#48 - Good Medicine Starts With Saying I Don’t Know Artwork

Code & Cure

Decoding health in the age of AI

Hosted by an AI researcher and a medical doctor, this podcast unpacks how artificial intelligence and emerging technologies are transforming how we understand, measure, and care for our bodies and minds.

Each episode unpacks a real-world topic to ask not just what’s new, but what’s true—and what’s at stake as healthcare becomes increasingly data-driven.

If you're curious about how health tech really works—and what it means for your body, your choices, and your future—this podcast is for you.

We’re here to explore ideas—not to diagnose or treat. This podcast doesn’t provide medical advice.

All Episodes

Code & Cure

#48 - Good Medicine Starts With Saying I Don’t Know

June 11, 2026 • Vasanth Sarathy and Laura Hagopian

0:00 | 28:09

What if the most dangerous AI answer is the one that sounds the most certain? We start with a playful challenge about the moon’s diameter, then use it to explore a much bigger question in healthcare: how should AI systems communicate uncertainty instead of simply projecting confidence to the user?

We dive into how clinicians make decisions when the facts are incomplete. In the emergency department, documentation workflows, and automated ICD-10 coding, medical reasoning rarely depends on a single perfect answer. Clinicians rank a differential, search for evidence that could prove them wrong, prioritize what is urgent, and bring in specialists when needed. That process is built for uncertainty. Yet many healthcare AI tools, from large language models to traditional machine learning classifiers, are still designed to deliver one “best” answer, even when the situation calls for caution.

The episode breaks uncertainty into two practical categories: aleatoric uncertainty, which comes from ambiguity and noise in the data, and epistemic uncertainty, which appears when a case falls outside the model’s knowledge. Along the way, we unpack what probability scores really mean, why near-ties deserve attention, and why out-of-distribution detection matters when a model might confidently mistake the unfamiliar for the known. The key takeaway is simple: safer AI systems do not hide uncertainty. They make it visible, communicate it clearly, and know when to abstain.

References:

Uncertainty-aware abstention in medical diagnosis based on medical texts
Vazhentsev et al.
Nature Artificial Intelligence (2026)

Credits:

Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)
Licensed under Creative Commons: By Attribution 4.0
https://creativecommons.org/licenses/by/4.0/

The Moon Question And False Certainty

SPEAKER_01 0:00

Hey Vasanth, what's the diameter of the moon? What? Yeah, just just give me like uh like an estimate where you're you know like 90% certain it's in that range. Don't don't do like a zero to a million or something, but just like you know.

SPEAKER_00 0:16

I don't know, somewhere between five and fifteen thousand miles. I don't know. I I want you to be like ninety percent confident though. Oh. Um yeah, I think it's between five and fifteen thousand miles.

SPEAKER_01 0:30

Interesting. You were uh overconfidently wrong there, or maybe you were underconfidently wrong. I don't know. You didn't sound very confident in your estimate.

SPEAKER_00 0:38

Oh boy.

SPEAKER_01 0:39

Um, but but the moon's diameter, uh the mean diameter is around 2,159 miles. Did you feel certain or uncertain there?

SPEAKER_00 0:48

Oh boy. I was definitely uncertain, and I had no idea what I was what I was doing.

SPEAKER_01 0:53

Did I trick you into like trying to trying to be certain? Can but the AI sounds certain when it does this kind of stuff. Yeah, I mean I I had to be certain, right? I had to say something. I forced you to.

SPEAKER_00 1:05

You did Hello and welcome back to Code and Cure, uh, the podcast where we discuss decoding health in the age of AI. My name is Vasant Sarathi. I'm an AI researcher and cognitive scientist, and I'm here with Laura Hagopian. I'm an emergency medicine physician. I have to say that moon example is interesting, um, you know, and it's it's kind of the the space of questions that we just don't have a starting point for, right? There are some questions that we're just uncertain about. We just don't know where to begin, but maybe that's okay.

SPEAKER_01 1:46

Well, I was gonna say, I actually think that's a good thing. Like uncertainty isn't a bad thing. In fact, certainty when you're wrong is way worse.

How Clinicians Use Uncertainty

SPEAKER_01 1:56

That's true. And in medicine, it's not like you come in with wrist pain and I say, well, you have a scaphoid fracture by just, you know, with my x-ray vision. It's like, oh, well, what could this be? What's the differential diagnosis? What's the list of things it could be? And then I move things up and down the list based on, you know, what is the x-ray show or what what hurts when I touch it, or do I see a deformity or whatever it is? Um I'm, you know, moving things up and down that list. And I also kind of prioritize things on the list that are maybe more emergent, right? Like if you have an open wrist fracture or the bone's sticking out, that's that's something that needs to go to the operating. Obviously, yeah. And so I think uncertainty is a good thing. It forces us to try to narrow down, figure out what the next steps would be from a diagnostic perspective and also from a management perspective. And especially when you're a generalist like me, you can't know everything about everything, right? Right. So you have to realize, oh, hey, I'm uncertain. And maybe this other doctor who specializes in ex orthopedics or OBGYN or whatever knows more about this than me. Let me call them and see what they think about this. Let me have a consultation. And so I think uncertainty is a good thing. It's like you don't want to be sure about something that you're unsure about.

SPEAKER_00 3:24

Yeah. Yeah. Clearly.

AI Answers Anyway And That’s Risky

SPEAKER_00 3:26

I mean, and this is this is the tension, right? We build these AI models and they are always giving us an answer and sometimes overconfident. And we've talked about that in the past too. And they don't have a sense of the degree of uncertainty that they should be worried about and maybe answer the question with a consult or or ask for a consult, um, or even not answer the question.

SPEAKER_01 3:51

Oh, yeah. I think that's a key thing. It's like always it's trying to please you, right? So then it answers your question. Yeah. And it could be answering it wrong. Or maybe there's like four different answers, but because, you know, as humans, we want this like binary yes or no, or like this is the diagnosis, that's often what it's giving. Yes. Because it feels good to be on the receiving end of that. Yes. Even though that may not be right.

SPEAKER_00 4:15

Yeah. And those are when we talk about trying to please you and things like that, that we're referring to the the LLM world, the the open AI chat, chat GPT type of world, right? Of AI systems that are training to speak to you. But there's also the world of machine learning systems that are not chatbots, and they are just meant to, they're just trained for a specific task of identifying something, identifying, you know, what what is what is an object in an image? Like is it a cat, is it a dog, and so on. So they have a list of things it could be, and they have to pick one. So it's not like they can just abstain, right? They are act they're they're it's not even they're thinking about it. It's not that they're not trying to please you, they're just doing the thing that they're trained to do, which is predict one of those things. And how do we care characterize uncertainty in those cases, right? Also. Um, and I think in this paper it really comes up in the context of um medical classification tasks. So like looking at doctors' notes and figuring out ICD 10 codes, for example, um, which are important, right?

SPEAKER_01 5:17

Yeah, I mean, we use them for billing purposes. Um, and so it's like, well, you know, if someone just going with the wrist pain example, I guess I was using earlier, it's like, hey, what are we billing for? Are we billing for just like a sort of general wrist pain? Are we billing for a scaphoid fracture? Are we billing for a distal radius fracture? Are we, you know, so the diagnosis is important to get. And can we scan the chart and get that information?

SPEAKER_00 5:42

Right, right, exactly. And you can imagine a machine learning system to be trained on lots of data and lots of ICD 10 codes. I mean, for each code, maybe there's a bunch of data that they can be trained on. And you can imagine that it's set up to do this automatically and effectively, maybe even. And then you wonder, okay, it's telling me that this specific example is this specific code. How confident is it? Right? Because if it's not confident, I want to be able to jump in. I want to have a system that has maybe a human or somebody in the loop that can take it, escalate the issue, and identify and and figure out that this is an uncertain outcome. So I need to do something, dig deeper, and figure out the right answer.

SPEAKER_01 6:22

Or like, could the model even abstain from doing it? Right. Instead of being like, this is ICD 10 code I11, it's going to be like, oh yeah, I don't know what the ICD 10 code is. Someone needs to review this thing because my certainty level is like under 50% or something. I don't know.

SPEAKER_00 6:39

Yeah.

ICD-10 Coding And Human Review

SPEAKER_00 6:40

So I I thought that this could be a great opportunity to dive deep into uncertainty because we talk about uncertainty, we talk about confidence, we talk about what, you know, but what does that really mean in these contexts? And it's and we know some degree of uncertainty and randomness when we play dice games, right? You roll, you roll some dice, and it could be one of six options. And if it's a fair die, then you know that it's going to be, you know, one-sixth uh chance of getting playing with unfair die.

SPEAKER_01 7:08

Right. I mean, to the extent that it's what is happening here. Do you have trick dice that you play with? I'm just saying if there are trick dice, then it changes everything. But my point is like we're just doing a regular die with one, two, three, four, five, and six. Okay. Yeah.

SPEAKER_00 7:21

So there is there is randomness in that, right? And so you roll the dice and there's some randomness in that. And your uncertainty is already kind of set in stone because it's it's the same thing with the coin, right?

SPEAKER_01 7:29

There's like a one in six chance that you're gonna roll a five.

SPEAKER_00 7:32

Right, right, right, right. So that's that's one type of uncertainty. But there's the other type of uncertainty, which is we just talked about with the moon, right? The moon diameter. You can see that these are different. It's not like the moon diameter could be one of eight options, right? The the moon diameter is a thing. It it is it is a number and it's in a certain range and it exists. It's just that you don't know about it, right? It's an unknown piece of knowledge for you.

SPEAKER_01 7:57

Well, it's yeah, it's hard for me to even conceptualize. Like with a die, I know it's gonna roll one of those six numbers, but with the moon, especially when I was asking you, I was like, you can't do between zero and a million, because like I would have done that. Yeah, then I would have been really certain. But if I have to narrow it down, I'm like, I actually had to look it up for this for this episode. I didn't know it off the top of my head.

SPEAKER_00 8:16

Yeah, yeah, yeah, yeah. And it's really hard to do that. And so that's it, it's a different flavor of uncertainty. And I kind of wanted to talk about that.

Two Kinds Of Uncertainty Explained

SPEAKER_00 8:24

And and there's some fancy words that are going to be thrown here. One of which I love fancy words. One of which is alleatoric uncertainty.

SPEAKER_01 8:31

Alleatoric.

SPEAKER_00 8:32

And the other is epistemic uncertainty. Epistemic. Okay. The I think the easiest way um to think about this is the following, which is imagine looking at a blurry photo of a die. A die has been rolled, and somebody took a photo of it, and it's blurry, somewhat blurry, and some number came up, and your job is to predict what number came up. Um, and if the die was blurry, covered, tilted, or you know, genuinely hard to read, like it was really, if you're looking at it and you're like, I can't tell what this is, that is aleatoric uncertainty. The case that means that the uh information, the data itself is ambiguous. Okay. Um, but if they let's imagine that an AI system was used for this, something like this. And if an AI was trained on a let's imagine that the AI was trained to look at all photos of normal six-sided dies, right? But now in the blurry photo that you gave it, um, it is a strange-looking, uh, transparent die that has some gold dots on it or something, something weird, right? With some solid some other symbols in it.

SPEAKER_01 9:33

That's not even in its schema because it's how it been trained on that.

SPEAKER_00 9:35

That is epistemic uncertainty. So that is outside of the model's knowledge zone. So do you see the difference? So, like one is within the model's knowledge zone.

SPEAKER_01 9:45

But the data isn't there. Yeah. And the other is like, hey, this is outside of what it even trained on.

SPEAKER_00 9:51

Well, it's not even the data isn't there, it's just that there's so many so much noise and ambiguity and all that other stuff, right? Yeah. So, you know, for instance, in the ICD coding setting, uh, you could imagine that a clinical note is ambiguous or incomplete. And other times I I never do that. I don't know what you're talking about. But but the other times the other times the note is clear to a human expert, but the model has not seen enough examples of the disease. And so one, the second example is much more um about the model's knowledge. It just doesn't know. Yeah, okay. And the first is it does, it could know, but it just hasn't, you know, there's too many other factors in the data itself to make it hard to uh quantify. And these are two different kinds of uncertainties, and we need to be able to catch them both. Yeah, that makes sense. Yeah. So um, you know, and the solutions would be different for each of them, right? The solutions would be different for each of them. And

Aleatoric Uncertainty Inside The Data

SPEAKER_00 10:37

I think it's worthwhile also like even taking this die example further to think even deeper, which is what is that, right? That that first blurry, noisy type of thing that we just talked about. Well, one thing is that photo is off a die roll. So inherently, the die roll has some uncertainty. We just talked about that, right? You roll there, it's a one-sixth or whatever, right? There's a genuine chance of the die rolling in any of those six numbers before it landed. So there's that. Um, that's the inherent uh variability of the dice itself. Um, and and this you can think of in the ICD setting as being something like disease trajectories or um the underlying clinical outcome itself could be variable, right? There's something beyond the notes that exists. That's the same.

SPEAKER_01 11:18

Yeah, for example, if you have a clavicle fracture, most of them can be managed with a sling, but depending on the location of it, some of them might need to be operated on. And potentially if you have some that are really close to the sternum, like they may need a further workup because that that could indicate the trauma was very severe. So I'm going into unnecessary details, but like there's different like not all clavicle fractures are managed the same way.

SPEAKER_00 11:46

Yeah, no, exactly. And so that's so then then there's the distinct ambigu ambiguous observation, right? In the dye uh photo example, the photo was blurry, right? So that's an example. Uh in the ICD 10 um coding example, maybe the notes are vague, right? Um, something was like, you know, the example, you know, you might have better examples, but um it doesn't specify enough details for you to make a judgment, right? In some level.

SPEAKER_01 12:12

Um yeah, and that's probably not uncommon in some of these notes. I guess maybe AI scribes could help improve that, but it's like, you know, when you're dealing with a patient, the most important thing in front of you is the patient and the management of the patient. And maybe not all the detail goes into the note.

SPEAKER_00 12:28

Yeah, yeah, yeah. Then then there's missing evidence. That's another type of allegatory uncertainty. Where in the dice example, maybe in the photo, your finger covers part of the dice. So you can see it's occlusion, it's occluded somehow, and you can't see it. And in the ICD10 example, it could just be that maybe some values were uh lab value, lab results were omitted, or something was left out, um, or or there's some details that was like, you know, timing information was left out or something, right? Okay. Um that's like an example of missing evidence that's also eleatoric. Um, you have noisy measurements. Um, in the photo, you know, in the photo, it's like bad lighting. Um, can distort how those little pips look on the dice. Um so that's an example.

SPEAKER_01 13:07

Oh, pips, that's a good word.

SPEAKER_00 13:08

It is a good word. I I learned that when I was um preparing for this podcast. You just like dropped it in there.

SPEAKER_01 13:14

You're like, I sound so smart. I know the word pips.

SPEAKER_00 13:17

Yeah, yeah, yeah. Um, but your clinical text could contain typos, abbreviations, um, maybe even contradictory statements that you were not aware of, right? And that's an example of a noisy, noisy measurement at some level. Um and if a die was tilted in the photo, you might two people might disagree on that. That's a ambiguity of the label itself, right? Of what the die says. Um, in your ICD 10, two quoters might disagree on what that is. Um, you have maybe uh like fuzzy boundaries between classes. This relates to kind of the ambiguity of what it looks like. Like a five kind of looks like a six in a dice. Uh, and you know, you couldn't miss a square, maybe not in a dice, but like the numbers.

SPEAKER_01 14:00

I was gonna say, I don't know. Are you are you talking about yourself? Because I don't think I'd mistake a five and a six on a die.

SPEAKER_00 14:05

Yeah, but you could imagine mistaking um, you know, there are things that you could leave out, like there could be parts that are kind of again, this goes back to the point of blurry, I suppose, but there is inherent ambiguity there too. Um and you could have potentially other types of noise that I'm not captured here, but you know, there might be in an ICD 10 case, there might be one thing that relates to multiple ICD 10 codes.

SPEAKER_01 14:31

Oh, sure.

SPEAKER_00 14:32

Right?

SPEAKER_01 14:32

So then that's yeah, and in the ICD 10 coding system, there's like often a parent code, and then there's like subclassifications. Yes, like uh like a tree almost, right? So it's like here is the main diagnosis code, you have a clavicle fracture, and then underneath it, it's like, oh, the fracture is in the proximal, middle, or distal clavicle. Yeah. So you get more specific and you can bill for it more appropriately if you've if you like hone in on the most specific code available.

SPEAKER_00 15:00

Got it. And and the way to think about this aleatoric uncertainty is that the medical note itself does not cleanly determine the code.

SPEAKER_01 15:08

Yeah, I mean, the code is for billing purposes. The medical note is to communicate something in the electronic health record. And sometimes those things overlap, but not always, right?

SPEAKER_00 15:17

Yes, yes, yes, yes. So that is the space of aleatoric uncertainty. You notice that like it's it's the problem with the with the with the inherent data item itself that you're looking at. And it's really hard to, you know, that there's that that's one type of uncertainty. And we'll get to exactly how we measure these in a second. But um,

Epistemic Uncertainty Outside The Model

SPEAKER_00 15:35

then there's the whole family of uncertainty measures that's called epistemic, which is just a philosophically fancy word for knowledge, belief, the idea that you there are things you don't know about. Um, and that also has a bunch of different ways that it can sort of surface.

SPEAKER_01 15:50

Hold on. Do LLMs actually like have things that they think they don't know about, or do they think they know about everything?

SPEAKER_00 15:54

Or do you have to like build it in? Um, that's a good question. I don't know. I don't I don't know. Oh, are you uncertain?

SPEAKER_01 16:02

Did I just like plant that in there unplanned?

SPEAKER_00 16:05

That was good. That was really good. Um, you know, saying I don't know is a good thing, right? We've talked about this before. Um, it and and saying that you do know about certain things more accurately than others is also a really good thing. And if you're correct. If you're correct, right. Um, but moving on to sort of epistemic uncertainty, and this is interesting because here you have the type of thing that we talked about with the weird 20-sided die, right? Your model has uh sees a weird 20-sided die and it's only been trained on six-sided dies, right? And that's kind of okay, I it doesn't know what to do with it. It's outside the realm. Outside the realm. And in the case of ICD 10, maybe that manifests itself as the model, it's a rare disease or an unusual procedure or novel clinical phrase phrasing that it's not been trained on. Yeah, okay.

SPEAKER_01 16:51

So there's just not been enough data that's been put into the system for this specific thing.

SPEAKER_00 16:56

Right. Um, or there's some, you know, it it could be some weird kind of unseen style. Maybe there's a different style of writing in one hospital versus another. Um, in the dice example, it could just be a different transparent dice, right? Or something weird looking like that. Um, the model might have seen very few examples. That's also another thing. Um, so maybe it's seen very few examples of the die phase four, right? Um, and so it doesn't just doesn't have enough knowledge about what fours look like. And a similar thing could happen in in ICD 10 codes too. Like maybe it's there's some codes that it just hasn't seen enough examples of. Um, and again, a problem that can be solved with more data. Um and notice that, right? You can solve these problems with more data, whereas the electoric one, it's like you can't solve the problems with more data because the data is the problem.

SPEAKER_01 17:44

Yeah, okay, that makes sense.

SPEAKER_00 17:45

Um, and you know, but the the the business of like you're you're the other thing that could happen is you might have um lighting that's off in in the dice photo example. Uh maybe in the ICD 10 example, you might have um training data that came from some old EHR, right?

SPEAKER_01 18:01

Or old system, and uh, you know, so they've got to be a little bit more than a lot of ICD9 codes instead of ICD 10.

SPEAKER_00 18:06

Right, right, right, exactly. Um, and there's other things like the model itself might have some limitations on the types of things it can learn um in in the AI case. And so that's you know, I think there is a challenge there. Maybe it has it can't deal with in a dice example, it can't deal with certain types of 3D rotations of how the dice looks. It just doesn't know what that means. Um because to it it doesn't even maybe look like a dye, you know, or a roll of a dice, right? And so um and so you can imagine this this whole world of unknown things that can often be solved by finding the right kind of data, but again, you have to know that it has low epistemic uncertainty on something or high epistemic uncertainty on something.

SPEAKER_01 18:46

Right. I mean, I think the underlying thing here is like when these models are s give you an answer and they sound confident, then how would you even know that they're uncertain? Yes. How like are is there any safeguard in place to get it to say, like, oh, I don't know, or I'm not going to answer that question because I don't have enough information? Because my experience using LLMs in general, and even even like the medical specific LLMs, like open evidence, is they like give you a very clear and discrete answer every time. Yeah. That they sound confident in.

SPEAKER_00 19:20

Yeah. Yeah. And

Reading Probability Outputs The Right Way

SPEAKER_00 19:22

it's it's even more challenging even with non-LLM systems, where if it's an ICD 10 predictor, you probably don't need an LLM, right? Because all it needs to output is one of the is is is a set of codes that it's found for a piece of information. So it's not like answering you by saying, Oh, that's a great question. Let me think about the codes and let me answer them. It's just giving you the answer because it's trained to look at the patterns of the data and tell you which code it matches, right? And even in that example, it's hard because uh if you think about it this way, there's uh six, seven choices, right? In a six choices in a depth in a in a roll of a dice. There's a lot more IZ ten codes than that. Sure, sure. And you know, and and and what these sometimes these models give you is what's called a probability distribution over those choices. So you provide some input, you provide the photo of the dice, right? And output says it's you know, 40% chance that this is a 50 five, uh, 10% chance that this could be a six, uh, two percent, and then that those numbers, it has something for all of the six choices. It might be zero, very close to zero. Zero percent chance it's a one, yeah. But but but those things add up to 100%. So it's telling you that here are my here is my that's called the probability distribution, and it tells you that this is my my current thought about that picture, that it's most likely a five or a six, uh, five, because I have that's the highest number, right? So there are these family of methods for uncertainty quantification that looks at it and says, um, how strong is my top guess? Yeah. Maybe it's not 40% isn't enough, right? Maybe you need maybe 80%.

SPEAKER_01 20:55

I mean, I don't I don't think 40% is enough when you're talking about something clinical like diagnosis or whatever. Yeah.

SPEAKER_00 21:01

Yes. And and so um so if this if this uh model's favorite answer is weak in that in that sense, it should be less trusted, right? In some sense. Um, the other way to think about this, so that was like how strong is my best answer? The other way to think about it is was there almost a tie between multiple ones?

SPEAKER_01 21:18

Maybe there was like 40% versus 39%. Nine percent. Yeah.

SPEAKER_00 21:23

And then all of a sudden, now it's different again because now the model has barely chosen one answer over another. But because it's given you one answer, you have no idea how close it was. Yeah. You just decided that the best one is the winner. Um, and similarly, the model might not have one or two clear winners, it might just be spread across everything, or multiple ones. And um and is if the if it's if that's the case, then kind of all bets are off, right? It's more of an I don't know in that case. Yeah. Um, and so you can think about uh these questions that I'm asking, which is is it almost a tie? Did it almost win? Um, when I sound confident, am I using Usually correct. Do slightly different versions of the model agree with me? After repeated noisy passes, how strong is my top answer? How much do the probabilities fluctuate across different times I run this thing? You know, how much is how similar is this example to what I've seen before? These are all questions that are answered by specific mathematical techniques that this paper talks about that are all targeted at different ways of looking at the same uncertainty problem. And they're all sort of either aleatoric or epistemic. Okay. Okay. So there's a lot here. And I know that this is technically heavy, but I want people to kind of think about uncertainty broadly. Um sort of to recap, you have aleatory and epistemic. Alleatoric is sort of inherent in the data itself. And epistemic is more about what you don't know. And uh the questions you ask to judge your choices, to decide something, will tell you how you are quantifying your uncertainty. And maybe you need all of them. Maybe you need to think through all of these different choices.

SPEAKER_01 23:05

And that is something that can be mathematically modeled.

SPEAKER_00 23:07

Yes.

Bringing Uncertainty Into Clinical Decisions

SPEAKER_01 23:08

And then I guess my question becomes like, is that something that can be integrated into these systems where we ask, maybe for ICD Tenkodes, but what if you want help with a diagnosis or something? You it becomes more and more important to understand, okay, I think this is a hundred percent chance that this is a scaphoid fracture versus like, hey, 60% chance scaphoid fracture, 40% chance distal radius fracture, 20% chance it's both of those things together. Like that to me is like some something that I'd be clinically thinking about.

SPEAKER_00 23:46

Yeah, right. Exactly. You I think you were already thinking about both aleutoric and epistemic when a human thinks, a human expert thinks about this, right?

SPEAKER_01 23:54

Right. And so then it's like, well, how can we get the systems that are currently in place to do that? How can we build that into systems that we might be using for clinical purposes?

SPEAKER_00 24:04

Yeah, yeah. And and I want to I want to stress again that this is hard enough even on systems that are not LLMs, that are not just giving you diagnoses. This is just all it's doing is giving a piece of text, like a note, telling you what the ICD 10 codes are that are being acting, you know, that are relevant in that in that in that note.

SPEAKER_01 24:22

This is

The Cat X-Ray And When To Abstain

SPEAKER_01 24:23

how like what was the example? They showed like COVID lungs and then a cat, and then the cat was a COVID lung.

SPEAKER_00 24:29

Yeah, that was a great example. I don't know if that's a great example of uncertainty, but that was a funny funny. It should have been uncertain. Oh, actually, this is a great example of uncertainty.

SPEAKER_01 24:37

Look, I did it.

SPEAKER_00 24:38

You're right, you're right, you're right. So that was an example where they had trained the model on, and I want to ask you a quiz question after this.

SPEAKER_01 24:43

Oh no, he's gonna ask me if it's aleatoric or epistemic.

SPEAKER_00 24:46

Uh-huh. Exactly. Okay, hold on, let me think while you while you're talking about. Let me let me tell the example uh again, which is that there was a model that was trained on on lungs, on COVID versus non-COVID lungs, and it was shown, you know, and and you could tell, given an X-ray, you could tell which of those two it was. And um, it was shown a picture of a cat, and it was 100% confident that it was a COVID lung. And so here are the two choices in this. It's not a six-sided diaret, so it's only two choices.

SPEAKER_01 25:13

So there's a 50% chance I'm gonna get it right. No, no, no.

SPEAKER_00 25:16

Well, it's a 50% chance that it's a COVID lung. And and so the question is, is it aleatoric or is it um epistemic?

SPEAKER_01 25:26

Okay, um, for $700, I am going, I'm gonna say what is epistemic uncertainty because it was outside of the system that it was trained on, right? Like it was not trained on cats. Yes. But then it then, like, you want the system to be like, hey, wasn't trained on cats. Yes. I can't answer this. The answer is I don't know. And it didn't do that.

SPEAKER_00 25:49

Exactly. And there are technical methods, and this paper talks about them. Um, they're called density-based methods that are exactly for this, where it should come back and say, This is looks like nothing like my training data. It this looks the the appearance and the shape and whatever else that's in here, it looks totally different. I don't know what it is. Of course, it doesn't know that it's a cat because it's only trained on COVID and non-COVID lungs. But it should be able to say, This is not something I've seen before. I'm going to abstain. I'm not going to answer it.

SPEAKER_01 26:19

And that is like, I think, really important. Where if a model doesn't know the answer, yeah, just like every human doesn't know the answer, to be able to say, hey, I don't know. I need more information, or I need to talk to a specialist, or I want to get an x-ray, or let's do some lab work, or it could be one of these six things, and five of them are all treated with a volar splint. So we're gonna do that. And for the six, we're ruling it out with an x-ray. So that's where it's like you're trying to make a decision about what to do next, and you want the information that goes into that decision to be appropriate. And if you are uncertain, that needs to be front and center so you can make the most appropriate decision. Exactly. Exactly.

SPEAKER_00 27:08

Uncertainty is a good thing. It is a good thing. It is a good thing to be uncertain. It's a good thing to be, it just means that there are questions left unanswered. It means that you have to raise new questions and explore more and figure out what it is.

SPEAKER_01 27:20

Yeah. And in medicine, sometimes something's straightforward, right? Yeah. You fell, you broke your arm, but a lot of times things are not so straightforward. And being uncertain about what's going on and what to do next can actually be helpful so that you expand what your tests are and then find a path forward and narrow in over time. Exactly. It's a good thing. Yes. All right.

Here’s To Uncertainty

SPEAKER_01 27:48

Well, uh, here's to uncertainty. And um, maybe, maybe, just maybe, hopefully, between a zero and a hundred percent chance, we will see you next time on Code and Cure. Thank you.

Laura Hagopian

Host

Vasanth Sarathy

Host