Why Nostr? What is Njump?
2024-04-22 19:03:50

tante on Nostr: “AI” as support systems (for diagnostics) One of the more reasonable use cases ...

“AI” as support systems (for diagnostics)

One of the more reasonable use cases for modern “AI” (statistical pattern matching and generating machines) is to support doctors in diagnostics, especially in the evaluation of complex data sets / documents in order to determine potentially dangerous abnormalities.

It’s a problem tailor-made for neural networks (it’s actually one of the types of cases with that we got trained on them back when I was still studying in the beginning of the 2000s): You have large more or less clean and tagged sets of data of diagnosed x-rays and such that you can feed into your system to train a detector. Now since human beings are not really standardized/normalized the results are never 100%, one person’s abnormal is another person’s normal. But you can get decent results.

Results that “AI” hypesters often celebrate: Look AI’s not a hype, studies show that AI systems are better than X% of doctors at diagnosing Y!

I think that many people don’t fully understand the context though: What we see time and time again is that these systems are better than a significant percentage of doctors at diagnoses. Cool. But which doctors?

More often than not you will find that these systems are better than new or unexperienced doctors but worse than experienced specialists. Which isn’t surprising. It’s like with text: ChatGPT and others generate text that’s decent-ish and better than what first semester students would write but usually a lot worse than what experienced professionals create. AI is cheap and mediocre.

Now we could say that that’s a positive: We’ve basically shifted the baseline up. No longer do you need to hope that a new doctor diagnoses your condition correctly, you have the AI level as baseline and more complex cases get sent to the experts (how many cases would be found as complex when the mediocre machine has already diagnosed the case is another very good question).

But there is an issue (one that you can see coming from a mile if you’ve every heard one of my talks on “AI”): How do young, inexperienced doctors learn, if the machines diagnose all the routine cases? You learn diagnoses by diagnosing, by talking about them with your peers, especially more experienced peers. It’s great that we’ve potentially made the baseline better but how do we ensure that young physicians get enough time to practice their diagnostic skills? Because the super experienced people will die at some point. And experience does neither come cheap nor quick.

Especially in a world where the main promise of “AI” is efficiency and productivity do we really believe that the “time savings” will be put into training young people who can’t yet do the work? When it takes years to get them to where they need to be to catch the stuff that the AIs didn’t find? In a different system we might but in neoliberal capitalism? In austerity?

It’s great that we look at reasonable use cases for statistics, especially in critical and highly complex domains like medicine. But the studies presenting successes need to be read and understood correctly and fully.

Better than X% of humans is an empty statement. You need to know what exactly characterizes the X% to understand the actual capabilities and limitations of the systems that are supposedly so great.Liked it? Take a second to support tante on Patreon!



This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



#ai #diagnostics #medicine

Author Public Key
npub1j7sydwd2stdfj7ydmr2uhwa8p0ytf9z36ut5nx2u82tru82wvwpq8vt4mh