The Telltale Words That Could Identify Generative AI Text
A new study suggests at least 10% of scientific abstracts in 2024 were processed using large language models, researchers from the University of Tubingen and Northwestern University report. Analyzing 14 million PubMed abstracts from 2010-2024, the team identified an unprecedented surge in certain "style words" following LLMs' widespread adoption in late 2022.
Words like "delves" and "showcasing" saw a 25-fold and 9-fold increase respectively in 2024 abstracts compared to pre-LLM trends. Common terms such as "potential" and "findings" also spiked in usage. The researchers drew parallels to studies measuring COVID-19's impact through excess deaths, applying a similar methodology to detect "excess word usage" in scientific writing.
<a href="http://twitter.com/home?status=The+Telltale+Words+That+Could+Identify+Generative+AI+Text%3A+https%3A%2F%2Ftech.slashdot.org%2Fstory%2F24%2F07%2F01%2F1656208%2F%3Futm_source%3Dtwitter%26utm_medium%3Dtwitter"; rel="nofollow"><img src="https://a.fsdn.com/sd/twitter_icon_large.png"></a>
<a href="http://www.facebook.com/sharer.php?u=https%3A%2F%2Ftech.slashdot.org%2Fstory%2F24%2F07%2F01%2F1656208%2Fthe-telltale-words-that-could-identify-generative-ai-text%3Futm_source%3Dslashdot%26utm_medium%3Dfacebook"; rel="nofollow"><img src="https://a.fsdn.com/sd/facebook_icon_large.png"></a>
https://tech.slashdot.org/story/24/07/01/1656208/the-telltale-words-that-could-identify-generative-ai-text?utm_source=rss1.0moreanon&utm_medium=feed
at Slashdot.
https://tech.slashdot.org/story/24/07/01/1656208/the-telltale-words-that-could-identify-generative-ai-text?utm_source=rss1.0mainlinkanon&utm_medium=feed