Can AI Effectively Identify AI-Written Text?


Humans hold tight to our world-leading position as authors because we can craft lines and stories that an AI would buck at, right? As artificial intelligence evolves, its language, so we instinctively think, will remain too un-humanlike to deceive us. This telling difference should prevent machine-written text from infiltrating our human realm.

Like any defining characteristic, this one could come under threat and fall victim to an evolutionary arms race. Artificial intelligence will develop counter-artificial intelligence. Iron sharpening iron and all that. It’s tough to go wrong. I’ve arranged the piece into six sections so you can ask: what are the technical issues, my experience as a reader, and the ethics? I’ll start with clades and continents.

The Rise of AI in Content Creation

Many of us use AI, thought of as bots, or automated systems, that write our text for us – in automated articles for news sources and automated email replies for issues affecting many. AIs use inputs from data-driven algorithms and accompanying huge data sets to produce text of increasingly humanlike quality. In fact, the quantity of AI-generated material mimicking human work has grown so much in the past few years that media reports have titled it ‘undetectable AI’. The implications of these developments lead to questions about what we can trust when we’re communicating digitally. If we often can’t tell the difference between content created by bots vs humans – writers vs writers – what are we to do? The need for tools to detect AI becomes a matter of urgency.

The advent of AI writing checkers makes them compulsory in many fields where content creators are seeking prestige, from journalism, to academia, to publishing – anything where there’s a strong incentive to ensure that whatever content is flowing underneath your name is uniquely you – and this can be tainted in various ways. AI might have been your guide and your collaborator in the process, even if you were doing a substantial amount of the actual writing. But if your final product contains too many trace characteristics of AI, then you’re walking the thin line of fraud – or at the very least, there’s a concerted effort to maintain your sterling ethical reputation. Featuring metrics designed to pinpoint distinctive patterns of AI text, the emergence of AI writing checkers makes them core components of the editor and content manager toolkit.

How AI Detection Tools Work

AI detection tools analyse text in search of linguistic fingerprints that are statistically correlated with AI composition: patterns of fluid verbiage and two-dimensional responses that are formulaic, prolix, stereotyped or, perhaps, violate rules of national grammar. The hallmarks of successful AI content detectors will be linguistic fingerprint analysis, consistency checking, surface of context analysis, and text flow anomaly detection.

These tools leverage a variety of computational techniques – from simple pattern-recognition models to increasingly complicated neural nets that learn from large volumes of both human and AI-created texts – and are constantly tweaking their models to keep up with ever more sophisticated AI text-generators so that they can catch anything that’s not ’written’ with the subtleties of a real human writer.

Challenges in Detecting AI-Written Text

Nevertheless, there is no easy way to detect AI-generated text today, even with such new AI-detection technologies (which still have their own problems). The biggest reason for this is that such an arms race is being waged between AI writers, whose capacity to reproduce human conversational styles is improving dramatically, and the systems being used to detect it.

The second is the ‘undetectable AI’ problem – that is, deliberate, expert-robot-generated text designed to evade detection, through mimicking human-like errors (for example, by incorporating spelling mistakes or unusual word order) and through an adaptive, machine-learning model that becomes better and better at ‘fooling’ the detector. These features make the texts particularly difficult for today’s AI detectors to differentiate from genuine human-generated text. Research and development to improve the detection accuracy and reliability is needed.

Ethical Implications of AI and Detection Technologies

While on one hand using AI to write text raises one set of ethical issues (is your writing merely tech-assisted, or fully AI-generated?” “Did you write that lauded column/book, or did we purchase it from you?”), using built-in systems that detect it presents another (preventing deterioration of the “professional writer’s” discursive capacities).

Ethical use of these AI detectors also means being transparent where and how content is born. But when the tools are deployed, both their use and their accuracy raise issues of privacy, of censorship, and of control over content on the internet. Striking the right mix among all of these factors is essential for enabling us to use technology to assist and empower us, without undermining the trust and integrity that should be at the core of every human endeavour.

The Future of AI in Writing and Detection

The AI-detection dynamic is likely to reach new levels of complexity in the future, as more sophisticated AI writing-assistance programs interact with more sophisticated AI-detection technologies. In the worst case, future iterations of AI might evolve to a point where it becomes possible to ‘humanise AI text’ so successfully that it couldn’t be distinguished from human-sounding text without some AGI-powered expertise. Detectors might then need to become more sophisticated in turn.

The cat-and-mouse game of AI-generated text and its detection will likely generate innovation on both sides. As humans and software continue to race ahead, we can expect new best practices to take root around AI in writing, in addition to more rigorous standards of oversight and accountability for AI tools themselves.


And as more and more AI stays hidden, embedded within digital content-creation tools, the problem of discerning computer-written text from the pen of humans will become only more pressing. The new tools that AI detectors use will have to keep pace with the race to produce cutting-edge AI writing technologies. The development of these tools entails both technical and ethical and philosophical questions.

How do we set the bar for what might be considered a harmonious relationship between human and machine, particularly in the area of authenticity when it comes to content creation? As we navigate this new digital reality, we need to find the right ideas and shape legislation or regulation to best fit human values. As AI continues to infiltrate our everyday lives, it will be essential to sort out how to reap the benefits of the technological arms race without ceding a disproportionate amount of creativity to machines.

