Blog / Year-in-Review

Are AI Detectors Getting Better in 2026? An Evidence-Based Year-in-Review

Q: Are AI detectors getting better in 2026?

No, not in any way the published evidence supports. Between 2023 and 2026, the documented false positive rates on AI detectors held steady at 4% to 50%+ on general populations and as high as 97.8% on non-native English writing. OpenAI shut down its own AI Classifier in July 2023 citing low accuracy and has not relaunched it. Turnitin still suppresses scores under 20% because reliability drops there, and major universities including Vanderbilt, UCLA, Yale, and the University of Waterloo have disabled detection tools entirely. The detector arms race has moved sideways, not forward.

Q: Why did OpenAI shut down its AI text classifier?

OpenAI launched its AI Text Classifier in January 2023 and discontinued it on July 20, 2023, citing 'low rate of accuracy.' At launch the classifier correctly identified only 26% of AI-generated text as 'likely AI-written' and produced a 9% false positive rate on human writing. OpenAI has not released a replacement classifier as of April 2026, and has stated publicly that they continue to research provenance methods such as watermarking instead.

Q: What is the false positive rate of AI detectors in 2026?

Across the published literature through 2026, the false positive rate runs from about 4% to over 50% on general populations, depending on the detector and corpus. Stanford's Liang et al. study found an average false positive rate of 61.3% on TOEFL essays written by non-native English speakers, with one detector flagging 97.8% of essays as AI-generated. The Weber-Wulff et al. 2023 European study tested 14 detection tools and concluded none were 'accurate or reliable' enough for academic use. The 2026 follow-up literature has not produced contradicting numbers.

Q: Which AI detector is most accurate in 2026?

There is no AI detector that the peer-reviewed literature endorses as accurate enough for high-stakes use. Vendor-published accuracy numbers (Turnitin: 98%; GPTZero: 99%; Originality.ai: 99%) are based on internal benchmarks not replicated by independent audits. Independent audits consistently find lower accuracy and substantial bias against non-native English writing, formal academic prose, and short texts. The right question is not 'which detector is most accurate' but 'should any detector be used as evidence' — and the institutional answer is increasingly no.

Q: What should I do if I write with AI assistance?

Read your institution's or platform's AI-use policy first. Most universities now permit AI assistance with disclosure. If you are permitted, document your process — keep version history, disclose AI use where required, and treat the detector score as a noisy signal rather than a verdict. If you want to reduce the risk of a false positive flag, options include rewriting in your own voice, varying sentence structure manually, or using a humanizer that targets the perplexity-and-burstiness signal detectors actually measure.

Three years after ChatGPT launched, the AI detection industry has scaled from a single OpenAI classifier into a billion-dollar market. The accuracy data has not kept up. We pulled the published audits, vendor disclosures, and institutional decisions from 2023 through April 2026 to answer one question: are these tools actually getting better?

April 30, 2026 · 11 min read

TL;DR — The 60-second answer

No. AI detectors have not meaningfully improved between 2023 and 2026. Published false positive rates remain at 4% to 50%+ on general writing and up to 97.8% on non-native English. OpenAI shut down its own classifier in July 2023 citing low accuracy and has not relaunched. Turnitin still suppresses scores under 20% because reliability drops there. Vanderbilt, UCLA, Yale, and the University of Waterloo have disabled the tools. Vendors publish higher numbers; independent audits do not replicate them. The category is plateaued.

It is a reasonable question to ask in 2026. ChatGPT launched in November 2022. OpenAI's classifier launched in January 2023. Turnitin's AI detector launched in April 2023. GPTZero, Originality.ai, Copyleaks, Winston AI, ZeroGPT, and a long tail of competitors all shipped in the same window. Three years is more than enough time for a new product category to mature, and most observers — including, until recently, us — assumed accuracy would improve as training data grew, models stabilized, and the underlying classifiers matured.

It hasn't. The evidence we pulled together for this review is consistent across vendor disclosures, peer-reviewed audits, institutional decisions, and the AI labs' own published statements. Detector accuracy has held roughly steady. The gap between vendor-claimed accuracy and independently audited accuracy has, if anything, widened. And the institutions that initially adopted these tools the fastest are the ones that have been disabling them the most aggressively. Below is the timeline, the data, and the honest read on where the category is headed.

The Timeline: What Actually Shipped

Before grading whether the category has improved, it helps to walk back through what actually launched, what it claimed at launch, and what its independently measured performance was within twelve months.

January 2023 — OpenAI AI Text Classifier. OpenAI released its own detector built directly on the model that produces the text in the first place. The launch announcement disclosed a 26% true positive rate on AI-generated text and a 9% false positive rate on human-written text — meaning the classifier flagged the wrong author roughly one time in ten on human writing, and missed three out of every four AI-generated samples. OpenAI's own announcement described it as a tool with "many limitations" that "should not be used as a primary decision-making tool."

April 2023 — Turnitin AI Detection. Turnitin enabled AI detection across its plagiarism platform with no opt-out for participating institutions. Vendor-published accuracy was 98%. By August 2023, the Washington Post and others reported that false positives were widespread, and Turnitin had introduced a 20% score floor below which scores would be hidden because reliability dropped at the low end.

July 2023 — OpenAI shuts down its own classifier. Six months after launch, OpenAI quietly removed the AI Text Classifier, updating its own announcement page with a note: "the AI classifier is no longer available due to its low rate of accuracy. We are working to incorporate feedback and are currently researching more effective provenance techniques for text." That note has remained on the page through April 2026. No replacement has shipped.

December 2023 — Weber-Wulff et al. publish the European audit. A team led by Debora Weber-Wulff at HTW Berlin tested 14 commercial AI detection tools across English, Spanish, and German texts. The study, published in the International Journal for Educational Integrity, concluded that "none of the tools tested was accurate or reliable" and explicitly recommended against their use for academic integrity decisions.

2023-2024 — Stanford / Liang et al. The single most-cited piece of detector research, Liang et al. in Patterns, tested seven leading detectors against TOEFL essays written by real non-native English speakers. The average false positive rate was 61.3%. One detector flagged 97.8% of the essays as AI-generated. A 2024 follow-up by the same group confirmed the bias persisted through that year's vendor updates.

2024-2025 — Institutional reversals begin. Vanderbilt disabled Turnitin's AI detector in August 2023, the first major US university to do so publicly. Yale, the University of Waterloo, Washington State University, and a growing list followed in 2024 and 2025. Most cited bias and false positive rates as the reason.

2026 — The category plateaus in public. Vendor accuracy numbers stayed flat. Independent audits continued to replicate the same false-positive findings. Two trends accelerated: (1) institutions stopped using the score as primary evidence, and (2) writers continued to use AI assistance at higher rates than ever, with or without disclosure.

The Numbers, Side by Side

The cleanest way to see whether detectors have improved is to put vendor-claimed accuracy next to the independently audited number, by year. Table below; sources in the references at the end.

Detector	Vendor accuracy claim	Independently audited FPR	FPR on ESL writing
OpenAI Classifier (2023)	~26% TPR (vendor)	9% (vendor-disclosed)	Not measured (discontinued)
Turnitin (2023-2026)	98%	4-50%+	~50%+
GPTZero (2023-2026)	99%	~30-50%	Up to 97.8% (Liang)
Originality.ai (2023-2026)	99%	~10-40%	~60%+
Copyleaks (2023-2026)	99.1%	~15-50%	~50%+
ZeroGPT (2023-2026)	98%	~20-60%	Variable, often very high

FPR = false positive rate (human writing flagged as AI). Vendor numbers are self-published; audited ranges combine Liang 2023, Weber-Wulff 2023, and 2024-2025 institutional audits. See sources at the end.

Two things stand out. First, the gap between vendor-claimed and audit-measured accuracy is consistently 30 to 70 percentage points across every vendor in the table. Second, the audited numbers in 2026 are not meaningfully different from the audited numbers in 2023. We checked. The Liang study from 2023 is still the most-cited number in 2026 court filings and university policy memos because the follow-up literature has not contradicted it.

Why the Category Plateaued

The reason isn't laziness or a lack of investment. The detection vendors have raised real money and shipped real updates over three years. The plateau is structural — it sits in what the classifiers actually measure.

Almost every commercial AI detector, including Turnitin, GPTZero, Originality.ai, and Copyleaks, scores text on two main features: perplexity (how surprising each word is given the words around it, scored against a language model) and burstiness (how much that perplexity varies across the document). LLM-generated text tends to score low on both. Some categories of human writing — formal academic prose, technical writing, and especially non-native English — also score low on both. The classifier cannot tell the difference, because there is no statistical difference to find. We covered this in detail in our AI detection false positives review.

Adding more training data does not fix this, because the issue isn't undertrained models. It's that the signal the detectors are looking for is shared between LLM output and certain categories of human writing. As LLMs have become better at producing fluent text — which has happened, dramatically, between GPT-3.5 and GPT-5 — the signal has gotten weaker, not stronger. The frontier models in 2026 produce output with more variation than 2023 models did, which means the gap between machine-generated and human-generated perplexity has narrowed. Detectors are trying to read a smaller and smaller signal.

This is the core reason OpenAI gave when shutting down its own classifier and the reason the company has pivoted toward watermarking and provenance research instead. If you can mark the text at generation time, you don't need to detect it after the fact. As of April 2026, no major LLM provider has shipped a watermarking standard at scale.

What the Institutions Did

One way to evaluate whether AI detectors are getting better is to ask the people who pay for them. The pattern over three years is one of cautious adoption, surprised disappointment, and quiet retraction.

The list of universities that have publicly disabled or stopped using AI detection tools includes Vanderbilt (August 2023, the first big public reversal), Yale, the University of Waterloo, the University of Pittsburgh, Northwestern, the University of Texas at Austin, Washington State University, and a long tail of regional institutions. UCLA's Center for Education Innovation publicly recommends against detector-based misconduct cases. The Modern Language Association and the Conference on College Composition and Communication issued joint guidance in 2024 cautioning against detector evidence in misconduct proceedings. We covered the institutional landscape in our review of university AI detection policies.

Turnitin still markets the feature, and many institutions still subscribe, but the trend among the most international and most research-active universities is to disable it or treat the score as inadmissible. That is not a vote of confidence in a category that's improving.

What OpenAI, Anthropic, and Google Have Said

The model labs have been comparatively candid about the limits of detection.

OpenAI's notice on the discontinued classifier page remains live: "the AI classifier is no longer available due to its low rate of accuracy." Sam Altman has stated in multiple public appearances that reliable AI text detection is not technically feasible at scale and that the company's research effort has shifted to provenance and watermarking. OpenAI has reportedly developed a watermarking method internally but has not deployed it, citing concerns about user equity and the ease with which adversaries could strip the watermark.

Anthropic has not released a detector and has stated in its academic policy guidance that it does not endorse any third-party detector for high-stakes decisions about Claude-generated text. Google's published research on detection emphasizes the same provenance-over-classification framing.

This is unusual. The three labs that produce the bulk of the LLM-generated text in the world have collectively concluded that downstream classification is not the right approach, while the detection vendors continue to claim 98-99% accuracy. Both cannot be right.

What Has Improved (Honestly)

It is worth being precise about what has improved between 2023 and 2026, even though the headline answer is "not detector accuracy."

Documentation has improved. Turnitin's own product documentation in 2026 is more cautious than the marketing copy in 2023. The 20% score floor, the explicit "should not be sole basis for misconduct" disclaimer, and the published note about reduced accuracy on short submissions are all real concessions. Vendors now disclose limitations they previously hid.

Institutional process has improved. Most major universities have updated AI-use policies that are far more nuanced than 2023 versions. Many now require human review, allow students to disclose AI use without penalty, and explicitly forbid using a detector score as the sole basis for an academic integrity finding.

The legal record has gotten longer. A series of student-led lawsuits in 2024 and 2025 have established a baseline expectation that schools cannot proceed on a detector score alone. The settlements, while not establishing binding precedent, have made institutions more cautious.

None of these improvements come from the detectors getting better at the underlying classification problem. They come from the surrounding institutions and policies catching up to the limits of the tools.

What This Means If You Write With AI

If you write with AI assistance — which, in 2026, is most of us — the practical implications of "detectors haven't improved" are mixed.

On the positive side, detector evidence is far less likely to result in a misconduct finding than it was in 2023. Institutions know the limits, courts have started to push back, and most policies now require corroborating evidence. If you wrote your work and got flagged, the fact pattern of "low-perplexity writing, vendor-disclaimed score, no corroborating evidence" is the exact pattern current academic-integrity appeals are winning on. Our step-by-step Turnitin guide covers what to do if it happens.

On the negative side, detectors are still in front of millions of student papers, hiring screens, freelance assignments, and university admissions essays. The scoring is unreliable in a measurable, reproducible way, but the score still appears, and the human reading it doesn't always understand the limits. That asymmetry is what makes the category persistently harmful even if the headline accuracy hasn't moved.

Practical mitigations, ordered by how much they cost you in time:

Disclose AI use where permitted. Most modern policies allow it. Disclosure removes the false-positive risk entirely because the question of whether AI was involved becomes moot.
Vary your prose manually. Mix sentence lengths. Cut formulaic transitions. Add specifics. The same edits that produce better writing also break the perplexity-and-burstiness signal that detectors lean on.
Preserve writing artifacts. Google Docs version history, draft files, browser research history — anything that shows the writing process is the strongest defense if you are flagged.
Use a humanizer where appropriate. An AI humanizer rewrites text to introduce the variation a detector is looking for. This is the highest-leverage step against the perplexity signal but doesn't change what your work says or who wrote the underlying ideas. ToHuman's free tier handles up to 700 characters with no signup if you want to test it.

None of these are guarantees against a noisy classifier. They are reasonable steps to take given that the tools are unlikely to get better in the near term. If you write in non-native English, the bias is structural and well-documented — see our review of detector bias against ESL writers for the evidence and the appeals playbook.

Where the Category Goes Next

The honest forecast for 2026-2027 is that the category continues to plateau on accuracy and slowly contracts on usage. The institutional momentum is clearly away from score-based misconduct decisions. The technical literature has not produced a credible path toward higher accuracy without watermarking. The model labs have publicly conceded that downstream detection is not the right approach.

Two things would change this picture. The first is a watermarking standard adopted at scale by OpenAI, Anthropic, and Google. That would shift the problem from classification (which is hard) to verification (which is easy if the watermark is in the text). As of April 2026, no such standard has shipped. The second is a detector that solves the perplexity-and-burstiness problem with a fundamentally different signal. We have not seen one.

Until either of those things happens, "are AI detectors getting better?" is a question the data answers with a clear no. Better-marketed, perhaps. Better-documented, in some cases. But not better at the thing they exist to do.

Frequently Asked Questions

Are AI detectors getting better in 2026?

No, not in any way the published evidence supports. False positive rates have held steady at 4-50%+ on general populations and up to 97.8% on non-native English. OpenAI shut down its own classifier in July 2023 and has not relaunched. Major universities continue to disable detection tools. The category is plateaued.

Why did OpenAI shut down its AI text classifier?

OpenAI launched the AI Text Classifier in January 2023 and discontinued it on July 20, 2023, citing "low rate of accuracy." At launch it correctly identified only 26% of AI-generated text and produced a 9% false positive rate on human writing. OpenAI has not released a replacement and has shifted research toward watermarking instead.

What is the false positive rate of AI detectors in 2026?

4% to over 50% on general populations, depending on detector and corpus. Stanford's Liang et al. found an average 61.3% false positive rate on TOEFL essays by non-native English speakers, with one detector hitting 97.8%. The 2026 follow-up literature has not produced contradicting numbers.

Which AI detector is most accurate in 2026?

None that the peer-reviewed literature endorses for high-stakes use. Vendor numbers (98-99%) are not replicated by independent audits. The right question is whether a detector should be used as primary evidence at all, and the institutional answer is increasingly no.

What should I do if I write with AI assistance?

Read your institution's or platform's policy first. Disclose where permitted, since disclosure removes false-positive risk entirely. If you want to reduce flag risk, vary your prose manually, preserve writing artifacts, and consider a humanizer that targets the perplexity-and-burstiness signal detectors actually measure.

Sources

Methodology: We compiled vendor-published accuracy claims directly from each detector's marketing or product documentation as of April 2026, then cross-referenced with peer-reviewed audits (Liang 2023, Weber-Wulff 2023), institutional disclosures, and press coverage from 2023-2026. Where audit ranges varied across studies, we report the published spread. False positive rates on ESL writing are drawn primarily from the Liang et al. dataset and confirmed by 2024 follow-up literature. Vendor accuracy numbers are self-reported and have not been independently replicated.

Try ToHuman free

Published April 30, 2026 by the ToHuman team.

Back to blog