AI Detection · Posted by Chris Demers ·

AI Detector Says My Writing Is AI: False Positives Explained

11

This is one of the most frustrating experiences a student (or teacher) can have: you write something entirely on your own, run it through an AI detector, and it comes back flagged as AI-generated.

False positives are a real and documented problem with current AI detection tools. Let me explain why they happen and what you can do about them.

AI detectors look for statistical patterns associated with AI-generated text. The main pattern is low “perplexity,” meaning predictable word choices. The problem is that certain types of human writing also have low perplexity. Academic writing that follows formal conventions, writing by non-native English speakers who rely on common phrases, formulaic business or technical writing, and text that covers well-established topics with standard explanations can all trigger false positives.

ESL students are disproportionately affected. When English isn’t your first language, you tend to use more common vocabulary and simpler sentence structures. These are the exact patterns AI detectors flag. This creates a serious equity issue that schools need to address.

Students who write very structured, well-organized essays can also get flagged. Ironically, the “better” a student writes in terms of clarity and organization, the more likely they are to match AI patterns.

What should teachers do about this? Never use an AI detection score as sole evidence of academic dishonesty. Always investigate further. Look at the student’s writing history. Talk to them about their process. Consider whether the topic or assignment type might naturally produce writing that looks “clean.”

And if you’re a student reading this: keep your drafts, outlines, and revision history. Having documentation of your writing process is your best defense against a false accusation.

Have you dealt with false positives in your classroom? How did you handle it?

5 replies

5 Replies

0

yeah i had the same issue with turnitin last week. flagged a kid who definitely wrote it herself. super frustrating

5

I'm not convinced these detection tools will ever be reliable enough for high-stakes decisions. The fundamental problem is that they're trying to distinguish between two types of text that are becoming increasingly similar. As AI models improve, the statistical patterns detectors rely on will become less distinctive. We might be investing in a technological dead end.

0

Interesting data, but I'd push back on one point. Your sample size is too small to draw broad conclusions. I've seen Turnitin perform very differently across different writing genres, grade levels, and student populations. What works in your grade 11 English class might not generalize.

1

im new to all this ai stuff. my school just told us to 'use our judgment' which is super helpful lol. this thread is really helping me understand whats going on tho

2

totally agree with this