Skip to main content
Odd Lots

This Is How to Tell if Writing Was Made by AI

48 min episode · 2 min read
·

Episode

48 min

Read time

2 min

Topics

Artificial Intelligence

AI-Generated Summary

Key Takeaways

  • AI Detection Accuracy: Pangram Labs achieves a false positive rate of 1 in 10,000 and a false negative rate of roughly 1%, far exceeding the ~90% human baseline accuracy. The model scales beyond simple perplexity metrics by using deep learning trained on millions of side-by-side human and AI writing pairs to detect subtle decision patterns.
  • How AI Writing Gets Detected: LLMs make thousands of micro-decisions when constructing even 100 words of text, and their output clusters into a narrow region of all possible writing. Pangram trains a model to recognize these decision patterns through contrast learning — pairing a human review with an AI-generated version of the same content to identify imperceptible differences.
  • Active Learning Pipeline: After an initial training pass on known human and AI samples, Pangram scans a larger corpus to surface false positives and false negatives, then feeds those edge cases back into retraining. This self-improving loop continuously pushes the model closer to the human-AI boundary where detection is hardest.
  • AI Slop Economics on Reddit: Startups sell services to brands promising organic-seeming AI bot mentions on Reddit, where bots post normal-seeming replies and occasionally name-drop products. This gaming matters because LLMs train on Reddit data, meaning seeded brand mentions in Reddit threads increase the likelihood those brands appear in future AI-generated responses.
  • Internet Contamination Scale: Roughly 40% of internet pages are now AI-generated, driven largely by SEO content farms switching to AI to produce keyword-targeting articles at near-zero cost. Medium crossed 50% AI-generated new articles roughly 18 months ago, while Reddit sits at around 10% today, up from 7% a year prior.

What It Covers

Max Spiro, founder of Pangram Labs, explains how his AI detection platform achieves a 1-in-10,000 false positive rate by training deep learning models on tens of millions of paired human and AI writing samples, while approximately 40% of the current internet is already AI-generated content.

Key Questions Answered

  • AI Detection Accuracy: Pangram Labs achieves a false positive rate of 1 in 10,000 and a false negative rate of roughly 1%, far exceeding the ~90% human baseline accuracy. The model scales beyond simple perplexity metrics by using deep learning trained on millions of side-by-side human and AI writing pairs to detect subtle decision patterns.
  • How AI Writing Gets Detected: LLMs make thousands of micro-decisions when constructing even 100 words of text, and their output clusters into a narrow region of all possible writing. Pangram trains a model to recognize these decision patterns through contrast learning — pairing a human review with an AI-generated version of the same content to identify imperceptible differences.
  • Active Learning Pipeline: After an initial training pass on known human and AI samples, Pangram scans a larger corpus to surface false positives and false negatives, then feeds those edge cases back into retraining. This self-improving loop continuously pushes the model closer to the human-AI boundary where detection is hardest.
  • AI Slop Economics on Reddit: Startups sell services to brands promising organic-seeming AI bot mentions on Reddit, where bots post normal-seeming replies and occasionally name-drop products. This gaming matters because LLMs train on Reddit data, meaning seeded brand mentions in Reddit threads increase the likelihood those brands appear in future AI-generated responses.
  • Internet Contamination Scale: Roughly 40% of internet pages are now AI-generated, driven largely by SEO content farms switching to AI to produce keyword-targeting articles at near-zero cost. Medium crossed 50% AI-generated new articles roughly 18 months ago, while Reddit sits at around 10% today, up from 7% a year prior.

Notable Moment

When a researcher attempted to evade Pangram by running AI text through multiple translation layers — English to Chinese to formal Chinese to Hebrew and back to English — the model still correctly identified the output as AI-generated, suggesting the underlying decision patterns survive significant linguistic transformation.

Know someone who'd find this useful?

You just read a 3-minute summary of a 45-minute episode.

Get Odd Lots summarized like this every Monday — plus up to 2 more podcasts, free.

Pick Your Podcasts — Free

Keep Reading

More from Odd Lots

We summarize every new episode. Want them in your inbox?

Similar Episodes

Related episodes from other podcasts

Explore Related Topics

This podcast is featured in Best Finance Podcasts (2026) — ranked and reviewed with AI summaries.

Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.

You're clearly into Odd Lots.

Every Monday, we deliver AI summaries of the latest episodes from Odd Lots and 192+ other podcasts. Free for up to 3 shows.

Start My Monday Digest

No credit card · Unsubscribe anytime