This Is How to Tell if Writing Was Made by AI
Episode
48 min
Read time
2 min
Topics
Startups, Marketing, Artificial Intelligence
AI-Generated Summary
Key Takeaways
- ✓AI Detection Accuracy: Pangram Labs achieves a false positive rate of 1 in 10,000 and a false negative rate of roughly 1%, far exceeding the ~90% human baseline accuracy. The model scales beyond simple perplexity metrics by using deep learning trained on millions of side-by-side human and AI writing pairs to detect subtle decision patterns.
- ✓How AI Writing Gets Detected: LLMs make thousands of micro-decisions when constructing even 100 words of text, and their output clusters into a narrow region of all possible writing. Pangram trains a model to recognize these decision patterns through contrast learning — pairing a human review with an AI-generated version of the same content to identify imperceptible differences.
- ✓Active Learning Pipeline: After an initial training pass on known human and AI samples, Pangram scans a larger corpus to surface false positives and false negatives, then feeds those edge cases back into retraining. This self-improving loop continuously pushes the model closer to the human-AI boundary where detection is hardest.
- ✓AI Slop Economics on Reddit: Startups sell services to brands promising organic-seeming AI bot mentions on Reddit, where bots post normal-seeming replies and occasionally name-drop products. This gaming matters because LLMs train on Reddit data, meaning seeded brand mentions in Reddit threads increase the likelihood those brands appear in future AI-generated responses.
- ✓Internet Contamination Scale: Roughly 40% of internet pages are now AI-generated, driven largely by SEO content farms switching to AI to produce keyword-targeting articles at near-zero cost. Medium crossed 50% AI-generated new articles roughly 18 months ago, while Reddit sits at around 10% today, up from 7% a year prior.
What It Covers
Max Spiro, founder of Pangram Labs, explains how his AI detection platform achieves a 1-in-10,000 false positive rate by training deep learning models on tens of millions of paired human and AI writing samples, while approximately 40% of the current internet is already AI-generated content.
Key Questions Answered
- •AI Detection Accuracy: Pangram Labs achieves a false positive rate of 1 in 10,000 and a false negative rate of roughly 1%, far exceeding the ~90% human baseline accuracy. The model scales beyond simple perplexity metrics by using deep learning trained on millions of side-by-side human and AI writing pairs to detect subtle decision patterns.
- •How AI Writing Gets Detected: LLMs make thousands of micro-decisions when constructing even 100 words of text, and their output clusters into a narrow region of all possible writing. Pangram trains a model to recognize these decision patterns through contrast learning — pairing a human review with an AI-generated version of the same content to identify imperceptible differences.
- •Active Learning Pipeline: After an initial training pass on known human and AI samples, Pangram scans a larger corpus to surface false positives and false negatives, then feeds those edge cases back into retraining. This self-improving loop continuously pushes the model closer to the human-AI boundary where detection is hardest.
- •AI Slop Economics on Reddit: Startups sell services to brands promising organic-seeming AI bot mentions on Reddit, where bots post normal-seeming replies and occasionally name-drop products. This gaming matters because LLMs train on Reddit data, meaning seeded brand mentions in Reddit threads increase the likelihood those brands appear in future AI-generated responses.
- •Internet Contamination Scale: Roughly 40% of internet pages are now AI-generated, driven largely by SEO content farms switching to AI to produce keyword-targeting articles at near-zero cost. Medium crossed 50% AI-generated new articles roughly 18 months ago, while Reddit sits at around 10% today, up from 7% a year prior.
Notable Moment
When a researcher attempted to evade Pangram by running AI text through multiple translation layers — English to Chinese to formal Chinese to Hebrew and back to English — the model still correctly identified the output as AI-generated, suggesting the underlying decision patterns survive significant linguistic transformation.
You just read a 3-minute summary of a 45-minute episode.
Get Odd Lots summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Odd Lots
How a Major Grocery Store Chain Can Dramatically Lower the Cost of Food
Jul 3 · 50 min
Eye on AI
#325 Phelim Brady: Why AI's Future Depends on Human Judgement
Mar 9
More from Odd Lots
What Dan Wang Saw on His Last Trip to China
Jul 2 · 48 min
Practical AI
Image Generation and Visual Intelligence with Black Forest Labs
Jul 2
Books, tools, and gear mentioned in this episode
SignalCast may earn commission on purchases via these links.
Tools
- Pangram LabsBy guest
“Max Spiro, founder of Pangram Labs, explains how his AI detection platform achieves a 1-in-10,000 false positive rate by training deep learning models on tens of millions of paired human and AI writing samples”
More from Odd Lots
We summarize every new episode. Want them in your inbox?
How a Major Grocery Store Chain Can Dramatically Lower the Cost of Food
What Dan Wang Saw on His Last Trip to China
Baidu's CFO on How It Became a Full-Stack AI Player
How Lenovo's CFO Is Allocating Capital During One of History's Biggest Booms
Rory Johnston on Why His $200 Oil Prediction Didn't Turn Out Right
Similar Episodes
Related episodes from other podcasts
Eye on AI
Mar 9
#325 Phelim Brady: Why AI's Future Depends on Human Judgement
Practical AI
Jul 2
Image Generation and Visual Intelligence with Black Forest Labs
Software Engineering Daily
Jun 23
Foundation Models for Structured Data
Masters of Scale
May 28
How to get better at money, with Carrie Joy Grimes
This Week in Startups
May 27
The Drone Company Quietly Taking Over Delivery
Explore Related Topics
This podcast is featured in Best Finance Podcasts (2026) — ranked and reviewed with AI summaries.
Read this week's Startups & Product Podcast Insights — cross-podcast analysis updated weekly.
You're clearly into Odd Lots.
Every Monday, we deliver AI summaries of the latest episodes from Odd Lots and 192+ other podcasts. Free for one show.
Start My Monday DigestNo credit card · Unsubscribe anytime