Opus 4.5 changed everything (Interview)
Episode
104 min
Read time
4 min
Topics
Fundraising & VC, Artificial Intelligence, Software Development
AI-Generated Summary
Key Takeaways
- ✓Agentic Model Quality Threshold: Sonnet 3.5 could build agentically but produced sloppy, spaghetti code that stalled on errors neither the developer nor the model could debug — because the developer hadn't written the code themselves. Opus 4.5 crossed a threshold where it one-shots functional native Windows tools using correct WinUI libraries, produces well-structured readable code, and completes full working apps in an afternoon. The practical test: Burke built a screen-capture-to-GIF tool, then extended it into a full screen recording editor in a few hours.
- ✓Personal Software Economics — The SaaS Killer Pattern: When a model reaches sufficient capability, replacing paid SaaS subscriptions with custom-built personal software becomes viable in a single afternoon. Burke replaced a paid routing app his wife used for her yard-sign business. Adam replaced an $500–$800/year invoicing service by prompting Claude Opus 4.6 with extended thinking to generate an optimal prompt, then handing that prompt to Augment Code's Auggie CLI overnight — waking to a working Rails app with invoicing, PDF generation, and email built in.
- ✓Subsidized Token Economics Won't Last: GitHub Copilot at $40/month offers request-based billing where one agent run doing 6,000 operations counts as a single request, with 1,500 requests per month included. Claude's $200/month max plan supports roughly one billion tokens monthly at an estimated provider cost of $25,000 — a $24,800 subsidy per user. Burke explicitly states this pricing cannot persist indefinitely and developers should maximize usage now, treating the current window as a finite opportunity before costs normalize.
- ✓Plan Mode as Context Extraction, Not Documentation: The value of agent plan mode is not producing a written plan — it is forcing the model to surface all the requirements and constraints the developer forgot to specify in the initial prompt. Running four to six planning loops with Opus 4.6 before execution dramatically improves output quality. Burke's current workflow: plan mode in Copilot CLI
What It Covers
Adam Stacoviak of The Changelog interviews Burke Holland from GitHub Copilot about how Claude Opus 4.5, released around December 2024, created a measurable step-function improvement in agentic coding capability. The conversation covers practical AI-assisted development workflows, the economics of subsidized model access, the future of software craftsmanship, and whether developers will be replaced or transformed into polymaths.
Key Questions Answered
- •Agentic Model Quality Threshold: Sonnet 3.5 could build agentically but produced sloppy, spaghetti code that stalled on errors neither the developer nor the model could debug — because the developer hadn't written the code themselves. Opus 4.5 crossed a threshold where it one-shots functional native Windows tools using correct WinUI libraries, produces well-structured readable code, and completes full working apps in an afternoon. The practical test: Burke built a screen-capture-to-GIF tool, then extended it into a full screen recording editor in a few hours.
- •Personal Software Economics — The SaaS Killer Pattern: When a model reaches sufficient capability, replacing paid SaaS subscriptions with custom-built personal software becomes viable in a single afternoon. Burke replaced a paid routing app his wife used for her yard-sign business. Adam replaced an $500–$800/year invoicing service by prompting Claude Opus 4.6 with extended thinking to generate an optimal prompt, then handing that prompt to Augment Code's Auggie CLI overnight — waking to a working Rails app with invoicing, PDF generation, and email built in.
- •Subsidized Token Economics Won't Last: GitHub Copilot at $40/month offers request-based billing where one agent run doing 6,000 operations counts as a single request, with 1,500 requests per month included. Claude's $200/month max plan supports roughly one billion tokens monthly at an estimated provider cost of $25,000 — a $24,800 subsidy per user. Burke explicitly states this pricing cannot persist indefinitely and developers should maximize usage now, treating the current window as a finite opportunity before costs normalize.
- •Plan Mode as Context Extraction, Not Documentation: The value of agent plan mode is not producing a written plan — it is forcing the model to surface all the requirements and constraints the developer forgot to specify in the initial prompt. Running four to six planning loops with Opus 4.6 before execution dramatically improves output quality. Burke's current workflow: plan mode in Copilot CLI
Notable Moment
Burke describes letting a GitHub Copilot CLI agent run continuously in a loop for days, autonomously deciding which features to add to a multiplayer game where users roleplay as baby birds. He acknowledges it will likely produce an unwieldy, unshippable result — and frames this as an honest illustration of exactly where autonomous agentic software development currently breaks down at scale.
You just read a 3-minute summary of a 101-minute episode.
Get The Changelog summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
Books, tools, and gear mentioned in this episode
SignalCast may earn commission on purchases via these links. As an Amazon Associate, SignalCast earns from qualifying purchases.
Tools
by OpenAI
“Burke's Anvil agent classifies tasks as easy, medium, or hard, then delegates design work to Gemini, code refactoring to GPT-5.3 Codex, and planning/communication to Claude Opus 4.6.”
by Google
“Burke's Anvil agent classifies tasks as easy, medium, or hard, then delegates design work to Gemini, code refactoring to GPT-5.3 Codex, and planning/communication to Claude Opus 4.6.”
- Claude Opus 4.5Recommended
by Anthropic
“Claude Opus 4.5, released around December 2024, created a measurable step-function improvement in agentic coding capability.”
- GitHub CopilotRecommended
by GitHub
“GitHub Copilot at $40/month offers request-based billing where one agent run doing 6,000 operations counts as a single request, with 1,500 requests per month included.”
- AnvilRecommended
“Burke's current workflow: plan mode in Copilot CLI → autopilot with a custom agent called Anvil → confidence-threshold loop.”
- Augment Code Auggie CLIRecommended
by Augment Code
“Adam replaced an $500–$800/year invoicing service by prompting Claude Opus 4.6 with extended thinking to generate an optimal prompt, then handing that prompt to Augment Code's Auggie CLI overnight — waking to a working Rails app with invoicing, PDF generation, and email built in.”
by Microsoft
“Teams like VS Code's engineering group are actively building AI-assisted workflows specifically for production-quality software.”
- Claude Opus 4.6Recommended
by Anthropic
“Adam replaced an $500–$800/year invoicing service by prompting Claude Opus 4.6 with extended thinking to generate an optimal prompt.”
company
“Sponsors: Squarespace”
- Augment CodeRecommended
“Adam replaced an $500–$800/year invoicing service by prompting Claude Opus 4.6 with extended thinking to generate an optimal prompt, then handing that prompt to Augment Code's Auggie CLI overnight.”
“Sponsors: Fly.io”
“Sponsors: Notion”
More from The Changelog
We summarize every new episode. Want them in your inbox?
Similar Episodes
Related episodes from other podcasts
Latent Space
Jun 2
GitHub's plan for Agents — Kyle Daigle, GitHub
The AI Breakdown
Apr 28
The AI Subsidy Era is Over
Startups For the Rest of Us
Mar 24
Episode 825 | Talking Tailwind CSS and Founder Fitness (with Adam Wathan)
WorkLife with Adam Grant
Mar 17
ReThinking: Breaking leadership barriers with hockey coach Jessica Campbell
So Money with Farnoosh Torabi
Feb 23
1948: Turning Side Hustles Into Sustainable Wealth with Money Girl Laura Adams
Explore Related Topics
This podcast is featured in Best Cybersecurity Podcasts (2026) — ranked and reviewed with AI summaries.
Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.
You're clearly into The Changelog.
Every Monday, we deliver AI summaries of the latest episodes from The Changelog and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime