The problem at the heart of drug discovery: Lexogen & Ochre Bio on the power of AI on human data
Episode
38 min
Read time
2 min
Topics
Artificial Intelligence, Science & Discovery
AI-Generated Summary
Key Takeaways
- ✓Human data gap in liver disease: Only 40,000 liver transplants are performed globally each year against 1.5 million annual deaths from liver disease. Drug discovery fails here across three compounding problems: insufficient causal human biology knowledge, animal models that cannot predict fibrosis regression, and clinical trials that lack reliable human endpoint data — all requiring a human-first data strategy to address.
- ✓AI requires causal data, not just correlational data: AI models detect correlations that human brains cannot see, but correlation alone cannot resolve biological complexity or causality. To build genuinely predictive models, teams must generate perturbation data — knocking out every expressed gene across multiple human donors and disease states — then RNA sequence the results to construct a functional gene regulatory network.
- ✓Designing data for AI at scale: Ochre Bio and Lexogen generated a foundational perturbation dataset covering all expressed genes across multiple human liver donors and disease states. Lexogen's TAW technology, which eliminates the RNA extraction step, reduced variation and increased sensitivity, enabling detection of low-level gene expression. The project achieved a 95% success rate and was completed within five months.
- ✓Lab-in-the-loop scaling strategy: Ochre Bio perturbs approximately 2,000 genes directly in complex human liver tissue models, then trains AI on that causal data to predict outcomes for all remaining untested genes in silico. This approach avoids reverting to simple cell lines while scaling predictions beyond what physical experimentation alone can achieve, using complex human tissue as the AI training ground.
- ✓Regulatory environment shifting toward human models: The FDA no longer requires animal studies before first-in-human trials, and recently indicated that liver disease clinical trials can move away from biopsy-dependent endpoints toward biomarker-driven measures. Bayesian clinical trial designs now allow data borrowing across prior studies and indications, enabling more sophisticated and ethically grounded use of human data in drug development.
What It Covers
Ochre Bio CEO Quinn Wills and Lexogen CEO Stefan Baj examine why drug discovery fails due to non-human biology models, how they built one of the world's largest human liver functional genomics datasets, and how purpose-designed RNA sequencing data enables AI-driven target discovery for liver disease affecting 1.5 million deaths annually.
Key Questions Answered
- •Human data gap in liver disease: Only 40,000 liver transplants are performed globally each year against 1.5 million annual deaths from liver disease. Drug discovery fails here across three compounding problems: insufficient causal human biology knowledge, animal models that cannot predict fibrosis regression, and clinical trials that lack reliable human endpoint data — all requiring a human-first data strategy to address.
- •AI requires causal data, not just correlational data: AI models detect correlations that human brains cannot see, but correlation alone cannot resolve biological complexity or causality. To build genuinely predictive models, teams must generate perturbation data — knocking out every expressed gene across multiple human donors and disease states — then RNA sequence the results to construct a functional gene regulatory network.
- •Designing data for AI at scale: Ochre Bio and Lexogen generated a foundational perturbation dataset covering all expressed genes across multiple human liver donors and disease states. Lexogen's TAW technology, which eliminates the RNA extraction step, reduced variation and increased sensitivity, enabling detection of low-level gene expression. The project achieved a 95% success rate and was completed within five months.
- •Lab-in-the-loop scaling strategy: Ochre Bio perturbs approximately 2,000 genes directly in complex human liver tissue models, then trains AI on that causal data to predict outcomes for all remaining untested genes in silico. This approach avoids reverting to simple cell lines while scaling predictions beyond what physical experimentation alone can achieve, using complex human tissue as the AI training ground.
- •Regulatory environment shifting toward human models: The FDA no longer requires animal studies before first-in-human trials, and recently indicated that liver disease clinical trials can move away from biopsy-dependent endpoints toward biomarker-driven measures. Bayesian clinical trial designs now allow data borrowing across prior studies and indications, enabling more sophisticated and ethically grounded use of human data in drug development.
Notable Moment
Quinn Wills reveals that Ochre Bio's foundational genomics dataset is already generating drug target hypotheses — including identifying genes within hepatocytes that could upregulate FGF21 from within the cell itself, potentially replacing the current therapeutic approach of simply administering the protein externally as a treatment for liver fibrosis.
You just read a 3-minute summary of a 35-minute episode.
Get Beyond Biotech summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Beyond Biotech
Freeze variability, not progress: strengthen your cell therapy supply chain from the start
May 15 · 30 min
Marketing School
The AI Search Strategy That Actually Works
May 25
More from Beyond Biotech
Making labs smarter for scientific breakthroughs
May 7 · 30 min
a16z Podcast
Why AI Isn’t Killing SaaS Yet
May 25
More from Beyond Biotech
We summarize every new episode. Want them in your inbox?
Freeze variability, not progress: strengthen your cell therapy supply chain from the start
Making labs smarter for scientific breakthroughs
How Epic Bio is leveraging CRISPR without cutting DNA
Diagonal Therapeutics’ innovative clustering antibodies for vascular diseases
Argobio: the venture model building Europe’s next biotech champions
Similar Episodes
Related episodes from other podcasts
Marketing School
May 25
The AI Search Strategy That Actually Works
a16z Podcast
May 25
Why AI Isn’t Killing SaaS Yet
Animal Spirits
May 25
Talk Your Book: Investing in the Rise of the Robots
Capital Allocators
May 25
Fundraising Mastery: The Tao of Kimmer – John Kim (EP.503)
How I Built This
May 25
Justin’s Nut Butter: Justin Gold. He Was Waiting Tables, Then...He Reinvented Peanut Butter.
Explore Related Topics
This podcast is featured in Best Biotech Podcasts (2026) — ranked and reviewed with AI summaries.
Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.
You're clearly into Beyond Biotech.
Every Monday, we deliver AI summaries of the latest episodes from Beyond Biotech and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime