Milliseconds to Match: Criteo's AdTech AI & the Future of Commerce w/ Diarmuid Gill & Liva Ralaivola
Episode
87 min
Read time
3 min
Topics
Relationships, Fundraising & VC, Marketing
AI-Generated Summary
Key Takeaways
- ✓Real-time bidding architecture: Criteo pre-computes user and product embeddings offline, reducing runtime inference to a vector similarity comparison executed in milliseconds. The system ingests product data from 17,000 retailers daily—sometimes multiple times per day—ensuring pricing, stock levels, and catalog accuracy that static LLM training data cannot provide. This hybrid of offline computation and live data refresh is the core technical moat enabling sub-millisecond ad decisions at billions of daily transactions.
- ✓Foundation model strategy: Rather than building one monolithic model, Criteo operates three to four specialized foundation models that generate embeddings for products, user timelines, and contextual signals separately. These embeddings are made available company-wide as reusable inputs, allowing new product teams to warm-start models instead of training from scratch. A recent internal hackathon validated this approach, with multiple teams achieving faster performance gains by plugging into existing embedding infrastructure rather than building new feature pipelines.
- ✓Feature evolution from sparse to dense: Criteo's modeling progressed from sparse binary vectors of up to 2^20 dimensions fed into logistic regression, to dense embeddings of 200–1,000 dimensions computed automatically via their proprietary Deep KNN algorithm. This shift eliminated manual feature engineering, which became unsustainable as cookie signals and data sources changed. The AI Lab, founded in 2018 specifically to drive this transition, now publishes the methodology publicly, including training loss functions and model architectures, in academic papers and technical blogs.
- ✓LLM partnership fills a specific gap: LLMs excel at general reasoning and natural language product queries but become stale immediately after training—missing flash sales, stock outages, and price changes. Criteo's OpenAI partnership addresses this by routing product queries through Criteo's live commerce data layer via MCP protocols, giving ChatGPT accurate real-time inventory context. The emerging agentic protocol standard makes this integration significantly easier than previous surface-by-surface API customization, reducing deployment complexity across chat interfaces and web surfaces simultaneously.
- ✓Privacy architecture as competitive advantage: Criteo stores no personally identifiable information—only anonymous random cookie IDs paired with behavioral signals like product views and purchase history, roughly 150 features per profile. Built under European GDPR constraints from inception, Criteo applies the same privacy-compliant tech stack globally rather than maintaining separate regional systems. This single-stack approach means US advertisers receive the same data handling as EU users, and Criteo pioneered the AdChoices opt-out icon before regulatory mandates required it.
What It Covers
Criteo CTO Diarmuid Gill and AI Lab VP Liva Ralaivola explain how their ad tech platform processes over one billion user profiles in milliseconds using cached embeddings and multiple foundation models, while exploring how their OpenAI partnership combines real-time commerce data from 17,000 retailers with LLM reasoning to power next-generation product discovery.
Key Questions Answered
- •Real-time bidding architecture: Criteo pre-computes user and product embeddings offline, reducing runtime inference to a vector similarity comparison executed in milliseconds. The system ingests product data from 17,000 retailers daily—sometimes multiple times per day—ensuring pricing, stock levels, and catalog accuracy that static LLM training data cannot provide. This hybrid of offline computation and live data refresh is the core technical moat enabling sub-millisecond ad decisions at billions of daily transactions.
- •Foundation model strategy: Rather than building one monolithic model, Criteo operates three to four specialized foundation models that generate embeddings for products, user timelines, and contextual signals separately. These embeddings are made available company-wide as reusable inputs, allowing new product teams to warm-start models instead of training from scratch. A recent internal hackathon validated this approach, with multiple teams achieving faster performance gains by plugging into existing embedding infrastructure rather than building new feature pipelines.
- •Feature evolution from sparse to dense: Criteo's modeling progressed from sparse binary vectors of up to 2^20 dimensions fed into logistic regression, to dense embeddings of 200–1,000 dimensions computed automatically via their proprietary Deep KNN algorithm. This shift eliminated manual feature engineering, which became unsustainable as cookie signals and data sources changed. The AI Lab, founded in 2018 specifically to drive this transition, now publishes the methodology publicly, including training loss functions and model architectures, in academic papers and technical blogs.
- •LLM partnership fills a specific gap: LLMs excel at general reasoning and natural language product queries but become stale immediately after training—missing flash sales, stock outages, and price changes. Criteo's OpenAI partnership addresses this by routing product queries through Criteo's live commerce data layer via MCP protocols, giving ChatGPT accurate real-time inventory context. The emerging agentic protocol standard makes this integration significantly easier than previous surface-by-surface API customization, reducing deployment complexity across chat interfaces and web surfaces simultaneously.
- •Privacy architecture as competitive advantage: Criteo stores no personally identifiable information—only anonymous random cookie IDs paired with behavioral signals like product views and purchase history, roughly 150 features per profile. Built under European GDPR constraints from inception, Criteo applies the same privacy-compliant tech stack globally rather than maintaining separate regional systems. This single-stack approach means US advertisers receive the same data handling as EU users, and Criteo pioneered the AdChoices opt-out icon before regulatory mandates required it.
- •Generative creative democratizes long-tail advertising: Historically, mid-to-long-tail advertisers were excluded from high-quality creative campaigns due to production costs. Criteo's self-service product Criteo Gold, combined with generative AI partners like Waymark, now enables smaller advertisers to produce campaign-quality creative assets. Dynamic creative optimization assembles pre-generated visual assets at runtime rather than rendering full generative outputs live—current generative video latency remains too high for real-time ad serving, but the modular assembly approach bridges the gap until on-device rendering speeds improve within an estimated two to three years.
Notable Moment
Liva Ralaivola proposed a future advertising model where users actively instruct their AI assistants to evaluate a fixed number of options—say, five shoes or five travel packages—and request curated ad exposure on their own terms. This reframes advertising not as interruption but as a user-initiated, agent-mediated discovery service, collapsing the boundary between search and advertising entirely.
You just read a 3-minute summary of a 84-minute episode.
Get Cognitive Revolution summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Cognitive Revolution
AI:AM #3: Zvi on Fable, the Cases For & Against the Ban, + AI for Math, Logistics & More
Jun 21 · 134 min
Decoder
Yahoo CEO Jim Lanzone on reviving the web's homepage
Mar 16
More from Cognitive Revolution
Dean Ball, on Joining OpenAI: New Power Centers, Frontier AI Policy, & Main Character Energy
Jun 20 · 159 min
Darknet Diaries
171: Melody Fraud
Mar 3
Books, tools, and gear mentioned in this episode
SignalCast may earn commission on purchases via these links. As an Amazon Associate, SignalCast earns from qualifying purchases.
Tools
“Sponsor: AvePoint - listed in the episode sponsors section.”
“Sponsor: Sequence - listed in the episode sponsors section.”
Products
by Criteo
“Criteo's self-service product Criteo Gold, combined with generative AI partners like Waymark, now enables smaller advertisers to produce campaign-quality creative assets.”
company
“Criteo's self-service product Criteo Gold, combined with generative AI partners like Waymark, now enables smaller advertisers to produce campaign-quality creative assets.”
“Criteo's OpenAI partnership combines real-time commerce data from 17,000 retailers with LLM reasoning to power next-generation product discovery.”
“Criteo CTO Diarmuid Gill and AI Lab VP Liva Ralaivola explain how their ad tech platform processes over one billion user profiles in milliseconds using cached embeddings and multiple foundation models.”
More from Cognitive Revolution
We summarize every new episode. Want them in your inbox?
AI:AM #3: Zvi on Fable, the Cases For & Against the Ban, + AI for Math, Logistics & More
Dean Ball, on Joining OpenAI: New Power Centers, Frontier AI Policy, & Main Character Energy
Radically Better Reasoning: Elicit's Andreas Stuhlmüller & Jungwon Byun on World Models for Research
AI in the AM — Week 2 Highlights (June 2026)
Babysitting the Machine: Glean's Rebecca Hinds on the Hidden Human Labor of AI at Work
Similar Episodes
Related episodes from other podcasts
Decoder
Mar 16
Yahoo CEO Jim Lanzone on reviving the web's homepage
Darknet Diaries
Mar 3
171: Melody Fraud
The Biotech Startups Podcast
Feb 12
🧬 How Curiosity Creates Breakthroughs in AI, Data & Biotech | Caleb Appleton (Part 4/4)
SaaStr Podcast
Feb 11
SaaStr 841: Going From Blobs to Billions. Clay's Co-Founder Breaks Down Inbound, Outbound, and AI-Powered Sales.
Coaching for Leaders
Feb 9
769: How to Connect Better with Remote Colleagues, with Charles Duhigg
Explore Related Topics
This podcast is featured in Best AI Podcasts (2026) — ranked and reviewed with AI summaries.
You're clearly into Cognitive Revolution.
Every Monday, we deliver AI summaries of the latest episodes from Cognitive Revolution and 192+ other podcasts. Free for one show.
Start My Monday DigestNo credit card · Unsubscribe anytime