Products mentioned by Sriram Raghavan

Products Sriram Raghavan has mentioned or recommended across podcast appearances.

SignalCast may earn a small commission on purchases through these links — at no extra cost to you. As an Amazon Associate we earn from qualifying purchases.

Granite models (2B and 8B)
Author
by IBM
“IBM trains its Granite models — currently 2B and 8B parameters — directly using reinforcement learning rather than distilling from larger models.”
Amazon
Mentioned on: Eye on AI · #335 Sriram Raghavan: Why IBM Is Betting Everyt...
Granite 3.3
Author
by IBM
“IBM's Granite 3.3 (8B parameters) matches GPT-4o and Claude 3.5 on code and math benchmarks using inference-time scaling techniques including particle filtering and majority voting.”
Amazon
Mentioned on: Eye on AI · #335 Sriram Raghavan: Why IBM Is Betting Everyt...
Granite 3.0
Author
by IBM
“Granite 3.3 was measurably ahead of Granite 3.0 at the halfway point of training, solely due to data quality improvements.”
Amazon
Mentioned on: Eye on AI · #335 Sriram Raghavan: Why IBM Is Betting Everyt...
Bamba 2
Author
by IBM
“Hybrid state-space/attention models (like IBM's Bamba 2, developed with CMU and Princeton) mean a 50B-parameter model can require less memory than a 34B model.”
Amazon
Mentioned on: Eye on AI · #335 Sriram Raghavan: Why IBM Is Betting Everyt...

← Back to Sriram Raghavan's podcast appearances

Granite models (2B and 8B)

Granite 3.3

Granite 3.0

Bamba 2