Products mentioned by Sriram Raghavan
Products Sriram Raghavan has mentioned or recommended across podcast appearances.
SignalCast may earn a small commission on purchases through these links — at no extra cost to you. As an Amazon Associate we earn from qualifying purchases.
Granite models (2B and 8B)
Authorby IBM
“IBM trains its Granite models — currently 2B and 8B parameters — directly using reinforcement learning rather than distilling from larger models.”
Granite 3.3
Authorby IBM
“IBM's Granite 3.3 (8B parameters) matches GPT-4o and Claude 3.5 on code and math benchmarks using inference-time scaling techniques including particle filtering and majority voting.”
Granite 3.0
Authorby IBM
“Granite 3.3 was measurably ahead of Granite 3.0 at the halfway point of training, solely due to data quality improvements.”
Bamba 2
Authorby IBM
“Hybrid state-space/attention models (like IBM's Bamba 2, developed with CMU and Princeton) mean a 50B-parameter model can require less memory than a 34B model.”