#340 Steffen Cruz: Training AI Without Data Centres
Episode
46 min
Read time
2 min
Topics
Remote Work, Personal Finance, Fundraising & VC
AI-Generated Summary
Key Takeaways
- ✓Distributed Pretraining Economics: Training large language models through geographically distributed nodes enables cost arbitrage unavailable to centralized data centers. When a facility builds out thousands of GPUs, training costs are fixed at construction. Distributed systems can target surplus energy pockets — such as Icelandic renewable energy available only 12 hours daily — reducing training costs to roughly 10–20% of conventional rates.
- ✓Model Parallelism Architecture: Macrocosmos's IOTA system (Incentivized Orchestrated Training Architecture) splits models into small slivers across nodes rather than running full model copies on each machine. This approach allows training of frontier-scale models — targeting 70 billion parameters by mid-2025 and 100 billion-plus by 2026 — using consumer-grade hardware like Mac minis and CUDA-enabled GPUs.
- ✓Supply-Side GPU Utilization Strategy: Cloud providers and neo-clouds with idle GPU inventory can plug surplus capacity into IOTA's network during rental gaps. Since training commands higher margins than inference token sales, providers earn better returns on underutilized hardware than selling compute at discounted spot rates, creating a direct bottom-line improvement without additional capital expenditure.
- ✓Consumer Passive Income via Train-at-Home: Individuals with idle Mac minis, MacBooks, or consumer GPUs can download a one-click app, set availability windows — for example, 10PM to 6AM — and earn passive income contributing to model training runs. Macrocosmos reports 2,500 app downloads within the first two weeks, with the payout system rewarding participation proportionally to hours of compute contributed daily.
- ✓Blockchain as Coordination Layer, Not Compute: The blockchain in BitTensor functions as an identity registry, synchronization clock, and transparent payout trigger — not as a compute or storage layer. Off-chain tracking records each node's contribution, then pushes verified totals on-chain to trigger token payouts. This architecture allowed Macrocosmos to scale beyond BitTensor's native 256-node limit to support thousands of simultaneous participants.
What It Covers
Steffen Cruz, CTO of Macrocosmos, explains how his company uses BitTensor's blockchain infrastructure to train large language models through distributed compute nodes worldwide, eliminating the need for centralized data centers and enabling cost arbitrage through surplus energy and idle consumer hardware like Mac minis and spare GPUs.
Key Questions Answered
- •Distributed Pretraining Economics: Training large language models through geographically distributed nodes enables cost arbitrage unavailable to centralized data centers. When a facility builds out thousands of GPUs, training costs are fixed at construction. Distributed systems can target surplus energy pockets — such as Icelandic renewable energy available only 12 hours daily — reducing training costs to roughly 10–20% of conventional rates.
- •Model Parallelism Architecture: Macrocosmos's IOTA system (Incentivized Orchestrated Training Architecture) splits models into small slivers across nodes rather than running full model copies on each machine. This approach allows training of frontier-scale models — targeting 70 billion parameters by mid-2025 and 100 billion-plus by 2026 — using consumer-grade hardware like Mac minis and CUDA-enabled GPUs.
- •Supply-Side GPU Utilization Strategy: Cloud providers and neo-clouds with idle GPU inventory can plug surplus capacity into IOTA's network during rental gaps. Since training commands higher margins than inference token sales, providers earn better returns on underutilized hardware than selling compute at discounted spot rates, creating a direct bottom-line improvement without additional capital expenditure.
- •Consumer Passive Income via Train-at-Home: Individuals with idle Mac minis, MacBooks, or consumer GPUs can download a one-click app, set availability windows — for example, 10PM to 6AM — and earn passive income contributing to model training runs. Macrocosmos reports 2,500 app downloads within the first two weeks, with the payout system rewarding participation proportionally to hours of compute contributed daily.
- •Blockchain as Coordination Layer, Not Compute: The blockchain in BitTensor functions as an identity registry, synchronization clock, and transparent payout trigger — not as a compute or storage layer. Off-chain tracking records each node's contribution, then pushes verified totals on-chain to trigger token payouts. This architecture allowed Macrocosmos to scale beyond BitTensor's native 256-node limit to support thousands of simultaneous participants.
Notable Moment
Cruz describes a near-future scenario where a personal AI agent, after completing its assigned tasks by mid-morning, autonomously decides to contribute the machine's idle compute to a training network and earns money before the owner returns home — reframing personal computers as proactive economic participants rather than passive tools.
You just read a 3-minute summary of a 43-minute episode.
Get Eye on AI summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Eye on AI
One Company Now Has More AI Agents Than Human Employees | Ryan Gavin of Slack
Jun 13 · 53 min
Beyond Biotech
How Epic Bio is leveraging CRISPR without cutting DNA
Apr 30
More from Eye on AI
AI Is Already Resolving 90% of Customer Service Tickets - and It's Getting Smarter | Shashi Upadhyay, Zendesk
Jun 12 · 57 min
a16z Podcast
Alex Blania on Proof of Human and Building World's Identity Network
Apr 2
Books, tools, and gear mentioned in this episode
SignalCast may earn commission on purchases via these links. As an Amazon Associate, SignalCast earns from qualifying purchases.
Tools
“Steffen Cruz, CTO of Macrocosmos, explains how his company uses BitTensor's blockchain infrastructure to train large language models through distributed compute nodes worldwide.”
by Macrocosmos
“Macrocosmos's IOTA system (Incentivized Orchestrated Training Architecture) splits models into small slivers across nodes rather than running full model copies on each machine.”
company
- MacrocosmosBy guest
“Steffen Cruz, CTO of Macrocosmos, explains how his company uses BitTensor's blockchain infrastructure to train large language models through distributed compute nodes worldwide.”
More from Eye on AI
We summarize every new episode. Want them in your inbox?
One Company Now Has More AI Agents Than Human Employees | Ryan Gavin of Slack
AI Is Already Resolving 90% of Customer Service Tickets - and It's Getting Smarter | Shashi Upadhyay, Zendesk
Every Enterprise Is About to Have a 100,000 Agent Problem | Oren Michaels of Barndoor AI
More Customers Chose the AI Agent Than Anyone Expected | Tom Chen, Aircall
Why the Future of AI Isn't Just Bigger Models. It's Models That Evolve | Risto Miikkulainen of Cognizant
Similar Episodes
Related episodes from other podcasts
Beyond Biotech
Apr 30
How Epic Bio is leveraging CRISPR without cutting DNA
a16z Podcast
Apr 2
Alex Blania on Proof of Human and Building World's Identity Network
Latent Space
Feb 25
🔬Searching the Space of All Possible Materials — Prof. Max Welling, CuspAI
The Bio Report
Feb 4
Reprogramming T Cells to Cross the Brain’s Border
Unchained
Jan 22
DEX in the City: When NYSE Goes Onchain, What Happens to Financial Intermediaries?
Explore Related Topics
This podcast is featured in Best AI Podcasts (2026) — ranked and reviewed with AI summaries.
You're clearly into Eye on AI.
Every Monday, we deliver AI summaries of the latest episodes from Eye on AI and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime