All-In Podcast · the podbrain notes ·
5 min read

Four CEOs on the Future of AI: CoreWeave, Perplexity, Mistral, and IREN

Jason Calacanis interviews four AI CEOs at NVIDIA's GTC conference: Michael Intrator (CoreWeave CEO), Arvind Srinivas (Perplexity CEO), Arthur Mensch (Mistral AI CEO), and Daniel Roberts (IREN co-CEO). These leaders represent different aspects of the AI infrastructure stack - from GPU cloud computing and AI search to...

All-In Podcast All-In Podcast
Subscribe to Notes Upgrade
All-In Podcast episode thumbnail: Four CEOs on the Future of AI: CoreWeave, Perplexity, Mistral, and IREN
All-In Podcast
Key Takeaways
  1. 01

    CoreWeave's average contract length is five years, with clients buying compute for 5-6 years, contradicting claims about GPU obsolescence - Michael

  2. 02

    Perplexity serves tens of millions of users monthly and has saved over $100 million for Enterprise Max customers through AI automation

  3. 03

    IREN operates 100% on renewable energy since inception, using hydro in British Columbia and wind/solar in West Texas

  4. 04

    CoreWeave dropped its cost of capital by 600 basis points over two years through structured financing boxes with long-term contracts

  5. 05

    Mistral AI deploys portable training platforms directly on customer infrastructure to ensure complete data segregation and security

  6. 06

    A million tokens cost $32 when ChatGPT-3 launched but now costs just nine cents, demonstrating massive efficiency gains - Sarah Fryer

  7. 07

    West Texas has 45-50 gigawatts of wind and solar capacity but only 12 gigawatts of transmission lines to export power

  8. 08

    IREN signed a $9.7 billion contract with Microsoft representing just 5% of their total pipeline capacity

Get the latest ideas from All-In Podcast.

Plus the best new takeaways about artificial intelligence from other top podcasts — read in minutes, not hours.

or

By continuing, you agree to podbrain's Terms and Privacy Policy.

These notes may contain occasional inaccuracies. Learn how podbrain notes are made

Jason Calacanis interviews four AI CEOs at NVIDIA's GTC conference: Michael Intrator (CoreWeave CEO), Arvind Srinivas (Perplexity CEO), Arthur Mensch (Mistral AI CEO), and Daniel Roberts (IREN co-CEO). These leaders represent different aspects of the AI infrastructure stack - from GPU cloud computing and AI search to specialized model training and data center power infrastructure.

The conversations explore the evolution from crypto mining to AI compute, the competitive dynamics between frontier models, the challenges of scaling AI infrastructure, and the massive capital requirements driving the industry. Key themes include GPU depreciation debates, the shift toward specialized vertical models, agentic AI capabilities, and the critical role of power infrastructure in enabling AI scaling.

Each CEO discusses their company's unique positioning in the AI ecosystem, from CoreWeave's financing innovations and long-term contracts to Perplexity's multi-model orchestration approach, Mistral's focus on open-source specialization, and IREN's renewable energy-powered data centers supporting the industry's massive compute demands.

CoreWeave's Evolution from Crypto Mining to AI Infrastructure

CoreWeave began in 2017 as an algorithmic hedge fund focused on natural gas, pivoting to crypto mining with GPUs because "there's some brilliant engineer that built the ASIC, and they're probably going to be better at running it than we are" - Michael

The company weathered crypto winters through disciplined risk management from their hedge fund background, then progressively moved up the complexity stack from crypto to CGI rendering, batch computing, medical research, and finally neural networks by 2020-2021

CoreWeave donated A100s to Eleuther AI as "tuition we paid to learn how to run this business," with volunteers later requesting the same infrastructure at their day jobs, launching the commercial business

The company recognized early that "computing decommoditizes at scale" - while anyone can run a GPU, building clusters large enough to train world-changing models requires different capabilities entirely

GPU Depreciation Debate and Hardware Lifecycle Reality

Michael dismisses GPU depreciation concerns as "nonsense" driven by traders with short positions, noting CoreWeave's average contract length is five years with clients willing to pay for that duration

A100 Ampere GPU prices have actually appreciated through the year as new companies emerge with different use cases and model sizes, similar to how older iPhones retain value in developing markets

CoreWeave uses six-year depreciation schedules, believing GPUs will last beyond that timeframe, with obsolescence ultimately defined by when power costs exceed infrastructure margins rather than technical capability

"If people are willing to pay me for it, it still has value" - Michael's simple approach to GPU valuation, emphasizing market demand over theoretical depreciation timelines

Innovative Financing Through Structured 'Box' Contracts

CoreWeave pioneered GPU-based loans through structured financing vehicles called 'boxes' containing client contracts, GPU purchases, and data center agreements with waterfall cash flows

The box structure ensures clients pay directly into the vehicle, which first covers data center costs, power bills, interest and principal before returning profits to CoreWeave

Within two and a half years of five-year deals, all principal and interest are paid off, giving lenders confidence in the "one rule of lending: give me my money back"

This financing innovation enabled CoreWeave to raise $35 billion in 18 months while dropping their cost of capital by 600 basis points toward hyperscaler borrowing rates

Perplexity's Multi-Model Orchestra Strategy

Perplexity positions itself as "Switzerland" in the AI model wars, with multi-model orchestration as their key advantage since "if GPT wins, Gemini wins, Claude wins, Llama wins, it doesn't matter to us" - Arvind

The platform serves tens of millions of users monthly with thousands of enterprise customers, growing faster on the corporate side than consumer, with Enterprise Max at $400/month per person

Perplexity Computer represents an "orchestra of everything AI can do today" with models as instruments and sub-agents as musicians, orchestrating capabilities across hundreds of specialized models

The company maintains positive gross margins on all revenue through subscription fees and efficient token routing, unlike "wrapper companies" that simply resell tokens at a loss

Agentic AI and the Future of Computer Interaction

Perplexity announced Personal Computer, synchronizing server-side AI orchestration with local Mac Mini hardware to provide hybrid local/cloud agent execution with privacy controls

The vision positions "AI as the operating system" where users operate at the objective level rather than giving specific instructions, with models handling file systems, connectors, and tools automatically

Arvind predicts AI will enable "one person, $1 billion companies" by automating business operations from ad campaigns to customer support, allowing entrepreneurs to "be sipping wine in Napa" while their business runs autonomously

The Model Council feature addresses the common practice of querying multiple AIs by automatically identifying where models agree, disagree, and showing nuanced differences rather than requiring manual comparison

Mistral's Specialized Vertical AI Approach

Mistral AI focuses on open-source models that can be deeply customized for enterprise verticals, working with companies like ASML and HSBC through their Forge platform for specialized training

The company deploys portable training platforms directly on customer infrastructure, ensuring "the flow of data doesn't go, there's no flow coming back to Mistral because everything stays there" - Arthur

Mistral sends PhD-level forward deployment engineers to work with subject matter experts, transferring knowledge about domain-specific tasks like image scanning and defect detection

While general-purpose models handle orchestration, enterprises benefit from specialized models that can leverage "decades of IP that you've been accruing in financial services, in heavy manufacturing"

Power Infrastructure as the New Constraint

IREN operates entirely on renewable energy, using hydro in British Columbia and wind/solar in West Texas where 45-50 gigawatts of renewable capacity exceeds 12 gigawatts of transmission capacity

The company's strategy involves going "to the source of low-cost, excess renewable energy, monetize it into this digital commodity, export it at the speed of light as tokens"

IREN's flagship Texas site operates at 750 megawatts with four gigawatts total capacity - "almost as much power annually as the Bay Area uses in its entirety" - Daniel

The constraint has shifted from GPU availability to "time to compute" - the physical construction and deployment of data centers, requiring thousands of tradespeople in remote locations

Economic Impact and Workforce Transformation

Data center construction is creating high-paying trade jobs with salaries in the $150-300k range, while IREN hires locally and provides over $1 million in community grants for playgrounds and fire departments

The industry locates where "heavy electrical infrastructure" exists, often where "old manufacturing and industry has closed down," leveraging existing infrastructure while retraining local workforces

Token costs have dropped dramatically from "$32 and change" for a million tokens when ChatGPT-3 launched to "nine cents" today, demonstrating the power of capital markets driving engineering competition - Sarah Fryer

The technology is "bringing down this incredible barrier that kept human creativity contained" by enabling anyone with a good idea to overcome previously insurmountable technical obstacles - Michael

All-In Podcast
From All-In Podcast. Get a note like this from every new episode.
Subscribe to Notes Upgrade

These notes may contain occasional inaccuracies. Learn how podbrain notes are made

0 / 0
Link copied