Generative AI Startups Continue to Attract Major Funding, But Consolidation is Key
Investments in generative AI startups—those developing AI-driven products for generating text, audio, video, and beyond—show no signs of slowing, though they are increasingly concentrated among a select group of early-stage ventures. From January to July 16, 2023, 225 startups successfully raised $12.3 billion from venture capitalists, as reported by Crunchbase. If this trend continues, generative AI companies are poised to match or surpass the approximately $21.8 billion they attracted in total funding in 2023.
Here’s the breakdown of funding by stage for the first half of 2024:
- 198 angel/seed deals: $500 million
- 39 early-stage deals: $8.7 billion
- 18 late-stage deals: $3.1 billion
Early-stage startups emerged as the significant winners. Notable examples include Elon Musk’s xAI, which secured $6 billion in May, China’s Moonshot AI with $1 billion in February, Mistral AI at $502.6 million in June, Glean with $203.2 million in February, and Cognition at $175 million in April. Chris Metinko, an analyst and senior reporter at Crunchbase, notes that investors are increasingly favoring major startups perceived to have a higher likelihood of succeeding, while letting less promising ventures “wither away” earlier in the process.
Metinko points out, “Some VCs anticipate that legal and regulatory challenges facing AI companies both in the U.S. and abroad may lead to a slowdown in AI funding.” He adds, “Others believe that just as established tech companies triumphed during the mobile revolution over a decade ago, they will again emerge as the leading players.”
Yet the futures of many generative AI companies—even those with substantial funding—remain uncertain. These models are typically trained on data such as images and text pulled from public websites. Companies argue that fair use protects them against legal issues when this data is copyrighted. However, the legal landscape is still unclear, prompting some firms to begin establishing licensing agreements with copyright holders.
Regardless of how individual cases play out, the competition for high-quality training data is intensifying, driving up costs and making it harder to source as startups deplete existing web resources and more creators block data scrapers. One analysis predicts that the market for AI training data could skyrocket from $2.5 billion to $30 billion in the next decade. Additionally, according to a recent report from Stanford, the costs to train models remain steep, with OpenAI’s GPT-4 priced at $78 million and Google Gemini at $191 million.
Given the substantial upfront investments needed to develop flagship models, it’s no surprise that very few generative AI startups are currently profitable—even industry leaders like OpenAI and Anthropic. Reports suggest that despite generating around $3.4 billion in revenue, OpenAI could face losses totaling $5 billion this year.
It appears that investors in generative AI are taking a long-term approach, particularly notable tech giants such as Google, Amazon, and Nvidia, who view these investments as strategic opportunities. However, the question remains: could the bubble burst soon? If generative AI startups fail to navigate the existential challenges ahead, this possibility may not be far-fetched.