Why World Models Are the Next Big Thing in AI

by Sophie Williams
0 comments

Medal Founder Launches AI Lab, General Intuition, With $133.7 Million Seed Funding

A new artificial intelligence laboratory, General Intuition, has launched today with $133.7 million in seed funding, aiming to develop AI agents capable of navigating and interacting with the physical world using data derived from video games.

The company is a spin-off from Medal, a popular video game clipping platform, founded by Pim de Witte. De Witte began exploring the potential of Medal’s data – roughly 2 billion video uploads annually from tens of thousands of games – after reading research from Google DeepMind demonstrating the utility of gaming data in training AI for 3D environments. He reportedly received acquisition offers, including one from OpenAI for $500 million, before deciding to pursue the AI lab independently. “Initially, we were quite interested in them,” de Witte said of the offers, “but that was mostly a result of us not understanding what we were sitting on.”

The seed round is led by Vinod Khosla, founder of Khosla Ventures, a firm that was also an early investor in OpenAI. Other investors include General Catalyst and the Raine Group, with Moritz Baier-Lentz of Lightspeed joining the startup part-time. Khosla believes General Intuition has the potential to be as impactful in the field of AI agents as OpenAI has been with large language models, and this represents Khosla Ventures’ largest seed check since their 2018 investment in OpenAI. This investment signals growing interest in “world models,” a branch of AI research focused on giving AI spatial understanding – enabling it to predict physical outcomes, like preventing a falling object. The Raine Group is a global merchant bank focused on technology, media, and telecommunications.

General Intuition plans to initially focus on applications like search and rescue drones, with longer-term goals including humanoid robots and self-driving cars. De Witte emphasized the value of gaming environments as a “verifiable domain for spatial-temporal reasoning,” where AI can learn to distinguish between effective and ineffective actions. He cautioned that gaming companies may become attractive acquisition targets as demand for this type of data increases, advising them that “you are at an information disadvantage.”

The company expects its first model to be operational within the next year, and officials stated they will continue to refine their approach as the field of world models evolves.

This is an excerpt of Sources by Alex Heath, a newsletter about AI and the tech industry, syndicated just for The Verge subscribers once a week.

Around the middle of last year, Pim de Witte started reaching out to a handful of prominent AI labs to see if they’d be interested in using data from Medal, his popular video game clipping platform, to train their agents.

Within weeks, it became clear that Medal’s data was more valuable to the labs than he expected. “We received multiple acquisition offers very quickly,” he told me. (He declined to name names, but it has been reported that OpenAI offered $500 million.) “Initially, we were quite interested in them,” he said of the offers, but that “was mostly a result of us not understanding what we were sitting on.”

He had read the Google DeepMind research paper showing that gaming data can be used to teach AI how to navigate a 3D environment. However, the interest from AI labs made him realize that his data from Medal, which receives roughly 2 billion video uploads per year from tens of thousands of video games, could be used to develop a unique foundational model for extending AI to the real world.

“It’s a pretty big bet.”

Today, Pim de Witte announced that Medal is spinning out a new AI lab called General Intuition that has raised a $133.7 million seed round. The money for the round is primarily from Vinod Khosla, founder of Khosla Ventures and one of the first investors in OpenAI. Other investors include General Catalyst and the Raine Group. Moritz Baier-Lentz, who oversees Lightspeed’s gaming investments, is also joining the startup part-time as a founding team member.

Khosla believes that General Intuition could be as impactful in the field of AI agents as OpenAI was on how people use large language models. It’s his firm’s largest seed check since it backed OpenAI in 2018. “It’s a pretty big bet,” he told me. “They have a unique dataset and a unique team.”

Unless you’re steeped in the AI world, you probably haven’t heard much about world models yet. It’s a branch of research that trains AI to have spatial understanding like a human. The idea is that a robot could, for example, predict when a glass of water will spill when knocked off a table and grab it before it falls. More practically, AI researchers are increasingly looking to world models as a way to train agents that can reliably generate and interact with a 3D space.

Among the prominent AI leaders, Google DeepMind CEO Demis Hassabis has been the most vocal advocate for world models and their importance in achieving AGI. Google recently demoed Genie 3, a model that generates a video game-like environment from scratch as you navigate through it. There are also a handful of startups working on similar models, including Fei-Fei Li’s World Labs, which this week released its own demo of a model that generates interactive video in real-time.

For General Intuition, the goal is to control any kind of device that can be mapped to a keyboard and mouse or has a game controller-like input scheme, according to de Witte. He expects the startup’s first model to be used by search and rescue drones but sees the potential for applications in other areas, including humanoid robots and self-driving cars.

Just as LLMs were initially trained on internet text data, de Witte believes that gaming environments will unlock AI’s ability to reliably predict the proper action to take in the physical world. “Games are basically the only verifiable domain for spatial-temporal reasoning,” he explained. “You can separate a good action from a bad action, which is why it’s so valuable.”

Still, it’s a risky bet. The correct technical path for developing world models is hotly debated in the AI industry, and as even Khosla noted to me, it’s unclear what data will ultimately prove the most valuable. Members of de Witte’s early research team have published notable research in the field, but the startup is still competing with better-funded giants like Google. “Somebody will win big in this market,” said Khosla, who told me thinks it’s an area where “multiple hundred-billion-dollar and potentially even trillion-dollar companies will be built.”

De Witte predicts that gaming companies will become prime takeover targets for AI labs as interest in world models heats up. His decision to start General Intuition was driven by the realization that, thanks to Medal’s data, he’s in the unique position to be more than a data supplier. However, he warned me that others might find it challenging to resist licensing checks and acquisition offers from the big AI labs.

“You are at an information disadvantage,” he said when I asked if he had advice for the gaming industry. “The better these models get, the less data they’re likely going to need.”

Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates.

You may also like

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.

This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Accept Read More

Privacy & Cookies Policy