NVIDIA unveils Cosmos
: A game-changer for physical AI developmenthttps://t.co/kVmxSt2pwS— Throwback C Magnon (@throwback_c) January 9, 2025
At CES 2025, NVIDIA unveiled its groundbreaking Cosmos platform, a suite of generative world foundation models (WFMs) designed to revolutionize the development of physical AI systems like autonomous vehicles (AVs) and robots. The announcement, made by NVIDIA founder and CEO Jensen Huang during his keynote address, marks a significant leap forward in AI-driven simulation and synthetic data generation. With major players like Uber, Waabi, and Agility already on board, Cosmos is poised to democratize physical AI development, making it accessible to developers of all sizes.
What Cosmos does and why it matters
Cosmos WFMs are purpose-built to generate photorealistic, physics-based synthetic data, enabling developers to train and evaluate AI models without the need for costly real-world data collection. These models can simulate complex environments, such as warehouses, factories, and driving scenarios, with remarkable accuracy. By combining inputs like text, images, and video with sensor or motion data, Cosmos can create realistic simulations that help robots and AVs learn to navigate the physical world.
“The ChatGPT moment for robotics is coming,” said Huang. “Like large language models, world foundation models are fundamental to advancing robot and AV development, yet not all developers have the expertise and resources to train their own. We created Cosmos to democratize physical AI and put general robotics in reach of every developer.”
Cosmos isn’t just about generating synthetic data—it’s also about efficiency. The platform features an NVIDIA AI and CUDA-accelerated data processing pipeline that can process, curate, and label 20 million hours of video in just 14 days using the NVIDIA Blackwell platform. This is a dramatic improvement over traditional CPU-based pipelines, which would take over three years to accomplish the same task.
Additionally, Cosmos includes a state-of-the-art visual tokenizer that delivers 8x more compression and 12x faster processing than current leading tokenizers. These tools, combined with the NVIDIA NeMo framework for model training and customization, make Cosmos a powerful resource for developers looking to build and refine physical AI systems.
h/t Dan
64 views