NVIDIA Unveils Free, Open AI Model

NVIDIA has launched Nemotron 3 Super, a powerful AI assistant that is completely free and open-source. It matches the performance of top proprietary models from a year ago but is significantly faster, especially with its NVFP4 version. This release democratizes access to advanced AI technology.

3 hours ago
3 min read

NVIDIA Unveils Free, Open AI Model

NVIDIA has released a powerful new AI assistant called Nemotron 3 Super. This AI is notable because it is completely free for everyone to use. Unlike many other advanced AI systems that require costly subscriptions and keep their inner workings secret, NVIDIA has shared all the details about Nemotron 3 Super. This includes a 51-page research paper that explains exactly how the AI was built and the data it was trained on.

A Look Inside Nemotron 3 Super

Nemotron 3 Super was trained on a massive 25 trillion pieces of data, called tokens. From this huge amount of information, it developed into an AI with 120 billion parameters. A parameter is like a knob that an AI can adjust to learn and make decisions. More parameters generally mean a smarter AI. This AI’s abilities are comparable to top closed-source AI models from about a year and a half ago. Those models cost billions of dollars to create and their details were kept hidden.

Speed and Performance

NVIDIA has released two versions of Nemotron 3 Super: BF16 and NVFP4. While both versions perform similarly in terms of accuracy, the NVFP4 version is significantly faster. It runs about 3.5 times quicker than the BF16 version. More impressively, NVFP4 is up to 7 times faster than other open-source AI models that are just as smart. This combination of speed and intelligence makes Nemotron 3 Super stand out.

Key Innovations Behind the Speed

NVIDIA shared four main innovations that contribute to Nemotron 3 Super’s performance:

  • NVFP4 (Quantization): This technique speeds up the AI by simplifying the math it uses. It’s like rounding off numbers to make calculations quicker. Normally, this can lead to errors and less accurate results. However, NVIDIA’s method smartly protects the most important calculations, keeping accuracy high while boosting speed.
  • Multi-Token Prediction: Instead of generating text one word at a time, this AI predicts several words, or 7 tokens, at once. It then checks these tokens together. This process greatly speeds up text generation.
  • Mamba Layers: Traditional AI systems can struggle with remembering information, like a student who keeps rereading a book. Mamba layers act like taking highly compressed notes. This allows the AI to efficiently remember important details from conversations without getting bogged down by unnecessary information. This helps it process vast amounts of data effectively.
  • Stochastic Rounding: When AI models perform many calculations, small errors can add up and magnify over time, similar to how small missteps can make you miss your car. To fix this, NVIDIA adds a tiny amount of controlled randomness, or ‘noise,’ to the calculations. This noise averages out to zero, meaning the errors don’t build up. The result is that the AI stays accurate even after many steps.

Why This Matters

The release of Nemotron 3 Super marks a significant shift in the AI landscape. By providing a powerful, open-source, and free AI model, NVIDIA is empowering researchers, developers, and the public. This transparency allows for greater understanding, collaboration, and innovation in the field of artificial intelligence. Previously, advanced AI capabilities were largely locked behind expensive, proprietary systems. Now, anyone can access and build upon state-of-the-art AI technology without financial barriers. NVIDIA’s commitment to investing billions in open systems suggests a future where cutting-edge AI is more accessible to everyone.

Availability and Future

Nemotron 3 Super is available for use, with NVIDIA providing extensive documentation through its research paper. While the AI is incredibly fast and smart, the paper acknowledges that some complex tasks might still take time. For instance, very math-intensive problems could require powerful computing resources. However, the overall impact of this release is immense. It challenges the dominance of closed AI systems and ushers in an era of open, collaborative AI development.


Source: NVIDIA’s New AI: A Revolution…For Free! (YouTube)

Written by

Joshua D. Ovidiu

I enjoy writing.

14,232 articles published
Leave a Comment