Nvidia stellt Nemotron 3 Nano Omni mit Bild- und Sprachverarbeitung für leistungsstarke agentic KI-Anwendungen vor

Nvidia introduces Nemotron 3 Nano Omni with vision and speech for powerful agentic AI use

North America / United States1 views1 min

Nvidia Corp. launched Nemotron 3 Nano Omni, a 30 billion parameter AI model that unifies text, vision, and speech for agentic AI applications. The model provides low latency and high flexibility, allowing for rapid understanding of documents, computer displays, voice activity, and video.

Nvidia Corp. introduced Nemotron 3 Nano Omni, a powerful reasoning AI model that combines text, vision, and speech. The 30 billion parameter model uses mixture-of-experts architecture to deliver low latency and high flexibility. It eliminates the need for separate perception modules, improving efficiency and providing up to nine times faster throughput than other open omni models. Nemotron 3 Nano Omni can be compressed to run on higher-end consumer hardware and execute efficiently on enterprise cloud deployments. The model is available on Hugging Face, OpenRouter, and build.nvidia.com as an Nvidia NIM microservice. Nvidia's Nemotron family has seen over 50 million downloads in the past year, and the Omni variant extends its capabilities into multimodal and agentic domains.

This content was automatically generated and/or translated by AI. It may contain inaccuracies. Please refer to the original sources for verification.

Nvidia introduces Nemotron 3 Nano Omni with vision and speech for powerful agentic AI use

Comments (0)