Symbols

OpenAI's Open Models Optimized for NVIDIA GPUs: Boosting AI Applications and Democratizing Access

Tuesday, Aug 5, 2025 1:07 pm ET1min read

NVIDIA has optimized OpenAI's new open-source gpt-oss models for their GeForce RTX and RTX PRO GPUs, enabling fast inference from the cloud to the PC. The models support agentic AI applications like web search and in-depth research. Developers can use popular tools like Ollama and llama.cpp on NVIDIA RTX AI PCs and workstations, achieving performance of up to 256 tokens per second on the GeForce RTX 5090 GPU.

NVIDIA has optimized OpenAI's new open-source gpt-oss models for their GeForce RTX and RTX PRO GPUs, enabling fast inference from the cloud to the PC. These models, gpt-oss-20b and gpt-oss-120b, support agentic AI applications such as web search and in-depth research. Developers can leverage popular tools like Ollama and llama.cpp on NVIDIA RTX AI PCs and workstations, achieving performance of up to 256 tokens per second on the GeForce RTX 5090 GPU.

The collaboration between NVIDIA and OpenAI underscores the companies' commitment to advancing AI technology and making it accessible to a broader audience. The gpt-oss models were trained on NVIDIA's H100 GPUs and optimized for the NVIDIA CUDA platform, which is the world's most widely available computing infrastructure.

Developers can now use these models on NVIDIA RTX AI PCs and workstations through various applications and frameworks, including Ollama, llama.cpp, and Microsoft AI Foundry Local. Ollama, in particular, offers an easy and optimized way to test the models with out-of-the-box support and customizable context lengths.

The release of these open-source models marks a significant step forward in AI innovation, allowing developers to build on the state-of-the-art open-source foundation provided by OpenAI and NVIDIA. This collaboration highlights NVIDIA's leadership in AI from training to inference and from cloud to AI PC.

References:
[1] https://blogs.nvidia.com/blog/rtx-ai-garage-openai-oss/
[2] https://blogs.nvidia.com/blog/openai-gpt-oss/

Stay ahead of the market.

Get curated U.S. market news, insights and key dates delivered to your inbox.

Comments

﻿

Add a public comment...

No comments yet

AInvest
PRO

Editorial Disclosure & AI Transparency: Ainvest News utilizes advanced Large Language Model (LLM) technology to synthesize and analyze real-time market data. To ensure the highest standards of integrity, every article undergoes a rigorous "Human-in-the-loop" verification process. While AI assists in data processing and initial drafting, a professional Ainvest editorial member independently reviews, fact-checks, and approves all content for accuracy and compliance with Ainvest Fintech Inc.’s editorial standards. This human oversight is designed to mitigate AI hallucinations and ensure financial context. Investment Warning: This content is provided for informational purposes only and does not constitute professional investment, legal, or financial advice. Markets involve inherent risks. Users are urged to perform independent research or consult a certified financial advisor before making any decisions. Ainvest Fintech Inc. disclaims all liability for actions taken based on this information. Found an error?Report an Issue

OpenAI's Open Models Optimized for NVIDIA GPUs: Boosting AI Applications and Democratizing Access

Stay ahead of the market.

Comments

AInvestPRO

AInvest

AInvest
PRO