AInvest Newsletter
Daily stocks & crypto headlines, free to your inbox
NVIDIA has optimized OpenAI's new open-source gpt-oss models for their GeForce RTX and RTX PRO GPUs, enabling fast inference from the cloud to the PC. The models support agentic AI applications like web search and in-depth research. Developers can use popular tools like Ollama and llama.cpp on NVIDIA RTX AI PCs and workstations, achieving performance of up to 256 tokens per second on the GeForce RTX 5090 GPU.
NVIDIA has optimized OpenAI's new open-source gpt-oss models for their GeForce RTX and RTX PRO GPUs, enabling fast inference from the cloud to the PC. These models, gpt-oss-20b and gpt-oss-120b, support agentic AI applications such as web search and in-depth research. Developers can leverage popular tools like Ollama and llama.cpp on NVIDIA RTX AI PCs and workstations, achieving performance of up to 256 tokens per second on the GeForce RTX 5090 GPU.
Daily stocks & crypto headlines, free to your inbox
Comments
No comments yet