Símbolos

NVIDIA TensorRT Boosts Stable Diffusion 3.5 Performance with 40% Less VRAM on RTX GPUs

jueves, 12 de junio de 2025, 12:35 pm ET1 min de lectura

NVIDIA TensorRT boosts Stable Diffusion 3.5 performance on NVIDIA GeForce RTX and RTX PRO GPUs. The software development kit (SDK) reduces VRAM consumption by 40% and doubles performance. It also enables just-in-time engine building and seamless AI deployment to over 100 million RTX AI PCs. The SDK is now available as a standalone for developers. NVIDIA collaborated with Stability AI to quantize Stable Diffusion 3.5 Large to FP8, reducing VRAM consumption by 40%.

NVIDIA's TensorRT has significantly boosted the performance of Stable Diffusion 3.5 on NVIDIA GeForce RTX and RTX PRO GPUs, according to a recent collaboration with Stability AI. The software development kit (SDK) reduces VRAM consumption by 40% and doubles the performance of the model. This optimization enables just-in-time engine building and seamless AI deployment to over 100 million RTX AI PCs. The SDK is now available as a standalone for developers [1].

The collaboration involved quantizing Stable Diffusion 3.5 Large to FP8, which reduced VRAM consumption by 40%. This optimization means that five GeForce RTX 50 Series GPUs can now run the model from memory instead of just one. The optimized models are now available on Stability AI’s Hugging Face page [1].

Additionally, TensorRT for RTX was released as a standalone SDK, making it easier for developers to create optimized AI engines. This new version of TensorRT allows developers to create a generic TensorRT engine that is optimized on device in seconds, streamlining the process and reducing development time [1].

NVIDIA's advancements in AI performance and efficiency are likely to attract more developers and users to their RTX GPUs, potentially driving sales and market share in the competitive GPU market. The collaboration with Stability AI also demonstrates the company's commitment to fostering innovation in AI and its willingness to work with industry leaders to achieve these goals [1].

References:
[1] https://blogs.nvidia.com/blog/rtx-ai-garage-gtc-paris-tensorrt-rtx-nim-microservices/

Comentarios

﻿

Add a public comment...

Aún no hay comentarios

Divulgación editorial y transparencia de la IA: Ainvest News utiliza tecnología avanzada de Modelos de Lenguaje Largo (LLM) para sintetizar y analizar datos de mercado en tiempo real. Para garantizar los más altos estándares de integridad, cada artículo se somete a un riguroso proceso de verificación con participación humana. Mientras la IA asiste en el procesamiento de datos y la redacción inicial, un miembro editorial profesional de Ainvest revisa, verifica y aprueba de forma independiente todo el contenido para garantizar su precisión y cumplimiento con los estándares editoriales de Ainvest Fintech Inc. Esta supervisión humana está diseñada para mitigar las alucinaciones de la IA y garantizar el contexto financiero. Advertencia sobre inversiones: Este contenido se proporciona únicamente con fines informativos y no constituye asesoramiento profesional de inversión, legal o financiero. Los mercados conllevan riesgos inherentes. Se recomienda a los usuarios que realicen una investigación independiente o consulten a un asesor financiero certificado antes de tomar cualquier decisión. Ainvest Fintech Inc. se exime de toda responsabilidad por las acciones tomadas con base en esta información. ¿Encontró un error? Reportar un problema