Subscribe to Our Newsletter

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn’t arrive within 3 minutes, check your spam folder.

Ok, Thanks

NVIDIA rolls out RTX upgrades for local generative AI workflows

The tech giant has announced a package of RTX software and performance updates aimed at improving speed and reducing memory use for local generative artificial intelligence tasks.

Defused News Writer profile image
by Defused News Writer
NVIDIA rolls out RTX upgrades for local generative AI workflows
Photo by Christian Wiediger / Unsplash

NVIDIA said it is introducing a set of RTX optimisations across GeForce RTX, NVIDIA RTX PRO and NVIDIA DGX Spark systems, targeting faster performance and lower memory consumption for on-device generative AI workloads.

The company said the updates include PyTorch-CUDA optimisations and native NVFP4 and NVFP8 precision support in ComfyUI, alongside RTX Video Super Resolution integration for real-time 4K video upscaling.

Additional enhancements include NVFP8 tuning for Lightricks’ LTX-2 model, a Blender-guided 4K video workflow and RTX acceleration for Nexa.ai’s Hyperlink application. Improvements to small language model inference are also being delivered through llama.cpp and Ollama, NVIDIA added.

According to the company, ComfyUI performance is up to three times faster with up to 60% lower video memory usage when using NVFP4 on RTX 50 Series graphics processors, and around twice as fast with 40% lower memory use using NVFP8. Open-weight checkpoints using NVFP4 and NVFP8 are being made available for models including LTX-2, FLUX.1, FLUX.2, Qwen-Image and Z-Image.

For search and language workloads, NVIDIA said Hyperlink enables local video content search on RTX-powered PCs, with significantly faster indexing and response times compared with central processing unit-based systems. The company also reported inference speed gains over recent months for llama.cpp and Ollama, with further updates due to appear in upcoming software releases.

Related reading

NVIDIA added that NVIDIA Broadcast 2.1 extends support for Virtual Key Light effects to a wider range of RTX graphics cards, while DGX Spark systems have received performance updates delivering faster results and new deployment playbooks.

The company said several of the updates are available immediately, with further video and application features due to roll out over the coming weeks.

The Recap

  • NVIDIA launches RTX upgrades to speed 4K AI video generation.
  • Up to 3x faster generation and 60% less VRAM usage.
  • Video workflow and RTX Video node available next month.
Defused News Writer profile image
by Defused News Writer

Read More