Category: NVIDIA News

New NVIDIA RTX PRO GPUs Deliver AI Acceleration in Compact Workstations

Post author By xenya
Post date 15/09/2025

Source: NVIDIA Blackwell Architecture Powers AI Acceleration in Compact Workstations | NVIDIA Blog.

NVIDIA expands its desktop GPU family with the NVIDIA RTX PRO 4000 SFF Edition and RTX PRO 2000 Blackwell GPUs, delivering AI capabilities in compact, energy-efficient form factors for professional workflows in engineering, content creation, and 3D visualization.

Key Features:

NVIDIA RTX PRO 4000 Blackwell SFF Edition: Offers up to 2.5x higher AI performance, 1.7x faster ray tracing, and 1.5x more bandwidth compared to the previous generation, with the same 70-watt power consumption.
NVIDIA RTX PRO 2000 Blackwell: Provides 1.6x faster 3D modeling, 1.4x faster CAD performance, and 1.6x quicker rendering, with a 1.4x boost in image generation and 2.3x in text generation.

Software Ecosystem: NVIDIA AI Enterprise software suite and NVIDIA Cosmos platforms enable fast, efficient AI model deployment, while NVIDIA Omniverse supports generative AI and simulations for 3D design, facilitating digital twins and visual workflows. In addition, the Blackwell platform builds on NVIDIA’s ecosystem of powerful development tools, NVIDIA CUDA-X libraries, over 6 million developers and close to 6,000 applications to scale performance across thousands of GPUs.

Tags Blackwell, Nvidia, NVIDIA RTX PRO 2000 Blackwell, NVIDIA RTX PRO 4000 Blackwell

NVIDIA News

OpenAI and NVIDIA: New Open-Weight Models gpt-oss-20B and gpt-oss-120B Optimized for Local Use on RTX/RTX PRO

Post author By xenya
Post date 15/09/2025

Source: OpenAI’s New Models on RTX GPUs | NVIDIA Blog.

NVIDIA has optimized OpenAI’s new open-source gpt-oss-20b and gpt-oss-120b models for its GPUs, enabling fast AI inference from cloud to PC. These models support advanced agentic AI applications such as web search, in-depth research, and more, with chain-of-thought capabilities and adjustable reasoning effort levels using a mixture-of-experts architecture. The models were trained on NVIDIA H100 GPUs and support context lengths up to 131,072, suitable for tasks like coding assistance and document comprehension.

On the NVIDIA GeForce RTX 5090 GPU, they can achieve performance up to 256 tokens per second, with optimizations for RTX AI PCs and workstations using tools like Ollama, llama.cpp, and Microsoft AI Foundry Local. This emphasizes NVIDIA’s leadership in AI from training to inference and from cloud to AI PC.

Tags Nvidia, RTX GPU

NVIDIA News

Chat with Your Enterprise Data Through the Open‑Source AI‑Q NVIDIA Blueprint

Post author By Pavel Snoj
Post date 31/07/2025

Source: Chat With Your Enterprise Data Through Open-Source AI-Q NVIDIA Blueprint | NVIDIA Technical Blog.

NVIDIA has released AI‑Q, a free, open‑source NVIDIA Blueprint that simplifies building enterprise AI agents capable of conversing with internal data securely and at scale. Designed for companies drowning in unstructured information (Gartner estimates that up to 68 % goes unused), AI‑Q agents can reason across diverse data sources—such as PDFs, emails, databases, chat logs, images, and tables—and provide fast, accurate answers via semantic search, Retrieval‑Augmented Generation (RAG), and live web search integrations (e.g. Tavily).

The blueprint comprises three core components:

NVIDIA NIM microservices for high‑performance model inference,
NVIDIA NeMo Retriever microservices for fast multimodal ingestion, embedding, and reranking,
NeMo Agent toolkit – open-source library that provides framework-agnostic profiling and optimization for production AI agent systems.

In sum, AI‑Q offers enterprises a scalable, secure, and customizable reference implementation to unlock the value of their data using agentic AI.

Tags AI-Q, Nvidia

NVIDIA News

NVIDIA Accelerates European AI Infrastructure for the Next Industrial Revolution

Post author By Pavel Snoj
Post date 13/06/2025

Source: Europe Builds AI Infrastructure With NVIDIA to Fuel Region’s Next Industrial Transformation | NVIDIA Newsroom.

At GTC Paris during VivaTech, NVIDIA announced partnerships with European nations and technology leaders to build advanced AI infrastructure based on NVIDIA Blackwell systems. The initiative aims to strengthen digital sovereignty, support economic growth, and position Europe as a leader in the AI-driven industrial revolution.

Jensen Huang, NVIDIA founder and CEO, emphasized the importance of this shift:

“Every industrial revolution begins with infrastructure. AI is the essential infrastructure of our time, just as electricity and the internet once were.”

France, Italy, Spain, and the United Kingdom are developing domestic AI infrastructure in collaboration with companies such as Domyn, Mistral AI, Nebius, and Nscale, as well as major telecom providers Orange, Swisscom, Telefónica, and Telenor. Together, these deployments will deliver over 3,000 exaflops of NVIDIA Blackwell compute power, enabling European enterprises, startups, and the public sector to securely develop and deploy advanced AI applications.

NVIDIA is also expanding its network of AI technology centers in Germany, Sweden, Italy, Spain, the UK, and Finland to accelerate research, workforce development, and scientific breakthroughs. In France, Mistral AI is building a cloud platform powered by 18,000 Grace Blackwell systems; in the UK, Nebius and Nscale will deploy 14,000 Blackwell GPUs in new data centers. In Germany, NVIDIA and its partners are constructing the world’s first industrial AI cloud for European manufacturers, based on DGX B200 and RTX PRO Server systems.

These efforts represent a strategic investment in Europe’s future, as artificial intelligence becomes essential infrastructure for innovation and competitiveness on the global stage.

Tags Blackwell, HPC, Nvidia

NVIDIA News

NVIDIA GTC 2025 – Jensen Huang Unveils the Future of AI with New Chips and Robotics

Post author By Pavel Snoj
Post date 19/03/2025

At the GTC 2025 conference, NVIDIA CEO Jensen Huang unveiled key innovations set to shape the future of artificial intelligence (AI) and computing.

New Chip Announcements:

Blackwell Ultra: The next generation of graphics processing units (GPUs), scheduled for release in the second half of 2025.
Vera Rubin: Named after renowned astronomer Vera Rubin, this chip is slated for a 2026 release, offering much higher performance of the current Blackwell chip.
Vera Rubin Ultra: Planned for 2027, this chip will further enhance capabilities and energy requirements.

Advancements in Robotics:

Isaac GR00T N1: An open-source model designed for the development of humanoid robots, enabling faster and more efficient learning and adaptation to various tasks.
Cosmos AI Model: An updated model that facilitates the generation of synthetic data for robot training, reducing the costs and time associated with collecting real-world data.

Partnerships and Infrastructure:

Collaboration with General Motors (GM): NVIDIA and GM are partnering to develop systems for autonomous vehicles and integrate AI into manufacturing processes and future vehicles.
NVIDIA Dynamo: A new open-source software platform designed to optimize data center operations and improve efficiency in executing complex AI models.

Attention was given to two new personal AI supercomputers: DGX Spark and DGX Station. DGX Spark, dubbed “the world’s smallest AI supercomputer,” is powered by the Grace Blackwell chip and designed for researchers, students, and developers to build advanced AI models locally. Meanwhile, DGX Station, equipped with the Blackwell Ultra chip and 784 GB of memory, is tailored for large-scale AI tasks on a desktop, making cutting-edge technology more accessible.

These innovations underscore NVIDIA’s commitment to advancing artificial intelligence, robotics, and computing infrastructure, opening new possibilities for industries worldwide.

View more on GTC 2025 – Announcements and Live Updates | NVIDIA Blog.

Tags Nvidia

NVIDIA News

Invitation to NVIDIA GTC 2025: Explore the Future of AI

Post author By Pavel Snoj
Post date 11/02/2025

Join NVIDIA GTC 2025, the world’s premier event for artificial intelligence, high-performance computing, and innovation. Discover the latest breakthroughs, connect with industry experts, and see how cutting-edge AI is solving today’s biggest challenges.

📅 Date: March 17–21, 2025
📍 Location: San Jose, California & Online

Don’t miss exclusive keynotes, hands-on workshops, and networking opportunities with top AI professionals. Register now and shape the future of AI!

🔗 More information: NVIDIA GTC 2025.

Tags Nvidia

NVIDIA News

DeepSeek-R1 Now Live With NVIDIA NIM

Post author By Pavel Snoj
Post date 11/02/2025

Source: DeepSeek-R1 Now Live With NVIDIA NIM | NVIDIA Blog

To help developers securely experiment with DeepSeek-R1 capabilities and build their own specialized agents, the 671-billion-parameter DeepSeek-R1 model is now available as an NVIDIA NIM microservice preview on build.nvidia.com. The DeepSeek-R1 NIM microservice can deliver up to 3,872 tokens per second on a single NVIDIA HGX H200 system.

Developers can test and experiment with the application programming interface (API), which is expected to be available soon as a downloadable NIM microservice, part of the NVIDIA AI Enterprise software platform.

The DeepSeek-R1 NIM microservice simplifies deployments with support for industry-standard APIs. Enterprises can maximize security and data privacy by running the NIM microservice on their preferred accelerated computing infrastructure. Using NVIDIA AI Foundry with NVIDIA NeMo software, enterprises will also be able to create customized DeepSeek-R1 NIM microservices for specialized AI agents.

Tags Nvidia

NVIDIA News

Fast Forward to Generative AI With NVIDIA Blueprints

Post author By Pavel Snoj
Post date 31/01/2025

NVIDIA Expands AI Workflows With NVIDIA NIM™ and NVIDIA Blueprints

Source: https://blogs.nvidia.com/blog/nim-agent-blueprints/

NVIDIA offers a wide range of software, including NIM (NVIDIA Inference Microservices) and NVIDIA Blueprints, to simplify the deployment of generative AI across industries. NVIDIA NIM™ provides optimized, cloud-native inference microservices for seamless integration of AI models, while NVIDIA Blueprints offer pre-built workflows for faster development and deployment.

These solutions help businesses accelerate AI implementation, reduce infrastructure complexity, and enhance productivity. Whether in the cloud, on-premises, or hybrid environments, NVIDIA’s new AI tools provide flexibility and scalability.

Learn more about NVIDIA Blueprints: NVIDIA AI Workflows.

Tags Nvidia

NVIDIA News

NVIDIA Brings Grace Blackwell AI Supercomputing to Every Desk

Post author By Pavel Snoj
Post date 22/01/2025

Source: NVIDIA Puts Grace Blackwell on Every Desk and at Every AI Developer’s Fingertips | NVIDIA Newsroom

At CES 2025, NVIDIA introduced Project DIGITS, a personal AI supercomputer designed to provide AI researchers, data scientists, and students with desktop access to the NVIDIA Grace Blackwell platform. Central to this system is the new NVIDIA GB10 Grace Blackwell Superchip, delivering up to 1 petaflop of AI performance at FP4 precision. The GB10 integrates an NVIDIA Blackwell GPU with the latest CUDA® cores and fifth-generation Tensor Cores, connected via NVLink®-C2C to a high-performance NVIDIA Grace™ CPU comprising 20 Arm-based cores. Developed in collaboration with MediaTek, the GB10 emphasizes power efficiency and performance. Each Project DIGITS unit includes 128GB of unified memory and up to 4TB of NVMe storage, enabling the handling of AI models with up to 200 billion parameters. For larger models, two units can be linked to support up to 405 billion parameters. This setup allows users to develop and run inference on models locally and seamlessly deploy them on accelerated cloud or data center infrastructures.

Tags Nvidia, Supercomputing

NVIDIA News

The Importance of GPU Memory for AI Performance

Post author By Pavel Snoj
Post date 22/01/2025

Source: GPU Memory Essentials for AI Performance | NVIDIA Technical Blog

The NVIDIA blog highlights the critical role of GPU memory capacity in running advanced artificial intelligence (AI) models. Large AI models, such as Llama 2 with 7 billion parameters, require significant amounts of memory. For instance, processing at FP16 precision demands at least 28 GB of memory.

NVIDIA offers high-performance RTX GPUs, such as the RTX 6000 Ada Generation, featuring up to 48 GB of VRAM. These GPUs are designed to handle the largest AI models, enabling local development and execution of complex tasks. Additionally, they come equipped with specialized hardware, including Tensor Cores, which significantly accelerate computations required for AI workloads.

With NVIDIA’s powerful solutions, businesses and researchers can optimize the development and deployment of AI models directly on local devices, opening up new possibilities for advancements in artificial intelligence.

For more details, visit the official NVIDIA blog: developer.nvidia.com.

Interested in learning more about NVIDIA’s powerful solutions? Contact Xenya d.o.o., and we’ll be happy to help you find the right solution for your needs!

Tags Nvidia