nvidia在

首席執行官詹森黃(Jensen Huang)在NVIDIA對AI工廠的廣泛願景,旨在處理實時推理和自動決策AI的大規模計算基礎設施中構成了這些發展。

該公司還強調href=“ https://nvidianews.nvidia.com/news/oracle-and-nvidia-collaborate-to-help-enterprises-accelerate-agentic-agentic-ai-inference”> Enterprise AI夥伴關係品牌表明,推動了整合行業中AI驅動的自動化的推動。這些進步標誌著NVIDIA從AI培訓到實時AI部署的過渡。

在NVIDIA AI策略的核心是AI,這是一個超出傳統生成模型的飛躍。與傳統的AI系統基於模式識別產生響應不同,推理AI模型旨在執行多步驟的決策,提高上下文意識和自主性。

領導這項工作是NVIDIA的Llama Nemotron家族,NVIDIA的新開放推理AI模型。這些是針對需要深入上下文理解的應用程序進行了優化的href=“ https://nvidianews.nvidia.com/news/nvidia-announces-major-release-cosmos-world-found-models-models-models-models-and-physical-ai-data-tools> Systems。

Blackwell Ultra和Vera Rubin:未來的AI硬件

引入Blackwell Ultra Signals的引入AI推理硬件中有重大升級。該GPU預計將在2025年下半年發貨,提供20個PETAFLOPS的AI性能和288GB的HBM3E內存,使大型型號無需分區即可運行。 It also supports FP4 precision, allowing for more efficient processing of AI reasoning tasks.

Following Blackwell Ultra, NVIDIA confirmed the development of the Vera Rubin AI architecture, set to launch in 2026. This architecture will introduce the Vera Rubin Ultra in 2027, delivering 50 petaflops of FP4 performance—a leap in computational capacity aimed at enterprise-scale AI inference工作負載。

但是,這些芯片的增加功能引起了人們對能源效率和散熱的擔憂。 While NVIDIA claims performance improvements, previous problems suggest that chip manufacturing constraints could impact availability and scaling.

[embedded content]

DGX AI Supercomputers: Expanding AI Access

Alongside its high-end AI chips, NVIDIA introduced DGX Spark, a compact $3,000 AI supercomputer designed for researchers, developers, and初創公司。 DGX Spark由GB10 Blackwell SuperChip提供支持,旨在使AI計算機民主化。

對於企業用戶,該公司推出了DGX站,以GB300 Blackwell Ultra芯片為特色,具有20個PETAFLOPS,具有20個PETAFLOPS的性能和784GB的統一記憶。該機器是針對醫療保健,金融和自動化的高性能AI推理工作負載而定制的。 href=“ https://nvidianews.nvidia.com/news/oracle-and-nvidia-collaborate-to-help-enterprises-accelerate-accelerate-agentic-ai-inference”>與Oracle Cloud cloud 合作,使企業能夠在規模的範圍內進行AI的界定。與GE Healthcare和YUM的合作!還宣布了品牌,展示了AI對醫學診斷和自動零售業務的影響不斷增長。

AI工廠策略

隨著NVIDIA擴展AI跨工業的AI採用,其AI Factory Vision旨在擴展實時應用程序的範圍範圍。 Throughout the keynote, NVIDIA emphasized the concept of AI Factories—large-scale data centers optimized for AI reasoning and inference.

These are designed to support the next wave of AI applications, from autonomous systems to real-time industrial simulations.

Despite the excitement surrounding these advancements, NVIDIA’s stock declined by 3.4% during the presentation, reflecting investor caution over AI和Google的AI基礎架構成本,芯片可用性以及競爭。

與Blackwell Ultra和Vera Rubin一起定義了AI計算的下一階段,行業分析師將密切關注NVIDIA的可伸縮性,功率效率和市場採用。

Categories: IT Info