英特尔锐炫Pro B60显卡
Search documents
4卡96GB显存暴力输出!英特尔锐炫Pro B60和长城世恒X-AIGC工作站评测
Xin Lang Cai Jing· 2026-02-10 12:41
Core Viewpoint - Intel's Arc Pro B60 graphics card is positioned as a cost-effective solution for AI inference, offering significant advantages in memory capacity and performance compared to NVIDIA's offerings, particularly in the context of large model inference. Group 1: Product Overview - Intel's Arc Pro B60 features a complete BMG-G21 GPU core with 20 Xe2 cores, 2560 FP32 units, and 24GB of GDDR6 memory, which is double the capacity of its predecessor, the Intel Arc B580 [6][59]. - The card provides 12.28 TFLOPS of FP32 performance and 197 TOPS of INT8 AI performance, with a memory bandwidth of 456GB/s [8][59]. - Compared to NVIDIA's RTX Pro 2000, the Arc Pro B60 offers 50% more memory capacity and bandwidth at a significantly lower price point, making it a competitive option for high-performance AI inference [9][46]. Group 2: Market Positioning - Intel's transition to a "full-stack AI company" is challenging NVIDIA's previous dominance in the GPU market, particularly in AI applications [1][52]. - The introduction of oneAPI allows developers to easily migrate code from NVIDIA's CUDA environment to Intel hardware, enhancing the usability of Intel's GPUs for AI tasks [4][55]. - The Arc Pro B60 is highlighted as the most cost-effective solution for building large memory pools (96GB to 192GB) necessary for running extensive AI models [9][59]. Group 3: Performance Testing - In tests with the GPT-OSS-120B model, the Arc Pro B60 demonstrated the ability to handle 100 concurrent requests successfully, indicating its robustness for real-time applications [27][50]. - The mean time to first token (TTFT) was recorded at 91.37ms, showcasing the card's strong performance in the prefill phase [31][50]. - As concurrency increased, the throughput of the Arc Pro B60 improved significantly, reaching a maximum of 701 tokens per second at high loads, which is sufficient to support up to 1000 simultaneous users [36][40]. Group 4: Competitive Analysis - When compared to NVIDIA's RTX Pro 2000, the Arc Pro B60 outperformed in both memory capacity and processing power, achieving approximately 50% better performance in multi-GPU setups [46][49]. - The Arc Pro B60's large memory capacity allows it to run larger models without the need for extreme quantization, which is a limitation for NVIDIA's offerings at similar price points [47][49]. - Intel's pricing strategy for the Arc Pro B60 positions it as a viable alternative for enterprises looking to build high-performance local LLM inference stations at a fraction of the cost of NVIDIA's equivalent products [50][51].