Workflow
Cross-platform applications
icon
Search documents
Foundry Local: Cutting-Edge AI experiences on device with ONNX Runtime/Olive โ€” Emma Ning, Microsoft
AI Engineerยท 2025-06-27 10:21
Key Benefits of Local AI - Addresses limitations of cloud AI in low-bandwidth or offline environments, exemplified by conference Wi-Fi issues [2][3] - Enhances privacy and security by processing sensitive data locally, crucial for industries handling legal documents and patient information [4] - Improves cost efficiency for applications deployed on millions of devices with high inference call volumes, such as game applications [5] - Reduces real-time latency, essential for AI applications requiring immediate responses [5] Foundry Local Overview - Microsoft introduces Foundry Local, an optimized end-to-end solution for seamless on-device AI, leveraging existing assets like Azure AI Foundry and ONNX Runtime [9] - ONNX Runtime accelerates performance across various hardware platforms, with over 10 million downloads per month [8] - Foundry Local Management Service hosts and manages models on client devices and connects to Azure AI Foundry to download open-source models on demand [10] - Foundry Local CLI and SDK enable developers to explore models and integrate Foundry Local into applications [11] - Foundry Local is available on Windows and macOS, integrated into the Windows platform for simpler AI development [12] Performance and Customer Feedback - Foundry Local accelerates performance across different silicon vendors, including NVIDIA, Intel, AMD, and Qualcomm [12] - Early adopters report ease of use and performance improvements, highlighting benefits like enhanced memory management and faster token generation [13][15][16] - Foundry Local enables hybrid solutions, allowing parts of applications to run locally, addressing data sensitivity concerns [17][18]