Multimodal

Search documents
This might be OpenAI's New Open-Source Model...
Matthew Berman· 2025-08-01 00:00
Model Capabilities & Performance - Horizon Alpha demonstrates impressive spatial awareness and problem-solving skills, accurately visualizing complex rotations [1] - The model exhibits multimodal capabilities, effectively understanding and interpreting images with speed [2] - Horizon Alpha successfully solves the Tower of Hanoi puzzle despite lacking chain-of-thought reasoning [6] - The model shows an ability to recognize its limitations, indicating when it lacks knowledge [20][21] - Horizon Alpha achieves top rankings in creative writing and emotional intelligence benchmarks [23][11] Model Characteristics & Limitations - Horizon Alpha is a fast model, outputting tokens at approximately 150 tokens per second [2] - The model lacks a "thinking mode," initially outputting the first response that comes to mind [2] - Horizon Alpha provides incorrect answers to simple logic and percentage-based questions [7][8] - The model refuses to provide instructions for illegal activities, such as hotwiring a car [8][9] - The model incorrectly identifies itself as a GPT4 class model from OpenAI, despite likely being an open-source model [9] Open Router & Box AI - Horizon Alpha is available on Open Router and free to use [1] - Box AI allows users to leverage the latest AI models, including open-source options, for document workflows with enterprise-level security [3][4]
国产大模型与AI芯片联盟,意义有多重大?
Guan Cha Zhe Wang· 2025-07-30 12:03
Core Insights - The establishment of the "Model-Chip Ecological Innovation Alliance" by ten domestic large model, AI chip, and computing acceleration companies marks a significant step towards adapting domestic AI chips from the development stage of large models, opening new avenues for collaboration in the domestic chip industry [1][3][4] - The release of the new generation multimodal reasoning large model Step 3 by Jumpspace, which boasts a remarkable adaptation capability to domestic chips, achieving inference efficiency up to 300% compared to DeepSeek-R1 on domestic chips [3][8] - The trend of increasing reliance on domestic computing power is driven by supply risks associated with NVIDIA chips, prompting more users and computing power vendors to shift towards domestic alternatives like Huawei Ascend [4][6][10] Industry Developments - The "Model-Chip Ecological Innovation Alliance" includes major players such as Huawei Ascend, Mu Xi, and others, indicating a strong collaborative effort within the industry [3][14] - Jumpspace's proactive approach in integrating model development with hardware capabilities aims to address inefficiencies in adapting models to chips, which traditionally lagged behind model iterations [10][11] - The new attention mechanism architecture, Multi-Matrix Factorization Attention (MFA), significantly reduces key-value cache usage during inference, making it more compatible with domestic chips [13] Market Dynamics - Jumpspace anticipates a revenue of 1 billion yuan for the year, showcasing its strong market position compared to competitors like Zhipu AI, which is projected to generate 200-300 million yuan in revenue but face losses of up to 2 billion yuan [22] - The rapid application of multimodal models is seen as a key growth area, with Jumpspace already collaborating with major domestic smartphone manufacturers and automotive companies to enhance user experiences [23] Regional Insights - Shanghai's dominance in the "Model-Chip Ecological Innovation Alliance" reflects its robust industrial foundation and emphasis on soft-hard integration, supported by local semiconductor manufacturing capabilities [24][25] - The city's AI industry has seen significant growth, with over 24,733 AI companies registered in 2024, marking a 5.1% increase from the previous year [24]
The State of AI Powered Search and Retrieval — Frank Liu, MongoDB (prev Voyage AI)
AI Engineer· 2025-06-27 09:57
Voyage AI & MongoDB Partnership - Voyage AI was acquired by MongoDB approximately 3-4 months ago [1] - The partnership aims to create a single data platform for embedding, re-ranking, query augmentation, and query decomposition [29][30][31] AI-Powered Search & Retrieval - AI-powered search finds related concepts beyond identical wording and understands user intent [7][8][9] - Embedding quality is a core component, with 95-99% of systems using embeddings [12] - Real-world applications include chatting with codebases, where evaluation is crucial to determine the best embedding model and LLM for the specific application [14][15] - Structured data, beyond embeddings, is often necessary for building powerful search and retrieval systems, such as filtering by state or document type in legal documents [16][17][18] - Agentic retrieval involves feedback loops where the AI search system is no longer just input-output, but can expand or decompose queries [19][20] Future Trends - The future of AI-powered search is multimodal, involving understanding images, text, and audio together [23][24][25] - Instruction tuning will allow steering vectors based on instructions, enabling more specific document retrieval [27][28]