量子位
Search documents
量子位编辑作者招聘
量子位· 2025-11-25 09:32
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Recruitment Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - All positions are full-time and based in Beijing, Zhongguancun [2]. Job Responsibilities - **AI Industry Direction**: Focuses on innovations in infrastructure, including chips, AI infrastructure, and cloud computing [5]. - **AI Finance Direction**: Involves tracking venture capital and financial reports in the AI sector, monitoring capital movements within the industry [6]. - **AI Product Direction**: Concentrates on the application and hardware developments of AI [6]. Benefits and Growth Opportunities - Employees will have the chance to engage with the latest AI technologies and tools, enhancing their work efficiency and creativity [6]. - The company offers a vibrant team environment, competitive salaries, and comprehensive benefits, including social insurance, meal allowances, and performance bonuses [6][12]. - New hires will receive mentorship from senior editors to accelerate their professional growth [6]. Company Impact and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across all platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sector according to third-party data platforms [12].
小米打通智驾和具身大模型,然后开源了
量子位· 2025-11-25 09:32
Core Insights - The article discusses the launch of MiMo-Embodied, the world's first unified base model for autonomous driving and embodied operations, developed by Xiaomi's Chen Long team [1][3]. Group 1: Model Overview - MiMo-Embodied is based on the MiMo-VL architecture and addresses the knowledge transfer challenges between autonomous driving and embodied operation scenarios by creating a high-quality dataset that includes general vision, embodied tasks, and driving scenes [3][10]. - The model employs a progressive four-stage training strategy that incorporates Chain of Thought (CoT) and Reinforcement Learning (RL), achieving state-of-the-art (SOTA) performance across 29 benchmarks in both autonomous driving and embodied intelligence [3][21]. Group 2: Challenges Addressed - Previous models in the embodied and autonomous driving fields lacked a unified embodied Visual Language Model (VLM), which limited their ability to interact effectively with the physical world in dynamic environments [6][9]. - The significant domain gap between indoor operations and outdoor driving has hindered the transfer of capabilities across these two areas [8][10]. Group 3: Training Strategy - The training data encompasses three dimensions: general multimodal understanding, embodied AI (including affordance prediction, planning, and spatial understanding), and autonomous driving (covering perception, prediction, and planning) [15][19]. - The four-stage training strategy includes: 1. **Stage 1**: Embodied AI Supervised Fine-tuning with general and embodied data [18]. 2. **Stage 2**: Autonomous Driving Supervised Fine-tuning, focusing on multi-view spatial reasoning and complex traffic scene analysis [20]. 3. **Stage 3**: CoT Supervised Fine-tuning, enhancing the model's ability to handle complex multi-step problems [20]. 4. **Stage 4**: RL Fine-Tuning using the GRPO algorithm to optimize accuracy and reliability [20]. Group 4: Performance Evaluation - MiMo-Embodied was evaluated through both qualitative and quantitative assessments, demonstrating competitive results against existing models in various benchmarks for embodied intelligence and autonomous driving [21][23]. - In embodied capabilities, MiMo-Embodied showed particular advantages in affordance prediction and spatial understanding compared to other models [23][24]. - The model also excelled in autonomous driving capabilities, showcasing strong performance in perception, prediction, and planning across diverse real-world driving scenarios [25][26]. Group 5: Real-World Applications - In embodied navigation tasks, MiMo-Embodied outperformed models like GPT-4o and Qwen2.5-VL in object localization and consistent performance across varied household scenarios [27]. - The model demonstrated robust affordance and spatial reasoning abilities in operational tasks [29]. - In autonomous driving, MiMo-Embodied effectively handled complex tasks such as turning at intersections and lane changes, integrating road context and vehicle state for coherent decision-making [33][36].
国产手机卖到1万6!华为新旗舰,搭载麒麟9030
量子位· 2025-11-25 09:32
Core Viewpoint - Huawei has launched its flagship Mate 80 series and the foldable Mate X7, featuring the new Kirin 9030 series chips, marking a significant advancement in both hardware and software capabilities. Group 1: Product Launch and Specifications - The Mate 80 Pro 12GB version is equipped with the new Kirin 9030 chip, while the 16GB version and Mate X7 utilize the Kirin 9030 Pro chip [3] - The starting price for the Mate 80 series is 4699 yuan, while the Mate X7 starts at 12999 yuan, going up to 15999 yuan for higher configurations [10][12] - The Mate 80 Pro Max is the first all-metal flagship phone post-5G era [5] Group 2: Software and AI Features - The Mate 80 series and Mate X7 debut with HarmonyOS 6, featuring upgraded AI capabilities [6] - The AI assistant, now called "Xiao Yi Intelligent Body," can autonomously learn app operations and collaborate with third-party applications [6][15] - The AI can assist with various tasks, such as reordering frequently purchased items and providing real-time updates on flight changes [19][25] Group 3: Imaging Capabilities - The Mate 80 series introduces the second-generation Red Maple imaging system, significantly enhancing color accuracy and dynamic range [8][51] - The main camera features a 5000-megapixel sensor with a variable aperture and optical image stabilization, while the long-focus camera offers 5.5x optical zoom [42][43] - The Mate 80 Pro Max includes advanced imaging hardware, improving processing speed, color accuracy, and noise reduction [63] Group 4: Kirin 9030 Chip Insights - The Kirin 9030 chip features a 1+4+4 core design with a CPU clocked at up to 2.75GHz and a GPU using Maleoon 935, indicating a notable performance improvement [83][84] - The return of the Kirin 9030 chip, combined with comprehensive software and hardware upgrades, enhances the flagship's appeal [84] Group 5: Additional Innovations - Huawei has introduced an AI pet called "Smart Huan Huan," reflecting its expansion into AI hardware ecosystems [86][88] - The company is set to showcase further innovations in the AI sector at the upcoming Quantum Meet 2026 conference [89]
马斯克开始用Grok替代员工了!最惨部门裁员90%
量子位· 2025-11-25 05:31
Core Viewpoint - Elon Musk is replacing employees at X (formerly Twitter) with AI technology, specifically using Grok to take over roles previously held by human engineers [1][2][9]. Group 1: Employee Reductions - Musk has laid off half of the engineering team responsible for trust and safety issues at X, reducing the team from over 100 members to fewer than 10 [2][3][4]. - The layoffs are part of a broader strategy to automate processes and reduce human labor in favor of AI solutions [22][32]. - The remaining engineering staff at X is estimated to be around 100, but further layoffs may occur [21]. Group 2: AI Integration - Musk aims to fully automate X's algorithmic recommendations, transferring responsibilities to Grok, which will match user interests through content consumption [6][23]. - The introduction of Grok as a central figure in content management signifies a shift from human oversight to AI-driven processes [24][25]. - Musk's Macrohard initiative seeks to automate software development, indicating a desire to replicate Microsoft's functions using AI [28][30]. Group 3: Leadership Changes - Musk has appointed twin engineers Dima and Ievgin Soboliev from xAI to lead the transformation at X, promoting a demanding work culture [11][20]. - The twins have a background in applied mathematics and have worked at major tech companies, bringing significant expertise to their roles [12][16]. Group 4: Risks and Challenges - The shift to AI raises concerns about accountability, particularly regarding the safety team’s ability to manage content generated by Grok [34][35]. - The ongoing layoffs and restructuring may hinder critical projects, such as the proposed payment service "X Money," which requires stable leadership and staffing to gain regulatory approval [36][38].
学生3年投稿6次被拒,于是吴恩达亲手搓了个评审Agent
量子位· 2025-11-25 05:31
Core Insights - The article discusses the development of an AI paper review system created by Andrew Ng in response to a student's repeated rejections in academic submissions, aiming to expedite the review process and provide actionable feedback [2][24]. Group 1: AI Paper Review System - The AI review system was trained on ICLR 2025 review data, achieving a correlation coefficient of 0.42 with human reviewers, which is comparable to the 0.41 correlation among human reviewers [4][14]. - The system allows users to select the conference or journal for submission, tailoring the review process to the specific style of that venue [9]. - Upon submission, the system converts the PDF to Markdown, extracts keywords, and searches arXiv for relevant research to summarize and provide a complete review with specific modification suggestions [11][12]. Group 2: Performance and Accuracy - The AI system scores papers on a scale of 1-10 across seven dimensions, including originality and the importance of the research question, with a final score calculated by a model [13][14]. - While the AI's scoring correlates well with human scores, human reviewers have a higher accuracy rate of 0.84 in predicting acceptance compared to the AI's 0.75 [14]. - The AI review system reflects the likelihood of a paper's acceptance to some extent, although it primarily references content from arXiv, which may introduce some inaccuracies [20][21]. Group 3: User Experience - Users have expressed that receiving a quick rejection from the AI is preferable to waiting months for human feedback, allowing for faster revisions and resubmissions [6][7]. - The system is currently available for researchers to try, potentially increasing their chances of acceptance [29].
荣耀500系列2699元起:人物能实况、路人能消除、照片还能自己“跳出来”
量子位· 2025-11-25 03:20
Core Viewpoint - Honor has launched the Honor 500 series, featuring two versions: Super Standard and Super Pro, with starting prices of 2699 yuan and 3599 yuan respectively [8]. Group 1: Product Features - The Honor 500 series introduces the industry's first front and rear Live portrait feature, supporting six film styles for enhanced atmosphere [2]. - A new feature called "Breaking the Frame" Live effect allows users to create a 3D-like effect by automatically recognizing subjects in photo collages [4]. - The series is equipped with an 8000mAh battery, setting a record for battery capacity in its price range, and supports 27W wired reverse charging [6][16]. Group 2: Performance and Gaming - The standard version uses the Snapdragon 8s Gen4 chip, while the Pro version features the Snapdragon 8 Supreme version, providing strong performance for its price point [9]. - The devices are optimized for high-load gaming, with the ability to run games like "Honor of Kings" and "Genshin Impact" at full frame rates without lag [11][12]. Group 3: Target Audience and Connectivity - Honor has implemented network optimizations specifically for students, including automatic network switching during power outages and campus network login without authentication [19]. - The design includes four color options and a thinner body at 7.75mm, enhancing the overall aesthetic and grip [21][23]. Group 4: Imaging Capabilities - The series maintains a 200-megapixel AI ultra-clear portrait foundation, with the Pro version adding a 50-megapixel dual-stabilized telephoto lens [25][26]. - The Live portrait feature allows for depth-of-field effects and beauty enhancements, making each frame closer to a finished product [29]. - The "Live passerby removal" function enhances image clarity by removing unwanted subjects from dynamic environments [32]. Group 5: Accessories and Launch - Alongside the Honor 500 series, several accessories were launched, including the Honor Watch X5 and Honor Earbuds S, with competitive pricing [38][40]. - The Honor 500 series is set to officially launch on November 27 [44].
Nano Banana新玩法无限套娃!“GPT-5都不会处理这种级别的递归”
量子位· 2025-11-25 03:20
Core Insights - The article discusses the innovative use of the "Nano Banana" AI tool, highlighting its recursive image generation capabilities and the excitement it has generated among users [1][8][26]. Group 1: Nano Banana and Its Features - Nano Banana has introduced a new recursive image generation feature that allows users to create complex images by layering prompts, leading to a unique "nested" visual experience [1][10]. - Users have reported impressive results, with many praising Nano Banana's understanding of specified backgrounds and perspectives in prompts [13][26]. - Despite its capabilities, the generated images often contain bugs and imperfections, particularly when users set the context to resemble old photographs, which introduces noise and low resolution [23][25]. Group 2: Market Impact and User Engagement - Following the release of Gemini 3, Gemini's market share increased from 23% to 30%, indicating a significant rise in user interest and engagement [28][29]. - The increase in market share suggests that new users may be attracted to Gemini due to its advanced features, although its user loyalty is lower compared to ChatGPT, which has an 82% loyalty rate [33][36]. - Prominent figures, such as Salesforce's CEO, have publicly expressed their preference for Gemini over ChatGPT, citing improvements in reasoning, speed, and overall performance [37].
Claude Opus 4.5发布!2小时工程测试超人类,前代Sonnet搞不定的活它轻松拿捏
量子位· 2025-11-25 01:17
Core Insights - Claude Opus 4.5 has been released, showcasing significant advancements in coding, agent capabilities, and computer usage, outperforming all human candidates in a two-hour engineering task [1][16][10] Performance Metrics - In the SWE-bench Verified coding tests, Opus 4.5 achieved a score of 80.9%, surpassing Sonnet 4.5's 77.2% and Opus 4.1's 74.5% [2][19] - The model demonstrated a 10.6% improvement in high-difficulty coding challenges compared to Sonnet 4.5 [22] - In visual reasoning, Opus 4.5 scored 80.7%, outperforming Sonnet 4.5's 77.8% [19] Enhanced Capabilities - Opus 4.5 shows improved performance in deep research, PPT creation, and spreadsheet handling, with the ability to autonomously process complex scenarios and provide solutions without human guidance [6][14] - The model can efficiently manage multiple sub-agents, supporting the construction of complex multi-agent systems [38] Developer Platform Upgrades - The Claude API has introduced an "effort parameter," allowing developers to optimize for time and cost or maximize performance, resulting in a 76% reduction in token usage while maintaining high performance [32][36] - Claude Code has launched new features, including a Plan Mode for generating precise execution plans and the ability to run multiple sessions simultaneously [41][42] Accessibility and Usage - Opus 4.5 is available through apps, APIs, and major cloud platforms, with a pricing model of $5 per million tokens for input and $2.5 for output [12] - The usage limits for Max and Team Premium users have been increased, aligning Opus token usage with previous Sonnet models [43]
奥特曼谈OpenAI首款AI硬件:我想拿起它咬一口
量子位· 2025-11-25 01:17
Core Viewpoint - OpenAI's first AI hardware product is anticipated to be released within two years, with a design that aims to evoke a strong emotional response from users, described metaphorically as something they would want to "bite" or "lick" [2][7][27]. Group 1: Product Design and Philosophy - The collaboration between Sam Altman and Jony Ive is rooted in a shared vision regarding design, intelligence, and technology's role in human life [9][10]. - The initial phase of their partnership focused on exploring metaphysical themes rather than predefined product goals, emphasizing curiosity and creative exploration [16][17]. - The design process involved extensive research and the creation of detailed books on design history, which guided the development of the new hardware's form [18]. Group 2: Product Features and User Experience - The final product is expected to be striking in quality, with a design that feels both inevitable and obvious upon first sight [20][21]. - It will feature a simple and serene interface, contrasting with the complexity of modern devices, and will be powered by a reliable AI that filters information and understands context [22][23]. - The user experience is designed to be intuitive and effortless, allowing for immediate use without overwhelming the user [24][25]. Group 3: Market Context and Future Developments - OpenAI has recently partnered with Foxconn to produce AI hardware, indicating a significant step towards the realization of their hardware ambitions [27]. - The anticipated product may present both potential benefits and challenges, but its initial appeal is expected to be strong, capturing user interest at first glance [27].
波士顿动力前CTO加盟DeepMind,Gemini要做机器人界的安卓
量子位· 2025-11-24 09:30
Core Insights - Google is positioning Gemini as a potential universal operating system for robots, akin to Android, aiming to create a system that can adapt to various physical configurations [5][10][30] - The hiring of Aaron Saunders, former CTO of Boston Dynamics, signifies a strategic move to enhance hardware capabilities in conjunction with the Gemini software [2][12][21] Group 1: Gemini's Development and Vision - The release of Gemini 3 has shifted Google's approach from a cautious exploration of robotics to a more aggressive strategy, indicating a desire to build a versatile AI system [6][31] - Google aims to create a universal robot OS that can accommodate different body configurations, which is essential for the adaptability of AI in robotics [7][10] - The Gemini Robotics series, launched earlier this year, showcases Google's commitment to enhancing robots' multimodal understanding capabilities [22][23] Group 2: Strategic Hiring and Expertise - Aaron Saunders, who has extensive experience in robotics and led the development of key robots at Boston Dynamics, will now oversee hardware engineering at DeepMind [3][13][20] - His expertise in dynamics and control systems is expected to significantly contribute to the development of Gemini as a robust robotic platform [20][21] - The combination of Gemini's software advancements and Saunders' hardware experience positions Google to make significant strides in the robotics sector [21][30]