量子位
Search documents
量子位编辑作者招聘
量子位· 2026-02-13 13:19
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as interpreting technical reports from conferences [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and capital movements within the AI industry, requiring strong analytical skills and a passion for interviews [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, producing in-depth evaluations of AI products, and engaging with industry experts [11]. Group 3: Benefits and Work Environment - Employees will have the opportunity to engage with cutting-edge AI technologies, enhance their work efficiency, and build personal influence through original content creation [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, and performance bonuses, and promotes a dynamic and open work culture [6][12]. Group 4: Company Growth - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12].
Seedance 2.0现象级刷屏!深度测评:复杂场景贼稳,连asmr都会?
量子位· 2026-02-13 08:23
梦瑶 发自 凹非寺 量子位 | 公众号 QbitAI 太热闹了!整个一个现象级show time~ 前脚字节刚上线 Seedance 2.0 ,后脚直接全网现象级刷屏!!! 这边 马斯克 怒赞,那边 美国导演 直呼好莱坞要完蛋了。 甚至急得不少老外狂催更:啥时候能开放全球使用?咋注册中国账号?在线等!挺急的! 如此之火爆,如此之amzing,那咱高低也得上手搓一把试试。 看我搓的这个全网超火的「猫咪大战哥斯拉」同款视频,小猫一跃,直接一个重拳出击~ 再来看这个AI版《F1狂飙飞车》,转速表飙升、刹车尖啸,太有好莱坞内味儿了嗷: 再来试试Chinese kungfu大战钢铁侠,俩人一来一回近身过招,音效刺激感拉满,太带派了! 脑洞大开的网友们更会整活儿,快看下面这位网友做的一镜到底,镜头从街头一路滑进地铁站、钻进车厢,超有梦核感: 还有这位网友,只是上传了一张漫画截图,Seedance 2.0直接给他整出了一整段剧情视频,别太amazing啊我说!! 老实说,Seedance 2.0确实对镜头语言理解更到位也更可控了,参考能力直接next level,很适合咱日常做多镜头和精细化控制使用~ 老规矩不废话,咱直接 ...
我把Agent拉进群聊,它竟然开始带队干活?全球首个AI社交通用平台来了!
量子位· 2026-02-13 08:23
Core Viewpoint - The article discusses the launch of "Teamily AI," a groundbreaking AI-native instant messaging application that integrates AI agents into social interactions, allowing them to participate in group discussions and decision-making processes, thus transforming the role of AI from a personal assistant to a collaborative entity within social networks [1][5][8]. Group 1: AI Integration in Social Interactions - Teamily AI allows users to invite AI agents into their social circles, enabling real-time responses to group needs and facilitating collaborative tasks [1][6]. - The platform supports various content forms, including images, videos, and long texts, allowing AI agents to engage in diverse discussions and tasks [6][10]. - AI agents can understand and respond to casual prompts in group chats, making interactions feel natural and seamless [15][19]. Group 2: AI in Work Environments - In professional settings, Teamily AI can assist in brainstorming sessions, project planning, and data analysis, significantly reducing communication overhead and improving collaboration efficiency [22][24]. - The AI agent can quickly analyze lengthy reports and provide structured insights, enhancing decision-making processes within teams [23][30]. - The platform's ability to manage multiple roles and tasks allows for a more organized workflow, addressing the complexities of team dynamics [42][44]. Group 3: Technical Framework and User Experience - Teamily AI eliminates the need for local deployment, allowing users to create personalized AI agents easily without technical barriers [31][33]. - The platform emphasizes privacy and user control, enabling individuals to manage permissions and data access for their AI agents [36][38]. - The underlying technology consists of a three-layer architecture that facilitates context management, user intent understanding, and task coordination among AI agents and human members [44]. Group 4: Future Implications - The integration of AI agents into social and work environments signifies a shift in human collaboration, where AI becomes an active participant rather than a passive tool [55][56]. - The potential for each individual to have a dedicated AI agent team suggests a future where AI is deeply embedded in daily social and professional interactions [55].
对话原力灵机周而进:模型2.4B就够用,关键是“具身原生”;能闭环才是最高效方法
量子位· 2026-02-13 05:42
Core Viewpoint - The company has introduced a lightweight embodiment model DM0 with 2.4 billion parameters, claiming it is sufficient for real-time processing and capable of continuous evolution through reinforcement learning [1][5][4]. Group 1: Model Specifications - DM0 is designed to handle three perspectives of 728x728 images with a reasoning delay of only 60 milliseconds [4]. - The model is considered the first "embodiment native large model" due to its unique training approach from scratch, differing from industry norms [7][18]. - The model's training process consists of three phases: VLM Train, VLA Pre-Train, and VLA Post-Train, focusing on multi-source and multi-task training [26][29][30]. Group 2: Technical Framework - Alongside DM0, the company released an open-source framework Dexbotic 2.0 and a production workflow DFOL, aimed at enhancing embodied applications [8][97]. - Dexbotic 2.0 is designed to unify embodied operations and navigation, allowing for modular architecture [98][100]. - DFOL aims to bridge the gap between traditional automation and human-like flexibility, focusing on efficiency and adaptability [101]. Group 3: Data Collection and Training Philosophy - The company emphasizes a "from zero" training approach, arguing that early exposure to physical world interactions is crucial for model understanding [40][42]. - Data collection is comprehensive, involving internet data, intelligent driving data, and embodied data, with a focus on high-resolution inputs for precise actions [62][64][66]. - The data collection strategy is dynamic, adjusting based on experimental results to ensure effective model training [68][70]. Group 4: Application and Market Strategy - The company is initially focusing on logistics as a practical application for embodied intelligence, aiming to refine capabilities in a controlled environment [125][146]. - The logistics scenario is chosen for its scalability and replicability, allowing for rapid data feedback loops to enhance model performance [149][150]. - Future plans include expanding from logistics to more complex environments, ultimately targeting consumer applications [155][156]. Group 5: Long-term Vision - The ultimate goal is to develop robots with broad social identities, capable of independent transactions and interactions in various environments [168][171]. - The company believes that achieving this vision requires a phased approach, ensuring reliability in hardware and model capabilities before expanding to more complex tasks [169][172].
姚顺宇谷歌首秀,Gemini新模型刷爆SOTA:人类仅剩7人捍卫碳基编程
量子位· 2026-02-13 05:42
Core Insights - Google has significantly upgraded its AI model, Gemini 3 Deep Think, in response to competition from Claude Opus 4.6 and GPT Codex 5.3 [1] Performance Metrics - Gemini 3 Deep Think achieved an unprecedented score of 84.6% on the ARC-AGI-2 benchmark, surpassing previous models that scored between 60%-70% [3][26] - In the Humanity's Last Exam (HLE), it scored 48.4%, setting a new state-of-the-art (SOTA) [4][22] - The model also scored 3455 Elo points on Codeforces, ranking it as the 8th in the world [2] - In the International Math Olympiad 2025, it reached gold medal level with a score of 81.5% [5][33] Cost Efficiency - The upgrade has reduced the reasoning cost by 82%, from $77.16 to $13.62 per task [29] Applications and Capabilities - Gemini 3 Deep Think can analyze sketches, model complex shapes, and generate files for 3D printing [8] - It successfully identified a subtle logical flaw in a complex mathematical paper that was missed during human peer review [10][11] - The model optimized a method for growing complex crystals, achieving a thickness greater than 100 microns, which was previously difficult [14] Research and Development Team - The development team includes notable Chinese scientists, such as Yi Tay and Shunyu Yao, who have significant backgrounds in AI and physics [36][41] - Yi Tay has previously worked on early large language models and returned to Google DeepMind after a stint in a startup [38] - Shunyu Yao has a strong academic background, having published in top journals and worked on advanced topics in quantum physics [41][42]
一键搞定百万行Excel和PPT排版!这家杭州电力AI初创要给打工人减负
量子位· 2026-02-13 02:52
Core Viewpoint - The article discusses the limitations of cloud-based AI solutions and introduces XMO-AgentBox, a local AI agent platform designed to enhance productivity while ensuring data security and system integration [1][5][28]. Group 1: Challenges with Current AI Solutions - Users often face difficulties with cloud-based AI tools, especially when dealing with large local databases, leading to frustration [2][3]. - There is a need for an AI partner that operates locally, capable of handling complex systems without the limitations of cloud execution [4][11]. Group 2: Introduction of XMO-AgentBox - XMO-AgentBox is presented as a local intelligent agent workspace that integrates advanced AI capabilities while maintaining data sovereignty [5][12]. - The platform is developed by Ximo Decision, a company with expertise in power energy and large-scale mathematical optimization [7][8]. Group 3: Capabilities of XMO-AgentBox - The platform supports various professional software and can handle large datasets, automate document generation, and optimize decision-making processes [29][30]. - It features the latest DeepSeek model, allowing for comprehensive understanding and context awareness of entire business systems [14][12]. Group 4: Practical Applications - XMO-AgentBox can generate high-quality presentations, write professional documents, and create engaging content, thus alleviating the burden of repetitive tasks [20][21][25]. - It enables professionals to focus on strategic design and optimization rather than mundane information handling [22][32]. Group 5: Conclusion and Future Potential - The article emphasizes the potential of localized AI solutions to enhance productivity, security, and reliability for professionals facing data management challenges [36]. - Users are encouraged to experience the platform firsthand, highlighting its capabilities in generating personalized content [37][38].
1美金时薪雇个全栈替身,MiniMax M2.5让打工人也能体验当老板的感觉
量子位· 2026-02-13 02:52
Core Insights - The article discusses the launch of MiniMax's new model M2.5, which excels in full-stack coding and intelligent agent capabilities, positioning itself alongside Claude Opus 4.6 in performance [2][7][21] - M2.5 is designed to handle both front-end and back-end development, offering a comprehensive solution for coding tasks and data management at a low cost [6][10][61] - The model's rapid processing speed and cost-effectiveness signal a significant advancement in AI applications, indicating an impending explosion of AI utility in various sectors [59][61][64] Group 1: Model Capabilities - M2.5 supports multiple platforms including PC, mobile apps, React Native, and Flutter, making it a versatile tool for developers [3][4] - It can generate complete, functional code for complex projects, such as an e-commerce website with advanced features [12][14] - The model has achieved an impressive 80.2% score on the SWE-Bench Verified leaderboard and ranks first in multi-language tasks [8][10] Group 2: Performance Metrics - M2.5 operates at a speed of 100 transactions per second (TPS), which is double that of mainstream flagship models, enhancing its efficiency in data processing and bug fixing [21][61] - The model's activation parameter count is only 10 billion, making it the smallest flagship model in its class while still delivering top-tier performance [20][64] Group 3: Practical Applications - M2.5 has been successfully integrated into real-world scenarios, such as automating financial report generation and data organization tasks [27][32] - The model's ability to analyze data and provide business insights, such as identifying cost-saving opportunities, showcases its advanced analytical capabilities [38][40] Group 4: Industry Implications - The rapid advancements in the M2 series indicate a shift towards more capable AI models that can independently manage complex tasks, reducing the need for constant developer oversight [59][66] - M2.5's introduction is seen as a catalyst for broader AI adoption across industries, with the potential to transform workflows and productivity [59][64]
小米的首代机器人VLA大模型来了!丝滑赛德芙,推理延迟仅80ms丨全面开源
量子位· 2026-02-12 12:42
Core Insights - The article discusses the rising prominence of embodied intelligence and robotics, highlighting the increasing interest from both large and small companies, as well as capital investment and media coverage [2][3] - There is a growing expectation for embodied robots to transition from being merely demonstrative to becoming practical tools that enhance productivity in real-world applications [3][4] - Xiaomi's new embodied VLA model, Xiaomi-Robotics-0, addresses critical issues such as the frequent pauses and slow corrections in robotic execution, aiming for greater autonomy and efficiency [7][8] Group 1: Industry Trends - The embodied robotics sector is at a pivotal point, characterized by impressive demonstrations of capabilities while also facing scrutiny regarding their actual value in industrial settings [3][4] - The industry is experiencing a paradigm shift where the focus is on the autonomy of robots, moving beyond human-assisted operations to fully autonomous systems [4][6] Group 2: Xiaomi-Robotics-0 Innovations - Xiaomi-Robotics-0 features three core technological innovations: architecture design, pre-training strategies, and post-training mechanisms, all aimed at enabling robots to understand complex environments and execute actions continuously and accurately [12][13] - The model employs a dual-brain architecture, separating the "brain" for decision-making and the "small brain" for generating continuous action blocks, which enhances the smoothness and precision of robotic movements [16][21] - A two-phase pre-training approach is utilized to maintain the model's visual understanding while training it to perform actions, ensuring that the robot can interpret complex instructions and plan continuous movements [24][30] Group 3: Performance Metrics - Xiaomi-Robotics-0 has achieved outstanding results in various benchmarks, surpassing approximately 30 existing models in environments like LIBERO, CALVIN, and SimplerEnv [44][45] - The model demonstrated a 100% success rate in the Libero-Object task and maintained high throughput in real-world tasks such as towel folding and LEGO disassembly, showcasing its practical capabilities [47][54][57] - The model's performance indicates that it does not sacrifice understanding capabilities for control abilities, maintaining high scores across multiple evaluation metrics [49][58] Group 4: Strategic Direction - Xiaomi's approach in the embodied intelligence field appears to focus on practical applications rather than merely showcasing advanced technology, aiming to address real-world industrial challenges [61][65] - The company has recently open-sourced its models, including TacRefineNet, which enhances fine-grained control without relying on visual input, indicating a commitment to transparency and collaboration within the industry [74][76] - This open-source strategy lowers barriers for smaller developers, allowing them to build upon Xiaomi's foundational work and contribute to the development of specialized applications in robotics [78][79]
马斯克回应xAI联创离职潮:这是组织的进化
量子位· 2026-02-12 11:00
Core Viewpoint - The recent departures at xAI, including two co-founders, have raised questions about whether this is a normal turnover or indicative of deeper issues within the company. Elon Musk has stated that the restructuring is aimed at improving execution efficiency and is not merely a layoff [4][9][10]. Group 1: Company Restructuring - Musk emphasized that the departures are part of a necessary organizational restructuring to enhance efficiency at the current scale of the company [9][10]. - The company is still actively recruiting, indicating a focus on future growth despite the recent changes [5][12]. - The restructuring is described as a response to the company's rapid growth, suggesting that some individuals may be better suited for earlier stages of a startup rather than a more mature organization [11][14]. Group 2: Employee Departures - The recent wave of departures includes co-founders Tony Wu and Jimmy Ba, who publicly expressed intentions to pursue new opportunities, suggesting a voluntary exit rather than a forced departure [15][17]. - The narrative from departing employees contrasts with Musk's framing of the situation, leading to speculation about the true nature of these departures [20][21]. - A significant number of core team members have left xAI since February, indicating a potential trend of increasing turnover within the organization [27][38]. Group 3: External Factors - xAI was recently acquired by SpaceX in an all-stock deal, which is expected to influence the company's operational focus and priorities moving forward [38]. - The company is facing external regulatory scrutiny, particularly related to its AI products, which may contribute to internal pressures and the recent departures [40][42]. - The competitive landscape in AI, with major players like OpenAI and Google ramping up efforts, raises concerns about the potential impact of talent turnover on xAI's future capabilities [43][44].
这个春节P图不求人!小红书开源图像编辑新SOTA
量子位· 2026-02-12 11:00
Core Viewpoint - The article highlights the launch of Xiaohongshu's foundational model FireRed-Image-Edit, which demonstrates exceptional capabilities in AI image generation and editing, achieving state-of-the-art (SOTA) performance in various benchmarks [2][3]. Group 1: Performance and Evaluation - FireRed-Image-Edit excels in handling complex editing instructions, style transfers, and high-precision text editing, showcasing superior understanding and efficiency compared to competitors [3][4]. - The model's performance is validated through a newly introduced evaluation framework called RedEdit Bench, which includes 15 sub-tasks covering real-world editing scenarios such as portrait beautification and low-quality enhancement [9][10]. - The RedEdit Bench will be open-sourced to establish a new standard for evaluating image editing models in the open-source community [11]. Group 2: Technical Foundation - The model's architecture is supported by a robust data engine and a three-phase training process, which includes pre-training, fine-tuning, and reinforcement learning stages to enhance its capabilities [13][16]. - The data engine efficiently generates training data by breaking down complex editing tasks into manageable sub-tasks, ensuring high-quality data through a rigorous cleaning process [14]. Group 3: Core Capabilities - FireRed-Image-Edit features advanced instruction adherence, allowing it to understand the semantic relationship between commands and images rather than relying on rote memorization [20]. - The model introduces a Layout-Aware OCR-based Reward system during the reinforcement learning phase, improving text editing accuracy by penalizing errors in character placement and layout [26][27]. - It supports creative scene generation and multi-reference image generation, enabling style transfer and image fusion capabilities [33]. Group 4: Future Developments - Xiaohongshu plans to further enhance the foundational model's capabilities in portrait beautification, consistency, and text editing, with ongoing updates and open-source releases in the coming months [49].