腾讯研究院AI速递 20250618

Group 1 - DeepSeek-R1 ranks 6th overall in LMArena and 1st among open-source models, with a 2nd place in programming tests [1] - MiniMax-M1 is a cost-effective reasoning model trained for 3 weeks at a cost of 3.8 million, achieving 4 times the generation efficiency of DeepSeek-R1 [2] - Kimi-Dev, an open-source code model with 72 billion parameters, achieved a 60.4% score in SWE-bench Verified, marking a new state-of-the-art in open-source [3] Group 2 - Alibaba has released 32 Qwen3 MLX quantization models, each available in four precision versions: 4bit, 6bit, 8bit, and BF16 [4][5] - Tencent's Yuanbao desktop version introduces an AI programming mode using DeepSeek V3, allowing users to write code with a single command [6] - Panasonic's OmniFlow multimodal model supports various transformations between text, image, and audio, enhancing training efficiency through modular design [7] Group 3 - A 13-year-old CEO, Michael Goldstein, founded FloweAI, which offers a general AI agent capable of performing various tasks like PPT creation and flight booking [8] - The "Meteor One" chip developed by the Shanghai Institute of Optics and Fine Mechanics achieves over 100 parallel optical computations, with a theoretical peak performance of 2560 TOPS [10] - Django's creator warns of three critical threats posed by AI agents, emphasizing the risks of accessing private data and exposure to untrusted content [11] Group 4 - Anthropic reveals details about Claude's deep research functionality, which utilizes a multi-agent architecture that outperforms single-agent systems by 90.2% but incurs 15 times the token consumption [12]