Artificial Intelligence

Search documents
GPT-5超越人类医生!推理能力比专家高出24%,理解力强29%
量子位· 2025-08-15 06:44
Core Insights - GPT-5 demonstrates superior performance in medical imaging reasoning and understanding compared to human experts, with accuracy rates exceeding human capabilities by 24.23% and 29.40% respectively [2][5][16]. Group 1: Model Comparison - A study from Emory University compared GPT-5 with its predecessors, including GPT-4o and smaller variants like GPT-5-mini and GPT-5-nano, focusing on their ability to handle multimodal information in the medical field [3][5]. - GPT-5 outperformed all other models in standardized tests, particularly in the MedXpertQA multimodal test, showing improvements of nearly 30% in reasoning and 36% in understanding over GPT-4o [5][13]. - In the MedXpertQA Text and MM tests, GPT-5 scored 56.96 and 69.99 respectively, significantly higher than human experts and other models [15][17]. Group 2: Testing Methodology - The tests included the USMLE exam, MedXpertQA, and VQA-RAD, all conducted in a zero-shot setting without data fine-tuning [7][10]. - The USMLE exam, a critical benchmark for medical education, showed GPT-5's comprehensive superiority over GPT-4o [8][10]. - MedXpertQA consists of 4460 questions across 17 medical specialties, with a multimodal subset that includes diverse images and clinical information [11][12]. Group 3: Technical Advancements - The core advancement of GPT-5 lies in its end-to-end multimodal architecture, enhancing cross-modal attention and alignment capabilities [18][19]. - Unlike GPT-4o, which relied on indirect methods for cross-modal tasks, GPT-5 integrates text, images, and audio into a unified vector space, facilitating seamless perception, reasoning, and decision-making [19]. - The collaborative effect of chain-of-thought prompting and enhanced internal reasoning capabilities in GPT-5 significantly boosts its performance in reasoning-intensive tasks [19]. Group 4: Real-World Application - Despite its impressive performance in standardized tests, GPT-5 still requires further real-world testing to validate its effectiveness in clinical settings [20][22]. - A recent ultimate exam in radiology revealed that all AI models, including GPT-5, scored lower than intern doctors, indicating a gap between AI capabilities and human expertise [20][22].
国家级AI创新应用赛事杀疯了!超200万元奖金池+全场景赛道,冲线团队速来
量子位· 2025-08-15 06:44
总计200万+奖金池,就业落户渠道、创业扶持、合作对接、项目孵化多重激励全方位开闸! 不愧是目前国内规模最大、参赛主体最丰富的AI专业赛事, 第二届"兴智杯"全国人工智能创新应用大赛 ,正在招募最后一波冲线团队。 先把底牌摊开来看,"兴智杯"的含金量早已被首届赛事验证——工业和信息化部、科学技术部等共同主办,吸引1.6万+选手、9000+支参赛团 队报名,聚焦产业发展趋势及技术应用热点,受到产学研各界高度关注。 西风 发自 凹非寺 量子位 | 公众号 QbitAI 规格与影响力都摆在那儿。 第二届大赛延续权威基因,中国信息通信研究院与有关地方政府部门联合主办,定位"以赛促用、以赛促产",面向全社会开放。 国内外AI相关企事业单位、高校团队及个人开发者均可参赛。 这次 大赛主题为"兴智赋能,创造 无界 " ,大赛包含三大主题赛再叠加其他特色方向赛。 无论你的团队是具备模型/Agent/多模态上的原创能力,还是擅长在国产软硬件栈与性能/效率/稳定性方面的工程优化,又或者在金融/能源/医 疗/网络通信等典型场景具有真实数据与方案,都能在这一届找到"刚好匹配"的赛道与评测坐标。 详细赛制规则在这里,速来围观~ 从大模 ...
SoundHound AI: The Next Leg Of Growth Is Just Beginning
Seeking Alpha· 2025-08-15 06:19
SoundHound AI (NASDAQ: SOUN ) is one of the few AI companies already operating in real markets with real product, delivering growing revenue, and has quite a clear path to large-scale commercialization, this would be my opinion only, butI’m an independent equity trader and licensed financial advisor focused on uncovering high-upside opportunities in overlooked sectors — especially small-caps, energy, commodities, and special situations. My investment strategy is rooted in the CAN SLIM framework but goes fur ...
端侧AI行业深度报告:端侧AI,万物智联新引擎
NORTHEAST SECURITIES· 2025-08-15 06:16
Investment Rating - The report rates the industry as "Outperform" [1][9] Core Insights - Edge AI is reshaping the traditional cloud computing landscape, transitioning from a cloud-centric model to a hybrid architecture involving cloud, edge, and terminal collaboration [3][23] - The edge AI industry is experiencing exponential growth, with a market size projected to exceed 1.9 trillion yuan by 2028, reflecting a compound annual growth rate (CAGR) of 58% from 2023 to 2028 [36][40] - The complete edge AI industry chain has formed, with significant contributions from chip manufacturers, algorithm optimizers, and application developers [4][26] Summary by Sections 1. Edge AI as a New Engine for IoT - Edge AI is becoming a critical component in the evolution of intelligent terminal devices, enabling real-time data processing and decision-making [3][5] - The market for edge AI is expanding rapidly, with 22.8 billion consumer devices expected by 2023, including smartphones (29.8%), smart home devices (26.3%), and PCs/PADs (17.6%) [4][36] 2. Growth Drivers for Edge AI - Hardware performance breakthroughs are anticipated, with flagship smartphones expected to reach 100 TOPS of NPU computing power by 2025 [40] - The penetration rate of AI smartphones is projected to reach 38% by 2025, with significant growth in industrial and smart city applications [40] 3. Edge AI Industry Chain - The edge AI industry encompasses a complete ecosystem, from hardware components like AI chips and sensors to software solutions for diverse applications [26] - Key players in the industry include companies like Guanghe Tong, Lexin Technology, and Rockchip, which are positioned to benefit from the growth of edge AI [6][5] 4. Applications of Edge AI - Edge AI is transforming various sectors, including consumer electronics, automotive, industrial applications, and smart home devices [5][32] - In the automotive sector, edge AI supports autonomous driving systems and enhances user experience through intelligent cockpit features [29][30] 5. Financial Data of Key Companies - Guanghe Tong: Current price 28.06, EPS forecast for 2025 is 0.89, PE ratio for 2025E is 31.46, rated as "Buy" [6] - Lexin Technology: Current price 161.26, EPS forecast for 2025 is 4.13, PE ratio for 2025E is 39.05, rated as "Hold" [6] - Rockchip: Current price 177.88, EPS forecast for 2025 is 2.37, PE ratio for 2025E is 75.02, rated as "Buy" [6]
GPT-5、Grok 4、o3 Pro都零分,史上最难AI评测基准换它了
机器之心· 2025-08-15 04:17
Core Viewpoint - The recent performance of leading AI models in the FormulaOne benchmark indicates that they struggle significantly with complex reasoning tasks, raising questions about their capabilities in solving advanced scientific problems [2][10][12]. Group 1: AI Model Performance - Google and OpenAI's models achieved gold medal levels in the International Mathematical Olympiad (IMO), suggesting potential for high-level reasoning [2]. - The FormulaOne benchmark, developed by AAI, resulted in zero scores for several advanced models, including GPT-5 and Gemini 2.5 Pro, highlighting their limitations in tackling complex graph structure dynamic programming problems [2][3]. - The overall success rates for the models in the benchmark were notably low, with GPT-5 achieving only 3.33% success overall, and all models scoring 0% in the deepest difficulty category [3][10][12]. Group 2: Benchmark Structure - The FormulaOne benchmark consists of 220 novel graph structure dynamic programming problems categorized into three levels: shallow, deeper, and deepest [3][4]. - The shallow category includes 100 easier problems, while the deeper category contains 100 challenging problems, and the deepest category has 20 highly challenging problems [4]. Group 3: AAI Company Overview - AAI, founded by Amnon Shashua in August 2023, focuses on advancing Artificial Expert Intelligence (AEI), which combines domain knowledge with rigorous scientific reasoning [14][18]. - The company aims to overcome traditional AI limitations by enabling AI to solve complex scientific or engineering problems like top human experts [19]. - Within its first year, AAI attracted significant investment and was selected for the AWS 2024 Generative AI Accelerator program, receiving $1 million in computing resources [19].
被曝蒸馏DeepSeek还造假!欧版OpenAI塌房了
猿大侠· 2025-08-15 04:11
Core Viewpoint - Mistral, a prominent player in the open-source AI sector, is accused of distilling its latest model from DeepSeek, misleading the public about its model's performance and testing results [3][22][24]. Group 1: Allegations and Evidence - A former employee of Mistral revealed through a mass email that the company's latest model may have directly distilled from DeepSeek, misrepresenting it as a successful reinforcement learning case [2][3]. - Analysis by Twitter user Sam Peach indicated a surprising similarity between Mistral-small-3.2 and DeepSeek-v3, suggesting that the resemblance is likely a result of distillation rather than coincidence [7][14]. - The analysis involved identifying overused words and n-grams in the models' outputs, leading to a similarity map that showed Mistral-small-3.2 and DeepSeek-v3 were closely positioned, indicating high output similarity [16][18]. Group 2: Company Background and Market Position - Mistral, founded in 2023 and based in Paris, is often referred to as the European version of OpenAI, co-founded by former Google DeepMind and Meta employees [24]. - The company has gained significant attention, with a valuation reaching $10 billion and plans for a new funding round of $1 billion, following a previous round that raised €600 million (approximately $645 million) [25]. - Mistral has maintained an open-source approach, releasing models like Mistral Small and Mistral Code, and has developed a chatbot named LeChat to compete with ChatGPT [27][28].
面了一个75k的字节小姐姐,想当场给她offer。。
猿大侠· 2025-08-15 04:11
在DeepSeek挂出的职位中,大部分岗位的起薪在 3万元以上 ,其中年薪最高可达 154万元 。猎聘网数据 显示,掌握深度强化学习、多模态融合等DeepSeek核心技术人才, 薪资涨幅同比超120% 。 它不仅是技术的颠覆者,更是一场席卷全球的"高薪革命"与"职业机遇风暴", 技术人纷纷想转行、跳槽 到 前景光明又高薪的算法岗位。 ( 深度学习/算法工程师的薪资 在各个技术岗位中显然是 最高的 ,更多技术岗位平均 薪资详请见下图) 其他企业为留住和吸引人才,也都相应 提高 薪资 待遇, 有的岗位薪资甚至比往年 提高70% ! 字节跳动 73.5万 年薪聘用应届生, 阿里达摩院开出超过 200万年薪 。 据中国基金报报道 ,某招聘平台显示,杭州深度求索人工智能(AI)基础技术研究有限公司(即 DeepSeek),发布了多个岗位的招聘信息。 2025年将是AI人才分水岭—— 要么成为DeepSeek技术红利的收割者,要么被时代无情淘汰! 高薪, 是AI领域缺人的事实依据 , 但是找不到工作的大有人在,也是事实。 问题就在,申请算法岗的人很 多 ,但实际能够胜任的很 少 。求职者所具备的 能力根本无法匹配 一线企 ...
速递|量子学家重构AI压缩算法,Multiverse已筹集2.15亿美元,打造出史上体积最小两款模型
Z Potentials· 2025-08-15 03:53
Core Viewpoint - Multiverse Computing has developed two of the smallest high-performance AI models, named after the sizes of animal brains, aimed at enhancing AI capabilities in IoT devices and enabling local operation on smartphones and personal computers [2][3]. Company Overview - Multiverse Computing is a European AI startup based in Donostia, Spain, founded by experts in quantum computing and AI, including Roman Orús and Samuel Muguel [4]. - The company has raised approximately €189 million (around $215 million) in funding, with a total of about $250 million since its inception in 2019 [4]. Technology and Innovation - The company utilizes a quantum-inspired compression algorithm called CompactifAI, which allows for significant model size reduction without sacrificing performance [4][5]. - Multiverse has released compressed versions of popular open-source models, including Llama 4 Scout and Mistral Small 3.1, and has also compressed large models like DeepSeek R1 Slim [4]. New Model Launch - The two new models, SuperFly and ChickBrain, are designed for IoT applications, with SuperFly being a compressed version of the SmolLM2-135 model, reduced from 135 million parameters to 94 million [6]. - ChickBrain, with 3.2 billion parameters, is a compressed version of Meta's Llama 3.1 8B model, capable of running offline on devices like MacBooks [6][7]. Performance Metrics - ChickBrain has outperformed its original model in several benchmark tests, including language and mathematical ability tests [7]. - Multiverse has not claimed that its models can surpass the performance of the most advanced large models, focusing instead on maintaining performance while reducing size [10]. Market Engagement - The company is in discussions with major device and appliance manufacturers, including Apple, Samsung, Sony, and HP, which has also invested in the company [10]. - Multiverse offers its compression technology for other forms of machine learning, such as image recognition, and has secured clients like BASF and Bosch [11].
马中企业家大会|马来西亚企业家卢传文:全球AI投资窗口已经开启
Sou Hu Cai Jing· 2025-08-15 03:37
"今天我们站在这个全球科技的巨变的十字路口,人工智能不仅是投资的价值曲线,也在塑造企业重新定义整个投资方向。"8月13日,在第十五届马中企业 家大会—商务贸易投资交流会上,马来西亚雪兰莪州资讯科技与数码经济机构运营长卢传文说,全球AI投资窗口已经开启。 马来西亚雪兰莪州资讯科技与数码经济机构运营长卢传文作推介。 班浪 摄 "人工智能是全球的经济的新引擎,而雪兰莪州以AI应用、IC芯片的制造,还有智慧城市的落地,构建了东盟最有竞争力的数字经济生态。雪兰莪州智慧城 市及数码经济大会将是你走进东盟市场的最佳入口,也是你们同时对接AI技术、IC产业和政府项目的一个平台。我希望2025年我们在吉隆坡不见不散。"最 后,卢传文再次盛情邀约。 贵州日报天眼新闻记者 王怡 牟绍莉 "在2023年,马来西亚政府就已经设定了必须要将数字经济占全国GDP的比重提升至25%,并将AI列为优先发展的核心产业。雪兰莪州覆盖了IC芯片的设 计,自动化工具的应用,AI芯片的研发,我们也聚集了国际半导体的龙头及本地高层企业,形成了上下游的协同产业的群体。"卢传文介绍,在雪兰莪州智 慧城市与数字经济大会上,将会通过一系列的高效的品牌活动,为投资 ...
AI应用大爆发:20家企业完成亿元级融资,平台和应用共生共创
Sou Hu Cai Jing· 2025-08-15 03:35
前 言 在百度AI DAY现场,我体验到各式各样的"创新创业",也感受到这群创业者之间深刻的"非共识"。 是技术牵引用户需求向前,还是用户需求牵引技术迭代?是模型决定AI产品上限,还是场景决定AI产品上限?卷AI Agent ToC赢面更大,还是卷AI Agent ToB赢面更大?下一个突破口在AI软件,还是下一个突破口在AI硬件? 就像《从0-1》序言中提到的:"如果你发现周围所有的车都在逆行,那多半是你开反了。而彼得·蒂尔就像这样一辆车,他不仅一往无前,无所畏惧地逆 向而行,还让路上的其他车困惑和怀疑是不是自己开反了方向。" 01 "非共识"下的AI创业 大家可能都有类似的感受:每天打开新闻,AI大模型总在头条。即使放下手机,它也会钻进工作报表的智能分析里,显身在孩子网课的个性化辅导中。 打个车或者叫个外卖,AI也会藏匿在各个系统里。 百度集团副总裁袁佛玉在AI DAY上直言:"AI大模型不只是每天的新闻头条,更在真真切切改变我们身边每一个熟悉或不熟悉的行业,甚至每个人的工作 和生活。" 它像一场滔天巨浪,裹挟着高速发展的势能,也带着探索未知的不确定性 ——AI创企要在技术深度、迭代速度与成本控制的"不可 ...