Workflow
大型语言模型
icon
Search documents
喝点VC|a16z对话AI领袖:AI的“蛮力”之路能走多远?从根本上具备人性,才能真正理解人们想要什么
Z Potentials· 2025-11-22 03:21
Core Insights - The discussion highlights the rapid advancements in AI technology and its potential to create a new wave of independent entrepreneurs, transforming the software development landscape [5][30]. - There is a divergence in opinions regarding the timeline and feasibility of achieving Artificial General Intelligence (AGI), with some experts expressing optimism about imminent breakthroughs while others remain skeptical [9][19]. AI Development Status and Path to AGI - Adam D'Angelo emphasizes that there are no fundamental challenges that cannot be solved by the brightest minds in the coming years, citing significant progress in reasoning models and code generation [3][8]. - Amjad Masad compares the current AI evolution to historical revolutions, suggesting that humanity is undergoing a transformative change that may not be easily defined [4][27]. - D'Angelo believes that the next five years will see a drastically different world, contingent on resolving current limitations in AI context and usability [8][10]. Economic Transformation and Future Societal Landscape - D'Angelo predicts that the economic impact of AI could lead to GDP growth far exceeding 4-5% if AI can perform tasks at a lower cost than human labor [21]. - Masad raises concerns about the second-order effects of AI on the job market, particularly the potential for entry-level jobs to be automated while expert roles remain [22][23]. - The conversation suggests that as AI automates more tasks, the nature of work will shift, with a potential increase in demand for roles that leverage human creativity and emotional intelligence [24][25]. Technological Landscape Evolution and Entrepreneurial Ecosystem Outlook - D'Angelo expresses excitement about the increase in independent entrepreneurs enabled by AI technologies, which allow individuals to bring ideas to fruition without the need for large teams [28][30]. - The discussion touches on the balance between large-scale companies and new entrants in the market, suggesting that both can coexist and thrive in the evolving landscape [32][36]. - Masad highlights the importance of AI in programming, indicating that as these tools improve, they will democratize software development, allowing more people to create complex applications [44]. Future Challenges and Ultimate Thoughts - The conversation reflects on the cultural implications of increased reliance on AI, particularly regarding knowledge sharing and collaboration among employees [49]. - D'Angelo and Masad both acknowledge the need for ongoing research and innovation in AI to unlock its full potential and address the challenges that arise from its integration into society [41][42].
推出全新AMD Instinct MI350系列GPU优化服务器解决方案 超微电脑(SMCI.US)小幅上涨
Zhi Tong Cai Jing· 2025-11-20 16:20
周四,超微电脑(SMCI.US)股价盘初小幅上涨,盘前一度涨超5%。截至发稿,该股涨超1.6%,报34.27 美元。消息面上,该公司宣布推出最新的AMD Instinct MI350系列GPU优化服务器解决方案,进一步强 化其在高性能计算与AI基础设施领域的产品布局。 超微电脑表示,此次推出的全新系统专为需要AMD Instinct MI355X GPU高端算力、但又必须部署在空 气冷却环境中的企业而设计。相比上一代产品,新款服务器可实现最高4倍的AI训练算力提升,以及高 达35倍的推理性能飞跃,显著增强企业在大型语言模型(LLM)、生成式AI、科学计算等领域的部署能 力。 ...
速递|百人团队ARR突破2亿美元,Lovable启动新一轮融资,估值预计超60亿美元
Z Potentials· 2025-11-19 11:30
图片来源: Lovable 这家尚未盈利的公司即将完成新一轮融资,估值将超过 60 亿美元,一位知情人士透露。该公司拒绝置评。福布斯最早报道了此次融资的部分细节。 这家尚未盈利的公司即将完成新一轮融资,估值将超过 60 亿美元。 该公司成立于 2023 年,能让专业开发者和没有编程背景的用户快速从零开始构建应用或网站。它采用免费增值模式,提供从免费到每月 100 美元不等的多 个服务层级。 "我们最大的新客户来源是口碑传播,"首席执行官安东·奥西卡在接受采访时表示。Lovable 是众多所谓氛围编程初创公司之一,这些公司依托日益精密的大 型语言模型开发而成,能够辅助软件开发。 这些企业承诺实现应用程序构建的民主化—— Osika 将 Lovable 誉为"终极软件"——并已吸引大量投资者关注。 Lovable 的估值预计将从夏季的 18 亿美元增长逾三倍。其竞争对手 Cursor 近期估值达 293 亿美元。 与上市公司不同,初创企业无需披露财务指标,这使得评估这些估值变得困难。年经常性收入( ARR )是 Lovable 等软件公司用来标榜用户增长的一种方 式。 据 Osika 透露, Lovable 通过 ...
谷歌推出Gemini3,芯片ETF(512760)小幅回调,近20日净流入超4亿元
Mei Ri Jing Ji Xin Wen· 2025-11-19 06:53
消息面,Alphabet旗下谷歌(Google)发布新一代大型语言模型Gemini3,并从发布当天起将其部署至 谷歌搜索的AI模式、Gemini应用、API接口、VertexAI等核心产品。首席执行官桑达尔·皮查伊(Sundar Pichai)在公司博客中将其描述为"我们最智能的模型"。 芯片ETF(512760)跟踪的是中华半导体芯片指数(990001),该指数聚焦于中国半导体芯片行业,从 市场中选取涉及材料、设备、设计、制造、封装和测试等环节的上市公司证券作为指数样本,以反映半 导体芯片相关上市公司证券的整体表现。该指数精选不超过40只成分股,侧重信息技术领域,集中体现 了行业内的核心资产与技术发展态势。 (文章来源:每日经济新闻) 此次发布的另一核心亮点是谷歌正式推出"Gemini Agents"(双子座代理),这是谷歌首次将能执行多步 骤任务的AI助手以系统化方式向消费者开放。 根据现场演示,Gemini Agent可执行的能力包括:自动整理用户邮箱、提取关键信息;规划完整旅行行 程,包括日程、交通与预算要素;执行具备多个步骤链条的复杂任务;在不同应用场景中作为可调用助 手运行。 ...
科技博主曝光OpenAI烧钱真相:2033年才能勉强覆盖推理成本,甚至“永远”亏损
Jin Shi Shu Ju· 2025-11-13 11:21
据外媒报道,英国科技博主埃德·齐特龙(Ed Zitron)发表了一篇有关OpenAI现金消耗的文章。核心观 点是,OpenAI的运营成本可能远高于外界先前的估计,而其主要资助方微软,则在双方的收入分成协 议中获益颇丰。 文章引用了一组据称显示OpenAI在微软Azure云托管平台上用于推理的支出数据。所谓"推理",指的是 ChatGPT等应用程序调用大型语言模型生成回答的过程。 OpenAI在 更重要的是,这些数据是否准确?英国《金融时报》将上文所示数字的近似值分别展示给微软和 OpenAI,并询问他们是否认为数据大体可信,同时也将这些信息提供给熟悉两家公司的业内人士,请 他们给予一些参考意见。 微软的一位女发言人回应称:"我们不便透露具体数字,但可以说,这些数据并不完全准确。"当记者追 问这句话具体含义时,微软拒绝进一步评论,也未回复后续的请求。OpenAI的一位发言人则只回复称 应向微软咨询此事。 一位熟悉OpenAI情况的人士表示,所展示的数据并不能反映全貌,但他拒绝进一步说明。简而言之, 虽然无法核实这些数据的真实性,但同样也没有获得任何足以质疑其主要内容的理由。《金融时报》 称,读者可自行判断。 如 ...
NeurIPS 2025 | 中科大、港中深、通义千问联合发布CoRT:仅30个样本教会大模型高效推理,token消耗降低50%
机器之心· 2025-11-12 13:23
近年来,以 OpenAI-o1、Qwen3、DeepSeek-R1 为代表的大型推理模型(LRMs)在复杂推理任务上取得了惊人进展,它们能够像人类一样进行长链条的思考、反 思和探索。然而,这些模型在面对精确的数学计算时,仍然会「心有余而力不足」,常常出现效率低下甚至算错的问题。 那么,如何 让大模型学会「何时」以及「如何」高效地使用工具,将自身的抽象推理能力与工具的精确计算能力完美结合? 来自 中国科学技术大学、香港中文大学(深圳)、通义千问的联合研究团队 给出了他们的答案: CoRT (Code-Optimized Reasoning Training) —— 一个旨在教会 大型语言模型高效利用代码工具进行推理的后训练(post-training)框架。 该框架通过创新的数据合成策略和多阶段训练流程,显著提升了模型的数学推理能力和 效率。 一个直观的解决方案, 是为模型配备代码解释器(Code Interpreter)等计算工具。 但这引入了一个更深层次的挑战,也是当前领域面临的关键瓶颈: 1. 认知冲突: 模型内部基于概率的、模糊的「思考」,与外部工具返回的确定性的、精确的「知识」之间存在冲突,导致模型陷 ...
一文读懂人工智能在供应链领域的典型应用
3 6 Ke· 2025-11-07 06:31
Overview - The article discusses the transformative impact of artificial intelligence (AI) and machine learning (ML) on marketing and supply chain management, emphasizing the need for businesses to adapt to these technologies for improved decision-making and operational efficiency [1][6]. AI Terminology Overview - AI encompasses a broad field focused on creating machines capable of tasks requiring human-like intelligence, while ML is a subset of AI that enables computers to learn from data without explicit programming [2][4]. Importance of AI - AI is being rapidly adopted across industries as it directly correlates with business efficiency, profitability, and competitiveness, moving beyond experimental phases to practical applications in daily operations [6][9]. Applications of AI in Marketing - AI is utilized in marketing through personalized recommendations, customer service chatbots, and predictive analytics, enhancing customer engagement and operational effectiveness [10][12]. Marketing's Impact on Supply Chain - Marketing activities can trigger demand shocks, necessitating a responsive supply chain to avoid stockouts and missed revenue opportunities, highlighting the interconnectedness of marketing and supply chain functions [13][15]. Challenges in Modern Supply Chains - Modern supply chains face challenges such as complexity, uncertainty, speed expectations, and sustainability, driving the need for AI to enhance demand forecasting and proactive measures [19][20]. AI in Demand Forecasting and Planning - AI enhances demand forecasting and planning by integrating time series analysis with machine learning, allowing for more accurate predictions and operational actions [20][22]. AI in Inventory Optimization - AI aids in inventory management by determining optimal stock levels based on real-time data and demand forecasts, balancing availability and cost [24][26]. AI in Logistics and Transportation - AI transforms logistics by optimizing delivery routes, predicting arrival times, and enabling predictive maintenance, thus improving efficiency and reliability [27][29]. AI in Supplier and Risk Management - AI strengthens supplier and risk management through continuous performance analysis and real-time monitoring of external events, allowing for proactive risk mitigation [33][34]. AI in Warehousing and Automation - AI automates and optimizes warehousing processes, improving accuracy and efficiency in inventory handling and order fulfillment [37][38]. AI in Sustainability and ESG - AI supports sustainability efforts by optimizing processes to reduce waste and emissions, facilitating the transition to circular supply chains [38][40]. Unified Perspective on Marketing and Supply Chain - Understanding AI's value requires viewing marketing and supply chain as interconnected systems, where AI synchronizes demand creation and fulfillment [61][63]. Emerging Trends in AI-Driven Supply Chains - New trends in AI include digital twins for simulation, proactive AI agents for planning, and visual models for real-time monitoring, indicating a shift towards more autonomous and intelligent supply chain operations [66][67].
垂直领域小型语言模型的优势
3 6 Ke· 2025-11-04 11:13
Core Insights - The article highlights the shift in artificial intelligence (AI) deployment from large language models (LLMs) to small language models (SLMs), emphasizing that smaller models can outperform larger ones in efficiency and cost-effectiveness [1][4][42] Group 1: Market Trends - The market for agent-based AI is projected to grow from $5.2 billion in 2024 to $200 billion by 2034, indicating a robust demand for efficient AI solutions [5] - Companies are increasingly recognizing that larger models are not always better, with research showing that 40% to 70% of enterprise AI tasks can be handled more efficiently by SLMs [4] Group 2: Technological Innovations - Key technological advancements enabling SLM deployment include smarter model architectures, CPU optimization, and advanced quantization techniques, which significantly reduce memory requirements while maintaining performance [20][27] - The introduction of GGUF (GPT-generated unified format) is revolutionizing AI model deployment by enhancing inference efficiency and allowing for local processing without expensive hardware [25][27] Group 3: Applications and Use Cases - SLMs are particularly advantageous for edge computing and IoT integration, allowing for local processing that ensures data privacy and reduces latency [30][34] - Successful applications of SLMs include real-time diagnostic assistance in healthcare, autonomous decision-making in robotics, and cost-effective fraud detection in financial services [34][38] Group 4: Cost Analysis - Deploying SLMs can save companies 5 to 10 times the costs associated with LLMs, with local deployment significantly reducing infrastructure expenses and response times [35][37] - The cost comparison shows that SLMs can operate with a monthly cost of $300 to $1,200 for local deployment, compared to $3,000 to $6,000 for cloud-based API solutions [36][37] Group 5: Future Outlook - The future of AI is expected to focus on modular AI ecosystems, green AI initiatives, and industry-specific SLMs that outperform general-purpose LLMs in specialized tasks [39][40][41] - The ongoing evolution of SLMs signifies a fundamental rethinking of how AI can be integrated into daily workflows and business processes, moving away from the pursuit of larger models [42]
AI大模型投资比赛落幕,阿里通义千问 Qwen 以 22.32% 收益率夺冠
Sou Hu Cai Jing· 2025-11-04 03:46
Core Insights - The Alpha Arena project conducted by Nof1 tested six leading AI language models (LLMs) in a real trading environment, with the goal of assessing their capabilities in quantitative trading [1][3][12] - The top performer, Alibaba's Tongyi Qianwen Qwen3-Max, achieved a return of 22.32%, securing the investment championship [1] Experiment Design - Each model started with $10,000 (approximately 71,218 RMB) to trade cryptocurrency perpetual contracts on the Hyperliquid platform, focusing on assets like BTC, ETH, SOL, BNB, DOGE, and XRP [11] - The models were restricted to making decisions based solely on numerical market data, without access to news or current events [11] - The primary objective for each model was to maximize profit and loss (PnL), with the Sharpe Ratio provided as a risk-adjusted performance metric [11] Initial Results - The models exhibited significant differences in trading styles, risk preferences, holding durations, and trading frequencies, despite operating under the same structure [9] - Some models engaged in short selling more frequently, while others rarely did so; similarly, some held positions longer with lower trading frequency, while others traded more frequently [9] - The research team noted that the order of data presentation could affect model performance, indicating sensitivity to data format [9] Significance and Observations - The project aims to shift AI research from static benchmark testing to real-world, dynamic, and risk-driven assessments [5][12] - Although the experiment did not determine the strongest model, it highlighted challenges faced by advanced LLMs in actual trading scenarios, including execution of actions, risk management, market state understanding, and sensitivity to prompt formatting [12]
三星加速追赶,台积电毫不在意
半导体芯闻· 2025-10-28 10:34
Core Viewpoint - TSMC remains confident in its leading position in semiconductor manufacturing despite competition from Samsung and Intel, as it continues to excel in 2nm and 3nm processes [2][3] Group 1: Competition and Market Dynamics - TSMC's chairman, Tzu-Hsien Tung, acknowledges that while Samsung is gaining more business in the U.S., TSMC does not feel threatened by this shift [2] - There are rumors that U.S. companies may prefer Samsung, but Tung denies any crisis, asserting that major chip companies still rely on TSMC's capabilities [2] - Elon Musk has praised Samsung's Texas facility, suggesting a potential boost for Samsung in the foundry sector, but Tung believes this discussion is common in the thriving semiconductor industry [2] Group 2: Taiwan's Role in AI Supply Chain - Taiwan has integrated itself into the U.S.-led AI supply chain by supplying semiconductors and critical hardware, despite competition from Japan and South Korea [3] - The U.S. is currently leading in global AI investment, followed by mainland China, with Taiwan's position in AI infrastructure remaining strong [3] - The long-term technological competition between the U.S. and China poses a greater challenge than competition from Samsung or Intel [3] Group 3: AI Model Development - The development of large AI models is limited to a few countries with the necessary funding and talent, with Taiwan playing a crucial role in this ecosystem [3] - The supply chain connecting Japan, South Korea, and Taiwan is still vital, especially as both China and the U.S. pursue their own AI technology stacks [3]