Grok 4
Search documents
X @Elon Musk
Elon Musk· 2025-11-10 05:48
RT DogeDesigner (@cb_doge)BREAKING: Grok 4 Fast just claimed the #1 spot on the TrustAI leaderboard. https://t.co/atJs7Woejs ...
腾讯研究院AI速递 20251110
腾讯研究院· 2025-11-09 16:09
生成式AI 一、Grok 4深夜大升级:200万上下文、五倍GPT-5脑容量? 3. 200万token上下文能力意味着可一次性处理相当于150万英文单词或6000页文本,相当于两部《战争与和平》。 https://mp.weixin.qq.com/s/PkHA-2aXsCg03xpoQOMXLA 二、GPT-5-Codex mini 紧凑版发布,性能相当速率提高 4 倍 1. OpenAI发布GPT-5-Codex-Mini紧凑版,使用量是GPT-5-Codex的约4倍,ChatGPT Plus等用户速率限制提高50%; 2. 代码中发现GPT-5.1系列三个新模型痕迹,包括旗舰模型GPT-5.1、推理模型GPT-5.1 Reasoning和研究级GPT-5.1 Pro; 3. 新模型 或 于 11月 底 发布,其中一个模型可能已以Polaris Alpha名字在OpenRouter等平台测试,在创意写作和基准测试中表现出 色。 https://mp.weixin.qq.com/s/er3zhiYfsyGKqchQuRYl0Q 三、谷歌二代Nano Banana爆出!一键推演微积分终结PS 1. Grok ...
X @Elon Musk
Elon Musk· 2025-11-08 21:37
Model Capabilities - Grok 4 Fast 拥有 200 万 token 的超大上下文窗口 [1] - 这一窗口允许加载大量文档或代码库,作为单个提示进行查询和分析 [1] - Grok 4 Fast 速度极快 [1] Industry Impact - 行业将受益于更大的上下文窗口,从而可以处理更复杂的任务 [1] - 行业可以更高效地分析大量数据 [1]
马斯克把时间给了xAI,却问特斯拉要万亿薪酬
Hua Er Jie Jian Wen· 2025-11-06 01:40
马斯克正将大量时间投入其新创立的人工智能公司xAI,与此同时,他却要求特斯拉股东批准一项旨在 确保其专注度的天价薪酬方案。 美国时间周四,特斯拉将公布一项关键股东投票的初步结果,核心议题是马斯克的新薪酬方案。 该方案若获通过,将在未来十年内把他的持股比例从约15%提升至25%,前提是公司达成包括销售一百 万台Optimus人形机器人和市值达到8.5万亿美元在内的宏大目标。 然而据媒体援引知情人士透露,一些主要特斯拉投资者近几周已私下向公司高管和董事会成员施压,询 问马斯克究竟将多少精力放在特斯拉,以及公司是否有CEO继任计划。两家有影响力的代理咨询公司已 建议股东投票反对该方案。 据前高管和与马斯克共事的人士透露,今年夏天大部分时间,马斯克都"躲在"他的最新创业公司xAI, 通宵达旦地参与会议。他甚至开始在xAI的办公室与特斯拉员工开会,而此时的特斯拉正面临连续两个 季度的销量下滑。 万亿薪酬与"兼职"CEO 特斯拉董事会在9月的委托书中提出了这项巨额薪酬方案。 董事会主席Robyn Denholm上周接受采访时表示,董事会并不担心马斯克如何分配时间。她说: 其他CEO可能喜欢打高尔夫,他喜欢创建公司,而这些 ...
1万美元实盘交易!全球首个AI投资大赛收官:中国大模型全盈利,美国GPT-5亏损超62%垫底【附大模型行业前景分析】
Sou Hu Cai Jing· 2025-11-05 07:41
Group 1 - The "Alpha Arena" competition showcased the capabilities of AI models, with China's Qwen3-Max achieving over 20% return, outperforming all American models, which collectively incurred losses, including GPT-5 with over 60% loss [2] - The competition lasted 17 days and involved six top AI models from China and the US, highlighting the competitive landscape in AI investment [2][3] - The event reflects the rapid development and innovation in China's AI model industry, with significant participation from both established tech giants and startups [3] Group 2 - As of Q1 2024, China has released a total of 478 AI models, ranking second globally after the US, indicating a strong presence in the AI research field [4] - The number of AI researchers in China has grown from under 10,000 in 2015 to 52,000 in 2024, with a compound annual growth rate of 28.7%, showcasing the country's growing research capabilities [4] - The language model sector is identified as a key area for technological breakthroughs and applications across various industries, with projections estimating the market size to exceed 220 billion yuan by 2030, growing at over 40% annually [4]
AI大模型实时投资比赛落幕,阿里千问Qwen夺冠;微信支付为中小商家推出AI菜单识别功能丨AIGC日报
创业邦· 2025-11-05 00:08
Group 1 - The AI model competition "Alpha Arena" concluded with Alibaba's Qwen winning the championship, achieving a return of 22.32% over 17 days, while four major US models incurred losses, with GPT-5 losing over 62% [2] - OpenAI reportedly discussed a merger with competitor Anthropic shortly after Sam Altman's brief departure as CEO, but the talks did not materialize due to practical obstacles [2] - WeChat Pay launched an AI menu recognition feature for small and medium-sized businesses, allowing merchants to upload photos of their menus for automatic content recognition and payment processing [2] Group 2 - The AI glasses market is rapidly growing, with major tech companies like Google and Apple accelerating their investments, as AI glasses are seen as the next generation of human-computer interaction [2] - Reports indicate that global shipments of AI glasses are expected to reach 4.065 million units in the first half of 2025, marking a year-on-year increase of 64.2%, with projections suggesting shipments could exceed 40 million units by 2029 [2]
全球首个AI投资大赛收官:阿里千问夺冠,美国四大模型均亏损
Guan Cha Zhe Wang· 2025-11-04 14:52
Core Insights - The AI investment competition "Alpha Arena" concluded with Alibaba's Qwen model achieving over 20% return, securing the championship [2][5] - DeepSeek ranked second, marking a significant performance for Chinese models, while all four leading American models reported losses, with GPT-5 suffering a loss exceeding 60% [2][7] Competition Overview - The competition lasted 17 days and involved six top AI models, including Qwen3-Max, DeepSeek v3.1, GPT-5, Gemini 2.5 Pro, Claude Sonnet 4.5, and Grok 4, with a total investment of $10,000 and real-time market data provided [2][3] - The models operated under a unified input system, ensuring fairness and transparency, with real-time trading records and account values publicly available [3] Performance Highlights - Qwen3-Max achieved a final account value of $12,232, reflecting a return of +22.32%, while DeepSeek v3.1 reached $10,489 with a +4.89% return [8] - In contrast, Claude Sonnet 4.5, Grok 4, Gemini 2.5 Pro, and GPT-5 reported significant losses, with GPT-5 at -62.66% [7][8] Industry Context - The success of Qwen and DeepSeek in the competition underscores the growing capabilities of Chinese AI models in real-world applications, highlighting their potential to address practical challenges [9] - The competition's results may influence the perception of AI models globally, particularly in the context of the ongoing competition between Chinese and American AI technologies [9]
投资大赛:阿里千问、DeepSeek赚了,GPT-5大亏
Nan Fang Du Shi Bao· 2025-11-04 13:41
Core Insights - The first AI large model trading competition initiated by the American AI research lab nof1 concluded, with six leading models participating in autonomous trading using market data without human intervention [1][5][7] - Two Chinese models, Alibaba's Qwen3 Max and DeepSeek Chat V3.1, achieved positive returns, with Qwen3 Max leading at a return rate of 22.3% and a profit of $2,232 [1][2][3] Performance Summary - Qwen3 Max achieved a return of 22.3%, with an account value of $12,232 and a win rate of 30.2% [3] - DeepSeek Chat V3.1 had a return of 4.89%, with an account value of $10,489 and a win rate of 24.4% [3] - Other models, including Claude Sonnet 4.5, Grok 4, Gemini 2.5 Pro, and GPT 5, experienced significant losses, with GPT 5 losing 62.66% [2][3] Trading Dynamics - The competition involved trading cryptocurrency derivatives, including Bitcoin, Ethereum, and Dogecoin, with each model starting with $10,000 [5] - Models were required to process quantitative data and execute trades without access to news or market information [5] - Qwen3 Max maintained the largest position size throughout the competition, while Grok 4 had the longest holding period [6] Model Behavior - Grok 4, GPT-5, and Gemini 2.5 Pro exhibited a higher frequency of short-selling compared to others, while Claude Sonnet 4.5 rarely engaged in short-selling [6] - Qwen3 Max had the narrowest stop-loss and take-profit distances, indicating a more conservative exit strategy [6] - The competition highlighted the need for dynamic testing of models in real market conditions, as opposed to static benchmark tests [7]
AI被严重低估,AlphaGo缔造者罕见发声:2026年AI自主上岗8小时
3 6 Ke· 2025-11-04 12:11
【导读】当我们还在调侃「AI写错代码」时,实验室里的科学家却看到它能独立完成几个小时的复杂任务。AlphaGo作者Julian罕见发声:公众对AI的认 知,至少落后一个世代。最新数据更显示,AI正以指数速度逼近专家水准,2026或许就是临界点。我们,是在见证未来,还是在自欺欺人? AlphaGo、AlphaZero的核心作者——Julian抛出了一个尖锐的比喻:人们今天对AI的态度,很像当初面对新冠疫情早期的反应。 Julian的意思很直接:我们正在严重低估AI的进展。 很多人还在笑它写错代码,抱怨它没法替代人类;但在实验室里,研究者早已看到另一幅景象——AI已经能独立完成几个小时的复杂任务,并且还在按 指数速度进化。 这就是他决定站出来发声的原因:公众的认知,和前沿的现实,之间至少隔着一个世代的落差。 科学家不忍再沉默:AI为何被大众低估? Julian Schrittwieser的名字,或许不像马斯克、奥特曼那样家喻户晓,但在AI圈,他是响当当的存在。 作为AlphaGo、AlphaZero、MuZero的核心作者之一,他亲历了AI从「围棋科幻」到「现实碾压」的全过程。 也正因如此,当他在个人博客写下那段 ...
首届AI交易大赛落幕,6个AI炒币2周:Qwen、DeepSeek赚钱,GPT-5血亏6000刀
3 6 Ke· 2025-11-04 11:13
Core Insights - The inaugural Nof1 AI Model Trading Competition concluded, designed to measure AI investment capabilities, likened to a "Turing test" for the crypto space [1] - Six AI models participated, representing the latest technology from both Chinese and American developers, with Qwen3 Max emerging as the top performer [1][12] Competition Overview - The competition ran from October 17 to November 3, 2025, with each model starting with $10,000 in initial capital [1] - Trading was conducted on Hyperliquid, focusing on six popular cryptocurrencies: BTC, ETH, SOL, BNB, DOGE, and XRP [3] - The trading strategies were limited to buying, selling, holding, or closing positions, with a focus on mid-frequency trading [3] Performance Results - Qwen3 Max ranked first with a return of 22.3%, total profit of $2,232, and a win rate of 30.2% over 43 trades [2][5] - DeepSeek Chat V3.1 secured second place with a return of 4.89%, total profit of $489.08, and a win rate of 24.4% over 41 trades [2][5] - Other models, including Claude Sonnet 4.5, Grok 4, Gemini 2.5 Pro, and GPT-5, experienced significant losses, with GPT-5 showing the worst performance at -62.66% [4][11] Model Characteristics - Qwen3 Max exhibited an aggressive trading style with a high return and significant trading frequency, reflected in its Sharpe ratio of 0.273 [9] - DeepSeek Chat V3.1 demonstrated a more conservative approach with a higher Sharpe ratio of 0.359, indicating better risk management [9] - Claude Sonnet 4.5 and Grok 4 showed cautious strategies but suffered from low win rates and high losses [10] - Gemini 2.5 Pro and GPT-5 were characterized by high trading activity but poor performance, indicating ineffective strategies [11] Industry Implications - The competition has garnered significant attention, with industry leaders like Binance's founder commenting on the potential impact of AI trading strategies on market dynamics [7] - The results suggest that AI models from China, particularly Qwen3 Max and DeepSeek, are currently outperforming their American counterparts in terms of risk control and trend identification [12]