Workflow
Programming
icon
Search documents
趣图:“这个可以做吗?”
程序员的那些事· 2025-12-30 06:03
Core Viewpoint - The article discusses the challenges faced by programmers when frequent changes in requirements occur, highlighting the impact on their productivity and mental well-being [1]. Group 1 - The article mentions a specific instance where 351 responses were received, raising the question of what happened to the remaining 14 responses [2]. - It includes a comic that illustrates how to explain to outsiders why frequent changes in requirements can drive programmers crazy [2]. - Additionally, there is a humorous graphic emphasizing the importance of not interrupting programmers, suggesting that such interruptions can be disruptive [2].
X @Tesla Owners Silicon Valley
Grok Rankings Update — Dec 23🥇 #1 Overall on OpenRouter Leaderboard🥇 #1 in Programming Market Share🥇 #1 in Python-Specific Tasks🥇 #1 in Categories Token Share🥇 #1 on Kilo Code Leaderboard🥇 #1 on BLACKBOXAI Leaderboard🥇 #1 on Roo Code Leaderboard🥇 #1 on Cline Leaderboard🥇 #1 on EQ-Bench3🥇 #1 on Creative Writing v3🥇 #1 on FActScore🥇 #1 on CyBench🥇 #1 on Alpha Arena Season 1.5🥇 #1 in Predictive Sentiment Arbitrage ...
X @Elon Musk
Elon Musk· 2025-12-17 15:32
Carmack is awesomeStartup Archive (@StartupArchive_):John Carmack on what he admires about Elon MuskProgramming legend John Carmack is asked about his relationship with Elon Musk, to which he replies:“In some ways we have a similar background. We’re almost exactly the same age, have backgrounds programming personal computers, https://t.co/5mZTgEGRwx ...
X @Elon Musk
Elon Musk· 2025-12-04 11:35
AI Model Performance - Grok 4.1 Fast claims the top spot for programming use case (Python) [1] - Grok Code Fast 1 takes second place in programming use case (Python) [1] - Grok models lead the programming use case chart [1]
X @Tesla Owners Silicon Valley
Grok Rankings Update — December 3xAI continues to lead across multiple benchmarks and usage categories. Here is the full breakdown:## Grok 4.1 Fast — The Agentic ModelSpecialized for tool-calling, long-context, and high-speed performance.#1 on τ²-Bench Telecom (Agentic Tool Use Benchmark)#1 on Berkeley Function Calling Benchmark#1 in Programming Category (Overall Token Share)#1 in Multilingual Usage (Overall Token Share)#2 on OpenRouter Overall Leaderboard (By Token Usage)## Grok Code Fast 1 — The Market Do ...
突发,Claude Opus 4.5编程世界第一,把谷歌OpenAI踢下王座
3 6 Ke· 2025-11-25 03:33
Core Insights - The release of Claude Opus 4.5 marks a significant advancement in AI capabilities, particularly in programming and computer usage, surpassing competitors like Gemini 3 Pro and GPT-5.1 [1][3][22] - Opus 4.5 has achieved state-of-the-art (SOTA) results in various benchmarks, indicating its superiority in coding, tool usage, and reasoning abilities [3][21][22] Performance Metrics - In the SWE-bench Verified test, Opus 4.5 scored 80.9%, outperforming Sonnet 4.5 (77.2%) and Opus 4.1 (74.5%), while also exceeding Gemini 3 Pro (76.2%) and GPT-5.1 (77.9%) [2][23] - Opus 4.5 achieved a 66.3% score in computer use, significantly higher than Opus 4.1 (44.4%) [2][23] - The model demonstrated a 37.6% score in the ARC-AGI-2 evaluation, showcasing its advanced reasoning capabilities [4][22] Productivity Enhancements - Internal evaluations indicated that using Opus 4.5 in conjunction with Claude Code resulted in an average productivity increase of 220%, with 50% of users reporting at least a 100% improvement [9][10] - Opus 4.5 is described as a "near-complete entry-level researcher replacement" by some users, highlighting its potential to transform research workflows [9][10] Cost and Accessibility - The pricing for Opus 4.5 has significantly decreased, with input costs at $5 per million tokens and output costs at $25 per million tokens, making it more accessible for widespread use [11][13][71] Tool and Feature Enhancements - Opus 4.5 introduces new features such as the "Plan Mode" for better task planning and execution, and improved capabilities for handling complex tasks in Excel and other applications [47][75] - The model's ability to manage multiple concurrent tasks has been enhanced, allowing for more efficient workflows [48][56] Safety and Alignment - Opus 4.5 is noted for being the most robust and aligned model released by Anthropic, with significant improvements in resisting prompt injection attacks compared to previous models [40][43]
X @Tesla Owners Silicon Valley
AI Model Performance - Grok continues to dominate benchmarks, leaderboards, and usage metrics [2] - Grok ranks 1 on ²-Bench Telecom (Agentic Tool Use) Benchmark [3] - Grok is 1 on Kilo Code Leaderboard, Cline Leaderboard, BlackBox AI Leaderboard, and Roo Code Leaderboard [3] OpenRouter Leaderboard & Usage - Grok ranks 1 on OpenRouter Trending Leaderboard [3] - Grok ranks 2 on OpenRouter Top Today Leaderboard [3] - Grok is 2 Most Popular LLMs for English on OpenRouter [3] - Grok is 1 in Token Usage across models on OpenRouter (Top Today, This Week, This Month) [3] - xAI holds 1 in Market Share on OpenRouter [3] - Grok is 1 in Programming Usecase on OpenRouter [3] - Grok is 1 Most Popular LLMs for English on OpenRouter [3]
X @Andrew Tate
Andrew Tate· 2025-11-10 14:13
Core Message - The document urges critical examination of strongly held beliefs [1] - It suggests that beliefs not rooted in personal experience may be the result of external programming [1] Call to Action - The document encourages questioning the origin and purpose of these beliefs [1] - It promotes resisting a "slave mind" and encourages independent thought [1]
X @Elon Musk
Elon Musk· 2025-10-20 10:46
Grok Model Performance - Grok 模型在 OpenRouter 排行榜上 Token 使用量排名第一,包括今日、本周和本月 [1] - Grok 模型在 OpenRouter 上编程用例排名第一,涵盖 Python、Javascript、TypeScript、Java、Ruby 和 C 等语言 [1] - Grok 模型是 OpenRouter 上不同语言最受欢迎的 LLM [1] - Grok 模型在 KiloCode 排行榜上排名第一 [1] - Grok 模型在 Cline 排行榜上排名第一 [1] Coding & Reasoning Benchmarks - Grok 模型在 Terminal-Bench Hard (Agentic Coding & Terminal Use) 排名第一 [1] - Grok 模型在 GPQA Diamond (Scientific Reasoning) 排名第一 [1] - Grok 模型在 SciCode (Coding) 排名第一 [1] AI Index - Grok 模型在 Artificial Analysis Intelligence Index Tokens Usage 排名第一 [1]
X @MEXC
MEXC· 2025-09-13 07:00
Industry Focus - MEXC wishes a happy Programmer Day to all programmers [1] - The programmers keep the modern world running [1]