Workflow
GPT3
icon
Search documents
What's with these OpenAI charts?
The Verge· 2025-08-08 14:07
Hallucination & Deception Rate Analysis - GTP5 scored 50 in deception rate, while 03 scored 47.4%, but the bar chart visually misrepresented the data, making GTP5 appear better when it's actually worse [1] - The report lacks clarity on what constitutes a good deception rate, ideally aiming for zero, especially when the model is used for medical advice [2][3] - Chart inconsistencies include boxes of different sizes representing percentages that don't align with their visual representation, such as 69.1%, 30.8%, and 52% [2] Data Presentation Issues - The charts exhibit a lack of consistency, with the highest numbers not always positioned in the highest spot [1] - Random coloring and inconsistent sizing of elements within the charts further contribute to the confusion [2]
AI浪潮录丨王晟:谋求窗口期,AI初创公司不要跟巨头抢地盘
Bei Ke Cai Jing· 2025-05-30 02:59
Core Insights - Beijing is emerging as a strategic hub in the AI large model sector, driven by technological innovation and a supportive ecosystem for breakthroughs [1] - The role of angel investors is crucial in the AI industry, providing essential support to startups and helping them take their first steps [4] - The AI large model wave has gained momentum globally since 2023, with early investments in generative models proving to be prescient [5][6] Group 1: AI Development and Investment Trends - The AI large model trend is characterized by a shift from previous waves focused on computer vision and autonomous driving to the current emphasis on AI agents and embodied intelligence [5][6] - Investors are increasingly favoring experienced founders with strong academic and research backgrounds, as seen in the case of companies like DeepMind and the Tsinghua NLP team [12][16] - The emergence of open-source models like Llama has accelerated competition among AI companies, allowing them to shorten development timelines [13] Group 2: Investment Strategies and Market Dynamics - Angel investors are focusing on a select number of projects, often operating in a "water under the bridge" manner, avoiding fully marketized projects [14][15] - The investment landscape is divided between long-term oriented funds that prioritize innovation and those focused on immediate revenue generation [21][22] - The success of companies like DeepSeek highlights the challenges faced by startups in competing with established giants, as the consensus around large models has solidified post-ChatGPT [26][27] Group 3: Entrepreneurial Characteristics and Market Challenges - Current AI entrepreneurs are predominantly scientists or technical experts, forming a close-knit community that is easier to identify and engage with [18][19] - The academic foundation of AI startups is critical, as many successful ventures are built on decades of research and development from their respective institutions [16][20] - The market is witnessing a shift where the ability to innovate is becoming more important than merely having financial resources, as the previous model of "buying capability" is no longer sustainable [27][28]