Workflow
Matthew Berman
icon
Search documents
Ex-OpenAI CTO Reveals Plan to Fix LLMs Biggest Problem
Matthew Berman· 2025-09-16 22:34
Get Started with Lindy For Free: https://go.lindy.ai/berman-ai Download Humanities Last Prompt Engineering Guide (free) 👇🏼 https://bit.ly/4kFhajz Download The Matthew Berman Vibe Coding Playbook (free) 👇🏼 https://bit.ly/3I2J0YQ Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Forward Future X: https://x.com/forward_future_ 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉 ...
Genie 3 Team: Agents, Training Genie, Simulation Theory, Text vs Video, and more!
Matthew Berman· 2025-09-16 18:18
Download Humanities Last Prompt Engineering Guide (free) 👇🏼 https://bit.ly/4kFhajz Download The Matthew Berman Vibe Coding Playbook (free) 👇🏼 https://bit.ly/3I2J0YQ Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Forward Future X: https://x.com/forward_future_ 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 Discord: https://discord.gg/xxysSXBxFW 👉🏻 TikTok: https://www ...
GPT-5 Codex is nuts...
Matthew Berman· 2025-09-15 22:31
Product Overview - OpenAI releases GPT5 Codeex, optimized for agentic coding, available in various environments like terminal, IDE, GitHub, and ChatGPT iOS app [1][2] - GPT5 Codeex is included with ChatGpt Plus Pro business edu and enterprise plans [3] Performance Benchmarks - GPT5 Codeex achieves 74.5% on SWEBench verified, a slight improvement over GPT5's 72.8% [3] - Code refactoring sees a significant improvement with GPT5 Codeex at 51.3% compared to GPT5's 33.9% [3] - GPT5 Codeex can work independently for over 7 hours on complex tasks [4] - GPT5 Codeex uses 93.7% fewer tokens than GPT5 for simpler tasks but spends twice as long on complex use cases [6] - GPT5 Codeex reduces incorrect comments to 4.4% compared to GPT5's 13.7% and increases high impact comments to 52.4% from 39.4% [7] Features and Capabilities - Codeex is trained for code reviews, identifying critical flaws by navigating codebase, reasoning through dependencies, and running code and tests [6] - Codeex CLI updates include better formatted tool calls and diffs, simplified approval modes, and conversation state compaction [12][13] - Codeex automates environment setup by scanning for setup scripts and fetching dependencies at runtime [15] - Codeex can spin up its own browser, iterate on its builds, and attach screenshots to tasks and GitHub PRs [15] - Codeex reviews PRs by matching stated intent to the actual diff, reasoning over the codebase, and executing code and tests [16] Windsurf Integration - Windsurf is highlighted as a powerful agentic IDE, especially after being acquired by Cognition [9] - Windsurf offers features like deep wiki, vibe, replace, one-click MCP store, sophisticated memory, and deep integration with Devon [10][11] Pricing and Availability - Pro plan at $200 per month can support a full work week across multiple projects, positioning it as an additional developer [19] - Business plans offer credit purchases for exceeding included limits, while enterprise plans provide a shared credit pool [20] Infrastructure Improvements - Cloud infrastructure performance is improved by caching containers, reducing medium completion time for new tasks and follow-ups by 90% [14]
Forward Future Live | 9/12/25
Matthew Berman· 2025-09-12 16:45
Download Humanities Last Prompt Engineering Guide (free) 👇🏼 https://bit.ly/4kFhajz Download The Matthew Berman Vibe Coding Playbook (free) 👇🏼 https://bit.ly/3I2J0YQ Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Forward Future X: https://x.com/forward_future_ 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 Discord: https://discord.gg/xxysSXBxFW 👉🏻 TikTok: https://www ...
Forward Future Live @ Boxworks | 9/11/25
Matthew Berman· 2025-09-11 19:29
AI Tools & Resources - Forward Future AI 提供最佳 AI 工具发现平台 [1] - 提供人文科学 Prompt Engineering 指南下载 [1] - 提供 AI 更新资讯 Newsletter 订阅 [1] Social Media & Community - Matthew Berman 的 X (Twitter) 链接 [1] - Matthew Berman 的 Instagram 链接 [1] - Discord 社区链接 [1] Business & Investment - 提供媒体/赞助咨询 [1] - 声明持有 FactoryAI 的少量投资 [1]
Forward Future Live @ Boxworks | 9/11/25
Matthew Berman· 2025-09-11 17:06
Download (GPT-5 UPDATED) Humanities Last Prompt Engineering Guide (free) 👇🏼 http://bit.ly/4m76knm Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 Discord: https://discord.gg/xxysSXBxFW Media/Sponsorship Inquiries ✅ https://bit.ly/44TC45V Disclaimer: I am a small investor in FactoryAI ...
Anthropic will pay $1.5b, the biggest copyright settlement in history
Matthew Berman· 2025-09-09 19:43
Anthropic, the company behind Claude, faces at least a $1.5% billion settlement for their copyright infringement. If approved, this would represent the biggest copyright settlement in history and really has major implications for artificial intelligence going forward. So, what happened.What did they do. Did they do it intentionally. And what does this mean for the rest of the industry.Let's get into it. And this video is brought to you by Sokumi. More on them later.So here is the case document. As you can s ...
Did OpenAI just solve hallucinations?
Matthew Berman· 2025-09-08 15:18
Open AAI may have just solved hallucinations. They just put out a paper which identifies the root cause of hallucinations and a potential way to fix it. And when you hear the reason why models hallucinate, it's going to be super obvious in retrospect.And this video is brought to you by notion. More on them later. So, here's a paper just released a couple days ago, why language models hallucinate.I'm going to break down everything you need to know about it. According to the paper, language models are known t ...
Forward Future Live 9.5.25
Matthew Berman· 2025-09-05 16:10
Download Humanities Last Prompt Engineering Guide (free) 👇🏼 https://bit.ly/4kFhajz Download The Matthew Berman Vibe Coding Playbook (free) 👇🏼 https://bit.ly/3I2J0YQ Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Forward Future X: https://x.com/forward_future_ 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 Discord: https://discord.gg/xxysSXBxFW 👉🏻 TikTok: https://www ...
AI News: xAI Sues OpenAI, Microsoft's MAI, Anthropic Funding, OpenAI Acquisition, and more!
Matthew Berman· 2025-09-04 17:46
AI Model Development & Releases - Microsoft released its first in-house AI models, MAI Voice 1 and MAI1 preview, with MAI Voice 1 generating a minute of audio in less than 1 second on a single GPU [4] - Microsoft's MAI1 preview debuted at number 13 on the LM Marina benchmark, below Grok 3 Preview [5] - Hunan, a Chinese company, released Hunan Video Folly, an open-source text-to-video-to-audio framework trained on a 100,000-hour multimodal dataset [17][18] - Microsoft published a paper on RSR 2 agent, a 14 billion parameter model that outperformed a 671 billion parameter model in math reasoning [21] - Korea AI released a real-time video generation model [22] Legal & Ethical Considerations - Elon Musk and XAI are suing a former employee for allegedly stealing confidential information and going to OpenAI [1] - Chinese social media companies are rolling out labels for AI-generated content to comply with new regulations [24] - Anthropic is updating its data retention policy to 5 years and will train models on user chat transcripts, with users able to opt out by September 28th [13][14] Corporate Developments & Investments - Anthropic raised $13 billion in a Series F funding round, valuing the company at $183 billion [10] - Anthropic's run rate revenue grew from approximately $1 billion at the beginning of 2025 to $5 billion by August [12] - OpenAI acquired Statsig for $1.1 billion in an all-stock deal [34][35] AI Applications & Features - Figure robot can now do dishes, showcasing advancements in robotics [14][15] - Chat GPT is rolling out new features to help people in need and parents, including mental health crisis interventions and parental controls [30][33] - AI can detect covert voluntary facial responses in coma patients earlier and more often than clinicians [26] Market Dynamics & Competition - Google avoided being broken up in its antitrust case, causing its stock to increase by 9% [15] - Elon Musk posted a graph showing Grok code usage increased 60% higher than Claude Sonnet on Open Router, but Grok code is essentially free [27][28] - Waymo is expanding to Seattle and Denver [36]