Workflow
Matthew Berman
icon
Search documents
The Industry Reacts to GPT-5 (Confusing...)
Matthew Berman· 2025-08-10 15:53
GPT5 has been the most polarizing model launch I have ever seen. From people saying it's the greatest model they've ever used to saying they're sticking with Claude 3.5% to GraphGate to saying the evals don't even matter anymore. So, I'm going to break down all of the reactions from the industry right now.All right, first from the man himself, Sam Alman, he gives some updates post launch after collecting some of the feedback. Listen to what he has to say. We for sure underestimated how much some of the thin ...
ChatGPT-5: The Rubik's Cube Test
Matthew Berman· 2025-08-08 23:27
I have been playing with GPT5 for about the last week. Let me show you. Can it solve the Rubik's Cube.Write a complete HTML JavaScript program using 3JS that renders a fully interactive Rubik's Cube simulation of any size up to 20x 20x 20. Yes, I am going to show you that. Look what we have here.This looks really good so far. Although we've had other models get this far without being able to rotate it. It gets all wonky.And there it is scrambling perfectly. simulated perfectly. Solve.There it goes. Okay. So ...
GPT-5 Full Breakdown! (Everything You Need to Know)
Matthew Berman· 2025-08-08 19:06
GPT5 is finally here. I am lucky enough to have been testing it over the last week. And yes, it is truly an incredible model.Let me just break down everything you need to know about this model. So, here's the blog post. Our smartest, fastest, and most useful model yet with built-in thinking that puts expert level intelligence in everyone's hand.Now, the first thing you're going to notice about this model is it is a hybrid model. It has both thinking and non-thinking versions built into the same exact model. ...
Forward Future Live August 8th, 2025
Matthew Berman· 2025-08-08 16:58
Download (GPT-5 UPDATED) Humanities Last Prompt Engineering Guide (free) 👇🏼 http://bit.ly/4m76knm Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 Discord: https://discord.gg/xxysSXBxFW Media/Sponsorship Inquiries ✅ https://bit.ly/44TC45V ...
Forward Future Live August 8th, 2025
Matthew Berman· 2025-08-08 16:33
Download (GPT-5 UPDATED) Humanities Last Prompt Engineering Guide (free) 👇🏼 http://bit.ly/4m76knm Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 Discord: https://discord.gg/xxysSXBxFW Media/Sponsorship Inquiries ✅ https://bit.ly/44TC45V ...
GPT-5 LIVESTREAM WATCHPARTY!
Matthew Berman· 2025-08-07 18:20
Download (GPT-5 UPDATED) Humanities Last Prompt Engineering Guide (free) 👇🏼 http://bit.ly/4m76knm Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 Discord: https://discord.gg/xxysSXBxFW Media/Sponsorship Inquiries ✅ https://bit.ly/44TC45V ...
GPT-5 Fully Tested (INSANE)
Matthew Berman· 2025-08-07 18:00
I have been playing with GPT5 for about the last week and I am so excited to show you all of the tests that I've put it through. Yes, it is incredibly impressive. Let me show you.And by the way, we just updated Humanity's last prompt engineering guide. This is my team's very own prompt engineering guide updated for GPT5. It is completely free.I'll drop a link down below in the description. Check it out. Let me know what you think.Of course, first, can it solve the Rubik's Cube. Write a complete HTML JavaScr ...
GPT-5 LIVESTREAM WATCHPARTY!
Matthew Berman· 2025-08-07 16:41
Download (GPT-5 UPDATED) Humanities Last Prompt Engineering Guide (free) 👇🏼 http://bit.ly/4m76knm Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 Discord: https://discord.gg/xxysSXBxFW Media/Sponsorship Inquiries ✅ https://bit.ly/44TC45V ...
The Industry Reacts to gpt-oss!
Matthew Berman· 2025-08-06 19:22
Wow, I thought the open-source model from OpenAI was going to be popular, but it really struck a chord in the industry. Let me break down all of the industry reactions for you right now. First, of course, Sam Alman's tweet, GPOSS is out.We made an open model that performs at the level of 04 mini and runs on a high-end laptop WTF and a smaller one that runs on a phone, which is just crazy to think about. Super proud of the team. Big triumph of technology.I'm so happy that they put all of this research and ef ...
Claude Just Got a Big Update (Opus 4.1)
Matthew Berman· 2025-08-05 23:02
Model Release & Performance - Anthropic 发布了 Claude Opus 4.1%,是对 Claude Opus 4 的升级,尤其在 Agentic 任务、真实世界编码和推理方面 [1] - SWEBench verified 基准测试中,Claude Opus 4.1% 的得分从 Opus 4 的 72.5% 提升至 74.5%,提升了 2 个百分点 [3] - Terminal Bench 基准测试中,Claude Opus 4.1% 的终端使用能力从 39.2% 提升至 43.3%,提升了 4.1 个百分点 [4] - GPQA Diamond(研究生水平推理)基准测试中,Claude Opus 4.1% 的得分从 79.6% 提升至 80.9%,提升了 1.3 个百分点 [4] - Towbench(Agentic 工具使用)基准测试中,Claude Opus 4.1% 在零售方面的得分从 81.4% 提升至 82.4%,提升了 1 个百分点,但在航空方面从 59.6% 下降至 56%,下降了 3.6 个百分点 [5] - 多语言问答基准测试中,Claude Opus 4.1% 的得分从 88.8% 提升至 89.5%,提升了 0.7 个百分点 [5] - Amy 2025 基准测试中,Claude Opus 4.1% 的得分提升了 2.5 个百分点至 78% [5] Competitive Positioning & Future Outlook - 在 SWEBench 和 Terminal Bench 基准测试中,Claude Opus 4.1% 优于 OpenAI 的 GPT-3 和 Gemini 1.5 Pro [5] - 在 GPQA Diamond 和 Agentic 工具使用基准测试中,Claude Opus 4.1% 不及 OpenAI 的 GPT-3 和 Gemini 1.5 Pro [6] - 在高中数学竞赛基准测试中,Claude Opus 4.1% 的得分低于 OpenAI 的 GPT-3 (88.9%) 和 Gemini 1.5 Pro (88%),仅为 78% [6] - Claude 目前被广泛认为是市场上最佳的编码模型,尤其擅长 Agentic 编码和 Agent-driven 开发 [7]