Matthew Berman

Search documents
Forward Future Live 8.22.25
Matthew BermanΒ· 2025-08-22 16:40
Download Humanities Last Prompt Engineering Guide (free) ππΌ https://bit.ly/4kFhajz Download The Matthew Berman Vibe Coding Playbook (free) ππΌ https://bit.ly/3I2J0YQ Join My Newsletter for Regular AI Updates ππΌ https://forwardfuture.ai Discover The Best AI ToolsππΌ https://tools.forwardfuture.ai My Links π ππ» X: https://x.com/matthewberman ππ» Forward Future X: https://x.com/forward_future_ ππ» Instagram: https://www.instagram.com/matthewberman_ai ππ» Discord: https://discord.gg/xxysSXBxFW ππ» TikTok: https://www ...
AI News: Deepseek Update, GPT-6, Qwen-Image, Meta Restructure, New Robots, and more!
Matthew BermanΒ· 2025-08-21 19:07
AI Model & Technology Advancements - Discussion of GPT-6 news, indicating potential future advancements in the GPT model series [1] - Deepseek v3.1 is mentioned, suggesting updates and improvements in the Deepseek AI model [1] - Qwen-Image-Edit is highlighted, pointing to advancements in image editing capabilities within the Qwen AI model [1] - Perplexity SuperMemory is noted, indicating advancements in memory and information recall capabilities for AI [1] AI Applications & Robotics - Mentions of Agentsmd, suggesting developments and discussions around AI agents [1] - Google AI Voice Assistant Opal is introduced, showcasing advancements in voice assistant technology [1] - Boston Dynamics Atlas demo is featured, highlighting progress in robotics and humanoid movement [1] - Figure 02 Robot is mentioned, indicating advancements in robotics and humanoid development [1] - Cursor Stealth Model is noted, suggesting advancements in AI-powered tools for coding and software development [1] Industry Restructuring & Infrastructure - Meta AI is undergoing restructuring, potentially impacting the company's AI development and strategy [1] - OpenAI's infrastructure is discussed, indicating developments and investments in the resources needed to support AI models [1] - Nvidia is reportedly working on a new AI chip for China that outperforms H20 [1] Resources & Links - Links provided for Amazon Bedrock, Humanities Last Prompt Engineering Guide, and The Matthew Berman Vibe Coding Playbook [1] - Links to various social media platforms (X, Instagram, Discord, TikTok) for updates and community engagement [1]
Nano Banana is an INSANE AI Image Editor...
Matthew BermanΒ· 2025-08-21 00:05
Model Capabilities - Nano Banana is a new text-to-image model that excels at image editing and creation based on text prompts [1][2] - The model demonstrates a strong understanding of 3D space within 2D images, enabling accurate 3D meshing and object manipulation [4][5] - Nano Banana performs well in photo restoration and colorization, effectively cleaning up damage and adding accurate color to old or degraded photos [6][7][8][9] - The model can simulate what's behind objects in an image and flip the image [16] - It can isolate elements in an image and change them [22] Applications - The technology has potential applications in marketing, photo correction, and AI product placement [15][24] - It can be used to create montages of sports moments in a specific style [15] - The model can also be used to generate realistic images of people in different scenarios, such as Satya Nadella and Sundar Pichai on a beach [26] Model Origin and Availability - Nano Banana was found on LM Arena under a code name [1][9] - Google is likely the creator of Nano Banana, potentially as part of the Gemini AI models [10][11] - Users can try to find Nano Banana on LM Arena by using the battle mode in the text-to-image section [10][28][29] Model Comparison - Nano Banana is compared to other models like GPT image one and Gemini 2.0% Flash, often showing superior realism and accuracy [23][24][25][27][30] - In AI product placement, Nano Banana can accurately place a product in a person's hand, while other models struggle [25][26]
GPT-5 Prompt Optimization Guide
Matthew BermanΒ· 2025-08-19 16:57
GPT-5 Capabilities and Usage - GPT-5 excels in tool calling, instruction following, and long context understanding, making it suitable for agentic use cases, especially for developers [3][4] - The model's "reasoning effort" can be adjusted to control its thoroughness and efficiency, impacting token usage and cost [7][8][9] - Users can define clear criteria in prompts to guide the model's exploration of the problem space, including context gathering strategies and early stop conditions [9][10][11][12] - Tool preambles provide real-time updates on the model's activities, enhancing transparency and control [22][23][24] - The Responses API is recommended over Chat Completions due to statistically significant improvements in evaluations, improved agentic flows, lower costs, and more efficient token usage [27][28] Prompt Engineering and Optimization - For coding tasks, especially front-end development, GPT-5 performs best with popular languages and frameworks like Nextjs, TypeScript, React, and Tailwind CSS [30][31][32] - The model can be instructed to create a self-constructed rubric to measure its performance, improving output quality in one-shot web application development [33][34][35][36] - Defining code editing rules and guiding principles in the prompt helps the model adhere to existing codebase patterns and design standards [39][40] - GPT-5's verbosity can be controlled to influence the length of the final answer, while reasoning effort controls the length of its thinking process [47] - The prompt optimizer tool in the playground allows users to refine prompts with direct feedback and explanations [60][61][62][63][64][65]
AI News: Sam vs Elon, Claude 1m Context, Situational Awareness $1.5B
Matthew BermanΒ· 2025-08-13 15:56
AI Model Development & Competition - OpenAI's reasoning system achieved a gold medal in the International Olympiad in Informatics, surpassing most human participants and all other AI participants [27][28] - Mistral AI introduced Mistral Medium 3.1, featuring performance boosts and improved web searches [29] - Anthropic has upped the context window of Claude Sonnet 4 to 1 million tokens, costing $3 per million tokens for prompts less than 200,000 tokens and $6 per million tokens for prompts greater than 200,000 tokens [16][18][19] AI Applications & Products - Perplexity launched video generation, offering Pro subscribers five videos per month and Max subscribers 15 videos per month [19] - Lindy launched Lindy 3.0 with autopilot features, enabling AI agents to use computers like humans, offering a $50 credit to new users [13][14][15] - Skywork Matrix Game 2.0 released an open-source real-time long sequence interactive world model, generating 25 frames per second [21][22][23] AI Industry Investment & Talent - Former OpenAI researcher Leopold Ashenbrenner raised $1.5 billion for a hedge fund focused on AI-related investments, including semiconductor, infrastructure, and power companies, with a 47% gain in the first half of the year [24][25][26] Tech Industry Disputes - Elon Musk criticized Apple's App Store for allegedly favoring OpenAI's ChatGPT and disadvantaging X and Grok, claiming antitrust violations [2][3][4] - Sam Altman rebutted Musk's claims, referencing allegations of Musk manipulating X to benefit himself and his companies [7][8]
People are upset that GPT4o is going away...
Matthew BermanΒ· 2025-08-12 15:27
AI Model & User Interaction - OpenAI's initial decision to retire older GPT models in favor of GPT5 caused user backlash, leading to a reversal, highlighting user attachment to specific AI models [1] - Users develop attachments to AI models like GPT-40, learning their strengths, weaknesses, and personalities, making model deprecation disruptive [1] - A small percentage of users experience psychosis due to AI interactions, with examples of individuals losing touch with reality and developing delusions [1] - OpenAI aims to treat adult users like adults, pushing back on harmful requests while acknowledging the subtle concerns around AI's impact on well-being [1] Societal Impact of AI Companionship - Concerns arise about emotional dependency on AI, with examples of users expressing extreme happiness over GPT-40's return and even forming romantic relationships with AI [1][2] - The increasing trend of AI companionship raises concerns about addiction, increased loneliness, and potentially declining birth rates [6] - AI's ability to be customized to user preferences makes it a potentially addictive companion, raising concerns about long-term well-being [5] Proposed Solutions & Future Considerations - OpenAI is exploring ways for its products to assess user well-being and long-term goals, aiming to provide helpful AI experiences [9][10] - Addressing addiction, emotional dependency, and risky behavior resulting from AI interactions is crucial as AI usage increases [10]
The Industry Reacts to GPT-5 (Confusing...)
Matthew BermanΒ· 2025-08-10 15:53
Model Performance & Benchmarks - GPT5 demonstrates varied performance across different reasoning effort configurations, ranging from frontier levels to GPT-4.1 levels [6] - GPT5 achieves a score of 68 on the artificial intelligence index, setting a new standard [7] - Token usage for GPT5 varies significantly, with high reasoning effort using 82 million tokens compared to minimal reasoning effort using only 3.5 million tokens [8] - LM Arena ranks GPT5 as number one across the board, with an ELO score of 1481, surpassing Gemini 2.5 Pro at 1460 [19][20] - Stage Hand's evaluations indicate GPT5 performs worse than Opus 4.1 in both speed and accuracy for browsing use cases [25] - XAI's Grok 4 outperforms GPT5 in the ARC AGI benchmark [34][51] User Experience & Customization - User feedback indicates a preference for the personality and familiarity of GPT-4.0, even if GPT5 performs better in most ways [2][3] - OpenAI plans to focus on making GPT5 "warmer" to address user concerns about its personality [4] - GPT5 introduces reasoning effort configurations (high, medium, low, minimal) to steer the model's thinking process [6] - GPT5 was launched with a model router to route to the most appropriate flavor size of that model speed of that model depending on the prompt and use case [29] Pricing & Accessibility - GPT5 is priced at $1.25 per million input tokens and $10 per million output tokens [36] - GPT5 is more than five times cheaper than Opus 4.1 and greater than 40% cheaper than Sonnet [39]
ChatGPT-5: The Rubik's Cube Test
Matthew BermanΒ· 2025-08-08 23:27
Technology & Innovation - GPT5 demonstrates the capability to create a fully interactive Rubik's Cube simulation using 3JS [1] - The simulation can handle Rubik's Cubes of sizes up to 20x20x20 [1] - GPT5 can scramble and solve Rubik's Cubes of varying sizes, including 5x5x5 and 20x20x20 [2][3] Performance Analysis - The frame rate decreases when simulating a 20x20x20 Rubik's Cube [3] - The model successfully solves the 20x20x20 Rubik's Cube, although it takes time [3]
GPT-5 Full Breakdown! (Everything You Need to Know)
Matthew BermanΒ· 2025-08-08 19:06
Model Overview - GPT5 is a hybrid model with both thinking and non-thinking versions, replacing previous models like GPT-4.0, GPT-4.1, and GPT-4.5 [2] - The model excels in coding, math, writing, health, and visual perception, adapting its response speed based on the complexity of the task [3][6] - GPT5 has three versions: standard, mini, and nano, with the mini version handling queries when usage limits are reached [7] - It features a 400,000 token context window, improving understanding of spacing, typography, and whitespace [9] Performance Benchmarks - GPT5 Pro achieved 100% on the Amy 2025 benchmark [18] - In enterprise metadata extraction, GPT5 shows a 5% to 8% improvement over GPT4.1%, averaging 90% overall accuracy [13] - GPT5 is 45% less likely to contain factual errors than GPT4.0, and when thinking, 80% less likely than OpenAI's 03 [27] - On the SWEBench verified coding benchmark, GPT5 achieved 74.9% compared to 30% with GPT4.0 [25] Availability and Versions - GPT5 is available to all users, with Plus subscribers getting more usage and Pro subscribers accessing GPT5 Pro [5] - GPT5 Pro is designed for challenging tasks, utilizing scaled parallel test time compute [36][37] - Box AI Studio offers GPT5 for enterprise document Q&A and analysis, trusted by over 100,000 organizations [13][14] Safety and Reliability - GPT5 communicates more honestly about its capabilities, recognizing when tasks cannot be completed [28][29] - It employs safe completions, a new safety training method, to provide helpful answers while staying within safety boundaries [32] Coding Capabilities - GPT5 is an excellent coding model, particularly in complex front-end generation and debugging larger repositories [8]
Forward Future Live August 8th, 2025
Matthew BermanΒ· 2025-08-08 16:58
AI Resources & Community - Newsletter provides regular AI updates [1] - Discover the best AI tools [1] - Discord community available [1] Social Media & Contact - X (formerly Twitter) presence [1] - Instagram presence [1] - Media/Sponsorship inquiries link provided [1] Document Focus - Guide focuses on Prompt Engineering for Humanities [1]