Sequoia Capital
Search documents
OpenAI's Sora 2: Anime, Physics, and World Simulation
Sequoia Capital· 2025-11-13 18:30
We did spend a lot of time really thinking about what does like the optimal data mix for the world simulator kind of look like. I think in some cases we'll make decisions that you know maybe are for like making the model really fun. Like for example, people love generating anime but you know do not necessarily perfectly represent the laws of physics that are like directly useful for like real world applications.To put it another way, right, I think in anime there are certain primitives that are simplified t ...
How Google’s Nano Banana Achieved Breakthrough Character Consistency
Sequoia Capital· 2025-11-11 10:00
Model Development & Capabilities - Google's Nano Banana image model, built upon the Gemini model, achieves single image character consistency through high-quality data, long multimodal context windows, and disciplined human evaluations [3][4][32][33] - The model benefits from Gemini's multimodal foundational capabilities, including a long context window that allows for multiple image inputs and iterative conversations [33][34] - A key technical breakthrough is the model's ability to generalize well, enabling it to maintain character consistency and edit images while preserving untouched elements [32][33][24] - Craft and attention to detail in data selection and model design are as important as scale in achieving high-quality results [4][38][39] Applications & Use Cases - The model facilitates consistent character and scene preservation in video models, enabling smoother video creation with natural scene cuts [6][7][8] - Users are creatively "hacking" the model for learning and information digestion, such as creating sketch notes from complex topics [9][10] - The model allows users to see themselves in new ways, enhancing self-expression and identity through 3D figurines and other creative outputs [14] - The technology has potential for personalized learning, multimodal creation, and specialized UIs that combine fine-grain control with automation [4][69][70] Business & Product Strategy - Google aims to build a single, powerful model capable of handling any modality and transforming it into any other, with specialized models like Imagen and VEO serving as stepping stones [47][48][49] - The company is focusing on making the technology more accessible and easier to use for consumers, while also developing more precise control and robustness for professional workflows [43][66][67][68] - Google is exploring new visual creation canvases and UIs to enhance user interaction with the models, moving beyond simple chatbot interfaces [72][73][74] - Startups have opportunities to develop workflow-based tools for various verticals, leveraging the fundamental technology to address specific client needs [111][112] Safety & Ethical Considerations - Google is committed to preventing misuse of the technology, particularly in creating deepfakes and misinformation [89][90] - The company employs visible watermarks and invisible SynthID to indicate AI-generated content and verify its origin [91][92][95] - Google invests in ongoing testing and mitigation strategies to address new attack vectors and ensure responsible use of the models [93]
How OpenAI's Sora 2 goes beyond video generation.
Sequoia Capital· 2025-11-06 17:01
When you put enough compute and data into these systems in order to actually solve this task of predicting the next token, you need to develop an internal representation of how the world functions, right. You need to like simulate things. The models make lots of mistakes right now at like low compute scales.But as you continue pushing, you know, from 3 to four to five, you just see these internal world models get more and more robust. And it's really analogous for video, right. And in many ways more explici ...
From Early Failures to ‘Clash of Clans’ and ‘Brawl Stars’ - Supercell ft Ilkka Paananen
Sequoia Capital· 2025-11-06 10:00
Often times what tends to happen at especially at the successful games company is that sometimes these game developers who actually build the games they sort of lose control and the control like moves to the upper management and and and so forth and then this idea kind of start to grow on me and and and sort of my fellow co-founders. What if you would uh found a completely new type of games company and we you would almost like flip the organizational chart upside down. Meaning that instead of like the upper ...
Reinventing Delivery with Instant Drone Transport: Zipline's Keller Cliffton
Sequoia Capital· 2025-10-23 16:41
For a decade, it's been clear that, you know, getting permission to fly beyond visual line of sight in the US is really the holy grail from a regulatory perspective. And Zipline in 2023 became the first company in US history to be awarded like full approval to fly beyond visual line of sight in all 50 states in the United States. And it was a it was a milestone 10 years in the making because, you know, we had we had started building Zipline in 2013 thinking, you know, that ultimately like will this ever be ...
Securing the AI Frontier: Irregular Founder Dan Lahav
Sequoia Capital· 2025-10-21 09:00
There was a scenario where there was an agent on agent interaction. It was a critical security task. That was the simulation that they were in, but after working for a while, one of the models decided that they've worked enough.And they and they should stop. It did not stop there. It convinced the other model that they should both take a break.So the model did social engineering on the other model to another model. But now try to think about a situation where you actually as an enterprise are delegating an ...
Building the "App Store" for Robots: Hugging Face's Thomas Wolf on Physical AI
Sequoia Capital· 2025-09-09 09:00
Many many startups just already being built on top of the robot just you know they want to build something they have this idea of of a manual test they can automate or they have an idea of something they could do in the physical world and then they take the robot they take already like the basic building blocks we've shipped which is just a robotic a very simple robotic arm s 100 that we designed basically to be the cheapest robotic arm to be $100 and they're already like trying to start business around thi ...
How Crosby is Building an AI Law Firm on Deal Velocity not Billable Hours
Sequoia Capital· 2025-09-02 09:01
I think lawyers are quite good at learning, but in a law firm structure, as much time goes into apprenticeship. It's a teaching hospital. You don't actually spend that much time getting really good at teaching because you just do it through reps and reps and reps and reps.And so actually explaining things um is something that like I think is going to be a very prized skill for not just lawyers, but for any domain experts, but in particular lawyers. And we're seeing it like when you can make an AI do this th ...
The $10 Trillion AI Revolution: Why It’s Bigger Than the Industrial Revolution
Sequoia Capital· 2025-08-28 09:01
AI Revolution Thesis - Sequoia believes the AI revolution is comparable to the industrial revolution, presenting a significant transformation [1][2] - The cognitive revolution represents a $10 trillion (10 to the 13th power) opportunity [1][8] - Startups are crucial in specializing general AI technologies for specific applications [6] Commercial Opportunity - The AI-driven automation of the US services market, currently at approximately $20 billion, holds a $10 trillion potential [8] - The cognitive revolution can expand the market to include large, standalone public companies built around AI in the services space [12] Investment Trends - Work is shifting towards higher leverage (100+%) on tasks with less certainty in outcomes, requiring human correction [13][14][15] - Real-world measurement is becoming the new gold standard for proving AI excellence, surpassing academic benchmarks [15][16][17] - The industry forecasts at minimum a 10x increase in compute (flops) per knowledge worker, with optimistic views suggesting 1000x to 10,000x consumption [20] Investment Themes - Persistent memory, including long-term memory and consistent AI identity, is critical for AI's expansion into more work functions [21][22][23] - Seamless communication protocols between AIs, beyond initial protocols like MCP, will yield major applications [24][25] - AI voice is currently viable due to increased fidelity and decreased latency, with applications in both B2C and enterprise sectors [27][28][29] - AI security presents a huge opportunity across development, distribution, and user layers, potentially involving numerous AI security agents per person/agent [30][31][32][33] - Open source's ability to compete with state-of-the-art foundation models is critical for a free, open AI future [34][35][36]
Building in the application layer? Gamma's Jon Noronha gives advice for #founders in #AI
Sequoia Capital· 2025-08-19 19:22
Product Strategy & Market Differentiation - Gamma's unique perspective focuses on differentiating the presentation medium itself, aiming to replace traditional slide decks [1] - The company advises application layer founders to identify their unique market lens to navigate competitive spaces [2] - It cautions against creating similar AI coding startups and suggests exploring neglected areas where AI is not heavily applied [2] - The industry should consider working on areas that foundation models are not heavily optimizing for to avoid direct competition with larger entities [3] Technology & Experimentation - The industry should incorporate experimentation and try different models, avoiding reliance on a single provider [4] - Rapid and unpredictable innovation requires planning for a dynamic environment with potentially changing best models [4]