Hugging Face
Search documents
X @TechCrunch
TechCrunch· 2025-07-16 22:41
How do you get people to trust robots? Make them cute.@huggingface's co-founder and chief scientist @Thom_Wolf explains why the company is betting on friendly AI hardware.Catch the full conversation on the latest episode of @EquityPod 👇https://t.co/PV5mO844tB https://t.co/hbJrX6ZXpM ...
他们,押注光学AI芯片
半导体芯闻· 2025-07-15 10:04
Core Viewpoint - The rise of artificial intelligence (AI) is driven by advancements in both hardware and algorithms, but current GPU technology is struggling to meet the demands of larger models, leading to energy and thermal challenges in data centers [1][4] Group 1: Company Overview - Arago is a startup focused on developing a hybrid photonic processor named "JEF" that aims to reduce power consumption and integrate with existing AI ecosystems [2][4] - The JEF chip processes AI workloads using photons instead of electrons, achieving ten times lower energy consumption compared to top GPUs without sacrificing throughput or compatibility [4] Group 2: Technology and Innovation - JEF is designed to be compatible with mainstream AI frameworks like PyTorch and TensorFlow, and it utilizes standard semiconductor manufacturing processes [4][7] - Arago has developed a complete software stack called Carlota, which abstracts the complexities of photonic computing and provides a programmable interface for developers [4][7] Group 3: Founding Team and Expertise - The founding team of Arago includes experts in photonics, chip design, machine learning, and software engineering, which is crucial for integrating new computing principles into modern AI workflows [5][6] - Notable advisors and investors include former Nvidia researchers and executives from major tech companies, highlighting the confidence in photonic computing [6] Group 4: Future Plans and Market Potential - With $26 million in seed funding, Arago plans to accelerate commercial deployment, complete silicon photonic integration, and expand its team for early deployment in AI inference, edge computing, and low-power data center applications [6][7] - If successful, Arago's technology could significantly reduce the energy footprint of AI and reshape computing systems in the post-Moore's Law era [7]
X @TechCrunch
TechCrunch· 2025-07-11 20:41
Product & Sales - Hugging Face's Reachy Mini, despite limited out-of-the-box functionality, achieved impressive day-one sales [1] Industry Trends - The discussion includes open-source bots and LangChain's $1 billion pivot [1] Company Focus - The report mentions Rivian's new side hustle [1] Social Media Platform - The report references another unusual week for X (formerly Twitter) [1]
Hugging Face Looks to Open-Source AI Robotics
Bloomberg Technology· 2025-07-11 19:20
Product & Technology - The company is developing an open-source, human-robot interaction platform named Richie, designed with emotional expression capabilities [1][2] - Richie is designed as a kit, similar to Lego, to encourage community tinkering and improvement through open-source contributions [3][4] - The platform includes sensors, cameras, microphones, and speakers, and is compatible with AI models for speech and image processing [2] - The company will provide an SDK to facilitate the creation of new robot behaviors and applications, encouraging community sharing and accessibility [6] Market & Business Model - The company aims to provide an accessible platform for building human-robot interactions, differentiating itself from expensive humanoid robots [2][7][9] - The open-source nature of Richie is intended to maximize the value of community-built behaviors and applications [9] Sales & Distribution - The company has already generated $1 million in revenue from this project within a few days of launch [11] - Approximately 60% of initial sales are from the U S market, with the remaining 40% from international markets, primarily Europe, Asia, and South America [11] Manufacturing & Sourcing - The company is exploring local sourcing and manufacturing options in various locations to leverage design simplicity and navigate trade environments [12][13]
腾讯研究院AI每周关键词Top50
腾讯研究院· 2025-07-11 07:29
Group 1: Models - Grok4 is a new model introduced by Elon Musk [2] - Phi-4 new version launched by Microsoft [2] - OpenAI released an open-weight model [2] - SmolLM3 developed by Hugging Face [2] - Skywork-R1V 3.0 from Kunlun Wanwei [2] - BlueLM-2.5-3B launched by Vivo [2] - DeepSeek-R1 plugin from Shanghai Jiao Tong University [2] - HumanOmniV2 developed by Alibaba [2] - Skywork-Reward-V2 from Kunlun Wanwei [2] - Enhanced version of DeepSeek by German TNG Company [2] - Sekai dataset from Shanghai AILab [2] Group 2: Applications - AI browser Comet developed by Perplexity [2] - MedGemma 27B launched by Google [2] - Zodiac Penguin AI co-creation by Tencent [2] - Veo 3 upgrade from Google [2] - Vidu Q1 launched by Vidu [2] - Deep Research application by Microsoft [2] - PaddleOCR 3.1 developed by Baidu [2] - FiS-VLA from Zhihua Technology [2] - Artistic 3D generation application by Tencent [2] - AlphaFold drug discovery by Isomorphic Labs [2] - Xiao Gao Teacher AI agent from Amap [3] - Claude development application by Apple developers [3] - MemOS utilizing memory tensors [3] - AI factory management by WeChat Work [3] - Gemini CLI update from Google [3] - Excel Agent by Shortcut [3] - 10-year chronic disease identification by ChatGPT [3] Group 3: Technology and Perspectives - Reachy Mini robot from Hugging Face [3] - Lingxi X2-N robot from Zhiyuan Robotics [3] - Mind World Model discussed by Meta [3] - Anti-framework approach by Cursor [3] - Google reports on large model usage [3] - Current state of consumer AI by Menlo Ventures [3] - AI entrepreneurship communication by Manus & YouTube [3] - AI product dissemination insights from Base44 founders [3] - CS education reform in American universities [3] - AGI humanoid robot by Figure [3] - AI company development research by ICONIQ Capital [3] - Context engineering discussed by Karpathy [3] - Market research on AI replacement by a16z [3] - AI entrepreneurship guide for enterprises by a16z [3] Group 4: Capital and Events - OpenAI officially acquired io [3] - Embodied Intelligence went public with Zhiyuan Robotics [3] - Meta poached talent from Apple [3] - AI review inducement by the Shexain team [3]
速递|OpenAI高管押注:25岁工程师重构AI检索底层逻辑,YC新秀ZeroEntropy获420万美元种子轮
Z Potentials· 2025-07-10 04:12
Core Insights - The article discusses the emergence of ZeroEntropy, a startup focused on enhancing data retrieval for AI models, which has raised $4.2 million in seed funding to improve the accuracy of large language models (LLMs) through effective data retrieval [1][2]. Group 1: Company Overview - ZeroEntropy is co-founded by Ghita Houir Alami and Nicholas Pipitone, and is based in San Francisco. The company aims to provide rapid, accurate, and large-scale data retrieval for AI models [1]. - The seed funding round was led by Initialized Capital, with participation from Y Combinator, Transpose Platform, 22 Ventures, a16z Scout, and several angel investors, including executives from OpenAI and Hugging Face [1]. - ZeroEntropy is positioned within a growing wave of infrastructure companies that are enhancing retrieval-augmented generation (RAG) technology for next-generation AI systems [1]. Group 2: Technology and Innovation - RAG technology is highlighted as a critical breakthrough for the next phase of AI development, allowing AI systems to pull data from external documents for various applications [2]. - ZeroEntropy's API is designed to unify data ingestion, index building, result re-ranking, and performance evaluation, distinguishing it from other enterprise-focused search products [2][3]. - The company claims its proprietary re-ranker, ze-rank-1, outperforms similar models from Cohere and Salesforce in both public and private retrieval benchmarks [3]. Group 3: Market Adoption and Impact - Over 10 early-stage companies are already utilizing ZeroEntropy to build AI systems across various sectors, including healthcare, law, customer support, and sales [4]. - The founder, Ghita Houir Alami, has a background in engineering and mathematics, and her previous experiences in AI development inspired her to create ZeroEntropy [4]. Group 4: Diversity and Inspiration - Ghita Houir Alami is noted as one of the few female CEOs in the AI infrastructure space, aiming to inspire more young women to pursue careers in STEM fields [5].
英伟达成为全球首家市值突破4万亿美元公司;Meta豪掷35亿美元入股全球最大眼镜制造商丨全球科技早参
Mei Ri Jing Ji Xin Wen· 2025-07-10 00:07
Group 1: Nvidia's Milestone - Nvidia briefly surpassed a market capitalization of $4 trillion, becoming the first publicly traded company to reach this milestone, driven by surging demand for AI technology [1] - Nvidia's stock price rose by 2.5% to a historical high of $164.42 per share, closing at $162.88 per share with a market cap of $3.97 trillion [1] - This milestone may serve as a confidence indicator for technology stocks, potentially increasing market attention on leading tech companies [1] Group 2: Departure of X CEO - X CEO Linda Yaccarino announced her departure from the company after two years, expressing gratitude to Elon Musk but not disclosing the reasons for her exit [2] - Yaccarino's departure may raise concerns regarding the stability of X's management, which could impact investment expectations [2] Group 3: Meta's Investment in EssilorLuxottica - Meta invested $3.5 billion to acquire approximately 3% of EssilorLuxottica, the world's largest eyewear manufacturer, known for brands like Ray-Ban and Oakley [3] - This investment could increase to 5% over time and aims to enhance collaboration on the Ray-Ban smart glasses project, marking a significant step for Meta in controlling its hardware supply chain [3] - Meta's move signals a positive outlook for the wearable device industry and may drive technological integration [3] Group 4: Perplexity's AI Browser Launch - AI startup Perplexity launched the AI web browser Comet, initially available to subscribers of its $200 monthly Perplexity Max plan, with plans for broader access via an invitation system [4] - Comet utilizes Perplexity as its primary search engine, offering AI-generated query responses and assisting users in purchasing products or booking hotels [4] - The launch of Comet may create competitive pressure on traditional browser manufacturers, affecting their user ecosystem [4] Group 5: Hugging Face's Desktop Robot Orders - Hugging Face announced that its Reachy Mini desktop robot is now available for order, with two versions priced at $499 for the wireless model and $299 for the wired version [5] - The robot comes pre-installed with demonstration programs and integrates with the open-source machine learning platform Hugging Face Hub, allowing users to develop and share custom functionalities [5] - The introduction of Reachy Mini may lower the barriers to entry for AI hardware, potentially stimulating innovation within the open-source community [5]
昆仑万维发布并开源Skywork-R1V 3.0版本;浙江大学发布高精准基因组设计AI模型丨AIGC日报
创业邦· 2025-07-10 00:00
Group 1 - Kunlun Wanwei released and open-sourced Skywork-R1V 3.0, achieving a score of 76.0 in the comprehensive multimodal evaluation MMMU, surpassing closed-source models like Claude-3.7-Sonnet (75.0) and GPT-4.5 (74.4), nearing the level of human junior experts (76.2) [1] - Hugging Face announced the release and open-sourcing of the small parameter model SmolLM3, which supports six languages and features a 128k context window, enabling deep and non-deep reasoning modes [1] - Zhejiang University developed a deep learning AI model named "Nuwa CE" for genomic prediction design, achieving over 90% accuracy in predicting phenotypic changes due to mutations in genomic regulatory regions, with results published in the journal Cell [1] Group 2 - Hugging Face's desktop robot Reachy Mini is now available for order, featuring two versions: Reachy Mini Wireless priced at $449 (approximately 3224 RMB) and Reachy Mini Lite at $299 (approximately 2147 RMB), both designed for developers [1][2] - Both versions of Reachy Mini are open-source DIY kits, comparable in size to a plush toy, equipped with screens and antenna structures, allowing users to program via Python and access over 1.7 million AI models and 400,000 datasets through the Hugging Face Hub [2]
腾讯研究院AI速递 20250710
腾讯研究院· 2025-07-09 14:49
Group 1: Veo 3 Upgrade - The Google Veo 3 upgrade allows audio and video generation from a single image, maintaining high consistency across multiple angles [1] - The new feature is implemented through the Flow platform's "Frames to Video" option, enhancing camera movement capabilities, although the Gemini Veo3 entry is currently unavailable [1] - User tests indicate natural expressions and effective performances, marking a significant breakthrough in AI storytelling applicable in advertising and animation [1] Group 2: Hugging Face 3B Model - Hugging Face has released the open-source 3B parameter model SmolLM3, outperforming Llama-3.2-3B and Qwen2.5-3B, supporting a 128K context window and six languages [2] - The model features a dual-mode system allowing users to switch between deep thinking and non-thinking modes [2] - It employs a three-stage mixed training strategy, trained on 11.2 trillion tokens, with all technical details, including architecture and data mixing methods, made available [2] Group 3: Kunlun Wanwei Skywork-R1V 3.0 - Kunlun Wanwei has open-sourced the Skywork-R1V 3.0 multimodal model, achieving a score of 142 in high school mathematics and 76 in MMMU evaluation, surpassing some closed-source models [3] - The model utilizes a reinforcement learning strategy (GRPO) and key entropy-driven mechanisms, achieving high performance with only 12,000 supervised samples and 13,000 reinforcement learning samples [3] - It excels in physical reasoning, logical reasoning, and mathematical problem-solving, setting a new performance benchmark for open-source models and demonstrating cross-disciplinary generalization capabilities [3] Group 4: Vidu Q1 Video Creation - Vidu Q1's multi-reference video feature allows users to upload up to seven reference images, enabling strong character consistency and zero storyboard video generation [4] - Users can combine multiple subjects with simple prompts, with clarity upgraded to 1080P, and support for character material storage for repeated use [5] - Test results show it is suitable for creating multi-character animation trailers, supporting frame extraction and quality enhancement, reducing video production costs to less than 0.9 yuan per video [5] Group 5: VIVO BlueLM-2.5-3B Model - VIVO has launched the BlueLM-2.5-3B edge multimodal model, which excels in over 20 evaluations and supports GUI interface understanding [6] - The model allows flexible switching between long and short thinking modes, introducing a thinking budget control mechanism to optimize reasoning depth and computational cost [6] - It employs a sophisticated structure (ViT+Adapter+LLM) and a four-stage pre-training strategy, enhancing efficiency and mitigating the text capability forgetting issue in multimodal models [6] Group 6: DeepSeek-R1 System - The X-Masters system, developed by Shanghai Jiao Tong University and DeepMind Technology, has achieved a score of 32.1 in the "Human Last Exam" (HLE), surpassing OpenAI and Google [7] - The system is built on the DeepSeek-R1 model, enabling smooth transitions between internal reasoning and external tool usage, using code as an interactive language [7] - X-Masters employs a decentralized-stacked multi-agent workflow, enhancing reasoning breadth and depth through collaboration among solvers, critics, rewriters, and selectors, with the solution fully open-sourced [7] Group 7: Zhihui Jun's Acquisition - Zhihui Jun's Zhiyuan Robot has acquired control of the listed company Shuangwei New Materials for 2.1 billion yuan, aiming for a 63.62%-66.99% stake [8] - Following the acquisition, Shuangwei New Materials' stock resumed trading with a limit-up, reaching a market value of 3.77 billion yuan, with the actual controller changing to Zhiyuan CEO Deng Taihua and core team members including "Zhihui Jun" Peng Zhihui [8] - This acquisition, conducted through "agreement transfer + active invitation," is seen as a landmark case for new productivity enterprises in A-shares following the implementation of national policies [8] Group 8: AI Model Usage Trends - In the first half of 2025, the Gemini series models captured nearly half of the large model API market, with Google leading at 43.1%, followed by DeepSeek and Anthropic at 19.6% and 18.4% respectively [9] - DeepSeek V3 has maintained a high user retention rate since its launch, ranking among the top five in usage, while OpenAI's model usage has fluctuated significantly [9] - The competitive landscape shows differentiation: Claude-Sonnet-4 leads in programming (44.5%), Gemini-2.0-Flash excels in translation, GPT-4o leads in marketing (32.5%), and role-playing remains highly fragmented [9] Group 9: AI User Trends - A report by Menlo Ventures indicates that there are 1.8 billion AI users globally, with a low paid user rate of only 3%, and a high student usage rate of 85%, while parents are becoming heavy users [10] - AI is primarily used for email writing (19%), researching topics of interest (18%), and managing to-do lists (18%), with no single task dependency exceeding one-fifth [10] - The next 18-24 months are expected to see six major trends in AI: rise of vertical tools, complete process automation, multi-person collaboration, explosion of voice AI, physical AI in households, and diversification of business models [10]
X @TechCrunch
TechCrunch· 2025-07-09 07:01
Hugging Face opens up orders for its Reachy Mini desktop robots | TechCrunch https://t.co/VnfiBSiyT4 ...