REFRAG

Search documents
X @Avi Chawla
Avi Chawla· 2025-10-12 19:29
RT Avi Chawla (@_avichawla)Researchers from Meta built a new RAG approach that:- outperforms LLaMA on 16 RAG benchmarks.- has 30.85x faster time-to-first-token.- handles 16x larger context windows.- and it utilizes 2-4x fewer tokens.Here's the core problem with a typical RAG setup that Meta solves:Most of what we retrieve in RAG setups never actually helps the LLM.In classic RAG, when a query arrives:- You encode it into a vector.- Fetch similar chunks from vector DB.- Dump the retrieved context into the LL ...
【AI产业跟踪-海外】首个 Agent 浏览器Fellou CE发布,微软推出14B数学推理Agent rStar2-Agent
GUOTAI HAITONG SECURITIES· 2025-09-17 12:17
请务必阅读正文之后的免责条款部分 1 of 5 【AI 产业跟踪-海外】首个 Agent 浏览器 Fellou CE 发 产业研究中心 | 布,微软推出 | [Table_Authors] 14B 数学推理 Agent rStar2-Agent 李嘉琪(分析师) | | --- | --- | | 摘要:产业最新趋势跟踪,点评产业最新风向 | 010-83939821 | | | lijiaqi2@gtht.com | | [Table_Summary] 行业资讯 AI | 登记编号 S0880524040001 | | ASML 入股 | Mistral AI | | 微软携手 Nebius | 签 174 亿美元算力协议 刘峰(研究助理) | | AI 应用资讯 | 0755-23976068 | | 首个 Agent | 浏览器 Fellou CE 发布 | | | liufeng6@gtht.com | | AI 大模型资讯 | 登记编号 S0880124060013 | | 微软推出 14B | 数学推理 Agent rStar2-Agent | AI 科技前沿 NVIDIA 发布 Rubin CP ...