RAG
Search documents
湖南发布“古籍修复知识库系统” 打造古籍修复“数字百科”
Zhong Guo Xin Wen Wang· 2025-10-14 09:26
Group 1 - The Hunan Library has launched an "Ancient Book Restoration Knowledge Base System" aimed at serving practitioners nationwide and is open to the public for free [1] - The system integrates ancient book restoration techniques with modern technologies such as artificial intelligence and RAG, addressing challenges in knowledge transmission and talent cultivation [1] - The knowledge base includes over 200 restoration cases, more than 200 types of restoration paper, over 300 professional books, 800+ professional papers, and 100+ restoration techniques, creating a "digital encyclopedia" for ancient book restoration [1] Group 2 - The "Ancient Book Protection Course in Schools" initiative was launched in Hunan, aiming to engage more young people in the preservation of ancient books and promote traditional Chinese culture [2]
X @Avi Chawla
Avi Chawla· 2025-10-12 06:31
Core Innovation - REFRAG - Meta's REFRAG fundamentally rethinks retrieval in RAG setups by compressing and filtering context at a vector level [1] - REFRAG compresses each chunk into a single compressed embedding and uses a relevance policy trained via RL to select the most relevant chunks [1][2] - Only selected chunks are expanded back into full embeddings and passed to the LLM, processing only what matters [2] Technical Details - REFRAG encodes documents and stores them in a vector database [2] - It encodes the full user query, finds relevant chunks, and computes token-level embeddings for both [3] - A relevance policy, trained via RL, selects chunks to keep [3][5] - Token-level representations of the input query are concatenated with selected chunks and a compressed single-vector representation of rejected chunks before being sent to the LLM [3] Performance Metrics - REFRAG outperforms LLaMA on 16 RAG benchmarks [4][6] - It achieves 30.85x faster time-to-first-token, which is 3.75x better than previous state-of-the-art [4][6] - REFRAG handles 16x larger context windows [4][6] - It utilizes 2-4x fewer tokens [4][6] - REFRAG leads to no accuracy loss across RAG, summarization, and multi-turn conversation tasks [6]
很严重了,大家别轻易离职。。
菜鸟教程· 2025-10-10 03:30
Core Insights - The biggest opportunity in the AI industry by 2025 lies in the application layer, with companies like ByteDance rapidly expanding their AI teams and job postings for AI-related positions surging [1][3] - There is a significant demand for large model application development engineers, with over 60% of enterprises pushing for AI product implementation, yet these skilled professionals are extremely scarce [1][3] - The average monthly salary for AI positions is 78,000 yuan, with internships offering daily wages as high as 4,000 yuan, indicating the high value of AI skills in the job market [1][3] Group 1 - Companies are increasingly focusing on three core capabilities for AI application: RAG (Retrieval-Augmented Generation), Agent intelligence, and fine-tuning for specific tasks [1][3] - The rapid growth in job postings for large model-related positions, with over 1,000 companies hiring, highlights the urgent need for skilled professionals in the AI sector [1][3] - The transition to AI roles is lucrative, with some individuals already earning annual salaries exceeding one million yuan after shifting to AI-focused positions [1][3] Group 2 - A specialized course titled "Large Model Application Development Practical Training" is being offered to help developers master essential AI skills, including RAG, Agent, and fine-tuning [3][5] - The course includes live sessions that combine theoretical knowledge with practical project demonstrations, aiming to equip participants with the skills needed for enterprise-level projects [3][5] - Participants will receive a job-seeking package that includes interview question banks and insights into high-paying job opportunities [3][5] Group 3 - The course has already served over 20,000 students, receiving positive feedback for its effectiveness in enhancing learning outcomes and job placement success [8] - The training program emphasizes the importance of building a technical barrier to stand out in the competitive job market and avoid potential layoffs [10][11] - The course also offers opportunities for direct referrals and job placements, increasing the chances of securing high-paying positions in the AI field [13][17]
X @Avi Chawla
Avi Chawla· 2025-10-02 06:31
Technology & Innovation - Airweave builds live, bi-temporal knowledge bases for agents to reason on the freshest facts [1] - Supports fully agentic retrieval with semantic and keyword search, query expansion, and more across 30+ sources [1] - Airweave is 100% open-source [1] Real-time Data Challenge - RAG (Retrieval-Augmented Generation) struggles with real-time data [1]
具身领域的大模型基础部分,都在这里了......
具身智能之心· 2025-09-20 16:03
Core Viewpoint - The article emphasizes the importance of a comprehensive community for learning and sharing knowledge about large models, particularly in the fields of embodied AI and autonomous driving, highlighting the establishment of the "Large Model Heart Tech Knowledge Planet" as a platform for collaboration and technical exchange [1][3]. Group 1: Community and Learning Resources - The "Large Model Heart Tech" community aims to provide a platform for technical exchange related to large models, inviting experts from renowned universities and leading companies in the field [3][67]. - The community offers a detailed learning roadmap for various aspects of large models, including RAG, AI Agents, and multimodal models, making it suitable for beginners and advanced learners [4][43]. - Members can access a wealth of resources, including academic progress, industrial applications, job recommendations, and networking opportunities with industry leaders [7][70]. Group 2: Technical Roadmaps - The community has outlined specific learning paths for RAG, AI Agents, and multimodal large models, detailing subfields and applications to facilitate systematic learning [9][43]. - For RAG, the community provides resources on various subfields such as Graph RAG, Knowledge-Oriented RAG, and applications in AIGC [10][23]. - The AI Agent section includes comprehensive introductions, evaluations, and advancements in areas like multi-agent systems and self-evolving agents [25][39]. Group 3: Future Plans and Engagement - The community plans to host live sessions with industry experts, allowing members to engage with leading figures in academia and industry [66]. - There is a focus on job sharing and recruitment information to empower members in their career pursuits within the large model domain [70].
但我还是想说:建议个人和小团队不要碰大模型训练!
自动驾驶之心· 2025-09-20 16:03
Core Viewpoint - The article emphasizes the importance of utilizing open-source large language models (LLMs) and retrieval-augmented generation (RAG) for businesses, particularly for small teams, rather than fine-tuning models without sufficient original data [2][6]. Group 1: Model Utilization Strategies - For small teams, deploying open-source LLMs combined with RAG can cover 99% of needs without the necessity of fine-tuning [2]. - In cases where open-source models perform poorly in niche areas, businesses should first explore RAG and in-context learning before considering fine-tuning specialized models [3]. - The article suggests assigning more complex tasks to higher-tier models (e.g., o1 series for critical tasks and 4o series for moderately complex tasks) [3]. Group 2: Domestic and Cost-Effective Models - The article highlights the potential of domestic large models such as DeepSeek, Doubao, and Qwen as alternatives to paid models [4]. - It also encourages the consideration of open-source models or cost-effective closed-source models for general tasks [5]. Group 3: AI Agent and RAG Technologies - The article introduces the concept of Agentic AI, stating that if existing solutions do not work, training a model may not be effective [6]. - It notes the rising demand for talent skilled in RAG and AI Agent technologies, which are becoming core competencies for AI practitioners [8]. Group 4: Community and Learning Resources - The article promotes a community platform called "大模型之心Tech," which aims to provide a comprehensive space for learning and sharing knowledge about large models [10]. - It outlines various learning pathways for RAG, AI Agents, and multi-modal large model training, catering to different levels of expertise [10][14]. - The community also offers job recommendations and industry opportunities, facilitating connections between job seekers and companies [13][11].
X @Avi Chawla
Avi Chawla· 2025-09-20 06:33
The ultimate Full-stack AI Engineering roadmap to go from 0 to 100.This is the exact mapped-out path on what it actually takes to go from Beginner → Full-Stack AI Engineer.> Start with Coding Fundamentals.> Learn Python, Bash, Git, and testing.> Every strong AI engineer starts with fundamentals.> Learn how to interact with models by understanding LLM APIs.> This will teach you structured outputs, caching, system prompts, etc.> APIs are great, but raw LLMs still need the latest info to be effective.> Learn h ...
真的花了好久才汇总的大模型技术路线......
具身智能之心· 2025-09-16 00:03
Core Insights - The article emphasizes the transformative impact of large models on various industries, highlighting their role in enhancing productivity and driving innovation in fields such as autonomous driving, embodied intelligence, and generative AI [2][4]. Group 1: Large Model Technology Trends - The large model industry is undergoing significant changes characterized by technological democratization, vertical application, and open-source ecosystems [2]. - There is a growing demand for talent skilled in technologies like RAG (Retrieval-Augmented Generation) and AI Agents, which are becoming core competencies for AI practitioners [2][4]. - The article introduces a comprehensive learning community focused on large models, offering resources such as videos, articles, learning paths, and job exchange opportunities [2][4]. Group 2: Learning Pathways - The community provides detailed learning pathways for various aspects of large models, including RAG, AI Agents, and multimodal models [4][5]. - Specific learning routes include Graph RAG, Knowledge-Oriented RAG, and Reasoning RAG, among others, aimed at both beginners and advanced learners [4][5]. - The pathways are designed to facilitate systematic learning and networking among peers in the field [5]. Group 3: Community Benefits - Joining the community offers benefits such as access to the latest academic advancements, industrial applications, and job opportunities in the large model sector [7][9]. - The community aims to create a collaborative environment for knowledge sharing and professional networking [7][9]. - There are plans for live sessions with industry leaders to further enrich the community's offerings [65][66].
RAG 的概念很糟糕,让大家忽略了应用构建中最关键的问题
Founder Park· 2025-09-14 04:43
Core Viewpoint - The article emphasizes the importance of Context Engineering in AI development, criticizing the current trend of RAG (Retrieval-Augmented Generation) as a misleading concept that oversimplifies complex processes [5][6][7]. Group 1: Context Engineering - Context Engineering is considered crucial for AI startups, as it focuses on effectively managing the information within the context window during model generation [4][9]. - The concept of Context Rot, where the model's performance deteriorates with an increasing number of tokens, highlights the need for better context management [8][12]. - Effective Context Engineering involves two loops: an internal loop for selecting relevant content for the current context and an external loop for learning to improve information selection over time [7][9]. Group 2: Critique of RAG - RAG is described as a confusing amalgamation of retrieval, generation, and combination, which leads to misunderstandings in the AI community [5][6]. - The article argues that RAG has been misrepresented in the market as merely using embeddings for vector searches, which is seen as a shallow interpretation [5][7]. - The author expresses a strong aversion to the term RAG, suggesting that it detracts from more meaningful discussions about AI development [6][7]. Group 3: Future Directions in AI - Two promising directions for future AI systems are continuous retrieval and remaining within the embedding space, which could enhance performance and efficiency [47][48]. - The potential for models to learn to retrieve information dynamically during generation is highlighted as an exciting area of research [41][42]. - The article suggests that the evolution of retrieval systems may lead to a more integrated approach, where models can generate and retrieve information simultaneously [41][48]. Group 4: Chroma's Role - Chroma is positioned as a leading open-source vector database aimed at facilitating the development of AI applications by providing a robust search infrastructure [70][72]. - The company emphasizes the importance of developer experience, aiming for a seamless integration process that allows users to quickly deploy and utilize the database [78][82]. - Chroma's architecture is designed to be modern and efficient, utilizing distributed systems and a serverless model to optimize performance and cost [75][86].
宇树科技官宣IPO后王兴兴首次发声:我最后悔的是以前没有学AI;甲骨文与OpenAI签署3000亿美元的算力协议丨AIGC日报
创业邦· 2025-09-12 00:12
Group 1 - Tencent's Youtu-GraphRAG has been open-sourced, featuring a new graph retrieval-enhanced generation framework that combines large language models with RAG mode, aimed at improving accuracy and traceability in complex Q&A tasks, particularly in knowledge-intensive scenarios like enterprise knowledge base Q&A and personal knowledge management [2] - Yushu Technology's CEO Wang Xingxing expressed regret for not learning AI earlier, highlighting the rapid advancements in AI capabilities and the potential for integrating AI with robotics, especially in light of the company's recent IPO announcement [2] - California's legislature is moving towards regulating AI chatbots with the passage of SB 243, which will require operators to implement safety protocols and hold companies legally accountable if standards are not met, set to take effect on January 1, 2026 [2] - Oracle has reportedly signed a $300 billion computing power agreement with OpenAI, marking one of the largest cloud service contracts in history, requiring 4.5 gigawatts of power capacity [2]