Workflow
LoRA
icon
Search documents
开发者狂喜:Thinking Machines发布首款产品Tinker,后训练麻烦全给包了
机器之心· 2025-10-02 03:12
机器之心报道 机器之心编辑部 对于大模型开发者 / 研究者来说,今天是重要的一天。 因为刚刚,OpenAI 前 CTO Mira Murati 创办的 Thinking Machines 推出了首款产品 ——Tinker。 简单来说,Tinker 是一个 API,用于帮开发者 / 研究人员微调语言模型。重要的是,在此过程中, 你只需要专注于训练数据和算法,而你不擅长的关于 Infra 的部 分 —— 调度、调优、资源管理和 Infra 可靠性 —— 统统由 Tinker 来搞定 ,这将大大简化 LLM 的后训练过程。 | You focus on 您关注 | You write 您编写 | We handle 我们处理 | | --- | --- | --- | | I Datasets and RL | | 4 Efficient distributed | | environments | Simple Python script | training of large models | | ■ 数据集和强化学习环境 | 简单的 Python 脚本 | ♦ 高效的大规模模型分布式 | | Your cus ...
用微信聊天记录来做AI数字的你,开源了
3 6 Ke· 2025-05-16 07:19
Core Insights - The WeClone project has gained significant attention as a solution for creating digital avatars based on WeChat chat records, utilizing large language models and fine-tuning techniques [1][2][3] - The project leverages RAG knowledge base principles to import WeChat chats and fine-tune models, enabling users to generate personalized digital personas [2][3] - The project is open-source and has garnered 8.7k stars on GitHub, indicating strong community interest and engagement [1] Project Overview - WeClone allows users to create digital avatars from their WeChat chat records, which are considered personal and detailed knowledge bases [3][7] - The project employs a default model, Qwen2.5-7B-Instruct, and utilizes LoRA for fine-tuning, requiring approximately 16GB of GPU memory [2] - The project includes features for automatic speech recognition (ASR) and text-to-speech (TTS), enabling the digital avatar to mimic the user's voice [2] Applications and Use Cases - The project can generate digital personas for various roles, including customer service representatives, marketing agents, and financial advisors, by utilizing chat records as knowledge bases [7] - Digital avatars can help reduce costs in customer service by automating responses based on accumulated chat data, thus eliminating the need for separate knowledge base management [7] - The ability to create tailored digital personas for different industries and roles enhances the effectiveness of communication and service delivery [7] Technical Implementation - Users can extract WeChat chat records using PyWxDump, with specific instructions for data migration and export in CSV format [6] - The project supports customization of dialogue names and system prompts, allowing users to personalize their digital avatars further [5] Community Engagement - The project encourages community participation by inviting users to join development groups for sharing product design cases and contributing to the development of digital personas [8]