Workflow
大语言模型
icon
Search documents
沉浸式翻译团队新品:BabelDOC PDF,无损翻译 PDF,免费用户可用
Founder Park· 2025-04-30 12:31
Core Viewpoint - BabelDOC has developed a PDF translation tool that effectively addresses common issues in machine translation, such as formatting errors and layout inconsistencies, allowing for precise PDF output. Group 1: Product Features - BabelDOC achieved a top-three ranking in the GitHub Trending list for all development languages shortly after its release [2] - The tool supports multiple languages, enabling translations from Latin-based languages to Simplified Chinese, Traditional Chinese, Japanese, and Korean, as well as mutual translations among Chinese, Japanese, and Korean [2] - Free users can process up to 1,000 pages per month, while Pro users can process up to 10,000 pages and access advanced translation models [3] Group 2: Technical Implementation - BabelDOC can extract and translate embedded elements in PDFs, such as charts, footnotes, and formulas, ensuring pixel-level layout alignment with the original document [7] - The tool utilizes AI layout recognition technology to identify text layout, paragraph structure, and complex formatting, which is crucial for maintaining the integrity of professional documents [7][9] - After recognizing the layout, the extracted text is translated using a large language model, and the translated text is matched with the original formatting to ensure consistency [8][9] Group 3: Understanding PDF Complexity - PDF (Portable Document Format) was invented by John Warnock in the early 1990s to ensure consistent document display across different devices [13] - PDF documents have unique advantages, such as strong cross-platform compatibility and high-quality printing, but they are less editable compared to DOCX formats [14] - The structure of a PDF is complex, resembling a tree with various components, including a file header, page tree, cross-reference table, and content flow, which complicates the translation process [16][19]
新华财经早报:4月30日
Xin Hua Cai Jing· 2025-04-30 02:13
Group 1: Financial Performance - Guizhou Moutai achieved a record revenue of 51.443 billion yuan in Q1, a year-on-year increase of 10.67%, and a net profit of 26.847 billion yuan, up 11.56% year-on-year [5][8] - Vanke A reported a revenue decline of 38.31% to 37.995 billion yuan in Q1, with a net loss of 6.246 billion yuan compared to a net loss of 362 million yuan in the same period last year [5][8] - Major state-owned banks announced the decision to abolish their supervisory boards, which requires approval from the shareholders' meeting [4][8] Group 2: Market Developments - The National Development and Reform Commission (NDRC) announced the issuance of 81 billion yuan in special long-term bonds to support the consumption upgrade policy [4] - The bond market saw a total issuance of 87,356.6 billion yuan in March, with government bonds accounting for 12,786.3 billion yuan and corporate credit bonds for 13,335.2 billion yuan [4] - The Hong Kong Stock Exchange is preparing to assist Chinese companies that have not yet listed in Hong Kong to return to the market [4] Group 3: Industry Trends - The steel industry reported a total revenue of 1.436 trillion yuan in Q1, a year-on-year decrease of 6.61%, while total profits increased by 108% to 21.583 billion yuan [4] - The real estate sector continues to face challenges, as evidenced by Vanke A's significant revenue drop [5] - The consumer confidence index in the U.S. fell for the fifth consecutive month, indicating potential impacts on global market sentiment [6]
沃尔玛态度转变:恢复中国供应商出货,美国客户承担关税成本;传饿了么加入外卖大战;因未按时公示年报,引望公司被列为经营异常
雷峰网· 2025-04-30 00:30
1. 网传中国半导体设备厂将大规模重组:200多家半导体设备公司或整合为10家大型企业 2.沃尔玛态度转变:恢复中国供应商出货,美国客户承担关税成本 3. 腾讯TEG架构调整:成立大语言和多模态模型部 4.传英伟达将在中国成立合资公司、为DeepSeek定制芯片,官方辟谣 5. 网传饿了么加入外卖大战: 正打印百亿补贴横幅 6.长城要做超跑?长城CTO吴会肖回应:5年前就在做,没想到大家这么关注 7.曝iPhone 2700个零部件:仅30家供应商完全在中国境外 8.OpenAI涉足电商领域!用户可通过ChatGPT购买商品 今日头条 HEADLINE NEWS 网传中国半导体设备厂将大规模重组:200多家半导体设备公司或整合为10家大型企业 据媒体报道,传中国正在推动一项政策,计划将200多家半导体设备公司整合为10家大型企业。这项政策 旨在提升中国半导体设备产业的竞争力,以应对美国的制裁压力。中国半导体自给率目前约为23%,在美 国政府的高压施压下,中国似乎计划采取资源集中策略,扶持具有潜力的企业。 今年3月,中国半导体设备龙头企业北方华创就有类似的动作,该公司以16.9亿元收购涂胶显影设备厂芯 源微9. ...
中科金财(002657) - 002657中科金财投资者关系管理信息20250429
2025-04-29 14:40
Group 1: Financial Performance - The company's AI comprehensive service revenue increased to 208 million in 2024, with a significant growth of 86% in Q4 of the previous year, achieving profitability [1][4] - In Q1 2025, the AI comprehensive service revenue showed a year-on-year increase, although the company experienced a loss [4][8] - The gross margin for AI comprehensive services in 2024 was 20.70% [4] Group 2: AI Business Development - The company aims to enhance its AI Agent capabilities, focusing on multi-task and complex task agents, with existing orders already in place [2] - The AI Agent product line includes various applications such as intelligent customer service agents and intelligent credit agents, enhancing operational efficiency in banking [2] - The company has developed a global distribution platform for AI content, including micro-short films, although these products currently contribute a small percentage to overall revenue [3] Group 3: Research and Development - R&D expenses for Q1 2025 were 46.47 million, a 22.77% increase from 37.85 million in the same period last year [8] - The primary focus of R&D investments includes multi-modal applications, AI Agents, and large language models [8] - The company has established a comprehensive AI service framework, covering computational infrastructure, algorithms, and multi-modal applications [7] Group 4: Strategic Partnerships - The company collaborates with Alibaba Cloud as a partner and service provider for AI large model frameworks, enhancing its capabilities in the financial sector [6] - It has formed extensive partnerships with leading enterprises in the AI field, promoting the application of AI technologies across various industries [7]
对谈 Pokee.ai 朱哲清:强化学习做核心,Agent 的少数派造法
晚点LatePost· 2025-04-29 08:43
可能是更高效、更便宜的 Agent 实现路径。 文 丨 孙海宁 编辑 丨 程曼祺 主流 AI Agent 都把大语言模型(LLM,或者它的多模态版本)当作 "大脑",靠一个或几个 LLM 编 排工作、调用工具。但也有另一条路:Agent 规划、作业靠不依赖自然语言的强化学习模型,LLM 只 充当 Agent 和人类的 "交互层"。 不一样的想法,来自去年 10 月成立,至今只有 4 个正式员工的 Pokee.ai。 Pokee.ai 创始人朱哲清有十余年强化学习研究、落地经验。2017 年起,从杜克大学计算机科学专业毕 业的朱哲清,一边在斯坦福大学攻读强化学习方向博士学位,师从 Benjamin Van Roy;一边在 Meta 工作,曾任 Meta"应用强化学习" 部门负责人,他用强化学习算法改善内容推荐系统,把上任前只剩 3 人,一度要关停的部门扩张至 10 余人,为 Meta 增收 5 亿美元。 靠 LLM 规划、决策,是个自然而主流的想法。OpenAI Operator 和网页交互、操作电脑的能力基于 GPT-4o 模型,Manus 完成任务则是靠 Claude 3.5 Sonnet 模型做长程规划。 ...
阿里Qwen3系列开源:混合推理模式、性能超越DeepSeek R1
Founder Park· 2025-04-29 03:16
以下文章来源于赛博禅心 ,作者金色传说大聪明 赛博禅心 . 拜AI古佛,修赛博禅心 今天凌晨,Qwen3 发布。 本次共开源 8 款模型,包括 2 款 MoE 模型、6 款 Dense 模型。 Qwen3 系列 在代码、数学、通用能力等方面能力表现优异, 其中 235B 版本,在基 准测试上的水平超过了 671B 的 DeepSeek R1 。 同时, Qwen3 引入了「 思考模式/非思考模式 」无缝切换的功能。 在 思考模式下, 模型逐步推理,经过深思熟虑后给出最终答案。非思考模式 下,能够 提供快速的即时响应,适用于简单问题的回答。混合推理的模式平衡了算力和输出效果。 此外, Qwen3 系列提高了 Agent 能力, 同时也加强了对 MCP 的支持。Qwen 配套了一个 Qwen-Agent 项目,可以使用 API 进行工具调用, 或结合现有的工具链进行扩展。 | | | Qwen3 | | | | | | --- | --- | --- | --- | --- | --- | --- | | | | 通义千问最新一代大模型:采用混合专家架构,具备思考与快速回答双模式,支持119种语言 | | | | ...
Qwen3深夜正式开源,小尺寸也能大力出奇迹。
数字生命卡兹克· 2025-04-29 00:05
小道消息一直在说,昨天深夜或者今天凌晨,阿里会发Qwen3。 然后我特意早早的睡了一两小时,凌晨1点起床,就为了等Qwen3发。 结果这一等,就是好几个小时。。。 不过,功夫不负有心人。 凌晨5点,我眼睛都睁不开的时候,终于等到了。 Qwen你赔我睡眠。。。 把报告看完,我总结一下,觉得最大的亮点有6个: 1. 模型能力登顶全球,这个没啥可说的,就是No.1。 2. 第一个开源的混合推理模型。 3. 8个不同尺寸的模型,几乎覆盖了所有场景。 4. 成本很低, 旗舰模型235B参数部署成本只要DeepSeek R1的三分之一。 5. 支持MCP协议。 6. 居然还支持了119种语言。 一起说吧。 就像我们其实都知道,DeepSeek这个深度思考,你打开的时候,是R1模型,但是你关掉,其实用的是v3来给你回答。 但是Qwen3,是一体的。 是一个模型,只不过支持了两种模式,这个不管对于开发者还是使用者,都方便很多。 这次发了8个模型,Qwen3-0.6B、1.7B、4B、8B、14B、32B,这6个都是Dense稠密模型。 还有两个重量级MoE模型,Qwen3-30B-A3B,和旗舰版的Qwen3-235B-A2 ...
阿里Qwen3深夜开源,8款模型、集成MCP,性能超DeepSeek-R1,2小时狂揽16.9k星
3 6 Ke· 2025-04-28 23:23
Core Insights - Alibaba Cloud has officially open-sourced the Qwen3 series models, which include 2 MoE models and 6 dense models, achieving over 16.9k stars on GitHub within 2 hours of release [2][3] Model Features - The Qwen3 series features 8 parameter sizes ranging from 0.6B to 235B, with flagship models like Qwen3-235B-A22B and Qwen3-30B-A3B showcasing significant capabilities in programming, mathematics, and general reasoning [4][12] - The introduction of a hybrid thinking mode allows users to switch between "thinking" and "non-thinking" modes, enabling control over the depth of reasoning [15][16] - Enhanced reasoning capabilities surpass previous models in mathematics, code generation, and common-sense logic [4][15] Performance Metrics - Qwen3 models have demonstrated superior performance in various benchmarks compared to well-known models such as DeepSeek-R1 and OpenAI's models [12][13] - The Qwen3-30B-A3B model achieves performance exceeding that of QwQ-32B while using only 1/10 of the activated parameters [11][12] - The pre-training dataset for Qwen3 has doubled in size to approximately 3600 billion tokens, enhancing its capabilities in STEM and programming tasks [20][21] Deployment and Accessibility - The Qwen3 models are open-sourced on platforms like Hugging Face, ModelScope, and Kaggle, under the Apache 2.0 license [7] - Developers are encouraged to utilize various frameworks and tools for local deployment, including SGLang and vLLM [9] Future Directions - The company aims to continue enhancing model capabilities by optimizing architecture and training methods, focusing on expanding data scale, increasing model size, and improving long-term reasoning through reinforcement learning [24]
全球首个电池AI“分子宇宙”将开放测试
高工锂电· 2025-04-28 12:55
"分子宇宙"的无穷潜能正等待电池产业开掘。 北京时间4 月 29 日 晚 11 时( 美东时间 4 月 29 日中午 11 时) , SES AI 将公开全球首个电池领域专用 " 分子宇宙 " ( Molecular Universe , MU-0 ),并进行公开演示。 " 分子宇宙 " 是 SES AI 推出的一款电池领域 AI4S 解决方案,涵盖 10 的 11 次方个可用于电池的小分子,并电池专用的大语言模型驱动 训练而成的导航系统,让全球顶尖电池科学家的专业知识 " 触手可及 " 。 SES AI 表示, " 分子宇宙 " 可理解为一个专用于电池材料开发的 " 导航地图 " 或 " 参考词典 " 。通过 " 分子宇宙 " 软件,用户可精准 地筛选出所需材料,打开电池材料创新想象空间。 对于电池产业链企业而言,"分子宇宙"的现实价值在于加速或替代R&D,除了现阶段查询未知分子的功能,未来"分子宇宙"还将延伸到到材 料配方、电芯设计、电池测试等多个环节,实现电池开发全流程的加速。 具体而言, " 分子宇宙 " 软件具备三大方面的优势。 首先是庞大且持续扩展的数据库 —— 分子图谱。当前版本( MU-0 ) ...
细扒字节Seed 逆天招人要求!这5%本地顶级大脑做出了首个跨7大语言代码修复基准,让大模型成本狂降83%!
AI前线· 2025-04-28 11:10
作者|冬梅 字节 Top Seed 启动 2026 届招聘,瞄准顶尖博士 4 月 27 日,字节跳动 Seed 在其官微上发布了一则招聘启示,宣布正式启动 2026 届 Top Seed 大模型顶尖人才校招计划, 研究课题包括大语言模型、机器学习算法和系统、多模态生成、多模态理解、语音等方向,基本覆盖大模型研究各个领域, 计划招募约 30 位顶尖应届博士。 值得一提的是,本届 Top Seed 强调不限专业背景,更关注研究潜力,希望寻找具有极强技术信仰与热情、具备出色研究能 力、富有好奇心和驱动力的年轻研究者。 值得注意的是,字节跳动在此次招聘启事中还透露了几位刚毕业的同学已经做出了一些有影响力的研究。 比如,Z 同学构建并开源了首个多语言代码修复基准 Multi-SWE-bench,在 SWE-bench 基础上,首次覆盖 Python 之外的 Java、TypeScript、C、C++、Go、Rust 和 JavaScript 七种编程语言,1632 个真实修复任务,是真正面向"全栈工程"的评测 基准,其数据均来自 GitHub issue,历时近一年构建,以尽可能准确测评和提高大模型高阶编程智能水平。 ...