Workflow
机器翻译
icon
Search documents
腾讯宣布推出混元开源翻译模型1.5 曾拿下多项世界翻译比赛冠军
12月30日,腾讯混元宣布推出并开源翻译模型1.5,共包含两个模型:Tencent-HY-MT1.5-1.8B和Tencent- HY-MT1.5-7B,两个模型均支持33个语种互译以及5种民汉/方言,除了中文、英语、日语等常见语种, 也包含捷克语、马拉地语、爱沙尼亚语、冰岛语等小语种。模型已在腾讯混元官网上线,通过开源社区 也可以直接下载使用。 据介绍,HY-MT1.5-1.8B主要面向手机等消费级设备场景,经过量化,可支持端侧直接部署和离线实时 翻译,仅需1GB内存即可流畅运行,并且在参数量极小的前提下,效果超过了大部分商用翻译API。模 型在效率和性价比也表现突出,与主流商用翻译模型API对比,HY-MT1.5-1.8B推理速度更快,处理50 个tokens的平均耗时只有0.18秒,其他模型的时间在0.4秒左右。 HY-MT1.5-7B模型效果相比前一版本有较大提升,是此前获得WMT25比赛30个语种翻译冠军模型的升 级版,重点提升了翻译准确率,大幅减少了译文中夹带注释和语种混杂的情况,实用性进一步增加。 在部分用户实用场景下,混元翻译1.8B和7B两个尺寸模型同时使用,可以实现端侧和云侧模型的协同 部 ...
腾讯混元开源翻译模型1.5 支持端侧部署和离线实时翻译 效果超越商用API
智通财经网· 2025-12-30 08:01
在部分用户实际使用场景下,混元翻译1.8B和7B两个尺寸模型同时使用,可以实现端侧和云侧模型的 协同部署,提升模型的效果的一致性和稳定性。 据介绍,腾讯混元翻译模型此前不仅在国际机器翻译比赛拿下30个第1名,也在首次开源一周内便登上 了HuggingFace 模型趋势榜第一位。目前,混元翻译模型已经在腾讯内部多个业务场景落地应用,包括 腾讯会议、企业微信、QQ浏览器、客服翻译等。 HY-MT1.5-1.8B主要面向手机等消费级设备场景,经过量化,支持端侧直接部署和离线实时翻译,仅需 1GB内存即可流畅运行。HY-MT1.5-1.8B 的参数规模只有 1.8B,量化后仅需约 1GB 内存,和一款常见 手机应用体量大致相同。在参数量极小的前提下,在多项主流翻译测试集中,它的整体表现已经达到超 大尺寸闭源模型的90 分位水平,明显领先同尺寸开源模型与主流商用翻译 API。同时,模型也体现出 了极致的效率和性价比,与主流商用翻译模型API对比,HY-MT1.5-1.8B 推理速度更快,处理50个 tokens的平均耗时只有0.18秒,其他模型的时间在0.4秒左右。 HY-MT1.5-7B模型效果相比前一版本效果有较大 ...
美媒:“即时翻译”剥夺跨文化交流的乐趣?
Huan Qiu Shi Bao· 2025-11-12 22:51
Core Viewpoint - The article discusses the implications of real-time translation technology on cross-cultural communication, suggesting that while it offers convenience, it may diminish the joy and challenges associated with language learning and cultural exchange [1][3]. Group 1: Impact of Real-Time Translation - Real-time translation technology allows for seamless communication across languages, eliminating the need for intensive focus on language learning [1]. - The convenience of automatic translation may lead to a loss of the unique experiences and emotional connections that come from learning a new language and engaging with a different culture [2][3]. Group 2: Cultural and Linguistic Nuances - Translation is not merely about conveying meaning; it involves understanding cultural, temporal, and expressive differences that machines cannot fully grasp [3][4]. - The richness of language encompasses unique worldviews and historical contexts that automated translation may overlook, potentially leading to a homogenization of communication [3][4]. Group 3: Concerns About Language Perception - There is a concern that reliance on technology may lead individuals to view language solely as a tool for translation, neglecting its broader cultural significance [4]. - The article raises questions about what might be lost in the pursuit of eliminating language barriers through technology, emphasizing the need for a deeper understanding of language beyond mere translation [4].
“翻译界哈佛”倒闭:有学生哭了两晚,AI冲击下译员何去何从?
Di Yi Cai Jing· 2025-11-11 00:23
Core Insights - The Monterey Institute of International Studies, known as the "Harvard of Translation," will close its on-campus graduate programs by summer 2027 due to financial and structural issues, with a significant decline in enrollment attributed to the impact of AI on the translation industry [1] - The translation industry is undergoing a profound structural transformation, with professionals facing a critical juncture as they adapt to the rapid advancements in AI technology [1][2] Industry Trends - The adoption of Machine Translation Post-Editing (MTPE) has surged, with its average usage rate increasing from 26% in 2022 to nearly 46% in 2024, indicating a shift towards integrating AI tools in translation processes [2] - The cost and time efficiency of MTPE compared to pure human translation is significant; for instance, translating 100,000 words can cost over 200,000 yuan with human translation but only 120,000 to 150,000 yuan with MTPE, reducing the turnaround time from one month to about two weeks [3] Impact on Professionals - Many translation professionals are experiencing a decline in income due to AI's encroachment on the market, with reports indicating that over one-third of translators have lost jobs due to generative AI advancements [5] - The demand for translators is shifting, with a growing emphasis on technical skills and the ability to use AI translation tools, as evidenced by job postings requiring proficiency in AI software [6] Educational Responses - Educational institutions are adapting to the changing landscape by introducing new programs that combine translation with technology, such as dual degree programs in translation and computer science [6] - The Shanghai International Studies University has launched a dual bachelor's degree in translation and business management, reflecting the industry's need for professionals who can navigate both language and technology [6] Future Outlook - The translation industry is expected to see a continued reliance on AI for low to mid-level translation tasks, while high-quality, creative translation work remains a domain where human translators excel [7][9] - The unique qualities of human translators, such as emotional intelligence and cultural understanding, are becoming increasingly valuable as AI handles more routine tasks [9][10]
俄开发出分析机器翻译错误的应用程序
Ke Ji Ri Bao· 2025-10-26 23:43
Core Insights - The article discusses the development of a new application by scientists at Surgut State University in Russia, aimed at analyzing machine translation errors to improve translation quality [1][2] - The application offers a more comprehensive analysis compared to standard methods, addressing the limitations of existing evaluation metrics [1] Group 1: Application Features - The new tool provides in-depth analysis of translation quality, focusing on vocabulary accuracy, semantic accuracy, and syntactic correctness [1][2] - It integrates multiple evaluation methods into a single automated tool, enhancing the efficiency of the analysis process [1] Group 2: Performance Analysis - The research team analyzed translations from mainstream online translation services and commercial neural networks, generating detailed reports for each translation [1] - Sentences with low scores in any evaluation metric are highlighted for further analysis, indicating areas for improvement [1][2] - While some translation tools performed well in vocabulary matching, all tested systems struggled with translating complex grammatical structures [1]
阿里国际Marco获WMT机器翻译大赛六项冠军,英中赛道超GPT-4.1与Gemini 2.5 Pro等巨头
Cai Jing Wang· 2025-10-23 05:56
Core Insights - Alibaba's Marco-MT-Algharb translation model achieved significant success at the 2025 WMT competition, winning 6 championships, 4 second places, and 2 third places, particularly excelling in English-to-Chinese translation, surpassing top closed-source AI systems like Gemini 2.5 Pro and GPT-4.1 [1][2][3] Group 1: Competition Overview - The WMT competition is recognized as the "gold standard" in machine translation, combining automatic metrics like COMET and LLM Judge with extensive human evaluations to determine rankings [3] - Marco-MT participated in the more challenging constrained track of the WMT competition, which requires models to handle diverse content while adhering to strict guidelines of using only open-source data and models with a size limit of 20 billion parameters [2] Group 2: Model Performance and Methodology - Marco-MT's success is attributed to its integration of extensive e-commerce translation experience with an original training method called M2PO (Multi-stage Preference Optimization), which applies reinforcement learning to enhance translation quality [2] - The model's training process involves three steps: broadening knowledge through supervised fine-tuning, employing reinforcement learning to evaluate translation quality, and incorporating word alignment and reordering techniques during decoding to improve accuracy and fidelity [2] Group 3: Market Position and Future Prospects - Marco-MT, initially launched in 2024 for e-commerce translation, has expanded its capabilities to support various translation scenarios, including search, product information, dialogue, and images, establishing a strong foundation for its transition to general translation [3] - The model has already demonstrated its competitive edge in multimodal translation, having won 2 championships and 2 second places at the 2025 IWSLT international competition [3]
“翻译界哈佛”倒闭,AI杀死首个世界名校?
虎嗅APP· 2025-09-05 11:27
Core Viewpoint - The closure of the Monterey Institute of International Studies (MIIS), known as the "Harvard of Translation," highlights the significant impact of AI on traditional education and professional fields, particularly in translation and interpretation [3][5][10]. Group 1: Closure Announcement - MIIS officially announced it will stop enrolling graduate students by June 2027 due to financial difficulties [3][8]. - The decision to close was described as purely financial, with a significant drop in enrollment and a $14.1 million deficit reported earlier this year [23][9]. - The closure will affect all on-campus graduate programs and some online degree courses, marking the end of an era for many alumni [15][16]. Group 2: Impact of AI on the Translation Industry - The rise of AI translation tools has drastically changed the landscape, with human translators facing significant job threats; a Microsoft report listed interpreters and translators as the most at-risk profession [11][30]. - The enrollment at MIIS has been declining since 2009, with current numbers at 440 students, less than half of the initial target of 850 [30]. - AI advancements, such as real-time translation capabilities, have diminished the competitive edge of human translators, leading to a perception that the profession is no longer viable [30][56]. Group 3: Financial Struggles and Responses - MIIS faced a financial crisis, with efforts to cut staff benefits and expand enrollment proving ineffective, leading to protests from faculty and students [24][25]. - The institution's financial woes were exacerbated by the inability to adapt to the changing demands of the translation industry due to AI [10][30]. - Faculty members voted overwhelmingly to close MIIS within three years, indicating a consensus on the need to return to a more sustainable model [26][27]. Group 4: Future of Translation Profession - Despite the advancements in AI, there remains a belief that professional translators will still be needed for tasks requiring nuanced understanding and context [61][69]. - AI tools, while efficient, still require human oversight for tasks such as terminology management and quality control [62][66]. - The translation profession is evolving, with AI serving as a tool rather than a complete replacement, emphasizing the importance of human expertise [69].
腾讯混元最新开源成“最强翻译”:国际机器翻译比赛获30个语种第一
量子位· 2025-09-03 05:49
Core Viewpoint - Tencent's Hunyuan-MT-7B model has achieved significant success in international translation competitions, demonstrating its advanced capabilities in translating multiple languages and dialects, while also being open-sourced for broader accessibility [1][2][4]. Group 1: Model Performance and Achievements - Hunyuan-MT-7B won first place in 30 out of 31 language pairs in the WMT2025 competition, showcasing its dominance in both high-resource and low-resource languages [4][29]. - The model supports 33 languages and 5 dialects, making it a comprehensive lightweight translation solution [1]. - In the Flores200 evaluation dataset, Hunyuan-MT-7B outperformed other models of similar size and showed competitive results against larger models [6][9]. Group 2: Technical Innovations - The model is built on a complete training paradigm that includes pre-training, supervised fine-tuning, and reinforcement learning, leading to superior translation performance [11][12]. - The Shy framework, which incorporates synergy-enhanced policy optimization, fundamentally changes traditional optimization approaches by using a systematic design with two main components: foundational model development and ensemble strategies [15][19]. - The GRPO algorithm, a key innovation in the Shy framework, reduces gradient variance and improves sample efficiency, enhancing training stability and model convergence [21][24]. Group 3: Deployment and Usability - Hunyuan-MT-7B is designed for high computational efficiency, allowing for faster inference and lower operational costs compared to larger models [30]. - The model's open-source nature promotes transparency and allows for further improvements by the research community, lowering the technical barriers for participation in machine translation advancements [31]. Group 4: Broader Implications - The methodologies and frameworks developed for Hunyuan-MT-7B can serve as a reference for optimizing other specialized fields, promoting a shift from general to specialized technology applications [33].
全球机器翻译比赛拿下30个语种第1名,腾讯混元翻译模型开源
Sou Hu Cai Jing· 2025-09-02 11:32
Core Insights - Tencent Hunyuan announced the open-source release of its translation model Hunyuan-MT-7B, which has recently won an international translation competition, allowing developers to download and deploy it for free [1][4] - The Hunyuan-MT-7B model supports 33 languages and 5 dialects, showcasing its comprehensive capabilities as a lightweight translation model [1][6] - The model achieved outstanding results in the WMT2025 competition, ranking first in 30 out of 31 languages, demonstrating its superiority over larger models [4][6] Model Features - Hunyuan-MT-7B is characterized by its efficiency, achieving performance that meets or exceeds larger models with only 7 billion parameters [6] - The model's inference speed is significantly faster than that of larger models, allowing it to handle more translation requests under the same hardware conditions [6] - The model can be deployed across various hardware environments, from high-end servers to edge devices, with lower deployment, operational, and maintenance costs [6] Technical Advancements - Tencent Hunyuan has developed a complete training paradigm for translation models, covering pre-training, supervised tuning, and reinforcement learning, which contributes to its industry-leading translation performance [4][6] - The model has been integrated into multiple Tencent services, enhancing user experience across platforms such as Tencent Meeting, WeChat Work, QQ Browser, and more [6] Open Source Commitment - Since its debut in 2023, Tencent Hunyuan has embraced open-source principles, sharing its self-developed technologies and promoting breakthroughs in large model technology [7] - The Hunyuan-MT-7B model is available for experience and download on Tencent Hunyuan's official website, as well as on open-source platforms like Huggingface and GitHub [7]
AI模型终于能翻译“拼多多砍一刀”了
3 6 Ke· 2025-09-02 08:25
Core Insights - Tencent's Hunyuan-MT series translation models, Hunyuan-MT-7B and Hunyuan-MT-Chimera-7B, have been released as open-source, providing advanced machine translation capabilities across 33 languages, including minority languages and dialects [1][7][18] - The models have demonstrated superior performance in various translation benchmarks, outperforming established systems like Google Translate and other models with significantly larger parameters [2][26] - Hunyuan-MT-7B achieved first place in 30 out of 31 language pairs at the WMT 2025 competition, showcasing its effectiveness in both resource-rich and resource-scarce languages [4][22] Model Performance - Hunyuan-MT-7B excels in translating internet slang and gaming terminology, accurately interpreting phrases like "小红薯" as "REDnote" and "砍一刀" as Pinduoduo's pricing mechanism, while Google Translate fails to capture the context [8][10] - The model's understanding of cultural nuances and idiomatic expressions allows it to produce translations that are more natural and contextually appropriate compared to traditional systems [12][13] - In professional translation tests, Hunyuan-MT-7B showed strong capabilities in translating specialized terms, although it still faced challenges in maintaining fluency in sentence structure [15][18] Technical Advancements - The models utilize a "weak-to-strong" reinforcement learning approach, integrating multiple candidate translations to enhance output quality beyond single candidates [5][24] - Hunyuan-MT-7B has been optimized for performance, achieving a 30% improvement in inference speed through FP8 quantization [7][26] - The training data for these models includes a vast dataset of 1.3 trillion tokens from over 112 languages, ensuring a diverse and high-quality training foundation [19][21] Future Implications - The advancements in machine translation technology, particularly through the use of generative AI, are expected to significantly enhance cross-border business operations for companies like Tencent, ByteDance, and Alibaba [28] - The ongoing development of translation models indicates a trend towards more sophisticated and efficient solutions in the field of computational linguistics, potentially leading to broader applications in various industries [28]