Hy-MT2-1.8B:可用于33种语言间的翻译任务，支持通用、商业、特定领域及指令遵循场景。项目包含1.8B等多尺寸模型，采用AngelSlim量化技术实现轻量化部署，性能优于主流开源及商业模型。【此简介由AI生成】

English ｜中文

🖥️ 官方网站 | 💬 GitHub | 🪡 AngelSlim | 📚 Hy-MT2 技术报告

模型介绍

Hy-MT2 是面向复杂真实场景设计的“快思考”多语言翻译模型系列。该系列包含 1.8B、7B 和 30B-A3B（MoE）三种模型规模，均支持 33 种语言互译，并能有效遵循多语言翻译指令。针对端侧部署，AngelSlim 1.25 比特极致量化技术将 1.8B 模型的存储需求降至仅 440 MB，同时推理速度提升 1.5 倍。多维度评估显示，Hy-MT2 在通用翻译、真实业务翻译、领域特定翻译及指令遵循翻译任务中均表现卓越。7B 和 30B-A3B 模型在快思考模式下性能超越 DeepSeek-V4-Pro、Kimi K2.6 等开源模型，而轻量级 1.8B 模型整体表现也优于微软、豆包等主流商业 API。

本次发布中，我们还开源了用于评估翻译指令遵循能力的基准测试集 IFMTBench。

我们也欢迎大家使用我们发布的 Hy-MT2-Translator Skill，以便轻松集成 Hy-MT2 系列模型进行翻译任务。下载链接：ClawHub 和 SkillHub。

目前，腾讯混元已正式与 WMT26 合作举办“视频字幕翻译任务”（https://www2.statmt.org/wmt26/video-subtitle-translation.html）。使用 Hy-MT 模型系列参与“通用机器翻译任务”（https://www2.statmt.org/wmt26/translation-task.html）和“视频字幕翻译任务”的参赛者，将有机会获得由混元赞助的特别奖项。我们诚挚邀请大家参与，共同推动机器翻译技术的边界！

新闻动态

2026年5月21日我们在HuggingFace和ModelScope开源了Hy-MT2-1.8B/Hy-MT2-7B/Hy-MT2-30B-A3B/IFMTBench。
2025年12月30日我们在HuggingFace和ModelScope开源了HY-MT1.5-1.8B和HY-MT1.5-7B。
2025年9月1日我们在HuggingFace和ModelScope开源了Hunyuan-MT-7B和Hunyuan-MT-Chimera-7B。

实验结果

更多实验结果与分析，请参阅我们的报告。

模型链接

模型名称	说明	下载链接
Hy-MT2-1.8B	Hy 1.8B 翻译模型	🤗 模型
Hy-MT2-1.8B-FP8	Hy 1.8B 翻译模型，FP8 量化	🤗 模型
Hy-MT2-1.8B-GGUF	Hy 1.8B 翻译模型，llama.cpp 格式	🤗 模型
Hy-MT2-1.8B-2bit-GGUF	Hy 1.8B 翻译模型，llama.cpp 格式，2bit 量化	🤗 模型
Hy-MT2-1.8B-1.25bit-GGUF	Hy 1.8B 翻译模型，llama.cpp 格式，1.25bit 量化	🤗 模型
Hy-MT2-7B	Hy 7B 翻译模型	🤗 模型
Hy-MT2-7B-FP8	Hy 7B 翻译模型，FP8 量化	🤗 模型
Hy-MT2-7B-GGUF	Hy 7B 翻译模型，llama.cpp 格式	🤗 模型
Hy-MT2-30B-A3B	Hy 30B-A3B 翻译模型	🤗 模型
Hy-MT2-30B-A3B-FP8	Hy 30B-A3B 翻译模型，FP8 量化	🤗 模型

Hy-MT2 翻译任务指令示例（中英文对照）

注：以下示例中，source_lang 和 target_lang 均需使用完整语言名称。中文提示词中使用中文名称，英文提示词中使用英文名称。

类型	中文提示词	英文提示词
默认翻译	将以下文本翻译为 `{target_lang}`，注意只需要输出翻译后的结果，不要额外解释： `{source_text}`	Translate the following text into `{target_lang}`. Note that you should only output the translated result without any additional explanation: `{source_text}`
术语翻译	参考下面的翻译： `{text}` 翻译成 `{text}` `{text}` 翻译成 `{text}` `{text}` 翻译成 `{text}` 将以下文本翻译为 `{target_lang}`，注意只需要输出翻译后的结果，不要额外解释： `{source_text}`	Reference the following translations: `{text}` translates to `{text}` `{text}` translates to `{text}` `{text}` translates to `{text}` Translate the following text into `{target_lang}`. Note that you must ONLY output the translated result without any additional explanation: `{source_text}`
风格翻译	请将以下文本翻译为 `{target_lang}`。注意翻译的风格要严格符合【`{target_style}`】 `{source_text}`	Please translate the following text into `{target_lang}`. Note that the translation style must strictly conform to [`{target_style}`]: `{source_text}`
个性化翻译	【待翻译文本】 `{source_text}` 【翻译任务】 1、`{user_preferences}` 2、`{user_preferences}` 3、…… 4、将【待翻译文本】翻译为 `{target_lang}`。	[Source Text] `{source_text}` [Translation Tasks] 1. `{user_preferences}` 2. `{user_preferences}` 3. ... 4. Translate the [Source Text] into `{target_lang}`.
分隔符保留	请将以下文本准确翻译为 `{target_lang}`。你必须在译文中保留等量的分隔符，绝对不可遗漏、转义或翻译该符号，并注意分隔符的位置。 `{source_text}`	Please accurately translate the following text into `{target_lang}`. You must retain the exact same number of delimiters in the translation. Strictly do not omit, escape, or translate these symbols, and pay close attention to their placement. `{source_text}`
结构化数据 1	# 任务目标将下方 `{source_text}` 中的 `{format_type}` 格式数据翻译为 `{target_lang}`。 # 严格约束 1. 结构锁定：绝对保持原有的 `{format_type}` 数据结构、缩进和层级完全不变。 2. 选择性翻译：仅翻译面向用户展示的可见文本内容。 3. 禁止修改：严禁翻译或更改任何代码标签、键名 (Key)、变量占位符（如 `{{var}}`、`${var}`、`%s`、`%d` 等）或代码属性。 # 数据输入 `{source_text}`	### Task Translate the user-facing text within the following `{format_type}` data into `{target_lang}`. ### Strict Rules 1. Structure Preservation: You MUST preserve the original `{format_type}` data structure, nesting, hierarchy, and indentation exactly as they are. 2. Selective Translation: Translate ONLY the visible, user-facing text content/values. 3. Strict Non-Translation: NEVER translate or alter code tags, keys, properties, object names, or variable placeholders. Leave them exactly in their original English/code form. ### Source Data `{source_text}`
结构化数据 2	【背景信息】 `{background_text}` 请结合背景信息将以下文本翻译为 `{target_lang}`。【待翻译文本】 `{source_text}`	[Background Information] `{background_text}` Please translate the following text into `{target_lang}`, taking the provided background information into consideration. [Source Text] `{source_text}`

推理与部署

对于1.8B和7B模型，我们建议使用以下参数进行推理。请注意，我们的模型没有默认的system_prompt。


{
  "temperature": 0.7,
  "top_p": 0.6,
  "top_k": 20,
  "repetition_penalty": 1.05,
  "max_tokens": 4096
}

对于 30B-A3B，我们建议在推理时使用以下参数。请注意，我们的模型没有默认的 system_prompt。


{
  "temperature": 0.7,
  "top_p": 1.0,
  "top_k": -1,
  "repetition_penalty": 1.0,
  "max_tokens": 4096
}

转换器

transformers>=5.6.0

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_path = "tencent/Hy-MT2-1.8B"

# Load tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)

# Load model
model = AutoModelForCausalLM.from_pretrained(
    model_path,
    dtype=torch.bfloat16,
    device_map="auto",
    trust_remote_code=True,
)

model.eval()

# Example inference
prompt = "将以下文本翻译成英语,注意只需要输出翻译后的结果,不要额外解释:\n\n今天天气真好。"
messages = [{"role": "user", "content": prompt}]
inputs = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt").to(model.device)

with torch.no_grad():
    outputs = model.generate(
        **inputs,
        max_new_tokens=4096,
    )
response = tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True)
print(response)

vllm

从源码构建 vLLM：

uv venv --python 3.12 --seed --managed-python
source .venv/bin/activate
git clone https://github.com/vllm-project/vllm.git
cd vllm
uv pip install --editable . --torch-backend=auto

启动 vLLM 服务器：

vllm serve tencent/Hy-MT2-1.8B --tensor-parallel-size 1

sglang

从源代码构建 SGLang：

git clone https://github.com/sgl-project/sglang
cd sglang
pip3 install pip --upgrade
pip3 install "transformers>=5.6.0"
pip3 install -e "python"

启动 SGLang 服务器：

python3 -m sglang.launch_server --model tencent/Hy-MT2-1.8B --tp 1

模型训练

Hy-MT2 提供完整的模型训练流程，支持全参数微调与 LoRA 微调，以及多种 DeepSpeed ZeRO 配置和 LLaMA-Factory 集成。

详细的训练文档，请参考：模型训练指南

量化工具

我们提供 AngelSlim，这是一款易用、全面且高效的大模型压缩工具包，涵盖常见量化算法、低比特量化、投机采样等功能。

支持语言

语言	缩写	中文名称
Chinese	zh	中文
English	en	英语
French	fr	法语
Portuguese	pt	葡萄牙语
Spanish	es	西班牙语
Japanese	ja	日语
Turkish	tr	土耳其语
Russian	ru	俄语
Arabic	ar	阿拉伯语
Korean	ko	韩语
Thai	th	泰语
Italian	it	意大利语
German	de	德语
Vietnamese	vi	越南语
Malay	ms	马来语
Indonesian	id	印尼语
Filipino	tl	菲律宾语
Hindi	hi	印地语
Traditional Chinese	zh-Hant	繁体中文
Polish	pl	波兰语
Czech	cs	捷克语
Dutch	nl	荷兰语
Khmer	km	高棉语
Burmese	my	缅甸语
Persian	fa	波斯语
Gujarati	gu	古吉拉特语
Urdu	ur	乌尔都语
Telugu	te	泰卢固语
Marathi	mr	马拉地语
Hebrew	he	希伯来语
Bengali	bn	孟加拉语
Tamil	ta	泰米尔语
Ukrainian	uk	乌克兰语
Tibetan	bo	藏语
Kazakh	kk	哈萨克语
Mongolian	mn	蒙古语
Uyghur	ug	维吾尔语
Cantonese	yue	粤语

引用 Hy-MT2

@misc{zheng2026hymt2familyfastefficient,
      title={Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild}, 
      author={Mao Zheng and Zheng Li and Tao Chen and Bo Lv and Mingrui Sun and Mingyang Song and Jinlong Song and Hong Huang and Decheng Wu and Hai Wang and Yifan Song and Yanfeng Chen and Guanwei Zhang},
      year={2026},
      eprint={2605.22064},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2605.22064}, 
}

联系我们

如果您希望向我们的研发和产品团队提供反馈，欢迎联系腾讯混元大模型团队。您可以通过邮件与我们取得联系，邮箱地址为：hunyuan_opensource@tencent.com。