最先进的大规模预训练对话响应生成模型（DialoGPT）

DialoGPT 是一款用于多轮对话的最先进大规模预训练对话响应生成模型。人工评估结果表明，在单轮对话图灵测试中，DialoGPT 生成的响应质量可与人类响应相媲美。该模型是在来自 Reddit 讨论帖的 1.47 亿条多轮对话数据上训练而成的。

交互式环境中的多轮生成示例：

角色	响应
用户	金钱能买到幸福吗？
机器人	这取决于你在这上面花了多少钱。
用户	买到幸福的最佳方式是什么？
机器人	你只需在 20 岁出头就成为百万富翁，那么你就能幸福了。
用户	这太难了！
机器人	你根本不知道成为百万富翁并保持幸福有多难。富人拥有大量金钱是有原因的。

有关 DialoGPT 的预处理、训练以及完整详细信息，请参见原始 DialoGPT 仓库

ArXiv 论文：https://arxiv.org/abs/1911.00536

如何使用

现在，我们可以尝试将该模型作为聊天伙伴来使用了！

from openmind import AutoTokenizer, AutoModelForCausalLM, is_torch_npu_available
from openmind_hub import snapshot_download
import torch
import argparse

def parse_args():
    parser = argparse.ArgumentParser()
    parser.add_argument(
        "--model_name_or_path",
        type=str,
        help="Path to model",
        default="models/DialoGPT-small",
    )
    args = parser.parse_args()
    return args

def main():
    args = parse_args()
    model_path = args.model_name_or_path

    if is_torch_npu_available():
        device = "npu:0"
    else:
        device = "cpu"
    
    tokenizer = AutoTokenizer.from_pretrained(model_path)
    model = AutoModelForCausalLM.from_pretrained(model_path).to(device)

    # Let's chat for 5 lines
    for step in range(5):
        # encode the new user input, add the eos_token and return a tensor in Pytorch
        new_user_input_ids = tokenizer.encode(input(">> User:") + tokenizer.eos_token, return_tensors='pt').to(device)

        # append the new user input tokens to the chat history
        bot_input_ids = torch.cat([chat_history_ids, new_user_input_ids], dim=-1) if step > 0 else new_user_input_ids

        # generated a response while limiting the total chat history to 1000 tokens, 
        chat_history_ids = model.generate(bot_input_ids, max_length=1000, pad_token_id=tokenizer.eos_token_id)

        # pretty print last ouput tokens from bot
        print("DialoGPT: {}".format(tokenizer.decode(chat_history_ids[:, bot_input_ids.shape[-1]:][0], skip_special_tokens=True)))
    
if __name__ == "__main__":
    main()

Open LLM 排行榜评估结果

详细结果可查看此处

指标	数值
平均值	25.02
ARC（25轮）	25.77
HellaSwag（10轮）	25.79
MMLU（5轮）	25.81
TruthfulQA（0轮）	47.49
Winogrande（5轮）	50.28
GSM8K（5轮）	0.0
DROP（3轮）	0.0

最先进的大规模预训练对话响应生成模型（DialoGPT）

交互式环境中的多轮生成示例：

角色	响应
用户	金钱能买到幸福吗？
机器人	这取决于你在这上面花了多少钱。
用户	买到幸福的最佳方式是什么？
机器人	你只需在 20 岁出头就成为百万富翁，那么你就能幸福了。
用户	这太难了！
机器人	你根本不知道成为百万富翁并保持幸福有多难。富人拥有大量金钱是有原因的。

有关 DialoGPT 的预处理、训练以及完整详细信息，请参见原始 DialoGPT 仓库

如何使用

现在，我们可以尝试将该模型作为聊天伙伴来使用了！

from openmind import AutoTokenizer, AutoModelForCausalLM, is_torch_npu_available
from openmind_hub import snapshot_download
import torch
import argparse

def parse_args():
    parser = argparse.ArgumentParser()
    parser.add_argument(
        "--model_name_or_path",
        type=str,
        help="Path to model",
        default="models/DialoGPT-small",
    )
    args = parser.parse_args()
    return args

def main():
    args = parse_args()
    model_path = args.model_name_or_path

    if is_torch_npu_available():
        device = "npu:0"
    else:
        device = "cpu"
    
    tokenizer = AutoTokenizer.from_pretrained(model_path)
    model = AutoModelForCausalLM.from_pretrained(model_path).to(device)

    # Let's chat for 5 lines
    for step in range(5):
        # encode the new user input, add the eos_token and return a tensor in Pytorch
        new_user_input_ids = tokenizer.encode(input(">> User:") + tokenizer.eos_token, return_tensors='pt').to(device)

        # append the new user input tokens to the chat history
        bot_input_ids = torch.cat([chat_history_ids, new_user_input_ids], dim=-1) if step > 0 else new_user_input_ids

        # generated a response while limiting the total chat history to 1000 tokens, 
        chat_history_ids = model.generate(bot_input_ids, max_length=1000, pad_token_id=tokenizer.eos_token_id)

        # pretty print last ouput tokens from bot
        print("DialoGPT: {}".format(tokenizer.decode(chat_history_ids[:, bot_input_ids.shape[-1]:][0], skip_special_tokens=True)))
    
if __name__ == "__main__":
    main()

指标

数值

平均值

25.02

ARC（25轮）

25.77

HellaSwag（10轮）

25.79

MMLU（5轮）

25.81

TruthfulQA（0轮）

47.49

Winogrande（5轮）

50.28

GSM8K（5轮）

0.0

DROP（3轮）

0.0