HuggingFace镜像/billsum_tiny_summarization
模型介绍文件和版本分析
下载使用量0

billsum_tiny_summarization

该模型是 google/t5-efficient-tiny 在 billsum 数据集上的微调版本。 其在评估集上取得了以下结果:

  • 损失:3.5889
  • Rouge1:0.1503
  • Rouge2:0.0412
  • Rougel:0.1244
  • Rougelsum:0.1244
  • 生成长度:19.0

训练过程

训练超参数

训练过程中使用了以下超参数:

  • 学习率:2e-05
  • 训练批次大小:16
  • 评估批次大小:16
  • 种子:42
  • 优化器:Adam,其参数 betas=(0.9,0.999),epsilon=1e-08
  • 学习率调度器类型:线性
  • 训练轮次:4

训练结果

训练损失轮次步数验证损失Rouge1Rouge2RougelRougelsum生成长度
无日志1.0624.28350.14130.03230.11250.112419.0
无日志2.01243.72750.15070.04080.12630.126419.0
无日志3.01863.61540.14990.04070.12440.124419.0
无日志4.02483.58890.15030.04120.12440.124419.0

框架版本

  • Transformers 4.33.3
  • Pytorch 2.0.1+cu118
  • Datasets 2.14.5
  • Tokenizers 0.13.3

模型使用

import argparse
import torch
from openmind import pipeline, is_torch_npu_available

def parse_args():
    parser = argparse.ArgumentParser()
    parser.add_argument("--model_name_or_path", type=str, help="Path to model", default=None)
    args = parser.parse_args()
    return args

if __name__ == '__main__':
    args = parse_args()

    if is_torch_npu_available():
        device = "npu:0"
    else:
        device = "cpu"
   
    print("device",device)
    model_path = args.model_name_or_path
    print("Model path is:", model_path)

    try:
        generator = pipeline('text-generation', model=model_path, device=device)
        print("Generator initialized successfully")
    except Exception as e:
        print(f"Error initializing generator: {e}")
    
    try:
        output = generator("Hello, I'm a language model,", max_length=30, num_return_sequences=5, num_beams=5, truncation=True)
        print("Run successful. Generated outputs:")
        for idx, sequence in enumerate(output):
            print(f"Generated sequence {idx + 1}: {sequence['generated_text']}")
    except Exception as e:
        print(f"Error during generation: {e}")