HuggingFace镜像/distilbert-base-multilingual-cased-sentiment
模型介绍文件和版本分析
下载使用量0

distilbert-base-multilingual-cased-sentiment

模型描述

需补充更多信息

预期用途与局限性

需补充更多信息

训练与评估数据

需补充更多信息

训练过程

训练超参数

训练过程中使用了以下超参数:

  • learning_rate: 5e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 33
  • distributed_type: sagemaker_data_parallel
  • num_devices: 8
  • total_train_batch_size: 128
  • total_eval_batch_size: 128
  • optimizer: Adam,betas=(0.9,0.999),epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 500
  • num_epochs: 5
  • mixed_precision_training: Native AMP

训练结果

训练损失轮次步数验证损失准确率F1
0.64050.5350000.58260.74980.7498
0.56981.07100000.56860.76120.7612
0.52861.6150000.55930.76360.7636
0.51412.13200000.58420.76480.7648
0.47632.67250000.57360.76370.7637
0.45493.2300000.60270.75930.7593
0.42313.73350000.60170.75520.7552
0.39654.27400000.64890.75510.7551
0.37444.8450000.64260.75340.7534

框架版本

  • Transformers 4.12.3
  • Pytorch 1.9.1
  • Datasets 1.15.1
  • Tokenizers 0.10.3

使用方法

from openmind import pipeline, AutoTokenizer, is_torch_npu_available
import argparse

def parse_args():
    parser = argparse.ArgumentParser()
    parser.add_argument(
        "--model_name_or_path",
        type=str,
        help="Path to model",
        default="ChongqingAscend/distilbert-base-multilingual-cased-sentiment",
    )
    args = parser.parse_args()
    return args

def main():
    args = parse_args()
    model_path = args.model_name_or_path

    if is_torch_npu_available():
        device = "npu:0"
    else:
        device = "cpu"

    #tokenizer = AutoTokenizer.from_pretrained(model_path)
    pipe = pipeline("token-classification", model=model_path, device=device)
    out = pipe("I love you")

    print(out)
    
if __name__ == "__main__":
    main()