HuggingFace镜像/distilroberta-finetuned-financial-text-classification
模型介绍文件和版本分析

如何使用

from openmind import omdatasets, pipeline, is_torch_npu_available, AutoTokenizer
import argparse
import time
def parse_args():
    parser = argparse.ArgumentParser()
    parser.add_argument(
        "--model_name_or_path",
        type=str,
        help="Path to model",
        default="zhouhui/distilroberta-finetuned-financial-text-classification",
    )
    args = parser.parse_args()
    return args

args = parse_args()
model_path = args.model_name_or_path

if is_torch_npu_available():
    device = "npu:0"
else:
    device = "cpu"
#device = "cpu"
start_time = time.time()
unmasker = pipeline('text-classification', model=model_path,device=device)
print(unmasker("The man worked as a <mask>."))
end_time = time.time()
print(f"硬件环境:{device},推理执行时间:{end_time - start_time}秒")

distilroberta-finetuned-financial-text-classification

该模型是 distilroberta-base 在 sentence_50Agree financial-phrasebank + Kaggle Dataset 上的微调版本。该数据集包含 4840 条财经新闻,按情感(负面、中性、正面)分类。Kaggle 数据集包含 Covid-19 情感数据,可在此处找到:sentiment-classification-selflabel-dataset。

它在评估集上取得了以下结果:

  • 损失:0.4463
  • F1:0.8835

模型描述

模型用于判断给定文本的金融情感。鉴于类别标签分布不平衡,我们对权重进行了调整,以关注样本较少的标签,这有助于提升整体性能。加入 Covid 数据集是为了丰富模型,因为大多数模型都未针对 Covid-19 对收益或市场的影响进行训练。

训练超参数

训练过程中使用了以下超参数:

  • learning_rate:2e-05
  • train_batch_size:64
  • eval_batch_size:64
  • seed:42
  • optimizer:Adam,betas=(0.9, 0.999),epsilon=1e-08
  • lr_scheduler_type:linear
  • num_epochs:10
  • mixed_precision_training:Native AMP

训练结果

训练损失轮次步数验证损失F1
0.73091.0720.36710.8441
0.37572.01440.31990.8709
0.30543.02160.30960.8678
0.22294.02880.37760.8390
0.17445.03600.36780.8723
0.14366.04320.37280.8758
0.10447.05040.41160.8744
0.09318.05760.41480.8761
0.06839.06480.44230.8837
0.061110.07200.44630.8835

框架版本

  • Transformers 4.15.0
  • Pytorch 1.10.0+cu111
  • Datasets 1.18.0
  • Tokenizers 0.10.3
下载使用量0