HuggingFace镜像/verysmol_llama-v11-KIx2
模型介绍文件和版本分析
下载使用量0

verysmol_llama-v11-KIx2

模型说明

该模型是 v10 版本(refinedweb-3m 去重)的微调版本,在 KI 数据集上进一步训练了 2 个 epoch。

其在评估集上取得的结果如下:

  • 损失值:2.8876
  • 准确率:0.4502

评估结果

hf-causal-experimental (pretrained=pszemraj/verysmol_llama-v11-KIx2,revision=main,trust_remote_code=True,dtype='float'), limit: None, provide_description: False, num_fewshot: 0, batch_size: 16

任务版本号指标数值标准误差
arc_easy0acc0.4024±0.0101
acc_norm0.3788±0.0100
boolq1acc0.6199±0.0085
lambada_openai0ppl111.9939±4.6906
acc0.2354±0.0059
openbookqa0acc0.1440±0.0157
acc_norm0.2760±0.0200
piqa0acc0.5713±0.0115
acc_norm0.5664±0.0116
winogrande0acc0.5201±0.0140
任务版本号指标数值标准误差
arc_challenge0acc0.1971±0.0116
acc_norm0.2278±0.0123
任务版本号指标数值标准误差
hellaswag0acc0.2618±0.0088
acc_norm0.2797±0.0090
任务版本号指标数值标准误差
truthfulqa_mc1mc10.2509±0.0152
mc20.4492±0.0156

训练过程

训练超参数

训练过程中使用了以下超参数:

  • 学习率:0.00014
  • 训练批次大小:16
  • 评估批次大小:16
  • 随机种子:17514
  • 梯度累积步数:8
  • 总训练批次大小:128
  • 优化器:Adam,betas=(0.9, 0.95),epsilon=1e-06
  • 学习率调度器类型:inverse_sqrt
  • 学习率调度器预热比例:0.05
  • 训练轮数:2.0

训练结果

训练损失轮次步数验证损失准确率
3.06810.031503.06890.4259
3.01130.073003.04330.4278
2.94680.14503.03620.4288
3.01620.136003.01480.4326
2.95310.177503.00120.4341
2.92820.29002.99230.4358
2.94850.2310502.98450.4357
2.93650.2712002.97490.4375

...

训练损失轮次步数验证损失准确率
2.82151.776502.89430.4496
2.77141.7478002.89140.4501
2.81321.7779502.89130.4500
2.85051.881002.89060.4502
2.82941.8482502.89010.4502
2.79771.8784002.88910.4499
2.75011.985502.88780.4505
2.80381.9487002.88830.4504
2.75471.9788502.88760.4502

使用方法(OpenMind)

你可以这样使用该模型:

import argparse
import torch
from openmind import pipeline, is_torch_npu_available

def parse_args():
    parser = argparse.ArgumentParser()
    parser.add_argument(
        "--model_name_or_path",
        type=str,
        help="Path to model",
        default=None,
    )
    args = parser.parse_args()
    return args

def main():
    args = parse_args()
    model_path = args.model_name_or_path

    if is_torch_npu_available():
        device = "npu:0"
    else:
        device = "cpu"

    generator = pipeline('text-generation', model=model_path, device=device)
    output = generator("Hello, I'm a language model,")
    print(output)

if __name__ == "__main__":
    main()