魔搭社区:Trelis/multi-qa-MiniLM-L6-dot-v1-ft-pairs-4-cst-epoch-s1 | 码云:Trelis/multi-qa-MiniLM-L6-dot-v1-ft-pairs-4-cst-epoch-s1
| 指标 | 值 |
|---|---|
| Top-1 一致性 | N/A (embedding) |
| 最大 Logit 差异率 | 0.001017 |
| 平均 KL 散度 | 0 |
| 权重大小 | 86.7MB |
| 设备 | Ascend 910B NPU |
from transformers import AutoTokenizer, BertModel
import torch
tok = AutoTokenizer.from_pretrained("/opt/atomgit/~/output/Trelis_multi-qa-MiniLM-L6-dot-v1-ft-pairs-4-cst-epoch-s1/model/Trelis/multi-qa-MiniLM-L6-dot-v1-ft-pairs-4-cst-epoch-s1", trust_remote_code=True)
model = BertModel.from_pretrained("/opt/atomgit/~/output/Trelis_multi-qa-MiniLM-L6-dot-v1-ft-pairs-4-cst-epoch-s1/model/Trelis/multi-qa-MiniLM-L6-dot-v1-ft-pairs-4-cst-epoch-s1", trust_remote_code=True).to("npu:0").eval()
inputs = tok(["测试文本"], return_tensors="pt", padding=True, truncation=True, max_length=128)
inputs = {k: v.to("npu:0") for k, v in inputs.items()}
with torch.no_grad():
outputs = model(**inputs)
print(outputs.last_hidden_state.shape if hasattr(outputs, "last_hidden_state") else outputs.logits.shape)inference.py — 昇腾 NPU 推理脚本eval/run_accuracy.py — 精度验证脚本eval/run_performance.py — 性能测试脚本eval/accuracy.json — 精度验证结果eval/performance.json — 性能测试结果