cd granite-speech-4.1-2b-ascend/
python3 inference.py
测试验证
精度测试结果
指标
实测值
阈值
状态
Max Error (mean)
1.19e-07
< 1e-5
✅ 通过
Max Error (std)
1.86e-09
< 1e-5
✅ 通过
性能数据
操作
耗时
模型加载
9.79s
CPU 参考计算 (20 tensors)
7.37s
NPU 推理 (20 tensors)
0.16s
语音推理 (3s音频)
1.30s
测试日志
2026-05-19 08:47:05,469 - INFO - ============================================================
2026-05-19 08:47:05,469 - INFO - Granite-Speech-4.1-2B ASR Ascend NPU Inference
2026-05-19 08:47:05,469 - INFO - ============================================================
2026-05-19 08:47:05,469 - INFO - Model path: /opt/atomgit/mxy/granite-speech-4.1-2b
2026-05-19 08:47:05,469 - INFO - Device: npu:0
2026-05-19 08:47:09,764 - INFO - Loading model from: /opt/atomgit/mxy/granite-speech-4.1-2b
2026-05-19 08:47:10,441 - INFO - Processor loaded: GraniteSpeechProcessor
2026-05-19 08:47:14,914 - INFO - Model loaded on device: npu:0
2026-05-19 08:47:14,914 - INFO - Model type: GraniteSpeechForConditionalGeneration
2026-05-19 08:47:14,914 - INFO - Running inference...
2026-05-19 08:47:16,111 - INFO - Inference completed in 1.160s
2026-05-19 08:47:23,712 - INFO - PRECISION TEST PASSED
2026-05-19 08:47:23,712 - INFO - ============================================================