使用 ModelScope 的 snapshot_download 从本地缓存加载模型,而非直接从 HuggingFace 下载。
from modelscope import snapshot_download
model_dir = snapshot_download("iic/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch")模型本地路径:~/.cache/modelscope/hub/models/iic/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch
assets/test.wav,时长约 5.55 秒librosa.load(wav_path, sr=16000, mono=True) 读取并预处理python inference.py依赖安装:
pip install -r requirements.txt欢迎大家来体验达摩院推出的语音识别模型| 指标 | 数值 |
|---|---|
| max_abs_error | 0.000296 |
| mean_abs_error | 0.000003 |
| relative_error | 0.018596% |
| cosine_similarity | 1.000000 |
| threshold | 1.0% |
| result | PASS |
| 指标 | 数值 |
|---|---|
| 平均延迟(毫秒) | 481.08 |
| 最小延迟(毫秒) | 477.04 |
| 最大延迟(毫秒) | 489.74 |
| p50 延迟(毫秒) | 479.88 |
| p90 延迟(毫秒) | 486.13 |
| p95 延迟(毫秒) | 487.94 |
| 音频时长(秒) | 5.55 |
| 实时率 | 0.0867 |
.
├── assets/
│ └── test.wav
├── logs/
├── screenshots/
├── model_utils.py
├── inference.py
├── eval_consistency.py
├── benchmark.py
├── requirements.txt
├── .gitignore
└── README.md依次执行:
python inference.py
python eval_consistency.py
python benchmark.py#NPU