Nunchaku Logo

nunchaku-z-image-turbo模型卡片

本仓库包含经Nunchaku量化的Z-Image-Turbo版本，后者是一款高性能图像生成模型。本版本针对高效推理进行了优化，同时将性能损失控制在最小范围内。

暂无最新消息。敬请关注更新！

模型详情

模型说明

开发团队： Nunchaku团队（感谢@devgdovg）
模型类型： 图像到图像（image-to-image）
许可证： apache-2.0
量化基础模型： Z-Image-Turbo

模型文件

数据类型：适用于非Blackwell GPU（50系列之前）的INT4，适用于Blackwell GPU（50系列）的NVFP4。 秩（Rank）：

r32：推理速度更快，
r128：质量更佳，但推理速度较慢，
r256：质量最高（推理速度最慢）。

基础模型

适用于一般用途的标准推理速度模型

数据类型	秩	模型名称
INT4	r32	`svdq-int4_r32-z-image-turbo.safetensors`
	r128	`svdq-int4_r128-z-image-turbo.safetensors`
	r256	`svdq-int4_r256-z-image-turbo.safetensors`
NVFP4	r32	`svdq-fp4_r32-z-image-turbo.safetensors`
	r128	`svdq-fp4_r128-z-image-turbo.safetensors`

模型来源

推理引擎：nunchaku
量化库：deepcompressor
论文：SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
演示：demo.nunchaku.tech

使用方法

Diffusers 使用：参见 z-image-turbo.py。查看此教程了解更多高级用法。
ComfyUI 使用：参见 nunchaku-z-image-turbo.json。

性能

performance

引用

@inproceedings{
  li2024svdquant,
  title={SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models},
  author={Li*, Muyang and Lin*, Yujun and Zhang*, Zhekai and Cai, Tianle and Li, Xiuyu and Guo, Junxian and Xie, Enze and Meng, Chenlin and Zhu, Jun-Yan and Han, Song},
  booktitle={The Thirteenth International Conference on Learning Representations},
  year={2025}
}