zkx_/openai-community_roberta-base-openai-detector-ascend

openai-community/roberta-base-openai-detector on Ascend NPU

1. 简介

本文档记录 openai-community/roberta-base-openai-detector AI 生成文本检测器在昇腾 NPU（Ascend 910B3）上的迁移适配、精度评测与性能验证结果。

该模型基于 RoBERTa-base（12 层，768 维），通过 GPT-2 生成文本与人类文本的二分类训练，输出 Human（人类撰写）和 AI-generated（AI 生成）两个类别。适用于检测 ChatGPT、GPT-3、GPT-2 等大语言模型生成的文本，在教育防作弊、新闻真实性验证、内容溯源等场景有广泛应用。

2. 验证环境

组件	版本
`torch`	`2.8.0`
`torch_npu`	`2.8.0.post4`
`transformers`	`5.8.1`
`CANN`	`8.5.1`

NPU：8 × Ascend 910B3
精度对比基准：CPU（x86, PyTorch 2.8.0）

3. 部署使用流程

3.1 环境准备

conda create -n openai-community_roberta-base-openai-detector python=3.11 -y
conda activate openai-community_roberta-base-openai-detector

pip install torch==2.8.0 torch_npu==2.8.0.post4 \
    -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install transformers numpy \
    -i https://pypi.tuna.tsinghua.edu.cn/simple

3.2 推理脚本使用

python inference.py --text "This is a sample text that may be AI-generated." --device npu

编程接口：

from inference import AIDetector
detector = AIDetector(
    model_path="./openai-community_roberta-base-openai-detector", device="npu"
)
results, probs = detector.predict(["This text was written by a human."])
# results[0] → {'Human': 0.92, 'AI-generated': 0.08}

4. Smoke 验证

python inference.py --text "This is a sample text." --device npu

预期输出：Human/AI-generated 两类概率（和为 1.0），无运行时错误。

5. 性能参考

测试条件：10 条混合来源文本（人类/AI 各半），batch_size=16，NPU 预热 1 轮。

指标	数值
CPU 吞吐量	`37.9` texts/s
NPU 吞吐量	`416.8` texts/s
CPU/NPU 加速比	`11.0` ×

6. 精度评测

6.1 评测方法

分别在 CPU 和 NPU 上推理，比较二分类 softmax 概率向量的余弦相似度、MAE 和 Top-1 一致性。

6.2 评测结果

指标	数值
平均余弦相似度	`0.999997`
MAE	`0.001061`
精度误差率	`0.0003%`
Top-1 准确率	`100.0%`

结论：精度误差率 0.0003%，Top-1 完全一致，评测通过。

7. 迁移适配说明

7.1 模型结构

Backbone：RobertaModel（12 层，768 维，RoBERTa-base 标准架构）
Classifier Head：线性层（768 → 2），二分类 softmax
Tokenizer：BPE（GPT-2 风格），vocab.json + merges.txt
权重：safetensors + pytorch_model.bin 双格式
参数量：124.7M（RoBERTa-base 标准规模）

7.2 适配要点

AutoModelForSequenceClassification.from_pretrained() 加载
model.to("npu:0") 迁移，RoBERTa 12 层预热约 3-5 秒
BPE tokenizer 在 CPU 分词（GPT-2 风格，空格用 "Ġ" 前缀）
输出 softmax 后 .cpu().numpy() 返回

7.3 关键代码

import torch, torch_npu
from transformers import AutoModelForSequenceClassification, AutoTokenizer

model = AutoModelForSequenceClassification.from_pretrained(
    "roberta-base-openai-detector"
).to("npu:0")
tokenizer = AutoTokenizer.from_pretrained("roberta-base-openai-detector")

text = "This text could be AI-generated."
inputs = tokenizer(text, return_tensors="pt", truncation=True)
inputs = {k: v.to("npu:0") for k, v in inputs.items()}

with torch.no_grad():
    probs = torch.softmax(model(**inputs).logits, dim=-1)
    result = model.config.id2label[int(torch.argmax(probs))]

8. 注意事项

GPT-2 训练数据：该模型在 GPT-2 输出上训练，对 GPT-2 家族（GPT-2, GPT-3 base）的检测效果最佳。对 GPT-3.5/GPT-4/Claude 等新模型的泛化能力有限，实际使用建议结合其他检测方法。
长文本处理：RoBERTa 最大 512 tokens，超长文本（如论文、报告）会被截断。建议将长文本按段落分割后分别检测，取平均置信度。
BPE tokenizer 特点：RoBERTa 使用 GPT-2 风格 BPE，空格前缀用 "Ġ" 字符。decode 时使用 skip_special_tokens=True 避免输出特殊字符。
非内容审核工具：本模型仅区分"人类写 vs AI 生成"，不判断内容质量、真实性或合规性。不应作为唯一的内容决策依据。
首次 NPU 推理：RoBERTa 12 层预热约 3-5 秒，建议生产环境预热后批量推理。