MultiAPI Spoof: Multi-Source Audio Anti-Spoofing Dataset

November 21, 2025

Introduction

MultiAPI-Spoof is a multi-source audio anti-spoofing dataset that contains approximately 230 hours of audio. It includes synthetic audio generated by commercial TTS services, open-source models, and Chinese TTS websites.

This dataset is designed to support research and model training for audio anti-spoofing.

📥 Download link: MultiAPI Spoof dataset
📄 Paper on arXiv (coming soon)
🖥️ Code on github
License: For academic research only.

Spoofed Audio Samples

Data Sources

Our new dataset, MultiAPI Spoof, contains speech samples generated from a variety of API sources, including:

Commercial TTS APIs – speech generated by commercial services.
Open-Source Model Generation – speech generated by open-source models.
TTS Websites – speech on TTS web platforms.

The dataset is organized into 30 API, labeled A0–A29, with each group corresponding to a unique speech generation API source. The duration of speech in each API ranges from 0.2 to 12 hours.

Dataset Split

API NO.	train	dev	eval
A0–A20	70%	10%	20%
A21–A23	/	100%	/
A24–A29	/	/	100%

Metadata

The dataset includes three metadata files: MultiAPI_train.txt, MultiAPI_dev.txt, and MultiAPI_eval.txt.

Each line has four fields:

audio path	api	class_label
XXX.mp3	A0	spoofed
XXX.mp3	-	bonafide

Anti-spoofing Detection Demo

We trained a speech anti-spoofing model using our newly proposed MultiAPI Spoof dataset along with several public datasets. The model has been deployed online. 👉 Try it out through our interactive demo — simply upload an audio sample to get a spoofing score and classification result in real time.

Remark: This model is for reference only. As spoofing technologies evolve rapidly, detection results may contain errors. We will regularly update our model. The model and dataset are provided solely for academic research and are strictly prohibited from commercial use.