logo

MultiAPI Spoof: Multi-Source Audio Anti-Spoofing Dataset

November 21, 2025

trust-framework

Introduction

MultiAPI-Spoof is a multi-source audio anti-spoofing dataset that contains approximately 230 hours of audio. It includes synthetic audio generated by commercial TTS services, open-source models, and Chinese TTS websites.

This dataset is designed to support research and model training for audio anti-spoofing.


Spoofed Audio Samples


Data Sources

Our new dataset, MultiAPI Spoof, contains speech samples generated from a variety of API sources, including:

  1. Commercial TTS APIs – speech generated by commercial services.
  2. Open-Source Model Generation – speech generated by open-source models.
  3. TTS Websites – speech on TTS web platforms.

The dataset is organized into 30 API, labeled A0–A29, with each group corresponding to a unique speech generation API source. The duration of speech in each API ranges from 0.2 to 12 hours.


Dataset Split

API NO.traindeveval
A0–A2070%10%20%
A21–A23/100%/
A24–A29//100%

Metadata

The dataset includes three metadata files: MultiAPI_train.txt, MultiAPI_dev.txt, and MultiAPI_eval.txt.

Each line has four fields:

audio pathapiclass_label
XXX.mp3A0spoofed
XXX.mp3-bonafide

Anti-spoofing Detection Demo

We trained a speech anti-spoofing model using our newly proposed MultiAPI Spoof dataset along with several public datasets. The model has been deployed online. 👉 Try it out through our interactive demo — simply upload an audio sample to get a spoofing score and classification result in real time.

Remark: This model is for reference only. As spoofing technologies evolve rapidly, detection results may contain errors. We will regularly update our model. The model and dataset are provided solely for academic research and are strictly prohibited from commercial use.