Overview

Vobile/Pex AI Song Detector - Overview

The AI Song Detector identifies whether an audio file contains AI-generated music. It provides a simple API for submitting an audio file and receiving a classification result with accompanying details.

The detector is optimized for music, specifically individual songs, and produces one prediction per file.

Uploaded files are processed and then discarded, they are not stored on our servers nor used for training.


What the API Does

  • Analyzes an uploaded audio file.

  • Determines whether the musical content is AI-generated.

  • Returns a JSON response containing:

    • classification result - is it AI-generated or not,

    • classification confidence score,

    • additional details.

The API is designed for straightforward integration into existing media workflows and high-volume use cases.


Expected Input

The detector is designed to analyze music, not generic audio.

Files should contain:

  • A single musical track

  • Minimal non-music content (speech, silence, etc.)

Files shouldn't contain:

  • UGC compilations

  • Voiceovers

  • DJ mixes

  • Mashups

  • Audio dominated by silence or speech


Supported Audio Formats & Limits

  • Supported formats: MP3, WAV, FLAC, AAC, M4A/MP4, OGG, WEBM, and other common audio formats

  • Minimum duration: 15 seconds

  • Maximum duration: 15 minutes

  • Max file size: 100 MB

  • Stereo, mono, and variable bitrates are supported

Files outside these limits will return statuses such as too_short, too_long, or not_enough_music.


How the API Works

  1. Authenticate using OAuth 2.0 Client Credentials

  2. Submit an audio file via a request

  3. Receive the classification result in a JSON response

Each request accepts one file and produces one output. Batch submission is not supported.


Next Steps

Continue to API Documentation for:

  • Authentication flow

  • Detector endpoint details

  • Response structure

  • Error handling

  • Example implementations

Last updated