Overview
Vobile/Pex AI Song Detector - Overview
The AI Song Detector identifies whether an audio file contains AI-generated music. It provides a simple API for submitting an audio file and receiving a classification result with accompanying details.
The detector is optimized for music, specifically individual songs, and produces one prediction per file.
Submitted audio files, whether uploaded directly or retrieved from a provided URL, are processed and then discarded. They are not stored on our servers nor used for training.
What the API Does
Analyzes an audio file, either uploaded directly or fetched from a provided URL.
Determines whether the musical content is AI-generated.
Returns a JSON response containing:
classification result - is it AI-generated or not,
classification confidence score,
the detected AI generation platform (e.g., Suno, Udio), if known and detected with high enough confidence,
additional details.
The API is designed for straightforward integration into existing media workflows and high-volume use cases.
AI Generation Platform Detection
When the detector classifies a file as AI-generated, it may also identify the AI generation platform used to create the music (such as Suno, Udio, and others).
This information is returned only when the detector reaches a sufficiently high confidence for platform attribution. If platform confidence is not high enough, the response will still indicate that the content is AI-generated, but the platform field will be empty.
Expected Input
The detector is designed to analyze music, not generic audio.
Files should contain:
A single musical track
Minimal non-music content (speech, silence, etc.)
Files shouldn't contain:
UGC compilations
Voiceovers
DJ mixes
Mashups
Audio dominated by silence or speech
Supported Audio Formats & Limits
Supported formats: MP3, WAV, FLAC, AAC, M4A/MP4, OGG, WEBM, and other common audio formats
Minimum duration: 10 seconds
Maximum duration: 15 minutes
Max file size: 100 MB
Stereo, mono, and variable bitrates are supported
Files outside these limits will return statuses such as too_short, too_long, or not_enough_music.
Input Methods
The API supports two equivalent ways of providing audio for analysis:
Direct file upload as part of the request.
Providing a publicly accessible URL from which the API will retrieve the audio file.
Both methods are subject to the same format, duration, and size limits, and both produce identical classification results.
How the API Works
Authenticate using OAuth 2.0 Client Credentials
Submit an audio file, either by uploading it directly or by providing a URL to the file
Receive the classification result in a JSON response
Each request accepts one file and produces one output. Batch submission is not supported.
Next Steps
Continue to API Documentation for:
Authentication flow
Detector endpoint details
Response structure
Error handling
Example implementations
Last updated