Skip to main content

Ingest Audio

Preprocess one or more uploaded raw audio blobs to compute acoustic features, split stems, extract motifs, and index them into your target dataset. This operation is idempotent.
  • /trace/v1/ingest/type1 (Forces the legacy model)
  • /trace/v1/ingest/type2 (Forces the new efficient model)

Request Body Fields

FieldTypeRequiredDefaultDescription
dataset_namestringNoneName of your target dataset.
audio_blobsstring[]NoneArray of blob_name values. Max: 10 per request.
tasksstring[]["full_audio_latent"]List of features to compute.
webhook_urlstringURL for completion callbacks.

Valid tasks Options:

  • full_audio_latent: Encodes full audio sequence latents.
  • stem_vocals: Splits vocals stem.
  • stem_bass: Splits bass stem.
  • stem_drums: Splits drums stem.
  • stem_other: Splits accompaniment/other stem.
  • light_stem: Splits 2 stems (vocals + accompaniment).
  • motif: Computes melody contours / motif chroma.
  • section: Runs structural section boundary segmentation.

Ingestion In-App Pricing

Ingestion ProfileEnterprise Price
Track Only (Full Audio Latents)$0.0300
Comprehensive Search Profile (New Efficient)0.0352(3minsong)//0.0352** (3-min song) // **0.0513 (5-min song)
Comprehensive Search Profile (Legacy)0.0388(3minsong)//0.0388** (3-min song) // **0.1554 (5-min song)
Heaviest Custom Ingest$0.1628 (5-min song)

Request Example

{
  "dataset_name": "my-reference-catalog",
  "audio_blobs": [
    "dataset/123e4567-e89b-12d3-a456-426614174000_song_candidate_1.wav"
  ],
  "tasks": [
    "full_audio_latent",
    "light_stem",
    "motif"
  ],
  "webhook_url": "https://my-service.com/webhooks/trace"
}

Response

Status 202 Accepted
{
  "success": true,
  "data": {
    "job_id": "ingest_job_7e8f9a0b-c1d2-3e4f-5a6b-7c8d9e0f1a2b",
    "cost_usd": 0.0513
  }
}