Skip to content

Latest commit

 

History

History
37 lines (27 loc) · 909 Bytes

File metadata and controls

37 lines (27 loc) · 909 Bytes

Data

MSGD Dataset (Required)

Download MSGD_dataset_final.json and place it in this directory:

data/MSGD_dataset_final.json

The dataset contains 500 multi-phase sales dialogues with the following structure:

  • Chitchat → Transition → Task-oriented phases
  • Intents: FindAttraction, FindRestaurants, FindMovie, LookupMusic, SearchHotel, FindEvents, GetTransportation, SearchFlights

User Audio (Generated by Step 1)

After running scripts/1_prepare_user_audio.py, user audio files will be placed here:

data/user_audio/
  merge_0042_turn00.wav    # Dialogue 0042, user turn 0 (even = user turns)
  merge_0042_turn02.wav    # Dialogue 0042, user turn 2
  merge_0042_turn04.wav    # ...
  ...

Voice Profiles (Optional)

To use voice cloning with Sesame CSM, place voice profile samples here:

data/voice_profiles/
  user/
    sample_01.wav
    sample_02.wav
    ...