You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+32-2Lines changed: 32 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -44,8 +44,38 @@ dataset[0] # (1, 158621)
44
44
dataset[1] # (1, 153757)
45
45
```
46
46
47
+
#### Full API:
48
+
```py
49
+
LJSpeechDataset(
50
+
root: str="./data", # The root where the dataset will be downloaded
51
+
transforms: Optional[Callable] =None, # Transforms to apply to audio files
52
+
)
53
+
```
54
+
55
+
### LibriSpeech Dataset
56
+
Wrapper for the [LibriSpeech](https://www.openslr.org/12) dataset (EN only). Requires `pip install datasets`. Note that this dataset requires several GBs of storage.
57
+
58
+
```py
59
+
from audio_data_pytorch import LibriSpeechDataset
60
+
61
+
dataset = LibriSpeechDataset(
62
+
root="./data",
63
+
)
64
+
65
+
dataset[0] # (1, 222336)
66
+
```
67
+
68
+
#### Full API:
69
+
```py
70
+
LibriSpeechDataset(
71
+
root: str="./data", # The root where the dataset will be downloaded
72
+
with_info: bool=False, # Whether to return info (i.e. text, sampling rate, speaker_id)
73
+
transforms: Optional[Callable] =None, # Transforms to apply to audio files
74
+
)
75
+
```
76
+
47
77
### Common Voice Dataset
48
-
Multilanguage wrapper for the [Common Voice](https://commonvoice.mozilla.org/) dataset with voice-only data. Requires `pip install datasets`. Note that each language requires several GBs of storage, and that you have to confirm access for each distinct version you use e.g. [here](https://huggingface.co/datasets/mozilla-foundation/common_voice_10_0), to validate your Huggingface access token. You can provide a list of `languages` and to avoid an unbalanced dataset the values will be interleaved by downsampling the majority language to have the same number of samples as the minority language.
78
+
Multilanguage wrapper for the [Common Voice](https://commonvoice.mozilla.org/). Requires `pip install datasets`. Note that each language requires several GBs of storage, and that you have to confirm access for each distinct version you use e.g. [here](https://huggingface.co/datasets/mozilla-foundation/common_voice_10_0), to validate your Huggingface access token. You can provide a list of `languages` and to avoid an unbalanced dataset the values will be interleaved by downsampling the majority language to have the same number of samples as the minority language.
0 commit comments