Datasets
Common Voice Spontaneous Speech 3.0 - Georgian
License: CC0-1.0
Locale: ka
Task: ASR
Format: MP3
Size: 11.61 MB
Common Voice Spontaneous Speech 3.0 - Rakhine
License: CC0-1.0
Locale: rki
Task: ASR
Format: MP3
Size: 11.20 MB
Common Voice Spontaneous Speech 3.0 - Bashkir
License: CC0-1.0
Locale: ba
Task: ASR
Format: MP3
Size: 5.10 MB
Common Voice Spontaneous Speech 3.0 - Javanese
License: CC0-1.0
Locale: jv
Task: ASR
Format: MP3
Size: 3.67 MB
Common Voice Spontaneous Speech 3.0 - Sinhala
License: CC0-1.0
Locale: si
Task: ASR
Format: MP3
Size: 2.52 MB
Common Voice Spontaneous Speech 3.0 - Dutch
License: CC0-1.0
Locale: nl
Task: ASR
Format: MP3
Size: 2.42 MB
Common Voice Spontaneous Speech 3.0 - Shona
License: CC0-1.0
Locale: sn
Task: ASR
Format: MP3
Size: 1.53 MB
Common Voice Spontaneous Speech 3.0 - Bodo
License: CC0-1.0
Locale: brx
Task: ASR
Format: MP3
Size: 1.30 MB
Common Voice Spontaneous Speech 3.0 - Thai
License: CC0-1.0
Locale: th
Task: ASR
Format: MP3
Size: 940.22 KB
Common Voice Spontaneous Speech 3.0 - Frisian
License: CC0-1.0
Locale: fy-NL
Task: ASR
Format: MP3
Size: 323.25 KB
Common Voice Spontaneous Speech 3.0 - Croatian
License: CC0-1.0
Locale: hr
Task: ASR
Format: MP3
Size: 285.11 KB
Common Voice Spontaneous Speech 3.0 - Danish
License: CC0-1.0
Locale: da
Task: ASR
Format: MP3
Size: 61.80 KB
Common Voice Spontaneous Speech 3.0 - Ruuli
License: CC0-1.0
Locale: ruc
Task: ASR
Format: MP3
Size: 365.95 MB
Common Voice Spontaneous Speech 3.0 - Irish
License: CC0-1.0
Locale: ga-IE
Task: ASR
Format: MP3
Size: 3.14 MB
Istorima
License: CC BY-NC-ND 4.0
Locale: gr-GR
Task: NLP
Format: PARQUET
Size: 416.02 MB
UP - DSP - Philippine Languages Database (UP-DSP-PLD)
License: CC-BY-NC-4.0
Locale: phi
Task: ASR
Format: WAV, LOG
Size: 45.63 GB
Urdu Multi-Speaker TTS Dataset
License: CC-BY-NC-4.0
Locale: urd
Task: TTS
Format: WEBM, TSV
Size: 514.54 MB
BECO Brahui Literature Corpus
License: CC-BY-NC-SA-4.0
Locale: brh
Task: NLP
Format: TXT
Size: 1.19 MB
Malayalam Time-Aligned Speech Corpus
License: CC-BY-NC-4.0
Locale: mal
Task: ASR
Format: WAV, SRT
Size: 1.50 GB
ddd-kenya-somali-68hrs-asr-part3
License: CC-BY-4.0
Locale: som
Task: ASR
Format: WAV, TSV
Size: 1.33 GB
ddd-kenya-somali-68hrs-asr-part2
License: CC-BY-4.0
Locale: som
Task: ASR
Format: WAV, TSV
Size: 8.07 GB
ddd-kenya-somali-68hrs-asr-part1
License: CC-BY-4.0
Locale: som
Task: ASR
Format: WAV, TSV
Size: 7.68 GB
TODa: Tamazight Open Dataset
License: CC-BY-4.0
Locale: zgh
Task: NLP
Format: CSV
Size: 3.27 MB
TTS Balinese Language
License: CC-BY-SA-4.0
Locale: ban
Task: TTS
Format: WEBM, TSV
Size: 301.05 MB