Datasets
Anna 1.0
License: CC0-1.0
Locale: hu-HU
Task: TTS
Format: WEBM
Size: 95.27 MB
DataTrust Africa: Speech Corpus of Public Radio Recordings from Northern Uganda
License: NOODL-1.0
Locale: en-US
Task: NLP
Format: MP3
Size: 179.82 MB
Common Voice Scripted Speech 24.0 - Tunen
License: CC0-1.0
Locale: tvu
Task: ASR
Format: MP3
Size: 195.38 MB
Dmitri 1.0
License: CC0-1.0
Locale: ru-RU
Task: TTS
Format: WEBM
Size: 96.63 MB
Central Kurdish TTS dataset 1.0
License: CC-BY-4.0
Locale: ckb
Task: TTS
Format: wav
Size: 293.45 MB
Imre 1.0
License: CC0-1.0
Locale: hu-HU
Task: TTS
Format: WEBM
Size: 99.60 MB
Dimitar 1.0
License: CC0-1.0
Locale: bg-BG
Task: TTS
Format: WEBM
Size: 109.58 MB
Thorsten-Voice Dataset 2021.06 Emotional
License: CC0-1.0
Locale: de-DE
Task: TTS
Format: WAV,CSV
Size: 380.80 MB
Common Voice Spontaneous Speech 2.0 - Thur
License: CC0-1.0
Locale: lth
Task: ASR
Format: MP3
Size: 292.98 MB
Common Voice Scripted Speech 24.0 - Bateri
License: CC0-1.0
Locale: btv
Task: ASR
Format: MP3
Size: 205.82 MB
Mozilla Common Voice Spontaneous Speech ASR Shared Task Train/Dev Data
License: CC0-1.0
Locale: mul
Task: ASR
Format: mp3
Size: 4.30 GB
Common Voice Scripted Speech 24.0 - Nüpode Huitoto
License: CC0-1.0
Locale: hux
Task: ASR
Format: MP3
Size: 229.65 MB
Chitwan 1.0
License: CC0-1.0
Locale: ne-NE
Task: TTS
Format: WEBM
Size: 61.68 MB
Ewondo-TTS-Dataset
License: NOODL-1.0
Locale: ewo
Task: TTS
Format: MP3, TSV
Size: 152.70 MB
Common Voice Scripted Speech 24.0 - Esperanto
License: CC0-1.0
Locale: eo
Task: ASR
Format: MP3
Size: 38.69 GB
Hawrami Kurdish TTS dataset 1.0
License: CC-BY-4.0
Locale: hac
Task: TTS
Format: WAV
Size: 706.11 MB
Common Voice Spontaneous Speech 2.0 - Arvanitika
License: CC0-1.0
Locale: aat
Task: ASR
Format: MP3
Size: 46.68 MB
Common Voice Scripted Speech 24.0 - Losso
License: CC0-1.0
Locale: nmz
Task: ASR
Format: MP3
Size: 205.70 MB
Common Voice Scripted Speech 24.0 - Massa
License: CC0-1.0
Locale: mcn
Task: ASR
Format: MP3
Size: 217.68 MB
Common Voice Scripted Speech 24.0 - Kalenjin
License: CC0-1.0
Locale: kln
Task: ASR
Format: MP3
Size: 1.68 GB
Common Voice Scripted Speech 24.0 - Loja Highland Kichwa
License: CC0-1.0
Locale: qvj
Task: ASR
Format: MP3
Size: 221.72 MB
Common Voice Spontaneous Speech 2.0 - Wixárika
License: CC0-1.0
Locale: hch
Task: ASR
Format: MP3
Size: 198.80 MB
Mozilla Common Voice Spontaneous Speech ASR Shared Task Test Data
License: CC0-1.0
Locale: mul
Task: ASR
Format: MP3, TSV
Size: 784.80 MB
ReRooted: Speech Corpus of Testimonials from Armenian Refugees and Immigrants
License: GPL-3.0
Locale: hy
Task: ASR
Format: WAV, TEXTGRID
Size: 3.25 GB