Datasets

Filters:

Common Voice

Common Voice Scripted Speech 25.0 - Kabyle

A collection of read speech recordings in Kabyle.

License: CC0-1.0

Locale: kab

Task: ASR

Format: MP3

Size: 17.43 GB

Common Voice

Common Voice Scripted Speech 25.0 - Basque

A collection of read speech recordings in Basque.

License: CC0-1.0

Locale: eu

Task: ASR

Format: MP3

Size: 14.48 GB

Common Voice

Common Voice Scripted Speech 25.0 - Japanese

A collection of read speech recordings in Japanese.

License: CC0-1.0

Locale: ja

Task: ASR

Format: MP3

Size: 14.34 GB

Common Voice

Common Voice Scripted Speech 25.0 - Luganda

A collection of read speech recordings in Luganda.

License: CC0-1.0

Locale: lg

Task: ASR

Format: MP3

Size: 11.06 GB

Common Voice

Common Voice Scripted Speech 25.0 - Czech

A collection of read speech recordings in Czech.

License: CC0-1.0

Locale: cs

Task: ASR

Format: MP3

Size: 5.56 GB

Common Voice

Common Voice Scripted Speech 25.0 - Urdu

A collection of read speech recordings in Urdu.

License: CC0-1.0

Locale: ur

Task: ASR

Format: MP3

Size: 5.78 GB

Common Voice

Common Voice Scripted Speech 25.0 - Georgian

A collection of read speech recordings in Georgian.

License: CC0-1.0

Locale: ka

Task: ASR

Format: MP3

Size: 6.37 GB

Common Voice

Common Voice Scripted Speech 25.0 - Thai

A collection of read speech recordings in Thai.

License: CC0-1.0

Locale: th

Task: ASR

Format: MP3

Size: 8.38 GB

Common Voice

Common Voice Scripted Speech 25.0 - Russian

A collection of read speech recordings in Russian.

License: CC0-1.0

Locale: ru

Task: ASR

Format: MP3

Size: 6.55 GB

Common Voice

Common Voice Scripted Speech 25.0 - Italian

A collection of read speech recordings in Italian.

License: CC0-1.0

Locale: it

Task: ASR

Format: MP3

Size: 9.71 GB

Common Voice

Common Voice Scripted Speech 25.0 - Galician

A collection of read speech recordings in Galician.

License: CC0-1.0

Locale: gl

Task: ASR

Format: MP3

Size: 7.81 GB

Common Voice

Common Voice Scripted Speech 25.0 - Latvian

A collection of read speech recordings in Latvian.

License: CC0-1.0

Locale: lv

Task: ASR

Format: MP3

Size: 5.84 GB

Common Voice

Common Voice Scripted Speech 25.0 - Persian

A collection of read speech recordings in Persian.

License: CC0-1.0

Locale: fa

Task: ASR

Format: MP3

Size: 10.40 GB

Common Voice

Common Voice Scripted Speech 25.0 - Tamil

A collection of read speech recordings in Tamil.

License: CC0-1.0

Locale: ta

Task: ASR

Format: MP3

Size: 8.57 GB

Common Voice

Common Voice Scripted Speech 25.0 - Uyghur

A collection of read speech recordings in Uyghur.

License: CC0-1.0

Locale: ug

Task: ASR

Format: MP3

Size: 9.69 GB

Common Voice

Common Voice Scripted Speech 25.0 - Kabardian

A collection of read speech recordings in Kabardian.

License: CC0-1.0

Locale: kbd

Task: ASR

Format: MP3

Size: 5.52 GB

Common Voice

Common Voice Scripted Speech 25.0 - Frisian

A collection of read speech recordings in Frisian.

License: CC0-1.0

Locale: fy-NL

Task: ASR

Format: MP3

Size: 4.34 GB

Common Voice

Common Voice Scripted Speech 25.0 - Welsh

A collection of read speech recordings in Welsh.

License: CC0-1.0

Locale: cy

Task: ASR

Format: MP3

Size: 3.89 GB

Common Voice

Common Voice Scripted Speech 25.0 - Central Kurdish

A collection of read speech recordings in Central Kurdish.

License: CC0-1.0

Locale: ckb

Task: ASR

Format: MP3

Size: 3.59 GB

Common Voice

Common Voice Scripted Speech 25.0 - Hungarian

A collection of read speech recordings in Hungarian.

License: CC0-1.0

Locale: hu

Task: ASR

Format: MP3

Size: 3.58 GB

Common Voice

Common Voice Scripted Speech 25.0 - Chinese (Hong Kong)

A collection of read speech recordings in Chinese (Hong Kong).

License: CC0-1.0

Locale: zh-HK

Task: ASR

Format: MP3

Size: 3.42 GB

Common Voice

Common Voice Scripted Speech 25.0 - Meadow Mari

A collection of read speech recordings in Meadow Mari.

License: CC0-1.0

Locale: mhr

Task: ASR

Format: MP3

Size: 5.73 GB

Common Voice

Common Voice Scripted Speech 25.0 - Arabic

A collection of read speech recordings in Arabic.

License: CC0-1.0

Locale: ar

Task: ASR

Format: MP3

Size: 3.28 GB

Common Voice

Common Voice Scripted Speech 25.0 - Dutch

A collection of read speech recordings in Dutch.

License: CC0-1.0

Locale: nl

Task: ASR

Format: MP3

Size: 3.16 GB