Kokoro Speech Dataset

License:

libribox

Steward:

Community

Task: TTS

Release Date: 3/10/2026

Format: FLAC

Size: 3.98 GB

Description

Kokoro Speech Dataset is a public domain Japanese speech dataset. It contains 43,253 short audio clips of a single speaker reading 14 novel books. The format of the metadata is similar to that of LJ Speech so that the dataset is compatible with modern speech synthesis systems. The texts are from Aozora Bunko, which is in the public domain. The audio clips are from LibriVox project, which is also in the public domain. Readings are estimated by MeCab and UniDic Lite from kanji-kana mixture text. Readings are romanized which are similar to the format used by Julius. The audio clips were split and transcripts were aligned automatically by Kokoro-Align.

Specifics

Licensing

LibriVox Public domain

https://librivox.org/pages/public-domain/

Considerations

Restrictions/Special Constraints

This dataset is in the public domain in the USA (and most likely other countries as well). There are no restrictions on its use. For more information, please see: librivox.org/pages/public-domain.

Forbidden Usage

This dataset is in the public domain in the USA (and most likely other countries as well). There are no restrictions on its use. For more information, please see: librivox.org/pages/public-domain.

Kokoro Speech Dataset

Description

Specifics

Considerations

Metadata