Kokoro Speech Dataset
License:
libribox
Steward:
CommunityTask: TTS
Release Date: 3/10/2026
Format: FLAC
Size: 3.98 GB
Share
Description
Kokoro Speech Dataset is a public domain Japanese speech dataset. It contains 43,253 short audio clips of a single speaker reading 14 novel books. The format of the metadata is similar to that of LJ Speech so that the dataset is compatible with modern speech synthesis systems. The texts are from Aozora Bunko, which is in the public domain. The audio clips are from LibriVox project, which is also in the public domain. Readings are estimated by MeCab and UniDic Lite from kanji-kana mixture text. Readings are romanized which are similar to the format used by Julius. The audio clips were split and transcripts were aligned automatically by Kokoro-Align.
Specifics
Considerations
Restrictions/Special Constraints
This dataset is in the public domain in the USA (and most likely other countries as well). There are no restrictions on its use. For more information, please see: librivox.org/pages/public-domain.
Forbidden Usage
This dataset is in the public domain in the USA (and most likely other countries as well). There are no restrictions on its use. For more information, please see: librivox.org/pages/public-domain.