Thorsten-Voice Dataset 2021.06 Emotional
License:
CC0-1.0
Steward:
CommunityTask: TTS
Release Date: 2/27/2026
Format: WAV,CSV
Size: 380.80 MB
Share
Description
Thorsten-Voice Dataset 2021.06 (emotional) is a German emotional speech dataset recorded by Thorsten Müller and audio-optimized by Dominik Kreutz. It contains 2,400 recordings representing eight distinct emotions. The dataset is released under CC0 to enable unrestricted research and commercial use.
Specifics
Considerations
Restrictions/Special Constraints
None. Released under CC0 (public domain dedication).
Forbidden Usage
None from the licensor’s side. Users are responsible for complying with applicable laws and ethical standards.
Processes
Ethical Review
The dataset consists exclusively of voluntary recordings of the contributor’s own voice. No third-party voices or personal data are included. All recordings were created with the explicit intention of unrestricted public release under CC0. No formal institutional ethical review was required, as the dataset contains only self-recorded material. The dataset was released in the spirit of openness, equality, and free access to knowledge. The contributor encourages responsible and socially beneficial use.
Intended Use
Intended for emotional text-to-speech (TTS), expressive speech synthesis, emotion recognition research, benchmarking, and commercial speech technology development.
Metadata
Dataset Structure
300 sentences
8 emotions
2,400 total recordings (300 × 8)
Emotions Included
Neutral (~19 min)
Disgusted (~23 min)
Angry (~20 min)
Amused (~18 min)
Surprised (~18 min)
Sleepy (~30 min)
Drunk-style speech (~25 min)
(Recorded sober; speech style only)Whispering (~22 min)
Technical Details
WAV files (mono)
22,050 Hz sample rate
Normalized to -24 dB
No leading or trailing silence
Sentence length: 59–148 characters
Licensing
Released under CC0 (public domain dedication).
No restrictions apply.
Project Context
The goal of the Thorsten-Voice project is to provide high-quality German voice datasets and TTS models as free and open resources.
Contributor Statement (Thorsten Müller)
“I contribute my personal voice as a person believing in a world where all people are equal, regardless of gender, sexual orientation, religion, skin color, or geographic coordinates of birth. I believe in a global world where everyone is welcome everywhere and where free knowledge and education is accessible to all.”