Thorsten-Voice Dataset 2023.09 Hessisch

Specifics

Licensing

Creative Commons Zero v1.0 Universal (CC0-1.0)

https://spdx.org/licenses/CC0-1.0.html

Considerations

Restrictions/Special Constraints

None. Released under CC0 (public domain dedication).

Forbidden Usage

None from the licensor’s side. Users are responsible for complying with applicable laws and ethical standards.

Processes

Ethical Review

The dataset consists exclusively of voluntary recordings of the contributor’s own voice. No third-party voices or personal data are included. All recordings were created with the explicit intention of unrestricted public release under CC0. No formal institutional ethical review was required, as the dataset contains only self-recorded material. The dataset was released in the spirit of openness, equality, and free access to knowledge. The contributor encourages responsible and socially beneficial use.

Intended Use

Intended for dialectal text-to-speech (TTS), regional speech modeling, speech synthesis research, benchmarking, and commercial speech technology development.

Metadata

Dataset Structure

2,108 recorded phrases
Standard German text pronounced in Hessian dialect (“Hessisch”)
LJSpeech-compatible file and directory structure

Technical Details

WAV files (mono)
22,050 Hz sample rate
Normalized to -24 dB
No leading or trailing silence

Licensing

Released under CC0 (public domain dedication).
No restrictions apply.

Project Context

The goal of the Thorsten-Voice project is to provide high-quality German voice datasets and TTS models as free and open resources.

Contributor Statement (Thorsten Müller)

“I contribute my personal voice as a person believing in a world where all people are equal, regardless of gender, sexual orientation, religion, skin color, or geographic coordinates of birth. I believe in a global world where everyone is welcome everywhere and where free knowledge and education is accessible to all.”

Thorsten-Voice Dataset 2023.09 Hessisch

Description

Specifics

Considerations

Processes

Metadata

Dataset Structure

Technical Details

Licensing

Project Context

Contributor Statement (Thorsten Müller)