Taruen

About us

We are a group of people developing Natural Language Processing tools for Tatar and related languages. We are based in Istanbul.

Datasets

5 Datasets

Dolgan Folklore Text CorpusCC0-1.0dlgNLPTXT57.15 KB
Kyrgyz Folklore Text CorpusCC0-1.0kyNLPTXT1.28 MB
Polish Public Domain 20th Century Literature Text CorpusCC0-1.0plNLPTXT10.86 MB
Tatar Folklore Text CorpusCC0-1.0ttNLPTXT1.40 MB
World Factbook (JSON)CC0-1.0enNLPJSON7.10 MB