Punjabi Literature Corpus
License:
CC-BY-NC-4.0
Steward:
Tamahi Suneha Magazine
Task: OTH
Release Date: 11/7/2025
Format: TXT
Size: 1.83 MB
Description
This corpus contains 10,39,430 tokens of Punjabi Shahmukhi script.
Specifics
Licensing
Creative Commons Attribution Non Commercial 4.0 International (CC-BY-NC-4.0)
https://spdx.org/licenses/CC-BY-NC-4.0.htmlMetadata
This corpus includes Punjabi Shahmukhi literary works such as stories, novels, fiction, non-fiction, and poetry.
