This dataset is collection of 140 Turkish poems from 7 poets.
This dataset consists 140 poems, each of them is annotated with the name of its writer, and text id's. Raw dataset includes only the name of the poets as labels.
Label | Description |
---|---|
oveli | Orhan Veli |
nazim | Nazim Hikmet |
iozel | Ismet Ozel |
cahit | Cahit Sitki Taranci |
ataol | Ataol Behramoglu |
ahmettelli | Ahmet Telli |
necip | Necip Fazil Kisakurek |
Explain the fields of the instances.
field | dtype |
---|---|
poet | string |
file_id | integer |
poem | string |
Train/validation/test split sizes are not indicated.
The dataset is motivated by the desire to advance text classification in Turkish language.
The authors gathered the poems from famous Turkish poets.
This dataset is part of an effort to encourage text classification research in Turkish language. Using this dataset an arff document that contains 13878 properties is created.
List the names of the creators of the dataset. Example:
"Published by Banu Diri and M. Fatih Amasyali. "
Please cite the following paper (arXiv) if you found this dataset useful:
"Identifying the poets of the anonymous poems", Diri B., Amasyalı M. F., (2003), TAINN 2003, Çanakkale