140 Poems

This dataset is collection of 140 Turkish poems from 7 poets.

Dataset Details

This dataset consists 140 poems, each of them is annotated with the name of its writer, and text id's. Raw dataset includes only the name of the poets as labels.

Label Description
oveli Orhan Veli
nazim Nazim Hikmet
iozel Ismet Ozel
cahit Cahit Sitki Taranci
ataol Ataol Behramoglu
ahmettelli Ahmet Telli
necip Necip Fazil Kisakurek

Samples

{
    "poet": "necip"
    "file_id": "12"
    "poem": "Sanat\r\n \r\n \r\n \r\n \r\nAnladım işi, sanat Allah'ı aramakmış; \r\nMarifet bu, gerisi yalnız çelik-çomakmış...\r\n \r\n \r\n \r\n"
}

Fields

Explain the fields of the instances.

field dtype
poet string
file_id integer
poem string

Splits

Train/validation/test split sizes are not indicated.

Dataset Creation

Curation Rationale

The dataset is motivated by the desire to advance text classification in Turkish language.

Data Source

The authors gathered the poems from famous Turkish poets.

Annotations

Quality

Personal and Senstive Information

Considerations

Social Impact of Dataset

This dataset is part of an effort to encourage text classification research in Turkish language. Using this dataset an arff document that contains 13878 properties is created.

Additional Information

Dataset Curators

List the names of the creators of the dataset. Example:

"Published by Banu Diri and M. Fatih Amasyali. "

Citation Information

Please cite the following paper (arXiv) if you found this dataset useful:

"Identifying the poets of the anonymous poems", Diri B., Amasyalı M. F., (2003), TAINN 2003, Çanakkale

@inproceedings{marc_reviews,
    title={140 Poems},
    author={Banu Diri, Mehmet Fatih Amasyali},
    booktitle={Identifying the poets of the anonymous poems},
    year={2003}
}