Author(s): Osman ASLANOĞLU
In linguistics and language processing studies, with the term corpus that is implied the situation of coexisting of numerous texts as regularly and structurally. Corpuses may be monolingual or multilingual. With the corpus method, it’s possible to analyse language texts easily in many ways. In this article, which we translated, the outcomes of making comparison in the Kurmanji and Sorani dialects of Kurdish are emphasized.Resource scarcity along with diversity–both in dialect and script–are the two primary challenges in Kurdish language processing. In this paper we aim at addressing these two problems by (i) building a text corpus for Sorani and Kurmanji, the two main dialects of Kurdish, and (ii) highlighting some of the orthographic, phonological, and morphological differences between these two dialects from statistical and rule-based perspectives
The Journal of International Social Research received 8982 citations as per Google Scholar report