Articolul precedent |
Articolul urmator |
446 12 |
Ultima descărcare din IBN: 2023-11-09 14:14 |
SM ISO690:2012 HLAVCHEVA, Yulia, GLAVCHEV, Maksym, BOBICEV, Victoria, KANISHCHEVA, Olga. Language-independent features for authorship attribution on Ukrainian texts. In: CEUR Workshop Proceedings, 2-3 decembrie 2020, Kiev. Kyiv, Ukraine: Taras Shevchenko National University of Kyiv, 2021, Vol. 2833, pp. 134-143. ISSN 16130073. |
EXPORT metadate: Google Scholar Crossref CERIF DataCite Dublin Core |
CEUR Workshop Proceedings Vol. 2833, 2021 |
||||||
Conferința "7th International Conference "Information Technology and Interactions"" Kiev, Ucraina, 2-3 decembrie 2020 | ||||||
|
||||||
Pag. 134-143 | ||||||
|
||||||
Descarcă PDF | ||||||
Rezumat | ||||||
Authorship attribution is the natural language processing task of the author identification of an input text. The main goal of this task is to define the salient characteristics of documents that capture the author's writing style. In this paper, we analyze language-independent features for authorship attribution. All experiments were realized on the corpus of Ukrainian scientific papers. For the experiments we used Bayes Based Algorithms (Naive Bayes Multinomial), Support Vector Machine (SMO) and Decision Trees (LMT, J48) methods. The experimental results of the scientific text classification demonstrated that Decision Trees method in most cases outperforms other machine learning methods, and the proposed in the paper language-independent features are appropriate for the Ukrainian scientific documents authorship attribution. |
||||||
Cuvinte-cheie authorship attribution, Language-Independent Features, machine learning methods, text classification, Writing Style |
||||||
|