Language-independent features for authorship attribution on Ukrainian texts
Закрыть
Articolul precedent
Articolul urmator
446 12
Ultima descărcare din IBN:
2023-11-09 14:14
SM ISO690:2012
HLAVCHEVA, Yulia, GLAVCHEV, Maksym, BOBICEV, Victoria, KANISHCHEVA, Olga. Language-independent features for authorship attribution on Ukrainian texts. In: CEUR Workshop Proceedings, 2-3 decembrie 2020, Kiev. Kyiv, Ukraine: Taras Shevchenko National University of Kyiv, 2021, Vol. 2833, pp. 134-143. ISSN 16130073.
EXPORT metadate:
Google Scholar
Crossref
CERIF

DataCite
Dublin Core
CEUR Workshop Proceedings
Vol. 2833, 2021
Conferința "7th International Conference "Information Technology and Interactions""
Kiev, Ucraina, 2-3 decembrie 2020

Language-independent features for authorship attribution on Ukrainian texts


Pag. 134-143

Hlavcheva Yulia1, Glavchev Maksym1, Bobicev Victoria2, Kanishcheva Olga1
 
1 National Technical University «Kharkiv Polytechnic Institute»,
2 Technical University of Moldova
 
 
Disponibil în IBN: 8 aprilie 2021


Rezumat

Authorship attribution is the natural language processing task of the author identification of an input text. The main goal of this task is to define the salient characteristics of documents that capture the author's writing style. In this paper, we analyze language-independent features for authorship attribution. All experiments were realized on the corpus of Ukrainian scientific papers. For the experiments we used Bayes Based Algorithms (Naive Bayes Multinomial), Support Vector Machine (SMO) and Decision Trees (LMT, J48) methods. The experimental results of the scientific text classification demonstrated that Decision Trees method in most cases outperforms other machine learning methods, and the proposed in the paper language-independent features are appropriate for the Ukrainian scientific documents authorship attribution. 

Cuvinte-cheie
authorship attribution, Language-Independent Features, machine learning methods, text classification, Writing Style