Analyzing Complex Words in Hindi using Parameters of Classical Readability Formulae (Part 1)
Închide
Conţinutul numărului revistei
Articolul precedent
Articolul urmator
345 16
Ultima descărcare din IBN:
2024-02-26 07:13
Căutarea după subiecte
similare conform CZU
004.912:811.214.21 (1)
Informatică aplicată. Tehnici bazate pe calculator cu aplicații practice (438)
Limbi indiene (2)
SM ISO690:2012
VENUGOPAL, Gayatri, PRAMOD, Dhanya, JATINDERKUMA, R. Saini. Analyzing Complex Words in Hindi using Parameters of Classical Readability Formulae (Part 1). In: Computer Science Journal of Moldova, 2021, nr. 3(87), pp. 366-387. ISSN 1561-4042.
EXPORT metadate:
Google Scholar
Crossref
CERIF

DataCite
Dublin Core
Computer Science Journal of Moldova
Numărul 3(87) / 2021 / ISSN 1561-4042 /ISSNe 2587-4330

Analyzing Complex Words in Hindi using Parameters of Classical Readability Formulae (Part 1)

CZU: 004.912:811.214.21
MSC 2010: 68R10, 68Q25, 05C35, 05C05.

Pag. 366-387

Venugopal Gayatri, Pramod Dhanya, Jatinderkuma R. Saini
 
Symbiosis International (Deemed University) (SIU)
 
 
Disponibil în IBN: 3 decembrie 2021


Rezumat

Readability of a passage indicates the extent to which the meaning of the text can be understood; this could be represented in terms of the age that person should be of, or the grade that a person should be in, to understand the text. Numerous word lists and readability formulae have been devised by researchers who tested the readability of texts by involving children and adults. Most of these resources have been built for the English language. This study aims to analyse the complex words in Hindi sentences that were derived from a Human Intelligence Task (HIT), using variables considered in the widely adopted readability measures that focus on the lexical aspects of a sentence. Although there have been studies that analyse the readability of texts, this study claims to be the first of its kind, that aims to determine whether the parameters of traditional readability measures contribute significantly to context-agnostic models that classify a Hindi word as complex or simple. We report the results of two approaches used to deem a word as complex and determine the best approach out of the two. The model built using this approach was used to identify the most significant features.

Cuvinte-cheie
complex word identification, readability, hindi, binary classification, natural language processing