Toward the Soundness of Sense Structure Definitions in Thesaurus-Dictionaries. Parsing Problems and Solutions
Close
Conţinutul numărului revistei
Articolul precedent
Articolul urmator
783 3
Ultima descărcare din IBN:
2017-04-27 21:38
Căutarea după subiecte
similare conform CZU
004.9:81'374.82 (1)
Application-oriented computer-based techniques (449)
Linguistics and languages (5176)
SM ISO690:2012
CURTEANU, Neculai, MORUZ, Mihai-Alex. Toward the Soundness of Sense Structure Definitions in Thesaurus-Dictionaries. Parsing Problems and Solutions. In: Computer Science Journal of Moldova, 2012, nr. 3(60), pp. 275-303. ISSN 1561-4042.
EXPORT metadate:
Google Scholar
Crossref
CERIF

DataCite
Dublin Core
Computer Science Journal of Moldova
Numărul 3(60) / 2012 / ISSN 1561-4042 /ISSNe 2587-4330

Toward the Soundness of Sense Structure Definitions in Thesaurus-Dictionaries. Parsing Problems and Solutions
CZU: 004.9:81'374.82

Pag. 275-303

Curteanu Neculai1, Moruz Mihai-Alex12
 
1 Institute for Computer Science, Romanian Academy, Iasi Branch,
2 Alexandru Ioan Cuza University of Iaşi
 
 
Disponibil în IBN: 4 decembrie 2013


Rezumat

In this paper we point out some difficult problems of thesaurusdictionary entry parsing, relying on the parsing technology of SCD (Segmentation-Cohesion-Dependency) configurations, successfully applied on six largest thesauri – Romanian (2), French, German (2), and Russian. Challenging Problems: (a) Intricate and / or recursive structures of the lexicographic segments met in the entries of certain thesauri; (b) Cyclicity (recursive) calls of some sense marker classes on marker sequences; (c) Establishing the hypergraph-driven dependencies between all the atomic and non-atomic sense definitions. Classical approach to solve these parsing problems is hard mainly because of depth-first search of sense definitions and markers, the substantial complexity of entries, and the sense tree dynamic construction embodied within these parsers. SCD-based Parsing Solutions: (a) The SCD parsing method is a procedural tool, completely formal grammar-free, handling the recursive structure of the lexicographic segments by procedural nonrecursive calls performed on the SCD parsing configurations of the entry structure. (b) For dealing with cyclicity (recursive) calls between secondary sense markers and the sense enumeration markers, we proposed the Enumeration Closing Condition, sometimes coupled with New_Paragraphs typographic markers transformed into numeral sense enumeration. (c) These problems, their lexicographic modeling and parsing solutions are addressed to both dictionary parser programmers to experience the SCD-based parsing method, as well as to lexicographers and thesauri designers for tailoring balanced lexical-semantics granularities and sounder sense tree definitions of the dictionary entries.

Cuvinte-cheie
dictionary entry parsing, parsing method of SCD configurations, recursive lexicographic segments, recursive calls of sense markers