Optical Character Recognition Applied to Romanian Printed Texts of the 18th{20th Century
Închide
Conţinutul numărului revistei
Articolul precedent
Articolul urmator
1035 29
Ultima descărcare din IBN:
2024-03-13 15:18
Căutarea după subiecte
similare conform CZU
519.7:004.93'1:002.2(498)"18/20"sec. (1)
Cibernetică matematică (93)
Informatică aplicată. Tehnici bazate pe calculator cu aplicații practice (438)
Documentare. Cărți. Scrieri. Autori (67)
SM ISO690:2012
COJOCARU, Svetlana, COLESNICOV, Alexandru, MALAHOV, Ludmila, BUMBU, Tudor. Optical Character Recognition Applied to Romanian Printed Texts of the 18th{20th Century . In: Computer Science Journal of Moldova, 2016, nr. 1(70), pp. 106-117. ISSN 1561-4042.
EXPORT metadate:
Google Scholar
Crossref
CERIF

DataCite
Dublin Core
Computer Science Journal of Moldova
Numărul 1(70) / 2016 / ISSN 1561-4042 /ISSNe 2587-4330

Optical Character Recognition Applied to Romanian Printed Texts of the 18th{20th Century
CZU: 519.7:004.93'1:002.2(498)"18/20"sec.

Pag. 106-117

Cojocaru Svetlana, Colesnicov Alexandru, Malahov Ludmila, Bumbu Tudor
 
Institute of Mathematics and Computer Science ASM
 
 
Disponibil în IBN: 28 aprilie 2016


Rezumat

The paper discusses Optical Character Recognition (OCR) of historical texts of the 18th{20th century in the Romanian language using the Cyrillic script. We differ three epochs (approximately, the 18th, 19th, and 20th centuries), with different usage of the Cyrillic alphabet in Romanian and, correspondingly, different approach to OCR. We developed historical alphabets and sets of glyphs recognition templates specific for each epoch. The dictionaries in proper alphabets and orthographies were also created. In addition, virtual keyboards, fonts, transliteration utilities, etc. were developed. The resulting technology and toolset permit successful recognition of historical Romanian texts in the Cyrillic script. After transliteration to the modern Latin script we obtain no-barrier access to historical documents.