Multilingual Fine-Grained Named Entity Recognition
Închide
Conţinutul numărului revistei
Articolul precedent
Articolul urmator
95 0
Căutarea după subiecte
similare conform CZU
004.9 (449)
Informatică aplicată. Tehnici bazate pe calculator cu aplicații practice (440)
SM ISO690:2012
LUPANCU, Viorica-Camelia, IFTENE, Adrian. Multilingual Fine-Grained Named Entity Recognition. In: Computer Science Journal of Moldova, 2023, vol. 31, nr. 3(93), pp. 321-339. ISSN 1561-4042. DOI: https://doi.org/10.56415/csjm.v31.16
EXPORT metadate:
Google Scholar
Crossref
CERIF

DataCite
Dublin Core
Computer Science Journal of Moldova
Volumul 31, Numărul 3(93) / 2023 / ISSN 1561-4042 /ISSNe 2587-4330

Multilingual Fine-Grained Named Entity Recognition

DOI:https://doi.org/10.56415/csjm.v31.16
CZU: 004.9

Pag. 321-339

Lupancu Viorica-Camelia, Iftene Adrian
 
Alexandru Ioan Cuza University of Iaşi
 
 
Disponibil în IBN: 16 ianuarie 2024


Rezumat

The “MultiCoNER II Multilingual Complex Named Entity Recognition” task1 within SemEval 2023 competition focuses on identifying complex named entities (NEs), such as the titles of creative works (e.g., songs, books, movies), people with different titles (e.g., politicians, scientists, artists, athletes), different categories of products (e.g., food, drinks, clothing), and so on, in several languages. In the context of SemEval, our team, FII_Better, presented an exploration of a base transformer model’s capabilities regarding the task, focused more specifically on five languages (English, Spanish, Swedish, German, and Italian). We took DistilBERT (a distilled version of BERT) and BERT (Bidirectional Encoder Representations from Transformers) as two examples of basic transformer models, using DistilBERT as a baseline and BERT as the platform to create an improved model. In this process, we managed to get fair results in the chosen languages. We have managed to get moderate results in the English track (we ranked 17th out of 34), while our results in the other tracks could be further improved in the future (overall third to last).