Abstract:
|
The amount of abbreviations used in biomedical literature
increases constantly. Despite the existence of acronym dictionaries, it
is not viable to keep them updated with new creations. Thus, in the
processing of biomedical texts, discovering and disambiguating acronyms
and their expanded forms are essential aspects and this is the objective
proposed by BARR task at IberEval 2017 Workshop. This paper presents
our participation in this task. We propose five systems that deal with the
problem in different ways. Three of the systems are atomic approaches,
while two of them are combinations of the atomic systems. One of the
systems clearly outperforms the others, both in the detection of entities
(F-score of 0.749 in the test set) as well as identifying relations between
short-long forms (F-score of 0.697 in the test set). |