TBXTools: A free, fast and flexible tool for automatic terminology extraction

dc.contributor.author
Oliver González, Antoni
dc.contributor.author
Vàzquez Garcia, Mercè
dc.date
2016-02-04T13:08:57Z
dc.date
2016-02-04T13:08:57Z
dc.date
2015-09-15
dc.identifier.citation
1313-8502
dc.identifier.uri
http://hdl.handle.net/10609/46021
dc.description.abstract
The manual identification of terminology from specialized corpora is a complex task that needs to be addressed by flexible tools, in order to facilitate the construction of multilingual terminologies which are the main resources for computer-assisted translation tools, machine translation or ontologies. The automatic terminology extraction tools developed so far either use a proprietary code or an open source code, that is limited to certain software functionalities. To automatically extract terms from specialized corpora for different purposes such as constructing dictionaries, thesauruses or translation memories, we need open source tools to easily integrate new functionalities to improve term selection. This paper presents TBXTools, a free automatic terminology extraction tool that implements linguistic and statistical methods for multiword term extraction. The tool allows the users to easily identify multiword terms from specialized corpora and also, if needed, translation candidates from parallel corpora. In this paper we present the main features of TBXTools along with evaluation results for term extraction, both using statistical and linguistic methodology, for several corpora.
dc.description.abstract
En este trabajo se presenta TEXTools, una herramienta gratuita de extracción automática de terminología que implementa métodos lingüísticos y estadísticos para la extracción de términos de varias palabras.
dc.description.abstract
En aquest treball es presenta TEXTools, un eina gratuïta d'extracció automàtica de terminologia que implementa mètodes lingüístics i estadístics per a l'extracció de termes de diverses paraules.
dc.language.iso
eng
dc.publisher
Association for Computational Linguistics (ACL)
dc.rights
<a href="http://creativecommons.org/licenses/by-nc-nd/3.0/es/">http://creativecommons.org/licenses/by-nc-nd/3.0/es/</a>
dc.subject
Extracció de terminologia
dc.subject
Lingüística computacional
dc.subject
Extracción de terminología
dc.subject
Lingüística computacional
dc.subject
Terminology extraction
dc.subject
Computational linguistics
dc.subject
Computational linguistics
dc.subject
Lingüística computacional
dc.subject
Lingüística computacional
dc.title
TBXTools: A free, fast and flexible tool for automatic terminology extraction
dc.type
info:eu-repo/semantics/article


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)