To access the full text documents, please follow this link: http://hdl.handle.net/2117/86942

Discriminative learning within Arabic statistical machine translation
España Bonet, Cristina; Giménez, Jesús; Màrquez Villodre, Lluís
Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics; Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
Written Arabic is a especially ambiguous due to the lack of diacritisation of texts, and this makes the translation harder for automatic systems that do not take into account the context of phrases. Here, we use a standard Phrase-Based Statistical Machine Translation architecture to build an Arabic-to-English translation system, but we extend it by incorporating a local discriminative phrase selection model which addresses this semantic ambiguity. Local classifiers are trained using both linguistic information and context to translate a phrase, and this significantly increases the accuracy in phrase selection with respect to the most frequent translation traditionally considered. These classifiers are integrated into the translation system so that the global task gets benefits from the discriminative learning. As a result, we obtain improvements in the full translation of Arabic documents at the lexical, syntactic and semantic levels as measured by an heterogeneous set of automatic metrics.
-Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
-Statistical machine translation
-Discriminative learning
-Arabic
-English
Article - Published version
Report
         

Show full item record

Related documents

Other documents of the same author

España Bonet, Cristina; Màrquez Villodre, Lluís; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa
Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa; España Bonet, Cristina; Màrquez Villodre, Lluís
Chechev, Milen; González Bermúdez, Meritxell; Màrquez Villodre, Lluís; España Bonet, Cristina
Muntés Mulero, Víctor; Paladini Adell, Patricia; España Bonet, Cristina; Màrquez Villodre, Lluís
Enache, Ramona; España Bonet, Cristina; Ranta, Aarne; Màrquez Villodre, Lluís
 

Coordination

 

Supporters