Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace:

Using collocation segmentation to augment the phrase table
Henriquez, Carlos A; Ruiz Costa-Jussà, Marta; Daudaravicius, Vidas; Banchs, Rafael; Mariño, José B.
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
This paper describes the 2010 phrase-based statistical machine translation system developed at the TALP Research Center of the UPC1 in cooperation with BMIC2 and VMU3. In phrase-based SMT, the phrase table is the main tool in translation. It is created extracting phrases from an aligned parallel corpus and then computing translation model scores with them. Performing a collocation segmentation over the source and target corpus before the alignment causes that di erent and larger phrases are extracted from the same original documents. We performed this segmentation and used the union of this phrase set with the phrase set extracted from the nonsegmented corpus to compute the phrase table. We present the con gurations considered and also report results obtained with internal and o cial test sets.
Àrees temàtiques de la UPC::Informàtica
Machine translating
Machine translation
Traducció automàtica
Association for Computational Linguistics

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

Henriquez, Carlos A; Ruiz Costa-Jussà, Marta; Daudaravicius, Vidas; Banchs, Rafael E.; Mariño, José B.
Farrus, Mireia; Ruiz Costa-Jussà, Marta; Popovic, Maya; Henriquez, Carlos A