To access the full text documents, please follow this link: http://hdl.handle.net/2117/83344

Learning agglutinative morphology of indian languages with linguistically motivated adaptor grammars
Kumar, Arun; Padró, Lluís; Oliver González, Antoni
Universitat Politècnica de Catalunya. Departament de Ciències de la Computació; Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
In this paper an automatic morphology learning system for complex and agglutinative languages is presented. We process complex agglutinative morphology of Indian languages using Adaptor Grammars and linguistic rules of morphology. Adaptor Grammars are a compositional Bayesian framework for grammatical inference, where we define a morphological boundaries are inferred from a corpora of plain text. Once it produces morphological segmentation, regular expressions for orthography rules are applied to achieve final segmentation. We test our algorithm in the case of three complex languages from the Dravidian family and evaluate the results comparing to other state of the art unsupervised morphology learning systems and show significant improvements in the results.
Àrees temàtiques de la UPC::Informàtica::Llenguatges de programació
Àrees temàtiques de la UPC::Informàtica
Natural language processing (Computer science)
Tractament del llenguatge natural (Informàtica)
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/conferenceObject
         

Show full item record

Related documents

Other documents of the same author

Padró, Lluís; Turmo Borras, Jorge; Alegria, Iñaki; Aranberri, Nora; Fresno, Víctor; Samallo, Pablo; San Vicente, Iñaki; Zubiaga, Arkaitz
Ageno Pulido, Alicia; Comas Umbert, Pere Ramon; Padró, Lluís; Turmo Borras, Jorge
Carreras, Xavier; Padró, Lluís; Zhang, Lei; Rettinger, Achim; Li, Zhixing; García Cuesta, Esteban; Agic, Zeljko; Bekavac, Bozo; Fortuna, Blaz; Stajner, Tadej
Padró, Lluís; Agic, Zeljko; Carreras, Xavier; Fortuna, Blaz; García Cuesta, Esteban; Li, Zhixing; Stajner, Tadej; Tadic, Marko
 

Coordination

 

Supporters