Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/13011

Language technologies: question answering in speech transcripts
Turmo Borras, Jorge; Surdeanu, Mihai; Galibert, Olivier; Rosset, Sophie
Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics; Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
The Question Answering (QA) task consists of providing short, relevant answers to natural language questions. Most QA research has focused on extracting information from text sources, providing a the shortest relevant text in response to a question. For example, the correct answer to the question ”How many groups participate in the CHIL project?” is ”16”, whereas the response to “who are the partners in CHIL?” is a list of them. This simple example illustrates the two main advantages of QA over current search engines: first, the input is a natural language question rather a keyword query; and second, the answer provides the desired information content and not simply a potentially large set of documents or URLs that the user must plow through. One of the aims of the CHIL project was to provide information about what has been said during interactive seminars. Since the information must be located in speech data, the QA systems have to be able to deal with transcripts (manual or automatic) of spontaneous speech. This is a departure from much of the QA research carried by natural language groups who have typically developed techniques for written texts which are assumed to have a correct syntactic and semantic structure. The structure of spoken language is different from that of written language, and some of the anchor points used in processing such as punctuation must be inferred and are therefore error prone. Other spoken language phenomena include disfluencies, repetitions, restarts and corrections. In the case that automatic processing is used to create the speech transcripts, an additional challenge is dealing with the recognition errors. The response can be a short string, as in text-based QA, or an audio segment containing the response. This chapter summarizes the CHIL efforts devoted to QA for spoken language carried out at UPC and at CNRS-LIMSI. Research at UPC adapted a QA system developed for written texts to manually and automatically created speech transcripts, whereas at LIMSI an interactive oral QA system developed for the French language was adapted to the English language. CHIL organized the pilot track on Question Answering in Speech Transcripts (QAst), as part of CLEF 2007, in order to compare and evaluate QA technology on both manually and automatically produced transcripts of spontaneous speech.
Peer Reviewed
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
Question-answering systems
Natural language processing (Computer science)
Oral question answering
Speech transcripts
Tractament del llenguatge natural (Informàtica)
info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/bookPart
Springer-Verlag
         

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

Ferrés Domènech, Daniel; Kanaan Izquierdo, Samir; González Pellicer, Edgar; Ageno Pulido, Alicia; Fuentes Fort, Maria; Rodríguez Hontoria, Horacio; Surdeanu, Mihai; Turmo Borras, Jorge
Carreras Pérez, Xavier; Surdeanu, Mihai; Màrquez Villodre, Lluís
Domínguez Sal, David; Aguilar Saborit, Josep; Surdeanu, Mihai; Larriba Pey, Josep
Poveda Poveda, Jordi; Surdeanu, Mihai
Fuentes Fort, Maria; Rodríguez Hontoria, Horacio; Turmo Borras, Jorge; Ferrés Domènech, Daniel