<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-14T03:25:10Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/103813" metadataPrefix="oai_dc">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/103813</identifier><datestamp>2025-07-17T08:51:59Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452950</setSpec></header><metadata><oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
   <dc:title>Improving the robustness of the usual fbe-based asr front-end</dc:title>
   <dc:creator>Nadeu Camprubí, Climent</dc:creator>
   <dc:creator>Macho, D</dc:creator>
   <dc:creator>Hernando Pericás, Francisco Javier</dc:creator>
   <dc:contributor>Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions</dc:contributor>
   <dc:contributor>Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla</dc:contributor>
   <dc:subject>Àrees temàtiques de la UPC::Enginyeria de la telecomunicació</dc:subject>
   <dc:subject>Telecommunication</dc:subject>
   <dc:subject>Telecomunicació</dc:subject>
   <dc:description>All speech recognition systems require some form of signal representation that parametrically models the&#xd;
temporal evolution of the spectral envelope. Current parameterizations involve, either explicitly or implicitly, a&#xd;
set of energies from frequency bands which are often distributed in a mel scale. The computation of those filterbank&#xd;
energies (FBE) always includes smoothing of basic spectral measurements and non-linear amplitude&#xd;
compression. A variety of linear transformations are typically applied to this time-frequency representation prior&#xd;
to the Hidden Markov Model (HMM) pattern-matching stage of recognition. In the paper, we will discuss some&#xd;
robustness issues involved in both the computation of the FBEs and the posterior linear transformations,&#xd;
presenting alternative techniques that can improve robustness in additive noise conditions. In particular, the root&#xd;
non-linearity, a voicing-dependent FBE computation technique and a time&amp;frequency filtering (tiffing)&#xd;
technique will be considered. Recognition results for the Aurora database will be shown to illustrate the potential&#xd;
application of these alternatives techniques for enhancing the robustness of speech recognition systems.</dc:description>
   <dc:description>Peer Reviewed</dc:description>
   <dc:description>Postprint (published version)</dc:description>
   <dc:date>2000</dc:date>
   <dc:type>Conference report</dc:type>
   <dc:identifier>Nadeu, C., Macho, D., Hernando, J. Improving the robustness of the usual fbe-based asr front-end. A: Jornadas en Tecnología del Habla. "Las tecnologías del Habla". Sevilla: Mergablum, 2000, p. 1-20.</dc:identifier>
   <dc:identifier>84-95118-58-0</dc:identifier>
   <dc:identifier>https://hdl.handle.net/2117/103813</dc:identifier>
   <dc:language>spa</dc:language>
   <dc:rights>http://creativecommons.org/licenses/by-nc-nd/3.0/es/</dc:rights>
   <dc:rights>Open Access</dc:rights>
   <dc:rights>Attribution-NonCommercial-NoDerivs 3.0 Spain</dc:rights>
   <dc:format>20 p.</dc:format>
   <dc:format>application/pdf</dc:format>
   <dc:publisher>Mergablum</dc:publisher>
</oai_dc:dc></metadata></record></GetRecord></OAI-PMH>