<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-19T19:13:52Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/103813" metadataPrefix="qdc">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/103813</identifier><datestamp>2025-07-17T08:51:59Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452950</setSpec></header><metadata><qdc:qualifieddc xmlns:qdc="http://dspace.org/qualifieddc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://purl.org/dc/elements/1.1/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dc.xsd http://purl.org/dc/terms/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dcterms.xsd http://dspace.org/qualifieddc/ http://www.ukoln.ac.uk/metadata/dcmi/xmlschema/qualifieddc.xsd">
   <dc:title>Improving the robustness of the usual fbe-based asr front-end</dc:title>
   <dc:creator>Nadeu Camprubí, Climent</dc:creator>
   <dc:creator>Macho, D</dc:creator>
   <dc:creator>Hernando Pericás, Francisco Javier</dc:creator>
   <dc:subject>Àrees temàtiques de la UPC::Enginyeria de la telecomunicació</dc:subject>
   <dc:subject>Telecommunication</dc:subject>
   <dc:subject>Telecomunicació</dc:subject>
   <dcterms:abstract>All speech recognition systems require some form of signal representation that parametrically models the&#xd;
temporal evolution of the spectral envelope. Current parameterizations involve, either explicitly or implicitly, a&#xd;
set of energies from frequency bands which are often distributed in a mel scale. The computation of those filterbank&#xd;
energies (FBE) always includes smoothing of basic spectral measurements and non-linear amplitude&#xd;
compression. A variety of linear transformations are typically applied to this time-frequency representation prior&#xd;
to the Hidden Markov Model (HMM) pattern-matching stage of recognition. In the paper, we will discuss some&#xd;
robustness issues involved in both the computation of the FBEs and the posterior linear transformations,&#xd;
presenting alternative techniques that can improve robustness in additive noise conditions. In particular, the root&#xd;
non-linearity, a voicing-dependent FBE computation technique and a time&amp;frequency filtering (tiffing)&#xd;
technique will be considered. Recognition results for the Aurora database will be shown to illustrate the potential&#xd;
application of these alternatives techniques for enhancing the robustness of speech recognition systems.</dcterms:abstract>
   <dcterms:abstract>Peer Reviewed</dcterms:abstract>
   <dcterms:abstract>Postprint (published version)</dcterms:abstract>
   <dcterms:issued>2000</dcterms:issued>
   <dc:type>Conference report</dc:type>
   <dc:rights>http://creativecommons.org/licenses/by-nc-nd/3.0/es/</dc:rights>
   <dc:rights>Open Access</dc:rights>
   <dc:rights>Attribution-NonCommercial-NoDerivs 3.0 Spain</dc:rights>
   <dc:publisher>Mergablum</dc:publisher>
</qdc:qualifieddc></metadata></record></GetRecord></OAI-PMH>