<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-17T01:53:38Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/103813" metadataPrefix="marc">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/103813</identifier><datestamp>2025-07-17T08:51:59Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452950</setSpec></header><metadata><record xmlns="http://www.loc.gov/MARC21/slim" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd">
   <leader>00925njm 22002777a 4500</leader>
   <datafield ind2=" " ind1=" " tag="042">
      <subfield code="a">dc</subfield>
   </datafield>
   <datafield ind2=" " ind1=" " tag="720">
      <subfield code="a">Nadeu Camprubí, Climent</subfield>
      <subfield code="e">author</subfield>
   </datafield>
   <datafield ind2=" " ind1=" " tag="720">
      <subfield code="a">Macho, D</subfield>
      <subfield code="e">author</subfield>
   </datafield>
   <datafield ind2=" " ind1=" " tag="720">
      <subfield code="a">Hernando Pericás, Francisco Javier</subfield>
      <subfield code="e">author</subfield>
   </datafield>
   <datafield ind2=" " ind1=" " tag="260">
      <subfield code="c">2000</subfield>
   </datafield>
   <datafield ind2=" " ind1=" " tag="520">
      <subfield code="a">All speech recognition systems require some form of signal representation that parametrically models the&#xd;
temporal evolution of the spectral envelope. Current parameterizations involve, either explicitly or implicitly, a&#xd;
set of energies from frequency bands which are often distributed in a mel scale. The computation of those filterbank&#xd;
energies (FBE) always includes smoothing of basic spectral measurements and non-linear amplitude&#xd;
compression. A variety of linear transformations are typically applied to this time-frequency representation prior&#xd;
to the Hidden Markov Model (HMM) pattern-matching stage of recognition. In the paper, we will discuss some&#xd;
robustness issues involved in both the computation of the FBEs and the posterior linear transformations,&#xd;
presenting alternative techniques that can improve robustness in additive noise conditions. In particular, the root&#xd;
non-linearity, a voicing-dependent FBE computation technique and a time&amp;frequency filtering (tiffing)&#xd;
technique will be considered. Recognition results for the Aurora database will be shown to illustrate the potential&#xd;
application of these alternatives techniques for enhancing the robustness of speech recognition systems.</subfield>
   </datafield>
   <datafield ind2=" " ind1=" " tag="520">
      <subfield code="a">Peer Reviewed</subfield>
   </datafield>
   <datafield ind2=" " ind1=" " tag="520">
      <subfield code="a">Postprint (published version)</subfield>
   </datafield>
   <datafield tag="653" ind2=" " ind1=" ">
      <subfield code="a">Àrees temàtiques de la UPC::Enginyeria de la telecomunicació</subfield>
   </datafield>
   <datafield tag="653" ind2=" " ind1=" ">
      <subfield code="a">Telecommunication</subfield>
   </datafield>
   <datafield tag="653" ind2=" " ind1=" ">
      <subfield code="a">Telecomunicació</subfield>
   </datafield>
   <datafield ind2="0" ind1="0" tag="245">
      <subfield code="a">Improving the robustness of the usual fbe-based asr front-end</subfield>
   </datafield>
</record></metadata></record></GetRecord></OAI-PMH>