<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-13T05:54:25Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:10256/26902" metadataPrefix="didl">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:10256/26902</identifier><datestamp>2025-06-13T04:05:34Z</datestamp><setSpec>com_2072_452955</setSpec><setSpec>com_2072_2054</setSpec><setSpec>col_2072_452957</setSpec></header><metadata><d:DIDL xmlns:d="urn:mpeg:mpeg21:2002:02-DIDL-NS" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="urn:mpeg:mpeg21:2002:02-DIDL-NS http://standards.iso.org/ittf/PubliclyAvailableStandards/MPEG-21_schema_files/did/didl.xsd">
   <d:DIDLInfo>
      <dcterms:created xmlns:dcterms="http://purl.org/dc/terms/" xsi:schemaLocation="http://purl.org/dc/terms/ http://dublincore.org/schemas/xmls/qdc/dcterms.xsd">2025-06-13T04:05:34Z</dcterms:created>
   </d:DIDLInfo>
   <d:Item id="hdl_10256_26902">
      <d:Descriptor>
         <d:Statement mimeType="application/xml; charset=utf-8">
            <dii:Identifier xmlns:dii="urn:mpeg:mpeg21:2002:01-DII-NS" xsi:schemaLocation="urn:mpeg:mpeg21:2002:01-DII-NS http://standards.iso.org/ittf/PubliclyAvailableStandards/MPEG-21_schema_files/dii/dii.xsd">urn:hdl:10256/26902</dii:Identifier>
         </d:Statement>
      </d:Descriptor>
      <d:Descriptor>
         <d:Statement mimeType="application/xml; charset=utf-8">
            <oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
               <dc:title>Large-scale web tracking and cookie compliance: Evaluating one million websites under GDPR with AI categorization</dc:title>
               <dc:creator>Martínez Álvarez, David</dc:creator>
               <dc:creator>Molero Grau, Aniol</dc:creator>
               <dc:creator>Calle Ortega, Eusebi</dc:creator>
               <dc:creator>Canals Ametller, Dolors</dc:creator>
               <dc:creator>Jové, Albert</dc:creator>
               <dc:subject>Protecció de dades</dc:subject>
               <dc:subject>Intel·ligència artificial</dc:subject>
               <dc:subject>Data protection</dc:subject>
               <dc:subject>Artificial intelligence</dc:subject>
               <dc:subject>Internet -- Mesures de seguretat</dc:subject>
               <dc:subject>Internet -- Security measures</dc:subject>
               <dc:description>With the increasing prevalence of web-tracking technologies, including tracking cookies, pixel tracking, and browser fingerprinting techniques, there is a pressing need to analyze their impact on user privacy. Despite the growing interest in the scholarly literature, large-scale, fully automatic evaluations of website compliance with privacy regulations remain scarce. In this paper, we present new algorithms, methods, and an AI categorization model designed for massive, fully automatic analyses of web-tracking and cookie compliance and usage with and without valid user consent. Utilizing the recently published Website Evidence Collector (WEC) software from the European Data Protection Supervisor (EDPS), these algorithms are applied to assess over one million websites from Tranco's top list under European GDPR regulation. A novel 22-category multilabel AI model for website categorization provides content-based context to compliance results, achieving 96.56% accuracy and an F1 score of 0.963. Results reveal that nearly half of the websites utilize tracking cookies, while over half employ pixel tracking without user consent, thus highlighting significant differences between websites' content categories. Additionally, our analysis demonstrates how web-tracking power is concentrated among just a few companies, with the top 10 tracking firms being responsible for most compliance violations related to obtaining valid user consent. This paper serves as a foundation for ongoing large-scale web-tracking analyses, essential for understanding trends over time and evaluating the effectiveness of privacy regulations</dc:description>
               <dc:description>The University of Girona Institute of Informatics and Applications researchers thank the Generalitat de Catalunya for their support through a Consolidated Research Group (2021 SGR 01125). David Martínez thanks the University of Girona for his FI fellowship (IFUdG 46 2022)</dc:description>
               <dc:description>Open Access funding provided thanks to the CRUE-CSIC agreement with Elsevier</dc:description>
               <dc:date>2025-06-13T04:05:34Z</dc:date>
               <dc:date>2025-06-13T04:05:34Z</dc:date>
               <dc:date>2025-10</dc:date>
               <dc:type>info:eu-repo/semantics/article</dc:type>
               <dc:type>info:eu-repo/semantics/publishedVersion</dc:type>
               <dc:type>peer-reviewed</dc:type>
               <dc:identifier>http://hdl.handle.net/10256/26902</dc:identifier>
               <dc:relation>info:eu-repo/semantics/altIdentifier/doi/10.1016/j.jnca.2025.104222</dc:relation>
               <dc:relation>info:eu-repo/semantics/altIdentifier/issn/1084-8045</dc:relation>
               <dc:rights>Reconeixement 4.0 Internacional</dc:rights>
               <dc:rights>http://creativecommons.org/licenses/by/4.0</dc:rights>
               <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
               <dc:publisher>Elsevier</dc:publisher>
               <dc:source>Journal of Network and Computer Applications, 2025, vol. 242, núm. art.núm.104222</dc:source>
               <dc:source>Articles publicats (D-ATC)</dc:source>
               <dc:source>Martínez Álvarez, David Molero Grau, Aniol Calle Ortega, Eusebi Canals Ametller, Dolors Jové, Albert 2025 Large-scale web tracking and cookie compliance: Evaluating one million websites under GDPR with AI categorization Journal of Network and Computer Applications 242 art.núm.104222</dc:source>
            </oai_dc:dc>
         </d:Statement>
      </d:Descriptor>
   </d:Item>
</d:DIDL></metadata></record></GetRecord></OAI-PMH>