<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-17T18:51:23Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/98304" metadataPrefix="oai_dc">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/98304</identifier><datestamp>2026-02-02T10:20:51Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452950</setSpec></header><metadata><oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
   <dc:title>Scaling DBSCAN-like algorithms for event detection systems in Twitter</dc:title>
   <dc:creator>Capdevila Pujol, Joan</dc:creator>
   <dc:creator>Pericacho, Gonzalo</dc:creator>
   <dc:creator>Torres Viñals, Jordi</dc:creator>
   <dc:creator>Cerquides, Jesús</dc:creator>
   <dc:contributor>Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors</dc:contributor>
   <dc:contributor>Barcelona Supercomputing Center</dc:contributor>
   <dc:contributor>Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions</dc:contributor>
   <dc:subject>Àrees temàtiques de la UPC::Informàtica::Informàtica teòrica::Algorísmica i teoria de la complexitat</dc:subject>
   <dc:subject>Data mining</dc:subject>
   <dc:subject>Social networks</dc:subject>
   <dc:subject>Event detection</dc:subject>
   <dc:subject>Parallel algorithm</dc:subject>
   <dc:subject>Data clustering</dc:subject>
   <dc:subject>DBSCAN</dc:subject>
   <dc:subject>MapReduce</dc:subject>
   <dc:subject>Apache Spark</dc:subject>
   <dc:subject>Twitter</dc:subject>
   <dc:subject>Mineria de dades</dc:subject>
   <dc:subject>Xarxes socials</dc:subject>
   <dc:description>The increasing use of mobile social networks has lately transformed news media. Real-world events are nowadays reported in social networks much faster than in traditional channels. As a result, the autonomous detection of events from networks like Twitter has gained lot of interest in both research and media groups. DBSCAN-like algorithms constitute a well-known clustering approach to retrospective event detection. However, scaling such algorithms to geographically large regions and temporarily long periods present two major shortcomings. First, detecting real-world events from the vast amount of tweets cannot be performed anymore in a single machine. Second, the tweeting activity varies a lot within these broad space-time regions limiting the use of global parameters. Against this background, we propose to scale DBSCAN-like event detection techniques by parallelizing and distributing them through a novel density-aware MapReduce scheme. The proposed scheme partitions tweet data as per its spatial and temporal features and tailors local DBSCAN parameters to local tweet densities. We implement the scheme in Apache Spark and evaluate its performance in a dataset composed of geo-located tweets in the Iberian peninsula during the course of several football matches. The results pointed out to the benefits of our proposal against other state-of-the-art techniques in terms of speed-up and detection accuracy.</dc:description>
   <dc:description>Peer Reviewed</dc:description>
   <dc:description>Postprint (author's final draft)</dc:description>
   <dc:date>2016-11-25</dc:date>
   <dc:type>Part of book or chapter of book</dc:type>
   <dc:identifier>Capdevila, J., Pericacho, G., Torres, J., Cerquides, J. Scaling DBSCAN-like algorithms for event detection systems in Twitter. A: "Algorithms and architectures for parallel processing: 16th International Conference, ICA3PP 2016: Granada, Spain, December 14-16, 2016: proceedings". Berlín: Springer, 2016, p. 356-373.</dc:identifier>
   <dc:identifier>978-3-319-49582-8</dc:identifier>
   <dc:identifier>https://hdl.handle.net/2117/98304</dc:identifier>
   <dc:identifier>10.1007/978-3-319-49583-5_27</dc:identifier>
   <dc:language>eng</dc:language>
   <dc:relation>http://link.springer.com/chapter/10.1007/978-3-319-49583-5_27</dc:relation>
   <dc:relation>info:eu-repo/grantAgreement/MINECO//TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/</dc:relation>
   <dc:relation>info:eu-repo/grantAgreement/MINECO//SEV-2015-0493/ES/BARCELONA SUPERCOMPUTING CENTER - CENTRO. NACIONAL DE SUPERCOMPUTACION/</dc:relation>
   <dc:rights>Open Access</dc:rights>
   <dc:format>18 p.</dc:format>
   <dc:format>application/pdf</dc:format>
   <dc:publisher>Springer</dc:publisher>
</oai_dc:dc></metadata></record></GetRecord></OAI-PMH>