<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-17T03:27:09Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/76382" metadataPrefix="oai_dc">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/76382</identifier><datestamp>2026-01-26T01:32:15Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452949</setSpec></header><metadata><oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
   <dc:title>Parallel k nearest neighbor graph construction using tree-based data structures</dc:title>
   <dc:creator>Rajani, Nazneen</dc:creator>
   <dc:creator>McArdle, Kate</dc:creator>
   <dc:creator>Dhillon, Inderjit S.</dc:creator>
   <dc:subject>Àrees temàtiques de la UPC::Matemàtiques i estadística</dc:subject>
   <dc:subject>Algorithm</dc:subject>
   <dc:subject>Algorismes</dc:subject>
   <dc:description>Construction of a nearest neighbor graph is often a neces-&#xd;
sary step in many machine learning applications. However,&#xd;
constructing such a graph is computationally expensive, es-&#xd;
pecially when the data is high dimensional. Python's open&#xd;
source machine learning library Scikit-learn uses k-d trees&#xd;
and ball trees to implement nearest neighbor graph construc-&#xd;
tion. However, this implementation is ine cient for large&#xd;
datasets. In this work, we focus on exploiting these under-&#xd;
lying tree-based data structures to optimize parallel execu-&#xd;
tion of the nearest neighbor algorithm. We present parallel&#xd;
implementations of nearest neighbor graph construction us-&#xd;
ing such tree structures, with parallelism provided by the&#xd;
OpenMP and the Galois framework. We empirically show&#xd;
that our parallel and exact approach is e cient as well as&#xd;
scalable, compared to the Scikit-learn implementation. We&#xd;
present the  rst implementation of k-d trees and ball trees&#xd;
using Galois. Our results show that k-d trees are faster when&#xd;
the number of dimensions is small (2d   N); ball trees on&#xd;
the other hand scale well with the number of dimensions.&#xd;
Our implementation of ball trees in Galois has almost linear&#xd;
speedup on a number of datasets irrespective of the size and&#xd;
dimensionality of the data.</dc:description>
   <dc:date>2015</dc:date>
   <dc:type>Conference report</dc:type>
   <dc:identifier>Rajani, Nazneen; McArdle, Kate; Dhillon, Inderjit S. Parallel k nearest neighbor graph construction using tree-based data structures. A: HPGM: High Performance Graph Mining. "1st High Performance Graph Mining workshop, Sydney, 10 August 2015". 2015.</dc:identifier>
   <dc:identifier>https://hdl.handle.net/2117/76382</dc:identifier>
   <dc:identifier>10.5821/hpgm15.1</dc:identifier>
   <dc:language>eng</dc:language>
   <dc:relation>High Performance Graph Mining workshop (1st: 2015: Sydney)</dc:relation>
   <dc:rights>http://creativecommons.org/licenses/by-sa/3.0/es/</dc:rights>
   <dc:rights>Open Access</dc:rights>
   <dc:rights>Attribution-ShareAlike 3.0 Spain</dc:rights>
   <dc:format>8 p.</dc:format>
   <dc:format>application/pdf</dc:format>
   <dc:publisher>Barcelona Supercomputing Center</dc:publisher>
</oai_dc:dc></metadata></record></GetRecord></OAI-PMH>