<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-17T06:42:44Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/374059" metadataPrefix="qdc">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/374059</identifier><datestamp>2026-01-30T07:16:25Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452950</setSpec></header><metadata><qdc:qualifieddc xmlns:qdc="http://dspace.org/qualifieddc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://purl.org/dc/elements/1.1/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dc.xsd http://purl.org/dc/terms/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dcterms.xsd http://dspace.org/qualifieddc/ http://www.ukoln.ac.uk/metadata/dcmi/xmlschema/qualifieddc.xsd">
   <dc:title>OmpSs@cloudFPGA: An FPGA task-based programming model with message passing</dc:title>
   <dc:creator>De Haro Ruiz, Juan Miguel</dc:creator>
   <dc:creator>Cano, Rubén</dc:creator>
   <dc:creator>Álvarez Martínez, Carlos</dc:creator>
   <dc:creator>Jiménez González, Daniel</dc:creator>
   <dc:creator>Martorell Bofill, Xavier</dc:creator>
   <dc:creator>Ayguadé Parra, Eduard</dc:creator>
   <dc:creator>Labarta Mancho, Jesús José</dc:creator>
   <dc:creator>Abel, François</dc:creator>
   <dc:creator>Ringlein, Burkhard</dc:creator>
   <dc:creator>Weiss, Beat</dc:creator>
   <dc:subject>Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles</dc:subject>
   <dc:subject>Supercomputers -- Energy consumption</dc:subject>
   <dc:subject>Application program interfaces (Computer software)</dc:subject>
   <dc:subject>Parallel processing (Electronic computers)</dc:subject>
   <dc:subject>FPGA</dc:subject>
   <dc:subject>MPI</dc:subject>
   <dc:subject>OpenMP</dc:subject>
   <dc:subject>Programming models</dc:subject>
   <dc:subject>Network-attached FPGA</dc:subject>
   <dc:subject>Stand-alone FPGA</dc:subject>
   <dc:subject>High-level synthesis</dc:subject>
   <dc:subject>Heterogeneous programming</dc:subject>
   <dc:subject>High-performance computing</dc:subject>
   <dc:subject>Supercomputadors -- Consum d'energia</dc:subject>
   <dc:subject>Interfícies de programació d'aplicacions (Programari)</dc:subject>
   <dc:subject>Processament en paral·lel (Ordinadors)</dc:subject>
   <dcterms:abstract>Nowadays, a new parallel paradigm for energy-efficient heterogeneous hardware infrastructures is required to achieve better performance at a reasonable cost on high-performance computing applications. Under this new paradigm, some application parts are offloaded to specialized accelerators that run faster or are more energy-efficient than CPUs.&#xd;
Field-Programmable Gate Arrays (FPGA) are one of those types of accelerators that are becoming widely available in data centers.&#xd;
&#xd;
This paper proposes OmpSs@cloudFPGA, which includes novel extensions to parallel task-based programming models that enable easy and efficient programming of heterogeneous clusters with FPGAs.&#xd;
The programmer only needs to annotate, with OpenMP-like pragmas, the tasks of the application that should be accelerated in the cluster of FPGAs.&#xd;
Next, the proposed programming model framework automatically extracts parts annotated with High-Level Synthesis (HLS) pragmas and synthesizes them into hardware accelerator cores for FPGAs.&#xd;
Additionally, our extensions include and support two novel features: 1) FPGA-to-FPGA direct communication using a Message Passing Interface (MPI) similar Application Programming Interface (API) with one-to-one and collective communications to alleviate host communication channel bottleneck, and 2) creating and spawning work from inside the FPGAs to their own accelerator cores based on an MPI rank-like identification.&#xd;
These features break the classical host-accelerator model, where the host (typically the CPU) generates all the work and distributes it to each accelerator.&#xd;
&#xd;
We also present an evaluation of OmpSs@cloudFPGA for different parallel strategies of the N-Body application on the IBM cloudFPGA research platform.&#xd;
Results show that for cluster sizes up to 56 FPGAs, the performance scales linearly.&#xd;
To the best of our knowledge, this is the best performance obtained for N-body over FPGA platforms, reaching 344 Gpairs/s with 56 FPGAs.&#xd;
Finally, we compare the performance and power consumption of the proposed approach with the ones obtained by a classical execution on the MareNostrum 4 supercomputer, demonstrating that our FPGA approach reduces power consumption by an order of magnitude.</dcterms:abstract>
   <dcterms:abstract>This work has been done in the context of the IBM/BSC Deep Learning Center initiative. This work has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 754337 (EuroEXA), from Spanish Government (PID2019-107255GBC21/AEI/10.13039/501100011033), and from Generalitat de Catalunya (2017-SGR-1414 and 2017-SGR-1328).</dcterms:abstract>
   <dcterms:abstract>Peer Reviewed</dcterms:abstract>
   <dcterms:abstract>Postprint (author's final draft)</dcterms:abstract>
   <dcterms:issued>2022</dcterms:issued>
   <dc:type>Conference report</dc:type>
   <dc:relation>https://ieeexplore.ieee.org/document/9820636</dc:relation>
   <dc:relation>info:eu-repo/grantAgreement/EC/H2020/754337/EU/Co-designed Innovation and System for Resilient Exascale Computing in Europe: From Applications to Silicon/EuroEXA</dc:relation>
   <dc:relation>info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-107255GB-C21/ES/BSC - COMPUTACION DE ALTAS PRESTACIONES VIII/</dc:relation>
   <dc:rights>Open Access</dc:rights>
   <dc:publisher>Institute of Electrical and Electronics Engineers (IEEE)</dc:publisher>
</qdc:qualifieddc></metadata></record></GetRecord></OAI-PMH>