<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-13T02:36:22Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/116807" metadataPrefix="mets">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/116807</identifier><datestamp>2026-01-14T06:22:55Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452950</setSpec></header><metadata><mets xmlns="http://www.loc.gov/METS/" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" ID="&#xa;&#x9;&#x9;&#x9;&#x9;DSpace_ITEM_2117-116807" TYPE="DSpace ITEM" PROFILE="DSpace METS SIP Profile 1.0" xsi:schemaLocation="http://www.loc.gov/METS/ http://www.loc.gov/standards/mets/mets.xsd" OBJID="&#xa;&#x9;&#x9;&#x9;&#x9;hdl:2117/116807">
   <metsHdr CREATEDATE="2026-04-13T04:36:22Z">
      <agent ROLE="CUSTODIAN" TYPE="ORGANIZATION">
         <name>RECERCAT</name>
      </agent>
   </metsHdr>
   <dmdSec ID="DMD_2117_116807">
      <mdWrap MDTYPE="MODS">
         <xmlData xmlns:mods="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
            <mods:mods xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Garcia-Gasulla, Marta</mods:namePart>
               </mods:name>
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Houzeaux, Guillaume</mods:namePart>
               </mods:name>
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Ferrer, Roger</mods:namePart>
               </mods:name>
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Artigues, Antoni</mods:namePart>
               </mods:name>
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>López, Victor</mods:namePart>
               </mods:name>
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Labarta Mancho, Jesús José</mods:namePart>
               </mods:name>
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Vázquez, Mariano</mods:namePart>
               </mods:name>
               <mods:originInfo>
                  <mods:dateIssued encoding="iso8601">2018</mods:dateIssued>
               </mods:originInfo>
               <mods:identifier type="none"/>
               <mods:abstract>The main computing tasks of a finite element code(FE) for solving partial differential equations (PDE's)&#xd;
are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context&#xd;
of a hybrid MPI+X paradigm. Although we will describe algorithms in the FE context, a similar strategy&#xd;
can be straightforwardly applied to other discretization methods, like the finite volume method.&#xd;
The matrix assembly consists of a loop over the elements of the MPI partition to compute element&#xd;
matrices and right-hand sides and their assemblies in the local system to each MPI partition. In a MPI+X&#xd;
hybrid parallelism context, X has consisted traditionally of loop parallelism using OpenMP. Several strate-&#xd;
gies have been proposed in the literature to implement this loop parallelism, like coloring or substructuring&#xd;
techniques to circumvent the race condition that appears when assembling the element system into the local&#xd;
system. The main drawback of the first technique is the decrease of the IPC due to bad spatial locality.&#xd;
The second technique avoids this issue but requires extensive changes in the implementation, which can&#xd;
be cumbersome when several element loops should be treated. We propose an alternative, based on the&#xd;
task parallelism of the element loop using some extensions to the OpenMP programming model. The task-&#xd;
ification of the assembly solves both aforementioned problems. In addition, dynamic load balance will be&#xd;
applied using the DLB library, especially efficient in the presence of hybrid meshes, where the relative costs&#xd;
of the different elements is impossible to estimate a priori. This paper presents the proposed methodology,&#xd;
its implementation and its validation through the solution of large computational mechanics problems up&#xd;
to 16k cores.The use of large part of a supercomputer, even more in normal conditions of use, is never an innocuous exercise. The research leading to these results has received funding from: the European Union's Horizon 2020 Programme (2014–2020) and from Brazilian Ministry of Science, Technology and Innovation through Rede Nacional de Pesquisa (RNP), HPC4E Project, grant agreement 689772; the Energy oriented Centre of Excellence (EoCoE), grant agreement number 676629, funded within the Horizon2020 framework of the European Union; The Spanish Government (grant SEV2015-0493 of the Severo Ochoa Program); the Spanish Ministry of Science and Innovation (contract TIN2015-65316-P); the Generalitat de Catalunya (contract 2014-SGR-1051); the Intel-BSC Exascale Lab collaboration project. Comissió Interdepartamental de Recerca i Innovació Tecnológica(Interdepartmental Commission for Research and Technological Innovation)SíPost-print (author's final draft)</mods:abstract>
               <mods:language>
                  <mods:languageTerm authority="rfc3066"/>
               </mods:language>
               <mods:accessCondition type="useAndReproduction">Open Access</mods:accessCondition>
               <mods:subject>
                  <mods:topic>Àrees temàtiques de la UPC::Informàtica</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>OpenMP</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Finite element code (FE)</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>OpenMP</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>OpenMP</mods:topic>
               </mods:subject>
               <mods:titleInfo>
                  <mods:title>MPI+X: task-based parallelization and dynamic load balance of finite element assembly</mods:title>
               </mods:titleInfo>
               <mods:genre>Article</mods:genre>
            </mods:mods>
         </xmlData>
      </mdWrap>
   </dmdSec>
   <structMap LABEL="DSpace Object" TYPE="LOGICAL">
      <div TYPE="DSpace Object Contents" ADMID="DMD_2117_116807"/>
   </structMap>
</mets></metadata></record></GetRecord></OAI-PMH>