High Performance Computing pp-distance algorithms to generate X-ray spectra from 3D models

dc.contributor.author
González, César
dc.contributor.author
Balocco, Simone
dc.contributor.author
Bosch, Jaume
dc.contributor.author
Haro, Juan Miguel de
dc.contributor.author
Paolini, Maurizio
dc.contributor.author
Filgueras, Antonio
dc.contributor.author
Álvarez, Carlos
dc.contributor.author
Pons, Ramon
dc.date.issued
2022-11-18T09:53:52Z
dc.date.issued
2022-11-18T09:53:52Z
dc.date.issued
2022-09-27
dc.date.issued
2022-11-18T09:53:52Z
dc.identifier
1661-6596
dc.identifier
https://hdl.handle.net/2445/191001
dc.identifier
725361
dc.description.abstract
X-ray crystallography is a powerful method that has significantly contributed to our understanding of the biological function of proteins and other molecules. This method relies on the production of crystals that, however, are usually a bottleneck in the process. For some molecules, no crystallization has been achieved or insufficient crystals were obtained. Some other systems do not crystallize at all, such as nanoparticles which, because of their dimensions, cannot be treated by the usual crystallographic methods. To solve this, whole pair distribution function has been proposed to bridge the gap between Bragg and Debye scattering theories. To execute a fitting, the spectra of several different constructs, composed of millions of particles each, should be computed using a particle-pair or particle-particle (pp) distance algorithm. Using this computation as a test bench for current field-programmable gate array (FPGA) technology, we evaluate how the parallel computation capability of FPGAs can be exploited to reduce the computation time. We present two different solutions to the problem using two state-of-the-art FPGA technologies. In the first one, the main C program uses OmpSs (a high-level programming model developed at the Barcelona Supercomputing Center, that enables task offload to different high-performance computing devices) for task invocation, and kernels are built with OpenCL using reduced data sizes to save transmission time. The second approach uses task and data parallelism to operate on data locally and update data globally in a decoupled task. Benchmarks have been evaluated over an Intel D5005 Programmable Acceleration Card, computing a model of 2 million particles in 81.57 s - 24.5 billion atom pairs per second (bapps)- and over a ZU102 in 115.31 s. In our last test, over an up-to-date Alveo U200 board, the computation lasted for 34.68 s (57.67 bapps). In this study, we analyze the results in relation to the classic terms of speed-up and efficiency and give hints for future improvements focused on reducing the global job time.
dc.format
application/pdf
dc.language
eng
dc.publisher
MDPI
dc.relation
Reproducció del document publicat a: https://doi.org/10.3390/ijms231911408
dc.relation
International Journal of Molecular Sciences, 2022, vol. 23, num. 19
dc.relation
https://doi.org/10.3390/ijms231911408
dc.rights
cc-by (c) González, César et al., 2022
dc.rights
https://creativecommons.org/licenses/by/4.0/
dc.rights
info:eu-repo/semantics/openAccess
dc.source
Articles publicats en revistes (Matemàtiques i Informàtica)
dc.subject
Raigs X
dc.subject
Algorismes computacionals
dc.subject
Espectroscòpia de raigs X
dc.subject
X-rays
dc.subject
Computer algorithms
dc.subject
X-ray spectroscopy
dc.title
High Performance Computing pp-distance algorithms to generate X-ray spectra from 3D models
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Fitxers en aquest element

FitxersGrandàriaFormatVisualització

No hi ha fitxers associats a aquest element.

Aquest element apareix en la col·lecció o col·leccions següent(s)