Mapping parallel loops on multicore systems

Inicio | ¿Qué es? | Contacto

English | Català

Consultar RECERCAT

Por comunidades y
colecciones Por fecha Por autores Por títulos Por temas (CDU)

Consultar departamento

Por fecha Por autores Por títulos Por temas (CDU)

Estadisticas

Del documento Todo RECERCAT

Mi RECERCAT

Entrar Alertas por correo-e

Directorio de otros repositorios

RECERCAT Principal > Universitat Politècnica de Catalunya > Documents de recerca > Visualizar documento

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/16116

Título:	Mapping parallel loops on multicore systems
Autor/a:	Tabik, Siham; Romero, Felipe; Utrera Iglesias, Gladys Miriam; Plata, Oscar
Otros autores:	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors; Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
Abstract:	The compute nodes in contemporary HPC systems contain one or more multicore processors. As a result, these nodes constitute a shared-memory multiprocessor, often combining CMP and SMT concurrency technologies. This configuration introduces different levels of sharing in the cache hierarchy, resulting in non-uniform data sharing overheads. In this paper we analyze the data-sharing patterns that exhibit a real multithreaded application when executing on a multicore system, with emphasis in the use of the shared last level cache (LLC) for the concurrent threads. As a consequence of this study, we explore the loop mapping problem in such systems with the aim of optimizing the shared use of the the LLC by all parallel threads. We propose a three-phase loop mapping strategy that deals with workload imbalances, minimizes cache sharing interferences, and maximizes intra-core and inter-core data reuse in the cache hierarchy. Preliminary results show some benefits of our approach. However, this is a work in progress and much more research is being done.
Materia(s):	-Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors -High performance computing -Multiprocessors -Càlcul intensiu (Informàtica) -Multiprocessadors
Derechos:	Attribution-NonCommercial-NoDerivs 3.0 Spain http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Tipo de documento:	Artículo - Versión presentada Objeto de conferencia
Compartir:

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

Tareador: a tool to unveil parallelization strategies at undergraduate level

Ayguadé Parra, Eduard; Badia Sala, Rosa Maria; Jiménez González, Daniel; Herrero Zaragoza, José Ramón; Labarta Mancho, Jesús José; Subotic, Vladimir; Utrera Iglesias, Gladys Miriam

Task Packing: Efficient task scheduling in unbalanced parallel programs to maximize CPU utilization

Utrera Iglesias, Gladys Miriam; Farreras Esclusa, Montse; Fornés de Juan, Jordi

Noise inspector tool

Utrera Iglesias, Gladys Miriam; Fornés de Juan, Jordi; Labarta Mancho, Jesús José

Analyzing the impact of communication imbalance in high-speed networks

Utrera Iglesias, Gladys Miriam; Gil, Marisa; Martorell Bofill, Xavier

Accesibilidad | Aviso legal | Política de Cookies | Documentos de uso interno

Coordinación

Patrocinio