Per accedir als documents amb el text complet, si us plau, seguiu el següent enllaç:

Software trace cache
Ramírez Bellido, Alejandro; Larriba Pey, Josep; Valero Cortés, Mateo
Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors; Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions; Universitat Politècnica de Catalunya. DAMA-UPC - Data Management Group
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. The target is to enable the code to make better use of the underlying hardware resources regardless of the specific details of the processor/architecture in order to increase fetch performance. The Software Trace Cache (STC) is a code layout algorithm with a broader target than previous layout optimizations. We target not only an improvement in the instruction cache hit rate, but also an increase in the effective fetch width of the fetch engine. The STC algorithm organizes basic blocks into chains trying to make sequentially executed basic blocks reside in consecutive memory positions, then maps the basic block chains in memory to minimize conflict misses in the important sections of the program. We evaluate and analyze in detail the impact of the STC, and code layout optimizations in general, on the three main aspects of fetch performance; the instruction cache hit rate, the effective fetch width, and the branch prediction accuracy. Our results show that layout optimized, codes have some special characteristics that make them more amenable for high-performance instruction fetch. They have a very high rate of not-taken branches and execute long chains of sequential instructions; also, they make very effective use of instruction cache lines, mapping only useful instructions which will execute close in time, increasing both spatial and temporal locality.
Peer Reviewed
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
Cache memory
Compilers (Computer programs)
Parallel processing (Electronic computers)
Pipeline processors
Instruction fetch
Compiler optimizations
Branch prediction
Trace cache
Memòria ràpida de treball (Informàtica)
Compiladors (Programes d'ordinador)
Processament en paral·lel (Ordinadors)

Mostra el registre complet del document

Documents relacionats

Altres documents del mateix autor/a

Ramírez Bellido, Alejandro; Santana Jaria, Oliverio J.; Larriba Pey, Josep; Valero Cortés, Mateo
Knijnenburg, Peter M.W.; Ramírez Bellido, Alejandro; Latorre Salinas, Fernando; Larriba Pey, Josep; Valero Cortés, Mateo
Ramírez Bellido, Alejandro; Larriba Pey, Josep; Valero Cortés, Mateo
Ramírez Bellido, Alejandro; Larriba Pey, Josep; Valero Cortés, Mateo
Ramírez Bellido, Alejandro; Barroso, Luiz A; Gharachorloo, Kourosh; Cohn, Robert; Larriba Pey, Josep; Lowney, P. Geoffrey; Valero Cortés, Mateo