dc.contributor.author
Lotfi-Kamran, Pejman
dc.date.accessioned
2025-12-14T02:42:36Z
dc.date.available
2025-12-14T02:42:36Z
dc.date.issued
2021-01-18
dc.identifier
Lotfi-Kamran, P. Virtual BSC RS: Divide and Conquer Frontend Bottleneck. A: Severo Ochoa Research Seminars at BSC. «7th Severo Ochoa Research Seminar Lectures at BSC, Barcelona, 2020-21». 7 th. Barcelona: Severo Ochoa Research Seminars at BSC, 2021, p. 19-20.
dc.identifier
https://hdl.handle.net/2117/449094
dc.identifier.uri
http://hdl.handle.net/2117/449094
dc.description.abstract
The frontend stalls caused by instruction and BTB misses are a significant source of performance degradation. Server processors commonly use prefetchers to mitigate the frontend bottleneck. However, next-line prefetchers, which are available in server processors, are incapable of eliminating many L1 instruction misses. Temporal prefetchers, on the other hand, eliminate most of the misses but impose significant area overhead. Finally, while BTB-directed prefetchers offer low area overhead, as they rely on the BTB content for prefetching, BTB misses stall the prefetcher, which likely leads to costly instruction misses. In this talk, I present a divide-and-conquer approach to address the frontend bottleneck. The proposal, named SN4L+Dis+BTB, imposes the same area overhead as the state-of-the-art BTB-directed prefetcher, and at the same time, outperforms it by 5% on average and up to 16%.
dc.format
application/pdf
dc.publisher
Severo Ochoa Research Seminars at BSC
dc.rights
http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights
Attribution-NonCommercial-NoDerivatives 4.0 International
dc.subject
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject
High performance computing
dc.subject
Càlcul intensiu (Informàtica)
dc.title
Virtual BSC RS: Divide and Conquer Frontend Bottleneck
dc.type
Conference report