<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-13T06:59:25Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/428986" metadataPrefix="mets">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/428986</identifier><datestamp>2025-07-23T06:04:35Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452951</setSpec></header><metadata><mets xmlns="http://www.loc.gov/METS/" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" ID="&#xa;&#x9;&#x9;&#x9;&#x9;DSpace_ITEM_2117-428986" TYPE="DSpace ITEM" PROFILE="DSpace METS SIP Profile 1.0" xsi:schemaLocation="http://www.loc.gov/METS/ http://www.loc.gov/standards/mets/mets.xsd" OBJID="&#xa;&#x9;&#x9;&#x9;&#x9;hdl:2117/428986">
   <metsHdr CREATEDATE="2026-04-13T08:59:25Z">
      <agent ROLE="CUSTODIAN" TYPE="ORGANIZATION">
         <name>RECERCAT</name>
      </agent>
   </metsHdr>
   <dmdSec ID="DMD_2117_428986">
      <mdWrap MDTYPE="MODS">
         <xmlData xmlns:mods="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
            <mods:mods xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Vasabhaktula, Lokananda Hari Babu</mods:namePart>
               </mods:name>
               <mods:originInfo>
                  <mods:dateIssued encoding="iso8601">2025-01-28</mods:dateIssued>
               </mods:originInfo>
               <mods:identifier type="none"/>
               <mods:abstract>GPUs are designed to provide massive computational power to highly parallel threads. They have large register files to support this, which are also responsible for a significant portion of the GPU’s power consumption. The general trend in new GPUs released by vendors is that higher performance comes with higher power consumption. One useful metric to measure the energy efficiency of GPUs is the performance per watt (PPW), which indicates how much performance can be achieved for a single watt of power consumed. In this thesis, I propose strategies to improve this metric by tuning the occupancy and shrinking the register file size of the GPU. In GPUs, occupancy is the ratio of the number of active threads in the GPU to the maximum number of threads the hardware allows for a given time. The scheduling policies of the GPU limit the maximum number of threads possible. High occupancy generally means that threads can hide latency and thus obtain higher throughput. However, high occupancy does not always mean better performance. In this paper, I analyze the occupancy of threads in the GPU for different register file sizes for a collection of ten benchmark applications. I also quantify register file utilization and compare it to the performance and power consumption of the GPU. I show that high occupancy does not always mean better performance. The results prove that performance per watt can be improved simply by tuning the register file size, and the optimal size is significantly lower than the baseline architecture. With a reduction in the size of the register file from baseline 64KB to 40KB, we observe a power reduction of 7.13% with a performance degradation of just 1.2% on average. More improvements can be achieved with application-specific tuning of the register file size.</mods:abstract>
               <mods:language>
                  <mods:languageTerm authority="rfc3066"/>
               </mods:language>
               <mods:accessCondition type="useAndReproduction">Open Access</mods:accessCondition>
               <mods:subject>
                  <mods:topic>Àrees temàtiques de la UPC::Informàtica::Hardware</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Graphics processing units</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Computer files</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>GPU</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>VGPR</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Fitxer de registres</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Ocupació</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Potència</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Eficiència energètica</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Rendiment</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Optimització</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Rendiment per watt</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Pressió de registres</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Assignació de registres</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Restriccions de recursos</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Arquitectura de Computadors</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Register File</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Occupancy</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Power</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Energy Efficiency</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Performance</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Optimization</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Performance Per Watt</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Register Pressure</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Register Allocation</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Resource Constraints</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Computer Architecture</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Processadors gràfics</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Fitxers informàtics</mods:topic>
               </mods:subject>
               <mods:titleInfo>
                  <mods:title>Optimization of GPU architecture for power efficiency</mods:title>
               </mods:titleInfo>
               <mods:genre>Master thesis</mods:genre>
            </mods:mods>
         </xmlData>
      </mdWrap>
   </dmdSec>
   <structMap LABEL="DSpace Object" TYPE="LOGICAL">
      <div TYPE="DSpace Object Contents" ADMID="DMD_2117_428986"/>
   </structMap>
</mets></metadata></record></GetRecord></OAI-PMH>