<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-17T01:00:53Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/346628" metadataPrefix="mets">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/346628</identifier><datestamp>2026-01-19T02:19:27Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452949</setSpec></header><metadata><mets xmlns="http://www.loc.gov/METS/" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" ID="&#xa;&#x9;&#x9;&#x9;&#x9;DSpace_ITEM_2117-346628" TYPE="DSpace ITEM" PROFILE="DSpace METS SIP Profile 1.0" xsi:schemaLocation="http://www.loc.gov/METS/ http://www.loc.gov/standards/mets/mets.xsd" OBJID="&#xa;&#x9;&#x9;&#x9;&#x9;hdl:2117/346628">
   <metsHdr CREATEDATE="2026-04-17T03:00:53Z">
      <agent ROLE="CUSTODIAN" TYPE="ORGANIZATION">
         <name>RECERCAT</name>
      </agent>
   </metsHdr>
   <dmdSec ID="DMD_2117_346628">
      <mdWrap MDTYPE="MODS">
         <xmlData xmlns:mods="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
            <mods:mods xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Pou Mulet, Bartomeu</mods:namePart>
               </mods:name>
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Quiñones, Eduardo</mods:namePart>
               </mods:name>
               <mods:name>
                  <mods:role>
                     <mods:roleTerm type="text">author</mods:roleTerm>
                  </mods:role>
                  <mods:namePart>Martín Muñoz, Mario</mods:namePart>
               </mods:name>
               <mods:originInfo>
                  <mods:dateIssued encoding="iso8601">2021-05</mods:dateIssued>
               </mods:originInfo>
               <mods:identifier type="none"/>
               <mods:abstract>When planar wavefronts from distant stars traverse the&#xd;
atmosphere, they become distorted due to the atmosphere’s inhomogeneous&#xd;
temperature distribution. Adaptive Optics (AO)&#xd;
is the field in charge of correcting those distortions allowing&#xd;
high-quality observations of distant targets. The AO solution&#xd;
is composed of three main components: a deformable mirror&#xd;
(DM) that corrects the deformation in the wavefront, a&#xd;
wavefront sensor (WFS) that allows characterising the current&#xd;
turbulence in the wavefront and a real time controller (RTC)&#xd;
that issues commands to, via the deformation of the DM,&#xd;
correct the wavefront. Usually, the operations are performed&#xd;
on closed-loop with stringent real-time requirements (in the&#xd;
order of 103 􀀀 104 actions per second). At each iteration, the&#xd;
WFS observes the wavefront after being corrected by the DM&#xd;
and the RTC issues the commands to correct for the evolution&#xd;
of turbulence and previous uncorrected errors (Figure 1 left).&#xd;
One of the primary sources of error for an AO control&#xd;
algorithm is the temporal error. The delay between characterising&#xd;
the turbulence with the WFS and setting the desired&#xd;
commands in the DM creates the need that any successful&#xd;
control approach must take into account past commands and&#xd;
the probable evolution of the atmosphere in this gap of time.&#xd;
To do that, the most common approach in AO are variants&#xd;
of Linear Quadratic Gaussian (LQG) with Kalman filters with&#xd;
one of its initial iterations presented in [1]. Usually, a linear&#xd;
model of the system’s evolution is built with a set of parameters&#xd;
that are usually fitted based on observations or on theoretical&#xd;
assumptions, which limits the capability of the system to&#xd;
correct the turbulence.&#xd;
In this paper, we present a novel solution based on Reinforcement&#xd;
Learning (RL), based on a reward signal to be&#xd;
optimised, that does not need any previously built model (as&#xd;
LQG) and is non-linear. RL has been already applied in the&#xd;
domain of AO, however, it has been limited to WFS-less&#xd;
systems (e.g. [2]) or, more recently, to control a very limited&#xd;
number of actuators [3]. This work’s main practical objective&#xd;
is to be applied in the 8.2 m Subaru telescope (located in&#xd;
Hawaii), which includes thousands of actuators.&#xd;
B. AO Control: Integrator with gain</mods:abstract>
               <mods:language>
                  <mods:languageTerm authority="rfc3066"/>
               </mods:language>
               <mods:accessCondition type="useAndReproduction">Open Access</mods:accessCondition>
               <mods:subject>
                  <mods:topic>Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>High performance computing</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Reinforcement Learning</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Adaptive Optics</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Nonlinear Control</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Machine Learning</mods:topic>
               </mods:subject>
               <mods:subject>
                  <mods:topic>Càlcul intensiu (Informàtica)</mods:topic>
               </mods:subject>
               <mods:titleInfo>
                  <mods:title>Adaptive optics control with reinforcement learning: first steps</mods:title>
               </mods:titleInfo>
               <mods:genre>Conference report</mods:genre>
            </mods:mods>
         </xmlData>
      </mdWrap>
   </dmdSec>
   <structMap LABEL="DSpace Object" TYPE="LOGICAL">
      <div TYPE="DSpace Object Contents" ADMID="DMD_2117_346628"/>
   </structMap>
</mets></metadata></record></GetRecord></OAI-PMH>