Breaking through the ML/AI computational barrier

dc.contributor.author
Allen, Randy
dc.date.accessioned
2026-01-14T02:13:51Z
dc.date.available
2026-01-14T02:13:51Z
dc.date.issued
2023-03-08
dc.identifier
Allen, R. Breaking through the ML/AI computational barrier. A: Severo Ochoa Research Seminars at BSC. «8th Severo Ochoa Research Seminar Lectures at BSC, Barcelona, 2022-23». Barcelona: Barcelona Supercomputing Center, 2023, p. 66-67.
dc.identifier
https://hdl.handle.net/2117/450311
dc.identifier.uri
http://hdl.handle.net/2117/450311
dc.description.abstract
It is a surprise to many, but the frenetic activity spawned in ML and AI over the past decade has not been driven by theoretical advances. The fundamental underpinnings of current ML/AI approaches were published by Arthur Samuelson in 1952, and while there have certainly been improvements, the approach is basically the same. What has instead driven the activity has been available computational power. In 1952, computers would require months to evaluate a simple network. Over the last couple of decades, computational power has reached the point where interesting networks can be evaluated in useful time. “Useful” time is an interesting concept, but it is clear from the number of startups (and failed startups) striving to develop higher-performance lower-power AI accelerators that VCs and entrepreneurs believe that real future for AI involves achieving another 10- 100x acceleration. Whether that speedup is possible is an open question, but there are some definitive statements that that can be made on the question: - The problems that must be solved to achieve this speed up are not new problems. “Many have tried; none have succeeded.” - The solution, if there is one, will involve parallelism. - The solution, if there is one, will not be the development of a new, “magical” parallel hardware architecture. - Following the lessons of Linpack versus LAPACK, data reuse will be a key part of the solution. - Following lessons garnered from the vector world, compilermanaged memory (i.e. registers) will almost certainly be a key part of the solution. This talk will discuss these points and what they suggest about a possible solution to this problem. Not surprisingly, compilers will be a critical tenet of the solution.
dc.format
2 p.
dc.format
application/pdf
dc.language
eng
dc.publisher
Barcelona Supercomputing Center
dc.rights
http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights
Open Access
dc.rights
Attribution-NonCommercial-NoDerivatives 4.0 International
dc.subject
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject
High performance computing
dc.subject
Càlcul intensiu (Informàtica)
dc.title
Breaking through the ML/AI computational barrier
dc.type
Conference report


Fitxers en aquest element

FitxersGrandàriaFormatVisualització

No hi ha fitxers associats a aquest element.

Aquest element apareix en la col·lecció o col·leccions següent(s)

Congressos [11159]