Autor/a

Allen, Randy

Fecha de publicación

2023-03-08



Resumen

It is a surprise to many, but the frenetic activity spawned in ML and AI over the past decade has not been driven by theoretical advances. The fundamental underpinnings of current ML/AI approaches were published by Arthur Samuelson in 1952, and while there have certainly been improvements, the approach is basically the same. What has instead driven the activity has been available computational power. In 1952, computers would require months to evaluate a simple network. Over the last couple of decades, computational power has reached the point where interesting networks can be evaluated in useful time. “Useful” time is an interesting concept, but it is clear from the number of startups (and failed startups) striving to develop higher-performance lower-power AI accelerators that VCs and entrepreneurs believe that real future for AI involves achieving another 10- 100x acceleration. Whether that speedup is possible is an open question, but there are some definitive statements that that can be made on the question: - The problems that must be solved to achieve this speed up are not new problems. “Many have tried; none have succeeded.” - The solution, if there is one, will involve parallelism. - The solution, if there is one, will not be the development of a new, “magical” parallel hardware architecture. - Following the lessons of Linpack versus LAPACK, data reuse will be a key part of the solution. - Following lessons garnered from the vector world, compilermanaged memory (i.e. registers) will almost certainly be a key part of the solution. This talk will discuss these points and what they suggest about a possible solution to this problem. Not surprisingly, compilers will be a critical tenet of the solution.

Tipo de documento

Conference report

Lengua

Inglés

Publicado por

Barcelona Supercomputing Center

Citación recomendada

Esta citación se ha generado automáticamente.

Derechos

http://creativecommons.org/licenses/by-nc-nd/4.0/

Open Access

Attribution-NonCommercial-NoDerivatives 4.0 International

Este ítem aparece en la(s) siguiente(s) colección(ones)

Congressos [11156]