Efficient Dual-Numbers Reverse AD via Well-Known Program Transformations (POPL 2023 - POPL Research Papers)

Sun 15 - Sat 21 January 2023 Boston, Massachusetts, United States

Who

Tom Smeding, Matthijs Vákár

Track

POPL 2023

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 19 Jan 2023 15:35 - 16:00 at Grand Ballroom A - Automatic Differentiation Chair(s): Ningning Xie

Abstract

Where dual-numbers forward-mode automatic differentiation (AD) pairs each scalar value with its tangent value, dual-numbers \emph{reverse-mode} AD attempts to achieve reverse AD using a similarly simple idea: by pairing each scalar value with a backpropagator function. Its correctness and efficiency on higher-order input languages have been analysed by Brunel, Mazza and Pagani, but this analysis used a custom operational semantics for which it is unclear whether it can be implemented efficiently. We take inspiration from their use of \emph{linear factoring} to optimise dual-numbers reverse-mode AD to an algorithm that has the correct complexity and enjoys an efficient implementation in a standard functional language with support for mutable arrays, such as Haskell. Aside from the linear factoring ingredient, our optimisation steps consist of well-known ideas from the functional programming community. We demonstrate the use of our technique by providing a practical implementation that differentiates most of Haskell98.

Link to Preprint

https://arxiv.org/abs/2207.03418

DOI

https://doi.org/10.1145/3571247

Tom Smeding

Utrecht University

Netherlands

Matthijs Vákár