ALPyNA: Acceleration of Loops in Python for Novel Architectures (ARRAY 2019)

Sat 22 - Wed 26 June 2019 Phoenix, Arizona, United States

Who

Dejice Jacob, Jeremy Singer

Track

ARRAY 2019

Time Zone

The program is currently displayed in (GMT-07:00) Tijuana, Baja California.

Use conference time zone: (GMT-07:00) Tijuana, Baja CaliforniaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sat 22 Jun 2019 16:00 - 16:30 at 106C - Session 5 Chair(s): Lenore Mullin

Abstract

We present ALPyNA, an automatic loop parallelization framework for Python, which analyzes data dependences within nested loops and dynamically generates CUDA kernels for GPU execution. The ALPyNA system applies classical dependence analysis techniques to discover and exploit potential parallelism. The skeletal structure of the dependence graph is determined statically; this is combined with type and bounds information discovered at runtime, to auto-generate high-performance kernels for offload to GPU. We demonstrate speedups of up to 1000x relative to the native CPython interpreter across four array-intensive numerical Python benchmarks. Performance improvement is related to iteration domain sizes and the complexity of the dependence graph. Nevertheless, this approach promises to bring the benefits of manycore parallelism to end-user developers.

Dejice JacobAuthor

Jeremy SingerAuthor

University of Glasgow

United Kingdom