Skip to content

inducer/loopy

This branch is up to date with main.

Folders and files

NameName
Last commit message
Last commit date
Mar 25, 2022
Feb 12, 2025
Mar 29, 2016
Dec 4, 2024
Feb 12, 2025
Jan 31, 2025
Feb 20, 2025
Dec 19, 2024
Feb 20, 2025
Jun 23, 2020
Aug 2, 2024
Jan 8, 2025
Jul 9, 2024
Jun 26, 2022
Mar 11, 2022
Feb 16, 2024
Sep 23, 2018
Jul 18, 2024
Feb 19, 2025
Apr 9, 2015
Apr 25, 2021
Feb 18, 2025
Sep 1, 2022
Jun 28, 2022
Oct 14, 2022

Repository files navigation

Loopy: Transformation-Based Generation of High-Performance CPU/GPU Code

Gitlab Build Status Github Build Status Python Package Index Release Page Zenodo DOI for latest release

Loopy lets you easily generate the tedious, complicated code that is necessary to get good performance out of GPUs and multi-core CPUs. Loopy's core idea is that a computation should be described simply and then transformed into a version that gets high performance. This transformation takes place under user control, from within Python.

It can capture the following types of optimizations:

  • Vector and multi-core parallelism in the OpenCL/CUDA model
  • Data layout transformations (structure of arrays to array of structures)
  • Loop unrolling
  • Loop tiling with efficient handling of boundary cases
  • Prefetching/copy optimizations
  • Instruction level parallelism
  • and many more!

Loopy targets array-type computations, such as the following:

  • dense linear algebra,
  • convolutions,
  • n-body interactions,
  • PDE solvers, such as finite element, finite difference, and Fast-Multipole-type computations.

It is not (and does not want to be) a general-purpose programming language.

Loopy is licensed under the liberal MIT license and free for commercial, academic, and private use. All of Loopy's dependencies can be automatically installed from the package index after using:

pip install loopy

In addition, Loopy is compatible with and enhances pyopencl.

---

Places on the web related to Loopy: