diff --git a/.gitignore b/.gitignore
index 894a44cc06..6c7fef03eb 100644
--- a/.gitignore
+++ b/.gitignore
@@ -102,3 +102,60 @@ venv.bak/
 
 # mypy
 .mypy_cache/
+
+# Windows
+Thumbs.db
+
+# Ignore files built by Visual Studio [code]
+*.obj
+*.exe
+*.pdb
+*.user
+*.dot
+*.jpg
+*.aps
+*.pch
+*.vspscc
+*_i.c
+*_p.c
+*.ncb
+*.suo
+*.tlb
+*.tlh
+*.bak
+*.cache
+*.ilk
+[Bb]in
+[Dd]ebug*/
+*.lib
+*.sbr
+obj/
+[Rr]elease*/
+_ReSharper*/
+[Tt]est[Rr]esult*
+.vs/
+.vscode/
+src.VC.db
+src.VC.VC.opendb
+*.exp
+
+# DaCe
+.dacecache/
+out.sdfg
+*.dot
+*.out
+results.log
+perf.json
+perf*.csv
+/dace/frontend/octave/parsetab.py
+
+# Xilinx
+xilinx_vcu1525_*
+sdaccel_profile_*
+sdaccel_timeline_*
+
+# NVIDIA
+*.nvprof
+
+# Miscellaneous
+*~
diff --git a/.gitmodules b/.gitmodules
new file mode 100644
index 0000000000..bebc84450f
--- /dev/null
+++ b/.gitmodules
@@ -0,0 +1,10 @@
+[submodule "dace/external/cub"]
+	path = dace/external/cub
+	url = https://github.com/NVlabs/cub.git
+	branch = 1.8.0
+[submodule "dace/external/moodycamel"]
+	path = dace/external/moodycamel
+	url = https://github.com/cameron314/concurrentqueue.git
+[submodule "dace/external/hlslib"]
+	path = dace/external/hlslib
+	url = https://github.com/definelicht/hlslib.git
diff --git a/LICENSE b/LICENSE
index 30b39653b3..a1997075fc 100644
--- a/LICENSE
+++ b/LICENSE
@@ -1,6 +1,6 @@
 BSD 3-Clause License
 
-Copyright (c) 2019, SPCL
+Copyright (c) 2019, Scalable Parallel Computing Lab, ETH Zurich
 All rights reserved.
 
 Redistribution and use in source and binary forms, with or without
diff --git a/README.md b/README.md
index ccc37d3460..b1947d00b6 100644
--- a/README.md
+++ b/README.md
@@ -1,2 +1,90 @@
-# dace
-DaCe - Data Centric Parallel Programming
+![D](dace.svg)aCe - Data-Centric Parallel Programming
+=====================================================
+
+_Decoupling domain science from performance optimization._
+
+DaCe compiles code in various programming languages and paradigms (Python/Numpy, MATLAB, TensorFlow) and maps it efficiently to **CPUs, GPUs, and FPGAs** with high utilization, on par with the state-of-the-art. The key feature driving DaCe is its Stateful DataFlow multiGraph (SDFG) *data-centric intermediate representation*: A transformable, interactive representation of code based on data movement.
+With data-centric parallel programming, we enable **direct knowledge transfer** of performance optimization, regardless of the scientific application or the target processor.
+
+DaCe can be written inline in Python and transformed in the command-line, or SDFGs can be interactively modified using the Data-centric Interactive Optimization Development Environment (DIODE).
+
+For more information, see our [paper](http://www.arxiv.org/abs/1902.10345).
+
+Tutorials
+---------
+
+* _Implicit Dataflow in Python (coming soon)_
+* [Explicit Dataflow in Python](tutorials/explicit.ipynb)
+* [SDFG API](tutorials/sdfg_api.ipynb)
+* [Transformations](tutorials/transformations.ipynb)
+
+Installation and Dependencies
+-----------------------------
+
+To install: `pip install dace`
+
+Runtime dependencies:
+ * A C++14-capable compiler (e.g., gcc 5.3+)
+ * Python 3.5 or newer
+
+Running DIODE may require additional dependencies:
+ * `sudo apt-get install libgtksourceviewmm-3.0-dev libyaml-dev`
+ * `sudo apt-get install python3-cairo python3-gi-cairo libgirepository1.0-dev xdot libwebkitgtk-dev libwebkitgtk-3.0-dev libwebkit2gtk-4.0-dev`
+ * `pip install pygobject matplotlib`
+
+To run DIODE on Windows, use MSYS2:
+ * Download from http://www.msys2.org/
+ * In the MSYS2 console, install all dependencies: `pacman -S mingw-w64-i686-gtk3 mingw-w64-i686-python2-gobject mingw-w64-i686-python3-gobject mingw-w64-i686-python3-cairo mingw-w64-i686-python3-pip mingw-w64-i686-gtksourceviewmm3 mingw-w64-i686-gcc mingw-w64-i686-boost mingw-w64-i686-python3-numpy mingw-w64-i686-python3-scipy mingw-w64-i686-python3-matplotlib`
+ * Update MSYS2: `pacman -Syu`, close and restart MSYS2, then run `pacman -Su` to update the rest of the packages.
+
+Publication
+-----------
+
+If you use DaCe, cite us:
+```bibtex
+@article{dace,
+  author = {Ben-Nun, Tal and de Fine Licht, Johannes and Ziogas, Alexandros Nikolaos and Schneider, Timo and Hoefler, Torsten},
+        title = {Stateful Dataflow Multigraphs: A Data-Centric Model for High-Performance Parallel Programs},
+  journal   = {CoRR},
+  volume    = {abs/1902.10345},
+  year      = {2019},
+  url       = {http://arxiv.org/abs/1902.10345},
+  archivePrefix = {arXiv},
+  eprint    = {1902.10345}
+}
+```
+
+Configuration
+-------------
+
+DaCe creates a file called `.dace.conf` in the user's home directory. It provides useful settings that can be modified either directly in the file (YAML), within DIODE, or overriden on a case-by-case basis using environment variables that begin with `DACE_` and specify the setting (where categories are separated by underscores). The full configuration schema is located [here](dace/config_schema.yml).
+
+Useful environment variable configurations include:
+
+* `DACE_CONFIG` (default: `~/.dace.conf`): Override DaCe configuration file choice.
+
+Context configuration:
+ * `DACE_use_cache` (default: False): Uses DaCe program cache instead of re-optimizing and compiling programs.
+ * `DACE_debugprint` (default: True): Print debugging information.
+ 
+CPU target configuration:
+ * `DACE_compiler_cpu_executable` (default: g++): Chooses the default C++ compiler for CPU code.
+ * `DACE_compiler_cpu_additional_args` (default: None): Additional compiler flags (separated by spaces).
+  
+SDFG processing:
+ * `DACE_optimizer_interface` (default: `dace.transformation.optimizer.SDFGOptimizer`): Controls the SDFG optimization process. If empty or class name is invalid, skips process. By default, uses the transformation command line interface.
+ * `DACE_optimizer_visualize` (default: False): Visualizes optimization process by saving .dot (GraphViz) files after each pattern replacement.
+ 
+Profiling:
+ * `DACE_profiling` (default: False): Enables profiling measurement of the DaCe program runtime in milliseconds. Produces a log file and prints out median runtime.
+ * `DACE_treps` (default: 100): Number of repetitions to run a DaCe program when profiling is enabled.
+ 
+
+Contributing
+------------
+DaCe is an open-source project. We are happy to accept Pull Requests with your contributions!
+
+License
+-------
+DaCe is published under the New BSD license, see LICENSE.
+
diff --git a/dace.svg b/dace.svg
new file mode 100644
index 0000000000..c744ac88de
--- /dev/null
+++ b/dace.svg
@@ -0,0 +1,84 @@
+<?xml version="1.0" encoding="UTF-8" standalone="no"?>
+<svg
+   xmlns:dc="http://purl.org/dc/elements/1.1/"
+   xmlns:cc="http://creativecommons.org/ns#"
+   xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
+   xmlns:svg="http://www.w3.org/2000/svg"
+   xmlns="http://www.w3.org/2000/svg"
+   xmlns:sodipodi="http://sodipodi.sourceforge.net/DTD/sodipodi-0.dtd"
+   xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"
+   id="svg13"
+   height="60.891094"
+   width="57.565113"
+   version="1.0"
+   sodipodi:docname="DaCe.svg"
+   viewBox="0 0 143.91279 152.22773"
+   inkscape:version="0.92.3 (2405546, 2018-03-11)">
+  <sodipodi:namedview
+     pagecolor="#ffffff"
+     bordercolor="#666666"
+     borderopacity="1"
+     objecttolerance="10"
+     gridtolerance="10"
+     guidetolerance="10"
+     inkscape:pageopacity="0"
+     inkscape:pageshadow="2"
+     inkscape:window-width="1920"
+     inkscape:window-height="1017"
+     id="namedview10"
+     showgrid="false"
+     units="px"
+     fit-margin-top="0"
+     fit-margin-left="0"
+     fit-margin-right="0"
+     fit-margin-bottom="0"
+     inkscape:zoom="1.5398685"
+     inkscape:cx="72.524769"
+     inkscape:cy="-15.658864"
+     inkscape:window-x="-8"
+     inkscape:window-y="-8"
+     inkscape:window-maximized="1"
+     inkscape:current-layer="svg13" />
+  <metadata
+     id="metadata17">
+    <rdf:RDF>
+      <cc:Work
+         rdf:about="">
+        <dc:format>image/svg+xml</dc:format>
+        <dc:type
+           rdf:resource="http://purl.org/dc/dcmitype/StillImage" />
+        <dc:title></dc:title>
+      </cc:Work>
+    </rdf:RDF>
+  </metadata>
+  <defs
+     id="defs3">
+    <pattern
+       y="0"
+       x="0"
+       height="6"
+       width="6"
+       patternUnits="userSpaceOnUse"
+       id="EMFhbasepattern" />
+  </defs>
+  <path
+     id="path5"
+     d="m 0,0 h 71.95639 c 39.75591,0 71.95639,34.079345 71.95639,76.11387 0,42.05451 -32.20048,76.11387 -71.95639,76.11387 H 0 Z"
+     style="fill:#0070c0;fill-opacity:1;fill-rule:evenodd;stroke:none"
+     inkscape:connector-curvature="0" />
+  <path
+     id="path7"
+     d="M 76.913385,27.183525 115.29013,75.154451 76.913385,123.12538 Z"
+     style="fill:#ffffff;fill-opacity:1;fill-rule:evenodd;stroke:none"
+     inkscape:connector-curvature="0" />
+  <path
+     id="path9"
+     d="M 28.622652,27.183525 66.999394,50.049666 V 100.27923 L 28.622652,123.12538 Z"
+     style="fill:#ffffff;fill-opacity:1;fill-rule:evenodd;stroke:none"
+     inkscape:connector-curvature="0" />
+  <path
+     id="path11"
+     d="m 67.079345,75.234403 h 9.93398"
+     style="fill:none;stroke:#ffffff;stroke-width:3.99757719px;stroke-linecap:butt;stroke-linejoin:miter;stroke-miterlimit:8;stroke-dasharray:none;stroke-opacity:1"
+     inkscape:connector-curvature="0" />
+</svg>
diff --git a/dace/__init__.py b/dace/__init__.py
new file mode 100644
index 0000000000..3e0aa6f8dc
--- /dev/null
+++ b/dace/__init__.py
@@ -0,0 +1,14 @@
+from .types import *
+
+# Python frontend
+from .frontend.python.decorators import *
+from .frontend.python.ndarray import *
+from .frontend.python.ndloop import ndrange
+from .frontend.python.simulator import simulate
+
+from .config import Config
+from .frontend.operations import *
+from .sdfg import compile, SDFG, SDFGState
+from .memlet import Memlet, EmptyMemlet
+from .graph.edges import InterstateEdge
+from .symbolic import symbol, eval
diff --git a/dace/codegen/CMakeLists.txt b/dace/codegen/CMakeLists.txt
new file mode 100644
index 0000000000..916e42eae9
--- /dev/null
+++ b/dace/codegen/CMakeLists.txt
@@ -0,0 +1,315 @@
+cmake_minimum_required(VERSION 2.8.12)
+project(dace_program)
+
+# General options 
+set(DACE_PROGRAM_NAME "dace_program" CACHE STRING "Name of DaCe program")
+set(DACE_FILES "" CACHE STRING "Host code files")
+set(DACE_LIBS "" CACHE STRING "Extra libraries")
+set(HLSLIB_PART_NAME "${DACE_XILINX_PART_NAME}")
+
+# Allow passing flags to various stages of Xilinx compilation process
+set(DACE_XILINX_MODE "simulation" CACHE STRING "Type of compilation/execution [simulation/software_emulation/hardware_emulation/hardware].")
+set(DACE_XILINX_HOST_FLAGS "" CACHE STRING "Extra flags to host code")
+set(DACE_XILINX_SYNTHESIS_FLAGS "" CACHE STRING "Extra flags for performing high-level synthesis")
+set(DACE_XILINX_BUILD_FLAGS "" CACHE STRING "Extra flags to xocc build phase")
+set(DACE_XILINX_TARGET_CLOCK 200 CACHE STRING "Target clock frequency of FPGA kernel")
+set(DACE_XILINX_PART_NAME "xcvu9p-fsgd2104-2l-e" CACHE STRING "Xilinx chip to target from HLS")
+set(DACE_XILINX_TARGET_PLATFORM "xilinx_vcu1525_dynamic_5_1" CACHE STRING "SDAccel platform to target")
+set(DACE_XILINX_ENABLE_DEBUGGING OFF CACHE STRING "Inject debugging cores to kernel build (always on for simulation/emulation)")
+
+# Target detection
+set(DACE_ENABLE_MPI OFF)
+set(DACE_ENABLE_CUDA OFF)
+set(DACE_ENABLE_XILINX OFF)
+
+# Split list by target
+foreach(DACE_FILE ${DACE_FILES})
+  # Extract the target from the folder name
+  get_filename_component(DACE_FILE_NAME ${DACE_FILE} NAME_WE)
+  get_filename_component(DACE_FILE_TARGET ${DACE_FILE} DIRECTORY)
+  get_filename_component(DACE_FILE_TARGET ${DACE_FILE_TARGET} NAME)
+  if(${DACE_FILE_TARGET} STREQUAL "cuda")
+    set(DACE_ENABLE_CUDA ON)
+    set(DACE_CUDA_FILES ${DACE_CUDA_FILES} ${DACE_FILE})
+  elseif(${DACE_FILE_TARGET} STREQUAL "xilinx")
+    set(DACE_ENABLE_XILINX ON)
+    if(DACE_FILE_NAME MATCHES ".+_host") 
+      set(DACE_XILINX_HOST_FILES ${DACE_XILINX_HOST_FILES} ${DACE_FILE})
+    else()
+      set(DACE_XILINX_KERNEL_FILES ${DACE_XILINX_KERNEL_FILES} ${DACE_FILE})
+    endif()
+  elseif(${DACE_FILE_TARGET} STREQUAL "mpi")
+    set(DACE_ENABLE_MPI ON)
+    set(DACE_CPP_FILES ${DACE_CPP_FILES} ${DACE_FILE})
+  else()
+    set(DACE_CPP_FILES ${DACE_CPP_FILES} ${DACE_FILE})
+  endif()
+endforeach()
+
+# Internal dependencies
+set(DACE_RUNTIME_DIR ${CMAKE_SOURCE_DIR}/../runtime)
+include_directories(${DACE_RUNTIME_DIR}/include)
+
+# External dependencies 
+find_package(Threads REQUIRED)
+find_package(OpenMP REQUIRED COMPONENTS CXX)
+file(TO_NATIVE_PATH "${CMAKE_BINARY_DIR}/" DACE_BINARY_DIR)
+set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} ${OpenMP_CXX_FLAGS} -DDACE_BINARY_DIR='\"${DACE_BINARY_DIR}\"'")
+set(DACE_LIBS ${DACE_LIBS} ${CMAKE_THREAD_LIBS_INIT} ${OpenMP_CXX_LIBRARIES})
+if(DACE_ENABLE_MPI)
+  find_package(MPI REQUIRED)
+  include_directories(${MPI_CXX_INCLUDE_PATH})
+  set(DACE_LIBS ${DACE_LIBS} ${MPI_CXX_LIBRARIES})
+endif()
+if(DACE_ENABLE_CUDA)
+  find_package(CUDA REQUIRED)
+  set(CUDA_PROPAGATE_HOST_FLAGS OFF)
+  include_directories(${CUDA_INCLUDE_DIRS})
+  set(DACE_LIBS ${DACE_LIBS} ${CUDA_LIBRARIES})
+  add_definitions(-DWITH_CUDA)
+endif()
+if(DACE_ENABLE_XILINX)
+  set(DACE_HLSLIB_DIR ${CMAKE_SOURCE_DIR}/../external/hlslib)
+  set(CMAKE_MODULE_PATH ${CMAKE_MODULE_PATH} ${DACE_HLSLIB_DIR}/cmake)
+  find_package(SDAccel REQUIRED)
+
+  include_directories(SYSTEM ${SDAccel_INCLUDE_DIRS} ${DACE_HLSLIB_DIR}/include)
+  add_definitions(-DDACE_XILINX)
+  set(DACE_LIBS ${DACE_LIBS} ${SDAccel_LIBRARIES})
+
+endif()
+
+# Create CUDA object files
+if(DACE_ENABLE_CUDA)
+  # Get local CUDA architectures
+  if (NOT DEFINED LOCAL_CUDA_ARCHITECTURES)
+      execute_process(COMMAND "${CUDA_NVCC_EXECUTABLE}" "--run"
+                      "${CMAKE_SOURCE_DIR}/tools/get_cuda_arch.cpp"
+                      OUTPUT_VARIABLE _arch_out RESULT_VARIABLE _arch_res
+                      ERROR_QUIET OUTPUT_STRIP_TRAILING_WHITESPACE)
+
+      if(_arch_res EQUAL 0)
+        string(REGEX REPLACE "\n" ";" _arch_out "${_arch_out}")
+        list(GET _arch_out -1 _local_arch)
+        string(REGEX REPLACE " " ";" _local_arch "${_local_arch}")
+        set(LOCAL_CUDA_ARCHITECTURES "${_local_arch}" CACHE STRING "Detected local GPUs for compilation")
+        message("-- Local CUDA architectures detected: ${LOCAL_CUDA_ARCHITECTURES}")
+      else()
+        set(LOCAL_CUDA_ARCHITECTURES "" CACHE STRING "Detected local GPUs for compilation")
+        message("-- No local CUDA-capable GPUs found")
+      endif()
+  endif()
+
+  # Add flags to compile for local CUDA architectures
+  foreach(var ${LOCAL_CUDA_ARCHITECTURES})
+    list(APPEND CUDA_NVCC_FLAGS -gencode arch=compute_${var},code=sm_${var})
+  endforeach()
+
+  cuda_include_directories(${DACE_RUNTIME_DIR}/include)
+  cuda_compile(DACE_CUDA_OBJECTS ${DACE_CUDA_FILES})
+  set(DACE_OBJECTS ${DACE_OBJECTS} ${DACE_CUDA_OBJECTS})
+endif() # DACE_ENABLE_CUDA
+
+# Create Xilinx object files
+if(DACE_ENABLE_XILINX)
+  if((NOT (DACE_XILINX_MODE STREQUAL "hardware")) OR DACE_XILINX_ENABLE_DEBUGGING)
+    set(DACE_XILINX_HOST_FLAGS "${DACE_XILINX_HOST_FLAGS} -g")
+    set(DACE_XILINX_SYNTHESIS_FLAGS "${DACE_XILINX_SYNTHESIS_FLAGS} -g")
+  endif()
+
+  set_source_files_properties(${DACE_XILINX_KERNEL_FILES} ${DACE_XILINX_HOST_FILES} PROPERTIES COMPILE_FLAGS "${DACE_XILINX_HOST_FLAGS}")
+  set_source_files_properties(${DACE_XILINX_KERNEL_FILES} PROPERTIES COMPILE_FLAGS "-DDACE_XILINX_DEVICE_CODE ${DACE_XILINX_HOST_FLAGS}")
+  set(DACE_OBJECTS ${DACE_OBJECTS} ${DACE_XILINX_KERNEL_FILES} ${DACE_XILINX_HOST_FILES})
+
+  if(((${SDAccel_MAJOR_VERSION} LESS 2018) AND
+      (${SDAccel_MINOR_VERSION} LESS 3)) OR ${SDAccel_MAJOR_VERSION} LESS 2017)
+    add_definitions(-DHLSLIB_LEGACY_SDX=1)
+  else()
+    add_definitions(-DHLSLIB_LEGACY_SDX=0)
+  endif()
+
+  if(DACE_XILINX_MODE STREQUAL "simulation")
+    # This will cause the OpenCL calls to instead call a simulation code
+    # running on the host
+    add_definitions(-DHLSLIB_SIMULATE_OPENCL)
+  endif()
+
+  set(DACE_XILINX_SYNTHESIS_FLAGS "${DACE_XILINX_SYNTHESIS_FLAGS} -DDACE_SYNTHESIS -DDACE_XILINX -DDACE_XILINX_DEVICE_CODE -DHLSLIB_SYNTHESIS -std=c++11")
+
+  # Add synthesis and build commands
+  set(DACE_SYNTHESIS_TARGETS)
+  foreach(DACE_KERNEL_FILE ${DACE_XILINX_KERNEL_FILES})
+    get_filename_component(DACE_KERNEL_NAME ${DACE_KERNEL_FILE} NAME)
+    string(REGEX REPLACE "kernel_(.+).cpp" "\\1" DACE_KERNEL_NAME "${DACE_KERNEL_NAME}")
+    string(REPLACE " " ";" DACE_XILINX_SYNTHESIS_FLAGS_INTERNAL ${DACE_XILINX_SYNTHESIS_FLAGS})
+    set(DACE_XOCC_KERNEL_FILES ${DACE_XOCC_KERNEL_FILES} ${DACE_KERNEL_FILE})
+    set(DACE_XOCC_KERNELS ${DACE_XOCC_KERNELS} --kernel ${DACE_KERNEL_NAME} --xp prop:kernel.${DACE_KERNEL_NAME}.kernel_flags=\"${DACE_XILINX_SYNTHESIS_FLAGS_INTERNAL}\")
+
+    configure_file(${CMAKE_SOURCE_DIR}/Xilinx_HLS.tcl.in Synthesize_${DACE_KERNEL_NAME}.tcl)
+    add_custom_target(xilinx_synthesis_${DACE_KERNEL_NAME} COMMAND ${SDAccel_VIVADO_HLS} -f Synthesize_${DACE_KERNEL_NAME}.tcl) 
+    set(DACE_SYNTHESIS_TARGETS ${DACE_SYNTHESIS_TARGETS} xilinx_synthesis_${DACE_KERNEL_NAME})
+
+  endforeach()
+
+  add_custom_target(xilinx_synthesis DEPENDS ${DACE_SYNTHESIS_TARGETS})
+
+  string(REPLACE " " ";" DACE_XILINX_BUILD_FLAGS_INTERNAL
+         "${DACE_XILINX_BUILD_FLAGS}")
+
+  set(XOCC_BUILD_FLAGS
+    -s
+    -O3
+    -I${CMAKE_SOURCE_DIR}/include
+    -I${CMAKE_SOURCE_DIR}/../external/hlslib/include
+    -I${CMAKE_SOURCE_DIR}/../runtime/include
+    -I${CMAKE_BINARY_DIR}
+    "${DACE_XOCC_KERNELS}"
+    --platform ${DACE_XILINX_TARGET_PLATFORM}
+    ${DACE_XILINX_BUILD_FLAGS_INTERNAL}
+    --kernel_frequency ${DACE_XILINX_TARGET_CLOCK}
+    --max_memory_ports all)
+
+  if((NOT (DACE_XILINX_MODE STREQUAL "hardware")) OR DACE_XILINX_ENABLE_DEBUGGING)
+    # TODO: add Chipscope debugging on memory interfaces. Need to pass
+    # interfaces from codegen to CMake in order to do this.
+    message(STATUS "Enabled debugging/profiling for Xilinx targets.")
+    set(XOCC_BUILD_FLAGS ${XOCC_BUILD_FLAGS}
+      --profile_kernel "data:all:all:all"
+      --profile_kernel "stall:all:all"
+      --profile_kernel "exec:all:all")
+  endif()
+
+  if(SDAccel_MAJOR_VERSION LESS 2018 AND SDAccel_MINOR_VERSION LESS 3)
+
+    add_custom_target(
+      xilinx_build_${DACE_PROGRAM_NAME}_software_emulation
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -t sw_emu
+      ${DACE_XOCC_KERNEL_FILES}
+      -o ${DACE_PROGRAM_NAME}_sw_emu.xclbin)
+
+    add_custom_target(
+      xilinx_build_${DACE_PROGRAM_NAME}_hardware_emulation
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -t hw_emu
+      ${DACE_XOCC_KERNEL_FILES}
+      -o ${DACE_PROGRAM_NAME}_hw_emu.xclbin)
+
+    add_custom_target(
+      xilinx_build_${DACE_PROGRAM_NAME}_hardware
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -t hw
+      ${DACE_XOCC_KERNEL_FILES}
+      -o ${DACE_PROGRAM_NAME}_hw.xclbin)
+
+  else()
+
+    add_custom_target(
+      xilinx_compile_${DACE_PROGRAM_NAME}_software_emulation
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -c
+      -t sw_emu
+      ${DACE_XOCC_KERNEL_FILES}
+      -o ${DACE_PROGRAM_NAME}_sw_emu.xo)
+
+    add_custom_target(
+      xilinx_compile_${DACE_PROGRAM_NAME}_hardware_emulation
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -c
+      -t hw_emu
+      ${DACE_XOCC_KERNEL_FILES}
+      -o ${DACE_PROGRAM_NAME}_hw_emu.xo)
+
+    add_custom_target(
+      xilinx_compile_${DACE_PROGRAM_NAME}_hardware
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -c
+      -t hw
+      ${DACE_XOCC_KERNEL_FILES}
+      -o ${DACE_PROGRAM_NAME}_hw.xo)
+
+    add_custom_target(
+      xilinx_build_${DACE_PROGRAM_NAME}_software_emulation
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -l
+      -t sw_emu
+      ${DACE_PROGRAM_NAME}_sw_emu.xo
+      -o ${DACE_PROGRAM_NAME}_sw_emu.xclbin)
+
+    add_custom_target(
+      xilinx_build_${DACE_PROGRAM_NAME}_hardware_emulation
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -l
+      -t hw_emu
+      ${DACE_PROGRAM_NAME}_hw_emu.xo
+      -o ${DACE_PROGRAM_NAME}_hw_emu.xclbin)
+
+    add_custom_target(
+      xilinx_build_${DACE_PROGRAM_NAME}_hardware
+      COMMAND
+      XILINX_PATH=${CMAKE_BINARY_DIR} ${SDAccel_XOCC}
+      ${XOCC_BUILD_FLAGS}
+      -l
+      -t hw
+      ${DACE_PROGRAM_NAME}_hw.xo
+      -o ${DACE_PROGRAM_NAME}_hw.xclbin)
+
+  endif()
+
+endif() # DACE_ENABLE_XILINX
+
+# Create DaCe library file 
+add_library(${DACE_PROGRAM_NAME} SHARED ${DACE_CPP_FILES} ${DACE_OBJECTS})
+target_link_libraries(${DACE_PROGRAM_NAME} ${DACE_LIBS})
+
+# Create DaCe loader stub
+add_library(dacestub_${DACE_PROGRAM_NAME} SHARED "${CMAKE_SOURCE_DIR}/tools/dacestub.cpp")
+target_link_libraries(dacestub_${DACE_PROGRAM_NAME} ${CMAKE_THREAD_LIBS_INIT} ${OpenMP_CXX_LIBRARIES})
+
+# Windows-specific fixes
+if (MSVC_IDE)
+    # Copy output DLL from the "Debug" and "Release" directories CMake adds
+    # NOTE: The "|| (exit 0)" is added because copy sometimes fails due to the 
+    # stub library being already loaded.
+    add_custom_target(CopyDLL ALL
+        COMMAND ${CMAKE_COMMAND} -E copy_if_different
+        $<TARGET_FILE:${DACE_PROGRAM_NAME}> "${CMAKE_BINARY_DIR}/lib${DACE_PROGRAM_NAME}.dll"
+        COMMAND ${CMAKE_COMMAND} -E copy_if_different
+        $<TARGET_FILE:dacestub_${DACE_PROGRAM_NAME}> "${CMAKE_BINARY_DIR}/libdacestub_${DACE_PROGRAM_NAME}.dll" || (exit 0)
+        DEPENDS ${DACE_PROGRAM_NAME}
+        COMMENT "Copying binaries" VERBATIM)
+
+    # Replace /MD with /MT so that CUDA links properly
+    # https://stackoverflow.com/a/14172871/6489142
+    set(CompilerFlags
+        CMAKE_CXX_FLAGS
+        CMAKE_CXX_FLAGS_DEBUG
+        CMAKE_CXX_FLAGS_RELEASE
+        CMAKE_CXX_FLAGS_RELWITHDEBINFO
+        CMAKE_CXX_FLAGS_MINSIZEREL
+        CMAKE_C_FLAGS
+        CMAKE_C_FLAGS_DEBUG
+        CMAKE_C_FLAGS_RELEASE
+        CMAKE_C_FLAGS_RELWITHDEBINFO
+        CMAKE_C_FLAGS_MINSIZEREL
+        )
+    foreach(CompilerFlag ${CompilerFlags})
+      string(REPLACE "/MD" "/MT" ${CompilerFlag} "${${CompilerFlag}}")
+    endforeach()
+endif()
diff --git a/dace/codegen/Xilinx_HLS.tcl.in b/dace/codegen/Xilinx_HLS.tcl.in
new file mode 100644
index 0000000000..7261b513a9
--- /dev/null
+++ b/dace/codegen/Xilinx_HLS.tcl.in
@@ -0,0 +1,14 @@
+open_project ${DACE_KERNEL_NAME} 
+open_solution ${DACE_XILINX_PART_NAME}  
+add_files -cflags "${DACE_XILINX_SYNTHESIS_FLAGS} -I${DACE_RUNTIME_DIR}/include -I${DACE_HLSLIB_DIR}/include -I${CMAKE_BINARY_DIR}" "${DACE_KERNEL_FILE}" 
+set_top ${DACE_KERNEL_NAME} 
+set_part ${DACE_XILINX_PART_NAME} 
+create_clock -period ${DACE_XILINX_TARGET_CLOCK}MHz -name default
+# SDAccel default options
+config_rtl -register_reset
+config_interface -m_axi_addr64
+config_schedule -relax_ii_for_timing
+config_compile -pipeline_loops 64
+config_compile -name_max_length 256
+csynth_design
+exit
diff --git a/dace/codegen/__init__.py b/dace/codegen/__init__.py
new file mode 100644
index 0000000000..e69de29bb2
diff --git a/dace/codegen/codegen.py b/dace/codegen/codegen.py
new file mode 100644
index 0000000000..406b0409cf
--- /dev/null
+++ b/dace/codegen/codegen.py
@@ -0,0 +1,67 @@
+import numpy as np
+
+from typing import List
+
+from dace import symbolic
+from dace.codegen.targets import framecode
+from dace.codegen.codeobject import CodeObject
+
+from dace.codegen.instrumentation.perfsettings import PerfSettings, PerfMetaInfoStatic, PerfMetaInfo
+
+# Import all code generation targets
+from dace.codegen.targets import cpu, cuda, immaterial, mpi, xilinx
+
+
+class CodegenError(Exception):
+    pass
+
+
+STRING_TO_TARGET = {
+    "cpu": cpu.CPUCodeGen,
+    "cuda": cuda.CUDACodeGen,
+    "immaterial": immaterial.ImmaterialCodeGen,
+    "mpi": mpi.MPICodeGen,
+    "xilinx": xilinx.XilinxCodeGen,
+}
+
+_TARGET_REGISTER_ORDER = ['cpu', 'cuda', 'immaterial', 'mpi', 'xilinx']
+
+
+def generate_code(sdfg) -> List[CodeObject]:
+    """ Generates code as a list of code objects for a given SDFG.
+        @param sdfg: The SDFG to use
+        @return: List of code objects that correspond to files to compile.
+    """
+    # Before compiling, validate SDFG correctness
+    sdfg.validate()
+
+    frame = framecode.DaCeCodeGenerator()
+    # Instantiate all targets (who register themselves with framecodegen)
+    targets = {
+        name: STRING_TO_TARGET[name](frame, sdfg)
+        for name in _TARGET_REGISTER_ORDER
+    }
+
+    # Generate frame code (and the rest of the code)
+    global_code, frame_code, used_targets = frame.generate_code(sdfg, None)
+    target_objects = [
+        CodeObject(
+            sdfg.name,
+            global_code + frame_code,
+            'cpp',
+            cpu.CPUCodeGen,
+            'Frame',
+            meta_info=PerfMetaInfoStatic.info
+            if PerfSettings.perf_enable_vectorization_analysis() else
+            PerfMetaInfo())
+    ]
+    PerfMetaInfoStatic.info = PerfMetaInfo()
+
+    # Create code objects for each target
+    for tgt in used_targets:
+        target_objects.extend(tgt.get_generated_codeobjects())
+
+    return target_objects
+
+
+##################################################################
diff --git a/dace/codegen/codeobject.py b/dace/codegen/codeobject.py
new file mode 100644
index 0000000000..cb25c3884a
--- /dev/null
+++ b/dace/codegen/codeobject.py
@@ -0,0 +1,51 @@
+import ctypes
+import numpy as np
+
+from dace import symbolic, types
+from dace.config import Config
+from dace.frontend import operations
+from dace.properties import Property, make_properties
+from dace.codegen.targets.target import TargetCodeGenerator
+
+from dace.codegen.instrumentation.perfsettings import PerfMetaInfo
+
+
+@make_properties
+class CodeObject(object):
+    name = Property(dtype=str, desc="Filename to use")
+    code = Property(dtype=str, desc="The code attached to this object")
+    perf_meta_info = Property(
+        dtype=PerfMetaInfo, desc="Meta information used to map nodes to LOC")
+    language = Property(
+        dtype=str,
+        desc="Language used for this code (same " +
+        "as its file extension)")  # dtype=types.Language?
+    target = Property(dtype=type, desc="Target to use for compilation")
+    title = Property(dtype=str, desc="Title of code for GUI")
+    extra_compiler_kwargs = Property(
+        dtype=dict,
+        desc="Additional compiler argument "
+        "variables to add to template")
+    linkable = Property(
+        dtype=bool, desc='Should this file participate in '
+        'overall linkage?')
+
+    def __init__(self,
+                 name,
+                 code,
+                 language,
+                 target,
+                 title,
+                 additional_compiler_kwargs={},
+                 linkable=True,
+                 meta_info=PerfMetaInfo()):
+        super(CodeObject, self).__init__()
+
+        self.name = name
+        self.code = code
+        self.language = language
+        self.target = target
+        self.title = title
+        self.extra_compiler_kwargs = additional_compiler_kwargs
+        self.linkable = linkable
+        self.perf_meta_info = meta_info
diff --git a/dace/codegen/compiler.py b/dace/codegen/compiler.py
new file mode 100644
index 0000000000..16dab734b2
--- /dev/null
+++ b/dace/codegen/compiler.py
@@ -0,0 +1,512 @@
+#!/usr/bin/python3
+""" Handles compilation of code objects. Creates the proper folder structure,
+    compiles each target separately, links all targets to one binary, and
+    returns the corresponding CompiledSDFG object. """
+
+from __future__ import print_function
+
+import ctypes
+import os
+import re
+import six
+import shutil
+import subprocess
+import string
+import subprocess as sp
+import re
+from typing import List
+import numpy as np
+
+import dace
+from dace.frontend import operations
+from dace.frontend.python import ndarray
+from dace import symbolic, types
+from dace.config import Config
+from dace.codegen import codegen
+from dace.codegen.codeobject import CodeObject
+from dace.codegen.targets.cpu import CPUCodeGen
+from dace.codegen.targets.target import make_absolute
+
+from dace.codegen.instrumentation.perfsettings import PerfSettings, PerfMetaInfoStatic
+
+
+# Specialized exception classes
+class DuplicateDLLError(Exception):
+    """ An exception that is raised whenever a library is loaded twice. """
+    pass
+
+
+class CompilerConfigurationError(Exception):
+    """ An exception that is raised whenever CMake encounters a configuration
+        error. """
+    pass
+
+
+class CompilationError(Exception):
+    """ An exception that is raised whenever a compilation error occurs. """
+    pass
+
+
+class ReloadableDLL(object):
+    """ A reloadable shared object (or dynamically linked library), which
+        bypasses Python's dynamic library reloading issues. """
+
+    def __init__(self, library_filename, program_name):
+        """ Creates a new reloadable shared object. 
+            @param library_filename: Path to library file.
+            @param program_name: Name of the DaCe program (for use in finding
+                                 the stub library loader).
+        """
+        self._stub_filename = os.path.join(
+            os.path.dirname(os.path.realpath(library_filename)),
+            'libdacestub_%s.%s' %
+            (program_name, Config.get('compiler', 'library_extension')))
+        self._library_filename = library_filename
+        self._stub = None
+        self._lib = None
+
+    def get_symbol(self, name, restype=ctypes.c_int):
+        """ Returns a symbol (e.g., function name) in the loaded library. """
+
+        if self._lib is None or self._lib.value is None:
+            raise ReferenceError(
+                'ReloadableDLL can only be used with a ' +
+                '"with" statement or with load() and unload()')
+
+        func = self._stub.get_symbol(self._lib, ctypes.c_char_p(name.encode()))
+        if func is None:
+            raise KeyError('Function %s not found in library %s' %
+                           (name, os.path.basename(self._library_filename)))
+
+        return ctypes.CFUNCTYPE(restype)(func)
+
+    def load(self):
+        """ Loads the internal library using the stub. """
+
+        # If internal library is already loaded, skip
+        if self._lib is not None and self._lib.value is not None:
+            return
+        self._stub = ctypes.CDLL(self._stub_filename)
+
+        # Set return types of stub functions
+        self._stub.load_library.restype = ctypes.c_void_p
+        self._stub.get_symbol.restype = ctypes.c_void_p
+
+        # Convert library filename to string according to OS
+        if os.name == 'nt':
+            # As UTF-16
+            lib_cfilename = ctypes.c_wchar_p(self._library_filename)
+        else:
+            # As UTF-8
+            lib_cfilename = ctypes.c_char_p(
+                self._library_filename.encode('utf-8'))
+
+        # Check if library is already loaded
+        is_loaded = self._stub.is_library_loaded(lib_cfilename)
+        if is_loaded == 1:
+            raise DuplicateDLLError(
+                'Library %s is already loaded somewhere else, ' %
+                os.path.basename(self._library_filename) +
+                'either unload it or use a different name ' +
+                'for the SDFG/program.')
+
+        # Actually load the library
+        self._lib = ctypes.c_void_p(self._stub.load_library(lib_cfilename))
+
+        if self._lib.value is None:
+            raise RuntimeError('Could not load library %s' % os.path.basename(
+                self._library_filename))
+
+    def unload(self):
+        """ Unloads the internal library using the stub. """
+
+        if self._stub is None:
+            return
+
+        self._stub.unload_library(self._lib)
+        self._lib = None
+        del self._stub
+        self._stub = None
+
+    def __enter__(self, *args, **kwargs):
+        self.load()
+        return self
+
+    def __exit__(self, *args, **kwargs):
+        self.unload()
+
+
+class CompiledSDFG(object):
+    """ A compiled SDFG object that can be called through Python. """
+
+    def __init__(self, sdfg, lib: ReloadableDLL):
+        self._sdfg = sdfg
+        self._lib = lib
+        self._initialized = False
+        self._lastargs = ()
+        lib.load()  # Explicitly load the library
+        self._init = lib.get_symbol('__dace_init')
+        self._exit = lib.get_symbol('__dace_exit')
+        self._cfunc = lib.get_symbol('__program_{}'.format(sdfg.name))
+
+    @property
+    def sdfg(self):
+        return self._sdfg
+
+    def __del__(self):
+        if self._initialized == True:
+            self.finalize(*self._lastargs)
+            self._initialized = False
+        self._lib.unload()
+
+    def _construct_args(self, *args, **kwargs):
+        """ Main function that controls argument construction for calling
+            the C prototype of the SDFG. 
+            
+            Organizes arguments first by `sdfg.arglist`, then data descriptors
+            by alphabetical order, then symbols by alphabetical order.
+        """
+
+        if len(kwargs) > 0 and len(args) > 0:
+            raise AttributeError(
+                'Compiled SDFGs can only be called with either arguments ' +
+                '(e.g. "program(a,b,c)") or keyword arguments ' +
+                '("program(A=a,B=b)"), but not both')
+
+        # Argument construction
+        sig = []
+        if len(kwargs) > 0:
+            # Construct mapping from arguments to signature
+            sig = self._sdfg.signature_arglist(with_types=False)
+            arglist = []
+            for a in sig:
+                try:
+                    arglist.append(kwargs[a])
+                except KeyError:
+                    raise KeyError("Missing kernel argument \"{}\"".format(a))
+        elif len(args) > 0:
+            arglist = list(args)
+        else:
+            arglist = []
+
+        sdfg = self._sdfg
+
+        # As in compilation, add symbols used in array sizes to parameters
+        symparams = {}
+        for symname in sdfg.undefined_symbols(False):
+            # Ignore arguments (as they may not be symbols but constants,
+            # see below)
+            if symname in sdfg.arg_types: continue
+            try:
+                symval = symbolic.symbol(symname)
+                symparams[symname] = symval.get()
+            except UnboundLocalError:
+                try:
+                    symparams[symname] = kwargs[symname]
+                except KeyError:
+                    raise UnboundLocalError('Unassigned symbol %s' % symname)
+
+        arglist.extend(
+            [symparams[k] for k in sorted(symparams.keys()) if k not in sig])
+
+        # Obtain SDFG constants
+        constants = sdfg.constants
+
+        # Remove symbolic constants from arguments
+        callparams = tuple(
+            arg for arg in arglist if not symbolic.issymbolic(arg) or (
+                hasattr(arg, 'name') and arg.name not in constants))
+
+        # Replace symbols with their values
+        callparams = tuple(
+            symbolic.eval(arg) if symbolic.issymbolic(arg, constants) else arg
+            for arg in callparams)
+
+        # Replace arrays with their pointers
+        newargs = tuple(
+            ctypes.c_void_p(arg.__array_interface__['data'][0]) if (isinstance(
+                arg, ndarray.ndarray) or isinstance(arg, np.ndarray)) else arg
+            for arg in callparams)
+
+        newargs = tuple(types._FFI_CTYPES[type(arg)](arg)
+                        if type(arg) in types._FFI_CTYPES else arg
+                        for arg in newargs)
+
+        self._lastargs = newargs
+        return self._lastargs
+
+    def initialize(self, *argtuple):
+        if self._init is not None:
+            res = self._init(*argtuple)
+            if res != 0:
+                raise RuntimeError('DaCe application failed to initialize')
+
+        self._initialized = True
+
+    def finalize(self, *argtuple):
+        if self._exit is not None:
+            self._exit(*argtuple)
+
+    def __call__(self, *args, **kwargs):
+        argtuple = self._construct_args(*args, **kwargs)
+
+        # Call initializer function if necessary, then SDFG
+        if self._initialized == False:
+            self.initialize(*argtuple)
+
+        # PROFILING
+        if Config.get_bool('profiling'):
+            operations.timethis(self._sdfg.name, 'DaCe', 0, self._cfunc,
+                                *argtuple)
+        else:
+            return self._cfunc(*argtuple)
+
+
+def unique_flags(flags):
+    pattern = '[^ ]+[`\'"][^"\'`]+["\'`]|[^ ]+'
+    if not isinstance(flags, str):
+        flags = " ".join(flags)
+    return set(re.findall(pattern, flags))
+
+
+def generate_program_folder(code_objects: List[CodeObject], out_path):
+    """ Writes all files required to configure and compile the DaCe program
+        into the specified folder.
+
+        @param code_objects: List of generated code objects.
+        @param out_path: The folder in which the build files should be written.
+        @return: Path to the program folder.
+    """
+
+    src_path = os.path.join(out_path, "src")
+
+    try:
+        os.makedirs(src_path)
+    except FileExistsError:
+        pass
+
+    filelist = []
+    # Write each code object to a file
+    for code_object in code_objects:
+
+        name = code_object.name
+        extension = code_object.language
+        target_name = code_object.target.target_name
+
+        # Create target folder
+        target_folder = os.path.join(src_path, target_name)
+        try:
+            os.makedirs(target_folder)
+        except FileExistsError:
+            pass
+
+        # Write code to file
+        basename = "{}.{}".format(name, extension)
+        code_path = os.path.join(target_folder, basename)
+        with open(code_path, "w") as code_file:
+            clean_code = re.sub(r'[ \t]*////__DACE:[^\n]*', '',
+                                code_object.code)
+
+            if PerfSettings.perf_enable_vectorization_analysis():
+                # Generate line number information from the code
+                # TODO: Make per code stream
+                code_object.perf_meta_info.resolve(clean_code)
+            code_file.write(clean_code)
+
+        filelist.append("{},{}".format(target_name, basename))
+
+    # Write list of files
+    with open(os.path.join(out_path, "dace_files.csv"), "w") as filelist_file:
+        filelist_file.write("\n".join(filelist))
+
+    # Copy snapshot of configuration script
+    Config.save(os.path.join(out_path, "dace.conf"))
+
+    return out_path
+
+
+def configure_and_compile(program_folder, program_name=None):
+    """ Configures and compiles a DaCe program in the specified folder into a
+        shared library file.
+
+        @param program_folder: Folder containing all files necessary to build,
+                               equivalent to what was passed to
+                               `generate_program_folder`.
+        @return: Path to the compiled shared library file.
+    """
+
+    if program_name is None:
+        program_name = os.path.basename(program_folder)
+    program_folder = os.path.abspath(program_folder)
+    src_folder = os.path.join(program_folder, "src")
+
+    # Prepare build folder
+    build_folder = os.path.join(program_folder, "build")
+    try:
+        os.makedirs(build_folder)
+    except FileExistsError:
+        pass
+
+    # Read list of DaCe files to compile.
+    # We do this instead of iterating over source files in the directory to
+    # avoid globbing files from previous compilations, such that we don't need
+    # to wipe the directory for every compilation.
+    file_list = [
+        line.strip().split(",")
+        for line in open(os.path.join(program_folder, "dace_files.csv"), "r")
+    ]
+
+    # Get absolute paths and targets for all source files
+    files = []
+    targets = {}  # {target name: target class}
+    for target_name, file_name in file_list:
+        path = os.path.join(src_folder, target_name, file_name)
+        files.append(path)
+        targets[target_name] = codegen.STRING_TO_TARGET[target_name]
+
+    # Start forming CMake command
+    dace_path = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+    cmake_command = [
+        "cmake",
+        "-A x64" if os.name == 'nt' else "",  # Windows-specific flag
+        '"' + os.path.join(dace_path, "codegen") + '"',
+        "-DDACE_FILES=\"{}\"".format(";".join(files)),
+        "-DDACE_PROGRAM_NAME={}".format(program_name),
+    ]
+
+    # Replace backslashes with forward slashes
+    cmake_command = [cmd.replace('\\', '/') for cmd in cmake_command]
+
+    # Generate CMake options for each compiler
+    libraries = set()
+    for target_name, target in targets.items():
+        cmake_command += target.cmake_options()
+        try:
+            libraries |= unique_flags(
+                Config.get("compiler", target_name, "libs"))
+        except KeyError:
+            pass
+
+    # TODO: it should be possible to use the default arguments/compilers
+    #       found by CMake
+    cmake_command += [
+        "-DDACE_LIBS=\"{}\"".format(" ".join(libraries)),
+        "-DCMAKE_LINKER=\"{}\"".format(
+            make_absolute(Config.get('compiler', 'linker', 'executable'))),
+        "-DCMAKE_SHARED_LINKER_FLAGS=\"{}\"".format(
+            Config.get('compiler', 'linker', 'args') +
+            Config.get('compiler', 'linker', 'additional_args')),
+    ]
+
+    ##############################################
+    # Configure
+    try:
+        _run_liveoutput(" ".join(cmake_command), shell=True, cwd=build_folder)
+    except subprocess.CalledProcessError as ex:
+        # Clean CMake directory and try once more
+        if Config.get_bool('debugprint'):
+            print('Cleaning CMake build folder and retrying...')
+        shutil.rmtree(build_folder)
+        os.makedirs(build_folder)
+        try:
+            _run_liveoutput(
+                " ".join(cmake_command), shell=True, cwd=build_folder)
+        except subprocess.CalledProcessError as ex:
+            # If still unsuccessful, print results
+            if Config.get_bool('debugprint'):
+                raise CompilerConfigurationError('Configuration failure')
+            else:
+                raise CompilerConfigurationError('Configuration failure:\n' +
+                                                 ex.output)
+
+    # Compile and link
+    try:
+        _run_liveoutput(
+            "cmake --build . --config %s" % (Config.get(
+                'compiler', 'build_type')),
+            shell=True,
+            cwd=build_folder)
+    except subprocess.CalledProcessError as ex:
+        # If unsuccessful, print results
+        if Config.get_bool('debugprint'):
+            raise CompilationError('Compiler failure')
+        else:
+            raise CompilationError('Compiler failure:\n' + ex.output)
+
+    shared_library_path = os.path.join(
+        build_folder, "lib{}.{}".format(
+            program_name, Config.get('compiler', 'library_extension')))
+
+    return shared_library_path
+
+
+def get_program_handle(library_path, sdfg):
+    lib = ReloadableDLL(library_path, sdfg.name)
+    # Load and return the compiled function
+    return CompiledSDFG(sdfg, lib)
+
+
+def load_from_file(sdfg, binary_filename):
+    if not os.path.isfile(binary_filename):
+        raise FileNotFoundError('File not found: ' + binary_filename)
+
+    # Load the generated library
+    lib = ReloadableDLL(binary_filename, sdfg.name)
+
+    # Load and return the compiled function
+    return CompiledSDFG(sdfg, lib)
+
+
+def get_binary_name(object_name,
+                    object_hash=None,
+                    lib_extension=Config.get('compiler', 'library_extension')):
+    name = None
+    if object_hash is None:
+        name = os.path.join('.dacecache', object_name, "build",
+                            'lib%s.%s' % (object_name, lib_extension))
+    else:
+        name = os.path.join(
+            '.dacecache', object_name, "build",
+            'lib%s_%s.%s' % (object_name, object_hash, lib_extension))
+    return name
+
+
+def _run_liveoutput(command, **kwargs):
+    process = subprocess.Popen(
+        command, stderr=subprocess.STDOUT, stdout=subprocess.PIPE, **kwargs)
+    output = six.StringIO()
+    while True:
+        line = process.stdout.readline().rstrip()
+        if not line:
+            break
+        output.write(line.decode('utf-8') + '\n')
+        if Config.get_bool('debugprint'):
+            print(line.decode('utf-8'), flush=True)
+    stdout, stderr = process.communicate()
+    if Config.get_bool('debugprint'):
+        print(stdout.decode('utf-8'), flush=True)
+        if stderr is not None:
+            print(stderr.decode('utf-8'), flush=True)
+    output.write(stdout.decode('utf-8'))
+    if stderr is not None:
+        output.write(stderr.decode('utf-8'))
+
+    # An error occurred, raise exception
+    if process.returncode != 0:
+        raise subprocess.CalledProcessError(process.returncode, command,
+                                            output.getvalue())
+
+
+# Allow configuring and compiling a prepared build folder from the commandline.
+# This is useful for remote execution.
+if __name__ == "__main__":
+    import argparse
+
+    argparser = argparse.ArgumentParser()
+    argparser.add_argument("path", type=str)
+    argparser.add_argument("outname", type=str)
+    args = vars(argparser.parse_args())
+
+    Config.load(os.path.join(args["path"], "dace.conf"))
+
+    configure_and_compile(args["path"], args["outname"])
diff --git a/dace/codegen/cppunparse.py b/dace/codegen/cppunparse.py
new file mode 100644
index 0000000000..904057a62b
--- /dev/null
+++ b/dace/codegen/cppunparse.py
@@ -0,0 +1,1093 @@
+# This module is derived from astunparse: https://github.com/simonpercivall/astunparse
+##########################################################################
+### astunparse LICENSES
+# LICENSE
+# ==================
+#
+# Copyright (c) 2014, Simon Percivall
+# All rights reserved.
+#
+# Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
+#
+# * Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
+#
+# * Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
+#
+# * Neither the name of AST Unparser nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.
+#
+# THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+#
+#
+# PYTHON SOFTWARE FOUNDATION LICENSE VERSION 2
+# --------------------------------------------
+#
+# 1. This LICENSE AGREEMENT is between the Python Software Foundation
+# ("PSF"), and the Individual or Organization ("Licensee") accessing and
+# otherwise using this software ("Python") in source or binary form and
+# its associated documentation.
+#
+# 2. Subject to the terms and conditions of this License Agreement, PSF hereby
+# grants Licensee a nonexclusive, royalty-free, world-wide license to reproduce,
+# analyze, test, perform and/or display publicly, prepare derivative works,
+# distribute, and otherwise use Python alone or in any derivative version,
+# provided, however, that PSF's License Agreement and PSF's notice of copyright,
+# i.e., "Copyright (c) 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010,
+# 2011, 2012, 2013, 2014 Python Software Foundation; All Rights Reserved" are retained
+# in Python alone or in any derivative version prepared by Licensee.
+#
+# 3. In the event Licensee prepares a derivative work that is based on
+# or incorporates Python or any part thereof, and wants to make
+# the derivative work available to others as provided herein, then
+# Licensee hereby agrees to include in any such work a brief summary of
+# the changes made to Python.
+#
+# 4. PSF is making Python available to Licensee on an "AS IS"
+# basis.  PSF MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR
+# IMPLIED.  BY WAY OF EXAMPLE, BUT NOT LIMITATION, PSF MAKES NO AND
+# DISCLAIMS ANY REPRESENTATION OR WARRANTY OF MERCHANTABILITY OR FITNESS
+# FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF PYTHON WILL NOT
+# INFRINGE ANY THIRD PARTY RIGHTS.
+#
+# 5. PSF SHALL NOT BE LIABLE TO LICENSEE OR ANY OTHER USERS OF PYTHON
+# FOR ANY INCIDENTAL, SPECIAL, OR CONSEQUENTIAL DAMAGES OR LOSS AS
+# A RESULT OF MODIFYING, DISTRIBUTING, OR OTHERWISE USING PYTHON,
+# OR ANY DERIVATIVE THEREOF, EVEN IF ADVISED OF THE POSSIBILITY THEREOF.
+#
+# 6. This License Agreement will automatically terminate upon a material
+# breach of its terms and conditions.
+#
+# 7. Nothing in this License Agreement shall be deemed to create any
+# relationship of agency, partnership, or joint venture between PSF and
+# Licensee.  This License Agreement does not grant permission to use PSF
+# trademarks or trade name in a trademark sense to endorse or promote
+# products or services of Licensee, or any third party.
+#
+# 8. By copying, installing or otherwise using Python, Licensee
+# agrees to be bound by the terms and conditions of this License
+# Agreement.
+##########################################################################
+### END OF astunparse LICENSES
+
+from __future__ import print_function, unicode_literals
+import inspect
+import six
+import sys
+import ast
+import os
+import tokenize
+from six import StringIO
+
+# Large float and imaginary literals get turned into infinities in the AST.
+# We unparse those infinities to INFSTR.
+INFSTR = "1e" + repr(sys.float_info.max_10_exp + 1)
+
+_py2c_nameconst = {True: "true", False: "false", None: "nullptr"}
+
+_py2c_reserved = {"True": "true", "False": "false", "None": "nullptr"}
+
+
+def interleave(inter, f, seq):
+    """Call f on each item in seq, calling inter() in between.
+    """
+    seq = iter(seq)
+    try:
+        f(next(seq))
+    except StopIteration:
+        pass
+    else:
+        for x in seq:
+            inter()
+            f(x)
+
+
+class LocalScheme(object):
+    def is_defined(self, local_name, current_depth):
+        raise NotImplementedError('Abstract class')
+
+    def define(self, local_name, lineno, depth):
+        raise NotImplementedError('Abstract class')
+
+    def clear_scope(self, from_indentation):
+        raise NotImplementedError('Abstract class')
+
+
+class CPPLocals(LocalScheme):
+    def __init__(self):
+        # Maps local name to a 2-tuple of line number and scope (measured in indentation)
+        self.locals = {}
+
+    def is_defined(self, local_name, current_depth):
+        return local_name in self.locals
+
+    def define(self, local_name, lineno, depth):
+        self.locals[local_name] = (lineno, depth)
+
+    def clear_scope(self, from_indentation):
+        """Clears all locals defined in indentation 'from_indentation' and deeper"""
+        toremove = set()
+        for local_name, (lineno, depth) in self.locals.items():
+            if depth >= from_indentation:
+                toremove.add(local_name)
+
+        for var in toremove:
+            del self.locals[var]
+
+
+# Python scheme: All global variables can be read, but not written to (unless defined as "global")
+class PythonLocals(LocalScheme):
+    def __init__(self):
+        # Maps local name to a 2-tuple of line number and scope (measured in indentation)
+        self.locals = {}
+
+    def is_defined(self, local_name, current_depth):
+        return local_name in self.locals and self.locals[local_name][1] == current_depth
+
+    def define(self, local_name, lineno, depth):
+        self.locals[local_name] = (lineno, depth)
+
+    def clear_scope(self, from_indentation):
+        """Clears all locals defined in indentation 'from_indentation' and deeper"""
+        toremove = set()
+        for local_name, (lineno, depth) in self.locals.items():
+            if depth >= from_indentation:
+                toremove.add(local_name)
+        for var in toremove:
+            del self.locals[var]
+
+
+class CPPUnparser:
+    """Methods in this class recursively traverse an AST and
+    output C++ source code for the abstract syntax; original formatting
+    is disregarded. """
+
+    def __init__(self,
+                 tree,
+                 depth,
+                 locals,
+                 file=sys.stdout,
+                 indent_output=True,
+                 expr_semicolon=True,
+                 indent_offset=0):
+
+        self.f = file
+        self.future_imports = []
+        self._indent = depth
+        self.indent_output = indent_output
+        self.indent_offset = indent_offset
+        self.expr_semicolon = expr_semicolon
+        if not isinstance(locals, LocalScheme):
+            raise TypeError('Locals must be a LocalScheme object')
+        self.locals = locals
+        self.firstfill = True
+
+        self.dispatch(tree)
+        print("", file=self.f)
+        self.f.flush()
+
+    def fill(self, text=""):
+        """Indent a piece of text, according to the current indentation level"""
+        if self.firstfill:
+            if self.indent_output:
+                self.f.write("    " * (self._indent + self.indent_offset) +
+                             text)
+            else:
+                self.f.write(text)
+            self.firstfill = False
+        else:
+            if self.indent_output:
+                self.f.write("\n" + "    " *
+                             (self._indent + self.indent_offset) + text)
+            else:
+                self.f.write("\n" + text)
+
+    def write(self, text):
+        """Append a piece of text to the current line."""
+        self.f.write(six.text_type(text))
+
+    def enter(self):
+        """Print '{', and increase the indentation."""
+        self.write(" {")
+        self._indent += 1
+
+    def leave(self):
+        """Decrease the indentation and print '}'."""
+        self._indent -= 1
+        self.fill()
+        self.write("}")
+        # Clear locals defined inside scope
+        self.locals.clear_scope(self._indent + 1)
+
+    def dispatch(self, tree):
+        """Dispatcher function, dispatching tree type T to method _T."""
+        try:
+            tree = iter(tree)
+            for t in tree:
+                self.dispatch(t)
+        except TypeError:
+            meth = getattr(self, "_" + tree.__class__.__name__)
+            meth(tree)
+
+    ############### Unparsing methods ######################
+    # There should be one method per concrete grammar type #
+    # Constructors should be grouped by sum type. Ideally, #
+    # this would follow the order in the grammar, but      #
+    # currently doesn't.                                   #
+    ########################################################
+
+    def _Module(self, tree):
+        for stmt in tree.body:
+            self.dispatch(stmt)
+
+    def _Interactive(self, tree):
+        for stmt in tree.body:
+            self.dispatch(stmt)
+
+    def _Expression(self, tree):
+        self.dispatch(tree.body)
+
+    # stmt
+    def _Expr(self, tree):
+        self.fill()
+        self.dispatch(tree.value)
+        if self.expr_semicolon:
+            self.write(';')
+
+    def _Import(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _ImportFrom(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def dispatch_lhs_tuple(self, targets):
+        # Decide whether to use the C++17 syntax for undefined variables or std::tie for defined variables
+        if all(
+                self.locals.is_defined(target.id, self._indent)
+                for target in targets):
+            defined = True
+        elif any(
+                self.locals.is_defined(target.id, self._indent)
+                for target in targets):
+            raise SyntaxError(
+                'Invalid C++ (some variables in tuple were already defined)')
+        else:
+            defined = False
+
+        if not defined:  # C++17 syntax: auto [a,b,...,z] = ...
+            self.write("auto [")
+        else:  # C++14 syntax: std::tie(a,b,...,z) = ...
+            self.write("std::tie(")
+
+        first = True
+        for target in targets:
+            if not first:
+                self.write(', ')
+            self.locals.define(target.id, target.lineno, self._indent)
+            self.dispatch(target)
+            first = False
+
+        if not defined:
+            self.write("]")
+        else:
+            self.write(")")
+
+    def _Assign(self, t):
+        self.fill()
+
+        # Handle the case of a tuple output
+        if len(t.targets) > 1:
+            self.dispatch_lhs_tuple(t.targets)
+        else:
+            target = t.targets[0]
+            if isinstance(target, ast.Tuple):
+                if len(target.elts) > 1:
+                    self.dispatch_lhs_tuple(target.elts)
+                target = target.elts[0]
+
+            if not isinstance(target,
+                              ast.Subscript) and not self.locals.is_defined(
+                                  target.id, self._indent):
+                self.locals.define(target.id, t.lineno, self._indent)
+                self.write('auto ')
+            self.dispatch(target)
+
+        self.write(" = ")
+        self.dispatch(t.value)
+        self.write(';')
+
+    def _AugAssign(self, t):
+        self.fill()
+        self.dispatch(t.target)
+        # Operations that require a function call
+        if t.op.__class__.__name__ in self.funcops:
+            separator, func = self.funcops[t.op.__class__.__name__]
+            self.write(" = " + func + "(")
+            self.dispatch(t.target)
+            self.write(separator + " ")
+            self.dispatch(t.value)
+            self.write(")")
+        else:
+            self.write(" " + self.binop[t.op.__class__.__name__] + "= ")
+            self.dispatch(t.value)
+        self.write(';')
+
+    def _AnnAssign(self, t):
+        self.fill()
+
+        if isinstance(t.target, ast.Tuple):
+            if len(t.target.elts) > 1:
+                self.dispatch_lhs_tuple(t.target.elts)
+            else:
+                target = target.elts[0]
+        else:
+            target = t.target
+
+        # Assignment of the form x: int = 0 is converted to int x = (int)0;
+        if not self.locals.is_defined(target.id, self._indent):
+            self.locals.define(target.id, t.lineno, self._indent)
+            self.dispatch(t.annotation)
+            self.write(' ')
+        if not t.simple:
+            self.write("(")
+        self.dispatch(t.target)
+        if not t.simple:
+            self.write(")")
+        if t.value:
+            self.write(" = (")
+            self.dispatch(t.annotation)
+            self.write(")")
+            self.dispatch(t.value)
+        self.write(';')
+
+    def _Return(self, t):
+        self.fill("return")
+        if t.value:
+            self.write(" ")
+            self.dispatch(t.value)
+        self.write(';')
+
+    def _Pass(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Break(self, t):
+        self.fill("break;")
+
+    def _Continue(self, t):
+        self.fill("continue;")
+
+    def _Delete(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Assert(self, t):
+        self.fill("assert(")
+        self.dispatch(t.test)
+        if t.msg:
+            self.write(", ")
+            self.dispatch(t.msg)
+        self.write(");")
+
+    def _Exec(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Print(self, t):
+        do_comma = False
+        if t.dest:
+            self.fill("fprintf(")
+            self.dispatch(t.dest)
+            do_comma = True
+        else:
+            self.fill("printf(")
+
+        for e in t.values:
+            if do_comma: self.write(", ")
+            else: do_comma = True
+            self.dispatch(e)
+        if not t.nl:
+            self.write(",")
+
+        self.write(');')
+
+    def _Global(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Nonlocal(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Yield(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _YieldFrom(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Raise(self, t):
+        self.fill("throw")
+        if six.PY3:
+            if not t.exc:
+                assert not t.cause
+                return
+            self.write(" ")
+            self.dispatch(t.exc)
+            if t.cause:
+                raise SyntaxError('Invalid C++')
+        else:
+            self.write(" ")
+            if t.type:
+                self.dispatch(t.type)
+            if t.inst:
+                self.write(", ")
+                self.dispatch(t.inst)
+            if t.tback:
+                self.write(", ")
+                self.dispatch(t.tback)
+        self.write(';')
+
+    def _Try(self, t):
+        self.fill("try")
+        self.enter()
+        self.dispatch(t.body)
+        self.leave()
+        for ex in t.handlers:
+            self.dispatch(ex)
+        if t.orelse:
+            raise SyntaxError('Invalid C++')
+        if t.finalbody:
+            self.fill("finally")
+            self.enter()
+            self.dispatch(t.finalbody)
+            self.leave()
+
+    def _TryExcept(self, t):
+        self.fill("try")
+        self.enter()
+        self.dispatch(t.body)
+        self.leave()
+
+        for ex in t.handlers:
+            self.dispatch(ex)
+        if t.orelse:
+            raise SyntaxError('Invalid C++')
+
+    def _TryFinally(self, t):
+        if len(t.body) == 1 and isinstance(t.body[0], ast.TryExcept):
+            # try-except-finally
+            self.dispatch(t.body)
+        else:
+            self.fill("try")
+            self.enter()
+            self.dispatch(t.body)
+            self.leave()
+
+        self.fill("finally")
+        self.enter()
+        self.dispatch(t.finalbody)
+        self.leave()
+
+    def _ExceptHandler(self, t):
+        self.fill("catch (")
+        if t.type:
+            self.dispatch(t.type)
+        if t.name:
+            if six.PY3:
+                self.write(t.name)
+            else:
+                self.dispatch(t.name)
+        self.write(')')
+        self.enter()
+        self.dispatch(t.body)
+        self.leave()
+
+    def _ClassDef(self, t):
+        raise NotImplementedError('Classes are unsupported')
+
+        # Original class definition from astunparse
+        #self.write("\n")
+        #for deco in t.decorator_list:
+        #    self.fill("@")
+        #    self.dispatch(deco)
+        #self.fill("class "+t.name)
+        #if six.PY3:
+        #    self.write("(")
+        #    comma = False
+        #    for e in t.bases:
+        #        if comma: self.write(", ")
+        #        else: comma = True
+        #        self.dispatch(e)
+        #    for e in t.keywords:
+        #        if comma: self.write(", ")
+        #        else: comma = True
+        #        self.dispatch(e)
+        #    if sys.version_info[:2] < (3, 5):
+        #        if t.starargs:
+        #            if comma: self.write(", ")
+        #            else: comma = True
+        #            self.write("*")
+        #            self.dispatch(t.starargs)
+        #        if t.kwargs:
+        #            if comma: self.write(", ")
+        #            else: comma = True
+        #            self.write("**")
+        #            self.dispatch(t.kwargs)
+        #    self.write(")")
+        #elif t.bases:
+        #        self.write("(")
+        #        for a in t.bases:
+        #            self.dispatch(a)
+        #            self.write(", ")
+        #        self.write(")")
+        #self.enter()
+        #self.dispatch(t.body)
+        #self.leave()
+
+    def _generic_FunctionDef(self, t, is_async=False):
+        self.write("\n")
+        for deco in t.decorator_list:
+            self.fill("// Decorator: ")
+            self.dispatch(deco)
+        if is_async:
+            self.write('/* async */ ')
+
+        if getattr(t, "returns", False):
+            if isinstance(t.returns, ast.NameConstant):
+                if t.returns.value is None:
+                    self.write('void')
+                else:
+                    self.dispatch(t.returns)
+            else:
+                self.dispatch(t.returns)
+
+            self.fill(" " + t.name + "(")
+        else:
+            self.fill("auto " + t.name + "(")
+
+        self.dispatch(t.args)
+
+        self.write(")")
+        self.enter()
+        self.dispatch(t.body)
+        self.leave()
+
+    def _FunctionDef(self, t):
+        self._generic_FunctionDef(t)
+
+    def _AsyncFunctionDef(self, t):
+        self._generic_FunctionDef(t, is_async=True)
+
+    def _generic_For(self, t, is_async=False):
+        if is_async:
+            self.fill("/* async */ for (")
+        else:
+            self.fill("for (")
+        if isinstance(t.target, ast.Tuple):
+            self.write("auto ")
+            if len(t.target.elts) == 1:
+                (elt, ) = t.target.elts
+                self.locals.define(elt.id, t.lineno, self._indent + 1)
+                self.dispatch(elt)
+            else:
+                self.write("[")
+                interleave(lambda: self.write(", "), self.dispatch,
+                           t.target.elts)
+                for elt in t.target.elts:
+                    self.locals.define(elt.id, t.lineno, self._indent + 1)
+                self.write("]")
+
+        else:
+            if not self.locals.is_defined(t.target.id, self._indent):
+                self.locals.define(t.target.id, t.lineno, self._indent + 1)
+                self.write('auto ')
+            self.dispatch(t.target)
+
+        self.write(" : ")
+        self.dispatch(t.iter)
+        self.write(")")
+        self.enter()
+        self.dispatch(t.body)
+        self.leave()
+        if t.orelse:
+            raise SyntaxError('Invalid C++')
+
+    def _For(self, t):
+        self._generic_For(t)
+
+    def _AsyncFor(self, t):
+        self._generic_For(t, is_async=True)
+
+    def _If(self, t):
+        self.fill("if (")
+        self.dispatch(t.test)
+        self.write(')')
+        self.enter()
+        self.dispatch(t.body)
+        self.leave()
+        # collapse nested ifs into equivalent elifs.
+        while (t.orelse and len(t.orelse) == 1
+               and isinstance(t.orelse[0], ast.If)):
+            t = t.orelse[0]
+            self.fill("else if (")
+            self.dispatch(t.test)
+            self.write(')')
+            self.enter()
+            self.dispatch(t.body)
+            self.leave()
+        # final else
+        if t.orelse:
+            self.fill("else")
+            self.enter()
+            self.dispatch(t.orelse)
+            self.leave()
+
+    def _While(self, t):
+        self.fill("while (")
+        self.dispatch(t.test)
+        self.write(')')
+        self.enter()
+        self.dispatch(t.body)
+        self.leave()
+        if t.orelse:
+            raise SyntaxError('Invalid C++')
+
+    def _generic_With(self, t, is_async=False):
+        raise SyntaxError('Invalid C++')
+
+    def _With(self, t):
+        self._generic_With(t)
+
+    def _AsyncWith(self, t):
+        self._generic_With(t, is_async=True)
+
+    # expr
+    def _Bytes(self, t):
+        self.write(repr(t.s))
+
+    def _Str(self, tree):
+        result = ''
+        if six.PY3:
+            result = repr(tree.s)
+        else:
+            # if from __future__ import unicode_literals is in effect,
+            # then we want to output string literals using a 'b' prefix
+            # and unicode literals with no prefix.
+            if "unicode_literals" not in self.future_imports:
+                result = repr(tree.s)
+            elif isinstance(tree.s, str):
+                result = "b" + repr(tree.s)
+            elif isinstance(tree.s, unicode):
+                result = repr(tree.s).lstrip("u")
+            else:
+                assert False, "shouldn't get here"
+
+        self.write(result.replace('\'', '\"'))
+
+    format_conversions = {97: 'a', 114: 'r', 115: 's'}
+
+    def _FormattedValue(self, t):
+        # FormattedValue(expr value, int? conversion, expr? format_spec)
+        self.write("{")
+        self.dispatch(t.value)
+        if t.conversion is not None and t.conversion != -1:
+            self.write("!")
+            self.write(self.format_conversions[t.conversion])
+            #raise NotImplementedError(ast.dump(t, True, True))
+        if t.format_spec is not None:
+            self.write(":")
+            if isinstance(t.format_spec, ast.Str):
+                self.write(t.format_spec.s)
+            else:
+                self.dispatch(t.format_spec)
+        self.write("}")
+
+    def _JoinedStr(self, t):
+        # JoinedStr(expr* values)
+        self.write("f'''")
+        for value in t.values:
+            if isinstance(value, ast.Str):
+                self.write(value.s)
+            else:
+                self.dispatch(value)
+        self.write("'''")
+
+    def _Name(self, t):
+        if t.id in _py2c_reserved:
+            self.write(_py2c_reserved[t.id])
+        else:
+            self.write(t.id)
+
+    def _NameConstant(self, t):
+        self.write(_py2c_nameconst[t.value])
+
+    def _Repr(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Num(self, t):
+        repr_n = repr(t.n)
+        if six.PY3:
+            if repr_n.endswith("j"):
+                # FIXME: Complex is not a native type in C++, this type-hack should deduce the target type
+                self.write(
+                    "dace::complexJ()*%s" % repr_n.replace("inf", INFSTR)[:-1])
+            else:
+                self.write(repr_n.replace("inf", INFSTR))
+        else:
+            # Parenthesize negative numbers, to avoid turning (-1)**2 into -1**2.
+            if repr_n.startswith("-"):
+                self.write("(")
+            if "inf" in repr_n and repr_n.endswith("*j"):
+                repr_n = repr_n.replace("*j", "j")
+
+            if repr_n.endswith("j"):
+                # FIXME: Complex is not a native type in C++, this type-hack should deduce the target type
+                self.write(
+                    "dace::complexJ()*%s" % repr_n.replace("inf", INFSTR)[:-1])
+            else:
+                # Substitute overflowing decimal literal for AST infinities.
+                self.write(repr_n.replace("inf", INFSTR))
+
+            if repr_n.startswith("-"):
+                self.write(")")
+
+    def _List(self, t):
+        raise SyntaxError('Invalid C++')
+        #self.write("[")
+        #interleave(lambda: self.write(", "), self.dispatch, t.elts)
+        #self.write("]")
+
+    def _ListComp(self, t):
+        raise SyntaxError('Invalid C++')
+        #self.write("[")
+        #self.dispatch(t.elt)
+        #for gen in t.generators:
+        #    self.dispatch(gen)
+        #self.write("]")
+
+    def _GeneratorExp(self, t):
+        raise SyntaxError('Invalid C++')
+        #self.write("(")
+        #self.dispatch(t.elt)
+        #for gen in t.generators:
+        #    self.dispatch(gen)
+        #self.write(")")
+
+    def _SetComp(self, t):
+        raise SyntaxError('Invalid C++')
+        #self.write("{")
+        #self.dispatch(t.elt)
+        #for gen in t.generators:
+        #    self.dispatch(gen)
+        #self.write("}")
+
+    def _DictComp(self, t):
+        raise SyntaxError('Invalid C++')
+        #self.write("{")
+        #self.dispatch(t.key)
+        #self.write(": ")
+        #self.dispatch(t.value)
+        #for gen in t.generators:
+        #    self.dispatch(gen)
+        #self.write("}")
+
+    def _comprehension(self, t):
+        raise SyntaxError('Invalid C++')
+        #if getattr(t, 'is_async', False):
+        #    self.write(" async")
+        #self.write(" for ")
+        #self.dispatch(t.target)
+        #self.write(" in ")
+        #self.dispatch(t.iter)
+        #for if_clause in t.ifs:
+        #    self.write(" if ")
+        #    self.dispatch(if_clause)
+
+    def _IfExp(self, t):
+        self.write("(")
+        self.dispatch(t.test)
+        self.write(" ? ")
+        self.dispatch(t.body)
+        self.write(" : ")
+        self.dispatch(t.orelse)
+        self.write(")")
+
+    def _Set(self, t):
+        raise SyntaxError('Invalid C++')
+        #assert(t.elts) # should be at least one element
+        #self.write("{")
+        #interleave(lambda: self.write(", "), self.dispatch, t.elts)
+        #self.write("}")
+
+    def _Dict(self, t):
+        raise SyntaxError('Invalid C++')
+        #self.write("{")
+        #def write_pair(pair):
+        #    (k, v) = pair
+        #    self.dispatch(k)
+        #    self.write(": ")
+        #    self.dispatch(v)
+        #interleave(lambda: self.write(", "), write_pair, zip(t.keys, t.values))
+        #self.write("}")
+
+    def _Tuple(self, t):
+        self.write("std::make_tuple(")
+        if len(t.elts) == 1:
+            (elt, ) = t.elts
+            self.dispatch(elt)
+            self.write(",")
+        else:
+            interleave(lambda: self.write(", "), self.dispatch, t.elts)
+        self.write(")")
+
+    unop = {"Invert": "~", "Not": "!", "UAdd": "+", "USub": "-"}
+
+    def _UnaryOp(self, t):
+        self.write("(")
+        self.write(self.unop[t.op.__class__.__name__])
+        self.write(" ")
+        if six.PY2 and isinstance(t.op, ast.USub) and isinstance(
+                t.operand, ast.Num):
+            # If we're applying unary minus to a number, parenthesize the number.
+            # This is necessary: -2147483648 is different from -(2147483648) on
+            # a 32-bit machine (the first is an int, the second a long), and
+            # -7j is different from -(7j).  (The first has real part 0.0, the second
+            # has real part -0.0.)
+            self.write("(")
+            self.dispatch(t.operand)
+            self.write(")")
+        else:
+            self.dispatch(t.operand)
+        self.write(")")
+
+    binop = {
+        "Add": "+",
+        "Sub": "-",
+        "Mult": "*",
+        "Div": "/",
+        "Mod": "%",
+        "LShift": "<<",
+        "RShift": ">>",
+        "BitOr": "|",
+        "BitXor": "^",
+        "BitAnd": "&"
+    }
+    funcops = {
+        "FloorDiv": (" /", "dace::math::ifloor"),
+        "MatMult": (",", "dace::gemm")
+    }
+
+    def _BinOp(self, t):
+        # Operations that require a function call
+        if t.op.__class__.__name__ in self.funcops:
+            separator, func = self.funcops[t.op.__class__.__name__]
+            self.write(func + "(")
+            self.dispatch(t.left)
+            self.write(separator + " ")
+            self.dispatch(t.right)
+            self.write(")")
+        # Special case for integer power
+        elif t.op.__class__.__name__ == 'Pow':
+            if (isinstance(t.right, ast.Num) and int(t.right.n) == t.right.n
+                    and t.right.n >= 0):
+                self.write("(")
+                if t.right.n == 0:
+                    self.write("1")
+                else:
+                    self.dispatch(t.left)
+                    for i in range(int(t.right.n) - 1):
+                        self.write(" * ")
+                        self.dispatch(t.left)
+                self.write(")")
+            else:
+                self.write("dace::math::pow(")
+                self.dispatch(t.left)
+                self.write(", ")
+                self.dispatch(t.right)
+                self.write(")")
+        else:
+            self.write("(")
+            self.dispatch(t.left)
+            self.write(" " + self.binop[t.op.__class__.__name__] + " ")
+            self.dispatch(t.right)
+            self.write(")")
+
+    cmpops = {
+        "Eq": "==",
+        "NotEq": "!=",
+        "Lt": "<",
+        "LtE": "<=",
+        "Gt": ">",
+        "GtE": ">=",
+        "Is": "==",
+        "IsNot": "!=",
+        #"In":"in", "NotIn":"not in"
+    }
+
+    def _Compare(self, t):
+        self.write("(")
+        self.dispatch(t.left)
+        for o, e in zip(t.ops, t.comparators):
+            if o.__class__.__name__ not in self.cmpops:
+                raise SyntaxError('Invalid C++')
+
+            self.write(" " + self.cmpops[o.__class__.__name__] + " ")
+            self.dispatch(e)
+        self.write(")")
+
+    boolops = {ast.And: '&&', ast.Or: '||'}
+
+    def _BoolOp(self, t):
+        self.write("(")
+        s = " %s " % self.boolops[t.op.__class__]
+        interleave(lambda: self.write(s), self.dispatch, t.values)
+        self.write(")")
+
+    def _Attribute(self, t):
+        self.dispatch(t.value)
+        # Special case: 3.__abs__() is a syntax error, so if t.value
+        # is an integer literal then we need to either parenthesize
+        # it or add an extra space to get 3 .__abs__().
+        if isinstance(t.value, ast.Num) and isinstance(t.value.n, int):
+            self.write(" ")
+        self.write(".")
+        self.write(t.attr)
+
+    def _Call(self, t):
+        self.dispatch(t.func)
+        self.write("(")
+        comma = False
+        for e in t.args:
+            if comma: self.write(", ")
+            else: comma = True
+            self.dispatch(e)
+        for e in t.keywords:
+            if comma: self.write(", ")
+            else: comma = True
+            self.dispatch(e)
+        if sys.version_info[:2] < (3, 5):
+            if t.starargs:
+                raise SyntaxError('Invalid C++')
+            if t.kwargs:
+                raise SyntaxError('Invalid C++')
+        self.write(")")
+
+    def _Subscript(self, t):
+        self.dispatch(t.value)
+        self.write("[")
+        self.dispatch(t.slice)
+        self.write("]")
+
+    def _Starred(self, t):
+        raise SyntaxError('Invalid C++')
+
+    # slice
+    def _Ellipsis(self, t):
+        self.write("...")
+
+    def _Index(self, t):
+        self.dispatch(t.value)
+
+    def _Slice(self, t):
+        if t.lower:
+            self.dispatch(t.lower)
+        self.write(":")
+        if t.upper:
+            self.dispatch(t.upper)
+        if t.step:
+            self.write(":")
+            self.dispatch(t.step)
+
+    def _ExtSlice(self, t):
+        interleave(lambda: self.write(', '), self.dispatch, t.dims)
+
+    # argument
+    def _arg(self, t):
+        if t.annotation:
+            self.dispatch(t.annotation)
+            self.write(' ')
+        else:
+            self.write("auto ")
+        self.write(t.arg)
+        self.locals.define(t.arg, t.lineno, self._indent)
+
+    # others
+    def _arguments(self, t):
+        first = True
+        # normal arguments
+        defaults = [None] * (len(t.args) - len(t.defaults)) + t.defaults
+        for a, d in zip(t.args, defaults):
+            if first: first = False
+            else: self.write(", ")
+
+            # ast.arg does not exist in python2
+            if six.PY2:
+                self.write("auto ")
+                self.locals.define(a.id, a.lineno, self._indent)
+
+            self.dispatch(a)
+            if d:
+                self.write("=")
+                self.dispatch(d)
+
+        # varargs, or bare '*' if no varargs but keyword-only arguments present
+        if t.vararg or getattr(t, "kwonlyargs", False):
+            raise SyntaxError('Invalid C++')
+
+        # keyword-only arguments
+        if getattr(t, "kwonlyargs", False):
+            raise SyntaxError('Invalid C++')
+
+        # kwargs
+        if t.kwarg:
+            raise SyntaxError('Invalid C++')
+
+    def _keyword(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Lambda(self, t):
+        self.write("(")
+        self.write("[] (")
+        self.dispatch(t.args)
+        self.write(") { return ")
+        self.dispatch(t.body)
+        self.write("; } )")
+
+    def _alias(self, t):
+        self.write('using ')
+        self.write(t.name)
+        if t.asname:
+            self.write(" = " + t.asname)
+        self.write(';')
+
+    def _withitem(self, t):
+        raise SyntaxError('Invalid C++')
+
+    def _Await(self, t):
+        raise SyntaxError('Invalid C++')
+
+
+def cppunparse(node, expr_semicolon=True):
+    strio = StringIO()
+    CPPUnparser(node, 0, CPPLocals(), strio, expr_semicolon=expr_semicolon)
+    return strio.getvalue().strip()
+
+
+# Code can either be a string or a function
+def py2cpp(code, expr_semicolon=True):
+    if isinstance(code, str):
+        return cppunparse(ast.parse(code), expr_semicolon)
+    elif code.__class__.__name__ == 'function':
+        try:
+            code_str = inspect.getsource(code)
+
+            # Remove leading indentation
+            lines = code_str.splitlines()
+            leading_spaces = len(lines[0]) - len(lines[0].lstrip())
+            code_str = ''
+            for line in lines:
+                code_str += line[leading_spaces:] + '\n'
+
+        except:  # Can be different exceptions coming from Python's AST module
+            raise TypeError('Invalid function given')
+        return cppunparse(ast.parse(code_str), expr_semicolon)
+
+    else:
+        raise TypeError('Unsupported type for py2cpp')
+
+
+def pyexpr2cpp(expr):
+    return py2cpp(expr, expr_semicolon=False)
diff --git a/dace/codegen/instrumentation/__init__.py b/dace/codegen/instrumentation/__init__.py
new file mode 100644
index 0000000000..e69de29bb2
diff --git a/dace/codegen/instrumentation/perfsettings.py b/dace/codegen/instrumentation/perfsettings.py
new file mode 100644
index 0000000000..00982de3d3
--- /dev/null
+++ b/dace/codegen/instrumentation/perfsettings.py
@@ -0,0 +1,1587 @@
+from dace.graph.nodes import MapEntry, MapExit, Tasklet
+from dace.graph.graph import SubgraphView
+from dace.memlet import Memlet
+from dace.data import Array
+
+from dace.config import Config
+
+from dace.types import ScheduleType
+
+import re
+
+import sympy as sp
+
+# Helper function to get the module path
+if __name__ == "__main__":
+    import os
+    print("path: " + os.path.dirname(__file__))
+
+
+class PerfSettings(object):
+
+    _unique_counter = 0
+
+    _perf_enable_instrumentation = True
+    perf_enable_override_config = True
+
+    #default_papi_counters = ["PAPI_TOT_INS", "PAPI_TOT_CYC", "PAPI_L1_TCM", "PAPI_L2_TCM", "PAPI_L3_TCM"]
+    default_papi_counters = [
+        "PAPI_TOT_INS", "PAPI_TOT_CYC", "PAPI_L2_TCM", "PAPI_L3_TCM"
+    ]
+
+    @staticmethod
+    def get_unique_number():
+        ret = PerfSettings._unique_counter
+        PerfSettings._unique_counter = PerfSettings._unique_counter + 1
+        return ret
+
+    @staticmethod
+    def perf_multirun_num():
+        """ Amount of iterations with different PAPI configurations to run. (1 means no multirun) """
+        if not PerfSettings.perf_enable_instrumentation():
+            return 1
+        return 4
+
+    @staticmethod
+    def perf_multirun_options():
+        """ Specifies the options for "multirunning": running the same program
+            multiple times with different performance counters. """
+        ret = []
+
+        if PerfSettings.perf_multirun_num() == 1:
+            return ret  # Don't specify these options by default
+
+        for i in range(0, 4):
+            ret.append(("omp_num_threads", i + 1))
+        return ret
+
+    @staticmethod
+    def perf_default_papi_counters():
+        return eval(Config.get("instrumentation", "default_papi_counters"))
+
+    @staticmethod
+    def perf_enable_instrumentation():
+        return Config.get_bool("instrumentation", "enable_papi")
+
+    @staticmethod
+    def perf_enable_instrumentation_for(sdfg, node=None):
+        return PerfSettings.perf_enable_instrumentation(
+        ) and not sdfg.has_instrumented_parent()
+
+    @staticmethod
+    def perf_supersection_emission_debug():
+        return True
+
+    @staticmethod
+    def perf_enable_counter_sanity_check():
+        return Config.get_bool("instrumentation",
+                               "enable_papi_counter_sanity_check")
+
+    @staticmethod
+    def perf_print_instrumentation_output():
+        return False
+
+    @staticmethod
+    def perf_enable_vectorization_analysis():
+        return Config.get_bool("instrumentation",
+                               "enable_vectorization_analysis")
+
+    @staticmethod
+    def perf_max_scope_depth():
+        # This variable selects the maximum depth inside a scope. For example,
+        # "map { map {}}" with max_scope_depth 0 will result in
+        # "map { profile(map{}) }", while max_scope_depth >= 1 result in
+        # "map { map { profile() }}"
+        return Config.get("instrumentation", "max_scope_depth")
+
+    perf_debug_profile_innermost = False  # innermost = False implies outermost
+    perf_debug_annotate_scopes = True
+    perf_debug_annotate_memlets = False
+    perf_debug_hard_error = False  # If set to true, untreated cases cause program abort.
+
+    #TODO: There should be a variable per MAP-Element that overrides the scope depth
+    perf_tasklets = False
+
+    perf_whitelist_schedules = [
+        ScheduleType.Default, ScheduleType.CPU_Multicore,
+        ScheduleType.Sequential
+    ]
+
+
+class PerfUtils(object):
+    @staticmethod
+    def unified_id(node_id, state_id):
+        if node_id > 0x0FFFF:
+            raise ValueError("Nodeid is too larget to fit in 16 bits!")
+        if state_id > 0x0FFFF:
+            raise ValueError("Stateid is too large to fit in 16 bits!")
+        return (int(state_id) << 16) | int(node_id)
+
+    @staticmethod
+    def gather_remote_metrics():
+        """ Returns a dictionary of metrics collected by instrumentation. """
+
+        # Run the tools/membench file on remote.
+        remote_workdir = Config.get("execution", "general", "workdir")
+        from diode.remote_execution import Executor
+        from string import Template
+        import subprocess
+        executor = Executor(None, True, None)
+
+        remote_filepath = remote_workdir + "/" + "membench.cpp"
+
+        executor.copy_file_to_remote("tools/membench.cpp", remote_filepath)
+
+        libs = Config.get("compiler", "cpu", "libs").split(" ")
+
+        libflags = map(lambda x: "-l" + x, libs)
+
+        libflagstring = "".join(libflags)
+
+        path_resolve_command = "python3 -m dace.codegen.instrumentation.perfsettings"
+        # Get the library path
+        s = Template(Config.get("execution", "general", "execcmd"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            command=path_resolve_command)
+
+        p = subprocess.Popen(
+            cmd,
+            shell=True,
+            stdout=subprocess.PIPE,
+            stderr=subprocess.STDOUT,
+            universal_newlines=True)
+
+        stdout, _ = p.communicate(timeout=60)
+
+        remote_dace_path = re.search(r"path: (?P<dace_path>.*)", str(stdout))
+        if remote_dace_path:
+            remote_dace_path = remote_dace_path['dace_path']
+        print("Remote dace path: %s" % remote_dace_path)
+
+        # Now create the include path from that
+        include_path = "\"" + remote_dace_path + "/" + "runtime/include" + "\""
+
+        print("remote_workdir: " + remote_workdir)
+        compile_and_run_command = "cd " + remote_workdir + " && " + " pwd && " + Config.get(
+            "compiler", "cpu", "executable"
+        ) + " " + Config.get(
+            "compiler", "cpu", "args"
+        ) + " " + "-fopenmp" + " " + Config.get(
+            "compiler", "cpu", "additional_args"
+        ) + " -I" + include_path + " " + "membench.cpp -o membench" + " " + libflagstring + " && " + "./membench"
+
+        # Wrap that into a custom shell because ssh will not keep context.
+        # The HEREDOC is needed because we already use " and ' inside the command.
+        compile_and_run_command = "<< EOF\nsh -c '" + compile_and_run_command + "'" + "\nEOF"
+
+        print("Compile command is " + compile_and_run_command)
+
+        # run this command
+        s = Template(Config.get("execution", "general", "execcmd"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            command=compile_and_run_command)
+
+        p2 = subprocess.Popen(
+            cmd,
+            shell=True,
+            stdout=subprocess.PIPE,
+            stderr=subprocess.STDOUT,
+            universal_newlines=True)
+
+        stdout2, _ = p2.communicate(timeout=60)
+
+        #print("stdout2: " + str(stdout2))
+
+        bytes_per_cycle = re.search(r"result: (?P<bytes_per_cycle>.*?$)",
+                                    str(stdout2))
+        if bytes_per_cycle:
+            bytes_per_cycle = bytes_per_cycle['bytes_per_cycle']
+        print("Bytes per cycle: %s" % bytes_per_cycle)
+
+        executor.remote_delete_file(remote_workdir + "/membench.cpp")
+        executor.remote_delete_file(remote_workdir + "/membench")
+
+        return bytes_per_cycle
+
+    @staticmethod
+    def reduce_iteration_count(begin, end, step, retparams: dict):
+
+        from dace.symbolic import symbols_in_sympy_expr, SymExpr
+
+        # There are different rules when expanding depending on where the expand should happen
+        start_syms = symbols_in_sympy_expr(begin)
+        end_syms = symbols_in_sympy_expr(end)
+        step_syms = symbols_in_sympy_expr(step)
+
+        def intersection(lista, listb):
+            return [x for x in lista if x in listb]
+
+        start_dyn_syms = intersection(start_syms, retparams.keys())
+        end_dyn_syms = intersection(end_syms, retparams.keys())
+        step_dyn_syms = intersection(step_syms, retparams.keys())
+
+        def replace_func(element, dyn_syms, retparams):
+            print("Dynamic element symbols symbols: %s (out of %s)!" %
+                  (str(element), str(dyn_syms)))
+            print("(srepr): " + sp.srepr(element))
+            # Resolve all symbols using the retparams-dict
+
+            for x in dyn_syms:
+                print("Replacing " + str(x))
+                target = sp.functions.Min(
+                    retparams[x] * (retparams[x] - 1) / 2, 0)
+                print("\twith target " + str(target))
+                bstr = str(element)
+                #print(bstr)
+                element = sp.sympify(bstr, sp.abc._clash)
+                #print("\t(new srepr): " + sp.srepr(element))
+                element = element.subs(
+                    x, target)  # Add the classic sum formula; going upwards
+
+                # To not have hidden elements that get added again later, we also replace the values in the other itvars...
+                for k, v in retparams.items():
+                    newv = sp.sympify(str(v), sp.abc._clash)
+
+                    itsyms = symbols_in_sympy_expr(newv)
+                    tarsyms = symbols_in_sympy_expr(target)
+                    if x in map(str, tarsyms):
+                        continue
+                    # assert not x in itsyms # We never want to have the replaced symbol in its own expression. This can happen when applying 2 SMs
+
+                    tmp = newv.subs(x, target)
+                    if tmp != v:
+                        print("Replacing %s with %s" % (str(newv), str(tmp)))
+                        retparams[k] = tmp
+
+            print("\t New element: " + str(element))
+            return element
+
+        if len(start_dyn_syms) > 0:
+            pass
+            begin = replace_func(begin, start_dyn_syms, retparams)
+
+        if len(end_dyn_syms) > 0:
+            pass
+            end = replace_func(end, end_dyn_syms, retparams)
+
+        if len(step_dyn_syms) > 0:
+            pass
+            print("Dynamic step symbols %s!" % str(step))
+            raise NotImplementedError
+
+        return (begin, end, step)
+
+    @staticmethod
+    def get_iteration_count(mapEntry: MapEntry, vars: dict):
+        """ Get the number of iterations for this map, allowing other variables as bounds. """
+        from dace.symbolic import symbols_in_sympy_expr, SymExpr
+
+        _map = mapEntry.map
+        _it = _map.params
+
+        retparams = dict()
+        for k, v in vars.items():
+            retparams[k] = v
+
+        #print("Params: " + str(_it))
+        for i, r in enumerate(_map.range):
+            begin, end, step = r
+
+            end = end + 1  # end is inclusive, but we want it exclusive
+
+            if isinstance(begin, SymExpr):
+                begin = begin.expr
+            if isinstance(end, SymExpr):
+                end = end.expr
+            if isinstance(step, SymExpr):
+                step = step.expr
+
+            begin, end, step = PerfUtils.reduce_iteration_count(
+                begin, end, step, retparams)
+            num = (end - begin) / step  # The count of iterations
+            retparams[_it[i]] = num
+
+        return retparams
+
+    @staticmethod
+    def all_maps(mapEntry: MapEntry, dfg: SubgraphView):
+        children = [
+            x for x in dfg.scope_dict(True)[mapEntry]
+            if isinstance(x, MapEntry)
+        ]
+
+        sub = []
+        for x in children:
+            sub.extend(PerfUtils.all_maps(x, dfg))
+
+        children.extend(sub)
+        #children.extend([PerfUtils.all_maps(x, dfg) for x in children])
+        return children
+
+    @staticmethod
+    def map_depth(mapEntry: MapEntry):
+        # Returns the depth of this entry node.
+        # For now, the depth is stored inside the MapEntry node.
+        return mapEntry._map_depth
+
+    @staticmethod
+    def set_map_depth(mapEntry: MapEntry, DFG: SubgraphView):
+        from dace.graph.nodes import Reduce, AccessNode, NestedSDFG
+
+        # Set the depth for the mapEntry
+
+        # We do not use mapEntry for now, but it might be required for different implementations
+
+        # Get the sorted graph
+        dfg_sorted = DFG.topological_sort()
+        depth = 0
+        following_nodes_invalid = False  # Set to True when a fencing map is encountered
+        invalid_scope = -1
+        invalid_index = PerfSettings.perf_max_scope_depth() + 1
+        # Iterate and get the depth for every node, breaking when the specified node has been found
+        for e in dfg_sorted:
+            # Set the depth for every node on the way
+            if isinstance(e, MapEntry):
+                if not following_nodes_invalid and not e.map.schedule in PerfSettings.perf_whitelist_schedules:
+                    print(
+                        "Cannot instrument node %s, as it is running on a GPU (schedule %s)"
+                        % (str(mapEntry), e.map.schedule))
+                    following_nodes_invalid = True  # Invalidate all following maps
+                    invalid_scope = depth + 1  # Mark this depth as invalid. Once the depth drops below this threshold, the invalid-mark will be removed
+                if following_nodes_invalid and depth:
+                    e._map_depth = invalid_index  # Set an invalid index (this will never be instrumented)
+                else:
+                    e._map_depth = max(e._map_depth, depth)
+                if e.fence_instrumentation:
+                    following_nodes_invalid = True  # After a fence there must not be any instrumentation happening
+
+                depth += 1
+            elif isinstance(e, MapExit):
+                depth -= 1
+                if depth < invalid_scope:
+                    invalid_scope = -1
+                    following_nodes_invalid = False
+            elif isinstance(e, NestedSDFG):
+                e.sdfg.set_instrumented_parent()
+                #depth += 1 # Not sure if we should add a depth here
+
+                pass
+            else:
+                if isinstance(e, Reduce):
+                    pass
+                elif isinstance(e, AccessNode):
+                    pass
+                elif isinstance(e, Tasklet):
+                    pass
+                else:
+                    print("Error-Type: " + type(e).__name__)
+                    assert False
+
+    @staticmethod
+    def is_deepest_node(check: MapEntry, DFG: SubgraphView):
+        nodes = DFG.nodes()
+        checkdepth = PerfUtils.map_depth(check)
+        return all(
+            not isinstance(x, MapEntry) or PerfUtils.map_depth(x) <= checkdepth
+            for x in nodes)
+
+    @staticmethod
+    def instrument_entry(mapEntry: MapEntry, DFG: SubgraphView):
+        depth = PerfUtils.map_depth(mapEntry)
+        cond1 = PerfSettings.perf_enable_instrumentation(
+        ) and depth <= PerfSettings.perf_max_scope_depth() and (
+            PerfUtils.is_deepest_node(mapEntry, DFG)
+            or depth == PerfSettings.perf_max_scope_depth())
+        cond2 = mapEntry.map.schedule in PerfSettings.perf_whitelist_schedules
+        cond3 = not mapEntry.fence_instrumentation
+        if not cond2:
+            print("Cannot instrument node %s, as it is running on a GPU" %
+                  str(mapEntry))
+        return cond1 and cond2 and cond3
+
+    @staticmethod
+    def has_surrounding_perfcounters(node, DFG: SubgraphView):
+        """ Returns true if there is a possibility that this node is part of a
+            section that is profiled. """
+        parent = DFG.scope_dict()[node]
+
+        if isinstance(parent, MapEntry):
+            if parent.map._has_papi_counters or PerfUtils.map_depth(
+                    parent) > PerfSettings.perf_max_scope_depth():
+                return True
+
+        return False
+
+    @staticmethod
+    def get_memlet_byte_size(sdfg, memlet: Memlet):
+        pass
+        memdata = sdfg.arrays[memlet.data]
+        # For now, deal with arrays only
+        if isinstance(memdata, Array):
+            elems = [str(memdata.dtype.bytes)]
+            # The following for-loop is not relevant here, it just describes the shape of the source...
+            #for x in memdata.shape:
+            #    elems.append(str(x))
+            try:
+                if (memlet.num_accesses >= 0):
+                    elems.append(
+                        str(memlet.num_accesses)
+                    )  # num_accesses seems to be the amount of accesses per tasklet execution
+                else:
+                    print(
+                        "Refusing to add negative accesses (%d) in get_memlet_byte_size!"
+                        % memlet.num_accesses)
+            except:
+                print("Unsupported memlet.num_accesses type, %s (%s)" % (str(
+                    type(memlet.num_accesses)), str(memlet.num_accesses)))
+
+            return "(" + "*".join(elems) + ")"
+
+        else:
+            print("Untreated data type: ", type(memdata).__name__)
+            if PerfSettings.perf_debug_hard_error:
+                assert False
+            else:
+                return "0"
+
+    @staticmethod
+    def get_out_memlet_costs(sdfg, state_id, node, dfg):
+        from dace.graph import nodes
+        from dace.sdfg import ScopeSubgraphView, SDFG, scope_contains_scope
+        scope_dict = sdfg.nodes()[state_id].scope_dict()
+
+        out_costs = 0
+        for edge in dfg.out_edges(node):
+            _, uconn, v, _, memlet = edge
+            dst_node = dfg.memlet_path(edge)[-1].dst
+
+            # Target is neither a data nor a tasklet node
+            if (isinstance(node, nodes.AccessNode)
+                    and (not isinstance(dst_node, nodes.AccessNode)
+                         and not isinstance(dst_node, nodes.CodeNode))):
+                continue
+
+            # Skip array->code (will be handled as a tasklet input)
+            if isinstance(node, nodes.AccessNode) and isinstance(
+                    v, nodes.CodeNode):
+                continue
+
+            # code->code (e.g., tasklet to tasklet)
+            if isinstance(v, nodes.CodeNode):
+                shared_data_name = 's%d_n%d%s_n%d%s' % (
+                    state_id, dfg.node_id(edge.src), edge.src_conn,
+                    dfg.node_id(edge.dst), edge.dst_conn)
+                #result.write('__%s = %s;' % (shared_data_name, edge.src_conn),
+                #            sdfg, state_id, [edge.src, edge.dst])
+                # TODO: Check how to deal with this...
+                #raise NotImplementedError
+                continue
+
+            # If the memlet is not pointing to a data node (e.g. tasklet), then
+            # the tasklet will take care of the copy
+            if not isinstance(dst_node, nodes.AccessNode):
+                continue
+            # If the memlet is pointing into an array in an inner scope, then the
+            # inner scope (i.e., the output array) must handle it
+            if (scope_dict[node] != scope_dict[dst_node]
+                    and scope_contains_scope(scope_dict, node, dst_node)):
+                continue
+
+            # Array to tasklet (path longer than 1, handled at tasklet entry)
+            if node == dst_node:
+                continue
+
+            # Tasklet -> array
+            if isinstance(node, nodes.CodeNode):
+                if not uconn:
+                    print("This would normally raise a syntax error!")
+                    return 0  # We don't error-out because the error will be raised later
+
+                try:
+                    positive_accesses = bool(memlet.num_accesses >= 0)
+                except TypeError:
+                    positive_accesses = False
+
+                if memlet.subset.data_dims() == 0 and positive_accesses:
+
+                    if memlet.wcr is not None:
+                        # write_and_resolve
+                        # We have to assume that every reduction costs 3 accesses of the same size
+                        out_costs += 3 * sp.sympify(
+                            PerfUtils.get_memlet_byte_size(sdfg, memlet),
+                            sp.abc._clash)
+                    else:
+                        #'%s.write(%s);\n'
+                        # This standard operation is already counted
+                        out_costs += sp.sympify(
+                            PerfUtils.get_memlet_byte_size(sdfg, memlet),
+                            sp.abc._clash)
+            # Dispatch array-to-array outgoing copies here
+            elif isinstance(node, nodes.AccessNode):
+                pass
+        return out_costs
+
+    @staticmethod
+    def get_tasklet_byte_accesses(tasklet: Tasklet, dfg: SubgraphView, sdfg,
+                                  state_id):
+        """ Get the amount of bytes processed by `tasklet`. The formula is 
+            sum(inedges * size) + sum(outedges * size) """
+        in_accum = []
+        out_accum = []
+        in_edges = dfg.in_edges(tasklet)
+        out_edges = dfg.out_edges(tasklet)
+
+        for ie in in_edges:
+            # type ie.data == Memlet
+            # type ie.data.data == Data
+            in_accum.append(PerfUtils.get_memlet_byte_size(sdfg, ie.data))
+
+        out_accum.append(
+            str(PerfUtils.get_out_memlet_costs(sdfg, state_id, tasklet, dfg)))
+
+        # Merge (kept split to be able to change the behavior easily)
+        full = in_accum
+        full.extend(out_accum)
+
+        return "(" + "+".join(full) + ")"
+
+    @staticmethod
+    def get_map_exit_byte_accesses(mapexit: MapExit, dfg: SubgraphView, sdfg,
+                                   state_id):
+        """ Get the amount of bytes processed by mapexit. The formula is 
+            sum(inedges * size) + sum(outedges * size) """
+        in_accum = []
+        out_accum = []
+        in_edges = dfg.in_edges(mapexit)
+        out_edges = dfg.out_edges(mapexit)
+
+        out_connectors = mapexit.out_connectors
+
+        for ie in in_edges:
+            # type ie.data == Memlet
+            # type ie.data.data == Data
+            in_accum.append(PerfUtils.get_memlet_byte_size(sdfg, ie.data))
+
+        for oe in out_edges:
+            out_accum.append(PerfUtils.get_memlet_byte_size(sdfg, oe.data))
+
+        # Merge (kept split to be able to change the behavior easily)
+        full = in_accum
+        full.extend(out_accum)
+
+        return "(" + "+".join(full) + ")"
+
+    @staticmethod
+    def get_parents(outermost_node, node, sdfg, state_id):
+
+        parent = None
+        # Because dfg is only a subgraph view, it does not contain the entry
+        # node for a given entry. This O(n) solution is suboptimal
+        for state in sdfg.nodes():
+            s_d = state.scope_dict(node_to_children=False)
+            try:
+                scope = s_d[node]
+            except KeyError as e:
+                continue
+
+            if (scope != None):
+                parent = scope
+                break
+        if (parent == None):
+            return []
+        if (parent == outermost_node):
+            return [parent]
+
+        return PerfUtils.get_parents(outermost_node, parent, sdfg,
+                                     state_id) + [parent]
+
+    @staticmethod
+    def accumulate_byte_movements_v2(outermost_node, node, dfg: SubgraphView,
+                                     sdfg, state_id):
+
+        itvars = dict()  # initialize an empty dict
+
+        # First, get a list of children
+        if isinstance(node, MapEntry):
+            children = dfg.scope_dict(node_to_children=True)[node]
+        else:
+            children = []
+        assert not (node in children)
+
+        # If there still are children, descend recursively (dfs is fine here)
+        if len(children) > 0:
+            size = 0
+            for x in children:
+                size = size + PerfUtils.accumulate_byte_movements_v2(
+                    outermost_node, x, dfg, sdfg, state_id)
+
+            return size
+        else:
+            if isinstance(node, MapExit):
+                return 0  # We can ignore this.
+
+            # If we reached the deepest node, get all parents
+            parent_list = PerfUtils.get_parents(outermost_node, node, sdfg,
+                                                state_id)
+            #print("Parents are " + str(parent_list))
+            if isinstance(node, MapEntry):
+                map_list = parent_list + [node]
+            else:
+                #print("node is of type " + type(node).__name__)
+                map_list = parent_list
+
+            # From all iterations, get the iteration count, replacing inner
+            # iteration variables with the next outer variables.
+            for x in map_list:
+                itvars = PerfUtils.get_iteration_count(x, itvars)
+
+            #print("itvars: " + str(itvars))
+
+            itcount = 1
+            for x in itvars.values():
+                itcount = itcount * x
+            #print("Probable itcount: " + str(itcount))
+
+            #print("constants: " + str(sdfg.constants))
+
+            if isinstance(node, MapEntry):
+                raise ValueError(
+                    "Unexpected node"
+                )  # A map entry should never be the innermost node
+            elif isinstance(node, MapExit):
+                return 0  # We can ignore this.
+            elif isinstance(node, Tasklet):
+                return itcount * sp.sympify(
+                    PerfUtils.get_tasklet_byte_accesses(
+                        node, dfg, sdfg, state_id))
+            else:
+                if PerfSettings.perf_debug_hard_error:
+                    raise NotImplementedError
+                else:
+                    return 0
+
+    @staticmethod
+    def accumulate_byte_movements(node, dfg: SubgraphView, sym2cpp, sdfg,
+                                  state_id):
+        """ Loops over all sub-iterations and calculates the number of bytes 
+            moved (logically). """
+
+        # The coefficient consists of multipliers (i.e. maps) and bytes (i.e.
+        # memlet/tasklet movements)
+        coeff_this_node = ""
+
+        if isinstance(node, MapEntry):
+            # get the iteration count for this entry
+            coeff_this_node = '*'.join([
+                '((%s - %s) / %s)' % (sym2cpp(re + 1), sym2cpp(rb),
+                                      sym2cpp(rs))
+                for rb, re, rs in node.map.range
+            ])
+
+            # Create a list to contain all suboperations (for this scope)
+            subops = [coeff_this_node]
+
+            for edge in dfg.edges():
+                source = dfg.scope_dict()[edge.src]
+                destination = dfg.scope_dict()[edge.dst]
+                if source == node and edge.dst != node:
+                    subops.append(
+                        PerfUtils.accumulate_byte_movements(
+                            edge.dst, dfg, sym2cpp, sdfg, state_id))
+                if destination == node and edge.src != node:
+                    subops.append(
+                        PerfUtils.accumulate_byte_movements(
+                            edge.src, dfg, sym2cpp, sdfg, state_id))
+
+            # We can just simplify that directly
+            if any(x == "0" for x in subops):
+                return "0"
+            coeff_this_node = ' * '.join([x for x in subops if x != ""])
+            return coeff_this_node
+        elif isinstance(node, MapExit):
+            # Ignore this type, we already dealt with it when we processed
+            # MapEntry
+            return ""
+        elif isinstance(node, Tasklet):
+            # Exact data movement costs depend on the tasklet code
+            return PerfUtils.get_tasklet_byte_accesses(node, dfg, sdfg,
+                                                       state_id)
+
+        else:
+            if PerfSettings.perf_debug_hard_error:
+                raise NotImplementedError
+            else:
+                return "0"
+
+    class ParseStates:
+        CONTROL = 0
+        VALUES = 1
+        SECTION_SIZE = 2
+
+    class Entry:
+        def __init__(self):
+            pass
+            self.values = {}
+            self.nodeid = 0
+            self.coreid = 0
+            self.iteration = 0
+            self.flags = 0
+
+        def is_valid(self):
+            return len(self.values) != 0
+
+        def add(self, counter, value):
+            self.values[counter] = value
+
+        def get(self, name: str):
+            try:
+                return self.values[name]
+            except:
+                return None
+
+        def toJSON(self):
+            return '{{ "node": "{node}",\n"thread": "{thread}",\n"iteration": "{iteration}",\n"flags": {flags},\n"values": [{values}]\n}}\n'.format(
+                node=str(self.nodeid),
+                thread=str(self.coreid),
+                iteration=str(self.iteration),
+                flags=str(self.flags),
+                values=", ".join([
+                    '{{ "{code}": {value} }}'.format(
+                        code=str(code), value=str(value))
+                    for code, value in self.values.items()
+                ]))
+
+        def toCSVsubstring(self, delim=','):
+            return delim.join([
+                self.nodeid, self.coreid, self.iteration,
+                *self.values.values()
+            ])  # * == ... in other languages
+
+    class Section:
+        def __init__(self, nodeid=0, threadid=0):
+            pass
+            self.entries = []
+            self.nodeid = nodeid
+            self.datasize = 0
+            self.bytes_moved = 0
+            self.was_collapsed = False
+            self.threadid = threadid
+
+        def is_complete(self):
+            """ Checks if all iterations are in this section. This might not 
+                always be the case, e.g. in filtered sections. """
+            itlist = [int(x.iteration) for x in self.entries]
+            sortitlist = sorted(itlist)
+            for i, e in enumerate(sortitlist):
+                if (i != int(e)):
+                    print("list: %s\n" % sortitlist)
+                    return False
+            return True
+
+        def is_valid(self):
+            return len(self.entries) != 0
+
+        def add(self, e):
+            self.entries.append(e)
+
+        def addSection(self, sec):
+            """ Merges another section into this section. """
+            assert self.nodeid == sec.nodeid
+
+            # We allow collapsing at most once.
+            if self.was_collapsed:
+                return
+            if sec.was_collapsed:
+                return
+            # Add all entries
+            for x in sec.entries:
+                self.add(x)
+
+            # merge meta
+            #self.datasize += sec.datasize
+            self.bytes_moved += sec.bytes_moved
+            self.was_collapsed = True
+            sec.was_collapsed = True
+
+        def select_event(self, event: str):
+            """ Selects all values of 'event' in correct order from all 
+                entries. """
+            return [
+                int(x.get(event)) for x in self.entries if x.get(event) != None
+            ]
+
+        def select_thread(self, thread: int):
+            """ Returns a section that only contains entries of `self` that 
+                were obtained in the given thread. """
+            ret = PerfUtils.Section(self.nodeid)
+
+            for x in self.entries:
+                if int(x.coreid) == int(thread):
+                    ret.entries.append(x)
+
+            return ret
+
+        def select_node(self, node: int):
+            """ Returns a section that only contains entries of `self` that 
+                were obtained for the given node """
+            ret = PerfUtils.Section(self.nodeid)
+
+            for x in self.entries:
+                if int(x.nodeid) == int(node):
+                    ret.entries.append(x)
+
+            return ret
+
+        def filter(self, predicate):
+            """ Returns a section that only contains entries `e` for which 
+                `predicate(e)` returns true"""
+            ret = PerfUtils.Section(self.nodeid)
+
+            for x in self.entries:
+                if predicate(x):
+                    ret.entries.append(x)
+
+            return ret
+
+        def get_max_thread_num(self):
+            """ Returns the maximal thread number in at most O(n) 
+                complexity. """
+            max = 0
+            for x in self.entries:
+                if int(x.coreid) > max:
+                    max = int(x.coreid)
+            return max
+
+        def toCSVsubstring(self, prepend="", delim=',', linedelim='\n'):
+            ret = ""
+            for x in self.entries:
+                ret += delim.join([
+                    prepend, "node" + self.nodeid, self.threadid,
+                    x.toCSVsubstring(delim)
+                ]) + linedelim
+            return ret
+
+        def toJSON(self):
+            return '{{ "entry_node": {entry_node}, "static_movement": {datasize}, "entry_core": {core}, "entries": ['.format(
+                entry_node=self.nodeid,
+                datasize=self.datasize,
+                core=self.threadid) + ", ".join(
+                    [x.toJSON() for x in self.entries]) + "]}"
+
+    class SuperSection:
+        """ Contains multiple Sections. 
+            @see Section
+        """
+
+        def __init__(self, supernode=0):
+            self.sections = {}
+            self.supernode = supernode
+
+        def is_valid(self):
+            return len(self.sections.values()) > 0
+
+        def addSection(self, section):
+            if int(section.threadid) in self.sections:
+                self.sections[int(section.threadid)].append(section)
+            else:
+                self.sections[int(section.threadid)] = [section]
+
+        def addEntry(self, entry):
+
+            if not entry.is_valid():
+                # ignore invalid entries
+                return
+
+            # We have 2 cases - either:
+            # (a) the section starts outside of a parallel block:
+            #   Every entry needs to be assigned to this block. There will only
+            #   be one block with threadid == 0 in this case.
+            # or (b) the section starts in a parallel block:
+            #   Entries can be assigned by thread_id.
+            if int(entry.coreid) in self.sections:
+                # Assign by thread id
+                try:
+                    self.sections[int(entry.coreid)][-1].add(entry)
+                except:
+                    print("Sections has keys " + str(self.sections.keys()))
+                    raise
+            else:
+                # Ideally, we can only add nodes to a section if they have the
+                # same core id. However, in nested omp constructs, the
+                # lower-level sections are usually just run on core 0.
+                # So if a section starts on core 1, its entries might still
+                # report core 0.
+                try:
+                    self.sections[0][-1].add(entry)
+                except Exception as e:
+                    print("error, contained sections:")
+                    print(str(self.sections))
+                    print(str(self.sections.values()))
+
+                    mitigated = False
+                    # Find the section that matches by nodeid...
+                    for x in self.sections.values():
+                        # Find the correct section and append to that
+                        # (start with oldest entry)
+                        for y in reversed(x):
+                            if y.nodeid == entry.nodeid:
+                                y.add(entry)
+                                print(
+                                    "Warning: Mitigation successful, but you should probably enable OMP_NESTED"
+                                )
+                                mitigated = True
+                                break
+
+                    if not mitigated:  # Only complain if we could not mitigate
+                        raise e
+
+        def getSections(self):
+            l = []
+            for x in self.sections.values():
+                l.extend(x)
+            return [x for x in l]
+
+        def toCSVstring(self, delim=',', linedelim='\n'):
+            """ Create a CSV string from the data. """
+
+            # Squashes everything into a row, duplicating data.
+            ret = ""
+            for x in self.sections.values():
+                for y in x:
+                    ret += y.toCSVsubstring("supernode" + str(self.supernode),
+                                            delim, linedelim)
+            ret += "ENDSUPERSECTION" + linedelim
+            return ret
+
+        def toJSON(self):
+            return '{{ "hint": "supersection", "supernode": {supernode},\n "sections": [{sections}] }}'.format(
+                supernode=self.supernode,
+                sections=",\n".join([x.toJSON() for x in self.getSections()]))
+
+    @staticmethod
+    def perf_counter_store_string(counterlist: [str]):
+        """ Creates a performance counter typename string. """
+        return "PAPIValueStore<" + ", ".join(counterlist) + ">"
+
+    @staticmethod
+    def perf_counter_string_from_string_list(counterlist: [str]):
+        """ Creates a performance counter typename string. """
+        if isinstance(counterlist, str):
+            print("Wrong format")
+            counterlist = eval(counterlist)
+        return "PAPIPerfLowLevel<" + ", ".join(counterlist) + ">"
+
+    @staticmethod
+    def perf_counter_string(node):
+        """ Creates a performance counter typename string. """
+        try:
+            assert isinstance(node.papi_counters, list)
+            return PerfUtils.perf_counter_string_from_string_list(
+                node.papi_counters)
+        except Exception as e:
+            return PerfUtils.perf_counter_string_from_string_list(
+                PerfSettings.perf_default_papi_counters())
+
+    @staticmethod
+    def read_available_perfcounters():
+        from string import Template
+        import subprocess
+
+        papi_avail_str = "papi_avail -a"
+        s = Template(Config.get("execution", "general", "execcmd"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            command=papi_avail_str)
+        p = subprocess.Popen(
+            cmd,
+            shell=True,
+            stdout=subprocess.PIPE,
+            stderr=subprocess.STDOUT,
+            universal_newlines=True)
+
+        stdout, _ = p.communicate(timeout=60)
+
+        counter_num = re.search(
+            r"Number Hardware Counters[\s.]*:\s(?P<num_cntr>[0-9]+)",
+            str(stdout))
+        if counter_num:
+            counter_num = int(counter_num['num_cntr'])
+        print("Hardware counters: %s" % counter_num)
+
+        print("PAPI preset events:")
+        # Find non-derived events first
+        non_derived = re.findall(
+            r"(?P<event_name>PAPI_[0-9A-Z_]+)\s+0x[0-9a-zA-Z]+\s+No",
+            str(stdout))
+        print("Non-Derived: ", non_derived)
+
+        # Now all derived events
+        derived = re.findall(
+            r"(?P<event_name>PAPI_[0-9A-Z_]+)\s+0x[0-9a-zA-Z]+\s+Yes",
+            str(stdout))
+        print("Derived: ", derived)
+
+        return (non_derived, derived, counter_num)
+
+    @staticmethod
+    def collapse_sections(sections: list):
+        """ Combine sections with the same ID into one single section. """
+
+        seen = []  # Nodeids that were already collapsed
+        collapsed = [
+        ]  # The return value, consisting of all collapsed sections
+
+        # Add all elements that were already collapsed
+        collapsed = [x for x in sections if x.was_collapsed]
+
+        print("%d sections were already collapsed" % len(collapsed))
+
+        for _ in sections:
+            preselection = [
+                x for x in sections
+                if not (x.nodeid, x.threadid) in seen and not x.was_collapsed
+            ]
+            if preselection == []:
+                break
+            target = preselection[0]
+            seen.append((target.nodeid, target.threadid))
+            selection = [
+                x for x in sections
+                if x.nodeid == target.nodeid and x.threadid == target.threadid
+                and x != target and not x.was_collapsed
+            ]
+            for y in selection:
+                target.addSection(y)
+            collapsed.append(target)
+
+            target.was_collapsed = True  # If selection is []
+
+            assert target.was_collapsed
+
+        # Debug
+        removed_nodes = [x for x in sections if not (x in collapsed)]
+        print("Removed nodes: " + str([x.toJSON() for x in removed_nodes]))
+        print(
+            "Reduced from %d sections to %d" % (len(sections), len(collapsed)))
+        return collapsed
+
+    @staticmethod
+    def print_instrumentation_output(data: str):
+        import json
+        print("print_instrumentation_output start")
+        # Regex for Section start + bytes: # Section start \(node (?P<section_start_node>[0-9]+)\)\nbytes: (?P<section_start_bytes>[0-9]+)
+        # Regex for general entries: # entry \((?P<entry_node>[0-9]+), (?P<entry_thread>[0-9]+), (?P<entry_iteration>[0-9]+), (?P<entry_flags>[0-9]+)\)\n((?P<value_key>[0-9-]+): (?P<value_val>[0-9-]+)\n)*
+
+        print_values = False
+
+        multirun_results = []
+        multirun_supersections = []
+        current_multirun_line = ""
+        sections = []
+        supersection_node_id = None
+        supersections = []
+        current_supersection = PerfUtils.SuperSection()
+        current_section = PerfUtils.Section()
+        current_entry = PerfUtils.Entry()
+
+        state = PerfUtils.ParseStates.CONTROL
+        if isinstance(data, str):
+            lines = data.split('\n')
+            is_string_input = True
+        else:
+            lines = data
+            is_string_input = False
+
+        line_num = 0
+        for line in lines:
+            line_num = line_num + 1
+            if not is_string_input:
+                line = line[:-1]  # Chomp trailing newline
+
+            if "multirun" in line:
+                # Multirun result
+
+                try:
+                    current_supersection.addEntry(current_entry)
+                except Exception as e:
+                    print("Error occurred in line " + str(line_num) + "!")
+                    raise e
+
+                if current_section.is_valid():
+                    pass
+
+                # Reset variables
+                current_section = PerfUtils.Section()
+                current_entry = PerfUtils.Entry()
+
+                sections.extend(current_supersection.getSections())
+                supersections.append(current_supersection)
+
+                current_supersection = PerfUtils.SuperSection()
+
+                if current_multirun_line != "" and sections != []:
+                    multirun_results.append((current_multirun_line.replace(
+                        "\n", ""), sections))
+                if current_multirun_line != "" and supersections != []:
+                    multirun_supersections.append(
+                        (current_multirun_line.replace("\n", ""),
+                         supersections))
+
+                current_multirun_line = line
+                sections = []
+                supersections = []
+                continue
+            if len(line) == 0:
+                continue
+            if line[0] == '#':
+                state = PerfUtils.ParseStates.CONTROL
+            if state == PerfUtils.ParseStates.CONTROL:
+                # First try: Entry
+                match = re.search(
+                    r"# entry \((?P<entry_node>[0-9]+), (?P<entry_thread>[0-9]+), (?P<entry_iteration>[0-9]+), (?P<entry_flags>[0-9]+)\)",
+                    line)
+                if match:
+                    d = match.groupdict()
+
+                    try:
+                        current_supersection.addEntry(current_entry)
+                    except Exception as e:
+                        print("Error occurred in line " + str(line_num) + "!")
+                        raise e
+
+                    current_entry = PerfUtils.Entry()
+
+                    current_entry.nodeid = d['entry_node']
+                    current_entry.coreid = d['entry_thread']
+                    current_entry.iteration = d['entry_iteration']
+                    current_entry.flags = d['entry_flags']
+                    state = PerfUtils.ParseStates.VALUES
+                    continue
+
+                # Next try: Section header
+                match = re.search(
+                    r"# Section start \(node (?P<section_start_node>[0-9]+), core (?P<section_start_core>[0-9]+)\)",
+                    line)
+                if match:
+                    #print("Matched Section Start")
+                    d = match.groupdict()
+
+                    try:
+                        current_supersection.addEntry(current_entry)
+                    except Exception as e:
+                        print("Error occurred in line " + str(line_num) + "!")
+                        raise e
+
+                    current_entry = PerfUtils.Entry()
+                    if (current_section.is_valid()):
+                        #sections.append(current_section)
+                        pass
+                    current_section = PerfUtils.Section(
+                        d['section_start_node'], d['section_start_core'])
+                    current_supersection.addSection(current_section)
+                    state = PerfUtils.ParseStates.SECTION_SIZE
+                    continue
+                # Next try: Supersection header
+                match = re.search(
+                    r"# Supersection start \(node (?P<section_start_node>[0-9]+)\)",
+                    line)
+                if match:
+                    d = match.groupdict()
+
+                    supersection_node_id = d['section_start_node']
+
+                    try:
+                        current_supersection.addEntry(current_entry)
+                    except Exception as e:
+                        print("Error occurred in line " + str(line_num) + "!")
+                        raise e
+                    current_entry = PerfUtils.Entry()
+
+                    if (current_section.is_valid()):
+                        #sections.append(current_section)
+                        pass
+
+                    sections.extend(current_supersection.getSections())
+
+                    supersections.append(current_supersection)
+                    current_supersection = PerfUtils.SuperSection(
+                        d['section_start_node'])
+
+                    current_section = PerfUtils.Section()  # Clear the record
+
+                    state = PerfUtils.ParseStates.CONTROL
+                    continue
+                # Next try: Section data moved
+                match = re.search(r"# moved_bytes: (?P<moved_bytes>[0-9]+)",
+                                  line)
+                if match:
+                    d = match.groupdict()
+                    current_section.bytes_moved = d['moved_bytes']
+                    continue
+                # Next try: Section data moved
+                match = re.search(r"# contention: (?P<contention>[0-9]+)",
+                                  line)
+                if match:
+                    d = match.groupdict()
+                    if int(d['contention']) != 0:
+                        print(
+                            "Contention: {cont}".format(cont=d['contention']))
+                    continue
+                # Next try: Entry (anonymous)
+                # (Should not happen)
+                print("Error, unexpected: anonymous entry %s" % line)
+                print(str(match))
+            elif state == PerfUtils.ParseStates.VALUES:
+                match = re.search(r"(?P<counter>[0-9-]+): (?P<value>[0-9-]+)",
+                                  line)
+                if match:
+                    #print("Matched Value")
+                    d = match.groupdict()
+                    current_entry.add(d['counter'], d['value'])
+                else:
+                    print("Failed to match expected values!")
+                continue
+            elif state == PerfUtils.ParseStates.SECTION_SIZE:
+                match = re.search(r"bytes: (?P<bytes>[0-9-]+)", line)
+                if match:
+                    #print("Matched Section Size")
+                    d = match.groupdict()
+                    current_section.datasize = d['bytes']
+                else:
+                    pass
+                continue
+
+        try:
+            current_supersection.addEntry(current_entry)
+        except Exception as e:
+            print("Error occurred in line " + str(line_num) + "!")
+            raise e
+
+        if current_section.is_valid():
+            #sections.append(current_section)
+            pass
+
+        #sections = PerfUtils.collapse_sections(sections)
+        #sections.extend(PerfUtils.collapse_sections(current_supersection.getSections()))
+        sections.extend(current_supersection.getSections())
+        supersections.append(current_supersection)
+        multirun_results.append((current_multirun_line, sections))
+        multirun_supersections.append((current_multirun_line, supersections))
+
+        # We'll filter invalid supersections later...
+
+        print("Multirun length: " + str(len(multirun_results)))
+
+        for o, s in multirun_results:
+            print("\tSection size: " + str(len(s)))
+            print("\t\tSection size: " + str(s[0].datasize))
+
+        try:
+            totstr = '{ "type": "PerfInfo", "payload": [' + ", ".join([
+                '{"runopts": "%s", "data": [%s]}' % (o, ", ".join(
+                    [x.toJSON() for x in r_supersections if x.is_valid()]))
+                for o, r_supersections in multirun_supersections
+            ]) + "]}"
+
+            #totstr = '{ "type": "PerfInfo", "payload": [' + ", ".join([x.toJSON() for x in sections]) + "]}"
+            with open("perf.json", "w") as out:
+                out.write(totstr)
+
+            # Debug CSV output
+            for idx, v in enumerate(multirun_supersections):
+                o, r_supersections = v
+                with open("perf%d.csv" % idx, "w") as out:
+                    for x in r_supersections:
+                        out.write(x.toCSVstring())
+
+        except:
+            import traceback
+            print("[Error] Failed to jsonify")
+            print(traceback.format_exc())
+
+        # Check if this runs
+        try:
+            for s in sections:
+                json.loads(s.toJSON())
+        except:
+            print("[Error] JSON contains syntax errors!")
+
+        if print_values:
+            print("==== ANALYSIS ====")
+            print("Got %d sections" % len(sections))
+            for i, section in enumerate(sections):
+                print("Section %d (node %s)" % (i, section.nodeid))
+                print("static memory movement (estimation): %s" % str(
+                    section.datasize))
+                print("runtime memory movement (measured):  %s" % str(
+                    section.bytes_moved))
+
+                max_thread_num = section.get_max_thread_num()
+                print("max_thread_num: %d" % max_thread_num)
+                tot_cyc = list()
+                tot_l3_miss = list()
+                tot_l2_miss = list()
+                for t in range(0, max_thread_num + 1):
+                    ts = section.select_thread(t)
+                    tc = ts.select_event('-2147483589')
+                    # print("tc: %s\nsum(tc): %s" % (str(tc), str(sum(tc))))
+                    tot_cyc.append(sum(tc))
+
+                    tl3 = ts.select_event('-2147483640')
+                    tot_l3_miss.append(sum(tl3))
+
+                    tl2 = ts.select_event('-2147483641')
+                    tot_l2_miss.append(sum(tl2))
+
+                # Now we can get the balance
+                for i, t in enumerate(tot_cyc):
+                    print("Thread %d took %d cycles" % (i, t))
+                from statistics import stdev, mean
+                if len(tot_cyc) > 1 and mean(tot_cyc) != 0:
+
+                    print("stdev: %d" % stdev(tot_cyc))
+                    print("Balance: %f" %
+                          (float(stdev(tot_cyc)) / float(mean(tot_cyc))))
+
+                for i, t in enumerate(tot_l3_miss):
+                    print("Thread %d had %d L3 misses" % (i, t))
+                sum_l3 = sum(tot_l3_miss)
+                print(
+                    "%d bytes (presumably) accessed\n%d L3 misses over all threads\n%d bytes loaded from memory"
+                    % (int(section.datasize), int(sum_l3), int(sum_l3) * 64))
+
+                for i, t in enumerate(tot_l2_miss):
+                    print("Thread %d had %d L2 misses" % (i, t))
+                sum_l2 = sum(tot_l2_miss)
+                print(
+                    "%d bytes (presumably) accessed\n%d L2 misses over all threads\n%d bytes loaded from L3"
+                    % (int(section.datasize), int(sum_l2), int(sum_l2) * 64))
+
+
+class PAPIUtil:
+    @staticmethod
+    def fallback_dict(available_events):
+        """
+        Defines potential fallbacks for unavailable PAPI (preset) events
+        """
+        d = dict()
+        #TCM => DCM
+        d['PAPI_L1_TCM'] = [
+            x for x in ['PAPI_L1_DCM'] if x in available_events
+        ]
+        d['PAPI_L2_TCM'] = [
+            x for x in ['PAPI_L2_DCM'] if x in available_events
+        ]
+        d['PAPI_L3_TCM'] = [
+            x for x in ['PAPI_L3_DCM'] if x in available_events
+        ]
+        #DCM => TCM
+        d['PAPI_L1_DCM'] = [
+            x for x in ['PAPI_L1_TCM'] if x in available_events
+        ]
+        d['PAPI_L2_DCM'] = [
+            x for x in ['PAPI_L2_TCM'] if x in available_events
+        ]
+        d['PAPI_L3_DCM'] = [
+            x for x in ['PAPI_L3_TCM'] if x in available_events
+        ]
+
+        return d
+
+    @staticmethod
+    def get_fallback(event, available_events):
+        """
+        Returns a string identifying the most appropriate fallback for 'event',
+        or None if no such fallback exists.
+        """
+        fbd = PAPIUtil.fallback_dict(available_events)
+        fb = fbd[event]
+        if (len(fb) == 0):
+            return None
+        else:
+            return fb[0]
+
+
+class PerfMetaInfo:
+    """ Class dedicated to keep meta information about the generated code, in 
+        particular line numbers. """
+
+    def __init__(self):
+        self.nodes = dict()  # Maps nodes to their strings
+        self.lines = dict()  # Maps nodes to their line number
+
+    def add_node(self, node, string):
+        self.nodes[node] = string
+
+    def has_node(self, node):
+        return node in self.nodes.keys()
+
+    def resolve(self, codestr: str):
+        """ Maps all entries in self.node to line numbers """
+        index = 0
+        line = 1
+        print("self.nodes: %s\ntype: %s" % (self.nodes, type(self.nodes)))
+        for key, value in self.nodes.items():
+            pos = codestr.find(value, index)
+            if pos == -1:
+                # We will not accept this. This should only ever occur if some
+                # part of the program pretty-prints code.
+                assert False
+            sublines = codestr.count('\n', index, pos)
+            line += sublines
+            index = pos
+            # We store the current line back to self.lines
+            self.lines[key] = line
+
+    def analyze(self, vectorizer_output: str):
+        """ Checks if a certain operation or a segment within a region of an 
+            operation was vectorized. """
+        # We only match calls originating from ./src/cpu/*, but it might still
+        # include some of the instrumentation. Consider running this on
+        # non-instrumented code instead
+        data = re.findall(
+            r".*?src/cpu/(?P<file>[^:]*):(?P<line>[\d]*):(?P<col>[\d]*): (?P<msg>[^\n]*)",
+            vectorizer_output)
+
+        print("data is:\n%s" % data)
+
+        print("Node information is\n%s\n" % self.nodes)
+        print("Line information is\n%s\n" % self.lines)
+
+        ret = dict(
+        )  # We return a dict of node -> [(file, line, col, Message)]
+
+        first = True
+        tmp = (None, None)
+        for key, value in self.lines.items():
+            # We now find for each key the value of their respective start
+            # (exception: MapExit, where the end counts)
+            # Then, we associate the message to that key
+            if not first:
+                prevkey, prevval = tmp
+                for file, line, col, message in data:
+                    if int(prevval) <= int(line) and int(line) < int(value):
+                        # Valid entry
+                        if not (prevkey in ret.keys()):
+                            ret[prevkey] = list()
+                        ret[prevkey].append((file, line, col, message))
+            else:
+                first = False
+
+            tmp = (key, value)
+
+        # For the last entry:
+        prevkey, prevval = tmp
+        if prevkey != None:
+            for file, line, col, message in data:
+                if int(prevval) <= int(line):
+                    # Valid entry
+                    if not (prevkey in ret.keys()):
+                        ret[prevkey] = list()
+                    ret[prevkey].append((file, line, col, message))
+
+        print("ret:\n%s" % ret)
+
+        return ret
+
+
+class PerfMetaInfoStatic:
+    info = PerfMetaInfo()
+
+
+class PerfPAPIInfo:
+    """ Class used to keep information about the remote, most notably the 
+        allowed configurations. """
+
+    def __init__(self):
+        self.num_hw_counters = -1
+        self.preset_cost = dict()  # event: str -> num_counters: int
+        self.cached_host = ""
+        self.memspeed = 20.0  # B/c
+
+    def set_memspeed(self, speed):
+        self.memspeed = speed
+
+    def load_info(self):
+        """ Load information about the counters from remote. """
+        from string import Template
+        import subprocess
+
+        print("Loading counter info from remote...")
+
+        if self.cached_host == Config.get("execution", "general", "host"):
+            return  # Do not run this every time, just the first time
+        else:
+            # else reset
+            self.num_hw_counters = -1
+            self.preset_cost = dict()
+
+        non_derived, derived, num_ctrs = PerfUtils.read_available_perfcounters(
+        )
+        self.num_hw_counters = num_ctrs
+
+        # Having these events, the non_derived (by definition) use 1 counter
+        for x in non_derived:
+            self.preset_cost[x] = 1
+
+        # For the others, we have to request some more information.
+        # NOTE: This could be moved into a shell script and run on remote
+        # if issuing many commands is too slow
+        for index, x in enumerate(derived):
+            print("%d/%d Elements...\r" % (index + 1, len(derived)), end='')
+            papi_avail_str = 'papi_avail -e %s | grep --color=never "Number of Native Events"' % x
+            s = Template(Config.get("execution", "general", "execcmd"))
+            cmd = s.substitute(
+                host=Config.get("execution", "general", "host"),
+                command=papi_avail_str)
+            p = subprocess.Popen(
+                cmd,
+                shell=True,
+                stdout=subprocess.PIPE,
+                stderr=subprocess.STDOUT,
+                universal_newlines=True)
+
+            stdout, _ = p.communicate(timeout=60)
+
+            counter_num_grp = re.search(
+                r"Number of Native Events:\s*(?P<num>\d+)", str(stdout))
+            if counter_num_grp != None:
+                self.preset_cost[x] = int(counter_num_grp['num'])
+            else:
+                print("\nError: Expected to find a number here...")
+
+        self.cached_host = Config.get("execution", "general", "host")
+        print("\nDone")
+
+    def check_counters(self, counter_lists: list):
+        """ Checks if the specified counter groups can be used. """
+        assert self.cached_host != ""
+
+        counter_lists_set = list()
+
+        for x in counter_lists:
+            if not x in counter_lists_set:
+                counter_lists_set.append(x)
+        for counter_list in counter_lists_set:
+            sum_counters = 0
+            for c in counter_list:
+                try:
+                    sum_counters += self.preset_cost[c]
+                except:
+                    # This should only happen with Native Events
+                    print(
+                        "check_counters failed with reason: Unknown/unsupported event code specified: %s"
+                        % c)
+                    return False
+            if sum_counters > self.num_hw_counters:
+                print(
+                    "check_counters failed with reason: Not enough hardware counters to support specified events"
+                )
+                return False
+        return True
+
+
+class PerfPAPIInfoStatic:
+    info = PerfPAPIInfo()
diff --git a/dace/codegen/prettycode.py b/dace/codegen/prettycode.py
new file mode 100644
index 0000000000..f58e4a707b
--- /dev/null
+++ b/dace/codegen/prettycode.py
@@ -0,0 +1,70 @@
+""" Code I/O stream that automates indentation and mapping of code to SDFG 
+    nodes. """
+
+from six import StringIO
+from dace.config import Config
+
+
+class CodeIOStream(StringIO):
+    """ Code I/O stream that automates indentation and mapping of code to SDFG 
+        nodes. """
+
+    def __init__(self, base_indentation=0):
+        super(CodeIOStream, self).__init__()
+        self._indent = 0
+        self._spaces = int(Config.get('compiler', 'indentation_spaces'))
+
+    def write(self, contents, sdfg=None, state_id=None, node_id=None):
+        # Delete single trailing newline, as this will be implicitly inserted
+        # anyway
+        if contents:
+            if contents[-1] == "\n":
+                lines = contents[:-1].split("\n")
+            else:
+                lines = contents.split('\n')
+        else:
+            lines = contents
+
+        # If SDFG/state/node location is given, annotate this line
+        if sdfg is not None:
+            location_identifier = '  ////__DACE:%s' % sdfg.name
+            if state_id is not None:
+                location_identifier += ':' + str(state_id)
+                if node_id is not None:
+                    if not isinstance(node_id, list):
+                        node_id = [node_id]
+                    for i, nid in enumerate(node_id):
+                        if not isinstance(nid, int):
+                            node_id[i] = sdfg.nodes()[state_id].node_id(nid)
+                    location_identifier += ':' + ','.join(
+                        [str(nid) for nid in node_id])
+        else:
+            location_identifier = ''
+
+        # Write each line separately
+        for line in lines:
+            opening_braces = line.count('{')
+            closing_braces = line.count('}')
+            brace_balance = opening_braces - closing_braces
+
+            # Write line and then change indentation
+            if brace_balance < 0:
+                self._indent += brace_balance
+
+            codeline = self._indent * self._spaces * ' ' + line.strip()
+
+            # Location identifier is written at character 81 and on, find out
+            # how many spaces we need to add for that
+            loc_spaces = max(80 - len(codeline), 2)
+
+            super(CodeIOStream, self).write(codeline + loc_spaces * ' ' +
+                                            location_identifier + '\n')
+            if brace_balance > 0:
+                self._indent += brace_balance
+
+            # If indentation failed, warn user
+            if self._indent < -1:
+                super(CodeIOStream, self).write(
+                    '///WARNING: Indentation failure! This probably ' +
+                    'indicates an error in the SDFG.\n')
+                self._indent = 0
diff --git a/dace/codegen/targets/__init__.py b/dace/codegen/targets/__init__.py
new file mode 100644
index 0000000000..8b13789179
--- /dev/null
+++ b/dace/codegen/targets/__init__.py
@@ -0,0 +1 @@
+
diff --git a/dace/codegen/targets/cpu.py b/dace/codegen/targets/cpu.py
new file mode 100644
index 0000000000..65002bc74b
--- /dev/null
+++ b/dace/codegen/targets/cpu.py
@@ -0,0 +1,2618 @@
+import ast
+import copy
+import functools
+import itertools
+import sympy as sp
+from six import StringIO
+
+from dace.codegen import cppunparse
+
+import dace
+from dace.config import Config
+from dace.frontend import operations
+from dace import data, subsets, symbolic, types, memlet as mmlt
+from dace.codegen.prettycode import CodeIOStream
+from dace.codegen.codeobject import CodeObject
+from dace.codegen.targets import framecode
+from dace.codegen.targets.target import (TargetCodeGenerator, make_absolute,
+                                         DefinedType)
+from dace.graph import nodes, nxutil
+from dace.sdfg import ScopeSubgraphView, SDFG, scope_contains_scope, find_input_arraynode, find_output_arraynode, is_devicelevel
+
+from dace.frontend.python.astutils import ExtNodeTransformer, rname, unparse
+from dace.properties import LambdaProperty
+
+from dace.codegen.instrumentation.perfsettings import PerfSettings, PerfUtils, PerfMetaInfo, PerfMetaInfoStatic
+
+_REDUCTION_TYPE_TO_OPENMP = {
+    types.ReductionType.Max: 'max',
+    types.ReductionType.Min: 'min',
+    types.ReductionType.Sum: '+',
+    types.ReductionType.Product: '*',
+    types.ReductionType.Bitwise_And: '&',
+    types.ReductionType.Logical_And: '&&',
+    types.ReductionType.Bitwise_Or: '|',
+    types.ReductionType.Logical_Or: '||',
+    types.ReductionType.Bitwise_Xor: '^',
+}
+
+
+class CPUCodeGen(TargetCodeGenerator):
+    """ SDFG CPU code generator. """
+
+    title = 'CPU'
+    target_name = 'cpu'
+    language = 'cpp'
+
+    def __init__(self, frame_codegen, sdfg):
+        self._frame = frame_codegen
+        self._dispatcher = frame_codegen.dispatcher
+        dispatcher = self._dispatcher
+
+        self._locals = cppunparse.CPPLocals()
+        # Scope depth (for use of the 'auto' keyword when
+        # defining locals)
+        self._ldepth = 0
+
+        # FIXME: this allows other code generators to change the CPU
+        # behavior to assume that arrays point to packed types, thus dividing
+        # all addresess by the vector length.
+        self._packed_types = False
+
+        # Keep track of traversed nodes
+        self._generated_nodes = set()
+        self._allocated_arrays = set()
+        # Keeps track of generated connectors, so we know how to access them in
+        # nested scopes
+        for name, arg_type in sdfg.arglist().items():
+            if (isinstance(arg_type, dace.data.Scalar)
+                    or isinstance(arg_type, dace.types.typeclass)):
+                self._dispatcher.defined_vars.add(name, DefinedType.Scalar)
+            elif isinstance(arg_type, dace.data.Array):
+                self._dispatcher.defined_vars.add(name, DefinedType.Pointer)
+            elif isinstance(arg_type, dace.data.Stream):
+                if arg_type.is_stream_array():
+                    self._dispatcher.defined_vars.add(name,
+                                                      DefinedType.StreamArray)
+                else:
+                    self._dispatcher.defined_vars.add(name, DefinedType.Stream)
+            else:
+                raise TypeError("Unrecognized argument type: {}".format(
+                    type(arg_type).__name__))
+
+        # Register dispatchers
+        dispatcher.register_node_dispatcher(self)
+        dispatcher.register_map_dispatcher(
+            [types.ScheduleType.CPU_Multicore, types.ScheduleType.Sequential],
+            self)
+
+        cpu_storage = [
+            types.StorageType.CPU_Heap, types.StorageType.CPU_Pinned,
+            types.StorageType.CPU_Stack, types.StorageType.Register
+        ]
+        dispatcher.register_array_dispatcher(cpu_storage, self)
+
+        # Register CPU copies (all internal pairs)
+        for src_storage, dst_storage in itertools.product(
+                cpu_storage, cpu_storage):
+            dispatcher.register_copy_dispatcher(src_storage, dst_storage, None,
+                                                self)
+
+    @staticmethod
+    def cmake_options():
+        compiler = make_absolute(Config.get("compiler", "cpu", "executable"))
+        flags = Config.get("compiler", "cpu", "args")
+        flags += Config.get("compiler", "cpu", "additional_args")
+
+        # Args for vectorization output
+        if PerfSettings.perf_enable_vectorization_analysis():
+            flags += " -fopt-info-vec-optimized-missed=vecreport.txt "
+
+        options = [
+            "-DCMAKE_CXX_COMPILER=\"{}\"".format(compiler),
+            "-DCMAKE_CXX_FLAGS=\"{}\"".format(flags),
+        ]
+        return options
+
+    def get_generated_codeobjects(self):
+        # CPU target generates inline code
+        return []
+
+    @property
+    def has_initializer(self):
+        return False
+
+    @property
+    def has_finalizer(self):
+        return False
+
+    def generate_scope(self, sdfg: SDFG, dfg_scope: ScopeSubgraphView,
+                       state_id, function_stream, callsite_stream):
+        entry_node = dfg_scope.source_nodes()[0]
+        presynchronize_streams(sdfg, dfg_scope, state_id, entry_node,
+                               callsite_stream)
+
+        self.generate_node(sdfg, dfg_scope, state_id, entry_node,
+                           function_stream, callsite_stream)
+        self._dispatcher.dispatch_subgraph(
+            sdfg,
+            dfg_scope,
+            state_id,
+            function_stream,
+            callsite_stream,
+            skip_entry_node=True)
+
+    def generate_node(self, sdfg, dfg, state_id, node, function_stream,
+                      callsite_stream):
+        # Dynamically obtain node generator according to class name
+        gen = getattr(self, '_generate_' + type(node).__name__)
+
+        gen(sdfg, dfg, state_id, node, function_stream, callsite_stream)
+
+        # Mark node as "generated"
+        self._generated_nodes.add(node)
+
+        self._locals.clear_scope(self._ldepth + 1)
+
+    def allocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                       callsite_stream):
+        name = node.data
+        nodedesc = node.desc(sdfg)
+        if ((state_id, node.data) in self._allocated_arrays
+                or (None, node.data) in self._allocated_arrays
+                or nodedesc.transient == False):
+            return
+        self._allocated_arrays.add((state_id, node.data))
+
+        # Compute array size
+        arrsize = ' * '.join([sym2cpp(s) for s in nodedesc.strides])
+
+        if isinstance(nodedesc, data.Scalar):
+            callsite_stream.write("%s %s;\n" % (nodedesc.dtype.ctype, name),
+                                  sdfg, state_id, node)
+            self._dispatcher.defined_vars.add(name, DefinedType.Scalar)
+        elif isinstance(nodedesc, data.Stream):
+            ###################################################################
+            # Stream directly connected to an array
+
+            if is_array_stream_view(sdfg, dfg, node):
+                if state_id is None:
+                    raise SyntaxError(
+                        'Stream-view of array may not be defined '
+                        'in more than one state')
+
+                arrnode = sdfg.arrays[nodedesc.sink]
+                state = sdfg.nodes()[state_id]
+                edges = state.out_edges(node)
+                if len(edges) > 1:
+                    raise NotImplementedError('Cannot handle streams writing '
+                                              'to multiple arrays.')
+
+                memlet_path = state.memlet_path(edges[0])
+                # Allocate the array before its stream view, if necessary
+                self.allocate_array(sdfg, dfg, state_id, memlet_path[-1].dst,
+                                    function_stream, callsite_stream)
+
+                array_expr = self.copy_expr(sdfg, nodedesc.sink, edges[0].data)
+                threadlocal = ''
+                threadlocal_stores = [
+                    types.StorageType.CPU_Stack, types.StorageType.Register
+                ]
+                if (sdfg.arrays[nodedesc.sink].storage in threadlocal_stores
+                        or nodedesc.storage in threadlocal_stores):
+                    threadlocal = 'Threadlocal'
+                callsite_stream.write(
+                    'dace::ArrayStreamView%s<%s> %s (%s);\n' %
+                    (threadlocal, arrnode.dtype.ctype, name, array_expr), sdfg,
+                    state_id, node)
+                self._dispatcher.defined_vars.add(name, DefinedType.Stream)
+                return
+
+            ###################################################################
+            # Regular stream
+
+            dtype = "dace::vec<{}, {}>".format(nodedesc.dtype.ctype,
+                                               sym2cpp(nodedesc.veclen))
+
+            if nodedesc.buffer_size != 0:
+                definition = "dace::Stream<{}> {}({});".format(
+                    dtype, name, nodedesc.buffer_size)
+            else:
+                definition = "dace::Stream<{}> {};".format(dtype, name)
+
+            callsite_stream.write(definition, sdfg, state_id, node)
+            self._dispatcher.defined_vars.add(name, DefinedType.Stream)
+
+        elif (nodedesc.storage == types.StorageType.CPU_Heap
+              or nodedesc.storage == types.StorageType.Immaterial
+              ):  # TODO: immaterial arrays should not allocate memory
+            callsite_stream.write(
+                "%s *%s = new %s DACE_ALIGN(64)[%s];\n" %
+                (nodedesc.dtype.ctype, name, nodedesc.dtype.ctype, arrsize),
+                sdfg, state_id, node)
+            self._dispatcher.defined_vars.add(name, DefinedType.Pointer)
+            if node.setzero:
+                callsite_stream.write('memset(%s, 0, sizeof(%s)*%s);' %
+                                      (name, nodedesc.dtype.ctype, arrsize))
+            return
+        elif (nodedesc.storage == types.StorageType.CPU_Stack
+              or nodedesc.storage == types.StorageType.Register):
+            if node.setzero:
+                callsite_stream.write(
+                    "%s %s[%s]  DACE_ALIGN(64) = {0};\n" %
+                    (nodedesc.dtype.ctype, name, arrsize), sdfg, state_id,
+                    node)
+                self._dispatcher.defined_vars.add(name, DefinedType.Pointer)
+                return
+            callsite_stream.write(
+                "%s %s[%s]  DACE_ALIGN(64);\n" %
+                (nodedesc.dtype.ctype, name, arrsize), sdfg, state_id, node)
+            self._dispatcher.defined_vars.add(name, DefinedType.Pointer)
+            return
+        else:
+            raise NotImplementedError('Unimplemented storage type ' +
+                                      str(nodedesc.storage))
+
+    def initialize_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        if isinstance(dfg, SDFG):
+            result = StringIO()
+            for sid, state in enumerate(dfg.nodes()):
+                if node in state.nodes():
+                    self.initialize_array(sdfg, state, sid, node,
+                                          function_stream, callsite_stream)
+                    break
+            return
+
+        parent_node = dfg.scope_dict()[node]
+        nodedesc = node.desc(sdfg)
+        name = node.data
+
+        # Traverse the DFG, looking for WCR with an identity element
+        def traverse(u, uconn, v, vconn, d):
+            if d.wcr:
+                if d.data == name:
+                    if d.wcr_identity is not None:
+                        return d.wcr_identity
+            return None
+
+        identity = None
+        if parent_node is not None:
+            for u, uconn, v, vconn, d, s in nxutil.traverse_sdfg_scope(
+                    dfg, parent_node):
+                identity = traverse(u, uconn, v, vconn, d)
+                if identity is not None: break
+        else:
+            for u, uconn, v, vconn, d in dfg.edges():
+                identity = traverse(u, uconn, v, vconn, d)
+                if identity is not None: break
+
+        if identity is None:
+            return
+
+        # If we should generate an initialization expression
+        if isinstance(nodedesc, data.Scalar):
+            callsite_stream.write('%s = %s;\n' % (name, sym2cpp(identity)),
+                                  sdfg, state_id, node)
+            return
+
+        params = [name, sym2cpp(identity)]
+        shape = [sym2cpp(s) for s in nodedesc.shape]
+        params.append(' * '.join(shape))
+
+        # Faster
+        if identity == 0:
+            params[-1] += ' * sizeof(%s[0])' % name
+            callsite_stream.write('memset(%s);\n' % (', '.join(params)), sdfg,
+                                  state_id, node)
+            return
+
+        callsite_stream.write('dace::InitArray(%s);\n' % (', '.join(params)),
+                              sdfg, state_id, node)
+
+    def deallocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        nodedesc = node.desc(sdfg)
+        if isinstance(nodedesc, data.Scalar):
+            return
+        elif isinstance(nodedesc, data.Stream):
+            return
+        elif nodedesc.storage == types.StorageType.CPU_Heap:
+            callsite_stream.write("delete[] %s;\n" % node.data, sdfg, state_id,
+                                  node)
+        else:
+            return
+
+    def copy_memory(self, sdfg, dfg, state_id, src_node, dst_node, edge,
+                    function_stream, callsite_stream):
+        if isinstance(src_node, nodes.Tasklet):
+            src_storage = types.StorageType.Register
+            try:
+                src_parent = dfg.scope_dict()[src_node]
+            except KeyError:
+                src_parent = None
+            dst_schedule = (None
+                            if src_parent is None else src_parent.map.schedule)
+        else:
+            src_storage = src_node.desc(sdfg).storage
+
+        if isinstance(dst_node, nodes.Tasklet):
+            dst_storage = types.StorageType.Register
+        else:
+            dst_storage = dst_node.desc(sdfg).storage
+
+        try:
+            dst_parent = dfg.scope_dict()[dst_node]
+        except KeyError:
+            dst_parent = None
+        dst_schedule = None if dst_parent is None else dst_parent.map.schedule
+
+        state_dfg = sdfg.nodes()[state_id]
+
+        # Emit actual copy
+        self._emit_copy(sdfg, state_id, src_node, src_storage, dst_node,
+                        dst_storage, dst_schedule, edge, state_dfg,
+                        callsite_stream)
+
+    def _emit_copy(self, sdfg, state_id, src_node, src_storage, dst_node,
+                   dst_storage, dst_schedule, edge, dfg, stream):
+        u, uconn, v, vconn, memlet = edge
+
+        #############################################################
+        # Instrumentation: Pre-copy
+
+        # For perfcounters, we have to make sure that:
+        # 1) No other measurements are done for the containing scope (no map operation containing this copy is instrumented)
+        src_instrumented = PerfUtils.has_surrounding_perfcounters(
+            src_node, dfg)
+        dst_instrumented = PerfUtils.has_surrounding_perfcounters(
+            dst_node, dfg)
+
+        # From cuda.py
+        cpu_storage_types = [
+            types.StorageType.CPU_Heap, types.StorageType.CPU_Stack,
+            types.StorageType.CPU_Pinned, types.StorageType.Register
+        ]
+
+        perf_cpu_only = (src_storage in cpu_storage_types) and (
+            dst_storage in cpu_storage_types)
+
+        perf_should_instrument = PerfSettings.perf_enable_instrumentation_for(
+            sdfg) and (not src_instrumented) and (
+                not dst_instrumented) and perf_cpu_only
+
+        #############################################################
+
+        # Determine memlet directionality
+        if (isinstance(src_node, nodes.AccessNode)
+                and memlet.data == src_node.data):
+            write = True
+        elif (isinstance(dst_node, nodes.AccessNode)
+              and memlet.data == dst_node.data):
+            write = False
+        elif isinstance(src_node, nodes.CodeNode) and isinstance(
+                dst_node, nodes.CodeNode):
+            # Code->Code copy (not read nor write)
+            raise RuntimeError(
+                'Copying between code nodes is only supported as'
+                ' part of the participating nodes')
+        else:
+            raise LookupError('Memlet does not point to any of the nodes')
+
+        if isinstance(dst_node, nodes.Tasklet):
+            # Copy into tasklet
+            stream.write(
+                '    ' + self.memlet_definition(sdfg, memlet, False, vconn),
+                sdfg, state_id, [src_node, dst_node])
+            return
+        elif isinstance(src_node, nodes.Tasklet):
+            # Copy out of tasklet
+            stream.write(
+                '    ' + self.memlet_definition(sdfg, memlet, True, uconn),
+                sdfg, state_id, [src_node, dst_node])
+            return
+        else:  # Copy array-to-array
+            src_nodedesc = src_node.desc(sdfg)
+            dst_nodedesc = dst_node.desc(sdfg)
+
+            if write:
+                vconn = dst_node.data
+            ctype = 'dace::vec<%s, %d>' % (dst_nodedesc.dtype.ctype,
+                                           memlet.veclen)
+
+            #############################################
+            # Corner cases
+
+            # Writing one index
+            if isinstance(memlet.subset,
+                          subsets.Indices) and memlet.wcr is None:
+                stream.write(
+                    '%s = %s;' % (vconn, self.memlet_ctor(
+                        sdfg, memlet, False)), sdfg, state_id,
+                    [src_node, dst_node])
+                return
+            # Writing from/to a stream
+            if (isinstance(sdfg.arrays[memlet.data], data.Stream) or \
+                (isinstance(src_node, nodes.AccessNode) and isinstance(src_nodedesc,
+                                                                     data.Stream))):
+                # Identify whether a stream is writing to an array
+                if (isinstance(dst_nodedesc, (data.Scalar, data.Array))
+                        and isinstance(src_nodedesc, data.Stream)):
+                    return  # Do nothing (handled by ArrayStreamView)
+
+                # Array -> Stream - push bulk
+                if (isinstance(src_nodedesc, (data.Scalar, data.Array))
+                        and isinstance(dst_nodedesc, data.Stream)):
+                    if hasattr(src_nodedesc, 'src'):  # ArrayStreamView
+                        stream.write(
+                            '{s}.push({arr});'.format(
+                                s=dst_node.data, arr=src_nodedesc.src), sdfg,
+                            state_id, [src_node, dst_node])
+                    else:
+                        copysize = ' * '.join(
+                            [sym2cpp(s) for s in memlet.subset.size()])
+                        stream.write(
+                            '{s}.push({arr}, {size});'.format(
+                                s=dst_node.data,
+                                arr=src_node.data,
+                                size=copysize), sdfg, state_id,
+                            [src_node, dst_node])
+                    return
+                else:
+                    # Unknown case
+                    raise NotImplementedError
+
+            #############################################
+
+            state_dfg = sdfg.nodes()[state_id]
+
+            copy_shape, src_strides, dst_strides, src_expr, dst_expr = (
+                self.memlet_copy_to_absolute_strides(sdfg, memlet, src_node,
+                                                     dst_node))
+
+            # Which numbers to include in the variable argument part
+            dynshape, dynsrc, dyndst = 1, 1, 1
+
+            # Dynamic copy dimensions
+            if any(symbolic.issymbolic(s, sdfg.constants) for s in copy_shape):
+                copy_tmpl = 'Dynamic<{type}, {veclen}, {aligned}, {dims}>'.format(
+                    type=ctype,
+                    veclen=1,  # Taken care of in "type"
+                    aligned='false',
+                    dims=len(copy_shape))
+            else:  # Static copy dimensions
+                copy_tmpl = '<{type}, {veclen}, {aligned}, {dims}>'.format(
+                    type=ctype,
+                    veclen=1,  # Taken care of in "type"
+                    aligned='false',
+                    dims=', '.join(sym2cpp(copy_shape)))
+                dynshape = 0
+
+            # Constant src/dst dimensions
+            if not any(
+                    symbolic.issymbolic(s, sdfg.constants)
+                    for s in dst_strides):
+                # Constant destination
+                shape_tmpl = 'template ConstDst<%s>' % ', '.join(
+                    sym2cpp(dst_strides))
+                dyndst = 0
+            elif not any(
+                    symbolic.issymbolic(s, sdfg.constants)
+                    for s in src_strides):
+                # Constant source
+                shape_tmpl = 'template ConstSrc<%s>' % ', '.join(
+                    sym2cpp(src_strides))
+                dynsrc = 0
+            else:
+                # Both dynamic
+                shape_tmpl = 'Dynamic'
+
+            # Parameter pack handling
+            stride_tmpl_args = [0] * (
+                dynshape + dynsrc + dyndst) * len(copy_shape)
+            j = 0
+            for shape, src, dst in zip(copy_shape, src_strides, dst_strides):
+                if dynshape > 0:
+                    stride_tmpl_args[j] = shape
+                    j += 1
+                if dynsrc > 0:
+                    stride_tmpl_args[j] = src
+                    j += 1
+                if dyndst > 0:
+                    stride_tmpl_args[j] = dst
+                    j += 1
+
+            copy_args = ([src_expr, dst_expr] + ([] if memlet.wcr is None else
+                                                 [unparse_cr(memlet.wcr)]) +
+                         sym2cpp(stride_tmpl_args))
+
+            #############################################################
+            # Instrumentation: Pre-copy 2
+            unique_cpy_id = PerfSettings.get_unique_number()
+
+            if perf_should_instrument:
+                fac3 = ' * '.join(sym2cpp(copy_shape)) + " / " + '/'.join(
+                    sym2cpp(dst_strides))
+                copy_size = "sizeof(%s) * %s * (%s)" % (ctype, memlet.veclen,
+                                                        fac3)
+                node_id = PerfUtils.unified_id(dfg.node_id(dst_node), state_id)
+                # Mark a section start (this is not really a section in itself (it would be a section with 1 entry))
+                stream.write(
+                    "__perf_store.markSectionStart(%d, (long long)%s, PAPI_thread_id());\n"
+                    % (node_id, copy_size), sdfg, state_id,
+                    [src_node, dst_node])
+                stream.write((
+                    "dace_perf::{pcs} __perf_cpy_{nodeid}_{unique_id};\n" +
+                    "auto& __vs_cpy_{nodeid}_{unique_id} = __perf_store.getNewValueSet(__perf_cpy_{nodeid}_{unique_id}, {nodeid}, PAPI_thread_id(), {size}, dace_perf::ValueSetType::Copy);\n"
+                    + "__perf_cpy_{nodeid}_{unique_id}.enterCritical();\n"
+                ).format(
+                    pcs=PerfUtils.perf_counter_string(dst_node),
+                    nodeid=node_id,
+                    unique_id=unique_cpy_id,
+                    size=copy_size), sdfg, state_id, [src_node, dst_node])
+            #############################################################
+
+            nc = True
+            if memlet.wcr is not None:
+                nc = not is_write_conflicted(dfg, edge)
+            if nc:
+                stream.write(
+                    """
+                    dace::CopyND{copy_tmpl}::{shape_tmpl}::{copy_func}(
+                        {copy_args});""".format(
+                        copy_tmpl=copy_tmpl,
+                        shape_tmpl=shape_tmpl,
+                        copy_func='Copy'
+                        if memlet.wcr is None else 'Accumulate',
+                        copy_args=', '.join(copy_args)), sdfg, state_id,
+                    [src_node, dst_node])
+            else:  # Conflicted WCR
+                if dynshape == 1:
+                    raise NotImplementedError(
+                        'Accumulation of dynamically-shaped '
+                        'arrays not yet implemented')
+                elif copy_shape == [
+                        1
+                ]:  # Special case: accumulating one element
+                    dst_expr = self.memlet_view_ctor(sdfg, memlet, True)
+                    stream.write(
+                        write_and_resolve_expr(memlet, nc, dst_expr,
+                                               '*(' + src_expr + ')'), sdfg,
+                        state_id, [src_node, dst_node])
+                else:
+                    raise NotImplementedError('Accumulation of arrays '
+                                              'with WCR not yet implemented')
+
+        #############################################################
+        # Instrumentation: Post-copy
+        if perf_should_instrument:
+            stream.write(("__perf_cpy_%d_%d.leaveCritical(__vs_cpy_%d_%d);\n")
+                         % (node_id, unique_cpy_id, node_id, unique_cpy_id),
+                         sdfg, state_id, [src_node, dst_node])
+        #############################################################
+
+    ###########################################################################
+    # Memlet handling
+
+    def process_out_memlets(self, sdfg, state_id, node, dfg, dispatcher,
+                            result, locals_defined, function_stream):
+
+        scope_dict = sdfg.nodes()[state_id].scope_dict()
+
+        for edge in dfg.out_edges(node):
+            _, uconn, v, _, memlet = edge
+            dst_node = dfg.memlet_path(edge)[-1].dst
+
+            # Target is neither a data nor a tasklet node
+            if (isinstance(node, nodes.AccessNode)
+                    and (not isinstance(dst_node, nodes.AccessNode)
+                         and not isinstance(dst_node, nodes.CodeNode))):
+                continue
+
+            # Skip array->code (will be handled as a tasklet input)
+            if isinstance(node, nodes.AccessNode) and isinstance(
+                    v, nodes.CodeNode):
+                continue
+
+            # code->code (e.g., tasklet to tasklet)
+            if isinstance(v, nodes.CodeNode):
+                shared_data_name = 's%d_n%d%s_n%d%s' % (
+                    state_id, dfg.node_id(edge.src), edge.src_conn,
+                    dfg.node_id(edge.dst), edge.dst_conn)
+                result.write('__%s = %s;' % (shared_data_name, edge.src_conn),
+                             sdfg, state_id, [edge.src, edge.dst])
+                continue
+
+            # If the memlet is not pointing to a data node (e.g. tasklet), then
+            # the tasklet will take care of the copy
+            if not isinstance(dst_node, nodes.AccessNode):
+                continue
+            # If the memlet is pointing into an array in an inner scope, then
+            # the inner scope (i.e., the output array) must handle it
+            if (scope_dict[node] != scope_dict[dst_node]
+                    and scope_contains_scope(scope_dict, node, dst_node)):
+                continue
+
+            # Array to tasklet (path longer than 1, handled at tasklet entry)
+            if node == dst_node:
+                continue
+
+            # Tasklet -> array
+            if isinstance(node, nodes.CodeNode):
+                if not uconn:
+                    raise SyntaxError(
+                        'Cannot copy memlet without a local connector: {} to {}'
+                        .format(str(edge.src), str(edge.dst)))
+
+                try:
+                    positive_accesses = bool(memlet.num_accesses >= 0)
+                except TypeError:
+                    positive_accesses = False
+
+                if memlet.subset.data_dims() == 0 and positive_accesses:
+                    out_local_name = '    __' + uconn
+                    in_local_name = uconn
+                    if not locals_defined:
+                        out_local_name = self.memlet_ctor(sdfg, memlet, True)
+                        in_memlets = [
+                            d for _, _, _, _, d in dfg.in_edges(node)
+                        ]
+                        assert len(in_memlets) == 1
+                        in_local_name = self.memlet_ctor(
+                            sdfg, in_memlets[0], False)
+
+                    state_dfg = sdfg.nodes()[state_id]
+
+                    if memlet.wcr is not None:
+                        nc = not is_write_conflicted(dfg, edge)
+                        result.write(
+                            write_and_resolve_expr(memlet, nc, out_local_name,
+                                                   in_local_name), sdfg,
+                            state_id, node)
+                    else:
+                        result.write(
+                            '%s.write(%s);\n' % (out_local_name,
+                                                 in_local_name), sdfg,
+                            state_id, node)
+            # Dispatch array-to-array outgoing copies here
+            elif isinstance(node, nodes.AccessNode):
+                if dst_node != node and not isinstance(dst_node,
+                                                       nodes.Tasklet):
+                    dispatcher.dispatch_copy(node, dst_node, edge, sdfg, dfg,
+                                             state_id, function_stream, result)
+
+    def memlet_view_ctor(self, sdfg, memlet, is_output):
+        memlet_params = []
+
+        memlet_name = memlet.data
+        def_type = self._dispatcher.defined_vars.get(memlet_name)
+
+        if def_type == DefinedType.Pointer:
+            memlet_expr = memlet_name  # Common case
+        elif (def_type == DefinedType.Scalar
+              or def_type == DefinedType.ScalarView):
+            memlet_expr = '&' + memlet_name
+        elif def_type == DefinedType.ArrayView:
+            memlet_expr = memlet_name + ".ptr()"
+        else:
+            raise TypeError("Unsupported connector type {}".format(def_type))
+
+        if isinstance(memlet.subset, subsets.Indices):
+
+            # FIXME: _packed_types influences how this offset is
+            # generated from the FPGA codegen. We should find a nicer solution.
+            if self._packed_types is True:
+                offset = cpp_array_expr(
+                    sdfg, memlet, False, packed_veclen=memlet.veclen)
+            else:
+                offset = cpp_array_expr(sdfg, memlet, False)
+
+            # Compute address
+            memlet_params.append(memlet_expr + ' + ' + offset)
+            dims = 0
+
+        else:
+
+            if isinstance(memlet.subset, subsets.Range):
+
+                dims = len(memlet.subset.ranges)
+
+                # FIXME: _packed_types influences how this offset is
+                # generated from the FPGA codegen. We should find a nicer
+                # solution.
+                if self._packed_types is True:
+                    offset = cpp_offset_expr(
+                        sdfg.arrays[memlet.data],
+                        memlet.subset,
+                        packed_veclen=memlet.veclen)
+                else:
+                    offset = cpp_offset_expr(sdfg.arrays[memlet.data],
+                                             memlet.subset)
+                if offset == "0":
+                    memlet_params.append(memlet_expr)
+                else:
+                    if (def_type not in [
+                            DefinedType.Pointer, DefinedType.ArrayView
+                    ]):
+                        raise dace.codegen.codegen.CodegenError(
+                            "Cannot offset address of connector {} of type {}".
+                            format(memlet_name, def_type))
+                    memlet_params.append(memlet_expr + ' + ' + offset)
+
+                # Dimensions to remove from view (due to having one value)
+                indexdims = []
+
+                # Figure out dimensions for scalar version
+                for dim, (rb, re, rs) in enumerate(memlet.subset.ranges):
+                    try:
+                        if (re - rb) == 0:
+                            indexdims.append(dim)
+                    except TypeError:  # cannot determine truth value of Relational
+                        pass
+
+                # Remove index (one scalar) dimensions
+                dims -= len(indexdims)
+
+                if dims > 0:
+                    strides = memlet.subset.absolute_strides(
+                        sdfg.arrays[memlet.data].strides)
+                    # Filter out index dims
+                    strides = [
+                        s for i, s in enumerate(strides) if i not in indexdims
+                    ]
+                    # FIXME: _packed_types influences how this offset is
+                    # generated from the FPGA codegen. We should find a nicer
+                    # solution.
+                    if self._packed_types and memlet.veclen > 1:
+                        for i in range(len(strides) - 1):
+                            strides[i] /= memlet.veclen
+                    memlet_params.extend(sym2cpp(strides))
+                    dims = memlet.subset.data_dims()
+
+            else:
+                raise RuntimeError(
+                    'Memlet type "%s" not implemented' % memlet.subset)
+
+        if memlet.num_accesses == 1:
+            num_accesses_str = "1"
+        else:  # symbolic.issymbolic(memlet.num_accesses, sdfg.constants):
+            num_accesses_str = 'dace::NA_RUNTIME'
+
+        return 'dace::ArrayView%s<%s, %d, %s, %s> (%s)' % (
+            "Out"
+            if is_output else "In", sdfg.arrays[memlet.data].dtype.ctype, dims,
+            sym2cpp(memlet.veclen), num_accesses_str, ', '.join(memlet_params))
+
+    def memlet_definition(self, sdfg, memlet, output, local_name):
+        result = ('auto __%s = ' % local_name + self.memlet_ctor(
+            sdfg, memlet, output) + ';\n')
+
+        # Allocate variable type
+        memlet_type = 'dace::vec<%s, %s>' % (
+            sdfg.arrays[memlet.data].dtype.ctype, sym2cpp(memlet.veclen))
+
+        var_type = self._dispatcher.defined_vars.get(memlet.data)
+
+        # ** Concerning aligned vs. non-aligned values:
+        # We prefer aligned values, so in every case where we are assigning to
+        # a local _value_, we explicitly assign to an aligned type
+        # (memlet_type). In all other cases, where we need either a pointer or
+        # a reference, typically due to variable number of accesses, we have to
+        # use the underlying type of the ArrayView, be it aligned or unaligned,
+        # to avoid runtime crashes. We use auto for this, so the ArrayView can
+        # return whatever it supports.
+
+        if var_type == DefinedType.Scalar:
+            if memlet.num_accesses == 1:
+                if not output:
+                    # We can pre-read the value
+                    result += "{} {} = __{}.val<{}>();".format(
+                        memlet_type, local_name, local_name, memlet.veclen)
+                else:
+                    # The value will be written during the tasklet, and will be
+                    # automatically written out after
+                    result += "{} {};".format(memlet_type, local_name)
+                self._dispatcher.defined_vars.add(local_name,
+                                                  DefinedType.Scalar)
+            elif memlet.num_accesses == -1:
+                if output:
+                    # Variable number of writes: get reference to the target of
+                    # the view to reflect writes at the data
+                    result += "auto &{} = __{}.ref<{}>();".format(
+                        local_name, local_name, memlet.veclen)
+                else:
+                    # Variable number of reads: get a const reference that can
+                    # be read if necessary
+                    result += "auto const &{} = __{}.ref<{}>();".format(
+                        local_name, local_name, memlet.veclen)
+                self._dispatcher.defined_vars.add(local_name,
+                                                  DefinedType.Scalar)
+            else:
+                raise dace.codegen.codegen.CodegenError(
+                    "Unsupported number of accesses {} for scalar {}".format(
+                        memlet.num_accesses, local_name))
+        elif var_type == DefinedType.Pointer:
+            if memlet.num_accesses == 1:
+                if output:
+                    result += "{} {};".format(memlet_type, local_name)
+                else:
+                    result += "{} {} = __{}.val<{}>();".format(
+                        memlet_type, local_name, local_name, memlet.veclen)
+                self._dispatcher.defined_vars.add(local_name,
+                                                  DefinedType.Scalar)
+            else:
+                if memlet.subset.data_dims() == 0:
+                    # Forward ArrayView
+                    result += "auto &{} = __{}.ref<{}>();".format(
+                        local_name, local_name, memlet.veclen)
+                    self._dispatcher.defined_vars.add(local_name,
+                                                      DefinedType.Scalar)
+                else:
+                    result += "auto *{} = __{}.ptr<{}>();".format(
+                        local_name, local_name, memlet.veclen)
+                    self._dispatcher.defined_vars.add(local_name,
+                                                      DefinedType.Pointer)
+        elif (var_type == DefinedType.Stream
+              or var_type == DefinedType.StreamArray):
+            if memlet.num_accesses == 1:
+                if output:
+                    result += "{} {};".format(memlet_type, local_name)
+                else:
+                    result += "auto {} = __{}.pop();".format(
+                        local_name, local_name)
+                self._dispatcher.defined_vars.add(local_name,
+                                                  DefinedType.Scalar)
+            else:
+                # Just forward actions to the underlying object
+                result += "auto &{} = __{};".format(local_name, local_name)
+                self._dispatcher.defined_vars.add(local_name,
+                                                  DefinedType.Stream)
+        else:
+            raise TypeError("Unknown variable type: {}".format(var_type))
+
+        return result
+
+    def memlet_stream_ctor(self, sdfg, memlet):
+        stream = sdfg.arrays[memlet.data]
+        dtype = "dace::vec<{}, {}>".format(stream.dtype.ctype,
+                                           symbolic.symstr(memlet.veclen))
+        return "dace::make_streamview({})".format(memlet.data + (
+            "[{}]".format(cpp_offset_expr(stream, memlet.subset))
+            if isinstance(stream, dace.data.Stream)
+            and stream.is_stream_array() else ""))
+
+    def memlet_ctor(self, sdfg, memlet, is_output):
+
+        def_type = self._dispatcher.defined_vars.get(memlet.data)
+
+        if (def_type == DefinedType.Stream
+                or def_type == DefinedType.StreamArray):
+            return self.memlet_stream_ctor(sdfg, memlet)
+
+        elif (def_type == DefinedType.Pointer or def_type == DefinedType.Scalar
+              or def_type == DefinedType.ScalarView
+              or def_type == DefinedType.ArrayView):
+            return self.memlet_view_ctor(sdfg, memlet, is_output)
+
+        else:
+            raise NotImplementedError(
+                "Connector type {} not yet implemented".format(def_type))
+
+    def copy_expr(self,
+                  sdfg,
+                  dataname,
+                  memlet,
+                  offset=None,
+                  relative_offset=True,
+                  packed_types=False):
+        datadesc = sdfg.arrays[dataname]
+        if relative_offset:
+            s = memlet.subset
+            o = offset
+        else:
+            if offset is None:
+                s = None
+            elif not isinstance(offset, subsets.Subset):
+                s = subsets.Indices(offset)
+            else:
+                s = offset
+            o = None
+        if s != None:
+            offset_cppstr = cpp_offset_expr(
+                datadesc, s, o, memlet.veclen if packed_types else 1)
+        else:
+            offset_cppstr = '0'
+        dt = ''
+
+        if memlet.veclen != 1 and not packed_types:
+            offset_cppstr = '(%s) / %s' % (offset_cppstr, sym2cpp(
+                memlet.veclen))
+            dt = '(dace::vec<%s, %s> *)' % (datadesc.dtype.ctype,
+                                            sym2cpp(memlet.veclen))
+
+        expr = dataname
+
+        def_type = self._dispatcher.defined_vars.get(dataname)
+
+        add_offset = (offset_cppstr != "0")
+
+        if def_type == DefinedType.Pointer:
+            return "{}{}{}".format(
+                dt, expr, " + {}".format(offset_cppstr) if add_offset else "")
+
+        elif def_type == DefinedType.ArrayView:
+            return "{}{}.ptr(){}".format(
+                dt, expr, " + {}".format(offset_cppstr) if add_offset else "")
+
+        elif def_type == DefinedType.StreamArray:
+            return "{}[{}]".format(expr, offset_cppstr)
+
+        elif (def_type == DefinedType.Scalar
+              or def_type == DefinedType.ScalarView
+              or def_type == DefinedType.Stream):
+
+            if add_offset:
+                raise TypeError(
+                    "Tried to offset address of scalar {}: {}".format(
+                        dataname, offset_cppstr))
+
+            if (def_type == DefinedType.Scalar
+                    or def_type == DefinedType.ScalarView):
+                return "{}&{}".format(dt, expr)
+            else:
+                return dataname
+
+        else:
+            raise NotImplementedError(
+                "copy_expr not implemented "
+                "for connector type: {}".format(def_type))
+
+    def memlet_copy_to_absolute_strides(self,
+                                        sdfg,
+                                        memlet,
+                                        src_node,
+                                        dst_node,
+                                        packed_types=False):
+        # Ignore vectorization flag is a hack to accommmodate FPGA behavior,
+        # where the pointer type is changed to a vector type, and addresses
+        # thus shouldn't take vectorization into account.
+        copy_shape = memlet.subset.size()
+        copy_shape = [symbolic.overapproximate(s) for s in copy_shape]
+        src_nodedesc = src_node.desc(sdfg)
+        dst_nodedesc = dst_node.desc(sdfg)
+
+        if memlet.data == src_node.data:
+            src_expr = self.copy_expr(
+                sdfg, src_node.data, memlet, packed_types=packed_types)
+            dst_expr = self.copy_expr(
+                sdfg,
+                dst_node.data,
+                memlet,
+                None,
+                False,
+                packed_types=packed_types)
+            if memlet.other_subset is not None:
+                dst_expr = self.copy_expr(
+                    sdfg,
+                    dst_node.data,
+                    memlet,
+                    memlet.other_subset,
+                    False,
+                    packed_types=packed_types)
+                dst_subset = memlet.other_subset
+            else:
+                dst_subset = subsets.Range.from_array(dst_nodedesc)
+            src_subset = memlet.subset
+
+        else:
+            src_expr = self.copy_expr(
+                sdfg,
+                src_node.data,
+                memlet,
+                None,
+                False,
+                packed_types=packed_types)
+            dst_expr = self.copy_expr(
+                sdfg, dst_node.data, memlet, packed_types=packed_types)
+            if memlet.other_subset is not None:
+                src_expr = self.copy_expr(
+                    sdfg,
+                    src_node.data,
+                    memlet,
+                    memlet.other_subset,
+                    False,
+                    packed_types=packed_types)
+                src_subset = memlet.other_subset
+            else:
+                src_subset = subsets.Range.from_array(src_nodedesc)
+            dst_subset = memlet.subset
+
+        src_strides = src_subset.absolute_strides(src_nodedesc.strides)
+        dst_strides = dst_subset.absolute_strides(dst_nodedesc.strides)
+
+        # Try to turn into degenerate/strided ND copies
+        result = ndcopy_to_strided_copy(copy_shape, src_nodedesc.strides,
+                                        src_strides, dst_nodedesc.strides,
+                                        dst_strides, memlet.subset)
+        if result is not None:
+            copy_shape, src_strides, dst_strides = result
+        else:
+            # If other_subset is defined, reduce its dimensionality by
+            # removing the "empty" dimensions (size = 1) and filter the
+            # corresponding strides out
+            src_strides = [
+                stride for stride, s in zip(src_strides, src_subset.size())
+                if s != 1
+            ] + src_strides[len(src_subset):]  # Include tiles
+            if not src_strides:
+                src_strides = [1]
+            dst_strides = [
+                stride for stride, s in zip(dst_strides, dst_subset.size())
+                if s != 1
+            ] + dst_strides[len(dst_subset):]  # Include tiles
+            if not dst_strides:
+                dst_strides = [1]
+            copy_shape = [s for s in copy_shape if s != 1]
+            if not copy_shape:
+                copy_shape = [1]
+
+        # Extend copy shape to the largest among the data dimensions,
+        # and extend other array with the appropriate strides
+        if (len(dst_strides) != len(copy_shape)
+                or len(src_strides) != len(copy_shape)):
+            if memlet.data == src_node.data:
+                copy_shape, dst_strides = _reshape_strides(
+                    src_subset, src_strides, dst_strides, copy_shape)
+            elif memlet.data == dst_node.data:
+                copy_shape, src_strides = _reshape_strides(
+                    dst_subset, dst_strides, src_strides, copy_shape)
+
+        if memlet.veclen != 1:
+            int_floor = sp.Function('int_floor')
+            src_strides[:-1] = [
+                int_floor(s, memlet.veclen) for s in src_strides[:-1]
+            ]
+            dst_strides[:-1] = [
+                int_floor(s, memlet.veclen) for s in dst_strides[:-1]
+            ]
+            if not packed_types:
+                copy_shape[-1] = int_floor(copy_shape[-1], memlet.veclen)
+
+        return copy_shape, src_strides, dst_strides, src_expr, dst_expr
+
+    #########################################################################
+    # Dynamically-called node dispatchers
+
+    def _generate_Tasklet(self, sdfg, dfg, state_id, node, function_stream,
+                          callsite_stream):
+        callsite_stream.write('{\n', sdfg, state_id, node)
+
+        # Add code to init and exit functions
+        self._frame._initcode.write(node.code_init, sdfg)
+        self._frame._exitcode.write(node.code_exit, sdfg)
+
+        state_dfg = sdfg.nodes()[state_id]
+
+        self._dispatcher.defined_vars.enter_scope(node)
+
+        arrays = set()
+        for edge in state_dfg.in_edges(node):
+            u = edge.src
+            memlet = edge.data
+
+            if edge.dst_conn:  # Not (None or "")
+                if edge.dst_conn in arrays:  # Disallow duplicates
+                    raise SyntaxError('Duplicates found in memlets')
+                # Special case: code->code
+                if isinstance(edge.src, nodes.CodeNode):
+                    shared_data_name = 's%d_n%d%s_n%d%s' % (
+                        state_id, dfg.node_id(edge.src), edge.src_conn,
+                        dfg.node_id(edge.dst), edge.dst_conn)
+
+                    # Read variable from shared storage
+                    callsite_stream.write(
+                        'const dace::vec<%s, %s>& %s = __%s;' %
+                        (sdfg.arrays[memlet.data].dtype.ctype,
+                         sym2cpp(memlet.veclen), edge.dst_conn,
+                         shared_data_name), sdfg, state_id,
+                        [edge.src, edge.dst])
+                    self._dispatcher.defined_vars.add(edge.dst_conn,
+                                                      DefinedType.Scalar)
+
+                else:
+                    src_node = find_input_arraynode(state_dfg, edge)
+
+                    self._dispatcher.dispatch_copy(
+                        src_node, node, edge, sdfg, state_dfg, state_id,
+                        function_stream, callsite_stream)
+
+                # Also define variables in the C++ unparser scope
+                self._locals.define(edge.dst_conn, -1, self._ldepth + 1)
+                arrays.add(edge.dst_conn)
+
+        callsite_stream.write('\n', sdfg, state_id, node)
+
+        # Use outgoing edges to preallocate output local vars
+        for edge in state_dfg.out_edges(node):
+            v = edge.dst
+            memlet = edge.data
+
+            if edge.src_conn:
+                if edge.src_conn in arrays:  # Disallow duplicates
+                    continue
+                # Special case: code->code
+                if isinstance(edge.dst, nodes.CodeNode):
+                    callsite_stream.write(
+                        'dace::vec<%s, %s> %s;' %
+                        (sdfg.arrays[memlet.data].dtype.ctype,
+                         sym2cpp(memlet.veclen), edge.src_conn), sdfg,
+                        state_id, [edge.src, edge.dst])
+                    self._dispatcher.defined_vars.add(edge.src_conn,
+                                                      DefinedType.Scalar)
+                else:
+                    dst_node = find_output_arraynode(state_dfg, edge)
+
+                    self._dispatcher.dispatch_copy(
+                        node, dst_node, edge, sdfg, state_dfg, state_id,
+                        function_stream, callsite_stream)
+
+                # Also define variables in the C++ unparser scope
+                self._locals.define(edge.src_conn, -1, self._ldepth + 1)
+                arrays.add(edge.src_conn)
+
+        callsite_stream.write('\n    ///////////////////\n', sdfg, state_id,
+                              node)
+
+        unparse_tasklet(sdfg, state_id, dfg, node, function_stream,
+                        callsite_stream, self._locals, self._ldepth)
+
+        callsite_stream.write('    ///////////////////\n\n', sdfg, state_id,
+                              node)
+
+        # Process outgoing memlets
+        self.process_out_memlets(sdfg, state_id, node, state_dfg,
+                                 self._dispatcher, callsite_stream, True,
+                                 function_stream)
+
+        #############################################################
+        # Instrumentation: Post-tasklet
+        if PerfSettings.perf_enable_instrumentation(
+        ) and PerfUtils.has_surrounding_perfcounters(node, dfg):
+            # Add bytes moved
+            callsite_stream.write(
+                "__perf_store.addBytesMoved(%s);" %
+                PerfUtils.get_tasklet_byte_accesses(node, dfg, sdfg, state_id))
+        #############################################################
+
+        callsite_stream.write('}\n', sdfg, state_id, node)
+
+        self._dispatcher.defined_vars.exit_scope(node)
+
+    def _generate_EmptyTasklet(self, sdfg, dfg, state_id, node,
+                               function_stream, callsite_stream):
+        self._generate_Tasklet(sdfg, dfg, state_id, node, function_stream,
+                               callsite_stream)
+
+    def _generate_NestedSDFG(self, sdfg, dfg: ScopeSubgraphView, state_id,
+                             node, function_stream: CodeIOStream,
+                             callsite_stream: CodeIOStream):
+
+        self._dispatcher.defined_vars.enter_scope(sdfg)
+
+        # If SDFG parent is not set, set it
+        node.sdfg._parent = sdfg
+        state_dfg = sdfg.nodes()[state_id]
+
+        # Take care of nested SDFG I/O
+        for _, _, _, vconn, in_memlet in state_dfg.in_edges(node):
+            callsite_stream.write(
+                self.memlet_definition(sdfg, in_memlet, False, vconn), sdfg,
+                state_id, node)
+        for _, uconn, _, _, out_memlet in state_dfg.out_edges(node):
+            callsite_stream.write(
+                self.memlet_definition(sdfg, out_memlet, True, uconn), sdfg,
+                state_id, node)
+
+        callsite_stream.write('\n    ///////////////////\n', sdfg, state_id,
+                              node)
+
+        sdfg_label = '_%d_%d' % (state_id, dfg.node_id(node))
+        # Generate code for internal SDFG
+        global_code, local_code, used_targets = \
+            self._frame.generate_code(node.sdfg, node.schedule, sdfg_label)
+
+        # Write generated code in the proper places (nested SDFG writes
+        # location info)
+        function_stream.write(global_code)
+        callsite_stream.write(local_code)
+
+        callsite_stream.write('    ///////////////////\n\n', sdfg, state_id,
+                              node)
+
+        # Process outgoing memlets with the internal SDFG
+        self.process_out_memlets(sdfg, state_id, node, state_dfg,
+                                 self._dispatcher, callsite_stream, True,
+                                 function_stream)
+
+        self._dispatcher.defined_vars.exit_scope(sdfg)
+
+    def _generate_MapEntry(self, sdfg, dfg, state_id, node: nodes.MapEntry,
+                           function_stream, callsite_stream):
+        map_params = node.map.params
+        map_name = '__DACEMAP_' + str(state_id) + '_' + str(dfg.node_id(node))
+
+        unified_id = PerfUtils.unified_id(dfg.node_id(node), state_id)
+
+        #############################################################
+        # Instrumentation: Pre-MapEntry
+
+        # Intrusively set the depth
+        PerfUtils.set_map_depth(node, dfg)
+
+        result = callsite_stream
+
+        map_header = ''
+
+        if PerfSettings.perf_enable_instrumentation():
+            idstr = "// (Node %d)\n" % unified_id
+            map_header += idstr  # Used to identify line numbers later
+            PerfMetaInfoStatic.info.add_node(node, idstr)
+
+        if node.map.schedule == types.ScheduleType.CPU_Multicore:
+            # We have to find out if we should mark a section start here or later.
+            children = PerfUtils.all_maps(node, dfg)
+
+            for x in children:
+                if PerfUtils.map_depth(
+                        x) > PerfSettings.perf_max_scope_depth():
+                    break  # We have our relevant nodes.
+                if x.map.schedule == types.ScheduleType.CPU_Multicore:
+                    # nested SuperSections are not well-supported
+                    # We have to mark the outermost section,
+                    # which also means that we have to somehow tell the
+                    # lower nodes to not mark the section start.
+                    x.map._can_be_supersection_start = False
+
+            if PerfSettings.perf_enable_instrumentation_for(
+                    sdfg, node
+            ) and PerfUtils.map_depth(
+                    node
+            ) <= PerfSettings.perf_max_scope_depth(
+            ) and node.map._can_be_supersection_start and not dfg.is_parallel(
+            ):
+                map_header += "__perf_store.markSuperSectionStart(%d);\n" % unified_id
+            elif PerfSettings.perf_supersection_emission_debug():
+                reasons = []
+                if not node.map._can_be_supersection_start:
+                    reasons.append("CANNOT_BE_SS")
+                if dfg.is_parallel():
+                    reasons.append("CONTAINER_IS_PARALLEL")
+                if PerfUtils.map_depth(
+                        node) > PerfSettings.perf_max_scope_depth():
+                    reasons.append("EXCEED_MAX_DEPTH")
+                if not PerfSettings.perf_enable_instrumentation_for(
+                        sdfg, node):
+                    reasons.append("MISC")
+
+                map_header += "// SuperSection start not emitted. Reasons: " + ",".join(
+                    reasons) + "\n"
+
+        elif PerfSettings.perf_enable_instrumentation_for(
+                sdfg, node
+        ) and PerfUtils.map_depth(node) == PerfSettings.perf_max_scope_depth(
+        ) and node.map._can_be_supersection_start and not dfg.is_parallel():
+            # even if the schedule is sequential, we can serialize to
+            # keep buffer usage low
+            map_header += "__perf_store.markSuperSectionStart(%d);\n" % unified_id
+
+        if PerfUtils.instrument_entry(
+                node, dfg) and PerfSettings.perf_enable_instrumentation_for(
+                    sdfg, node):
+
+            size = PerfUtils.accumulate_byte_movements_v2(
+                node, node, dfg, sdfg, state_id)
+            size = sp.simplify(size)
+
+            used_symbols = symbolic.symbols_in_sympy_expr(size)
+            defined_symbols = sdfg.symbols_defined_at(node)
+            undefined_symbols = [
+                x for x in used_symbols if x not in defined_symbols
+            ]
+            if len(undefined_symbols) > 0:
+                # We cannot statically determine the size at this point
+                print(
+                    "Failed to determine size because of undefined symbols (\""
+                    + str(undefined_symbols) + "\") in \"" + str(size) +
+                    "\", falling back to 0")
+                size = 0
+
+            size = sym2cpp(size)
+
+            map_header += "__perf_store.markSectionStart(%d, (long long)%s, PAPI_thread_id());\n" % (
+                unified_id, size)
+
+        #############################################################
+
+        if node.map.schedule == types.ScheduleType.CPU_Multicore:
+            map_header += '#pragma omp parallel for'
+            openmp_parallel_for_defined = True
+
+            # The code below is disabled since we now use pragma omp atomic
+            # TODO(later): set up register outside loop
+            #exit_node = dfg.exit_nodes(node)[0]
+            reduction_stmts = []
+            #for outedge in dfg.in_edges(exit_node):
+            #    if (isinstance(outedge.src, nodes.CodeNode)
+            #            and outedge.data.wcr is not None):
+            #        redt = operations.detect_reduction_type(outedge.data.wcr)
+            #        if redt != types.ReductionType.Custom:
+            #            reduction_stmts.append('reduction({typ}:{var})'.format(
+            #                typ=_REDUCTION_TYPE_TO_OPENMP[redt],
+            #                var=outedge.src_conn))
+            #            reduced_variables.append(outedge)
+
+            map_header += ' %s\n' % ', '.join(reduction_stmts)
+
+        # TODO: Explicit map unroller
+        if node.map.unroll:
+            if node.map.schedule == types.ScheduleType.CPU_Multicore:
+                raise ValueError('An Multicore CPU map cannot be unrolled (' +
+                                 node.map.label + ')')
+
+        constsize = all([
+            not symbolic.issymbolic(v, sdfg.constants) for r in node.map.range
+            for v in r
+        ])
+
+        # Construct (EXCLUSIVE) map range as a list of comma-delimited C++
+        # strings.
+        maprange_cppstr = [
+            '%s, %s, %s' % (sym2cpp(rb), sym2cpp(re + 1), sym2cpp(rs))
+            for rb, re, rs in node.map.range
+        ]
+
+        # Map flattening
+        if node.map.flatten:
+
+            #############################################################
+            # Instrumentation: Post-MapEntry (pre-definitions)
+            perf_entry_string = (
+                'dace_perf::%s __perf_%d;\n' +
+                'auto& __vs_%d = __perf_store.getNewValueSet(__perf_%d, %d, PAPI_thread_id(), %%s);\n'
+                + '__perf_%d.enterCritical();\n') % (
+                    PerfUtils.perf_counter_string(node), unified_id,
+                    unified_id, unified_id, unified_id, unified_id)
+            #############################################################
+
+            # If the integer set is constant-sized, emit const_int_range
+            if constsize:
+                # Generate the loop
+                result.write(
+                    """
+typedef dace::const_int_range<{range}> {mapname}_rng;
+{map_header}
+for (int {mapname}_iter = 0; {mapname}_iter < {mapname}_rng::size; ++{mapname}_iter) {{
+                             """.format(
+                        range=', '.join(maprange_cppstr),
+                        map_header=map_header,
+                        mapname=map_name), sdfg, state_id, node)
+
+                #############################################################
+                # Instrumentation: Post-MapEntry (pre-definitions)
+                # Perfcounters for flattened maps include the calculations
+                # made to obtain the different axis indices
+                if PerfUtils.instrument_entry(
+                        node,
+                        dfg) and PerfSettings.perf_enable_instrumentation_for(
+                            sdfg, node):
+                    result.write(perf_entry_string % (map_name + "_iter"),
+                                 sdfg, state_id, node)
+                    # remember which map has the counters enabled
+                    node.map._has_papi_counters = True
+                #############################################################
+
+                # Generate the variables
+                for ind, var in enumerate(map_params):
+                    result.write(
+                        ('auto {var} = {mapname}_rng' +
+                         '::index_value({mapname}_iter, ' + '{ind});').format(
+                             ind=ind, var=var,
+                             mapname=map_name), sdfg, state_id, node)
+            else:  # Runtime-size integer range set
+                # Generate the loop
+                result.write(
+                    """
+auto {mapname}_rng = dace::make_range({tuplerange});
+{map_header}
+for (int {mapname}_iter = 0; {mapname}_iter < {mapname}_rng.size(); ++{mapname}_iter) {{
+                                 """.format(
+                        tuplerange=', '.join([
+                            'std::make_tuple(%s)' % cppr
+                            for cppr in maprange_cppstr
+                        ]),
+                        map_header=map_header,
+                        mapname=map_name), sdfg, state_id, node)
+
+                #############################################################
+                # Instrumentation: Post-MapEntry (pre-definitions)
+                # Perfcounters for flattened maps include the calculations
+                # made to obtain the different axis indices
+                if PerfUtils.instrument_entry(
+                        node,
+                        dfg) and PerfSettings.perf_enable_instrumentation_for(
+                            sdfg, node):
+                    result.write(perf_entry_string % (map_name + "_iter"),
+                                 sdfg, state_id, node)
+                    # remember which map has the counters enabled
+                    node.map._has_papi_counters = True
+                #############################################################
+
+                # Generate the variables
+                for ind, var in enumerate(map_params):
+                    result.write(
+                        ('auto {var} = {mapname}_rng' +
+                         '.index_value({mapname}_iter, ' + '{ind});').format(
+                             ind=ind, var=var,
+                             mapname=map_name), sdfg, state_id, node)
+
+        else:  # Nested loops
+            result.write(map_header, sdfg, state_id, node)
+            for i, r in enumerate(node.map.range):
+                #var = '__DACEMAP_%s_%d' % (node.map.label, i)
+                var = map_params[i]
+                begin, end, skip = r
+
+                if node.map.unroll:
+                    result.write('#pragma unroll', sdfg, state_id, node)
+
+                result.write(
+                    'for (auto %s = %s; %s < %s; %s += %s) {\n' %
+                    (var, sym2cpp(begin), var, sym2cpp(end + 1), var,
+                     sym2cpp(skip)), sdfg, state_id, node)
+
+                #############################################################
+                # Instrumentation: Post-MapEntry (pre-definitions)
+                if PerfUtils.instrument_entry(node, dfg) and (
+                    (not PerfSettings.perf_debug_profile_innermost and i == 0)
+                        or (PerfSettings.perf_debug_profile_innermost
+                            and i == len(node.map.range) - 1)
+                ) and PerfSettings.perf_enable_instrumentation_for(sdfg, node):
+                    result.write(
+                        ('dace_perf::%s __perf_%d;\n' +
+                         'auto& __vs_%d = __perf_store.getNewValueSet(__perf_%d, %d, PAPI_thread_id(), %s);\n'
+                         + '__perf_%d.enterCritical();\n') %
+                        (PerfUtils.perf_counter_string(node), unified_id,
+                         unified_id, unified_id, unified_id, var, unified_id),
+                        sdfg, state_id, node)
+                    # remember which map has the counters enabled
+                    node.map._has_papi_counters = True
+                #############################################################
+
+        # Emit internal transient array allocation
+        to_allocate = dace.sdfg.local_transients(sdfg, dfg, node)
+        allocated = set()
+        for child in dfg.scope_dict(node_to_children=True)[node]:
+            if not isinstance(child, nodes.AccessNode):
+                continue
+            if child.data not in to_allocate or child.data in allocated:
+                continue
+            allocated.add(child.data)
+            self._dispatcher.dispatch_allocate(sdfg, dfg, state_id, child,
+                                               None, result)
+            self._dispatcher.dispatch_initialize(sdfg, dfg, state_id, child,
+                                                 None, result)
+
+        # Generate register definitions for inter-tasklet memlets
+        scope_dict = dfg.scope_dict()
+        for edge in dfg.edges():
+            # Only interested in edges within current scope
+            if scope_dict[edge.src] != node or scope_dict[edge.dst] != node:
+                continue
+            if (isinstance(edge.src, nodes.CodeNode)
+                    and isinstance(edge.dst, nodes.CodeNode)):
+                local_name = '__s%d_n%d%s_n%d%s' % (
+                    state_id, dfg.node_id(edge.src), edge.src_conn,
+                    dfg.node_id(edge.dst), edge.dst_conn)
+                # Allocate variable type
+                code = 'dace::vec<%s, %s> %s;' % (
+                    sdfg.arrays[edge.data.data].dtype.ctype,
+                    sym2cpp(edge.data.veclen), local_name)
+                result.write(code, sdfg, state_id, [edge.src, edge.dst])
+
+    def _generate_MapExit(self, sdfg, dfg, state_id, node, function_stream,
+                          callsite_stream):
+        result = callsite_stream
+
+        # Obtain start of map
+        scope_dict = dfg.scope_dict()
+        map_node = scope_dict[node]
+
+        if map_node is None:
+            raise ValueError('Exit node ' + str(node.map.label) +
+                             ' is not dominated by a scope entry node')
+
+        #############################################################
+        # Instrumentation: Pre-MapExit
+        unified_id = PerfUtils.unified_id(dfg.node_id(map_node), state_id)
+        #############################################################
+
+        # Emit internal transient array deallocation
+        to_allocate = dace.sdfg.local_transients(sdfg, dfg, map_node)
+        deallocated = set()
+        for child in dfg.scope_dict(node_to_children=True)[map_node]:
+            if not isinstance(child, nodes.AccessNode):
+                continue
+            if child.data not in to_allocate or child.data in deallocated:
+                continue
+            deallocated.add(child.data)
+            self._dispatcher.dispatch_deallocate(sdfg, dfg, state_id, child,
+                                                 None, result)
+
+        # If there are other non-visited map exits, they are responsible for
+        # closing braces
+        map_exits = [
+            k for k, v in scope_dict.items()
+            if v == map_node and isinstance(k, nodes.ExitNode)
+            and k not in self._generated_nodes
+        ]
+        if len(map_exits) > 1:
+            return
+
+        # Map flattening
+        if map_node.map.flatten:
+            #############################################################
+            # Instrumentation: Pre-MapExit
+            if PerfSettings.perf_enable_instrumentation(
+            ) and map_node.map._has_papi_counters:
+                result.write(
+                    '__perf_%d.leaveCritical(__vs_%d);\n' %
+                    (unified_id, unified_id), sdfg, state_id, node)
+            if PerfSettings.perf_debug_annotate_scopes:
+                result.write('// %s\n' % str(map_node), sdfg, state_id, node)
+            #############################################################
+            result.write('}', sdfg, state_id, node)
+        else:
+            for i, r in enumerate(map_node.map.range):
+                #############################################################
+                # Instrumentation: Pre-MapExit
+                if PerfSettings.perf_enable_instrumentation(
+                ) and map_node.map._has_papi_counters and (
+                    (PerfSettings.perf_debug_profile_innermost and i == 0) or
+                    (not PerfSettings.perf_debug_profile_innermost
+                     and i == len(map_node.map.range) - 1)):
+                    result.write(
+                        '__perf_%d.leaveCritical(__vs_%d);\n' %
+                        (unified_id, unified_id), sdfg, state_id, node)
+                if PerfSettings.perf_debug_annotate_scopes and i == len(
+                        map_node.map.range) - 1:
+                    result.write('// %s\n' % str(map_node), sdfg, state_id,
+                                 node)
+                #############################################################
+                result.write('}', sdfg, state_id, node)
+
+        #############################################################
+        # Instrumentation: Post-MapExit
+        if PerfSettings.perf_enable_vectorization_analysis():
+            idstr = "// end (Node %d)\n" % unified_id
+            result.write(idstr, sdfg, state_id, node)
+            PerfMetaInfoStatic.info.add_node(node, idstr)
+        #############################################################
+
+    def _generate_ConsumeEntry(self, sdfg, dfg, state_id, node: nodes.MapEntry,
+                               function_stream, callsite_stream):
+        result = callsite_stream
+
+        constsize = all([
+            not symbolic.issymbolic(v, sdfg.constants) for r in node.map.range
+            for v in r
+        ])
+        state_dfg = sdfg.nodes()[state_id]
+
+        input_sedge = next(
+            e for e in state_dfg.in_edges(node) if e.dst_conn == 'IN_stream')
+        output_sedge = next(
+            e for e in state_dfg.out_edges(node) if e.src_conn == 'OUT_stream')
+        input_stream = state_dfg.memlet_path(input_sedge)[0].src
+        input_streamdesc = input_stream.desc(sdfg)
+
+        # Take chunks into account
+        if node.consume.chunksize == 1:
+            chunk = 'const %s& %s' % (input_streamdesc.dtype.ctype,
+                                      node.consume.label + '_element')
+            self._dispatcher.defined_vars.add(node.consume.label + "_element",
+                                              DefinedType.Scalar)
+        else:
+            chunk = 'const %s *%s, size_t %s' % (
+                input_streamdesc.dtype.ctype, node.consume.label + '_elements',
+                node.consume.label + '_numelems')
+            self._dispatcher.defined_vars.add(node.consume.label + "_elements",
+                                              DefinedType.Pointer)
+            self._dispatcher.defined_vars.add(node.consume.label + "_numelems",
+                                              DefinedType.Scalar)
+
+        # Take quiescence condition into account
+        if node.consume.condition is not None:
+            condition_string = (
+                '[&]() { return %s; }, ' % cppunparse.cppunparse(
+                    node.consume.condition, False))
+        else:
+            condition_string = ''
+
+        result.write(
+            'dace::Consume<{chunksz}>::template consume{cond}({stream_in}, '
+            '{num_pes}, {condition}'
+            '[&](int {pe_index}, {element_or_chunk}) {{'.format(
+                chunksz=node.consume.chunksize,
+                cond='' if node.consume.condition is None else '_cond',
+                condition=condition_string,
+                stream_in=input_stream.data,  # TODO: stream arrays
+                element_or_chunk=chunk,
+                num_pes=sym2cpp(node.consume.num_pes),
+                pe_index=node.consume.pe_index),
+            sdfg,
+            state_id,
+            node)
+
+        # Since consume is an alias node, we create an actual array for the
+        # consumed element and modify the outgoing memlet path ("OUT_stream")
+        # TODO: do this before getting to the codegen
+        if node.consume.chunksize == 1:
+            consumed_element = sdfg.add_scalar(
+                node.consume.label + '_element',
+                input_streamdesc.dtype,
+                transient=True,
+                storage=types.StorageType.Register)
+            ce_node = nodes.AccessNode(node.consume.label + '_element',
+                                       types.AccessType.ReadOnly)
+        else:
+            consumed_element = sdfg.add_array(
+                node.consume.label + '_elements', [node.consume.chunksize],
+                input_streamdesc.dtype,
+                transient=True,
+                storage=types.StorageType.Register)
+            ce_node = nodes.AccessNode(node.consume.label + '_elements',
+                                       types.AccessType.ReadOnly)
+        state_dfg.add_node(ce_node)
+        out_memlet_path = state_dfg.memlet_path(output_sedge)
+        state_dfg.remove_edge(out_memlet_path[0])
+        state_dfg.add_edge(
+            out_memlet_path[0].src, out_memlet_path[0].src_conn, ce_node, None,
+            mmlt.Memlet.from_array(ce_node.data, ce_node.desc(sdfg)))
+        state_dfg.add_edge(
+            ce_node, None, out_memlet_path[0].dst, out_memlet_path[0].dst_conn,
+            mmlt.Memlet.from_array(ce_node.data, ce_node.desc(sdfg)))
+        for e in out_memlet_path[1:]:
+            e.data.data = ce_node.data
+        ## END of SDFG-rewriting code
+
+        # Emit internal transient array allocation
+        to_allocate = dace.sdfg.local_transients(sdfg, dfg, node)
+        allocated = set()
+        for child in dfg.scope_dict(node_to_children=True)[node]:
+            if not isinstance(child, nodes.AccessNode):
+                continue
+            if child.data not in to_allocate or child.data in allocated:
+                continue
+            allocated.add(child.data)
+            self._dispatcher.dispatch_allocate(sdfg, dfg, state_id, child,
+                                               None, result)
+            self._dispatcher.dispatch_initialize(sdfg, dfg, state_id, child,
+                                                 None, result)
+
+        # Generate register definitions for inter-tasklet memlets
+        scope_dict = dfg.scope_dict()
+        for edge in dfg.edges():
+            # Only interested in edges within current scope
+            if scope_dict[edge.src] != node or scope_dict[edge.dst] != node:
+                continue
+            if (isinstance(edge.src, nodes.CodeNode)
+                    and isinstance(edge.dst, nodes.CodeNode)):
+                local_name = '__s%d_n%d%s_n%d%s' % (
+                    state_id, dfg.node_id(edge.src), edge.src_conn,
+                    dfg.node_id(edge.dst), edge.dst_conn)
+                # Allocate variable type
+                code = 'dace::vec<%s, %s> %s;' % (
+                    sdfg.arrays[edge.data.data].dtype.ctype,
+                    sym2cpp(edge.data.veclen), local_name)
+                result.write(code, sdfg, state_id, [edge.src, edge.dst])
+
+    def _generate_ConsumeExit(self, sdfg, dfg, state_id, node, function_stream,
+                              callsite_stream):
+        result = callsite_stream
+
+        # Obtain start of map
+        scope_dict = dfg.scope_dict()
+        entry_node = scope_dict[node]
+
+        if entry_node is None:
+            raise ValueError('Exit node ' + str(node.consume.label) +
+                             ' is not dominated by a scope entry node')
+
+        # Emit internal transient array deallocation
+        to_allocate = dace.sdfg.local_transients(sdfg, dfg, entry_node)
+        deallocated = set()
+        for child in dfg.scope_dict(node_to_children=True)[entry_node]:
+            if not isinstance(child, nodes.AccessNode):
+                continue
+            if child.data not in to_allocate or child.data in deallocated:
+                continue
+            deallocated.add(child.data)
+            self._dispatcher.dispatch_deallocate(sdfg, dfg, state_id, child,
+                                                 None, result)
+
+        result.write('});', sdfg, state_id, node)
+
+    def _generate_Reduce(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+
+        unified_id = PerfUtils.unified_id(dfg.node_id(node), state_id)
+
+        # Try to autodetect reduction type
+        redtype = operations.detect_reduction_type(node.wcr)
+
+        loop_header = ''
+
+        perf_should_instrument = PerfSettings.perf_enable_instrumentation(
+        ) and not PerfUtils.has_surrounding_perfcounters(
+            node, dfg) and PerfSettings.perf_enable_instrumentation_for(
+                sdfg, node)
+
+        if node.schedule == types.ScheduleType.CPU_Multicore:
+            if PerfSettings.perf_enable_vectorization_analysis():
+                idstr = "// (Node %d)\n" % dfg.node_id(node)
+                loop_header += idstr
+                PerfMetaInfoStatic.info.add_node(node, idstr)
+            loop_header += '#pragma omp parallel for'
+
+        end_braces = 0
+
+        axes = node.axes
+        state_dfg = sdfg.nodes()[state_id]
+        input_memlet = state_dfg.in_edges(node)[0].data
+        output_edge = state_dfg.out_edges(node)[0]
+        output_memlet = output_edge.data
+
+        output_type = 'dace::vec<%s, %s>' % (
+            sdfg.arrays[output_memlet.data].dtype.ctype, output_memlet.veclen)
+
+        # If axes were not defined, use all input dimensions
+        input_dims = input_memlet.subset.dims()
+        output_dims = output_memlet.subset.data_dims()
+        if axes is None:
+            axes = tuple(range(input_dims))
+
+        # Obtain variable names per output and reduction axis
+        axis_vars = []
+        octr = 0
+        for d in range(input_dims):
+            if d in axes:
+                axis_vars.append('__i%d' % d)
+            else:
+                axis_vars.append('__o%d' % octr)
+                octr += 1
+
+        #############################################################
+        # Instrumentation: Pre-reduce
+        # For measuring the memory bandwidth, we analyze the amount of data
+        # moved.
+        if perf_should_instrument:
+            perf_expected_data_movement_sympy = 1
+
+            for axis in range(output_dims):
+                ao = output_memlet.subset[axis]
+                perf_expected_data_movement_sympy *= (
+                    (ao[1] + 1 - ao[0]) / ao[2])
+
+            for axis in axes:
+                ai = input_memlet.subset[axis]
+                perf_expected_data_movement_sympy *= (
+                    (ai[1] + 1 - ai[0]) / ai[2])
+
+            if not dfg.is_parallel():
+                # Now we put a start marker, but only if we are in a serial state
+                callsite_stream.write(
+                    '__perf_store.markSuperSectionStart(%d);\n' % (unified_id))
+
+            callsite_stream.write(
+                '__perf_store.markSectionStart(%d, (long long)%s, PAPI_thread_id());\n'
+                % (unified_id,
+                   str(sp.simplify(perf_expected_data_movement_sympy)) +
+                   (" * (sizeof(%s) + sizeof(%s))" %
+                    (sdfg.arrays[output_memlet.data].dtype.ctype,
+                     sdfg.arrays[input_memlet.data].dtype.ctype))), sdfg,
+                state_id, node)
+        #############################################################
+
+        # Write OpenMP loop pragma if there are output dimensions
+        if output_dims > 0:
+            callsite_stream.write(loop_header, sdfg, state_id, node)
+
+        # Generate outer loops
+        output_subset = output_memlet.subset
+        for axis in range(output_dims):
+            callsite_stream.write(
+                'for (int {var} = {begin}; {var} < {end}; {var} += {skip}) {{'.
+                format(
+                    var='__o%d' % axis,
+                    begin=output_subset[axis][0],
+                    end=output_subset[axis][1] + 1,
+                    skip=output_subset[axis][2]), sdfg, state_id, node)
+
+            #############################################################
+            # Instrumentation: Reduce (part 1)
+            # This could prevent the compiler from parallelizing/vectorizing
+            if perf_should_instrument:
+                if ((end_braces == 0
+                     and not PerfSettings.perf_debug_profile_innermost)
+                        or (end_braces == output_dims - 1
+                            and PerfSettings.perf_debug_profile_innermost)):
+                    callsite_stream.write(
+                        'dace_perf::%s __perf_%d;\n' %
+                        (PerfUtils.perf_counter_string(node), unified_id),
+                        sdfg, state_id, node)
+                    callsite_stream.write(
+                        'auto& __perf_%d_vs = __perf_store.getNewValueSet(__perf_%d, %d, PAPI_thread_id(), __o%d);\n'
+                        % (unified_id, unified_id, unified_id, axis), sdfg,
+                        state_id, node)
+                    callsite_stream.write(
+                        '__perf_%d.enterCritical();\n' % unified_id, sdfg,
+                        state_id, node)
+            #############################################################
+            end_braces += 1
+
+        #############################################################
+        # Instrumentation: Reduce (part 2)
+        if end_braces == 0 and perf_should_instrument:
+            callsite_stream.write(
+                'dace_perf::%s __perf_%d;\n' %
+                (PerfUtils.perf_counter_string(node), unified_id), sdfg,
+                state_id, node)
+            callsite_stream.write(
+                'auto& __perf_%d_vs = __perf_store.getNewValueSet(__perf_%d, %d, PAPI_thread_id(),  0);\n'
+                % (unified_id, unified_id, unified_id), sdfg, state_id, node)
+            callsite_stream.write('__perf_%d.enterCritical();\n' % unified_id,
+                                  sdfg, state_id, node)
+        #############################################################
+
+        use_tmpout = False
+        if len(axes) == input_dims:
+            # Add OpenMP reduction clause if reducing all axes
+            if (redtype != types.ReductionType.Custom
+                    and node.schedule == types.ScheduleType.CPU_Multicore):
+                loop_header += ' reduction(%s: __tmpout)' % (
+                    _REDUCTION_TYPE_TO_OPENMP[redtype])
+
+            # Output initialization
+            identity = ''
+            if node.identity is not None:
+                identity = ' = %s' % sym2cpp(node.identity)
+            callsite_stream.write(
+                '{\n%s __tmpout%s;' % (output_type, identity), sdfg, state_id,
+                node)
+            callsite_stream.write(loop_header, sdfg, state_id, node)
+            end_braces += 1
+            use_tmpout = True
+
+        # Generate inner loops (reducing)
+        input_subset = input_memlet.subset
+        for axis in axes:
+            callsite_stream.write(
+                'for (int {var} = {begin}; {var} < {end}; {var} += {skip}) {{'.
+                format(
+                    var='__i%d' % axis,
+                    begin=input_subset[axis][0],
+                    end=input_subset[axis][1] + 1,
+                    skip=input_subset[axis][2]), sdfg, state_id, node)
+            end_braces += 1
+
+        # Generate reduction code
+        credtype = 'dace::ReductionType::' + str(
+            redtype)[str(redtype).find('.') + 1:]
+
+        # Use index expressions
+        outvar = ('__tmpout' if use_tmpout else cpp_array_expr(
+            sdfg,
+            output_memlet,
+            offset=['__o%d' % i for i in range(output_dims)],
+            relative_offset=False))
+        invar = cpp_array_expr(
+            sdfg, input_memlet, offset=axis_vars, relative_offset=False)
+
+        if redtype != types.ReductionType.Custom:
+            callsite_stream.write(
+                'dace::wcr_fixed<%s, %s>::reduce_atomic(&%s, %s);' %
+                (credtype, output_type, outvar, invar), sdfg, state_id,
+                node)  #cpp_array_expr(), cpp_array_expr()
+        else:
+            callsite_stream.write(
+                'dace::wcr_custom<%s>::template reduce_atomic(%s, &%s, %s);' %
+                (output_type, unparse_cr(node.wcr), outvar, invar), sdfg,
+                state_id, node)  #cpp_array_expr(), cpp_array_expr()
+
+        #############################################################
+        # Instrumentation: Post-Reduce (pre-braces)
+        byte_moved_measurement = "__perf_store.addBytesMoved(%s);\n"
+
+        # For reductions, we assume Read-Modify-Write for all operations
+        # Every reduction statement costs sizeof(input) + sizeof(output).
+        # This is wrong with some custom reductions or extending operations
+        # (e.g., i32 * i32 => i64)
+        # It also is wrong for write-avoiding min/max (min/max that only
+        # overwrite the reduced variable when it needs to be changed)
+
+        if perf_should_instrument:
+            callsite_stream.write(
+                byte_moved_measurement % ("(sizeof(%s) + sizeof(%s))" %
+                                          (outvar, invar)), sdfg, state_id,
+                node)
+        #############################################################
+
+        # Generate closing braces
+        for i in range(end_braces):
+            # Store back tmpout into the true output
+            if i == end_braces - 1 and use_tmpout:
+                callsite_stream.write(
+                    '%s = __tmpout;' % cpp_array_expr(sdfg, output_memlet),
+                    sdfg, state_id, node)
+            #############################################################
+            # Instrumentation: Post-Reduce (in-braces)
+            if perf_should_instrument and (
+                (i == end_braces - 1
+                 and not PerfSettings.perf_debug_profile_innermost) or
+                (i == len(axes)
+                 and PerfSettings.perf_debug_profile_innermost)):
+                callsite_stream.write(
+                    '__perf_%d.leaveCritical(__perf_%d_vs);\n' %
+                    (unified_id, unified_id), sdfg, state_id, node)
+            #############################################################
+            callsite_stream.write('}', sdfg, state_id, node)
+
+    def _generate_AccessNode(self, sdfg, dfg, state_id, node, function_stream,
+                             callsite_stream):
+        state_dfg = sdfg.nodes()[state_id]
+
+        if node not in state_dfg.sink_nodes():
+            # NOTE: sink nodes are synchronized at the end of a state
+            presynchronize_streams(sdfg, state_dfg, state_id, node,
+                                   callsite_stream)
+
+        sdict = state_dfg.scope_dict()
+        for edge in state_dfg.in_edges(node):
+            predecessor, _, _, _, memlet = edge
+            if memlet.data is None:
+                continue  # If the edge has to be skipped
+
+            # Determines if this path ends here or has a definite source (array) node
+            memlet_path = state_dfg.memlet_path(edge)
+            if memlet_path[-1].dst == node:
+                src_node = memlet_path[0].src
+                # Only generate code in case this is the innermost scope
+                # (copies are generated at the inner scope, where both arrays exist)
+                if (scope_contains_scope(sdict, src_node, node)
+                        and sdict[src_node] != sdict[node]):
+                    self._dispatcher.dispatch_copy(
+                        src_node, node, edge, sdfg, dfg, state_id,
+                        function_stream, callsite_stream)
+
+        # Process outgoing memlets (array-to-array write should be emitted
+        # from the first leading edge out of the array)
+        self.process_out_memlets(sdfg, state_id, node, state_dfg,
+                                 self._dispatcher, callsite_stream, False,
+                                 function_stream)
+
+
+########################################################################
+########################################################################
+########################################################################
+########################################################################
+# Helper functions and classes
+
+
+def _reshape_strides(subset, strides, original_strides, copy_shape):
+    """ Helper function that reshapes a shape to the given strides. """
+    # TODO(later): Address original strides in the computation of the
+    #              result strides.
+    original_copy_shape = subset.size()
+    dims = len(copy_shape)
+
+    reduced_tile_sizes = [
+        ts for ts, s in zip(subset.tile_sizes, original_copy_shape) if s != 1
+    ]
+
+    reshaped_copy = copy_shape + [ts for ts in subset.tile_sizes if ts != 1]
+    reshaped_copy[:len(copy_shape)] = [
+        s / ts for s, ts in zip(copy_shape, reduced_tile_sizes)
+    ]
+
+    new_strides = [0] * len(reshaped_copy)
+    elements_remaining = functools.reduce(sp.mul.Mul, copy_shape, 1)
+    tiledim = 0
+    for i in range(len(copy_shape)):
+        new_strides[i] = elements_remaining / reshaped_copy[i]
+        elements_remaining = new_strides[i]
+        if reduced_tile_sizes[i] != 1:
+            new_strides[dims + tiledim] = (
+                elements_remaining / reshaped_copy[dims + tiledim])
+            elements_remaining = new_strides[dims + tiledim]
+            tiledim += 1
+
+    return reshaped_copy, new_strides
+
+
+def ndcopy_to_strided_copy(copy_shape, src_shape, src_strides, dst_shape,
+                           dst_strides, subset):
+    """ Detects situations where an N-dimensional copy can be degenerated into
+        a (faster) 1D copy or 2D strided copy. Returns new copy
+        dimensions and offsets to emulate the requested copy.
+
+        @return: a 3-tuple: copy_shape, src_strides, dst_strides
+    """
+    dims = len(copy_shape)
+
+    # Cannot degenerate tiled copies
+    if any(ts != 1 for ts in subset.tile_sizes):
+        return None
+
+    # 1D copy of the whole array
+    if (tuple(copy_shape) == tuple(src_shape)
+            and tuple(copy_shape) == tuple(dst_shape)):
+        copy_shape = [functools.reduce(lambda x, y: x * y, copy_shape)]
+        return copy_shape, [1], [1]
+    # 1D strided copy
+    elif sum([0 if c == 1 else 1 for c in copy_shape]) == 1:
+        # Find the copied dimension:
+        # In copy shape
+        copydim = next(i for i, c in enumerate(copy_shape) if c != 1)
+
+        # In source strides
+        if len(copy_shape) == len(src_shape):
+            srcdim = copydim
+        else:
+            srcdim = next(i for i, c in enumerate(src_shape) if c != 1)
+
+        # In destination strides
+        if len(copy_shape) == len(dst_shape):
+            dstdim = copydim
+        else:
+            dstdim = next(i for i, c in enumerate(dst_shape) if c != 1)
+
+        # Return new copy
+        return [copy_shape[copydim]], [src_strides[srcdim]], [
+            dst_strides[dstdim]
+        ]
+    else:
+        return None
+
+
+def ndslice_cpp(slice, dims, rowmajor=True):
+    result = StringIO()
+
+    if len(slice) == 0:  # Scalar
+        return '0'
+
+    for i, d in enumerate(slice):
+        if isinstance(d, tuple):
+            raise SyntaxError(
+                'CPU backend does not yet support ranges as inputs/outputs')
+
+        # TODO(later): Use access order
+
+        result.write(sym2cpp(d))
+
+        # If not last
+        if i < len(slice) - 1:
+            strdims = [str(dim) for dim in dims[i + 1:]]
+            result.write(
+                '*%s + ' % '*'.join(strdims))  # Multiply by leading dimensions
+
+    return result.getvalue()
+
+
+def cpp_offset_expr(d: data.Data,
+                    subset_in: subsets.Subset,
+                    offset=None,
+                    packed_veclen=1):
+    """ Creates a C++ expression that can be added to a pointer in order
+        to offset it to the beginning of the given subset and offset.
+        @param d: The data structure to use for sizes/strides.
+        @param subset: The subset to offset by.
+        @param offset: An additional list of offsets or a Subset object
+        @param packed_veclen: If packed types are targeted, specifies the
+                              vector length that the final offset should be 
+                              divided by.
+        @return: A string in C++ syntax with the correct offset
+    """
+    subset = copy.deepcopy(subset_in)
+
+    # Offset according to parameters
+    if offset is not None:
+        if isinstance(offset, subsets.Subset):
+            subset.offset(offset, False)
+        else:
+            subset.offset(subsets.Indices(offset), False)
+
+    # Then, offset according to array
+    subset.offset(subsets.Indices(d.offset), False)
+
+    # Obtain start range from offsetted subset
+    slice = [0] * len(d.strides)  #subset.min_element()
+
+    index = subset.at(slice, d.strides)
+    if packed_veclen > 1:
+        index /= packed_veclen
+
+    return sym2cpp(index)
+
+
+def cpp_array_expr(sdfg,
+                   memlet,
+                   with_brackets=True,
+                   offset=None,
+                   relative_offset=True,
+                   packed_veclen=1):
+    """ Converts an Indices/Range object to a C++ array access string. """
+    s = memlet.subset if relative_offset else subsets.Indices(offset)
+    o = offset if relative_offset else None
+    offset_cppstr = cpp_offset_expr(sdfg.arrays[memlet.data], s, o,
+                                    packed_veclen)
+
+    if with_brackets:
+        return '%s[%s]' % (memlet.data, offset_cppstr)
+    else:
+        return offset_cppstr
+
+
+def write_and_resolve_expr(memlet, nc, outname, inname, indices=None):
+    """ Helper function that emits a write_and_resolve call from a memlet. """
+
+    redtype = operations.detect_reduction_type(memlet.wcr)
+
+    nc = '_nc' if nc else ''
+    indstr = (', ' + indices) if indices is not None else ''
+
+    reduction_tmpl = ''
+    custom_reduction = ''
+
+    # Special call for detected reduction types
+    if redtype != types.ReductionType.Custom:
+        credtype = ('dace::ReductionType::' +
+                    str(redtype)[str(redtype).find('.') + 1:])
+        reduction_tmpl = '<%s>' % credtype
+    else:
+        custom_reduction = ', %s' % unparse_cr(memlet.wcr)
+
+    return '{oname}.write_and_resolve{nc}{tmpl}({iname}{wcr}{ind});'.format(
+        oname=outname,
+        nc=nc,
+        tmpl=reduction_tmpl,
+        iname=inname,
+        wcr=custom_reduction,
+        ind=indstr)
+
+
+def is_write_conflicted(dfg, edge, datanode=None):
+    """ Detects whether a write-conflict-resolving edge can be emitted without
+        using atomics or critical sections. """
+
+    if edge.data.wcr_conflict is not None and not edge.data.wcr_conflict:
+        return False
+
+    if edge is None:
+        start_node = None
+        memlet = None
+    else:
+        start_node = edge.dst
+        memlet = edge.data
+
+    # If it's an entire SDFG, it's probably write-conflicted
+    if isinstance(dfg, SDFG):
+        if datanode is None: return True
+        in_edges = find_incoming_edges(datanode, dfg)
+        if len(in_edges) != 1: return True
+        if (isinstance(in_edges[0].src, nodes.ExitNode) and
+                in_edges[0].src.map.schedule == types.ScheduleType.Sequential):
+            return False
+        return True
+
+    # Traverse memlet path to determine conflicts.
+    # If no conflicts will occur, write without atomics
+    # (e.g., if the array has been defined in a non-parallel schedule context)
+    # TODO: This is not perfect (need to take indices into consideration)
+    path = dfg.memlet_path(edge)
+    for e in path:
+        if (isinstance(e.dst, nodes.ExitNode)
+                and e.dst.map.schedule != types.ScheduleType.Sequential):
+            return True
+        # Should never happen (no such thing as write-conflicting reads)
+        if (isinstance(e.src, nodes.EntryNode)
+                and e.src.map.schedule != types.ScheduleType.Sequential):
+            return True
+
+    return False
+
+
+def unparse_cr(wcr_ast):
+    """ Outputs a C++ version of a conflict resolution lambda. """
+
+    if isinstance(wcr_ast, ast.Lambda):
+        return cppunparse.cppunparse(wcr_ast, expr_semicolon=False)
+    elif isinstance(wcr_ast, ast.FunctionDef):
+        # Construct a lambda function out of a function
+        return '[] (%s) { %s }' % (
+            cppunparse.cppunparse(wcr_ast.args, expr_semicolon=False),
+            cppunparse.cppunparse(wcr_ast.body, expr_semicolon=False))
+    elif isinstance(wcr_ast, ast.Module):
+        return unparse_cr(wcr_ast.body[0].value)
+    elif isinstance(wcr_ast, str):
+        return unparse_cr(LambdaProperty.from_string(wcr_ast))
+    else:
+        raise NotImplementedError('INVALID TYPE OF WCR: ' +
+                                  type(wcr_ast).__name__)
+
+
+def unparse_tasklet(sdfg, state_id, dfg, node, function_stream,
+                    callsite_stream, locals, ldepth):
+
+    if node.label is None or node.label == "":
+        return ''
+
+    state_dfg = sdfg.nodes()[state_id]
+    unified_id = PerfUtils.unified_id(dfg.node_id(node), state_id)
+
+    # Not [], "" or None
+    if not node.code:
+        return ''
+
+    # Not [], "" or None
+    if node.code_global:
+        if node.language is not types.Language.CPP:
+            raise ValueError(
+                "Global code only supported for C++ tasklets: got {}".format(
+                    node.language))
+        function_stream.write(
+            type(node).__properties__["code_global"].to_string(
+                node.code_global), sdfg, state_id, node)
+        function_stream.write("\n", sdfg, state_id, node)
+
+    # If raw C++ code, return the code directly
+    if node.language != types.Language.Python:
+        # If this code runs on the host and is associated with a CUDA stream,
+        # set the stream to a local variable.
+        max_streams = Config.get('compiler', 'cuda', 'max_concurrent_streams')
+        if (max_streams >= 0 and not is_devicelevel(sdfg, state_dfg, node)
+                and hasattr(node, '_cuda_stream')):
+            callsite_stream.write(
+                'cudaStream_t __dace_current_stream = dace::cuda::__streams[%d];'
+                % node._cuda_stream, sdfg, state_id, node)
+
+        if node.language != types.Language.CPP:
+            raise ValueError(
+                "Only Python or C++ code supported in CPU codegen, got: {}".
+                format(node.language))
+        callsite_stream.write(
+            type(node).__properties__["code"].to_string(node.code), sdfg,
+            state_id, node)
+
+        if (hasattr(node, '_cuda_stream')
+                and not is_devicelevel(sdfg, state_dfg, node)):
+            synchronize_streams(sdfg, state_dfg, state_id, node, node,
+                                callsite_stream)
+        return
+
+    body = node.code
+
+    # Map local names to memlets (for WCR detection)
+    memlets = {}
+    for edge in state_dfg.all_edges(node):
+        u, uconn, v, vconn, memlet = edge
+        if u == node:
+            memlet_nc = not is_write_conflicted(dfg, edge)
+            memlet_wcr = memlet.wcr
+
+            memlets[uconn] = (memlet, memlet_nc, memlet_wcr)
+        elif v == node:
+            memlets[vconn] = (memlet, False, None)
+
+    #############################################################
+    # Instrumentation: Pre-Tasklet
+    if PerfSettings.perf_tasklets and PerfSettings.perf_enable_instrumentation(
+    ):
+        callsite_stream.write(
+            'dace_perf::%s __perf_%s;\n' %
+            (PerfUtils.perf_counter_string(node), node.label), sdfg, state_id,
+            node)
+        callsite_stream.write(
+            'auto& __perf_vs_%s = __perf_store.getNewValueSet(__perf_%s, %d, PAPI_thread_id(), 0);\n'
+            % (node.label, node.label, unified_id), sdfg, state_id, node)
+
+        callsite_stream.write('__perf_%s.enterCritical();\n' % node.label,
+                              sdfg, state_id, node)
+
+    #############################################################
+
+    callsite_stream.write('// Tasklet code (%s)\n' % node.label, sdfg,
+                          state_id, node)
+    for stmt in body:
+        if isinstance(stmt, ast.Expr):
+            rk = DaCeKeywordRemover(memlets,
+                                    sdfg.constants).visit_TopLevelExpr(stmt)
+        else:
+            rk = DaCeKeywordRemover(memlets, sdfg.constants).visit(stmt)
+
+        if rk is not None:
+            # Unparse to C++ and add 'auto' declarations if locals not declared
+            result = StringIO()
+            cppunparse.CPPUnparser(rk, ldepth + 1, locals, result)
+            callsite_stream.write(result.getvalue(), sdfg, state_id, node)
+
+    #############################################################
+    # Instrumentation: Post-Tasklet
+    if PerfSettings.perf_tasklets and PerfSettings.perf_enable_instrumentation(
+    ):
+        callsite_stream.write(
+            '__perf_%s.leaveCritical(__perf_vs_%s);' %
+            (node.label, node.label), sdfg, state_id, node)
+    #############################################################
+
+
+def is_array_stream_view(sdfg, dfg, node):
+    """ Test whether a stream is directly connected to an array. """
+
+    # Test all memlet paths from the array. If the path goes directly
+    # to/from a stream, construct a stream array view
+    source_paths = []
+    sink_paths = []
+    for e in dfg.in_edges(node):
+        src_node = dfg.memlet_path(e)[0].src
+        if (isinstance(src_node, nodes.AccessNode)
+                and isinstance(src_node.desc(sdfg), data.Array)):
+            source_paths.append(src_node)
+    for e in dfg.out_edges(node):
+        sink_node = dfg.memlet_path(e)[-1].dst
+        if (isinstance(sink_node, nodes.AccessNode)
+                and isinstance(sink_node.desc(sdfg), data.Array)):
+            sink_paths.append(sink_node)
+
+    # Special case: stream can be represented as a view of an array
+    if len(source_paths) == 1 or len(sink_paths) == 1:
+        # TODO: What about a source path?
+        arrnode = sink_paths[0]
+        # Only works if the stream itself is not an array of streams
+        if list(node.desc(sdfg).shape) == [1]:
+            node.desc(sdfg).sink = arrnode.data  # For memlet generation
+            arrnode.desc(
+                sdfg).src = node.data  # TODO: Move src/sink to node, not array
+            return True
+    return False
+
+
+def find_incoming_edges(node, dfg):
+    # If it's an entire SDFG, look in each state
+    if isinstance(dfg, SDFG):
+        result = []
+        for state in dfg.nodes():
+            result.extend(list(state.in_edges(node)))
+        return result
+    else:  # If it's one state
+        return list(dfg.in_edges(node))
+
+
+def find_outgoing_edges(node, dfg):
+    # If it's an entire SDFG, look in each state
+    if isinstance(dfg, SDFG):
+        result = []
+        for state in dfg.nodes():
+            result.extend(list(state.out_edges(node)))
+        return result
+    else:  # If it's one state
+        return list(dfg.out_edges(node))
+
+
+def sym2cpp(s):
+    """ Converts an array of symbolic variables (or one) to C++ strings. """
+    if not isinstance(s, list):
+        return cppunparse.pyexpr2cpp(symbolic.symstr(s))
+    return [cppunparse.pyexpr2cpp(symbolic.symstr(d)) for d in s]
+
+
+class DaCeKeywordRemover(ExtNodeTransformer):
+    """ Removes memlets and other DaCe keywords from a Python AST, and 
+        converts array accesses to C++ methods that can be generated.
+        
+        Used for unparsing Python tasklets into C++ that uses the DaCe 
+        runtime.
+        
+        @note: Assumes that the DaCe syntax is correct (as verified by the
+               Python frontend).
+    """
+
+    def __init__(self, memlets, constants):
+        self.memlets = memlets
+        self.constants = constants
+
+    def visit_TopLevelExpr(self, node):
+        # This is a DaCe shift, omit it
+        if isinstance(node.value, ast.BinOp):
+            if isinstance(node.value.op, ast.LShift) or isinstance(
+                    node.value.op, ast.RShift):
+                return None
+        return self.generic_visit(node)
+
+    def visit_AugAssign(self, node):
+        if not isinstance(node.target, ast.Subscript):
+            return self.generic_visit(node)
+
+        target = rname(node.target)
+        if target not in self.memlets:
+            return self.generic_visit(node)
+
+        raise SyntaxError('Augmented assignments (e.g. +=) not allowed on ' +
+                          'array memlets')
+
+    def visit_Assign(self, node):
+        target = rname(node.targets[0])
+        if target not in self.memlets:
+            return self.generic_visit(node)
+
+        memlet, nc, wcr = self.memlets[target]
+        value = self.visit(node.value)
+
+        if not isinstance(node.targets[0], ast.Subscript):
+            # Dynamic accesses -> every access counts
+            try:
+                if memlet is not None and memlet.num_accesses < 0:
+                    if wcr is not None:
+                        newnode = ast.Name(
+                            id=write_and_resolve_expr(
+                                memlet, nc, '__' + target,
+                                cppunparse.cppunparse(
+                                    value, expr_semicolon=False)))
+                    else:
+                        newnode = ast.Name(id='__%s.write(%s);' % (
+                            target,
+                            cppunparse.cppunparse(value, expr_semicolon=False))
+                                           )
+
+                    return ast.copy_location(newnode, node)
+            except TypeError:  # cannot determine truth value of Relational
+                pass
+
+            return self.generic_visit(node)
+
+        slice = self.visit(node.targets[0].slice)
+        if not isinstance(slice, ast.Index):
+            raise NotImplementedError('Range subscripting not implemented')
+
+        if isinstance(slice.value, ast.Tuple):
+            subscript = unparse(slice)[1:-1]
+        else:
+            subscript = unparse(slice)
+
+        if wcr is not None:
+            newnode = ast.Name(
+                id=write_and_resolve_expr(
+                    memlet,
+                    nc,
+                    '__' + target,
+                    cppunparse.cppunparse(value, expr_semicolon=False),
+                    indices=subscript))
+        else:
+            newnode = ast.Name(id='__%s.write(%s, %s);' % (
+                target, cppunparse.cppunparse(value, expr_semicolon=False),
+                subscript))
+
+        return ast.copy_location(newnode, node)
+
+    def visit_Subscript(self, node):
+        target = rname(node)
+        if target not in self.memlets and target not in self.constants:
+            return self.generic_visit(node)
+
+        slice = self.visit(node.slice)
+        if not isinstance(slice, ast.Index):
+            raise NotImplementedError('Range subscripting not implemented')
+
+        if isinstance(slice.value, ast.Tuple):
+            subscript = unparse(slice)[1:-1]
+        else:
+            subscript = unparse(slice)
+
+        if target in self.constants:
+            slice_str = ndslice_cpp(
+                subscript.split(', '), self.constants[target].shape)
+            newnode = ast.parse('%s[%s]' % (target, slice_str)).body[0].value
+        else:
+            newnode = ast.parse('__%s(%s)' % (target, subscript)).body[0].value
+        return ast.copy_location(newnode, node)
+
+    def visit_Expr(self, node):
+        # Check for DaCe function calls
+        if isinstance(node.value, ast.Call):
+            # Some calls should not be parsed
+            if rname(node.value.func) == "define_local":
+                return None
+            elif rname(node.value.func) == "define_local_scalar":
+                return None
+            elif rname(node.value.func) == "define_stream":
+                return None
+            elif rname(node.value.func) == "define_streamarray":
+                return None
+
+        return self.generic_visit(node)
+
+    def visit_FunctionDef(self, node):
+        # Do not parse internal functions
+        return None
+
+    # Replace default modules (e.g., math) with dace::math::
+    def visit_Attribute(self, node):
+        attrname = rname(node)
+        module_name = attrname[:attrname.rfind('.')]
+        func_name = attrname[attrname.rfind('.') + 1:]
+        if module_name in types._ALLOWED_MODULES:
+            cppmodname = types._ALLOWED_MODULES[module_name]
+            return ast.copy_location(
+                ast.Name(id=(cppmodname + func_name), ctx=ast.Load), node)
+        return self.generic_visit(node)
+
+
+def unique(seq):
+    seen = set()
+    return [x for x in seq if not (x in seen or seen.add(x))]
+
+
+# TODO: This should be in the CUDA code generator. Add appropriate conditions to node dispatch predicate
+def presynchronize_streams(sdfg, dfg, state_id, node, callsite_stream):
+    state_dfg = sdfg.nodes()[state_id]
+    if hasattr(node, '_cuda_stream') or is_devicelevel(sdfg, state_dfg, node):
+        return
+    for e in state_dfg.in_edges(node):
+        if hasattr(e.src, '_cuda_stream'):
+            cudastream = 'dace::cuda::__streams[%d]' % e.src._cuda_stream
+            callsite_stream.write('cudaStreamSynchronize(%s);' % cudastream,
+                                  sdfg, state_id, [e.src, e.dst])
+
+
+# TODO: This should be in the CUDA code generator. Add appropriate conditions to node dispatch predicate
+def synchronize_streams(sdfg, dfg, state_id, node, scope_exit,
+                        callsite_stream):
+    # Post-kernel stream synchronization (with host or other streams)
+    max_streams = Config.get('compiler', 'cuda', 'max_concurrent_streams')
+    if max_streams >= 0:
+        cudastream = 'dace::cuda::__streams[%d]' % node._cuda_stream
+        for edge in dfg.out_edges(scope_exit):
+            # Synchronize end of kernel with output data (multiple kernels
+            # lead to same data node)
+            if (isinstance(edge.dst, nodes.AccessNode)
+                    and edge.dst._cuda_stream != node._cuda_stream):
+                callsite_stream.write(
+                    '''cudaEventRecord(dace::cuda::__events[{ev}], {src_stream});
+cudaStreamWaitEvent(dace::cuda::__streams[{dst_stream}], dace::cuda::__events[{ev}], 0);'''
+                    .format(
+                        ev=edge._cuda_event,
+                        src_stream=cudastream,
+                        dst_stream=edge.dst._cuda_stream), sdfg, state_id,
+                    [edge.src, edge.dst])
+                continue
+
+            # We need the streams leading out of the output data
+            for e in dfg.out_edges(edge.dst):
+                if isinstance(e.dst, nodes.AccessNode):
+                    continue
+                # If no stream at destination: synchronize stream with host.
+                if not hasattr(e.dst, '_cuda_stream'):
+                    pass
+                    # Done at destination
+
+                # If different stream at destination: record event and wait
+                # for it in target stream.
+                elif e.dst._cuda_stream != node._cuda_stream:
+                    callsite_stream.write(
+                        '''cudaEventRecord(dace::cuda::__events[{ev}], {src_stream});
+    cudaStreamWaitEvent(dace::cuda::__streams[{dst_stream}], dace::cuda::__events[{ev}], 0);'''
+                        .format(
+                            ev=e._cuda_event,
+                            src_stream=cudastream,
+                            dst_stream=e.dst._cuda_stream), sdfg, state_id,
+                        [e.src, e.dst])
+                # Otherwise, no synchronization necessary
diff --git a/dace/codegen/targets/cuda.py b/dace/codegen/targets/cuda.py
new file mode 100644
index 0000000000..b6abdd8727
--- /dev/null
+++ b/dace/codegen/targets/cuda.py
@@ -0,0 +1,1794 @@
+from six import StringIO
+import ast
+import ctypes
+import functools
+import os
+import sympy
+
+import dace
+from dace.frontend import operations
+from dace import subsets, symbolic, types
+from dace.config import Config
+from dace.graph import nodes
+from dace.sdfg import ScopeSubgraphView, SDFG, SDFGState, scope_contains_scope, is_devicelevel
+from dace.codegen.codeobject import CodeObject
+from dace.codegen.prettycode import CodeIOStream
+from dace.codegen.targets.target import (TargetCodeGenerator, IllegalCopy,
+                                         make_absolute, DefinedType)
+from dace.codegen.targets.cpu import (sym2cpp, unparse_cr, cpp_array_expr,
+                                      is_array_stream_view,
+                                      synchronize_streams)
+from dace.codegen.targets.framecode import _set_default_schedule_and_storage_types
+from dace.properties import LambdaProperty
+
+from dace.codegen import cppunparse
+
+_SPECIAL_RTYPES = {
+    types.ReductionType.Min_Location: 'ArgMin',
+    types.ReductionType.Max_Location: 'ArgMax',
+}
+
+
+def prod(iterable):
+    return functools.reduce(sympy.mul.Mul, iterable, 1)
+
+
+def _expr(val):
+    if isinstance(val, symbolic.SymExpr):
+        return val.expr
+    return val
+
+
+class CUDACodeGen(TargetCodeGenerator):
+    """ GPU (CUDA) code generator. """
+    target_name = 'cuda'
+    title = 'CUDA'
+    language = 'cu'
+
+    def __init__(self, frame_codegen, sdfg):
+        self._frame = frame_codegen
+        self._dispatcher = frame_codegen.dispatcher
+        dispatcher = self._dispatcher
+
+        self._in_device_code = False
+        self._cpu_codegen = None
+        self._block_dims = None
+        self._codeobject = CodeObject(sdfg.name + '_' + 'cuda', '', 'cu',
+                                      CUDACodeGen, 'CUDA')
+        self._localcode = CodeIOStream()
+        self._globalcode = CodeIOStream()
+        self._initcode = CodeIOStream()
+        self._exitcode = CodeIOStream()
+        self._global_sdfg = sdfg
+        self._toplevel_schedule = None
+
+        # Keep track of current "scope entry/exit" code streams for extra
+        # code generation
+        self.scope_entry_stream = self._initcode
+        self.scope_exit_stream = self._exitcode
+
+        # Annotate CUDA streams and events
+        self._cuda_streams, self._cuda_events = self._compute_cudastreams(sdfg)
+
+        # Register dispatchers
+        self._cpu_codegen = dispatcher.get_generic_node_dispatcher()
+
+        # Register additional CUDA dispatchers
+        dispatcher.register_map_dispatcher(types.GPU_SCHEDULES, self)
+
+        dispatcher.register_node_dispatcher(
+            self, CUDACodeGen.node_dispatch_predicate)
+
+        dispatcher.register_state_dispatcher(self,
+                                             self.state_dispatch_predicate)
+
+        gpu_storage = [
+            types.StorageType.GPU_Global, types.StorageType.GPU_Shared,
+            types.StorageType.GPU_Stack
+        ]
+        dispatcher.register_array_dispatcher(gpu_storage, self)
+        dispatcher.register_array_dispatcher(types.StorageType.CPU_Pinned,
+                                             self)
+
+        for storage in gpu_storage:
+            for other_storage in types.StorageType:
+                dispatcher.register_copy_dispatcher(storage, other_storage,
+                                                    None, self)
+                dispatcher.register_copy_dispatcher(other_storage, storage,
+                                                    None, self)
+
+        # Register illegal copies
+        cpu_unpinned_storage = [
+            types.StorageType.CPU_Heap, types.StorageType.CPU_Stack
+        ]
+        gpu_private_storage = [
+            types.StorageType.GPU_Shared, types.StorageType.GPU_Stack
+        ]
+        illegal_copy = IllegalCopy()
+        for st in cpu_unpinned_storage:
+            for gst in gpu_private_storage:
+                dispatcher.register_copy_dispatcher(st, gst, None,
+                                                    illegal_copy)
+                dispatcher.register_copy_dispatcher(gst, st, None,
+                                                    illegal_copy)
+        for st in cpu_unpinned_storage:
+            for sched_type in [
+                    types.ScheduleType.GPU_Device,
+                    types.ScheduleType.GPU_ThreadBlock
+            ]:
+                dispatcher.register_copy_dispatcher(
+                    st, types.StorageType.Register, sched_type, illegal_copy)
+                dispatcher.register_copy_dispatcher(
+                    types.StorageType.Register, st, sched_type, illegal_copy)
+        # End of illegal copies
+        # End of dispatcher registration
+        ######################################
+
+    # Generate final code
+    def get_generated_codeobjects(self):
+        fileheader = CodeIOStream()
+        self._frame.generate_fileheader(self._global_sdfg, fileheader)
+
+        self._codeobject.code = """
+#include <cuda_runtime.h>
+#include <dace/dace.h>
+
+{file_header}
+
+DACE_EXPORTED int __dace_init_cuda({params});
+DACE_EXPORTED void __dace_exit_cuda({params});
+
+{other_globalcode}
+
+namespace dace {{ namespace cuda {{
+    cudaStream_t __streams[{nstreams}];
+    cudaEvent_t __events[{nevents}];
+}} }}
+
+int __dace_init_cuda({params}) {{
+    int count;
+
+    // Check that we are able to run CUDA code
+    if (cudaGetDeviceCount(&count) != cudaSuccess)
+    {{
+        printf("ERROR: CUDA drivers are not configured or CUDA-capable device "
+               "not found\\n");
+        return 1;
+    }}
+    if (count == 0)
+    {{
+        printf("ERROR: No CUDA-capable devices found\\n");
+        return 2;
+    }}
+
+    // Initialize CUDA before we run the application
+    float *dev_X;
+    cudaMalloc((void **) &dev_X, 1);
+
+    // Create CUDA streams and events
+    for(int i = 0; i < {nstreams}; ++i) {{
+        cudaStreamCreateWithFlags(&dace::cuda::__streams[i], cudaStreamNonBlocking);
+    }}
+    for(int i = 0; i < {nevents}; ++i) {{
+        cudaEventCreateWithFlags(&dace::cuda::__events[i], cudaEventDisableTiming);
+    }}
+
+    {initcode}
+
+    return 0;
+}}
+
+void __dace_exit_cuda({params}) {{
+    {exitcode}
+
+    // Destroy CUDA streams and events
+    for(int i = 0; i < {nstreams}; ++i) {{
+        cudaStreamDestroy(dace::cuda::__streams[i]);
+    }}
+    for(int i = 0; i < {nevents}; ++i) {{
+        cudaEventDestroy(dace::cuda::__events[i]);
+    }}
+}}
+
+{localcode}
+""".format(params=self._global_sdfg.signature(),
+           initcode=self._initcode.getvalue(),
+           exitcode=self._exitcode.getvalue(),
+           other_globalcode=self._globalcode.getvalue(),
+           localcode=self._localcode.getvalue(),
+           file_header=fileheader.getvalue(),
+           nstreams=self._cuda_streams,
+           nevents=self._cuda_events)
+
+        return [self._codeobject]
+
+    @staticmethod
+    def node_dispatch_predicate(sdfg, node):
+        if (getattr(node, 'schedule', False)
+                and node.schedule in types.GPU_SCHEDULES):
+            return True
+        return False
+
+    def state_dispatch_predicate(self, sdfg, state):
+        if self._toplevel_schedule in types.GPU_SCHEDULES:
+            return True
+        for node in state.sink_nodes():
+            if hasattr(node, '_cuda_stream'):
+                return True
+            else:
+                for e in state.in_edges(node):
+                    if hasattr(e.src, '_cuda_stream'):
+                        return True
+        return False
+
+    @property
+    def has_initializer(self):
+        return True
+
+    @property
+    def has_finalizer(self):
+        return True
+
+    @staticmethod
+    def cmake_options():
+
+        host_compiler = make_absolute(
+            Config.get("compiler", "cpu", "executable"))
+        compiler = make_absolute(Config.get("compiler", "cuda", "executable"))
+        flags = Config.get("compiler", "cuda", "args")
+        flags += Config.get("compiler", "cuda", "additional_args")
+
+        # Get CUDA architectures from configuration
+        cuda_arch = Config.get('compiler', 'cuda', 'cuda_arch').split(',')
+        cuda_arch = [ca for ca in cuda_arch if ca is not None and len(ca) > 0]
+
+        flags += ' ' + ' '.join(
+            '-gencode arch=compute_{arch},code=sm_{arch}'.format(arch=arch)
+            for arch in cuda_arch)
+
+        options = [
+            "-DCUDA_HOST_COMPILER=\"{}\"".format(host_compiler),
+            "-DCUDA_NVCC_FLAGS=\"{}\"".format(flags),
+            "-DCUDA_TOOLKIT_ROOT_DIR=\"{}\"".format(
+                os.path.dirname(os.path.dirname(compiler).replace('\\', '/')))
+        ]
+
+        return options
+
+    def allocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                       callsite_stream):
+        nodedesc = node.desc(sdfg)
+        if isinstance(nodedesc, dace.data.Stream):
+            return self.allocate_stream(sdfg, dfg, state_id, node,
+                                        function_stream, callsite_stream)
+
+        result = StringIO()
+        arrsize = ' * '.join([
+            cppunparse.pyexpr2cpp(symbolic.symstr(s)) for s in nodedesc.strides
+        ])
+        is_dynamically_sized = any(
+            symbolic.issymbolic(s, sdfg.constants) for s in nodedesc.strides)
+        arrsize_malloc = arrsize + ' * sizeof(%s)' % nodedesc.dtype.ctype
+        dataname = node.data
+
+        # Different types of GPU arrays
+        if nodedesc.storage == types.StorageType.GPU_Global:
+            result.write(
+                '%s *%s = nullptr;\n' % (nodedesc.dtype.ctype, dataname))
+            self._dispatcher.defined_vars.add(dataname, DefinedType.Pointer)
+
+            # Strides are left to the user's discretion
+            result.write('cudaMalloc(&%s, %s);\n' % (dataname, arrsize_malloc))
+            if node.setzero:
+                result.write(
+                    'cudaMemset(%s, 0, %s);\n' % (dataname, arrsize_malloc))
+
+        elif nodedesc.storage == types.StorageType.CPU_Pinned:
+            result.write(
+                '%s *%s = nullptr;\n' % (nodedesc.dtype.ctype, dataname))
+            self._dispatcher.defined_vars.add(dataname, DefinedType.Pointer)
+
+            # Strides are left to the user's discretion
+            result.write(
+                'cudaMallocHost(&%s, %s);\n' % (dataname, arrsize_malloc))
+            if node.setzero:
+                result.write(
+                    'memset(%s, 0, %s);\n' % (dataname, arrsize_malloc))
+        elif nodedesc.storage == types.StorageType.GPU_Shared:
+            if is_dynamically_sized:
+                raise NotImplementedError('Dynamic shared memory unsupported')
+            result.write("__shared__ %s %s[%s];\n" % (nodedesc.dtype.ctype,
+                                                      dataname, arrsize))
+            self._dispatcher.defined_vars.add(dataname, DefinedType.Pointer)
+            if node.setzero:
+                result.write(
+                    'dace::ResetShared<{type}, {block_size}, {elements}, '
+                    '1, false>::Reset({ptr});\n'.format(
+                        type=nodedesc.dtype.ctype,
+                        block_size=', '.join(_topy(self._block_dims)),
+                        ptr=dataname,
+                        elements=arrsize))
+        elif nodedesc.storage == types.StorageType.GPU_Stack:
+            if is_dynamically_sized:
+                raise ValueError('Dynamic allocation of registers not allowed')
+            szstr = ' = {0}' if node.setzero else ''
+            result.write("%s %s[%s]%s;\n" % (nodedesc.dtype.ctype, dataname,
+                                             arrsize, szstr))
+            self._dispatcher.defined_vars.add(dataname, DefinedType.Pointer)
+        else:
+            raise NotImplementedError("CUDA: Unimplemented storage type " +
+                                      str(nodedesc.storage))
+
+        callsite_stream.write(result.getvalue(), sdfg, state_id, node)
+
+    def initialize_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        # No need (for now)
+        pass
+
+    def allocate_stream(self, sdfg, dfg, state_id, node, function_stream,
+                        callsite_stream):
+        nodedesc = node.desc(sdfg)
+        dataname = node.data
+        if nodedesc.storage == types.StorageType.GPU_Global:
+            fmtargs = {
+                'name': dataname,
+                'type': nodedesc.dtype.ctype,
+                'is_pow2': sym2cpp(
+                    sympy.log(nodedesc.buffer_size, 2).is_Integer),
+                'location':
+                '%s_%s_%s' % (sdfg.name, state_id, dfg.node_id(node)),
+            }
+
+            self._dispatcher.defined_vars.add(dataname, DefinedType.Stream)
+
+            if is_array_stream_view(sdfg, dfg, node):
+                fmtargs['ptr'] = nodedesc.sink
+                # Assuming 1D array sink/src
+                fmtargs['size'] = sym2cpp(sdfg.arrays[nodedesc.sink].shape[0])
+
+                function_stream.write(
+                    'DACE_EXPORTED void __dace_alloc_{location}({type} *ptr, uint32_t size, dace::GPUStream<{type}, {is_pow2}>& result);'.
+                    format(**fmtargs), sdfg, state_id, node)
+                self._globalcode.write(
+                    """
+DACE_EXPORTED void __dace_alloc_{location}({type} *ptr, uint32_t size, dace::GPUStream<{type}, {is_pow2}>& result);
+void __dace_alloc_{location}({type} *ptr, uint32_t size, dace::GPUStream<{type}, {is_pow2}>& result) {{
+    result = dace::AllocGPUArrayStreamView<{type}, {is_pow2}>(ptr, size);
+}}""".format(**fmtargs), sdfg, state_id, node)
+                callsite_stream.write(
+                    'dace::GPUStream<{type}, {is_pow2}> {name}; __dace_alloc_{location}({ptr}, {size}, {name});'.
+                    format(**fmtargs), sdfg, state_id, node)
+            else:
+                fmtargs['size'] = sym2cpp(nodedesc.buffer_size)
+
+                function_stream.write(
+                    'DACE_EXPORTED void __dace_alloc_{location}(uint32_t size, dace::GPUStream<{type}, {is_pow2}>& result);'.
+                    format(**fmtargs), sdfg, state_id, node)
+                self._globalcode.write(
+                    """
+DACE_EXPORTED void __dace_alloc_{location}(uint32_t size, dace::GPUStream<{type}, {is_pow2}>& result);
+dace::GPUStream<{type}, {is_pow2}> __dace_alloc_{location}(uint32_t size, dace::GPUStream<{type}, {is_pow2}>& result) {{
+    result = dace::AllocGPUStream<{type}, {is_pow2}>({size});
+}}""".format(**fmtargs), sdfg, state_id, node)
+                callsite_stream.write(
+                    'dace::GPUStream<{type}, {is_pow2}> {name}; __dace_alloc_{location}({size}, {name});'.
+                    format(**fmtargs), sdfg, state_id, node)
+
+    def deallocate_stream(self, sdfg, dfg, state_id, node, function_stream,
+                          callsite_stream):
+        nodedesc = node.desc(sdfg)
+        dataname = node.data
+        if nodedesc.storage == types.StorageType.GPU_Global:
+            if is_array_stream_view(sdfg, dfg, node):
+                callsite_stream.write(
+                    'dace::FreeGPUArrayStreamView(%s);' % dataname, sdfg,
+                    state_id, node)
+            else:
+                callsite_stream.write('dace::FreeGPUStream(%s);' % dataname,
+                                      sdfg, state_id, node)
+
+    def deallocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        nodedesc = node.desc(sdfg)
+        dataname = node.data
+        if isinstance(nodedesc, dace.data.Stream):
+            return self.deallocate_stream(sdfg, dfg, state_id, node,
+                                          function_stream, callsite_stream)
+
+        if nodedesc.storage == types.StorageType.GPU_Global:
+            callsite_stream.write('cudaFree(%s);\n' % dataname, sdfg, state_id,
+                                  node)
+        elif nodedesc.storage == types.StorageType.CPU_Pinned:
+            callsite_stream.write('cudaFreeHost(%s);\n' % dataname, sdfg,
+                                  state_id, node)
+        elif nodedesc.storage == types.StorageType.GPU_Shared or \
+             nodedesc.storage == types.StorageType.GPU_Stack:
+            pass  # Do nothing
+        else:
+            raise NotImplementedError
+
+    def _compute_cudastreams(self,
+                             sdfg: SDFG,
+                             default_stream=0,
+                             default_event=0):
+        """ Annotates an SDFG (and all nested ones) to include a `_cuda_stream`
+            field. This field is applied to all GPU maps, tasklets, and copies
+            that can be executed in parallel.
+            @param sdfg: The sdfg to modify.
+            @param default_stream: The stream ID to start counting from (used
+                                   in recursion to nested SDFGs).
+            @param default_event: The event ID to start counting from (used
+                                  in recursion to nested SDFGs).
+            @return: 2-tuple of the number of streams, events to create.
+        """
+        concurrent_streams = Config.get('compiler', 'cuda',
+                                        'max_concurrent_streams')
+        if concurrent_streams < 0:
+            return 0, 0
+
+        def increment(streams):
+            if concurrent_streams > 0:
+                return (streams + 1) % concurrent_streams
+            return streams + 1
+
+        state_streams = []
+        state_subsdfg_events = []
+
+        for state in sdfg.nodes():
+            # Start by annotating source nodes
+            source_nodes = state.source_nodes()
+
+            # Concurrency can only be found in each state
+            max_streams = default_stream
+            max_events = default_event
+
+            for i, node in enumerate(source_nodes):
+                if isinstance(node, nodes.AccessNode):
+                    continue
+                if isinstance(node, nodes.NestedSDFG):
+                    if node.schedule == types.ScheduleType.GPU_Device:
+                        continue
+                node._cuda_stream = max_streams
+                node._cs_childpath = False
+                max_streams = increment(max_streams)
+
+            # Maintain the same CUDA stream in DFS order, add more when
+            # possible.
+            for e in state.dfs_edges(source_nodes):
+                if hasattr(e.dst, '_cuda_stream'):
+                    continue
+                if hasattr(e.src, '_cuda_stream'):
+                    c = e.src._cuda_stream
+                    if e.src._cs_childpath == True:
+                        c = max_streams
+                        max_streams = increment(max_streams)
+                    e.src._cs_childpath = True
+                else:
+                    c = max_streams
+                    max_streams = increment(max_streams)
+                e.dst._cuda_stream = c
+                if not hasattr(e.dst, '_cs_childpath'):
+                    e.dst._cs_childpath = False
+                if isinstance(e.dst, nodes.NestedSDFG):
+                    if e.dst.schedule not in types.GPU_SCHEDULES:
+                        max_streams, max_events = self._compute_cudastreams(
+                            e.dst.sdfg, e.dst._cuda_stream, max_events + 1)
+
+            state_streams.append(max_streams if concurrent_streams == 0 else
+                                 concurrent_streams)
+            state_subsdfg_events.append(max_events)
+
+        # Remove CUDA streams from paths of non-gpu copies and CPU tasklets
+        for node, graph in sdfg.all_nodes_recursive():
+            if isinstance(graph, SDFGState):
+                cur_sdfg = graph.parent
+                for e in graph.out_edges(node):
+                    path = graph.memlet_path(e)
+                    # If leading from/to a GPU memory node, keep stream
+                    if ((isinstance(path[0].src, nodes.AccessNode)
+                         and path[0].src.desc(
+                             cur_sdfg).storage == types.StorageType.GPU_Global)
+                            or (isinstance(path[-1].dst, nodes.AccessNode)
+                                and path[-1].dst.desc(cur_sdfg).storage ==
+                                types.StorageType.GPU_Global)):
+                        break
+                    # If leading from/to a GPU tasklet, keep stream
+                    if ((isinstance(path[0].src, nodes.CodeNode)
+                         and is_devicelevel(cur_sdfg, graph, path[0].src)) or
+                        (isinstance(path[-1].dst, nodes.CodeNode)
+                         and is_devicelevel(cur_sdfg, graph, path[-1].dst))):
+                        break
+                    # If leading from/to a GPU reduction, keep stream
+                    if ((isinstance(path[0].src, nodes.Reduce) and
+                         path[0].src.schedule == types.ScheduleType.GPU_Device)
+                            or
+                        (isinstance(path[-1].dst, nodes.Reduce) and path[-1]
+                         .dst.schedule == types.ScheduleType.GPU_Device)):
+                        break
+                else:  # If we did not break, we do not need a CUDA stream
+                    if hasattr(node, '_cuda_stream'):
+                        delattr(node, '_cuda_stream')
+                # In any case, remove childpath
+                if hasattr(node, '_cs_childpath'):
+                    delattr(node, '_cs_childpath')
+
+        # Compute maximal number of events by counting edges (within the same
+        # state) that point from one stream to another
+        state_events = []
+        for i, state in enumerate(sdfg.nodes()):
+            events = state_subsdfg_events[i]
+
+            for e in state.edges():
+                if hasattr(e.src, '_cuda_stream'):
+                    # If there are two or more CUDA streams involved in this
+                    # edge, or the destination is unrelated to CUDA
+                    if (not hasattr(e.dst, '_cuda_stream')
+                            or e.src._cuda_stream != e.dst._cuda_stream):
+                        for mpe in state.memlet_path(e):
+                            mpe._cuda_event = events
+                        events += 1
+
+            state_events.append(events)
+
+        # Maximum over all states
+        max_streams = max(state_streams)
+        max_events = max(state_events)
+
+        return max_streams, max_events
+
+    def _emit_copy(self, state_id, src_node, src_storage, dst_node,
+                   dst_storage, dst_schedule, edge, sdfg, dfg,
+                   callsite_stream):
+        u, uconn, v, vconn, memlet = edge
+        state_dfg = sdfg.nodes()[state_id]
+
+        cpu_storage_types = [
+            types.StorageType.CPU_Heap, types.StorageType.CPU_Stack,
+            types.StorageType.CPU_Pinned
+        ]
+        gpu_storage_types = [
+            types.StorageType.GPU_Global, types.StorageType.GPU_Shared,
+            types.StorageType.GPU_Stack
+        ]
+
+        copy_shape = memlet.subset.bounding_box_size()
+        copy_shape = [symbolic.overapproximate(s) for s in copy_shape]
+        # Determine directionality
+        if (isinstance(src_node, nodes.AccessNode)
+                and memlet.data == src_node.data):
+            outgoing_memlet = True
+        elif (isinstance(dst_node, nodes.AccessNode)
+              and memlet.data == dst_node.data):
+            outgoing_memlet = False
+        else:
+            raise LookupError('Memlet does not point to any of the nodes')
+
+        if (isinstance(src_node, nodes.AccessNode)
+                and isinstance(dst_node, nodes.AccessNode)
+                and not self._in_device_code
+                and (src_storage == types.StorageType.GPU_Global
+                     or dst_storage == types.StorageType.GPU_Global)):
+            src_location = 'Device' if src_storage == types.StorageType.GPU_Global else 'Host'
+            dst_location = 'Device' if dst_storage == types.StorageType.GPU_Global else 'Host'
+
+            syncwith = {}  # Dictionary of {stream: event}
+            is_sync = False
+            max_streams = Config.get('compiler', 'cuda',
+                                     'max_concurrent_streams')
+
+            if hasattr(src_node, '_cuda_stream'):
+                cudastream = src_node._cuda_stream
+                if not hasattr(dst_node, '_cuda_stream'):
+                    # Copy after which data is needed by the host
+                    is_sync = True
+                elif dst_node._cuda_stream != src_node._cuda_stream:
+                    syncwith[dst_node._cuda_stream] = edge._cuda_event
+                else:
+                    pass  # Otherwise, no need to synchronize
+            elif hasattr(dst_node, '_cuda_stream'):
+                cudastream = dst_node._cuda_stream
+            else:
+                if max_streams >= 0:
+                    print('WARNING: Undefined stream, reverting to default')
+                if dst_location == 'Host':
+                    is_sync = True
+                cudastream = 'nullptr'
+
+            # Handle case of impending kernel/tasklet on another stream
+            if max_streams >= 0:
+                for e in state_dfg.out_edges(dst_node):
+                    if isinstance(e.dst, nodes.AccessNode):
+                        continue
+                    if not hasattr(e.dst, '_cuda_stream'):
+                        is_sync = True
+                    elif e.dst._cuda_stream != cudastream:
+                        syncwith[e.dst._cuda_stream] = e._cuda_event
+
+                if cudastream != 'nullptr':
+                    cudastream = 'dace::cuda::__streams[%d]' % cudastream
+
+            if memlet.wcr is not None:
+                raise NotImplementedError('Accumulate %s to %s not implemented'
+                                          % (src_location, dst_location))
+            #############################
+
+            # Obtain copy information
+            copy_shape, src_strides, dst_strides, src_expr, dst_expr = (
+                self._cpu_codegen.memlet_copy_to_absolute_strides(
+                    sdfg, memlet, src_node, dst_node))
+
+            dims = len(copy_shape)
+
+            # Handle unsupported copy types
+            if dims == 2 and (src_strides[-1] != 1 or dst_strides[-1] != 1):
+                raise NotImplementedError('2D copy only supported with one '
+                                          'stride')
+
+            # Currently we only support ND copies when they can be represented
+            # as a 1D copy or as a 2D strided copy
+            if dims > 2:
+                raise NotImplementedError('Copies between CPU and GPU are not'
+                                          ' supported for N-dimensions')
+
+            if dims == 1:
+                copysize = ' * '.join([
+                    cppunparse.pyexpr2cpp(symbolic.symstr(s))
+                    for s in copy_shape
+                ])
+                array_length = copysize
+                copysize += ' * sizeof(%s)' % dst_node.desc(sdfg).dtype.ctype
+
+                callsite_stream.write(
+                    'cudaMemcpyAsync(%s, %s, %s, cudaMemcpy%sTo%s, %s);\n' %
+                    (dst_expr, src_expr, copysize, src_location, dst_location,
+                     cudastream), sdfg, state_id, [src_node, dst_node])
+                node_dtype = dst_node.desc(sdfg).dtype
+                if issubclass(node_dtype.type, ctypes.Structure):
+                    callsite_stream.write(
+                        'for (auto __idx = 0; __idx < {arrlen}; ++__idx) '
+                        '{{'.format(arrlen=str(array_length)))
+                    for field_name, field_type in node_dtype._data.items():
+                        if isinstance(field_type, types.pointer):
+                            tclass = field_type.type
+                            length = node_dtype._length[field_name]
+                            size = 'sizeof({})*{}[__idx].{}'.format(
+                                types._CTYPES[tclass], str(src_node), length)
+                            callsite_stream.write(
+                                'cudaMalloc(&{dst}[__idx].{fname}, '
+                                '{sz});'.format(
+                                    dst=str(dst_node),
+                                    fname=field_name,
+                                    sz=size))
+                            callsite_stream.write(
+                                'cudaMemcpyAsync({dst}[__idx].{fname}, '
+                                '{src}[__idx].{fname}, {sz}, '
+                                'cudaMemcpy{sloc}To{dloc}, {stream});'.format(
+                                    dst=str(dst_node),
+                                    src=str(src_node),
+                                    fname=field_name,
+                                    sz=size,
+                                    sloc=src_location,
+                                    dloc=dst_location,
+                                    stream=cudastream), sdfg, state_id,
+                                [src_node, dst_node])
+                    callsite_stream.write('}')
+            elif dims == 2:
+                callsite_stream.write(
+                    'cudaMemcpy2DAsync(%s, %s, %s, %s, %s, %s, cudaMemcpy%sTo%s, %s);\n'
+                    % (dst_expr, _topy(dst_strides[0]) +
+                       ' * sizeof(%s)' % dst_node.desc(sdfg).dtype.ctype,
+                       src_expr, sym2cpp(src_strides[0]) +
+                       ' * sizeof(%s)' % src_node.desc(sdfg).dtype.ctype,
+                       sym2cpp(copy_shape[1]) +
+                       ' * sizeof(%s)' % dst_node.desc(sdfg).dtype.ctype,
+                       sym2cpp(copy_shape[0]), src_location, dst_location,
+                       cudastream), sdfg, state_id, [src_node, dst_node])
+
+            # Post-copy synchronization
+            if is_sync:
+                # Synchronize with host (done at destination)
+                pass
+            else:
+                # Synchronize with other streams as necessary
+                for streamid, event in syncwith.items():
+                    syncstream = 'dace::cuda::__streams[%d]' % streamid
+                    callsite_stream.write(
+                        '''
+    cudaEventRecord(dace::cuda::__events[{ev}], {src_stream});
+    cudaStreamWaitEvent({dst_stream}, dace::cuda::__events[{ev}], 0);
+                    '''.format(
+                            ev=event,
+                            src_stream=cudastream,
+                            dst_stream=syncstream), sdfg, state_id,
+                        [src_node, dst_node])
+
+        # Copy within the GPU
+        elif (src_storage in gpu_storage_types
+              and dst_storage in gpu_storage_types):
+
+            state_dfg = sdfg.nodes()[state_id]
+            sdict = state_dfg.scope_dict()
+            if scope_contains_scope(sdict, src_node, dst_node):
+                inner_schedule = dst_schedule
+            else:
+                inner_schedule = sdict[src_node]
+                if inner_schedule is not None:
+                    inner_schedule = inner_schedule.map.schedule
+            if inner_schedule is None:  # Top-level schedule
+                inner_schedule = self._toplevel_schedule
+
+            # Collaborative load
+            if inner_schedule == types.ScheduleType.GPU_Device:
+                # Obtain copy information
+                copy_shape, src_strides, dst_strides, src_expr, dst_expr = (
+                    self._cpu_codegen.memlet_copy_to_absolute_strides(
+                        sdfg, memlet, src_node, dst_node))
+
+                dims = len(copy_shape)
+
+                funcname = 'dace::%sTo%s%dD' % (_get_storagename(src_storage),
+                                                _get_storagename(dst_storage),
+                                                dims)
+
+                accum = ''
+                custom_reduction = []
+                if memlet.wcr is not None:
+                    redtype = operations.detect_reduction_type(memlet.wcr)
+                    reduction_tmpl = ''
+                    # Special call for detected reduction types
+                    if redtype != types.ReductionType.Custom:
+                        credtype = ('dace::ReductionType::' +
+                                    str(redtype)[str(redtype).find('.') + 1:])
+                        reduction_tmpl = '<%s>' % credtype
+                    else:
+                        custom_reduction = [unparse_cr(memlet.wcr)]
+                    accum = '::template Accum%s' % reduction_tmpl
+
+                if any(
+                        symbolic.issymbolic(s, sdfg.constants)
+                        for s in copy_shape):
+                    callsite_stream.write((
+                        '    {func}Dynamic<dace::vec<{type}, {veclen}>, {bdims}, '
+                        + '{dststrides}, {is_async}>{accum}({args});').format(
+                            func=funcname,
+                            type=dst_node.desc(sdfg).dtype.ctype,
+                            veclen=memlet.veclen,
+                            bdims=', '.join(_topy(self._block_dims)),
+                            dststrides=', '.join(_topy(dst_strides)),
+                            is_async='false'
+                            if state_dfg.out_degree(dst_node) > 0 else 'true',
+                            accum=accum,
+                            args=', '.join([src_expr] + _topy(src_strides) +
+                                           [dst_expr] + custom_reduction +
+                                           _topy(copy_shape))), sdfg, state_id,
+                                          [src_node, dst_node])
+                else:
+                    callsite_stream.write((
+                        '    {func}<dace::vec<{type}, {veclen}>, {bdims}, {copysize}, '
+                        + '{dststrides}, {is_async}>{accum}({args});').format(
+                            func=funcname,
+                            type=dst_node.desc(sdfg).dtype.ctype,
+                            veclen=memlet.veclen,
+                            bdims=', '.join(_topy(self._block_dims)),
+                            copysize=', '.join(_topy(copy_shape)),
+                            dststrides=', '.join(_topy(dst_strides)),
+                            is_async='false'
+                            if state_dfg.out_degree(dst_node) > 0 else 'true',
+                            accum=accum,
+                            args=', '.join([src_expr] + _topy(src_strides) +
+                                           [dst_expr] + custom_reduction)),
+                                          sdfg, state_id, [src_node, dst_node])
+            # Per-thread load (same as CPU copies)
+            else:
+                self._cpu_codegen.copy_memory(sdfg, dfg, state_id, src_node,
+                                              dst_node, edge, None,
+                                              callsite_stream)
+        else:
+            self._cpu_codegen.copy_memory(sdfg, dfg, state_id, src_node,
+                                          dst_node, edge, None,
+                                          callsite_stream)
+
+    def copy_memory(self, sdfg, dfg, state_id, src_node, dst_node, memlet,
+                    function_stream, callsite_stream):
+        if isinstance(src_node, nodes.Tasklet):
+            src_storage = types.StorageType.Register
+            src_parent = dfg.scope_dict()[src_node]
+            dst_schedule = None if src_parent is None else src_parent.map.schedule
+        else:
+            src_storage = src_node.desc(sdfg).storage
+
+        if isinstance(dst_node, nodes.Tasklet):
+            dst_storage = types.StorageType.Register
+        else:
+            dst_storage = dst_node.desc(sdfg).storage
+
+        dst_parent = dfg.scope_dict()[dst_node]
+        dst_schedule = None if dst_parent is None else dst_parent.map.schedule
+
+        # Emit actual copy
+        self._emit_copy(state_id, src_node, src_storage, dst_node, dst_storage,
+                        dst_schedule, memlet, sdfg, dfg, callsite_stream)
+
+    def generate_state(self, sdfg, state, function_stream, callsite_stream):
+        # Two modes: device-level state and if this state has active streams
+        if self._toplevel_schedule in types.GPU_SCHEDULES:
+            self.generate_devicelevel_state(sdfg, state, function_stream,
+                                            callsite_stream)
+        else:
+            # Active streams found. Generate state normally and sync with the
+            # streams in the end
+            self._frame.generate_state(
+                sdfg,
+                state,
+                function_stream,
+                callsite_stream,
+                generate_state_footer=False)
+            if state.nosync == False:
+                streams_to_sync = set()
+                for node in state.sink_nodes():
+                    if hasattr(node, '_cuda_stream'):
+                        streams_to_sync.add(node._cuda_stream)
+                    else:
+                        # Synchronize sink-node copies at the end of the state
+                        for e in state.in_edges(node):
+                            if hasattr(e.src, '_cuda_stream'):
+                                streams_to_sync.add(e.src._cuda_stream)
+                for stream in streams_to_sync:
+                    callsite_stream.write(
+                        'cudaStreamSynchronize(dace::cuda::__streams[%d]);' %
+                        stream, sdfg, sdfg.node_id(state))
+
+            # After synchronizing streams, generate state footer normally
+
+            # Emit internal transient array deallocation
+            sid = sdfg.node_id(state)
+            data_to_allocate = (set(state.top_level_transients()) - set(
+                sdfg.shared_transients()))
+            deallocated = set()
+            for node in state.data_nodes():
+                if node.data not in data_to_allocate or node.data in deallocated:
+                    continue
+                deallocated.add(node.data)
+                self._frame._dispatcher.dispatch_deallocate(
+                    sdfg, state, sid, node, function_stream, callsite_stream)
+
+    def generate_devicelevel_state(self, sdfg, state, function_stream,
+                                   callsite_stream):
+
+        # Special case: if this is a GPU grid state and something is reading
+        # from a possible result of a collaborative write, sync first
+        if self._toplevel_schedule == types.ScheduleType.GPU_Device:
+            state_id = next(
+                i for i, s in enumerate(sdfg.nodes()) if s == state)
+            for node in state.nodes():
+                if (isinstance(node, nodes.AccessNode) and
+                        node.desc(sdfg).storage == types.StorageType.GPU_Shared
+                        and state.in_degree(node) == 0
+                        and state.out_degree(node) > 0):
+                    callsite_stream.write('__syncthreads();', sdfg, state_id)
+                    break
+
+        self._frame.generate_state(sdfg, state, function_stream,
+                                   callsite_stream)
+
+    # NOTE: This function is ONLY called from the CPU side. Therefore, any
+    # schedule that is out of the ordinary will raise an exception
+    def generate_scope(self, sdfg, dfg_scope, state_id, function_stream,
+                       callsite_stream):
+        scope_entry = dfg_scope.source_nodes()[0]
+        scope_exit = dfg_scope.sink_nodes()[0]
+
+        dfg = sdfg.nodes()[state_id]
+
+        # If in device-level code, call appropriate function
+        if (self._toplevel_schedule == types.ScheduleType.GPU_Device or
+            (dfg.scope_dict()[scope_entry] is not None and dfg.scope_dict()
+             [scope_entry].map.schedule in types.GPU_SCHEDULES)):
+            self.generate_devicelevel_scope(sdfg, dfg_scope, state_id,
+                                            function_stream, callsite_stream)
+            return
+
+        # If not device-level code, ensure the schedule is correct
+        if scope_entry.map.schedule != types.ScheduleType.GPU_Device:
+            raise TypeError('Cannot schedule %s directly from non-GPU code' %
+                            str(scope_entry.map.schedule))
+
+        # Determine whether to create a global (grid) barrier object
+        create_grid_barrier = False
+        for node in dfg_scope.nodes():
+            if scope_entry == node: continue
+            if (isinstance(node, nodes.EntryNode)
+                    and node.map.schedule == types.ScheduleType.GPU_Device):
+                create_grid_barrier = True
+
+        kernel_name = '%s_%d_%d' % (
+            scope_entry.map.label, dfg.node_id(scope_entry), sdfg.node_id(dfg))
+
+        # Get parameters from input/output memlets to this map
+        params = set(d.data for node in dfg_scope.source_nodes() for _,_,_,_,d in dfg.in_edges(node)) | \
+                 set(d.data for node in dfg_scope.sink_nodes() for _,_,_,_,d in dfg.out_edges(node))
+
+        # Get symbolic parameters (free symbols) for kernel
+        syms = sdfg.symbols_defined_at(scope_entry)
+        freesyms = {
+            k: v
+            for k, v in syms.items()
+            if k not in sdfg.constants and k not in scope_entry.map.params
+        }
+        symbol_sigs = [
+            v.dtype.ctype + ' ' + k for k, v in sorted(freesyms.items())
+        ]
+        symbol_names = [k for k in sorted(freesyms.keys())]
+
+        # Hijack symbol_sigs to create a grid barrier object
+        if create_grid_barrier:
+            symbol_sigs.append('cub::GridBarrier __gbar')
+
+        # Comprehend grid/block dimensions from scopes
+        grid_dims, block_dims, tbmap = self.get_kernel_dimensions(dfg_scope)
+
+        kernel_args = [
+            sdfg.arrays[p].signature(False, name=p) for p in sorted(params)
+        ] + symbol_names
+        kernel_args_typed = [
+            sdfg.arrays[p].signature(name=p) for p in sorted(params)
+        ] + symbol_sigs
+
+        # Store init/exit code streams
+        old_entry_stream = self.scope_entry_stream
+        old_exit_stream = self.scope_exit_stream
+        self.scope_entry_stream = CodeIOStream()
+        self.scope_exit_stream = CodeIOStream()
+
+        kernel_stream = CodeIOStream()
+        self.generate_kernel_scope(sdfg, dfg_scope, state_id, scope_entry.map,
+                                   kernel_name, grid_dims, block_dims, tbmap,
+                                   kernel_args_typed, self._globalcode,
+                                   kernel_stream)
+
+        # Write kernel prototype
+        node = dfg_scope.source_nodes()[0]
+        self._localcode.write(
+            '__global__ void %s(%s) {\n' %
+            (kernel_name, ', '.join(kernel_args_typed)), sdfg, state_id, node)
+
+        # Write constant expressions in GPU code
+        self._frame.generate_constants(sdfg, self._localcode)
+
+        self._localcode.write(self.scope_entry_stream.getvalue())
+
+        # Assuming kernel can write to global scope (function_stream), we
+        # output the kernel last
+        self._localcode.write(kernel_stream.getvalue() + '\n')
+
+        self._localcode.write(self.scope_exit_stream.getvalue())
+
+        # Restore init/exit code streams
+        self.scope_entry_stream = old_entry_stream
+        self.scope_exit_stream = old_exit_stream
+
+        # Write callback function definition
+        self._localcode.write(
+            """
+DACE_EXPORTED void __dace_runkernel_{fname}({fargs});
+void __dace_runkernel_{fname}({fargs})
+{{
+""".format(fname=kernel_name, fargs=', '.join(kernel_args_typed)), sdfg,
+            state_id, node)
+
+        if create_grid_barrier:
+            gbar = '__gbar_' + kernel_name
+            self._localcode.write('    cub::GridBarrierLifetime %s;\n' % gbar,
+                                  sdfg, state_id, node)
+            self._localcode.write(
+                '    %s.Setup(%s);\n' % (gbar, ' * '.join(_topy(grid_dims))),
+                sdfg, state_id, node)
+            symbol_names.append(gbar)
+
+        # Compute dynamic shared memory
+        dynsmem_size = 0
+        # For all access nodes, if array storage == GPU_Shared and size is
+        # symbolic, add it. If nested SDFG, check all internal arrays
+        for node in dfg_scope.nodes():
+            if isinstance(node, nodes.AccessNode):
+                arr = sdfg.arrays[node.data]
+                if arr.storage == types.StorageType.GPU_Shared:
+                    numel = functools.reduce(lambda a, b: a * b, arr.shape)
+                    if symbolic.issymbolic(numel, sdfg.constants):
+                        dynsmem_size += numel
+            elif isinstance(node, nodes.NestedSDFG):
+                for arr in node.sdfg.arrays_recursive():
+                    if (arr is not None
+                            and arr.storage == types.StorageType.GPU_Shared):
+                        numel = functools.reduce(lambda a, b: a * b, arr.shape)
+                        if symbolic.issymbolic(numel, sdfg.constants):
+                            dynsmem_size += numel
+
+        max_streams = Config.get('compiler', 'cuda', 'max_concurrent_streams')
+        if max_streams >= 0:
+            cudastream = 'dace::cuda::__streams[%d]' % scope_entry._cuda_stream
+        else:
+            cudastream = 'nullptr'
+
+        self._localcode.write(
+            '''
+void  *{kname}_args[] = {{ {kargs} }};
+cudaLaunchKernel((void*){kname}, dim3({gdims}), dim3({bdims}), {kname}_args, {dynsmem}, {stream});'''
+            .format(
+                kname=kernel_name,
+                kargs=', '.join(['(void *)&' + arg for arg in kernel_args]),
+                gdims=','.join(_topy(grid_dims)),
+                bdims=','.join(_topy(block_dims)),
+                dynsmem=_topy(dynsmem_size),
+                stream=cudastream), sdfg, state_id, node)
+
+        # Close the runkernel function
+        self._localcode.write('}')
+        #######################
+        # Add invocation to calling code (in another file)
+        function_stream.write(
+            'DACE_EXPORTED void __dace_runkernel_%s(%s);\n' %
+            (kernel_name, ', '.join(kernel_args_typed)), sdfg, state_id, node)
+        callsite_stream.write(
+            '__dace_runkernel_%s(%s);\n' %
+            (kernel_name, ', '.join(kernel_args)), sdfg, state_id, node)
+
+        synchronize_streams(sdfg, dfg, state_id, node, scope_exit,
+                            callsite_stream)
+
+    def get_kernel_dimensions(self, dfg_scope):
+        """ Determines a CUDA kernel's grid/block dimensions from map
+            scopes.
+
+            Ruleset for kernel dimensions:
+                1. If only one map (device-level) exists, of an integer set S,
+                   the block size is 32x1x1 and grid size is ceil(|S|/32) in 
+                   1st dimension.
+                2. If nested thread-block maps exist (T_1,...,T_n), grid 
+                   size is |S| and block size is max(|T_1|,...,|T_n|) with 
+                   block specialization.
+                3. If block size can be overapproximated, it is (for 
+                    dynamically-sized blocks that are bounded by a 
+                    predefined size).
+    
+            @note: Kernel dimensions are separate from the map
+                   variables, and they should be treated as such.
+            @note: To make use of the grid/block 3D registers, we use multi-
+                   dimensional kernels up to 3 dimensions, and flatten the 
+                   rest into the third dimension.
+        """
+
+        kernelmap_entry = dfg_scope.source_nodes()[0]
+        grid_size = kernelmap_entry.map.range.size(True)[::-1]
+        block_size = None
+
+        # Linearize (flatten) rest of dimensions to third
+        if len(grid_size) > 3:
+            grid_size[2] = functools.reduce(sympy.mul.Mul, grid_size[2:], 1)
+            del grid_size[3:]
+
+        # Extend to 3 dimensions if necessary
+        grid_size = grid_size + [1] * (3 - len(grid_size))
+
+        # Obtain thread-block maps for case (2)
+        tb_maps = [
+            node.map for node, parent in dfg_scope.scope_dict().items()
+            if parent == kernelmap_entry and isinstance(node, nodes.EntryNode)
+            and node.schedule == types.ScheduleType.GPU_ThreadBlock
+        ]
+        # Append thread-block maps from nested SDFGs
+        for node in dfg_scope.scope_subgraph(kernelmap_entry).nodes():
+            if isinstance(node, nodes.NestedSDFG):
+                _set_default_schedule_and_storage_types(
+                    node.sdfg, node.schedule)
+
+                tb_maps.extend([
+                    n.map for state in node.sdfg.nodes()
+                    for n in state.nodes() if isinstance(n, nodes.MapEntry)
+                    and n.schedule == types.ScheduleType.GPU_ThreadBlock
+                ])
+
+        # Case (1): no thread-block maps
+        if len(tb_maps) == 0:
+
+            print('WARNING: Thread-block maps not found in kernel, assuming ' +
+                  'block size of (%s)' %
+                  Config.get('compiler', 'cuda', 'default_block_size'))
+            block_size = [
+                int(b) for b in Config.get('compiler', 'cuda',
+                                           'default_block_size').split(',')
+            ]
+            assert (len(block_size) >= 1 and len(block_size) <= 3)
+
+            int_ceil = sympy.Function('int_ceil')
+
+            # Grid size = ceil(|S|/32) for first dimension, rest = |S|
+            grid_size = [
+                int_ceil(gs, bs) for gs, bs in zip(grid_size, block_size)
+            ]
+
+            return grid_size, block_size, False
+
+        # Find all thread-block maps to determine overall block size
+        block_size = [1, 1, 1]
+        detected_block_sizes = [block_size]
+        for tbmap in tb_maps:
+            tbsize = tbmap.range.size()[::-1]
+
+            # Over-approximate block size (e.g. min(N,(i+1)*32)-i*32 --> 32)
+            # The partial trailing thread-block is emitted as an if-condition
+            # that returns on some of the participating threads
+            tbsize = [symbolic.overapproximate(s) for s in tbsize]
+
+            # Linearize (flatten) rest of dimensions to third
+            if len(tbsize) > 3:
+                tbsize[2] = functools.reduce(sympy.mul.Mul, tbsize[2:], 1)
+                del tbsize[3:]
+
+            # Extend to 3 dimensions if necessary
+            tbsize = tbsize + [1] * (len(block_size) - len(tbsize))
+
+            block_size = [
+                sympy.Max(sz, bbsz) for sz, bbsz in zip(block_size, tbsize)
+            ]
+            if block_size != tbsize:
+                detected_block_sizes.append(tbsize)
+
+        # TODO: If grid/block sizes contain elements only defined within the
+        #       kernel, raise an invalid SDFG exception and recommend
+        #       overapproximation.
+
+        return grid_size, block_size, True
+
+    def generate_kernel_scope(
+            self, sdfg: SDFG, dfg_scope: ScopeSubgraphView, state_id: int,
+            kernel_map: nodes.Map, kernel_name: str, grid_dims: list,
+            block_dims: list, has_tbmap: bool, kernel_params: list,
+            function_stream: CodeIOStream, kernel_stream: CodeIOStream):
+        node = dfg_scope.source_nodes()[0]
+
+        if not node.map.flatten:
+            # Add more opening braces for scope exit to close
+            for dim in range(len(node.map.range) - 1):
+                kernel_stream.write('{\n', sdfg, state_id, node)
+
+        # Generate all index arguments for kernel grid
+        krange = subsets.Range(kernel_map.range[::-1])
+        kdims = krange.size()
+        dsym = [
+            symbolic.symbol('__DAPB%d' % i, nonnegative=True, integer=True)
+            for i in range(len(krange))
+        ]
+        bidx = krange.coord_at(dsym)
+
+        # First three dimensions are evaluated directly
+        for i in range(min(len(krange), 3)):
+            varname = kernel_map.params[-i - 1]
+
+            # Delinearize third dimension if necessary
+            if i == 2 and len(krange) > 3:
+                block_expr = '(blockIdx.z / (%s))' % _topy(
+                    functools.reduce(sympy.mul.Mul, kdims[3:], 1))
+            else:
+                block_expr = 'blockIdx.%s' % _named_idx(i)
+                # If we defaulted to 32 threads per block, offset by thread ID
+                if not has_tbmap:
+                    block_expr = '(%s * %s + threadIdx.%s)' % (
+                        block_expr, _topy(block_dims[i]), _named_idx(i))
+
+            expr = _topy(bidx[i]).replace('__DAPB%d' % i, block_expr)
+
+            kernel_stream.write('int %s = %s;' % (varname, expr), sdfg,
+                                state_id, node)
+            self._dispatcher.defined_vars.add(varname, DefinedType.Scalar)
+
+        # Delinearize beyond the third dimension
+        if len(krange) > 3:
+            for i in range(3, len(krange)):
+                varname = kernel_map.params[-i - 1]
+                # true dim i = z / ('*'.join(kdims[i+1:])) % kdims[i]
+                block_expr = '(blockIdx.z / (%s)) %% (%s)' % (
+                    _topy(functools.reduce(sympy.mul.Mul, kdims[i + 1:], 1)),
+                    _topy(kdims[i]),
+                )
+
+                expr = _topy(bidx[i]).replace('__DAPB%d' % i, block_expr)
+                kernel_stream.write('int %s = %s;' % (varname, expr), sdfg,
+                                    state_id, node)
+                self._dispatcher.defined_vars.add(varname, DefinedType.Scalar)
+
+        # Dispatch internal code
+        assert self._in_device_code == False
+        self._in_device_code = True
+        self._block_dims = block_dims
+
+        # Emit internal array allocation (deallocation handled at MapExit)
+        scope_entry = dfg_scope.source_nodes()[0]
+        to_allocate = dace.sdfg.local_transients(sdfg, dfg_scope, scope_entry)
+        allocated = set()
+        for child in dfg_scope.scope_dict(node_to_children=True)[node]:
+            if not isinstance(child, nodes.AccessNode):
+                continue
+            if child.data not in to_allocate or child.data in allocated:
+                continue
+            allocated.add(child.data)
+            self._dispatcher.dispatch_allocate(sdfg, dfg_scope, state_id,
+                                               child, function_stream,
+                                               kernel_stream)
+            self._dispatcher.dispatch_initialize(sdfg, dfg_scope, state_id,
+                                                 child, function_stream,
+                                                 kernel_stream)
+
+        # Generate conditions for this block's execution using min and max
+        # element, e.g., skipping out-of-bounds threads in trailing block
+        if has_tbmap == False:
+            dsym_end = [d + bs - 1 for d, bs in zip(dsym, self._block_dims)]
+            minels = krange.min_element()
+            maxels = krange.max_element()
+            for i, (v, minel, maxel) in enumerate(
+                    zip(kernel_map.params[::-1], minels, maxels)):
+                condition = ''
+
+                # Optimize conditions if they are always true
+                if i >= 3 or (dsym[i] >= minel) != True:
+                    condition += '%s >= %s' % (v, _topy(minel))
+                if i >= 3 or (dsym_end[i] < maxel) != False:
+                    if len(condition) > 0:
+                        condition += ' && '
+                    condition += '%s < %s' % (v, _topy(maxel + 1))
+                if len(condition) > 0:
+                    kernel_stream.write('if (%s) {' % condition, sdfg,
+                                        state_id, scope_entry)
+                else:
+                    kernel_stream.write('{', sdfg, state_id, scope_entry)
+
+        self._dispatcher.dispatch_subgraph(
+            sdfg,
+            dfg_scope,
+            state_id,
+            function_stream,
+            kernel_stream,
+            skip_entry_node=True)
+
+        if has_tbmap == False:
+            for _ in kernel_map.params:
+                kernel_stream.write('}\n', sdfg, state_id, node)
+
+        self._block_dims = None
+        self._in_device_code = False
+
+    def get_next_scope_entries(self, dfg, scope_entry):
+        parent_scope_entry = dfg.scope_dict()[scope_entry]
+        # We're in a nested SDFG, use full graph
+        if parent_scope_entry is None:
+            parent_scope = dfg
+        else:
+            parent_scope = dfg.scope_subgraph(parent_scope_entry)
+
+        # Get all non-sequential scopes from the same level
+        all_scopes = [
+            node for node in parent_scope.topological_sort(scope_entry)
+            if isinstance(node, nodes.EntryNode)
+            and node.map.schedule != types.ScheduleType.Sequential
+        ]
+
+        # TODO: Fix to include *next* scopes, without concurrent scopes
+
+        return all_scopes[all_scopes.index(scope_entry) + 1:]
+
+    def generate_devicelevel_scope(self, sdfg, dfg_scope, state_id,
+                                   function_stream, callsite_stream):
+        # Sanity check
+        assert self._in_device_code == True
+
+        dfg = sdfg.nodes()[state_id]
+        sdict = dfg.scope_dict()
+        scope_entry = dfg_scope.source_nodes()[0]
+        scope_map = scope_entry.map
+        next_scopes = self.get_next_scope_entries(dfg, scope_entry)
+
+        if scope_map.schedule == types.ScheduleType.GPU_ThreadBlock_Dynamic:
+            if len(scope_map.params) > 1:
+                raise ValueError('Only one-dimensional maps are supported for '
+                                 'dynamic block map schedule (got %d)' % len(
+                                     scope_map.params))
+            total_block_size = 1
+            for bdim in self._block_dims:
+                if symbolic.issymbolic(bdim, sdfg.constants):
+                    raise ValueError(
+                        'Block size has to be constant for block-wide '
+                        'dynamic map schedule (got %s)' % str(bdim))
+                total_block_size *= bdim
+            if _expr(scope_map.range[0][2]) != 1:
+                raise NotImplementedError(
+                    'Skip not implemented for dynamic thread-block map schedule'
+                )
+
+            ##### TODO (later): Generalize
+            # Find thread-block param map and its name
+            if self._block_dims[1] != 1 or self._block_dims[2] != 1:
+                raise NotImplementedError(
+                    'Dynamic block map schedule only '
+                    'implemented for 1D blocks currently')
+            pscope = sdict[scope_entry]
+            while pscope is not None and pscope.map.schedule != types.ScheduleType.GPU_ThreadBlock:
+                pscope = sdict[pscope]
+            if pscope is None:
+                raise NotImplementedError('Dynamic block map schedule '
+                                          'currently requires block map')
+            bname = pscope.map.params[0]
+
+            callsite_stream.write(
+                'dace::DynamicMap<{bsize}>::template '
+                'schedule({begin}, {end}, {tid}, [&](auto {param}, '
+                'auto {tid}) {{'.format(
+                    bsize=total_block_size,
+                    begin=scope_map.range[0][0],
+                    end=scope_map.range[0][1] + 1,
+                    param=scope_map.params[0],
+                    tid=bname), sdfg, state_id, scope_entry)
+        else:
+            # If integer sets are used, only emit one opening curly brace
+            if scope_map.flatten:
+                callsite_stream.write('{', sdfg, state_id, scope_entry)
+            else:
+                for dim in range(len(scope_map.range)):
+                    callsite_stream.write('{', sdfg, state_id, scope_entry)
+
+        # Emit internal array allocation (deallocation handled at MapExit)
+        to_allocate = dace.sdfg.local_transients(sdfg, dfg_scope, scope_entry)
+        allocated = set()
+        for child in dfg_scope.scope_dict(node_to_children=True)[scope_entry]:
+            if not isinstance(child, nodes.AccessNode):
+                continue
+            if child.data not in to_allocate or child.data in allocated:
+                continue
+            allocated.add(child.data)
+            self._dispatcher.dispatch_allocate(sdfg, dfg_scope, state_id,
+                                               child, function_stream,
+                                               callsite_stream)
+            self._dispatcher.dispatch_initialize(sdfg, dfg_scope, state_id,
+                                                 child, function_stream,
+                                                 callsite_stream)
+
+        # Generate all index arguments for block
+        if scope_map.schedule == types.ScheduleType.GPU_ThreadBlock:
+            brange = subsets.Range(scope_map.range[::-1])
+            kdims = brange.size()
+            dsym = [
+                symbolic.symbol(
+                    '__DAPT%d' % i, nonnegative=True, integer=True)
+                for i in range(len(brange))
+            ]
+            dsym_end = [d + bs - 1 for d, bs in zip(dsym, self._block_dims)]
+            tidx = brange.coord_at(dsym)
+
+            # First three dimensions are evaluated directly
+            for i in range(min(len(brange), 3)):
+                varname = scope_map.params[-i - 1]
+
+                # Delinearize third dimension if necessary
+                if i == 2 and len(brange) > 3:
+                    block_expr = '(threadIdx.z / (%s))' % _topy(
+                        functools.reduce(sympy.mul.Mul, kdims[3:], 1))
+                else:
+                    block_expr = 'threadIdx.%s' % _named_idx(i)
+
+                expr = _topy(tidx[i]).replace('__DAPT%d' % i, block_expr)
+                callsite_stream.write('int %s = %s;' % (varname, expr), sdfg,
+                                      state_id, scope_entry)
+                self._dispatcher.defined_vars.add(varname, DefinedType.Scalar)
+
+            # Delinearize beyond the third dimension
+            if len(brange) > 3:
+                for i in range(3, len(brange)):
+                    varname = scope_map.params[-i - 1]
+                    # true dim i = z / ('*'.join(kdims[i+1:])) % kdims[i]
+                    block_expr = '(threadIdx.z / (%s)) %% (%s)' % (
+                        _topy(
+                            functools.reduce(sympy.mul.Mul, kdims[i + 1:], 1)),
+                        _topy(kdims[i]),
+                    )
+
+                    expr = _topy(tidx[i]).replace('__DAPT%d' % i, block_expr)
+                    callsite_stream.write('int %s = %s;' % (varname, expr),
+                                          sdfg, state_id, scope_entry)
+                    self._dispatcher.defined_vars.add(varname,
+                                                      DefinedType.Scalar)
+
+            # Generate conditions for this block's execution using min and max
+            # element, e.g. skipping out-of-bounds threads in trailing block
+            minels = brange.min_element()
+            maxels = brange.max_element()
+            for i, (v, minel, maxel) in enumerate(
+                    zip(scope_map.params[::-1], minels, maxels)):
+                condition = ''
+
+                # Optimize conditions if they are always true
+                if i >= 3 or (dsym[i] >= minel) != True:
+                    condition += '%s >= %s' % (v, _topy(minel))
+                if i >= 3 or (dsym_end[i] < maxel) != False:
+                    if len(condition) > 0:
+                        condition += ' && '
+                    condition += '%s < %s' % (v, _topy(maxel + 1))
+                if len(condition) > 0:
+                    callsite_stream.write('if (%s) {' % condition, sdfg,
+                                          state_id, scope_entry)
+                else:
+                    callsite_stream.write('{', sdfg, state_id, scope_entry)
+        ##########################################################
+
+        # Generate contents normally
+        self._dispatcher.dispatch_subgraph(
+            sdfg,
+            dfg_scope,
+            state_id,
+            function_stream,
+            callsite_stream,
+            skip_entry_node=True)
+
+        # If there are any other threadblock maps down the road,
+        # synchronize the thread-block / grid
+        if len(next_scopes) > 0:
+            # Thread-block synchronization
+            if scope_entry.map.schedule == types.ScheduleType.GPU_ThreadBlock:
+                callsite_stream.write('    __syncthreads();\n', sdfg, state_id,
+                                      scope_entry)
+            # Grid synchronization (kernel fusion)
+            elif scope_entry.map.schedule == types.ScheduleType.GPU_Device:
+                callsite_stream.write('    __gbar.Sync();\n', sdfg, state_id,
+                                      scope_entry)
+
+    def generate_node(self, sdfg, dfg, state_id, node, function_stream,
+                      callsite_stream):
+        if CUDACodeGen.node_dispatch_predicate(sdfg, node):
+            # Dynamically obtain node generator according to class name
+            gen = getattr(self, '_generate_' + type(node).__name__)
+            gen(sdfg, dfg, state_id, node, function_stream, callsite_stream)
+            return
+
+        if not self._in_device_code:
+            self._cpu_codegen.generate_node(sdfg, dfg, state_id, node,
+                                            function_stream, callsite_stream)
+            return
+
+        self._locals.clear_scope(self._code_state.indentation + 1)
+
+        if self._in_device_code and isinstance(node, nodes.MapExit):
+            return  # skip
+
+        self._cpu_codegen.generate_node(sdfg, dfg, state_id, node,
+                                        function_stream, callsite_stream)
+
+    def _generate_NestedSDFG(self, sdfg, dfg, state_id, node, function_stream,
+                             callsite_stream):
+        self._toplevel_schedule = node.schedule
+        self._cpu_codegen._generate_NestedSDFG(
+            sdfg, dfg, state_id, node, function_stream, callsite_stream)
+
+    def _generate_MapExit(self, sdfg, dfg, state_id, node, function_stream,
+                          callsite_stream):
+        if node.map.schedule == types.ScheduleType.GPU_ThreadBlock:
+            # Close block invocation conditions
+            for i in range(len(node.map.params)):
+                callsite_stream.write('}', sdfg, state_id, node)
+        elif node.map.schedule == types.ScheduleType.GPU_ThreadBlock_Dynamic:
+            # Close lambda function
+            callsite_stream.write('});', sdfg, state_id, node)
+            return
+
+        self._cpu_codegen._generate_MapExit(sdfg, dfg, state_id, node,
+                                            function_stream, callsite_stream)
+
+    def _generate_Reduce(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        # Try to autodetect reduction type
+        redtype = operations.detect_reduction_type(node.wcr)
+        schedule = node.schedule
+        node_id = dfg.node_id(node)
+        idstr = '{sdfg}_{state}_{node}'.format(
+            sdfg=sdfg.name, state=state_id, node=node_id)
+
+        output_edge = dfg.out_edges(node)[0]
+        output_memlet = output_edge.data
+        output_type = 'dace::vec<%s, %s>' % (
+            sdfg.arrays[output_memlet.data].dtype.ctype, output_memlet.veclen)
+
+        if node.identity is None:
+            raise ValueError('For GPU reduce nodes, initial value must be '
+                             'defined')
+
+        # Create a functor or use an existing one for reduction
+        if redtype == types.ReductionType.Custom:
+            body, arg1, arg2 = unparse_cr_split(node.wcr)
+            self._globalcode.write(
+                """
+        struct __reduce_{id} {{
+            template <typename T>
+            DACE_HDFI T operator()(const T &{arg1}, const T &{arg2}) const {{
+                {contents}
+            }}
+        }};""".format(id=idstr, arg1=arg1, arg2=arg2, contents=body), sdfg,
+                state_id, node_id)
+            reduce_op = ', __reduce_' + idstr + '(), ' + _topy(node.identity)
+        elif redtype in _SPECIAL_RTYPES:
+            reduce_op = ''
+        else:
+            credtype = 'dace::ReductionType::' + str(
+                redtype)[str(redtype).find('.') + 1:]
+            reduce_op = (
+                (', dace::_wcr_fixed<%s, %s>()' % (credtype, output_type)) +
+                ', ' + _topy(node.identity))
+
+        # Obtain some SDFG-related information
+        input_data = dfg.memlet_path(dfg.in_edges(node)[0])[0].src
+        output_data = dfg.memlet_path(dfg.out_edges(node)[0])[-1].dst
+        input_memlet = dfg.in_edges(node)[0].data
+        reduce_shape = input_memlet.subset.bounding_box_size()
+        num_items = ' * '.join([_topy(s) for s in reduce_shape])
+        input = (input_memlet.data + ' + ' + cpp_array_expr(
+            sdfg, input_memlet, with_brackets=False))
+        output = (output_memlet.data + ' + ' + cpp_array_expr(
+            sdfg, output_memlet, with_brackets=False))
+
+        # Options: Device-wide reduction (even from device code),
+        #          block-wide reduction, sequential reduction (for loop)
+        if node.schedule == types.ScheduleType.GPU_Device:
+            # Verify that data is on the GPU
+            if input_data.desc(sdfg).storage not in [
+                    types.StorageType.GPU_Global, types.StorageType.CPU_Pinned
+            ]:
+                raise ValueError('Input of GPU reduction must either reside '
+                                 ' in global GPU memory or pinned CPU memory')
+            if output_data.desc(sdfg).storage not in [
+                    types.StorageType.GPU_Global, types.StorageType.CPU_Pinned
+            ]:
+                raise ValueError('Output of GPU reduction must either reside '
+                                 ' in global GPU memory or pinned CPU memory')
+
+            # TODO(later): Enable device-wide reduction from device through
+            # CUDA dynamic parallelism. It is disabled right now
+            # due to temporary memory allocation (which needs to be done
+            # on the host).
+            if self._in_device_code:
+                raise NotImplementedError('Device-wide reduction can only be'
+                                          ' run on non-GPU code.')
+
+            # Determine reduction type
+            kname = (_SPECIAL_RTYPES[redtype]
+                     if redtype in _SPECIAL_RTYPES else 'Reduce')
+
+            # Create temp memory for this GPU
+            self._globalcode.write(
+                """
+                void *__cub_storage_{sdfg}_{state}_{node} = NULL;
+                size_t __cub_ssize_{sdfg}_{state}_{node} = 0;
+            """.format(sdfg=sdfg.name, state=state_id, node=node_id), sdfg,
+                state_id, node)
+
+            # Call CUB to get the storage size, allocate and free it
+            self.scope_entry_stream.write(
+                """
+                cub::DeviceReduce::{kname}(nullptr, __cub_ssize_{sdfg}_{state}_{node},
+                                          ({intype}*)nullptr, ({outtype}*)nullptr, {num_items}{redop});
+                cudaMalloc(&__cub_storage_{sdfg}_{state}_{node}, __cub_ssize_{sdfg}_{state}_{node});
+""".format(sdfg=sdfg.name,
+            state=state_id,
+            node=node_id,
+            num_items=num_items,
+            redop=reduce_op,
+            intype=input_data.desc(sdfg).dtype.ctype,
+            outtype=output_data.desc(sdfg).dtype.ctype,
+            kname=kname), sdfg, state_id, node)
+
+            self.scope_exit_stream.write(
+                'cudaFree(__cub_storage_{sdfg}_{state}_{node});'.format(
+                    sdfg=sdfg.name, state=state_id, node=node_id), sdfg,
+                state_id, node)
+
+            max_streams = Config.get('compiler', 'cuda',
+                                     'max_concurrent_streams')
+            if max_streams >= 0:
+                cudastream = 'dace::cuda::__streams[%d]' % node._cuda_stream
+            else:
+                cudastream = 'nullptr'
+
+            # Write reduction function definition
+            self._localcode.write(
+                """
+DACE_EXPORTED void __dace_reduce_{id}({intype} *input, {outtype} *output,
+                                      size_t num_items);
+void __dace_reduce_{id}({intype} *input, {outtype} *output, size_t num_items)
+{{
+    cub::DeviceReduce::{kname}(__cub_storage_{id}, __cub_ssize_{id},
+                                input, output, num_items{redop}, {stream});
+}}
+            """.format(
+                    id=idstr,
+                    intype=input_data.desc(sdfg).dtype.ctype,
+                    outtype=output_data.desc(sdfg).dtype.ctype,
+                    kname=kname,
+                    redop=reduce_op,
+                    stream=cudastream), sdfg, state_id, node)
+
+            # Write reduction function definition in caller file
+            function_stream.write(
+                """
+DACE_EXPORTED void __dace_reduce_{id}({intype} *input, {outtype} *output,
+                                      size_t num_items);
+            """.format(
+                    id=idstr,
+                    intype=input_data.desc(sdfg).dtype.ctype,
+                    outtype=output_data.desc(sdfg).dtype.ctype), sdfg,
+                state_id, node)
+
+            # Call reduction function where necessary
+            input_dims = input_memlet.subset.dims()
+            output_dims = output_memlet.subset.data_dims()
+            if (node.axes is None or len(node.axes) == input_dims):
+                callsite_stream.write(
+                    '__dace_reduce_{id}({input}, {output}, {num_items});'
+                    .format(
+                        id=idstr,
+                        input=input,
+                        output=output,
+                        num_items=num_items), sdfg, state_id, node)
+            else:
+                raise NotImplementedError(
+                    'Multiple axis reductions not supported on GPUs. Please '
+                    'apply ReduceExpansion')
+                # Generate for loops around CUB calls and properly offset input
+                # and output arrays
+                #for axis in range(output_dims):
+                #    if axis not in node.axes:
+                #        callsite_stream.write(
+                #            'for (int {var} = {begin}; {var} < {end}; {var} += {skip}) {{'.
+                #            format(
+                #                var='__o%d' % axis,
+                #                begin=output_subset[axis][0],
+                #                end=output_subset[axis][1] + 1,
+                #                skip=output_subset[axis][2]), sdfg, state_id, node)
+                #
+                ### Obtain variable names per output and reduction axis
+                #axis_vars = []
+                #octr = 0
+                #for d in range(input_dims):
+                #    if d not in axes:
+                #        axis_vars.append('__o%d' % octr)
+                #        octr += 1
+                #
+                #input = (input_memlet.data.name + ' + ' + cpp_array_expr(
+                #            sdfg, input_memlet, with_brackets=False))
+                #output = (output_memlet.data.name + ' + ' + cpp_array_expr(
+                #            sdfg, output_memlet, with_brackets=False))
+                #num_items =
+                #
+                #callsite_stream.write(
+                #    '__dace_reduce_{id}({input}, {output}, {num_items});'
+                #    .format(
+                #        id=idstr,
+                #        input=input,
+                #        output=output,
+                #        num_items=num_items), sdfg, state_id, node)
+                #
+                ##cpp_array_expr(sdfg,
+                ##            output_memlet,
+                ##            offset=['__o%d' % i for i in range(output_dims)],
+                ##            relative_offset=False))
+                ##        invar = cpp_array_expr(sdfg,
+                ##            input_memlet, offset=axis_vars, relative_offset=False)
+                #for axis in range(output_dims):
+                #    callsite_stream.write('}\n', sdfg, state_id, node)
+            return
+
+        # Block-wide reduction
+        elif node.schedule == types.ScheduleType.GPU_ThreadBlock:
+            # Checks
+            if not self._in_device_code:
+                raise ValueError('Block-wide GPU reduction must occur within'
+                                 ' a GPU kernel')
+            for bdim in self._block_dims:
+                if symbolic.issymbolic(bdim, sdfg.constants):
+                    raise ValueError(
+                        'Block size has to be constant for block-wide '
+                        'reduction (got %s)' % str(bdim))
+            if (node.axes is not None and len(node.axes) < input_dims):
+                raise ValueError(
+                    'Only full reduction is supported for block-wide reduce,'
+                    ' please use ReduceExpansion')
+            if (input_data.desc(sdfg).storage != types.StorageType.GPU_Stack
+                    or output_data.desc(sdfg).storage !=
+                    types.StorageType.GPU_Stack):
+                raise ValueError(
+                    'Block-wise reduction only supports GPU register inputs '
+                    'and outputs')
+            if redtype in _SPECIAL_RTYPES:
+                raise ValueError('%s block reduction not supported' % redtype)
+
+            credtype = 'dace::ReductionType::' + str(
+                redtype)[str(redtype).find('.') + 1:]
+            if redtype == types.ReductionType.Custom:
+                redop = '__reduce_%s()' % idstr
+            else:
+                redop = 'dace::_wcr_fixed<%s, %s>()' % (credtype, output_type)
+
+            # Allocate shared memory for block reduce
+            self.scope_entry_stream.write(
+                """
+            typedef cub::BlockReduce<{type}, {numthreads}> BlockReduce_{id};
+            __shared__ typename BlockReduce_{id}::TempStorage temp_storage_{id};
+                """.format(
+                    id=idstr,
+                    type=output_data.desc(sdfg).dtype.ctype,
+                    numthreads=' * '.join(str(s) for s in self._block_dims)),
+                sdfg, state_id, node)
+
+            # TODO(later): If less than the whole block is participating,
+            #              use special CUB function
+            output = cpp_array_expr(sdfg, output_memlet)
+            callsite_stream.write(
+                """
+                {output} = BlockReduce_{id}(temp_storage_{id}).Reduce({input}, {redop});
+                """.format(
+                    id=idstr,
+                    redop=redop,
+                    input=input_memlet.data,
+                    output=output), sdfg, state_id, node)
+
+            return
+        # Sequential goes to CPU generator
+        elif node.schedule == types.ScheduleType.Sequential:
+            self._cpu_codegen._generate_Reduce(
+                sdfg, dfg, state_id, node, function_stream, callsite_stream)
+            return
+        else:
+            raise ValueError(
+                'Unsupported reduction schedule %s' % str(node.schedule))
+
+
+########################################################################
+########################################################################
+########################################################################
+########################################################################
+# Helper functions and classes
+
+
+def unparse_cr_split(wcr_ast):
+    """ Parses various types of WCR functions, returning a 3-tuple of body,
+        first argument name and second argument name. """
+    if isinstance(wcr_ast, ast.FunctionDef):
+        return (cppunparse.cppunparse(wcr_ast.body, expr_semicolon=False),
+                wcr_ast.args.args[0].arg, wcr_ast.args.args[1].arg)
+    elif isinstance(wcr_ast, ast.Lambda):
+        return (('return (' + cppunparse.cppunparse(
+            wcr_ast.body, expr_semicolon=False) + ');'),
+                wcr_ast.args.args[0].arg, wcr_ast.args.args[1].arg)
+    elif isinstance(wcr_ast, ast.Module):
+        return unparse_cr_split(wcr_ast.body[0].value)
+    elif isinstance(wcr_ast, str):
+        return unparse_cr_split(LambdaProperty.from_string(wcr_ast))
+    else:
+        raise NotImplementedError('INVALID TYPE OF WCR: ' +
+                                  type(wcr_ast).__name__)
+
+
+def _topy(arr):
+    """ Converts an array of symbolic variables (or one) to C++ strings. """
+    if not isinstance(arr, list):
+        return cppunparse.pyexpr2cpp(symbolic.symstr(arr))
+    return [cppunparse.pyexpr2cpp(symbolic.symstr(d)) for d in arr]
+
+
+def _named_idx(idx):
+    """ Converts 0 to x, 1 to y, 2 to z, or raises an exception. """
+    if idx < 0 or idx > 2:
+        raise ValueError('idx must be between 0 and 2, got %d' % idx)
+    return ('x', 'y', 'z')[idx]
+
+
+def _get_storagename(storage):
+    """ Returns a string containing the name of the storage location.
+        Example: types.StorageType.GPU_Shared will return "Shared". """
+    sname = str(storage)
+    return sname[sname.rindex('_') + 1:]
diff --git a/dace/codegen/targets/framecode.py b/dace/codegen/targets/framecode.py
new file mode 100644
index 0000000000..763fe640ae
--- /dev/null
+++ b/dace/codegen/targets/framecode.py
@@ -0,0 +1,936 @@
+from typing import Set
+
+import collections
+import dace
+import functools
+from dace.codegen.prettycode import CodeIOStream
+from dace.codegen.targets.target import TargetCodeGenerator, TargetDispatcher
+from dace.sdfg import SDFG, SDFGState, ScopeSubgraphView
+from dace.graph import nodes
+from dace import types, config
+
+from dace.frontend.python import ndarray
+from dace.codegen.instrumentation.perfsettings import PerfSettings, PerfUtils
+from dace.codegen import cppunparse
+
+import networkx as nx
+import numpy as np
+
+
+class DaCeCodeGenerator(object):
+    """ DaCe code generator class that writes the generated code for SDFG
+        state machines, and uses a dispatcher to generate code for 
+        individual states based on the target. """
+
+    def __init__(self, *args, **kwargs):
+        self._dispatcher = TargetDispatcher()
+        self._dispatcher.register_state_dispatcher(self)
+        self._initcode = CodeIOStream()
+        self._exitcode = CodeIOStream()
+
+    ##################################################################
+    # Target registry
+
+    @property
+    def dispatcher(self):
+        return self._dispatcher
+
+    ##################################################################
+    # Code generation
+
+    def generate_constants(self, sdfg: SDFG, callsite_stream: CodeIOStream):
+        # Write constants
+        for cstname, cstval in sdfg.constants.items():
+            if isinstance(cstval, np.ndarray):
+                if isinstance(cstval, ndarray.ndarray):
+                    dtype = cstval.descriptor.dtype
+                else:
+                    dtype = types.typeclass(cstval.dtype.type)
+                const_str = "constexpr " + dtype.ctype + \
+                    " " + cstname + "[" + str(cstval.size) + "] = {"
+                it = np.nditer(cstval, order='C')
+                for i in range(cstval.size - 1):
+                    const_str += str(it[0]) + ", "
+                    it.iternext()
+                const_str += str(it[0]) + "};\n"
+                callsite_stream.write(const_str, sdfg)
+            else:
+                callsite_stream.write(
+                    "constexpr auto %s = %s;\n" % (cstname, str(cstval)), sdfg)
+
+    def generate_fileheader(self, sdfg: SDFG, global_stream: CodeIOStream):
+        """ Generate a header in every output file that includes custom types
+            and constants.
+            @param sdfg: The input SDFG.
+            @param global_stream: Stream to write to (global).
+        """
+        #########################################################
+        # Custom types
+        types = set()
+        # Types of this SDFG
+        for sdfg, arrname, arr in sdfg.arrays_recursive():
+            if arr is not None:
+                types.add(arr.dtype)
+
+        # Emit unique definitions
+        global_stream.write('\n')
+        for typ in types:
+            if hasattr(typ, 'emit_definition'):
+                global_stream.write(typ.emit_definition(), sdfg)
+        global_stream.write('\n')
+
+        #########################################################
+        # Write constants
+        self.generate_constants(sdfg, global_stream)
+
+    def generate_header(self, sdfg: SDFG, global_stream: CodeIOStream,
+                        callsite_stream: CodeIOStream):
+        """ Generate the header of the frame-code. Code exists in a separate
+            function for overriding purposes.
+            @param sdfg: The input SDFG.
+            @param global_stream: Stream to write to (global).
+            @param callsite_stream: Stream to write to (at call site).
+        """
+        fname = sdfg.name
+        params = sdfg.signature()
+
+        # Write frame code - header
+        global_stream.write(
+            '/* DaCe AUTO-GENERATED FILE. DO NOT MODIFY */\n' +
+            '#include <dace/dace.h>\n', sdfg)
+
+        # Added for instrumentation includes
+        if PerfSettings.perf_enable_instrumentation():
+            global_stream.write(
+                '/* DaCe instrumentation include */\n' +
+                '#include <dace/perf/instrumentation.h>\n', sdfg)
+
+        self.generate_fileheader(sdfg, callsite_stream)
+
+        callsite_stream.write(
+            'void __program_%s_internal(%s)\n{\n' % (fname, params), sdfg)
+
+        # Define the performance store (autocleanup on destruction)
+        if PerfSettings.perf_enable_instrumentation():
+            callsite_stream.write(
+                'dace_perf::PAPI::init();\n' + 'dace_perf::%s __perf_store;\n'
+                % PerfUtils.perf_counter_store_string(
+                    PerfSettings.perf_default_papi_counters()), sdfg)
+
+    def generate_footer(self, sdfg: SDFG, global_stream: CodeIOStream,
+                        callsite_stream: CodeIOStream):
+        """ Generate the footer of the frame-code. Code exists in a separate
+            function for overriding purposes.
+            @param sdfg: The input SDFG.
+            @param global_stream: Stream to write to (global).
+            @param callsite_stream: Stream to write to (at call site).
+        """
+        fname = sdfg.name
+        params = sdfg.signature()
+        paramnames = sdfg.signature(False)
+
+        # Write frame code - footer
+        callsite_stream.write('}\n', sdfg)
+
+        # Write awkward footer to avoid 'extern "C"' issues
+        callsite_stream.write(
+            """
+void __program_%s_internal(%s);
+DACE_EXPORTED void __program_%s(%s)
+{
+    __program_%s_internal(%s);
+}
+""" % (fname, params, fname, params, fname, paramnames), sdfg)
+
+        for target in self._dispatcher.used_targets:
+            if target.has_initializer:
+                callsite_stream.write(
+                    'DACE_EXPORTED int __dace_init_%s(%s);\n' %
+                    (target.target_name, params), sdfg)
+            if target.has_finalizer:
+                callsite_stream.write(
+                    'DACE_EXPORTED int __dace_exit_%s(%s);\n' %
+                    (target.target_name, params), sdfg)
+
+        callsite_stream.write(
+            """
+DACE_EXPORTED int __dace_init(%s)
+{
+    int result = 0;
+""" % params, sdfg)
+
+        for target in self._dispatcher.used_targets:
+            if target.has_initializer:
+                callsite_stream.write(
+                    'result |= __dace_init_%s(%s);' % (target.target_name,
+                                                       paramnames), sdfg)
+
+        callsite_stream.write(self._initcode.getvalue(), sdfg)
+
+        callsite_stream.write(
+            """
+    return result;
+}
+
+DACE_EXPORTED void __dace_exit(%s)
+{
+""" % params, sdfg)
+
+        callsite_stream.write(self._exitcode.getvalue(), sdfg)
+
+        for target in self._dispatcher.used_targets:
+            if target.has_finalizer:
+                callsite_stream.write(
+                    '__dace_exit_%s(%s);' % (target.target_name, paramnames),
+                    sdfg)
+
+        callsite_stream.write('}\n', sdfg)
+
+    def generate_state(self,
+                       sdfg,
+                       state,
+                       global_stream,
+                       callsite_stream,
+                       generate_state_footer=True):
+
+        sid = sdfg.node_id(state)
+
+        # Emit internal transient array allocation
+        # Don't allocate transients shared with another state
+        data_to_allocate = (
+            set(state.top_level_transients()) - set(sdfg.shared_transients()))
+        allocated = set()
+        for node in state.data_nodes():
+            if node.data not in data_to_allocate or node.data in allocated:
+                continue
+            allocated.add(node.data)
+            self._dispatcher.dispatch_allocate(sdfg, state, sid, node,
+                                               global_stream, callsite_stream)
+            self._dispatcher.dispatch_initialize(
+                sdfg, state, sid, node, global_stream, callsite_stream)
+
+        #####################
+        # Create dataflow graph for state's children.
+
+        # DFG to code scheme: Only generate code for nodes whose all
+        # dependencies have been executed (topological sort).
+        # For different connected components, run them concurrently.
+
+        components = dace.sdfg.concurrent_subgraphs(state)
+
+        if len(components) == 1:
+            self._dispatcher.dispatch_subgraph(
+                sdfg,
+                state,
+                sid,
+                global_stream,
+                callsite_stream,
+                skip_entry_node=False)
+        else:
+            #############################################################
+            # Instrumentation: Pre-state
+            # We cannot have supersections starting in parallel
+            parent_id = PerfUtils.unified_id(-1, sid)
+            if PerfSettings.perf_enable_instrumentation():
+                callsite_stream.write(
+                    "__perf_store.markSuperSectionStart(%d);\n" %
+                    PerfUtils.unified_id(-1, sid))
+            #############################################################
+
+            callsite_stream.write("#pragma omp parallel sections\n{")
+            for c in components:
+                c.set_parallel_parent(
+                    parent_id
+                )  # Keep in mind not to add supersection start markers!
+                callsite_stream.write("#pragma omp section\n{")
+                self._dispatcher.dispatch_subgraph(
+                    sdfg,
+                    c,
+                    sid,
+                    global_stream,
+                    callsite_stream,
+                    skip_entry_node=False)
+                callsite_stream.write("} // End omp section")
+            callsite_stream.write("} // End omp sections")
+
+        #####################
+        # Write state footer
+
+        if generate_state_footer:
+            # Emit internal transient array deallocation
+            deallocated = set()
+            for node in state.data_nodes():
+                if node.data not in data_to_allocate or node.data in deallocated:
+                    continue
+                deallocated.add(node.data)
+                self._dispatcher.dispatch_deallocate(
+                    sdfg, state, sid, node, global_stream, callsite_stream)
+
+    @staticmethod
+    def _generate_assignments(assignments):
+        return [
+            "{} = {}".format(variable, value)
+            for variable, value in assignments.items()
+        ]
+
+    @staticmethod
+    def _is_always_true(condition_string):
+        return condition_string in ["true", "1"]
+
+    def _generate_transition(self, sdfg, sid, callsite_stream, edge,
+                             assignments):
+
+        condition_string = cppunparse.cppunparse(edge.data.condition, False)
+        always_true = self._is_always_true(condition_string)
+
+        if not always_true:
+            callsite_stream.write("if ({}) {{".format(condition_string), sdfg,
+                                  sid)
+
+        if len(assignments) > 0:
+            callsite_stream.write(
+                ";\n".join(
+                    DaCeCodeGenerator._generate_assignments(assignments) +
+                    [""]), sdfg, sid)
+
+        callsite_stream.write(
+            "goto __state_{}_{};".format(sdfg.name, edge.dst.label), sdfg, sid)
+
+        if not always_true:
+            callsite_stream.write("}")
+
+    def generate_states(self, sdfg, scope_label, control_flow, global_stream,
+                        callsite_stream, scope, states_generated):
+
+        states_topological = list(sdfg.topological_sort(sdfg.start_state))
+        states_to_generate = collections.deque([
+            s for s in states_topological
+            if s in scope and s not in states_generated
+        ])
+        if len(states_to_generate) == 0:
+            return
+
+        while len(states_to_generate) > 0:
+
+            state = states_to_generate.popleft()
+            # When generating control flow constructs, we will not necessarily
+            # move in topological order, so make sure this state has not
+            # already been generated.
+            if state in states_generated or state not in scope:
+                continue
+            states_generated.add(state)
+
+            sid = sdfg.node_id(state)
+
+            callsite_stream.write(
+                "__state_{}_{}:\n".format(sdfg.name, state.label), sdfg, sid)
+
+            # Don't generate brackets and comments for empty states
+            if len([
+                    n for n in state.nodes()
+                    if not isinstance(n, dace.graph.nodes.EmptyTasklet)
+            ]) > 0:
+
+                callsite_stream.write('{', sdfg, sid)
+
+                self._dispatcher.dispatch_state(sdfg, state, global_stream,
+                                                callsite_stream)
+
+                callsite_stream.write('}', sdfg, sid)
+
+            else:
+
+                callsite_stream.write(";")
+
+            out_edges = sdfg.out_edges(state)
+
+            # Write conditional branches to next states
+            for edge in out_edges:
+
+                generate_assignments = True
+                generate_transition = True
+
+                # Handle specialized control flow
+                if (dace.config.Config.get_bool('optimizer',
+                                                'detect_control_flow')):
+
+                    for control in control_flow[edge]:
+
+                        if isinstance(control,
+                                      dace.graph.edges.LoopAssignment):
+                            # Generate the transition, but leave the
+                            # assignments to the loop
+                            generate_transition = True
+                            generate_assignments = False
+
+                        elif isinstance(control, dace.graph.edges.LoopBack):
+                            generate_transition = False
+                            generate_assignments = False
+
+                        elif isinstance(control, dace.graph.edges.LoopExit):
+                            # Need to strip the condition, so generate it from
+                            # the loop entry
+                            generate_transition = False
+                            generate_assignments = True
+                            pass
+
+                        elif isinstance(control, dace.graph.edges.LoopEntry):
+                            generate_transition = False
+                            generate_assignments = False
+
+                            if control.scope.assignment is not None:
+                                assignment_edge = control.scope.assignment.edge
+                                init_assignments = ", ".join(
+                                    DaCeCodeGenerator._generate_assignments(
+                                        assignment_edge.data.assignments))
+                            else:
+                                init_assignments = ""
+
+                            back_edge = control.scope.back.edge
+                            continue_assignments = ", ".join(
+                                DaCeCodeGenerator._generate_assignments(
+                                    back_edge.data.assignments))
+
+                            entry_edge = control.scope.entry.edge
+                            condition = cppunparse.cppunparse(
+                                entry_edge.data.condition, False)
+
+                            if (len(init_assignments) > 0
+                                    or len(continue_assignments) > 0):
+                                callsite_stream.write(
+                                    "for ({}; {}; {}) {{".format(
+                                        init_assignments, condition,
+                                        continue_assignments), sdfg, sid)
+                            else:
+                                callsite_stream.write(
+                                    "while ({}) {{".format(condition), sdfg,
+                                    sid)
+
+                            # Generate loop body
+                            self.generate_states(
+                                sdfg, entry_edge.src.label + "_loop",
+                                control_flow, global_stream, callsite_stream,
+                                control.scope, states_generated)
+
+                            callsite_stream.write("}", sdfg, sid)
+
+                            exit_edge = control.scope.exit.edge
+
+                            # Update states to generate after nested call
+                            states_to_generate = collections.deque([
+                                s for s in states_to_generate
+                                if s not in states_generated
+                            ])
+                            # If the next state to be generated is the exit
+                            # state, we can omit the goto
+                            if (len(states_to_generate) > 0
+                                    and states_to_generate[0] == exit_edge.dst
+                                    and exit_edge.dst not in states_generated):
+                                pass
+                            else:
+                                callsite_stream.write(
+                                    "goto __state_{}_{};".format(
+                                        sdfg.name,
+                                        control.scope.exit.edge.dst))
+
+                        elif isinstance(control, dace.graph.edges.IfExit):
+                            generate_transition = True
+                            generate_assignments = True
+
+                        elif isinstance(control, dace.graph.edges.IfEntry):
+                            generate_transition = False
+                            generate_assignments = True
+
+                            if len(set(control.scope) - states_generated) == 0:
+                                continue
+
+                            then_scope = control.scope.if_then_else.then_scope
+                            else_scope = control.scope.if_then_else.else_scope
+
+                            then_entry = then_scope.entry.edge
+
+                            condition = cppunparse.cppunparse(
+                                then_entry.data.condition, False)
+
+                            callsite_stream.write(
+                                "if ({}) {{".format(condition), sdfg, sid)
+
+                            # Generate the then-scope
+                            self.generate_states(sdfg, state.label + "_then",
+                                                 control_flow, global_stream,
+                                                 callsite_stream, then_scope,
+                                                 states_generated)
+
+                            callsite_stream.write("} else {", sdfg, sid)
+
+                            # Generate the else-scope
+                            self.generate_states(sdfg, state.label + "_else",
+                                                 control_flow, global_stream,
+                                                 callsite_stream, else_scope,
+                                                 states_generated)
+
+                            callsite_stream.write("}", sdfg, sid)
+
+                            # Update states to generate after nested call
+                            states_to_generate = collections.deque([
+                                s for s in states_to_generate
+                                if s not in states_generated
+                            ])
+
+                            if_exit_state = control.scope.exit.edge.dst
+
+                            if ((if_exit_state not in states_generated) and
+                                ((len(states_to_generate) > 0) and
+                                 (states_to_generate[0] == if_exit_state))):
+                                pass
+                            else:
+                                callsite_stream.write(
+                                    "goto __state_{}_{};".format(
+                                        sdfg.name,
+                                        control.scope.exit.edge.dst))
+
+                        else:
+
+                            raise TypeError(
+                                "Unknown control flow \"{}\"".format(
+                                    type(control).__name__))
+
+                if generate_assignments and len(edge.data.assignments) > 0:
+                    assignments_to_generate = edge.data.assignments
+                else:
+                    assignments_to_generate = {}
+
+                if generate_transition:
+
+                    if ((len(out_edges) == 1)
+                            and (edge.dst not in states_generated)
+                            and ((len(states_to_generate) > 0) and
+                                 (states_to_generate[0] == edge.dst))):
+                        # If there is only one outgoing edge, the target will
+                        # be generated next, we can omit the goto
+                        pass
+                    elif (len(out_edges) == 1 and len(states_to_generate) == 0
+                          and (edge.dst not in scope)):
+                        # This scope has ended, and we don't need to generate
+                        # any output edge
+                        pass
+                    else:
+                        self._generate_transition(sdfg, sid, callsite_stream,
+                                                  edge,
+                                                  assignments_to_generate)
+                        # Assignments will be generated in the transition
+                        generate_assignments = False
+
+                if generate_assignments:
+
+                    callsite_stream.write(
+                        ";\n".join(
+                            DaCeCodeGenerator._generate_assignments(
+                                assignments_to_generate) + [""]), sdfg, sid)
+
+            if (((len(out_edges) == 0) or
+                 (not isinstance(scope, dace.graph.edges.ControlFlowScope) and
+                  (len(states_to_generate) == 0)))
+                    and (len(states_generated) != sdfg.number_of_nodes())):
+                callsite_stream.write(
+                    "goto __state_exit_{}_{};".format(sdfg.name, scope_label),
+                    sdfg, sid)
+
+        # Write exit state
+        callsite_stream.write(
+            "__state_exit_{}_{}:;".format(sdfg.name, scope_label), sdfg)
+
+    @staticmethod
+    def all_nodes_between(graph, begin, end):
+        """Finds all nodes between begin and end. Returns None if there is any
+           path starting at begin that does not reach end."""
+        to_visit = [begin]
+        seen = set()
+        while len(to_visit) > 0:
+            n = to_visit.pop()
+            if n == end:
+                continue  # We've reached the end node
+            if n in seen:
+                continue  # We've already visited this node
+            seen.add(n)
+            # Keep chasing all paths to reach the end node
+            node_out_edges = graph.out_edges(n)
+            if len(node_out_edges) == 0:
+                # We traversed to the end without finding the end
+                return None
+            for e in node_out_edges:
+                next_node = e.dst
+                if next_node != end and next_node not in seen:
+                    to_visit.append(next_node)
+        return seen
+
+    def generate_code(self,
+                      sdfg: SDFG,
+                      schedule: types.ScheduleType,
+                      sdfg_id: str = ""
+                      ) -> (str, str, Set[TargetCodeGenerator]):
+        """ Generate frame code for a given SDFG, calling registered targets'
+            code generation callbacks for them to generate their own code.
+            @param sdfg: The SDFG to generate code for.
+            @param schedule: The schedule the SDFG is currently located, or
+                             None if the SDFG is top-level.
+            @param sdfg_id: An optional string id given to the SDFG label
+            @return: A tuple of the generated global frame code, local frame
+                     code, and a set of targets that have been used in the
+                     generation of this SDFG.
+        """
+
+        sdfg_label = sdfg.name + sdfg_id
+
+        global_stream = CodeIOStream()
+        callsite_stream = CodeIOStream()
+
+        # Set default storage/schedule types in SDFG
+        _set_default_schedule_and_storage_types(sdfg, schedule)
+
+        # Generate preamble (if top-level)
+        if schedule is None:
+            self.generate_header(sdfg, global_stream, callsite_stream)
+
+        # Generate code
+        ###########################
+
+        if sdfg.parent is not None:
+            # Nested SDFG
+            symbols_available = sdfg.parent.symbols_defined_at(sdfg)
+        else:
+            symbols_available = sdfg.constants
+
+        # Allocate outer-level transients
+        shared_transients = sdfg.shared_transients()
+        allocated = set()
+        for state in sdfg.nodes():
+            for node in state.data_nodes():
+                if (node.data in shared_transients
+                        and node.data not in allocated):
+                    self._dispatcher.dispatch_allocate(sdfg, state, None, node,
+                                                       global_stream,
+                                                       callsite_stream)
+                    self._dispatcher.dispatch_initialize(
+                        sdfg, state, None, node, global_stream,
+                        callsite_stream)
+                    allocated.add(node.data)
+
+        # Allocate inter-state variables
+        assigned, _ = sdfg.interstate_symbols()
+        for isvarName, isvarType in assigned.items():
+            # Skip symbols that have been declared as outer-level transients
+            if isvarName in allocated:
+                continue
+            callsite_stream.write(
+                '%s;\n' % (isvarType.signature(
+                    with_types=True, name=isvarName)), sdfg)
+
+        # Initialize parameter arrays
+        for argnode in types.deduplicate(sdfg.input_arrays() +
+                                         sdfg.output_arrays()):
+            # Ignore transient arrays
+            if argnode.desc(sdfg).transient: continue
+            self._dispatcher.dispatch_initialize(
+                sdfg, sdfg, None, argnode, global_stream, callsite_stream)
+
+        callsite_stream.write('\n', sdfg)
+
+        states_topological = list(sdfg.topological_sort(sdfg.start_state))
+
+        # {edge: [dace.edges.ControlFlow]}
+        control_flow = {e: [] for e in sdfg.edges()}
+
+        if dace.config.Config.get_bool('optimizer', 'detect_control_flow'):
+
+            ####################################################################
+            # Loop detection procedure
+
+            all_cycles = list(sdfg.find_cycles())  # Returns a list of lists
+            # Order according to topological sort
+            all_cycles = [
+                sorted(c, key=lambda x: states_topological.index(x))
+                for c in all_cycles
+            ]
+            # Group in terms of starting node
+            starting_nodes = [c[0] for c in all_cycles]
+            cycles_by_node = [[c for c in all_cycles if c[0] == n]
+                              for n in starting_nodes]
+            for cycles in cycles_by_node:
+
+                # Use arbitrary cycle to find the first and last nodes
+                first_node = cycles[0][0]
+                last_node = cycles[0][-1]
+
+                if not first_node.is_empty():
+                    # The entry node should not contain any computations
+                    continue
+
+                if not all([c[-1] == last_node for c in cycles]):
+                    # There are multiple back edges: not a for or while loop
+                    continue
+
+                previous_edge = [
+                    e for e in sdfg.in_edges(first_node) if e.src != last_node
+                ]
+                if len(previous_edge) != 1:
+                    # No single starting point: not a for or while
+                    continue
+                previous_edge = previous_edge[0]
+
+                back_edge = sdfg.edges_between(last_node, first_node)
+                if len(back_edge) != 1:
+                    raise RuntimeError("Expected exactly one edge in cycle")
+                back_edge = back_edge[0]
+
+                # Build a set of all nodes in all cycles associated with this
+                # set of start and end node
+                internal_nodes = functools.reduce(
+                    lambda a, b: a | b, [set(c)
+                                         for c in cycles]) - {first_node}
+
+                exit_edge = [
+                    e for e in sdfg.out_edges(first_node)
+                    if e.dst not in internal_nodes | {first_node}
+                ]
+                if len(exit_edge) != 1:
+                    # No single stopping condition: not a for or while
+                    # (we don't support continue or break)
+                    continue
+                exit_edge = exit_edge[0]
+
+                entry_edge = [
+                    e for e in sdfg.out_edges(first_node) if e != exit_edge
+                ]
+                if len(entry_edge) != 1:
+                    # No single starting condition: not a for or while
+                    continue
+                entry_edge = entry_edge[0]
+
+                # Make sure this is not already annotated to be another construct
+                if (len(control_flow[entry_edge]) != 0
+                        or len(control_flow[back_edge]) != 0
+                        or len(control_flow[exit_edge]) != 0):
+                    continue
+
+                if entry_edge == back_edge:
+                    # No entry check (we don't support do-loops)
+                    # TODO: do we want to add some support for self-loops?
+                    continue
+
+                # Now we make sure that there is no other way to exit this
+                # cycle, by checking that there's no reachable node *not*
+                # included in any cycle between the first and last node.
+                if any([len(set(c) - internal_nodes) > 1 for c in cycles]):
+                    continue
+
+                # This is a loop! Generate the necessary annotation objects.
+                loop_scope = dace.graph.edges.LoopScope(internal_nodes)
+
+                if ((len(previous_edge.data.assignments) > 0
+                     or len(back_edge.data.assignments) > 0)
+                        and len(control_flow[previous_edge]) == 0):
+                    # Generate assignment edge, if available
+                    control_flow[previous_edge].append(
+                        dace.graph.edges.LoopAssignment(
+                            loop_scope, previous_edge))
+                # Assign remaining control flow constructs
+                control_flow[entry_edge].append(
+                    dace.graph.edges.LoopEntry(loop_scope, entry_edge))
+                control_flow[exit_edge].append(
+                    dace.graph.edges.LoopExit(loop_scope, exit_edge))
+                control_flow[back_edge].append(
+                    dace.graph.edges.LoopBack(loop_scope, back_edge))
+
+            ###################################################################
+            # If/then/else detection procedure
+
+            candidates = [
+                n for n in states_topological if sdfg.out_degree(n) == 2
+            ]
+            for candidate in candidates:
+
+                # A valid if occurs when then are no reachable nodes for either
+                # path that does not pass through a common dominator.
+                dominators = nx.dominance.dominance_frontiers(
+                    sdfg.nx, candidate)
+
+                left_entry, right_entry = sdfg.out_edges(candidate)
+                if (len(control_flow[left_entry]) > 0
+                        or len(control_flow[right_entry]) > 0):
+                    # Already assigned to a control flow construct
+                    # TODO: carefully allow this in some cases
+                    continue
+
+                left, right = left_entry.dst, right_entry.dst
+                dominator = dominators[left] & dominators[right]
+                if len(dominator) != 1:
+                    # There must be a single dominator across both branches,
+                    # unless one of the nodes _is_ the next dominator
+                    # if (len(dominator) == 0 and dominators[left] == {right}
+                    #         or dominators[right] == {left}):
+                    #     dominator = dominators[left] | dominators[right]
+                    # else:
+                    #     continue
+                    continue
+                dominator = next(iter(dominator))  # Exactly one dominator
+
+                exit_edges = sdfg.in_edges(dominator)
+                if len(exit_edges) != 2:
+                    # There must be a single entry and a single exit. This
+                    # could be relaxed in the future.
+                    continue
+
+                left_exit, right_exit = exit_edges
+                if (len(control_flow[left_exit]) > 0
+                        or len(control_flow[right_exit]) > 0):
+                    # Already assigned to a control flow construct
+                    # TODO: carefully allow this in some cases
+                    continue
+
+                # Now traverse from the source and verify that all possible paths
+                # pass through the dominator
+                left_nodes = DaCeCodeGenerator.all_nodes_between(
+                    sdfg, left, dominator)
+                if left_nodes is None:
+                    # Not all paths lead to the next dominator
+                    continue
+                right_nodes = DaCeCodeGenerator.all_nodes_between(
+                    sdfg, right, dominator)
+                if right_nodes is None:
+                    # Not all paths lead to the next dominator
+                    continue
+                all_nodes = left_nodes | right_nodes
+
+                # Make sure there is no overlap between left and right nodes
+                if len(left_nodes & right_nodes) > 0:
+                    continue
+
+                # This is a valid if/then/else construct. Generate annotations
+                if_then_else = dace.graph.edges.IfThenElse(
+                    candidate, dominator)
+
+                # Arbitrarily assign then/else to the two branches. If one edge
+                # has no dominator but leads to the dominator, it means there's
+                # only a then clause (and no else).
+                has_else = False
+                if len(dominators[left]) == 1:
+                    then_scope = dace.graph.edges.IfThenScope(
+                        if_then_else, left_nodes)
+                    else_scope = dace.graph.edges.IfElseScope(
+                        if_then_else, right_nodes)
+                    control_flow[left_entry].append(
+                        dace.graph.edges.IfEntry(then_scope, left_entry))
+                    control_flow[left_exit].append(
+                        dace.graph.edges.IfExit(then_scope, left_exit))
+                    control_flow[right_exit].append(
+                        dace.graph.edges.IfExit(else_scope, right_exit))
+                    if len(dominators[right]) == 1:
+                        control_flow[right_entry].append(
+                            dace.graph.edges.IfEntry(else_scope, right_entry))
+                        has_else = True
+                else:
+                    then_scope = dace.graph.edges.IfThenScope(
+                        if_then_else, right_nodes)
+                    else_scope = dace.graph.edges.IfElseScope(
+                        if_then_else, left_nodes)
+                    control_flow[right_entry].append(
+                        dace.graph.edges.IfEntry(then_scope, right_entry))
+                    control_flow[right_exit].append(
+                        dace.graph.edges.IfExit(then_scope, right_exit))
+                    control_flow[left_exit].append(
+                        dace.graph.edges.IfExit(else_scope, left_exit))
+
+        #######################################################################
+        # State transition generation
+
+        states_generated = set()  # For sanity check
+        self.generate_states(sdfg, "sdfg", control_flow,
+                             global_stream, callsite_stream,
+                             set(states_topological), states_generated)
+
+        #############################
+        # End of code generation
+
+        if len(states_generated) != len(sdfg.nodes()):
+            raise RuntimeError(
+                "Not all states were generated in SDFG {}!"
+                "\n  Generated: {}\n  Missing: {}".format(
+                    sdfg.label, [s.label for s in states_generated],
+                    [s.label for s in (set(sdfg.nodes()) - states_generated)]))
+
+        # Deallocate transients
+        shared_transients = sdfg.shared_transients()
+        deallocated = set()
+        for state in sdfg.nodes():
+            for node in state.data_nodes():
+                if (node.data in shared_transients
+                        and node.data not in deallocated):
+                    self._dispatcher.dispatch_deallocate(
+                        sdfg, sdfg, None, node, global_stream, callsite_stream)
+                    deallocated.add(node.data)
+
+        ###########################
+
+        # Generate footer (if top-level)
+        if schedule is None:
+            self.generate_footer(sdfg, global_stream, callsite_stream)
+
+        # Clear out all the annotated control flow
+
+        # Return the generated global and local code strings
+        return (global_stream.getvalue(), callsite_stream.getvalue(),
+                self._dispatcher.used_targets)
+
+
+def _set_default_schedule_and_storage_types(sdfg, toplevel_schedule):
+    """ Sets default storage and schedule types throughout SDFG. 
+        Replaces `ScheduleType.Default` and `StorageType.Default`
+        with the corresponding types according to the parent scope's 
+        schedule. """
+    for state in sdfg.nodes():
+        scope_dict = state.scope_dict()
+        reverse_scope_dict = state.scope_dict(node_to_children=True)
+
+        def set_default_in_scope(parent_node):
+            if parent_node is None:
+                parent_schedule = toplevel_schedule
+            else:
+                parent_schedule = parent_node.map.schedule
+
+            for node in reverse_scope_dict[parent_node]:
+                # Set default schedule type
+                if isinstance(node, nodes.MapEntry):
+                    if node.map.schedule == types.ScheduleType.Default:
+                        node.map._schedule = \
+                            types.SCOPEDEFAULT_SCHEDULE[parent_schedule]
+                    # Also traverse children (recursively)
+                    set_default_in_scope(node)
+                elif isinstance(node, nodes.ConsumeEntry):
+                    if node.consume.schedule == types.ScheduleType.Default:
+                        node.consume._schedule = \
+                            types.SCOPEDEFAULT_SCHEDULE[parent_schedule]
+                    # Also traverse children (recursively)
+                    set_default_in_scope(node)
+                elif getattr(node, 'schedule', False):
+                    if node.schedule == types.ScheduleType.Default:
+                        node._schedule = \
+                            types.SCOPEDEFAULT_SCHEDULE[parent_schedule]
+
+        ## End of recursive function
+
+        # Start with top-level nodes
+        set_default_in_scope(None)
+
+        # Set default storage type
+        for node in state.nodes():
+            if isinstance(node, nodes.AccessNode):
+                if node.desc(sdfg).storage == types.StorageType.Default:
+                    if scope_dict[node] is None:
+                        parent_schedule = toplevel_schedule
+                    else:
+                        parent_schedule = scope_dict[node].map.schedule
+
+                    node.desc(sdfg).storage = (
+                        types.SCOPEDEFAULT_STORAGE[parent_schedule])
+        ### End of storage type loop
diff --git a/dace/codegen/targets/immaterial.py b/dace/codegen/targets/immaterial.py
new file mode 100644
index 0000000000..8290b4badc
--- /dev/null
+++ b/dace/codegen/targets/immaterial.py
@@ -0,0 +1,238 @@
+from dace import data, subsets, symbolic, types
+from dace.codegen.codeobject import CodeObject
+from dace.codegen.targets.target import TargetCodeGenerator
+from dace.codegen.targets.cpu import cpp_array_expr, sym2cpp
+from dace.graph import nodes
+
+from dace.codegen import cppunparse
+
+
+class ImmaterialCodeGen(TargetCodeGenerator):
+    """ Code generator for data nodes with immaterial (i.e., generated
+        from a function) storage. """
+
+    target_name = 'Immaterial'
+    language = 'cpp'
+
+    def __init__(self, frame_codegen, sdfg):
+        self._frame = frame_codegen
+        self._dispatcher = frame_codegen.dispatcher
+        dispatcher = self._dispatcher
+
+        self.emitted_materialize_funcs = set()
+
+        # Register dispatchers
+        dispatcher.register_array_dispatcher(types.StorageType.Immaterial,
+                                             self)
+
+        cpu_storage = [
+            types.StorageType.CPU_Heap, types.StorageType.CPU_Pinned,
+            types.StorageType.CPU_Stack, types.StorageType.Register
+        ]
+        for storage_type in cpu_storage:
+            dispatcher.register_copy_dispatcher(types.StorageType.Immaterial,
+                                                storage_type, None, self)
+            dispatcher.register_copy_dispatcher(
+                storage_type, types.StorageType.Immaterial, None, self)
+
+    def get_generated_codeobjects(self):
+        return []  # Immaterial storage generates inline code
+
+    @property
+    def has_initializer(self):
+        return False
+
+    @property
+    def has_finalizer(self):
+        return False
+
+    def allocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                       callsite_stream):
+        callsite_stream.write("// allocate array\n", sdfg, state_id, node)
+
+    def initialize_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        callsite_stream.write("// initialize_array " + node.data + "\n", sdfg,
+                              state_id, node)
+
+    def deallocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        callsite_stream.write("// deallocate_array", sdfg, state_id, node)
+
+    def copy_memory(self, sdfg, dfg, state_id, src_node, dst_node, edge,
+                    function_stream, callsite_stream):
+        memlet = edge.data
+        if (isinstance(src_node, nodes.AccessNode)
+                and (src_node.desc(sdfg).materialize_func is not None)):
+            function_stream.write(src_node.desc(sdfg).materialize_func)
+
+            if edge.dst_conn is not None:
+                arrayname = str(edge.dst_conn)
+            else:
+                arrayname = str(dst_node.desc)
+
+            if isinstance(dst_node, nodes.Tasklet) or \
+                    (dst_node.desc(sdfg).storage == types.StorageType.Register):
+                callsite_stream.write(
+                    self.memlet_definition(
+                        sdfg, memlet, arrayname, direction="in"), sdfg,
+                    state_id, [src_node, dst_node])
+            else:
+                callsite_stream.write("__dace_materialize(\"" + \
+                                      sym2cpp(src_node) + "\", " + \
+                                      sym2cpp(memlet.subset.min_element()[0]) +
+                                      ", " + \
+                                      sym2cpp(memlet.subset.min_element()[0] +
+                                          memlet.subset.num_elements()) +
+                                      ", " + sym2cpp(dst_node.data) + ");\n",
+                                      sdfg, state_id, [src_node, dst_node])
+
+        if (isinstance(dst_node, nodes.AccessNode)
+                and (dst_node.desc(sdfg).materialize_func is not None)):
+            # This case is pretty complicated due to how the rest of the
+            # codegen works: This is not the place to actually copy code. In
+            # the place where data is ready to be written there will be a call
+            # __foo.write(foo) where foo is the local_name of the memlet that
+            # "causes" the write. But this function is actually called when
+            # we should set up everything for this call to work.
+            # The above mentioned code is generated by process_out_memlets
+
+            function_stream.write(dst_node.desc(sdfg).materialize_func)
+            if isinstance(src_node, nodes.Tasklet) or \
+                    (src_node.desc(sdfg).storage == types.StorageType.Register):
+                callsite_stream.write(
+                    self.memlet_definition(
+                        sdfg, memlet, edge.src_conn, direction="out"), sdfg,
+                    state_id, [src_node, dst_node])
+            else:
+                callsite_stream.write("__dace_serialize(\"" + \
+                        sym2cpp(dst_node) + "\", " + \
+                        sym2cpp(memlet.subset.min_element()[0]) +
+                        ", " + \
+                        sym2cpp(memlet.subset.min_element()[0] +
+                            memlet.subset.num_elements()) +
+                        ", " + sym2cpp(src_node.data) + ");\n",
+                    sdfg, state_id, [src_node, dst_node])
+
+    def memlet_definition(self, sdfg, memlet, local_name, direction="in"):
+        if isinstance(memlet.data, data.Stream):
+            return 'auto& %s = %s;\n' % (local_name, memlet.data)
+
+        result = ('auto __%s = ' % local_name + self.memlet_view_ctor(
+            sdfg, memlet, direction) + ';\n')
+
+        # Allocate variable type
+        memlet_type = '    dace::vec<%s, %s>' % (
+            sdfg.arrays[memlet.data].dtype.ctype, sym2cpp(memlet.veclen))
+        if memlet.subset.data_dims() == 0 and memlet.num_accesses >= 0:
+            result += memlet_type + ' ' + local_name
+            if direction == "in":
+                result += ' = __%s;\n' % local_name
+            else:
+                result += ';\n'
+
+        return result
+
+    def memlet_view_ctor(self, sdfg, memlet, direction):
+        useskip = False
+        memlet_params = []
+
+        memlet_name = memlet.data
+        if isinstance(sdfg.arrays[memlet.data], data.Scalar):
+            raise ValueError("This should never have happened")
+
+        if isinstance(memlet.subset, subsets.Indices):
+            # Compute address
+            memlet_params.append(cpp_array_expr(sdfg, memlet, False))
+            dims = 0
+
+        elif isinstance(memlet.subset, subsets.Range):
+            dims = len(memlet.subset.ranges)
+            #memlet_params.append("")
+
+            # Dimensions to remove from view (due to having one value)
+            indexdims = []
+            nonIndexDims = []
+
+            for dim, (rb, re, rs) in enumerate(memlet.subset.ranges):
+                if rs != 1:
+                    useskip = True
+                try:
+                    if (re - rb) == 0:
+                        indexdims.append(dim)
+                    else:
+                        nonIndexDims.append(dim)
+                except TypeError:  # cannot determine truth value of Relational
+                    nonIndexDims.append(dim)
+
+            if len(nonIndexDims) > 1 and len(indexdims) > 0:
+                raise NotImplementedError(
+                    'subviews of more than one dimension ' + 'not implemented')
+            elif len(
+                    nonIndexDims) == 1 and len(indexdims) > 0:  # One dimension
+                indexdim = nonIndexDims[0]
+
+                # Contiguous dimension
+                if indexdim == dims - 1:
+                    memlet_params[-1] += ' + %s' % cpp_array_expr(
+                        sdfg, memlet, False)
+                    memlet_params.append(
+                        '0, %s' % (sym2cpp(memlet.subset.ranges[-1][1] -
+                                           memlet.subset.ranges[-1][0])))
+                else:  # Non-contiguous dimension
+                    useskip = True
+                    memlet_params[-1] += ' + %s' % cpp_array_expr(
+                        sdfg, memlet, False)
+                    memlet_range = memlet.subset.ranges[indexdim]
+
+                    # TODO(later): Access order
+                    memlet_stride = functools.reduce(
+                        lambda x, y: x * y,
+                        sdfg.arrays[memlet.data].shape[indexdim + 1:])
+                    memlet_stride = sym2cpp(memlet_stride)
+
+                    memlet_params.append(
+                        '0, %s, %s' %
+                        (sym2cpp(memlet_range[1] - memlet_range[0]),
+                         sym2cpp(memlet_stride)))
+
+                # Subtract index dimensions from array dimensions
+                dims -= len(indexdims)
+
+            elif len(indexdims) == 0:
+                for (rb, re, rs), s in zip(memlet.subset.ranges,
+                                           sdfg.arrays[memlet.data].shape):
+                    if useskip:
+                        memlet_params.append(
+                            '%s, %s, %s' %
+                            (cppunparse.pyexpr2cpp(symbolic.symstr(rb)),
+                             cppunparse.pyexpr2cpp(symbolic.symstr(s)),
+                             cppunparse.pyexpr2cpp(symbolic.symstr(rs))))
+                    else:
+                        memlet_params.append(
+                            '%s, %s' %
+                            (cppunparse.pyexpr2cpp(symbolic.symstr(rb)),
+                             cppunparse.pyexpr2cpp(symbolic.symstr(s))))
+            elif len(nonIndexDims) == 0:  # Scalar view
+                # Compute address
+                memlet_params[-1] += ' + ' + cpp_array_expr(
+                    sdfg, memlet, False)
+                dims = 0
+
+        else:
+            raise RuntimeError(
+                'Memlet type "%s" not implemented' % memlet.subset)
+
+        if dims == 0:
+            return 'dace::ArrayViewImmaterial%s%s<%s, %s, int32_t> ("%s", %s)' % (
+                'In' if direction == "in" else "Out", 'Skip'
+                if useskip else '', sdfg.arrays[memlet.data].dtype.ctype,
+                symbolic.symstr(
+                    memlet.veclen), memlet.data, ', '.join(memlet_params))
+        else:
+            return 'dace::ArrayViewImmaterial%s%s<%s, %s, int32_t, %s> ("%s", %s)' % (
+                'In' if direction == "in" else "Out", 'Skip'
+                if useskip else '', sdfg.arrays[memlet.data].dtype.ctype,
+                symbolic.symstr(memlet.veclen), ', '.join([
+                    str(s) for s in memlet.subset.bounding_box_size()
+                ]), memlet.data, ', '.join(memlet_params))
diff --git a/dace/codegen/targets/mpi.py b/dace/codegen/targets/mpi.py
new file mode 100644
index 0000000000..72356a7463
--- /dev/null
+++ b/dace/codegen/targets/mpi.py
@@ -0,0 +1,129 @@
+import dace
+from dace import symbolic, types
+from dace.codegen.prettycode import CodeIOStream
+from dace.codegen.codeobject import CodeObject
+from dace.codegen.targets.target import TargetCodeGenerator, make_absolute
+from dace.graph import nodes
+from dace.config import Config
+
+from dace.codegen import cppunparse
+
+
+class MPICodeGen(TargetCodeGenerator):
+    """ An MPI code generator. """
+    target_name = 'mpi'
+    title = 'MPI'
+    language = 'cpp'
+
+    def __init__(self, frame_codegen, sdfg):
+        self._frame = frame_codegen
+        self._dispatcher = frame_codegen.dispatcher
+        dispatcher = self._dispatcher
+
+        fileheader = CodeIOStream()
+        self._frame.generate_fileheader(sdfg, fileheader)
+
+        self._codeobj = CodeObject(
+            sdfg.name + '_mpi', """
+#include <dace/dace_runtime.h>
+#include <mpi.h>
+
+MPI_Comm __dace_mpi_comm;
+int __dace_comm_size = 1;
+int __dace_comm_rank = 0;
+
+{file_header}
+
+DACE_EXPORTED int __dace_init_mpi({params});
+DACE_EXPORTED void __dace_exit_mpi({params});
+
+int __dace_init_mpi({params}) {{
+    if (MPI_Init(NULL, NULL) != MPI_SUCCESS)
+        return 1;
+
+    MPI_Comm_dup(MPI_COMM_WORLD, &__dace_mpi_comm);
+    MPI_Comm_rank(__dace_mpi_comm, &__dace_comm_rank);
+    MPI_Comm_size(__dace_mpi_comm, &__dace_comm_size);
+
+    printf(\"MPI was initialized on proc %i of %i\\n\", __dace_comm_rank,
+           __dace_comm_size);
+    return 0;
+}}
+
+void __dace_exit_mpi({params}) {{
+    MPI_Comm_free(&__dace_mpi_comm);
+    MPI_Finalize();
+
+    printf(\"MPI was finalized on proc %i of %i\\n\", __dace_comm_rank,
+           __dace_comm_size);
+}}
+""".format(params=sdfg.signature(), file_header=fileheader.getvalue()), 'cpp',
+            MPICodeGen, 'MPI')
+
+        # Register dispatchers
+        dispatcher.register_map_dispatcher(types.ScheduleType.MPI, self)
+
+    def get_generated_codeobjects(self):
+        return [self._codeobj]
+
+    @staticmethod
+    def cmake_options():
+        compiler = make_absolute(Config.get("compiler", "mpi", "executable"))
+        return [
+            "-DMPI_CXX_COMPILER=\"{}\"".format(compiler),
+            "-DDACE_ENABLE_MPI=ON",
+        ]
+
+    @property
+    def has_initializer(self):
+        return True
+
+    @property
+    def has_finalizer(self):
+        return True
+
+    def generate_scope(self, sdfg, dfg_scope, state_id, function_stream,
+                       callsite_stream):
+        # Take care of map header
+        assert len(dfg_scope.source_nodes()) == 1
+        map_header = dfg_scope.source_nodes()[0]
+
+        function_stream.write('extern int __dace_comm_size, __dace_comm_rank;',
+                              sdfg, state_id, map_header)
+
+        if len(map_header.map.params) > 1:
+            raise NotImplementedError(
+                'Multi-dimensional MPI maps are not supported')
+
+        for var, r in zip(map_header.map.params, map_header.map.range):
+            begin, end, skip = r
+
+            callsite_stream.write('{\n', sdfg, state_id, map_header)
+            callsite_stream.write(
+                'auto %s = %s + __dace_comm_rank * (%s);\n' %
+                (var, cppunparse.pyexpr2cpp(symbolic.symstr(begin)),
+                 cppunparse.pyexpr2cpp(symbolic.symstr(skip))), sdfg, state_id,
+                map_header)
+
+        to_allocate = dace.sdfg.local_transients(sdfg, dfg_scope, map_header)
+        allocated = set()
+        for child in dfg_scope.scope_dict(node_to_children=True)[map_header]:
+            if not isinstance(child, nodes.AccessNode):
+                continue
+            if child.data not in to_allocate or child.data in allocated:
+                continue
+            allocated.add(child.data)
+            self._dispatcher.dispatch_allocate(sdfg, dfg_scope, state_id,
+                                               child, function_stream,
+                                               callsite_stream)
+            self._dispatcher.dispatch_initialize(sdfg, dfg_scope, state_id,
+                                                 child, function_stream,
+                                                 callsite_stream)
+
+        self._dispatcher.dispatch_subgraph(
+            sdfg,
+            dfg_scope,
+            state_id,
+            function_stream,
+            callsite_stream,
+            skip_entry_node=True)
diff --git a/dace/codegen/targets/target.py b/dace/codegen/targets/target.py
new file mode 100644
index 0000000000..73e10ba6cc
--- /dev/null
+++ b/dace/codegen/targets/target.py
@@ -0,0 +1,570 @@
+import os
+import shutil  # which
+import dace
+from dace import types
+from dace.graph import nodes, nxutil
+
+
+class TargetCodeGenerator(object):
+    """ Interface dictating functions that generate code for:
+          * Array allocation/deallocation/initialization/copying
+          * Scope (map, consume) code generation
+    """
+
+    def get_generated_codeobjects(self):
+        """ Returns a list of generated `CodeObject` classes corresponding
+            to files with generated code.
+            @see: CodeObject
+        """
+        raise NotImplementedError('Abstract class')
+
+    @property
+    def has_initializer(self):
+        """ Returns True if the target generates a `__dace_init_<TARGET>` 
+            function that should be called on initialization. """
+        raise NotImplementedError('Abstract class')
+
+    @property
+    def has_finalizer(self):
+        """ Returns True if the target generates a `__dace_exit_<TARGET>` 
+            function that should be called on finalization. """
+        raise NotImplementedError('Abstract class')
+
+    def generate_state(self, sdfg, state, function_stream, callsite_stream):
+        """ Generates code for an SDFG state, outputting it to the given 
+            code streams.
+            @param sdfg: The SDFG to generate code from.
+            @param state: The SDFGState to generate code from.
+            @param function_stream: A `CodeIOStream` object that will be
+                                    generated outside the calling code, for
+                                    use when generating global functions.
+            @param callsite_stream: A `CodeIOStream` object that points
+                                    to the current location (call-site)
+                                    in the code.
+        """
+        raise NotImplementedError('Abstract class')
+
+    def generate_scope(self, sdfg, dfg_scope, state_id, function_stream,
+                       callsite_stream):
+        """ Generates code for an SDFG state scope (from a scope-entry node
+            to its corresponding scope-exit node), outputting it to the given 
+            code streams.
+            @param sdfg: The SDFG to generate code from.
+            @param dfg_scope: The `ScopeSubgraphView` to generate code from.
+            @param state_id: The node ID of the state in the given SDFG.
+            @param function_stream: A `CodeIOStream` object that will be
+                                    generated outside the calling code, for
+                                    use when generating global functions.
+            @param callsite_stream: A `CodeIOStream` object that points
+                                    to the current location (call-site)
+                                    in the code.
+        """
+        raise NotImplementedError('Abstract class')
+
+    def generate_node(self, sdfg, dfg, state_id, node, function_stream,
+                      callsite_stream):
+        """ Generates code for a single node, outputting it to the given 
+            code streams.
+            @param sdfg: The SDFG to generate code from.
+            @param dfg: The SDFG state to generate code from.
+            @param state_id: The node ID of the state in the given SDFG.
+            @param node: The node to generate code from.
+            @param function_stream: A `CodeIOStream` object that will be
+                                    generated outside the calling code, for
+                                    use when generating global functions.
+            @param callsite_stream: A `CodeIOStream` object that points
+                                    to the current location (call-site)
+                                    in the code.
+        """
+        raise NotImplementedError('Abstract class')
+
+    def allocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                       callsite_stream):
+        """ Generates code for allocating an array, outputting to the given 
+            code streams.
+            @param sdfg: The SDFG to generate code from.
+            @param dfg: The SDFG state to generate code from.
+            @param state_id: The node ID of the state in the given SDFG.
+            @param node: The data node to generate allocation for.
+            @param function_stream: A `CodeIOStream` object that will be
+                                    generated outside the calling code, for
+                                    use when generating global functions.
+            @param callsite_stream: A `CodeIOStream` object that points
+                                    to the current location (call-site)
+                                    in the code.
+        """
+        raise NotImplementedError('Abstract class')
+
+    def initialize_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        """ Generates code for initializing an array, outputting to the given 
+            code streams.
+            @param sdfg: The SDFG to generate code from.
+            @param dfg: The SDFG state to generate code from.
+            @param state_id: The node ID of the state in the given SDFG.
+            @param node: The data node to generate initialization for.
+            @param function_stream: A `CodeIOStream` object that will be
+                                    generated outside the calling code, for
+                                    use when generating global functions.
+            @param callsite_stream: A `CodeIOStream` object that points
+                                    to the current location (call-site)
+                                    in the code.
+        """
+        raise NotImplementedError('Abstract class')
+
+    def deallocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        """ Generates code for deallocating an array, outputting to the given 
+            code streams.
+            @param sdfg: The SDFG to generate code from.
+            @param dfg: The SDFG state to generate code from.
+            @param state_id: The node ID of the state in the given SDFG.
+            @param node: The data node to generate deallocation for.
+            @param function_stream: A `CodeIOStream` object that will be
+                                    generated outside the calling code, for
+                                    use when generating global functions.
+            @param callsite_stream: A `CodeIOStream` object that points
+                                    to the current location (call-site)
+                                    in the code.
+        """
+        raise NotImplementedError('Abstract class')
+
+    def copy_memory(self, sdfg, dfg, state_id, src_node, dst_node, edge,
+                    function_stream, callsite_stream):
+        """ Generates code for copying memory, either from a data access 
+            node (array/stream) to another, a code node (tasklet/nested 
+            SDFG) to another, or a combination of the two.
+            @param sdfg: The SDFG to generate code from.
+            @param dfg: The SDFG state to generate code from.
+            @param state_id: The node ID of the state in the given SDFG.
+            @param src_node: The source node to generate copy code for.
+            @param dst_node: The destination node to generate copy code for.
+            @param edge: The edge representing the copy (in the innermost
+                         scope, adjacent to either the source or destination
+                         node).
+            @param function_stream: A `CodeIOStream` object that will be
+                                    generated outside the calling code, for
+                                    use when generating global functions.
+            @param callsite_stream: A `CodeIOStream` object that points
+                                    to the current location (call-site)
+                                    in the code.
+        """
+        raise NotImplementedError('Abstract class')
+
+
+class IllegalCopy(TargetCodeGenerator):
+    """ A code generator that is triggered when invalid copies are specified
+        by the SDFG. Only raises an exception on failure. """
+
+    def copy_memory(self, sdfg, dfg, state_id, src_node, dst_node, edge,
+                    function_stream, callsite_stream):
+        raise TypeError('Illegal copy! (from ' + str(src_node) + ' to ' +
+                        str(dst_node) + ')')
+
+
+class DefinedType(dace.types.AutoNumber):
+    """ Data types for `DefinedMemlets`.
+        @see: DefinedMemlets
+    """
+    Pointer = ()
+    ArrayView = ()
+    Scalar = ()
+    ScalarView = ()
+    Stream = ()
+    StreamArray = ()
+
+
+class DefinedMemlets:
+    """ Keeps track of the type of defined memlets to ensure that they are
+        referenced correctly in nested scopes and SDFGs. """
+
+    def __init__(self):
+        self._scopes = [(None, {})]
+
+    def enter_scope(self, parent):
+        self._scopes.append((parent, {}))
+
+    def exit_scope(self, parent):
+        expected, _ = self._scopes.pop()
+        if expected != parent:
+            raise ValueError(
+                "Exited scope {} mismatched current scope {}".format(
+                    parent.name, expected.name))
+
+    def get(self, name):
+        for _, scope in reversed(self._scopes):
+            if name in scope:
+                return scope[name]
+        raise KeyError("Variable {} has not been defined".format(name))
+
+    def add(self, name, connector_type):
+        if not isinstance(name, str):
+            raise TypeError(
+                'Variable name type cannot be %s' % type(name).__name__)
+
+        for _, scope in reversed(self._scopes):
+            if name in scope:
+                err_str = "Shadowing variable {} from type {} to {}".format(
+                    name, scope[name], connector_type)
+                if dace.config.Config.get_bool("compiler", "allow_shadowing"):
+                    print("WARNING: " + err_str)
+                else:
+                    raise dace.codegen.codegen.CodegenError(err_str)
+        self._scopes[-1][1][name] = connector_type
+
+
+#############################################################################
+
+
+class TargetDispatcher(object):
+    """ Dispatches sub-SDFG generation (according to scope),
+        storage<->storage copies, and storage<->tasklet copies to targets. """
+
+    def __init__(self):
+        self._used_targets = set()
+
+        self._array_dispatchers = {
+        }  # Type: types.StorageType -> TargetCodeGenerator
+        self._map_dispatchers = {
+        }  # Type: types.ScheduleType -> TargetCodeGenerator
+        self._copy_dispatchers = {}  # Type: (types.StorageType src,
+        #                                     types.StorageType dst,
+        #                                     types.ScheduleType dst_schedule)
+        #                                     -> TargetCodeGenerator
+        self._node_dispatchers = []  # [(predicate, dispatcher)]
+        self._generic_node_dispatcher = None  # Type: TargetCodeGenerator
+        self._state_dispatchers = []  # [(predicate, dispatcher)]
+        self._generic_state_dispatcher = None  # Type: TargetCodeGenerator
+
+        self._defined_vars = DefinedMemlets()
+
+    @property
+    def defined_vars(self):
+        """ Returns a list of defined variables.
+            @rtype: DefinedMemlets
+        """
+        return self._defined_vars
+
+    @property
+    def used_targets(self):
+        """ Returns a list of targets (code generators) that were triggered
+            during generation. """
+        return self._used_targets
+
+    def register_state_dispatcher(self, dispatcher, predicate=None):
+        """ Registers a code generator that processes a single state, calling
+            `generate_state`.
+            @param dispatcher: The code generator to use.
+            @param predicate: A lambda function that accepts the SDFG and 
+                              state, and triggers the code generator when True
+                              is returned. If None, registers `dispatcher`
+                              as the default state dispatcher.
+            @see: TargetCodeGenerator
+        """
+
+        if not hasattr(dispatcher, "generate_state"):
+            raise TypeError("State dispatcher \"{}\" does not "
+                            "implement \"generate_state\"".format(dispatcher))
+        if predicate is None:
+            self._generic_state_dispatcher = dispatcher
+        else:
+            self._state_dispatchers.append((predicate, dispatcher))
+
+    def get_generic_state_dispatcher(self):
+        """ Returns the default state dispatcher. """
+        return self._generic_state_dispatcher
+
+    def get_predicated_state_dispatchers(self):
+        """ Returns a list of state dispatchers with predicates. """
+        return list(self._state_dispatchers)
+
+    def register_node_dispatcher(self, dispatcher, predicate=None):
+        """ Registers a code generator that processes a single node, calling
+            `generate_node`.
+            @param dispatcher: The code generator to use.
+            @param predicate: A lambda function that accepts the SDFG, state,
+                              and node, and triggers the code generator when 
+                              True is returned. If None, registers `dispatcher`
+                              as the default node dispatcher.
+            @see: TargetCodeGenerator
+        """
+        if not hasattr(dispatcher, "generate_node"):
+            raise TypeError("Node dispatcher must "
+                            "implement \"generate_node\"")
+        if predicate is None:
+            self._generic_node_dispatcher = dispatcher
+        else:
+            self._node_dispatchers.append((predicate, dispatcher))
+
+    def get_generic_node_dispatcher(self):
+        """ Returns the default node dispatcher. """
+        return self._generic_node_dispatcher
+
+    def get_predicated_node_dispatchers(self):
+        """ Returns a list of node dispatchers with predicates. """
+        return list(self._node_dispatchers)
+
+    def register_map_dispatcher(self, schedule_type, func):
+        """ Registers a function that processes a scope, used when calling
+            `dispatch_subgraph` and `dispatch_scope`.
+            @param schedule_type: The scope schedule that triggers `func`.
+            @param func: A TargetCodeGenerator object that contains an 
+                         implementation of `generate_scope`.
+            @see: TargetCodeGenerator
+        """
+        if isinstance(schedule_type, list):
+            for stype in schedule_type:
+                self.register_map_dispatcher(stype, func)
+            return
+
+        if not isinstance(schedule_type, types.ScheduleType): raise TypeError
+        if not isinstance(func, TargetCodeGenerator): raise TypeError
+        if schedule_type in self._map_dispatchers:
+            raise ValueError('Schedule already mapped to ' +
+                             str(self._map_dispatchers[schedule_type]))
+        self._map_dispatchers[schedule_type] = func
+
+    def register_array_dispatcher(self, storage_type, func):
+        """ Registers a function that processes data allocation,   
+            initialization, and deinitialization. Used when calling
+            `dispatch_allocate/deallocate/initialize`.
+            @param storage_type: The data storage type that triggers `func`.
+            @param func: A TargetCodeGenerator object that contains an 
+                         implementation of data memory management functions.
+            @see: TargetCodeGenerator
+        """
+        if isinstance(storage_type, list):
+            for stype in storage_type:
+                self.register_array_dispatcher(stype, func)
+            return
+
+        if not isinstance(storage_type, types.StorageType): raise TypeError
+        if not isinstance(func, TargetCodeGenerator): raise TypeError
+        self._array_dispatchers[storage_type] = func
+
+    def register_copy_dispatcher(self, src_storage, dst_storage, dst_schedule,
+                                 func):
+        """ Registers code generation of data-to-data (or data from/to 
+            tasklet, if src/dst storage is StorageType.Register) copy 
+            functions. Can also be target-schedule specific, or 
+            dst_schedule=None if the function will be invoked on any schedule.
+            @param src_storage: The source data storage type that triggers 
+                                `func`.
+            @param dst_storage: The destination data storage type that 
+                                triggers `func`.
+            @param dst_schedule: An optional destination scope schedule type 
+                                 that triggers `func`.
+            @param func: A TargetCodeGenerator object that contains an 
+                         implementation of `copy_memory`.
+            @see: TargetCodeGenerator            
+        """
+
+        if not isinstance(src_storage, types.StorageType): raise TypeError
+        if not isinstance(dst_storage, types.StorageType): raise TypeError
+        if (dst_schedule is not None
+                and not isinstance(dst_schedule, types.ScheduleType)):
+            raise TypeError
+        if not isinstance(func, TargetCodeGenerator): raise TypeError
+
+        self._copy_dispatchers[(src_storage, dst_storage, dst_schedule)] = \
+            func
+
+    def dispatch_state(self, sdfg, state, function_stream, callsite_stream):
+        """ Dispatches a code generator for an SDFG state. """
+
+        self.defined_vars.enter_scope(state)
+        # Check if the state satisfies any predicates that delegate to a
+        # specific code generator
+        satisfied_dispatchers = [
+            dispatcher for pred, dispatcher in self._state_dispatchers
+            if pred(sdfg, state) is True
+        ]
+        num_satisfied = len(satisfied_dispatchers)
+        if num_satisfied > 1:
+            raise RuntimeError(
+                "Multiple predicates satisfied for {}: {}".format(
+                    state, ", ".join(
+                        [type(x).__name__ for x in satisfied_dispatchers])))
+        elif num_satisfied == 1:
+            satisfied_dispatchers[0].generate_state(
+                sdfg, state, function_stream, callsite_stream)
+        else:  # num_satisfied == 0
+            # Otherwise use the generic code generator (CPU)
+            self._generic_state_dispatcher.generate_state(
+                sdfg, state, function_stream, callsite_stream)
+        self.defined_vars.exit_scope(state)
+
+    def dispatch_subgraph(self,
+                          sdfg,
+                          dfg,
+                          state_id,
+                          function_stream,
+                          callsite_stream,
+                          skip_entry_node=False):
+        """ Dispatches a code generator for a scope subgraph of an 
+            `SDFGState`. """
+
+        start_nodes = list(
+            v for v in dfg.nodes() if len(list(dfg.predecessors(v))) == 0)
+
+        # Mark nodes to skip in order to be able to skip
+        nodes_to_skip = set()
+
+        if skip_entry_node:
+            assert len(start_nodes) == 1
+            nodes_to_skip.add(start_nodes[0])
+
+        for v in nxutil.dfs_topological_sort(dfg, start_nodes):
+            if v in nodes_to_skip:
+                continue
+
+            if isinstance(v, nodes.MapEntry):
+                scope_subgraph = sdfg.find_state(state_id).scope_subgraph(v)
+
+                # Propagate parallelism
+                if dfg.is_parallel():
+                    scope_subgraph.set_parallel_parent(dfg.get_parallel_parent)
+
+                assert not dfg.is_parallel() or scope_subgraph.is_parallel()
+                self.dispatch_scope(v.map.schedule, sdfg, scope_subgraph,
+                                    state_id, function_stream, callsite_stream)
+
+                # Skip scope subgraph nodes
+                #print(scope_subgraph.nodes())
+                nodes_to_skip.update(scope_subgraph.nodes())
+            else:
+                self.dispatch_node(sdfg, dfg, state_id, v, function_stream,
+                                   callsite_stream)
+
+    def dispatch_node(self, sdfg, dfg, state_id, node, function_stream,
+                      callsite_stream):
+        """ Dispatches a code generator for a single node. """
+
+        # Check if the node satisfies any predicates that delegate to a
+        # specific code generator
+        satisfied_dispatchers = [
+            dispatcher for pred, dispatcher in self._node_dispatchers
+            if pred(sdfg, node)
+        ]
+        num_satisfied = len(satisfied_dispatchers)
+        if num_satisfied > 1:
+            raise RuntimeError(
+                "Multiple predicates satisfied for {}: {}".format(
+                    node, ", ".join(
+                        [type(x).__name__ for x in satisfied_dispatchers])))
+        elif num_satisfied == 1:
+            self._used_targets.add(satisfied_dispatchers[0])
+            satisfied_dispatchers[0].generate_node(
+                sdfg, dfg, state_id, node, function_stream, callsite_stream)
+        else:  # num_satisfied == 0
+            # Otherwise use the generic code generator (CPU)
+            self._used_targets.add(self._generic_node_dispatcher)
+            self._generic_node_dispatcher.generate_node(
+                sdfg, dfg, state_id, node, function_stream, callsite_stream)
+
+    def dispatch_scope(self, map_schedule, sdfg, sub_dfg, state_id,
+                       function_stream, callsite_stream):
+        """ Dispatches a code generator function for a scope in an SDFG 
+            state. """
+        entry_node = sub_dfg.source_nodes()[0]
+        self.defined_vars.enter_scope(entry_node)
+        self._used_targets.add(self._map_dispatchers[map_schedule])
+        self._map_dispatchers[map_schedule].generate_scope(
+            sdfg, sub_dfg, state_id, function_stream, callsite_stream)
+        self.defined_vars.exit_scope(entry_node)
+
+    def dispatch_allocate(self, sdfg, dfg, state_id, node, function_stream,
+                          callsite_stream):
+        """ Dispatches a code generator for data allocation. """
+
+        nodedesc = node.desc(sdfg)
+        storage = (nodedesc.storage if not isinstance(node, nodes.Tasklet) else
+                   types.StorageType.Register)
+        self._used_targets.add(self._array_dispatchers[storage])
+
+        self._array_dispatchers[storage].allocate_array(
+            sdfg, dfg, state_id, node, function_stream, callsite_stream)
+
+    def dispatch_initialize(self, sdfg, dfg, state_id, node, function_stream,
+                            callsite_stream):
+        """ Dispatches a code generator for a data initialization. """
+
+        nodedesc = node.desc(sdfg)
+        storage = (nodedesc.storage if not isinstance(node, nodes.Tasklet) else
+                   types.StorageType.Register)
+        self._used_targets.add(self._array_dispatchers[storage])
+        self._array_dispatchers[storage].initialize_array(
+            sdfg, dfg, state_id, node, function_stream, callsite_stream)
+
+    def dispatch_deallocate(self, sdfg, dfg, state_id, node, function_stream,
+                            callsite_stream):
+        """ Dispatches a code generator for a data deallocation. """
+
+        nodedesc = node.desc(sdfg)
+        storage = (nodedesc.storage if not isinstance(node, nodes.Tasklet) else
+                   types.StorageType.Register)
+        self._used_targets.add(self._array_dispatchers[storage])
+
+        self._array_dispatchers[storage].deallocate_array(
+            sdfg, dfg, state_id, node, function_stream, callsite_stream)
+
+    # Dispatches copy code for a memlet
+    def dispatch_copy(self, src_node, dst_node, edge, sdfg, dfg, state_id,
+                      function_stream, output_stream):
+        """ Dispatches a code generator for a memory copy operation. """
+
+        if isinstance(src_node, nodes.CodeNode):
+            src_storage = types.StorageType.Register
+        else:
+            src_storage = src_node.desc(sdfg).storage
+
+        if isinstance(dst_node, nodes.CodeNode):
+            dst_storage = types.StorageType.Register
+        else:
+            dst_storage = dst_node.desc(sdfg).storage
+
+        if (isinstance(src_node, nodes.Tasklet)
+                and not isinstance(dst_node, nodes.Tasklet)):
+            # Special case: Copying from a tasklet to an array, schedule of
+            # the copy is in the copying tasklet
+            dst_schedule_node = dfg.scope_dict()[src_node]
+        else:
+            dst_schedule_node = dfg.scope_dict()[dst_node]
+
+        if dst_schedule_node is not None:
+            dst_schedule = dst_schedule_node.map.schedule
+        else:
+            dst_schedule = None
+
+        if (src_storage, dst_storage, dst_schedule) in self._copy_dispatchers:
+            target = self._copy_dispatchers[(src_storage, dst_storage,
+                                             dst_schedule)]
+            self._used_targets.add(target)
+            target.copy_memory(sdfg, dfg, state_id, src_node, dst_node, edge,
+                               function_stream, output_stream)
+        elif (src_storage, dst_storage, None) in self._copy_dispatchers:
+            target = self._copy_dispatchers[(src_storage, dst_storage, None)]
+            self._used_targets.add(target)
+            target.copy_memory(sdfg, dfg, state_id, src_node, dst_node, edge,
+                               function_stream, output_stream)
+        else:
+            raise RuntimeError('Copy dispatcher for %s->%s with schedule %s' %
+                               (str(src_storage), str(dst_storage),
+                                str(dst_schedule)) + ' not found')
+
+
+def make_absolute(path):
+    if os.path.isfile(path):
+        if os.path.isabs(path):
+            # Path is abolute, we're happy
+            return path
+        else:
+            # Path is relative: make it absolute
+            return os.path.abspath(path)
+    else:
+        # This is not a path, probably just an executable name, such
+        # as "g++". Try to find it on the PATH
+        executable = shutil.which(path)
+        if not executable:
+            raise ValueError("Could not find executable \"{}\"".format(path))
+        return executable
diff --git a/dace/codegen/targets/xilinx.py b/dace/codegen/targets/xilinx.py
new file mode 100644
index 0000000000..a836d07387
--- /dev/null
+++ b/dace/codegen/targets/xilinx.py
@@ -0,0 +1,1683 @@
+from six import StringIO
+import collections
+import functools
+import os
+import itertools
+import re
+import sympy as sp
+
+import dace
+from dace import subsets
+from dace.config import Config
+from dace.frontend import operations
+from dace.graph import nodes
+from dace.sdfg import ScopeSubgraphView, find_input_arraynode, find_output_arraynode
+from dace.codegen.codeobject import CodeObject
+from dace.codegen.prettycode import CodeIOStream
+from dace.codegen.targets.target import (TargetCodeGenerator, IllegalCopy,
+                                         make_absolute, DefinedType)
+from dace.codegen.targets.cpu import cpp_offset_expr, cpp_array_expr
+from dace.codegen.targets import cpu, cuda
+
+from dace.codegen import cppunparse
+
+REDUCTION_TYPE_TO_HLSLIB = {
+    dace.types.ReductionType.Min: "hlslib::op::Min",
+    dace.types.ReductionType.Max: "hlslib::op::Max",
+    dace.types.ReductionType.Sum: "hlslib::op::Sum",
+    dace.types.ReductionType.Product: "hlslib::op::Product",
+    dace.types.ReductionType.Logical_And: "hlslib::op::And",
+}
+
+
+class XilinxCodeGen(TargetCodeGenerator):
+    """ Xilinx FPGA code generator. """
+    target_name = 'xilinx'
+    title = 'Xilinx'
+    language = 'hls'
+
+    def __init__(self, frame_codegen, sdfg):
+        self._in_device_code = False
+        self._cpu_codegen = None
+        self._frame = frame_codegen
+        self._dispatcher = frame_codegen.dispatcher
+
+        self._global_sdfg = sdfg
+        self._program_name = sdfg.name
+
+        # Verify that we did not miss the allocation of any global arrays, even
+        # if they're nested deep in the SDFG
+        self._allocated_global_arrays = set()
+        self._unrolled_pes = set()
+
+        # Register dispatchers
+        self._cpu_codegen = self._dispatcher.get_generic_node_dispatcher()
+
+        self._host_codes = []
+        self._kernel_codes = []
+
+        # Register additional Xilinx dispatchers
+        self._dispatcher.register_map_dispatcher(
+            [dace.types.ScheduleType.FPGA_Device], self)
+
+        self._dispatcher.register_state_dispatcher(
+            self,
+            predicate=lambda sdfg, state: len(state.data_nodes()) > 0 and all([
+                n.desc(sdfg).storage in [
+                    dace.types.StorageType.FPGA_Global,
+                    dace.types.StorageType.FPGA_Local,
+                    dace.types.StorageType.FPGA_Registers]
+                for n in state.data_nodes()]))
+
+        self._dispatcher.register_node_dispatcher(
+            self, predicate=lambda *_: self._in_device_code)
+
+        xilinx_storage = [
+            dace.types.StorageType.FPGA_Global,
+            dace.types.StorageType.FPGA_Local,
+            dace.types.StorageType.FPGA_Registers,
+        ]
+        self._dispatcher.register_array_dispatcher(xilinx_storage, self)
+
+        # Register permitted copies
+        for storage_from in itertools.chain(xilinx_storage,
+                                            [dace.types.StorageType.Register]):
+            for storage_to in itertools.chain(
+                    xilinx_storage, [dace.types.StorageType.Register]):
+                if (storage_from == dace.types.StorageType.Register
+                        and storage_to == dace.types.StorageType.Register):
+                    continue
+                self._dispatcher.register_copy_dispatcher(
+                    storage_from, storage_to, None, self)
+        self._dispatcher.register_copy_dispatcher(
+            dace.types.StorageType.FPGA_Global,
+            dace.types.StorageType.CPU_Heap, None, self)
+        self._dispatcher.register_copy_dispatcher(
+            dace.types.StorageType.FPGA_Global,
+            dace.types.StorageType.CPU_Stack, None, self)
+        self._dispatcher.register_copy_dispatcher(
+            dace.types.StorageType.CPU_Heap,
+            dace.types.StorageType.FPGA_Global, None, self)
+        self._dispatcher.register_copy_dispatcher(
+            dace.types.StorageType.CPU_Stack,
+            dace.types.StorageType.FPGA_Global, None, self)
+
+    @property
+    def has_initializer(self):
+        return True
+
+    @property
+    def has_finalizer(self):
+        return False
+
+    @staticmethod
+    def cmake_options():
+        compiler = make_absolute(
+            Config.get("compiler", "xilinx", "executable"))
+        host_flags = Config.get("compiler", "xilinx", "host_flags")
+        synthesis_flags = Config.get("compiler", "xilinx", "synthesis_flags")
+        build_flags = Config.get("compiler", "xilinx", "build_flags")
+        mode = Config.get("compiler", "xilinx", "mode")
+        target_platform = Config.get("compiler", "xilinx", "platform")
+        enable_debugging = ("ON"
+                            if Config.get_bool("compiler", "xilinx",
+                                               "enable_debugging") else "OFF")
+        options = [
+            "-DSDACCEL_ROOT_DIR={}".format(
+                os.path.dirname(os.path.dirname(compiler))),
+            "-DDACE_XILINX_HOST_FLAGS=\"{}\"".format(host_flags),
+            "-DDACE_XILINX_SYNTHESIS_FLAGS=\"{}\"".format(synthesis_flags),
+            "-DDACE_XILINX_BUILD_FLAGS=\"{}\"".format(build_flags),
+            "-DDACE_XILINX_MODE={}".format(mode),
+            "-DDACE_XILINX_TARGET_PLATFORM=\"{}\"".format(target_platform),
+            "-DDACE_XILINX_ENABLE_DEBUGGING={}".format(enable_debugging),
+        ]
+        return options
+
+    def generate_state(self, sdfg, state, function_stream, callsite_stream):
+        """ Generate a kernel that runs all connected components within a state
+            as concurrent dataflow modules. """
+
+        state_id = sdfg.node_id(state)
+
+        # Determine independent components
+        subgraphs = dace.sdfg.concurrent_subgraphs(state)
+
+        # Generate kernel code
+        shared_transients = set(sdfg.shared_transients())
+        if not self._in_device_code:
+            # Allocate global memory transients, unless they are shared with
+            # other states
+            all_transients = set(state.all_transients())
+            allocated = set(shared_transients)
+            for node in state.data_nodes():
+                data = node.desc(sdfg)
+                if node.data not in all_transients or node.data in allocated:
+                    continue
+                if data.storage != dace.types.StorageType.FPGA_Global:
+                    continue
+                allocated.add(node.data)
+                self._dispatcher.dispatch_allocate(sdfg, state, state_id, node,
+                                                   function_stream,
+                                                   callsite_stream)
+                self._dispatcher.dispatch_initialize(sdfg, state, state_id,
+                                                     node, function_stream,
+                                                     callsite_stream)
+            # Generate kernel code
+            self.generate_kernel(sdfg, state, state.label, subgraphs,
+                                 function_stream, callsite_stream)
+        else:  # self._in_device_code == True
+            to_allocate = dace.sdfg.local_transients(sdfg, state, None)
+            allocated = set()
+            for node in state.data_nodes():
+                data = node.desc(sdfg)
+                if node.data not in to_allocate or node.data in allocated:
+                    continue
+                # Make sure there are no global transients in the nested state
+                # that are thus not gonna be allocated
+                if data.storage == dace.types.StorageType.FPGA_Global:
+                    raise dace.codegen.codegen.CodegenError(
+                        "Cannot allocate global memory from device code.")
+                allocated.add(data)
+                # Allocate transients
+                self._dispatcher.dispatch_allocate(sdfg, state, state_id, node,
+                                                   function_stream,
+                                                   callsite_stream)
+                self._dispatcher.dispatch_initialize(sdfg, state, state_id,
+                                                     node, function_stream,
+                                                     callsite_stream)
+            self.generate_nested_state(sdfg, state, state.label, subgraphs,
+                                       function_stream, callsite_stream)
+
+    @staticmethod
+    def shared_data(subgraphs):
+        """ Returns a set of data objects that are shared between two or more 
+            of the specified subgraphs. """
+        shared = set()
+        if len(subgraphs) >= 2:
+            seen = {}
+            for sg in subgraphs:
+                for node in sg:
+                    if isinstance(node, dace.graph.nodes.AccessNode):
+                        if node.data in seen:
+                            if seen[node.data] != sg:
+                                shared.add(node.data)
+                        else:
+                            seen[node.data] = sg
+        return shared
+
+    @staticmethod
+    def global_transient_nodes(subgraphs):
+        """ Generator that returns all transient global arrays nested in the
+            passed subgraphs on the form (is_output, AccessNode). """
+        seen = set()
+        for subgraph in subgraphs:
+            for n, scope in subgraph.all_nodes_recursive():
+                if (isinstance(n, dace.graph.nodes.AccessNode)
+                        and n.desc(sdfg).transient and n.desc(sdfg).storage ==
+                        dace.types.StorageType.FPGA_Global):
+                    if n.data in seen:
+                        continue
+                    seen.add(n.data)
+                    if scope.out_degree(n) > 0:
+                        yield (False, n)
+                    if scope.in_degree(n) > 0:
+                        yield (True, n)
+
+    @staticmethod
+    def make_parameters(sdfg, state, subgraphs):
+        """ Determines the parameters that must be passed to the passed list of
+            subgraphs, as well as to the global kernel. """
+
+        # Get a set of data nodes that are shared across subgraphs
+        shared_data = XilinxCodeGen.shared_data(subgraphs)
+
+        # For some reason the array allocation dispatcher takes nodes, not
+        # arrays. Build a dictionary of arrays to arbitrary data nodes
+        # referring to them.
+        data_to_node = {}
+
+        global_data_params = []
+        top_level_local_data = []
+        subgraph_params = collections.OrderedDict()  # {subgraph: [params]}
+        nested_global_transients = []
+        nested_global_transients_seen = set()
+        for subgraph in subgraphs:
+            data_to_node.update({
+                node.data: node
+                for node in subgraph.nodes()
+                if isinstance(node, dace.graph.nodes.AccessNode)
+            })
+            subsdfg = subgraph.parent
+            candidates = []  # type: List[Tuple[bool,str,Data]]
+            # [(is an output, dataname string, data object)]
+            for n in subgraph.source_nodes():
+                candidates += [(False, e.data.data,
+                                subsdfg.arrays[e.data.data])
+                               for e in state.in_edges(n)]
+            for n in subgraph.sink_nodes():
+                candidates += [(True, e.data.data, subsdfg.arrays[e.data.data])
+                               for e in state.out_edges(n)]
+            # Find other data nodes that are used internally
+            for n, scope in subgraph.all_nodes_recursive():
+                if isinstance(n, dace.graph.nodes.AccessNode):
+                    # Add nodes if they are outer-level, or an inner-level
+                    # transient (inner-level inputs/outputs are just connected
+                    # to data in the outer layers, whereas transients can be
+                    # independent).
+                    if scope == subgraph or n.desc(scope).transient:
+                        if scope.out_degree(n) > 0:
+                            candidates.append((False, n.data, n.desc(scope)))
+                        if scope.in_degree(n) > 0:
+                            candidates.append((True, n.data, n.desc(scope)))
+                        if scope != subgraph:
+                            if (isinstance(n.desc(scope), dace.data.Array)
+                                    and n.desc(scope).storage ==
+                                    dace.types.StorageType.FPGA_Global and
+                                    n.data not in nested_global_transients_seen
+                                ):
+                                nested_global_transients.append(n)
+                            nested_global_transients_seen.add(n.data)
+            subgraph_params[subgraph] = []
+            # Differentiate global and local arrays. The former are allocated
+            # from the host and passed to the device code, while the latter are
+            # (statically) allocated on the device side.
+            for is_output, dataname, data in candidates:
+                if (isinstance(data, dace.data.Array)
+                        or isinstance(data, dace.data.Scalar)
+                        or isinstance(data, dace.data.Stream)):
+                    if data.storage == dace.types.StorageType.FPGA_Global:
+                        subgraph_params[subgraph].append((is_output, dataname,
+                                                          data))
+                        if is_output:
+                            global_data_params.append((is_output, dataname,
+                                                       data))
+                        else:
+                            global_data_params.append((is_output, dataname,
+                                                       data))
+                    elif (data.storage == dace.types.StorageType.FPGA_Local or
+                          data.storage == dace.types.StorageType.FPGA_Registers
+                          ):
+                        if dataname in shared_data:
+                            # Only transients shared across multiple components
+                            # need to be allocated outside and passed as
+                            # parameters
+                            subgraph_params[subgraph].append((is_output,
+                                                              dataname, data))
+                            # Resolve the data to some corresponding node to be
+                            # passed to the allocator
+                            top_level_local_data.append(dataname)
+                    else:
+                        raise ValueError("Unsupported storage type: {}".format(
+                            data.storage))
+                else:
+                    raise TypeError("Unsupported data type: {}".format(
+                        type(data).__name__))
+            subgraph_params[subgraph] = dace.types.deduplicate(
+                subgraph_params[subgraph])
+
+        # Deduplicate
+        global_data_params = dace.types.deduplicate(global_data_params)
+        top_level_local_data = dace.types.deduplicate(top_level_local_data)
+        top_level_local_data = [data_to_node[n] for n in top_level_local_data]
+
+        # Get scalar parameters
+        scalar_parameters = sdfg.scalar_parameters(False)
+        symbol_parameters = sdfg.undefined_symbols(False)
+
+        return (global_data_params, top_level_local_data, subgraph_params,
+                scalar_parameters, symbol_parameters, nested_global_transients)
+
+    def generate_nested_state(self, sdfg, state, nest_name, subgraphs,
+                              function_stream, callsite_stream):
+
+        for sg in subgraphs:
+
+            self._dispatcher.dispatch_subgraph(
+                sdfg,
+                sg,
+                sdfg.node_id(state),
+                function_stream,
+                callsite_stream,
+                skip_entry_node=False)
+
+    @staticmethod
+    def detect_memory_widths(subgraphs):
+        stack = []
+        for sg in subgraphs:
+            stack += [(n, sg) for n in sg.nodes()]
+        memory_widths = {}
+        seen = set()
+        while len(stack) > 0:
+            node, graph = stack.pop()
+            if isinstance(node, dace.graph.nodes.NestedSDFG):
+                for state in node.sdfg.states():
+                    stack += [(n, state) for n in state.nodes()]
+            elif isinstance(node, dace.graph.nodes.AccessNode):
+                if node in seen:
+                    continue
+                seen.add(node)
+                nodedesc = node.desc(graph)
+                for edge in graph.all_edges(node):
+                    if (isinstance(edge.data, dace.memlet.EmptyMemlet)
+                            or edge.data.data is None):
+                        continue
+                    if node.data not in memory_widths:
+                        if (isinstance(nodedesc, dace.data.Stream)
+                                and nodedesc.veclen != edge.data.veclen):
+                            raise ValueError(
+                                "Vector length on memlet {} ({}) doesn't "
+                                "match vector length of {} ({})".format(
+                                    edge.data, edge.data.veclen, node.data,
+                                    nodedesc.veclen))
+                        memory_widths[node.data] = edge.data.veclen
+                    else:
+                        if memory_widths[node.data] != edge.data.veclen:
+                            raise dace.codegen.codegen.CodegenError(
+                                "Inconsistent vector length "
+                                "on FPGA for \"{}\": got {}, had {}".format(
+                                    node.data, edge.data.veclen,
+                                    memory_widths[node.data]))
+        return memory_widths
+
+    def generate_kernel(self, sdfg, state, kernel_name, subgraphs,
+                        function_stream, callsite_stream):
+
+        state_id = sdfg.node_id(state)
+
+        (global_data_params, top_level_local_data, subgraph_params,
+         scalar_parameters, symbol_parameters,
+         nested_global_transients) = type(self).make_parameters(
+             sdfg, state, subgraphs)
+
+        # Scalar parameters are never output
+        sc_parameters = [(False, pname, param)
+                         for pname, param in scalar_parameters]
+
+        symbol_params = [
+            v.signature(with_types=True, name=k)
+            for k, v in symbol_parameters.items()
+        ]
+
+        # Inspect the vector length of all memlets leading to each memory, to
+        # make sure that they're consistent, and to allow us to instantiate the
+        # memories as vector types to enable HLS to generate wider data paths.
+        # Since we cannot pass this auxiliary data structure to the allocator,
+        # which is called by the dispatcher, we temporarily store it in the
+        # codegen object.
+        self._memory_widths = XilinxCodeGen.detect_memory_widths(subgraphs)
+
+        # Write host code
+        self.generate_host_code(sdfg, state, kernel_name,
+                                global_data_params + sc_parameters,
+                                symbol_parameters, nested_global_transients,
+                                function_stream, callsite_stream)
+        if self._in_device_code:
+            raise CodegenError("Tried to generate kernel from device code")
+        self._in_device_code = True
+        self._cpu_codegen._packed_types = True
+
+        # Now we write the device code
+        module_stream = CodeIOStream()
+        kernel_stream = CodeIOStream()
+
+        # Write header
+        module_stream.write("#include <dace/xilinx/device.h>\n\n", sdfg)
+        self._frame.generate_fileheader(sdfg, module_stream)
+        module_stream.write("\n", sdfg)
+
+        # Build kernel signature
+        kernel_args = []
+        for is_output, dataname, data in global_data_params:
+            if isinstance(data, dace.data.Array):
+                kernel_args.append("dace::vec<{}, {}> *{}_{}".format(
+                    data.dtype.ctype, self._memory_widths[dataname], dataname,
+                    "out" if is_output else "in"))
+            else:
+                kernel_args.append(
+                    data.signature(with_types=True, name=dataname))
+        kernel_args += ([
+            arg.signature(with_types=True, name=argname)
+            for _, argname, arg in scalar_parameters
+        ] + symbol_params)
+
+        # Write kernel signature
+        kernel_stream.write(
+            "DACE_EXPORTED void {}({}) {{\n".format(
+                kernel_name, ', '.join(kernel_args)), sdfg, state_id)
+
+        # Insert interface pragmas
+        mapped_args = 0
+        for arg in kernel_args:
+            var_name = re.findall("\w+", arg)[-1]
+            if "*" in arg:
+                kernel_stream.write(
+                    "#pragma HLS INTERFACE m_axi port={} "
+                    "offset=slave bundle=gmem{}".format(var_name, mapped_args),
+                    sdfg, state_id)
+                mapped_args += 1
+
+        for arg in kernel_args + ["return"]:
+            var_name = re.findall("\w+", arg)[-1]
+            kernel_stream.write(
+                "#pragma HLS INTERFACE s_axilite port={} bundle=control".
+                format(var_name))
+
+        # TODO: add special case if there's only one module for niceness
+        kernel_stream.write("\n#pragma HLS DATAFLOW")
+        kernel_stream.write("\nHLSLIB_DATAFLOW_INIT();")
+
+        # Actual kernel code generation
+        self.generate_modules(sdfg, state, kernel_name, subgraphs,
+                              subgraph_params, sc_parameters,
+                              symbol_parameters, top_level_local_data,
+                              function_stream, module_stream, kernel_stream)
+
+        kernel_stream.write("HLSLIB_DATAFLOW_FINALIZE();\n}\n")
+        self._in_device_code = False
+        self._cpu_codegen._packed_types = False
+
+        concatenated_code = (
+            module_stream.getvalue() + kernel_stream.getvalue())
+
+        # Store code strings to be passed to compilation phase
+        self._kernel_codes.append((kernel_name, concatenated_code))
+
+        # Delete the field we've used to pass this dictionary to the memory
+        # allocator
+        del self._memory_widths
+        self._allocated_global_arrays = set()
+
+    def generate_modules(self, sdfg, state, kernel_name, subgraphs, params,
+                         scalar_parameters, symbol_parameters,
+                         top_level_local_data, function_stream, module_stream,
+                         kernel_stream):
+
+        # Emit allocations
+        state_id = sdfg.node_id(state)
+        for node in top_level_local_data:
+            self._dispatcher.dispatch_allocate(sdfg, state, state_id, node,
+                                               module_stream, kernel_stream)
+            self._dispatcher.dispatch_initialize(sdfg, state, state_id, node,
+                                                 module_stream, kernel_stream)
+
+        # Module generation
+        for subgraph in subgraphs:
+            # Traverse to find first tasklets reachable in topological order
+            to_traverse = subgraph.source_nodes()
+            seen = set()
+            while len(to_traverse) > 0:
+                n = to_traverse.pop()
+                if n in seen:
+                    continue
+                seen.add(n)
+                if (not isinstance(n, dace.graph.nodes.Tasklet)
+                        and not isinstance(n, dace.graph.nodes.NestedSDFG)):
+                    for e in subgraph.out_edges(n):
+                        if e.dst not in seen:
+                            to_traverse.append(e.dst)
+            # Name module according to all reached tasklets (can be just one)
+            labels = [
+                n.label.replace(" ", "_") for n in seen
+                if isinstance(n, dace.graph.nodes.Tasklet)
+                or isinstance(n, dace.graph.nodes.NestedSDFG)
+            ]
+            if len(labels) == 0:
+                labels = [
+                    n.label.replace(" ", "_") for n in seen
+                    if isinstance(n, dace.graph.nodes.AccessNode)
+                ]
+            if len(labels) == 0:
+                raise RuntimeError(
+                    "Expected at least one tasklet or data node")
+            module_name = "_".join(labels)
+            self.generate_module(sdfg, state, module_name, subgraph,
+                                 params[subgraph] + scalar_parameters,
+                                 symbol_parameters, function_stream,
+                                 module_stream, kernel_stream)
+
+    def generate_scope(self, sdfg, dfg_scope, state_id, function_stream,
+                       callsite_stream):
+
+        if not self._in_device_code:
+            # If we're not already generating kernel code we need to set up the
+            # kernel launch
+            subgraphs = [dfg_scope]
+            return self.generate_kernel(
+                sdfg, sdfg.find_state(state_id),
+                dfg_scope.source_nodes()[0].map.label.replace(" ", "_"),
+                subgraphs, function_stream, callsite_stream)
+
+        self.generate_node(sdfg, dfg_scope, state_id,
+                           dfg_scope.source_nodes()[0], function_stream,
+                           callsite_stream)
+
+        self._dispatcher.dispatch_subgraph(
+            sdfg,
+            dfg_scope,
+            state_id,
+            function_stream,
+            callsite_stream,
+            skip_entry_node=True)
+
+    def generate_host_code(self, sdfg, state, kernel_name, params,
+                           symbol_parameters, nested_global_transients,
+                           function_stream, callsite_stream):
+
+        state_id = sdfg.node_id(state)
+
+        # We exclude nested transients from the CPU code function call, as they
+        # have not yet been allocated at this point
+        nested_transient_set = {n.data for n in nested_global_transients}
+
+        symbol_sigs = [
+            v.signature(with_types=True, name=k)
+            for k, v in symbol_parameters.items()
+        ]
+        symbol_names = list(symbol_parameters.keys())
+        seen = set(nested_transient_set)
+        kernel_args_call_wrapper = []
+        kernel_args_call_host = []
+        for is_output, pname, p in params:
+            kernel_args_call_wrapper.append(p.signature(False, name=pname))
+            # Only pass each array once from the host code
+            if p in seen:
+                continue
+            seen.add(p)
+            kernel_args_call_host.append(p.signature(False, name=pname))
+        kernel_args_call_wrapper += symbol_names
+        kernel_args_call_host += symbol_names
+        kernel_args_opencl = (XilinxCodeGen.sdaccel_params(
+            sdfg, [p for p in params
+                   if p[1] not in nested_transient_set]) + symbol_sigs)
+        kernel_args_hls = []
+        kernel_args_hls_without_vectorization = []
+        for is_output, argname, arg in params:
+            if isinstance(arg, dace.data.Array):
+                kernel_args_hls.append("dace::vec<{}, {}> *{}_{}".format(
+                    arg.dtype.ctype, self._memory_widths[argname], argname,
+                    "out" if is_output else "in"))
+                kernel_args_hls_without_vectorization.append(
+                    "{} *{}_{}".format(arg.dtype.ctype, argname, "out"
+                                       if is_output else "in"))
+            else:
+                kernel_args_hls.append(
+                    arg.signature(with_types=True, name=argname))
+                kernel_args_hls_without_vectorization.append(
+                    arg.signature(with_types=True, name=argname))
+        kernel_args_hls += symbol_sigs
+        kernel_args_hls_without_vectorization += symbol_sigs
+
+        kernel_function_name = kernel_name
+
+        #----------------------------------------------------------------------
+        # Generate OpenCL host-code
+        #----------------------------------------------------------------------
+
+        kernel_file_name = "{}.xclbin".format(kernel_name)
+        host_function_name = "__dace_runkernel_{}".format(kernel_name)
+
+        # Write OpenCL host function
+        code = CodeIOStream()
+        code.write("""\
+// Signature of kernel function (with raw pointers) for argument matching
+DACE_EXPORTED void {kernel_function_name}({kernel_args_hls_novec});
+
+DACE_EXPORTED void {host_function_name}({kernel_args_opencl}) {{""".format(
+            kernel_function_name=kernel_function_name,
+            kernel_args_hls_novec=", ".join(
+                kernel_args_hls_without_vectorization),
+            host_function_name=host_function_name,
+            kernel_args_opencl=", ".join(kernel_args_opencl)))
+
+        # Any extra transients stored in global memory on the FPGA must now be
+        # allocated and passed to the kernel
+        for arr_node in nested_global_transients:
+            self._dispatcher.dispatch_allocate(sdfg, state, None, arr_node,
+                                               None, code)
+            self._dispatcher.dispatch_initialize(sdfg, state, None, arr_node,
+                                                 None, code)
+
+        code.write("""\
+  hlslib::ocl::Program program =
+      hlslib::ocl::GlobalContext().CurrentlyLoadedProgram();
+  auto kernel = program.MakeKernel({kernel_function_name}, "{kernel_function_name}", {kernel_args});
+  const std::pair<double, double> elapsed = kernel.ExecuteTask();
+  std::cout << "Kernel executed in " << elapsed.second << " seconds.\\n" << std::flush;
+}}""".format(
+            kernel_function_name=kernel_function_name,
+            kernel_args=", ".join(kernel_args_call_wrapper)))
+
+        # Store code to be passed to compilation phase
+        self._host_codes.append((kernel_name, code.getvalue()))
+
+        #----------------------------------------------------------------------
+        # Inject header for OpenCL host code in the calling code file
+        #----------------------------------------------------------------------
+
+        host_declaration = "\n\nDACE_EXPORTED void {}({});\n\n".format(
+            host_function_name, ", ".join(kernel_args_opencl))
+        function_stream.write(host_declaration, sdfg, state_id, None)
+
+        #----------------------------------------------------------------------
+        # Call the OpenCL host function from the callsite
+        #----------------------------------------------------------------------
+
+        callsite_stream.write(
+            "{}({});".format(host_function_name,
+                             ", ".join(kernel_args_call_host)), sdfg, state_id,
+            None)
+
+
+# Unused?
+#    def generate_caller_code(self, sdfg, state, kernel_name, params,
+#                             symbol_parameters, function_stream,
+#                             callsite_stream):
+#
+#        state_id = sdfg.node_id(state)
+#
+#        symbol_sigs = [v.ctype + ' ' + k for k, v in symbol_parameters.items()]
+#        symbol_names = symbol_parameters.keys()
+#        kernel_args_call = [p.signature(False) for p in params] + symbol_names
+#        kernel_args_plain = [i.signature() for i in params] + symbol_sigs
+#
+#        kernel_function_name = kernel_name
+#
+#        callsite_stream.write(
+#            "{}({});".format(kernel_function_name,
+#                             ", ".join(kernel_args_call)), sdfg, state_id,
+#            None)
+
+    def generate_module(self, sdfg, state, name, subgraph, params,
+                        symbol_parameters, function_stream, module_stream,
+                        kernel_stream):
+        """Generates a module that will run as a dataflow function in the FPGA
+           kernel."""
+
+        state_id = sdfg.node_id(state)
+        dfg = sdfg.nodes()[state_id]
+
+        symbol_sigs = [
+            v.signature(with_types=True, name=k)
+            for k, v in symbol_parameters.items()
+        ]
+        symbol_names = list(symbol_parameters.keys())
+        kernel_args_call = []
+        kernel_args_module = []
+        added = set()
+        for is_output, pname, p in params:
+            if isinstance(p, dace.data.Array):
+                arr_name = "{}_{}".format(pname, "out" if is_output else "in")
+                kernel_args_call.append(arr_name)
+                kernel_args_module.append("dace::vec<{}, {}> {}*{}".format(
+                    p.dtype.ctype, self._memory_widths[pname], "const "
+                    if not is_output else "", arr_name))
+            else:
+                # Don't make duplicate arguments for other types than arrays
+                if pname in added:
+                    continue
+                added.add(pname)
+                if isinstance(p, dace.data.Stream):
+                    kernel_args_call.append(
+                        p.signature(with_types=False, name=pname))
+                    if p.is_stream_array():
+                        kernel_args_module.append(
+                            "dace::FIFO<{}, {}, {}> {}[{}]".format(
+                                p.dtype.ctype, p.veclen, p.buffer_size, pname,
+                                p.size_string()))
+                    else:
+                        kernel_args_module.append(
+                            "dace::FIFO<{}, {}, {}> &{}".format(
+                                p.dtype.ctype, p.veclen, p.buffer_size, pname))
+                else:
+                    kernel_args_call.append(
+                        p.signature(with_types=False, name=pname))
+                    kernel_args_module.append(
+                        p.signature(with_types=True, name=pname))
+        kernel_args_call += symbol_names
+        kernel_args_module += symbol_sigs
+
+        module_function_name = "module_" + name
+
+        # Unrolling processing elements: if the first scope of the subgraph
+        # is an unrolled map, generate a processing element for each iteration
+        scope_dict = subgraph.scope_dict(node_to_children=True)
+        top_scopes = [
+            n for n in scope_dict[None]
+            if isinstance(n, dace.graph.nodes.EntryNode)
+        ]
+        unrolled_loops = 0
+        if len(top_scopes) == 1:
+            scope = top_scopes[0]
+            if scope.unroll:
+                self._unrolled_pes.add(scope.map)
+                kernel_args_call += ", ".join(scope.map.params)
+                kernel_args_module += ["int " + p for p in scope.params]
+                for p, r in zip(scope.map.params, scope.map.range):
+                    if len(r) > 3:
+                        raise dace.codegen.codegen.CodegenError(
+                            "Strided unroll not supported")
+                    kernel_stream.write(
+                        "for (int {param} = {begin}; {param} < {end}; "
+                        "{param} += {increment}) {{\n#pragma HLS UNROLL".
+                        format(
+                            param=p, begin=r[0], end=r[1] + 1, increment=r[2]))
+                    unrolled_loops += 1
+
+        # Generate caller code in top-level function
+        kernel_stream.write(
+            "HLSLIB_DATAFLOW_FUNCTION({}, {});".format(
+                module_function_name, ", ".join(kernel_args_call)), sdfg,
+            state_id)
+
+        for _ in range(unrolled_loops):
+            kernel_stream.write("}")
+
+        #----------------------------------------------------------------------
+        # Generate kernel code
+        #----------------------------------------------------------------------
+
+        self._dispatcher.defined_vars.enter_scope(subgraph)
+
+        module_body_stream = CodeIOStream()
+
+        module_body_stream.write(
+            "void {}({}) {{".format(module_function_name,
+                                    ", ".join(kernel_args_module)), sdfg,
+            state_id)
+
+        # Construct ArrayInterface wrappers to pack input and output pointers
+        # to the same global array
+        in_args = {
+            argname
+            for out, argname, arg in params
+            if isinstance(arg, dace.data.Array)
+            and arg.storage == dace.types.StorageType.FPGA_Global and not out
+        }
+        out_args = {
+            argname
+            for out, argname, arg in params
+            if isinstance(arg, dace.data.Array)
+            and arg.storage == dace.types.StorageType.FPGA_Global and out
+        }
+        if len(in_args) > 0 or len(out_args) > 0:
+            # Add ArrayInterface objects to wrap input and output pointers to
+            # the same array
+            module_body_stream.write("\n")
+            interfaces_added = set()
+            for _, argname, arg in params:
+                if argname in interfaces_added:
+                    continue
+                interfaces_added.add(argname)
+                has_in_ptr = argname in in_args
+                has_out_ptr = argname in out_args
+                if not has_in_ptr and not has_out_ptr:
+                    continue
+                in_ptr = ("{}_in".format(argname) if has_in_ptr else "nullptr")
+                out_ptr = ("{}_out".format(argname)
+                           if has_out_ptr else "nullptr")
+                module_body_stream.write(
+                    "dace::ArrayInterface<{}, {}> {}({}, {});".format(
+                        arg.dtype.ctype, self._memory_widths[argname], argname,
+                        in_ptr, out_ptr))
+            module_body_stream.write("\n")
+
+        # Allocate local transients
+        data_to_allocate = (set(subgraph.top_level_transients()) - set(
+            sdfg.shared_transients()) - set([p[1] for p in params]))
+        allocated = set()
+        for node in subgraph.nodes():
+            if not isinstance(node, nodes.AccessNode):
+                continue
+            if node.data not in data_to_allocate or node.data in allocated:
+                continue
+            allocated.add(node.data)
+            self._dispatcher.dispatch_allocate(sdfg, state, state_id, node,
+                                               function_stream,
+                                               module_body_stream)
+            self._dispatcher.dispatch_initialize(sdfg, state, state_id, node,
+                                                 function_stream,
+                                                 module_body_stream)
+
+        self._dispatcher.dispatch_subgraph(
+            sdfg,
+            subgraph,
+            state_id,
+            module_stream,
+            module_body_stream,
+            skip_entry_node=False)
+
+        module_stream.write(module_body_stream.getvalue(), sdfg, state_id)
+        module_stream.write("}\n\n")
+
+        self._dispatcher.defined_vars.exit_scope(subgraph)
+
+    def get_generated_codeobjects(self):
+
+        execution_mode = Config.get("compiler", "xilinx", "mode")
+        sdaccel_dir = os.path.dirname(
+            os.path.dirname(
+                make_absolute(Config.get("compiler", "xilinx", "executable"))))
+        sdaccel_platform = Config.get("compiler", "xilinx", "platform")
+
+        kernel_file_name = "DACE_BINARY_DIR \"{}".format(self._program_name)
+        if execution_mode == "software_emulation":
+            kernel_file_name += "_sw_emu.xclbin\""
+            xcl_emulation_mode = "sw_emu"
+            xilinx_sdx = sdaccel_dir
+        elif execution_mode == "hardware_emulation":
+            kernel_file_name += "_hw_emu.xclbin\""
+            xcl_emulation_mode = "sw_emu"
+            xilinx_sdx = sdaccel_dir
+        elif execution_mode == "hardware" or execution_mode == "simulation":
+            kernel_file_name += "_hw.xclbin\""
+            xcl_emulation_mode = None
+            xilinx_sdx = None
+        else:
+            raise dace.codegen.codegen.CodegenError(
+                "Unknown Xilinx execution mode: {}".format(execution_mode))
+
+        set_env_vars = ""
+        set_str = "dace::set_environment_variable(\"{}\", \"{}\");\n"
+        unset_str = "dace::unset_environment_variable(\"{}\");\n"
+        set_env_vars += (set_str.format("XCL_EMULATION_MODE",
+                                        xcl_emulation_mode)
+                         if xcl_emulation_mode is not None else
+                         unset_str.format("XCL_EMULATION_MODE"))
+        set_env_vars += (set_str.format("XILINX_SDX", xilinx_sdx)
+                         if xilinx_sdx is not None else
+                         unset_str.format("XILINX_SDX"))
+
+        host_code = CodeIOStream()
+        host_code.write("""\
+#include "dace/xilinx/host.h"
+#include "dace/dace.h"
+#include <iostream>\n\n""")
+
+        self._frame.generate_fileheader(self._global_sdfg, host_code)
+
+        host_code.write("""
+DACE_EXPORTED int __dace_init_xilinx({signature}) {{
+    {environment_variables}
+    hlslib::ocl::GlobalContext().MakeProgram({kernel_file_name});
+    return 0;
+}}
+
+{host_code}""".format(
+            signature=self._global_sdfg.signature(),
+            environment_variables=set_env_vars,
+            kernel_file_name=kernel_file_name,
+            host_code="".join([
+                "{separator}\n// Kernel: {kernel_name}"
+                "\n{separator}\n\n{code}\n\n".format(
+                    separator="/" * 79, kernel_name=name, code=code)
+                for (name, code) in self._host_codes
+            ])))
+
+        host_code_obj = CodeObject(self._program_name + "_host",
+                                   host_code.getvalue(), "cpp", XilinxCodeGen,
+                                   "Xilinx")
+
+        kernel_code_objs = [
+            CodeObject("kernel_" + kernel_name, code, "cpp", XilinxCodeGen,
+                       "Xilinx") for (kernel_name, code) in self._kernel_codes
+        ]
+
+        return [host_code_obj] + kernel_code_objs
+
+    def allocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                       callsite_stream):
+        result = StringIO()
+        nodedesc = node.desc(sdfg)
+        arrsize = " * ".join([
+            cppunparse.pyexpr2cpp(dace.symbolic.symstr(s))
+            for s in nodedesc.strides
+        ])
+        is_dynamically_sized = any(
+            dace.symbolic.issymbolic(s, sdfg.constants)
+            for s in nodedesc.strides)
+
+        dataname = node.data
+
+        if isinstance(nodedesc, dace.data.Stream):
+
+            if not self._in_device_code:
+                raise dace.codegen.codegen.CodegenError(
+                    "Cannot allocate FIFO from CPU code: {}".format(node.data))
+
+            if is_dynamically_sized:
+                raise dace.codegen.codegen.CodegenError(
+                    "Arrays of streams cannot have dynamic size on FPGA")
+
+            if nodedesc.buffer_size < 1:
+                raise dace.codegen.codegen.CodegenError(
+                    "Streams cannot be unbounded on FPGA")
+
+            buffer_length_dynamically_sized = (
+                isinstance(nodedesc.buffer_size, sp.Expr)
+                and len(nodedesc.free_symbols) > 0)
+
+            if buffer_length_dynamically_sized:
+                raise dace.codegen.codegen.CodegenError(
+                    "Buffer length of stream cannot have dynamic size on FPGA")
+
+            if arrsize != "1":
+                is_stream_array = True
+            else:
+                is_stream_array = False
+
+            if is_stream_array:
+                result.write("dace::FIFO<{}, {}, {}> {}[{}];\n".format(
+                    nodedesc.dtype.ctype, nodedesc.veclen,
+                    nodedesc.buffer_size, dataname, arrsize))
+                result.write("dace::SetNames({}, \"{}\", {});".format(
+                    dataname, dataname, arrsize))
+                self._dispatcher.defined_vars.add(dataname,
+                                                  DefinedType.StreamArray)
+            else:
+                result.write("dace::FIFO<{}, {}, {}> {}(\"{}\");".format(
+                    nodedesc.dtype.ctype, nodedesc.veclen,
+                    nodedesc.buffer_size, dataname, dataname))
+                self._dispatcher.defined_vars.add(dataname, DefinedType.Stream)
+
+        elif isinstance(nodedesc, dace.data.Array):
+
+            if nodedesc.storage == dace.types.StorageType.FPGA_Global:
+
+                if self._in_device_code:
+
+                    if nodedesc not in self._allocated_global_arrays:
+                        raise RuntimeError("Cannot allocate global array "
+                                           "from device code: {} in {}".format(
+                                               node.label, sdfg.name))
+
+                else:
+
+                    devptr_name = dataname
+                    if isinstance(nodedesc, dace.data.Array):
+                        # TODO: Distinguish between read, write, and
+                        #       read+write
+                        # TODO: Handle memory banks (location?)
+                        self._allocated_global_arrays.add(node.data)
+                        result.write(
+                            "auto {} = hlslib::ocl::GlobalContext()."
+                            "MakeBuffer<{}, hlslib::ocl::Access::readWrite>"
+                            "({});".format(dataname, nodedesc.dtype.ctype,
+                                           arrsize))
+                        self._dispatcher.defined_vars.add(
+                            dataname, DefinedType.Pointer)
+
+            elif (nodedesc.storage == dace.types.StorageType.FPGA_Local or
+                  nodedesc.storage == dace.types.StorageType.FPGA_Registers):
+
+                if not self._in_device_code:
+                    raise dace.codegen.codegen.CodegenError(
+                        "Tried to allocate local FPGA memory "
+                        "outside device code: {}".format(dataname))
+                if is_dynamically_sized:
+                    raise ValueError(
+                        "Dynamic allocation of FPGA fast memory not allowed")
+
+                # Absorb vector size into type and adjust array size
+                # accordingly
+                veclen = self._memory_widths[node.data]
+                generate_scalar = False
+                if veclen > 1:
+                    arrsize_symbolic = functools.reduce(
+                        sp.mul.Mul, nodedesc.strides)
+                    arrsize_eval = dace.symbolic.eval(
+                        arrsize_symbolic / veclen)
+                    if cpu.sym2cpp(arrsize_eval) == "1":
+                        generate_scalar = True
+                    arrsize_vec = "({}) / {}".format(arrsize, veclen)
+                else:
+                    arrsize_vec = arrsize
+
+                # If the array degenerates to a single element because of
+                # vectorization, generate the variable as a scalar instead of
+                # an array of size 1
+                if generate_scalar:
+                    result.write("dace::vec<{}, {}> {};\n".format(
+                        nodedesc.dtype.ctype, veclen, dataname))
+                    self._dispatcher.defined_vars.add(dataname,
+                                                      DefinedType.Scalar)
+                else:
+                    result.write("dace::vec<{}, {}> {}[{}];\n".format(
+                        nodedesc.dtype.ctype, veclen, dataname, arrsize_vec))
+                    self._dispatcher.defined_vars.add(dataname,
+                                                      DefinedType.Pointer)
+                    if nodedesc.storage == dace.types.StorageType.FPGA_Registers:
+                        result.write("#pragma HLS ARRAY_PARTITION variable={} "
+                                     "complete\n".format(dataname))
+                    elif len(nodedesc.shape) > 1:
+                        result.write("#pragma HLS ARRAY_PARTITION variable={} "
+                                     "block factor={}\n".format(
+                                         dataname, nodedesc.shape[-2]))
+                    # result.write(
+                    #     "#pragma HLS DEPENDENCE variable={} false".format(
+                    #         dataname))
+
+            else:
+                raise NotImplementedError("Xilinx: Unimplemented storage type "
+                                          + str(nodedesc.storage))
+
+        elif isinstance(nodedesc, dace.data.Scalar):
+
+            result.write("{} {};\n".format(nodedesc.dtype.ctype, dataname))
+            self._dispatcher.defined_vars.add(dataname, DefinedType.Scalar)
+
+        else:
+            raise TypeError("Unhandled data type: {}".format(
+                type(nodedesc).__name__))
+
+        callsite_stream.write(result.getvalue(), sdfg, state_id, node)
+
+    def deallocate_array(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+        pass  # Handled by destructor
+
+    def _emit_copy(self, sdfg, state_id, src_node, src_storage, dst_node,
+                   dst_storage, dst_schedule, edge, dfg, callsite_stream):
+
+        u, v, memlet = edge.src, edge.dst, edge.data
+
+        cpu_storage_types = [
+            dace.types.StorageType.CPU_Heap, dace.types.StorageType.CPU_Stack,
+            dace.types.StorageType.CPU_Pinned
+        ]
+        fpga_storage_types = [
+            dace.types.StorageType.FPGA_Global,
+            dace.types.StorageType.FPGA_Local,
+            dace.types.StorageType.FPGA_Registers,
+        ]
+
+        # Determine directionality
+        if isinstance(src_node,
+                      nodes.AccessNode) and memlet.data == src_node.data:
+            outgoing_memlet = True
+        elif isinstance(dst_node,
+                        nodes.AccessNode) and memlet.data == dst_node.data:
+            outgoing_memlet = False
+        else:
+            raise LookupError("Memlet does not point to any of the nodes")
+
+        data_to_data = (isinstance(src_node, nodes.AccessNode)
+                        and isinstance(dst_node, nodes.AccessNode))
+
+        host_to_device = (data_to_data and src_storage in cpu_storage_types and
+                          dst_storage == dace.types.StorageType.FPGA_Global)
+        device_to_host = (data_to_data
+                          and src_storage == dace.types.StorageType.FPGA_Global
+                          and dst_storage in cpu_storage_types)
+        device_to_device = (
+            data_to_data and src_storage == dace.types.StorageType.FPGA_Global
+            and dst_storage == dace.types.StorageType.FPGA_Global)
+
+        if (host_to_device or device_to_host) and self._in_device_code:
+            raise RuntimeError(
+                "Cannot copy between host and device from device")
+
+        if (host_to_device or device_to_host
+                or (device_to_device and not self._in_device_code)):
+
+            dims = memlet.subset.dims()
+            copy_shape = memlet.subset.bounding_box_size()
+            copysize = ' * '.join([
+                cppunparse.pyexpr2cpp(dace.symbolic.symstr(s))
+                for s in copy_shape
+            ])
+            offset = cpp_array_expr(sdfg, memlet, with_brackets=False)
+
+            if (not sum(copy_shape) == 1
+                    and (not isinstance(memlet.subset, subsets.Range)
+                         or any([step != 1 for _, _, step in memlet.subset]))):
+                raise NotImplementedError("Only contiguous copies currently "
+                                          "supported for Xilinx FPGA.")
+
+            if host_to_device:
+
+                callsite_stream.write(
+                    "{}.CopyFromHost({}, {}, {});".format(
+                        dst_node.data, (offset if not outgoing_memlet else 0),
+                        copysize,
+                        src_node.data + (" + {}".format(offset)
+                                         if outgoing_memlet else "")), sdfg,
+                    state_id, [src_node, dst_node])
+
+            elif device_to_host:
+
+                callsite_stream.write(
+                    "{}.CopyToHost({}, {}, {});".format(
+                        src_node.data, (offset
+                                        if outgoing_memlet else 0), copysize,
+                        dst_node.data + (" + {}".format(offset)
+                                         if not outgoing_memlet else "")),
+                    sdfg, state_id, [src_node, dst_node])
+
+            elif device_to_device:
+
+                callsite_stream.write(
+                    "{}.CopyToDevice({}, {}, {}, {});".format(
+                        src_node.data, (offset
+                                        if outgoing_memlet else 0), copysize,
+                        dst_node.data, (offset if not outgoing_memlet else 0)),
+                    sdfg, state_id, [src_node, dst_node])
+
+        # Reject copying to/from local memory from/to outside the FPGA
+        elif (data_to_data
+              and (((src_storage == dace.types.StorageType.FPGA_Local
+                     or src_storage == dace.types.StorageType.FPGA_Registers)
+                    and dst_storage not in fpga_storage_types) or
+                   ((dst_storage == dace.types.StorageType.FPGA_Local
+                     or dst_storage == dace.types.StorageType.FPGA_Registers)
+                    and src_storage not in fpga_storage_types))):
+            raise NotImplementedError(
+                "Copies between host memory and FPGA "
+                "local memory not supported: from {} to {}".format(
+                    src_node, dst_node))
+
+        elif data_to_data:
+
+            if memlet.wcr is not None:
+                raise NotImplementedError("WCR not implemented for copy edges")
+
+            # Try to turn into degenerate/strided ND copies
+            copy_shape, src_strides, dst_strides, src_expr, dst_expr = (
+                self._cpu_codegen.memlet_copy_to_absolute_strides(
+                    sdfg, memlet, src_node, dst_node, packed_types=True))
+
+            ctype = src_node.desc(sdfg).dtype.ctype
+
+            # TODO: detect in which cases we shouldn't unroll
+            register_to_register = (src_node.desc(
+                sdfg).storage == dace.types.StorageType.FPGA_Registers
+                                    or dst_node.desc(sdfg).storage ==
+                                    dace.types.StorageType.FPGA_Registers)
+
+            # Loop intro
+            num_loops = 0
+            for i, copy_dim in enumerate(copy_shape):
+                if copy_dim != 1:
+                    callsite_stream.write(
+                        "for (auto __dace_copy{} = 0; __dace_copy{} < {}; "
+                        "++__dace_copy{}) {{".format(i, i, copy_dim, i))
+                    if register_to_register:
+                        callsite_stream.write("#pragma HLS UNROLL")
+                    num_loops += 1
+
+            # Pragmas
+            if num_loops > 0:
+                if not register_to_register:
+                    callsite_stream.write("#pragma HLS PIPELINE II=1")
+                if len(copy_shape) > 1:
+                    callsite_stream.write("#pragma HLS LOOP_FLATTEN")
+
+            # Construct indices (if the length of the stride array is zero,
+            # resolves to an empty string)
+            src_index = " + ".join(([""] if len(dst_strides) > 0 else []) + [
+                "__dace_copy{} * {}".format(i, cpu.sym2cpp(stride))
+                for i, stride in enumerate(src_strides) if copy_shape[i] != 1
+            ])
+            dst_index = " + ".join(([""] if len(dst_strides) > 0 else []) + [
+                "__dace_copy{} * {}".format(i, cpu.sym2cpp(stride))
+                for i, stride in enumerate(dst_strides) if copy_shape[i] != 1
+            ])
+
+            src_def_type = self._dispatcher.defined_vars.get(src_node.data)
+            dst_def_type = self._dispatcher.defined_vars.get(dst_node.data)
+
+            if src_def_type == DefinedType.Stream:
+                read_expr = src_expr
+            elif src_def_type == DefinedType.Scalar:
+                read_expr = src_node.label
+            else:
+                read_expr = "dace::Read<{}, {}>({}{})".format(
+                    ctype, memlet.veclen, src_expr, src_index)
+
+            if dst_def_type == DefinedType.Stream:
+                callsite_stream.write("{}.push({});".format(
+                    dst_expr, read_expr))
+            else:
+                if dst_def_type == DefinedType.Scalar:
+                    write_expr = dst_node.label
+                callsite_stream.write("dace::Write<{}, {}>({}{}, {});".format(
+                    ctype, memlet.veclen, dst_expr, dst_index, read_expr))
+
+            # Inject dependence pragmas (DaCe semantics implies no conflict)
+            for node in [src_node, dst_node]:
+                if (isinstance(node.desc(sdfg), dace.data.Array)
+                        and node.desc(sdfg).storage in [
+                            dace.types.StorageType.FPGA_Local,
+                            dace.StorageType.FPGA_Registers
+                        ]):
+                    callsite_stream.write(
+                        "#pragma HLS DEPENDENCE variable={} false".format(
+                            node.data))
+
+            # Loop outtro
+            for _ in range(num_loops):
+                callsite_stream.write("}")
+
+        else:
+
+            self._cpu_codegen.copy_memory(sdfg, dfg, state_id, src_node,
+                                          dst_node, edge, None,
+                                          callsite_stream)
+
+    @staticmethod
+    def sdaccel_params(sdfg, kernel_params):
+        seen = set()
+        out_params = []
+        for is_output, pname, param in kernel_params:
+            # Since we can have both input and output versions of the same
+            # array, make sure we only pass it once from the host code
+            if param in seen:
+                continue
+            seen.add(param)
+            if isinstance(param, dace.data.Array):
+                out_params.append("hlslib::ocl::Buffer<{}, "
+                                  "hlslib::ocl::Access::readWrite> &{}".format(
+                                      param.dtype.ctype, pname))
+            else:
+                out_params.append(param.signature(with_types=True, name=pname))
+        return out_params
+
+    def get_next_scope_entries(self, sdfg, dfg, scope_entry):
+        parent_scope_entry = dfg.scope_dict()[scope_entry]
+        parent_scope = dfg.scope_subgraph(parent_scope_entry)
+
+        # Get all scopes from the same level
+        all_scopes = [
+            node for node in parent_scope.topological_sort()
+            if isinstance(node, nodes.EntryNode)
+        ]
+
+        return all_scopes[all_scopes.index(scope_entry) + 1:]
+
+    def generate_node(self, sdfg, dfg, state_id, node, function_stream,
+                      callsite_stream):
+        method_name = "_generate_" + type(node).__name__
+        # Fake inheritance... use this class' method if it exists,
+        # otherwise fall back on CPU codegen
+        if hasattr(self, method_name):
+
+            if hasattr(node, "schedule") and node.schedule not in [
+                    dace.types.ScheduleType.Default,
+                    dace.types.ScheduleType.FPGA_Device
+            ]:
+                # raise dace.codegen.codegen.CodegenError(
+                #     "Cannot produce FPGA code for {} node with schedule {}: ".
+                #     format(type(node).__name__, node.schedule, node))
+                print("WARNING: found schedule {} on {} node in FPGA code. "
+                      "Ignoring.".format(node.schedule,
+                                         type(node).__name__))
+
+            getattr(self, method_name)(sdfg, dfg, state_id, node,
+                                       function_stream, callsite_stream)
+        else:
+            self._cpu_codegen.generate_node(sdfg, dfg, state_id, node,
+                                            function_stream, callsite_stream)
+
+    def initialize_array(self, *args, **kwargs):
+        pass
+
+    def copy_memory(self, sdfg, dfg, state_id, src_node, dst_node, edge,
+                    function_stream, callsite_stream):
+
+        if isinstance(src_node, nodes.Tasklet):
+            src_storage = dace.types.StorageType.Register
+            try:
+                src_parent = dfg.scope_dict()[src_node]
+            except KeyError:
+                src_parent = None
+            dst_schedule = (None
+                            if src_parent is None else src_parent.map.schedule)
+        else:
+            src_storage = src_node.desc(sdfg).storage
+
+        if isinstance(dst_node, nodes.Tasklet):
+            dst_storage = dace.types.StorageType.Register
+        else:
+            dst_storage = dst_node.desc(sdfg).storage
+
+        try:
+            dst_parent = dfg.scope_dict()[dst_node]
+        except KeyError:
+            dst_parent = None
+        dst_schedule = None if dst_parent is None else dst_parent.map.schedule
+
+        state_dfg = sdfg.nodes()[state_id]
+
+        # Emit actual copy
+        self._emit_copy(sdfg, state_id, src_node, src_storage, dst_node,
+                        dst_storage, dst_schedule, edge, state_dfg,
+                        callsite_stream)
+
+    def _generate_MapEntry(self, sdfg, dfg, state_id, node, function_stream,
+                           callsite_stream):
+
+        result = callsite_stream
+
+        scope_dict = dfg.scope_dict()
+
+        if node.map in self._unrolled_pes:
+
+            # This is a top-level unrolled map, meaning it has been used to
+            # replicate processing elements. Don't generate anything here.
+            pass
+
+        else:
+
+            # Generate nested loops
+            for i, r in enumerate(node.map.range):
+                var = node.map.params[i]
+                begin, end, skip = r
+                result.write(
+                    "for (auto {} = {}; {} < {}; {} += {}) {{\n".format(
+                        var, cpu.sym2cpp(begin), var, cpu.sym2cpp(end + 1),
+                        var, cpu.sym2cpp(skip)), sdfg, state_id, node)
+
+            # Pipeline innermost loops
+            scope = dfg.scope_dict(True)[node]
+
+            if node.map.unroll:
+                result.write("#pragma HLS UNROLL\n", sdfg, state_id, node)
+            else:
+                is_innermost = not any(
+                    [isinstance(x, nodes.EntryNode) for x in scope])
+                if is_innermost:
+                    result.write(
+                        "#pragma HLS PIPELINE II=1\n#pragma HLS LOOP_FLATTEN",
+                        sdfg, state_id, node)
+
+            if node.map.flatten:
+                result.write("#pragma HLS LOOP_FLATTEN\n", sdfg, state_id,
+                             node)
+
+        # Emit internal transient array allocation
+        to_allocate = dace.sdfg.local_transients(
+            sdfg, sdfg.find_state(state_id), node)
+        allocated = set()
+        for child in dfg.scope_dict(node_to_children=True)[node]:
+            if not isinstance(child, nodes.AccessNode):
+                continue
+            if child.data not in to_allocate or child.data in allocated:
+                continue
+            allocated.add(child.data)
+            self._dispatcher.dispatch_allocate(sdfg, dfg, state_id, child,
+                                               None, result)
+            self._dispatcher.dispatch_initialize(sdfg, dfg, state_id, child,
+                                                 None, result)
+
+    def _generate_MapExit(self, sdfg, dfg, state_id, node, function_stream,
+                          callsite_stream):
+        scope_dict = dfg.scope_dict()
+        entry_node = scope_dict[node]
+        if entry_node.map in self._unrolled_pes:
+            # This was generated as unrolled processing elements, no need to
+            # generate anything here
+            return
+        self._cpu_codegen._generate_MapExit(sdfg, dfg, state_id, node,
+                                            function_stream, callsite_stream)
+
+    def _generate_Reduce(self, sdfg, dfg, state_id, node, function_stream,
+                         callsite_stream):
+
+        end_braces = 0
+
+        axes = node.axes
+        input_memlet = dfg.in_edges(node)[0].data
+        src_data = sdfg.arrays[input_memlet.data]
+        output_edge = dfg.out_edges(node)[0]
+        output_memlet = output_edge.data
+        dst_data = sdfg.arrays[output_memlet.data]
+
+        output_type = 'dace::vec<%s, %s>' % (dst_data.dtype.ctype,
+                                             output_memlet.veclen)
+
+        # If axes were not defined, use all input dimensions
+        input_dims = input_memlet.subset.dims()
+        output_dims = output_memlet.subset.data_dims()
+        if axes is None:
+            axes = tuple(range(input_dims))
+        output_axes = [a for a in range(input_dims) if a not in axes]
+
+        # Obtain variable names per output and reduction axis
+        axis_vars = []
+        unroll_dim = []
+        octr = 0
+        for d in range(input_dims):
+            if d in axes:
+                axis_vars.append('__i%d' % d)
+            else:
+                axis_vars.append('__o%d' % octr)
+                octr += 1
+            if ((isinstance(src_data, dace.data.Stream)
+                 and src_data.is_stream_array()) or
+                (isinstance(src_data, dace.data.Array) and
+                 src_data.storage == dace.types.StorageType.FPGA_Registers)):
+                # Unroll reads from registers and stream arrays
+                unroll_dim.append(True)
+            else:
+                unroll_dim.append(False)
+
+        # We want to pipeline the last non-unrolled dimension
+        pipeline_dim = -1
+        for i in itertools.chain(axes, output_axes):
+            if not unroll_dim[i]:
+                pipeline_dim = i
+
+        if node.identity is not None:
+            identity = cpu.sym2cpp(node.identity)
+        else:
+            identity = None
+
+        # Initialize accumulator variable if we're collapsing to a single value
+        all_axes_collapsed = (len(axes) == input_dims)
+        if all_axes_collapsed:
+            accumulator = "_{}_accumulator".format(output_memlet.data)
+            callsite_stream.write("{} {};".format(output_type, accumulator),
+                                  sdfg, state_id, node)
+
+        # Generate inner loops (for each collapsed dimension)
+        input_subset = input_memlet.subset
+        iterators_inner = ["__i{}".format(axis) for axis in axes]
+        for i, axis in enumerate(axes):
+            callsite_stream.write(
+                'for (int {var} = {begin}; {var} < {end}; {var} += {skip}) {{'.
+                format(
+                    var=iterators_inner[i],
+                    begin=input_subset[axis][0],
+                    end=input_subset[axis][1] + 1,
+                    skip=input_subset[axis][2]), sdfg, state_id, node)
+            if unroll_dim[axis]:
+                callsite_stream.write("#pragma HLS UNROLL\n")
+            if axis == pipeline_dim:
+                callsite_stream.write(
+                    "#pragma HLS PIPELINE II=1\n#pragma HLS LOOP_FLATTEN")
+            end_braces += 1
+
+        # Generate outer loops (over different output locations)
+        output_subset = output_memlet.subset
+        iterators_outer = ["__o{}".format(axis) for axis in range(output_dims)]
+        for i, axis in enumerate(output_axes):
+            callsite_stream.write(
+                'for (int {var} = {begin}; {var} < {end}; {var} += {skip}) {{'.
+                format(
+                    var=iterators_outer[i],
+                    begin=output_subset[i][0],
+                    end=output_subset[i][1] + 1,
+                    skip=output_subset[i][2]), sdfg, state_id, node)
+            if unroll_dim[axis]:
+                callsite_stream.write("#pragma HLS UNROLL\n")
+            if axis == pipeline_dim:
+                callsite_stream.write(
+                    "#pragma HLS PIPELINE II=1\n#pragma HLS LOOP_FLATTEN")
+            end_braces += 1
+
+        # Determine reduction type
+        reduction_type = operations.detect_reduction_type(node.wcr)
+        if reduction_type == dace.types.ReductionType.Custom:
+            raise NotImplementedError("Custom reduction for FPGA is NYI")
+
+        # Input and output variables
+        out_var = (accumulator
+                   if all_axes_collapsed else cpp_array_expr(
+                       sdfg,
+                       output_memlet,
+                       offset=iterators_outer,
+                       relative_offset=False))
+        in_var = cpp_array_expr(
+            sdfg, input_memlet, offset=axis_vars, relative_offset=False)
+
+        # Call library function to perform reduction
+        reduction_cpp = "dace::Reduce<{}, {}, {}, {}<{}>>".format(
+            dst_data.dtype.ctype, input_memlet.veclen, output_memlet.veclen,
+            REDUCTION_TYPE_TO_HLSLIB[reduction_type], dst_data.dtype.ctype)
+
+        # Check if this is the first iteration of accumulating into this
+        # location
+        is_first_iteration = " && ".join([
+            "{} == {}".format(iterators_inner[i], input_subset[axis][0])
+            for i, axis in enumerate(axes)
+        ])
+        if identity is not None:
+            # If this is the first iteration, set the previous value to be
+            # identity, otherwise read the value from the output location
+            prev_var = "{}_prev".format(output_memlet.data)
+            callsite_stream.write(
+                "{} {} = ({}) ? ({}) : ({});".format(
+                    output_type, prev_var, is_first_iteration, identity,
+                    out_var), sdfg, state_id, node)
+            callsite_stream.write(
+                "{} = {}({}, {});".format(out_var, reduction_cpp, prev_var,
+                                          in_var), sdfg, state_id, node)
+        else:
+            # If this is the first iteration, assign the value read from the
+            # input directly to the output
+            callsite_stream.write(
+                "{} = ({}) ? ({}) : {}({}, {});".format(
+                    out_var, is_first_iteration, in_var, reduction_cpp,
+                    out_var, in_var), sdfg, state_id, node)
+
+        # Generate closing braces
+        for i in range(end_braces):
+            callsite_stream.write('}', sdfg, state_id, node)
+            if i == end_braces - 1 and all_axes_collapsed:
+                dst_expr = output_memlet.data
+                offset = cpp_offset_expr(
+                    dst_data,
+                    output_memlet.subset,
+                    packed_veclen=output_memlet.veclen)
+                if offset:
+                    dst_expr += " + " + offset
+                callsite_stream.write(
+                    "dace::Write({}, {});".format(dst_expr, out_var), sdfg,
+                    state_id, node)
+
+    def _generate_Tasklet(self, sdfg, dfg, state_id, node, function_stream,
+                          callsite_stream):
+
+        # TODO: this is copied from the CPU-codegen, necessary to inject
+        # pragmas at the output memlets! Should consolidate.
+
+        callsite_stream.write('{\n', sdfg, state_id, node)
+
+        state_dfg = sdfg.nodes()[state_id]
+
+        self._dispatcher.defined_vars.enter_scope(node)
+
+        arrays = set()
+        for edge in dfg.in_edges(node):
+            u = edge.src
+            memlet = edge.data
+
+            if edge.dst_conn:  # Not (None or "")
+                if edge.dst_conn in arrays:  # Disallow duplicates
+                    raise SyntaxError('Duplicates found in memlets')
+                # Special case: code->code
+                if isinstance(edge.src, nodes.CodeNode):
+                    shared_data_name = 's%d_n%d%s_n%d%s' % (
+                        state_id, dfg.node_id(edge.src), edge.src_conn,
+                        dfg.node_id(edge.dst), edge.dst_conn)
+
+                    # Read variable from shared storage
+                    callsite_stream.write(
+                        'const dace::vec<%s, %s>& %s = __%s;' %
+                        (edge.data.data.dtype.ctype, sym2cpp(edge.data.veclen),
+                         edge.dst_conn, shared_data_name), sdfg, state_id,
+                        [edge.src, edge.dst])
+                    self._dispatcher.defined_vars.add(edge.dst_conn,
+                                                      DefinedType.Scalar)
+
+                else:
+                    src_node = find_input_arraynode(state_dfg, edge)
+
+                    self._dispatcher.dispatch_copy(
+                        src_node, node, edge, sdfg, state_dfg, state_id,
+                        function_stream, callsite_stream)
+
+                # Also define variables in the C++ unparser scope
+                self._cpu_codegen._locals.define(edge.dst_conn, -1,
+                                                 self._cpu_codegen._ldepth + 1)
+                arrays.add(edge.dst_conn)
+
+        callsite_stream.write('\n', sdfg, state_id, node)
+
+        # Use outgoing edges to preallocate output local vars
+        for edge in dfg.out_edges(node):
+            v = edge.dst
+            memlet = edge.data
+
+            if edge.src_conn:
+                if edge.src_conn in arrays:  # Disallow duplicates
+                    continue
+                # Special case: code->code
+                if isinstance(edge.dst, nodes.CodeNode):
+                    callsite_stream.write(
+                        'dace::vec<%s, %s> %s;' %
+                        (sdfg.arrays[memlet.data].dtype.ctype,
+                         sym2cpp(memlet.veclen), edge.src_conn), sdfg,
+                        state_id, [edge.src, edge.dst])
+                    self._dispatcher.defined_vars.add(edge.src_conn,
+                                                      DefinedType.Scalar)
+                else:
+                    dst_node = find_output_arraynode(state_dfg, edge)
+
+                    self._dispatcher.dispatch_copy(
+                        node, dst_node, edge, sdfg, state_dfg, state_id,
+                        function_stream, callsite_stream)
+
+                # Also define variables in the C++ unparser scope
+                self._cpu_codegen._locals.define(edge.src_conn, -1,
+                                                 self._cpu_codegen._ldepth + 1)
+                arrays.add(edge.src_conn)
+
+        callsite_stream.write('\n    ///////////////////\n', sdfg, state_id,
+                              node)
+
+        cpu.unparse_tasklet(sdfg, state_id, dfg, node, function_stream,
+                            callsite_stream, self._cpu_codegen._locals,
+                            self._cpu_codegen._ldepth)
+
+        callsite_stream.write('    ///////////////////\n\n', sdfg, state_id,
+                              node)
+
+        # Process outgoing memlets
+        self._cpu_codegen.process_out_memlets(
+            sdfg, state_id, node, state_dfg, self._dispatcher, callsite_stream,
+            True, function_stream)
+
+        for edge in state_dfg.out_edges(node):
+            datadesc = sdfg.arrays[edge.data.data]
+            if (isinstance(datadesc, dace.data.Array) and
+                (datadesc.storage == dace.types.StorageType.FPGA_Local
+                 or datadesc.storage == dace.types.StorageType.FPGA_Registers)
+                    and edge.data.wcr is None):
+                callsite_stream.write(
+                    "#pragma HLS DEPENDENCE variable=__{} false".format(
+                        edge.src_conn))
+
+        callsite_stream.write('}\n', sdfg, state_id, node)
+
+        self._dispatcher.defined_vars.exit_scope(node)
diff --git a/dace/codegen/tools/dacestub.cpp b/dace/codegen/tools/dacestub.cpp
new file mode 100644
index 0000000000..e2a76b54ed
--- /dev/null
+++ b/dace/codegen/tools/dacestub.cpp
@@ -0,0 +1,85 @@
+/**
+ * Stub library that can load other libraries for use in as DaCe programs
+**/
+
+#ifdef _WIN32
+    #include <windows.h>
+    #define DACE_EXPORTED extern "C" __declspec(dllexport)
+#else
+    #include <dlfcn.h>
+    #define DACE_EXPORTED extern "C"
+#endif
+
+// Workaround (see unload_library)
+#include <omp.h>
+
+// Loads a library and returns a handle to it, or NULL if there was an error
+// NOTE: On Windows, path must be given as a Unicode string (UTF-16, or 
+//       ctypes.c_wchar_p)
+DACE_EXPORTED void *load_library(const char *filename) {
+    if (!filename)
+        return nullptr;
+
+    void *hLibrary = nullptr;
+
+#ifdef _WIN32
+    hLibrary = (void *)LoadLibraryW((const wchar_t*)filename);
+#else
+    hLibrary = dlopen(filename, RTLD_LOCAL | RTLD_NOW);
+#endif
+
+    return hLibrary;
+}
+
+// Returns 1 if the library is already loaded, 0 if not, or -1 on error
+DACE_EXPORTED int is_library_loaded(const char *filename) {
+    if (!filename)
+        return -1;
+
+    void *hLibrary = nullptr;
+
+#ifdef _WIN32
+    hLibrary = (void *)GetModuleHandleW((const wchar_t*)filename);
+#else
+    hLibrary = dlopen(filename, RTLD_LOCAL | RTLD_NOW | RTLD_NOLOAD);
+#endif
+
+    if (hLibrary)
+        return 1;
+    return 0;
+}
+
+// Loads a library function and returns a pointer, or NULL if it was not found
+DACE_EXPORTED void *get_symbol(void *hLibrary, const char *symbol) {
+    if (!hLibrary || !symbol)
+        return nullptr;
+
+    void *sym = nullptr;
+
+#ifdef _WIN32
+    sym = GetProcAddress((HMODULE)hLibrary, symbol);
+#else
+    sym = dlsym(hLibrary, symbol);
+#endif
+
+    return sym;
+}
+
+// Loads a library and returns a handle to it, or NULL if there was an error
+// NOTE: On Windows, path must be given as a Unicode string (UTF-16, or 
+//       ctypes.c_wchar_p)
+DACE_EXPORTED void unload_library(void *hLibrary) {
+    if (!hLibrary)
+        return;
+    
+    // Workaround so that OpenMP does not go ballistic when calling dlclose()
+    omp_get_max_threads();
+
+#ifdef _WIN32
+    FreeLibrary((HMODULE)hLibrary);
+#else
+    dlclose(hLibrary);
+#endif
+}
+
+
diff --git a/dace/codegen/tools/get_cuda_arch.cpp b/dace/codegen/tools/get_cuda_arch.cpp
new file mode 100644
index 0000000000..88442f0729
--- /dev/null
+++ b/dace/codegen/tools/get_cuda_arch.cpp
@@ -0,0 +1,33 @@
+#include <iostream>
+#include <set>
+#include <string>
+#include <sstream>
+#include <cuda_runtime.h>
+
+
+int main(int argc, char **argv)
+{
+    int count;
+    if (cudaGetDeviceCount(&count) != cudaSuccess) 
+        return 1;
+
+    std::set<std::string> architectures;
+    // Loop over all GPU architectures
+    for (int i = 0; i < count; ++i)
+    {
+        cudaDeviceProp prop;
+        if (cudaGetDeviceProperties(&prop, i) != cudaSuccess ||
+            (prop.major == 99 && prop.minor == 99))
+            continue;
+        std::stringstream ss;
+        ss << prop.major << prop.minor;
+        architectures.insert(ss.str());
+    }
+
+    // Print out architectures
+    for (std::set<std::string>::iterator iter = architectures.begin();
+         iter != architectures.end(); ++iter)
+        std::cout << *iter << " ";
+    
+    return 0;
+}
diff --git a/dace/config.py b/dace/config.py
new file mode 100644
index 0000000000..e6415a4a7b
--- /dev/null
+++ b/dace/config.py
@@ -0,0 +1,265 @@
+import os
+import platform
+import yaml
+
+
+def _env2bool(envval):
+    """ Converts an arbitrary value to boolean.
+        @param envval: Arbitrary value.
+        @return: True if the input value matches a valid TRUE
+                  value, or False otherwise.
+    """
+    return str(envval).lower() in ['true', '1', 'y', 'yes', 'on']
+
+
+def _add_defaults(config, metadata):
+    """ Add defaults to configuration from metadata.
+        @return: True if configuration was modified, False otherwise.
+    """
+    osname = platform.system()
+    modified = False
+    for k, v in metadata.items():
+        # Recursive call for fields inside the dictionary
+        if v['type'] == 'dict':
+            if k not in config:
+                modified = True
+                config[k] = {}
+            modified |= _add_defaults(config[k], v['required'])
+            continue
+        # Empty list initialization (if no default is specified)
+        elif v['type'] == 'list':
+            if k not in config and 'default' not in v:
+                modified = True
+                config[k] = []
+                continue
+        # Key does not exist in configuration, add default value
+        if k not in config:
+            modified = True
+            # Per-OS default
+            if 'default_' + osname in v:
+                config[k] = v['default_' + osname]
+            else:
+                config[k] = v['default']
+    return modified
+
+
+class Config(object):
+    """ Interface to the DaCe hierarchical configuration file. """
+
+    _config = {}
+    _config_metadata = {}
+    _cfg_filename = None
+    _metadata_filename = None
+
+    @staticmethod
+    def cfg_filename():
+        """ Returns the current configuration file path. """
+
+        return Config._cfg_filename
+
+    @staticmethod
+    def initialize():
+        """ Initializes configuration.
+        
+            B{Note:} This function runs automatically when the module
+                     is loaded.
+        """
+
+        # If already initialized, skip
+        if Config._cfg_filename is not None:
+            return
+
+        # Override default configuration file path
+        if 'DACE_CONFIG' in os.environ:
+            cfg_filename = os.environ['DACE_CONFIG']
+        else:
+            home = os.path.expanduser("~")
+            cfg_filename = os.path.join(home, ".dace.conf")
+
+        Config._cfg_filename = cfg_filename
+
+        dace_path = os.path.dirname(os.path.abspath(__file__))
+        Config._metadata_filename = os.path.join(dace_path,
+                                                 'config_schema.yml')
+
+        # Load configuration schema (for validation and defaults)
+        Config.load_schema()
+
+        if os.path.isfile(cfg_filename):
+            Config.load()
+        else:
+            # Load the defaults from metadata and save new conf file
+            Config._config = {}
+            _add_defaults(Config._config, Config._config_metadata['required'])
+            Config.save()
+
+    @staticmethod
+    def load(filename=None):
+        """ Loads a configuration from an existing file.
+            @param filename: The file to load. If unspecified, 
+                             uses default configuration file.
+        """
+        if filename is None:
+            filename = Config._cfg_filename
+
+        # Read configuration file
+        with open(filename, 'r') as f:
+            Config._config = yaml.load(f.read())
+
+        # Add defaults from metadata
+        modified = _add_defaults(Config._config,
+                                 Config._config_metadata['required'])
+        if modified:  # Update file if changed
+            Config.save()
+
+    @staticmethod
+    def load_schema(filename=None):
+        """ Loads a configuration schema from an existing file.
+            @param filename: The file to load. If unspecified,
+                             uses default schema file.
+        """
+        if filename is None:
+            filename = Config._metadata_filename
+        with open(filename, 'r') as f:
+            Config._config_metadata = yaml.load(f.read())
+
+    @staticmethod
+    def save(path=None):
+        """ Saves the current configuration to a file.
+            @param path: The file to save to. If unspecified,
+                         uses default configuration file.
+        """
+        if path is None:
+            path = Config._cfg_filename
+        # Write configuration file
+        with open(path, 'w') as f:
+            yaml.dump(Config._config, f, default_flow_style=False)
+
+    @staticmethod
+    def get_metadata(*key_hierarchy):
+        """ Returns the configuration specification of a given entry
+            from the schema.
+            @param key_hierarchy: A tuple of strings leading to the
+                                  configuration entry. 
+                                  For example: ('a', 'b', 'c') would be
+                                  configuration entry c which is in the
+                                  path a->b.
+            @return: Configuration specification as a dictionary.
+        """
+        # Traverse the key hierarchy
+        current_conf = Config._config_metadata
+        for key in key_hierarchy:
+            current_conf = current_conf['required'][key]
+        return current_conf
+
+    @staticmethod
+    def get_default(*key_hierarchy):
+        """ Returns the default value of a given configuration entry.
+            Takes into accound current operating system.
+            @param key_hierarchy: A tuple of strings leading to the
+                                  configuration entry. 
+                                  For example: ('a', 'b', 'c') would be
+                                  configuration entry c which is in the
+                                  path a->b.
+            @return: Default configuration value.
+        """
+        # Traverse the key hierarchy
+        current_conf = Config._config_metadata
+        for key in key_hierarchy:
+            current_conf = current_conf['required'][key]
+        if 'default_' + platform.system() in current_conf:
+            return current_conf['default_' + platform.system()]
+        return current_conf['default']
+
+    @staticmethod
+    def get(*key_hierarchy):
+        """ Returns the current value of a given configuration entry.
+            @param key_hierarchy: A tuple of strings leading to the
+                                  configuration entry. 
+                                  For example: ('a', 'b', 'c') would be
+                                  configuration entry c which is in the
+                                  path a->b.
+            @return: Configuration entry value.
+        """
+        # Environment variable override
+        # NOTE: will only work if a specific key is accessed!
+        envvar = 'DACE_' + '_'.join(key_hierarchy)
+        if envvar in os.environ:
+            return os.environ[envvar]
+
+        # Traverse the key hierarchy
+        current_conf = Config._config
+        for key in key_hierarchy:
+            current_conf = current_conf[key]
+
+        return current_conf
+
+    @staticmethod
+    def get_bool(*key_hierarchy):
+        """ Returns the current value of a given boolean configuration entry.
+            This specialization allows more string types to be converted to 
+            boolean, e.g., due to environment variable overrides.
+            @param key_hierarchy: A tuple of strings leading to the
+                                  configuration entry. 
+                                  For example: ('a', 'b', 'c') would be
+                                  configuration entry c which is in the
+                                  path a->b.
+            @return: Configuration entry value (as a boolean).
+        """
+        res = Config.get(*key_hierarchy)
+        if isinstance(res, bool):
+            return res
+        return _env2bool(str(res))
+
+    @staticmethod
+    def append(*key_hierarchy, value=None, autosave=False):
+        """ Appends to the current value of a given configuration entry
+            and sets it. Example usage: 
+            `Config.append('compiler', 'cpu', 'args', value='-fPIC')`
+            @param key_hierarchy: A tuple of strings leading to the
+                                  configuration entry. 
+                                  For example: ('a', 'b', 'c') would be
+                                  configuration entry c which is in the
+                                  path a->b.
+            @param value: The value to append.
+            @param autosave: If True, saves the configuration to the file
+                             after modification.
+            @return: Current configuration entry value.
+        """
+        # Traverse the key hierarchy up until the next to last element
+        current_conf = Config._config
+        for key in key_hierarchy[:-1]:
+            current_conf = current_conf[key]
+
+        current_conf[key_hierarchy[-1]] += value
+        if autosave:
+            Config.save()
+
+        return current_conf[key_hierarchy[-1]]
+
+    @staticmethod
+    def set(*key_hierarchy, value=None, autosave=False):
+        """ Sets the current value of a given configuration entry.
+            Example usage: 
+            `Config.set('profiling', value=True)`
+            @param key_hierarchy: A tuple of strings leading to the
+                                  configuration entry. 
+                                  For example: ('a', 'b', 'c') would be
+                                  configuration entry c which is in the
+                                  path a->b.
+            @param value: The value to set.
+            @param autosave: If True, saves the configuration to the file
+                             after modification.
+        """
+        # Traverse the key hierarchy up until the next to last element
+        current_conf = Config._config
+        for key in key_hierarchy[:-1]:
+            current_conf = current_conf[key]
+
+        current_conf[key_hierarchy[-1]] = value
+        if autosave:
+            Config.save()
+
+
+# Code that runs when the module is loaded
+Config.initialize()
diff --git a/dace/config_schema.yml b/dace/config_schema.yml
new file mode 100644
index 0000000000..d0cdb5c552
--- /dev/null
+++ b/dace/config_schema.yml
@@ -0,0 +1,602 @@
+# Schema file for DaCe Preferences
+
+# Metadata fields for elements:
+#   type: any python type (dict, list, int, bool, float, str)
+#   title: short name to show in GUI
+#   description: tooltip to show in GUI
+#   required: required sub-fields (for dict fields)
+#   default: default value. Can be platform-specific (see below)
+#   default_<platformname>: default value for platform <platformname> (overrides default)
+#   template_vars: template variables to include when processing (str fields only)
+
+# Top-level element is a dictionary (record)
+type: dict
+title: General
+description: DaCe Preferences
+required:
+    #############################################
+    # Categories
+    optimizer:
+        type: dict
+        title: Optimizer
+        description: Preferences of the SDFG Optimizer
+        required:
+            autospecialize:
+                type: bool
+                default: false
+                title: Auto-specialize symbols
+                description: >
+                    Automatically specialize every SDFG to the symbol values 
+                    at call-time. Requires all symbols to be set.
+
+            interface:
+                type: bool
+                default: dace.transformation.optimizer.SDFGOptimizer
+                title: SDFG Optimizer
+                description: >
+                    SDFG optimization class to import and call automatically
+                    on compilation. Defaults to the transformation CLI, empty
+                    string or an invalid class name skips the process.
+
+            visualize:
+                type: bool
+                default: false
+                title: Visualize SDFG
+                description: Open a GraphViz window after every transformation.
+
+            savedots:
+                type: bool
+                default: false
+                title: Save dot files
+                description: Save GraphViz .dot files after every transformation.
+
+            automatic_state_fusion:
+                type: bool
+                default: true
+                title: Automatic strict transformations
+                description: >
+                    Automatically performs strict transformations 
+                    that are considered to be safe.
+
+            detect_control_flow:
+                type: bool
+                default: true 
+                title: Detect control flow from state transitions
+                description: >
+                    Attempts to infer control flow constructs "if", 
+                    "for" and "while" from state transitions, allowing 
+                    code generators to generate appropriate code. 
+
+    renderer:
+        type: dict
+        title: Renderer
+        description: Preferences of the SDFG Renderer
+        required:
+            fulledges:
+                type: bool
+                default: false
+                title: Show full edges
+                description: >
+                    If enabled, prints out the full edge labels (which may be 
+                    long due to complex indexing).
+            html5renderer:
+                type: bool
+                default: false
+                title: (EXPERIMENTAL) HTML5 Rendering Engine
+                description: >
+                    If enabled, uses an HTML5-based renderer to display SDFGs. 
+                    This allows to visualize performance data, but is still experimental. 
+
+    compiler:
+        type: dict
+        title: Compiler
+        description: Preferences of the compiler
+        required:
+            use_cache:
+                type: bool
+                default: false
+                title: Use cache
+                description: >
+                    If enabled, does not recompile code generated from SDFGs 
+                    if shared library (.so/.dll) file is present.
+
+            library_extension:
+                type: str
+                default: so
+                default_Linux: so
+                default_Windows: dll
+                default_Darwin: dylib
+                title: Library extension
+                description: File extension of shared libraries.
+
+            indentation_spaces:
+                type: int
+                default: 4
+                title: Indentation width
+                description: >
+                    Number of spaces used when indenting generated code.
+
+            build_type:
+                type: str
+                default: Release
+                title: Build configuration
+                description: >
+                    Configuration type for CMake build (can be Debug, Release,
+                    RelWithDebInfo, or MinSizeRel).
+
+            allow_shadowing:
+                type: str
+                default: false 
+                title: Allow variable shadowing
+                description: >
+                   Allowing shadowing of variables in the code (reduces
+                   exceptions to warnings when shadowing is encountered).
+                   
+            #############################################
+            # CPU compiler
+            cpu:
+                type: dict
+                title: CPU
+                description: CPU compiler preferences
+                required:
+                    executable:
+                        type: str
+                        default: g++
+                        default_Windows: cl
+                        title: Compiler executable name
+                        description: File path or name of compiler executable
+
+                    args:
+                        type: str
+                        title: Arguments
+                        description: Compiler argument flags
+                        default: '-std=c++14 -fPIC -Wall -Wextra -O3 -march=native -ffast-math -Wno-unused-parameter -Wno-unused-label'
+                        default_Windows: '/O2 /fp:fast /arch:AVX2 /D_USRDLL /D_WINDLL /D__restrict__=__restrict'
+
+                    additional_args:
+                        type: str
+                        title: Extra Arguments
+                        description: Additional arguments provided by users
+                        default: ''
+
+                    libs:
+                        type: str
+                        title: Additional libraries
+                        description: Additional linked libraries required by target
+                        default: ''
+                        
+            #############################################
+            # GPU (CUDA) compiler
+            cuda:
+                type: dict
+                title: GPU
+                description: GPU (CUDA) compiler preferences
+                required:
+                    executable:
+                        type: str
+                        default: nvcc
+                        title: Compiler executable name
+                        description: File path or name of compiler executable
+
+                    args:
+                        type: str
+                        title: Arguments
+                        description: Compiler argument flags
+                        default: '-std=c++14 -Xcompiler -fPIC -O3 -Xcompiler -march=native --use_fast_math -Xcompiler -Wno-unused-parameter'
+
+                    cuda_arch:
+                        type: str
+                        title: Additional CUDA architectures
+                        description: >
+                            Additional CUDA architectures (separated by commas)
+                            to compile GPU code for, excluding the current 
+                            architecture on the compiling machine.
+                        default: '35'
+                
+                    default_block_size:
+                        type: str
+                        title: Default thread-block size
+                        description: >
+                            Default thread-block size for CUDA kernels when
+                            explicit GPU block maps are not defined.
+                        default: '32,1,1'
+
+                    max_concurrent_streams:
+                        type: int
+                        title: Concurrent CUDA streams
+                        description: >
+                            Maximum number of concurrent CUDA streams to 
+                            generate. Special values: -1 only uses the 
+                            default stream, 0 uses infinite concurrent streams.
+                        default: 0
+
+                    additional_args:
+                        type: str
+                        title: Extra Arguments
+                        description: Additional arguments provided by users
+                        default: ''
+
+                    libs:
+                        type: str
+                        title: Additional libraries
+                        description: Additional linked libraries required by target
+                        default: ''
+
+            #############################################
+            # FPGA (Xilinx) compiler flags
+            xilinx:
+                type: dict
+                title: Xilinx 
+                description: FPGA (Xilinx) compiler preferences
+                required:
+
+                    mode:
+                        type: str 
+                        default: simulation 
+                        title: Compilation mode 
+                        description: Target of FPGA kernel build (simulation/software_emulation/hardware_emulation/hardware) 
+
+                    executable:
+                        type: str
+                        default: xocc
+                        title: SDAccel compiler executable path 
+                        description: File path or name of SDAccel binary (xocc) 
+
+                    platform:
+                        type: str
+                        default: xilinx_vcu1525_dynamic_5_1 
+                        title: Target platform for xocc
+                        description: Platform name of SDAccel target.
+
+                    enable_debugging:
+                        type: bool 
+                        default: false 
+                        title: Enable debugging for hardware kernels 
+                        description: >
+                            Injects debugging cores on the interfaces of the
+                            kernel, allowing fine-grained debugging of hardware
+                            runs at the cost of additional resources. This is
+                            always enabled for emulation runs.
+
+                    host_flags:
+                        type: str
+                        title: Host arguments
+                        description: Extra host compiler argument flags
+                        default: "-Wno-unknown-pragmas -Wno-unused-label"
+
+                    synthesis_flags:
+                        type: str
+                        title: Synthesis arguments 
+                        description: High-level synthesis C++ flags 
+                        default: "-std=c++11"
+
+                    build_flags:
+                        type: str
+                        title: Arguments
+                        description: Kernel build (xocc) C++ flags 
+                        default: ""
+                        
+            #############################################
+            # MPI compiler
+            mpi:
+                type: dict
+                title: MPI
+                description: MPI compiler preferences
+                required:
+                    executable:
+                        type: str
+                        default: mpicxx
+                        title: Compiler executable name
+                        description: File path or name of compiler executable
+            
+            #############################################
+            # Linker
+            linker:
+                type: dict
+                title: Linker
+                description: Linker preferences
+                required:
+                    executable:
+                        type: str
+                        default: g++
+                        default_Windows: cl
+                        title: Linker executable name
+                        description: File path or name of linker executable
+
+                    args:
+                        type: str
+                        title: Arguments
+                        description: Linker argument flags
+                        default: ''
+
+                    additional_args:
+                        type: str
+                        title: Extra Arguments
+                        description: Additional arguments provided by users
+                        default: ''
+                        template_envvars:
+                            - CUDA_PATH
+
+                    library_prefix:
+                        type: str
+                        title: Library argument prefix
+                        description: >
+                            Argument prefix to add before each added library.
+                        default: '-l'
+                        default_Windows: ''
+
+                    library_suffix:
+                        type: str
+                        title: Library argument suffix
+                        description: >
+                            Argument suffix to add after each added library.
+                        default: ''
+                        default_Windows: '.lib'
+
+    execution:
+        type: dict
+        title: Execution
+        description: Binary execution preferences        
+        required:
+            general:
+                type: dict
+                title: General
+                description: General execution preferences
+                required:
+                    host:
+                        type: str
+                        default: localhost
+                        title: Host
+                        description: Hostname to use for execution
+
+                    workdir:
+                        type: str
+                        default: '/tmp/'
+                        title: Working directory
+                        description: Working directory on the remote host
+
+                    check_args:
+                        type: bool
+                        default: true
+                        title: Check arguments 
+                        description: >
+                            Do strict verification that arguments passed when
+                            calling a DaCe program match the expected types.
+
+                    execcmd:
+                        type: str
+                        title: Command
+                        description: >
+                            Command to use to execute ${command} on ${host}
+                        default: 'ssh ${host} ${command}'
+                        template_vars:
+                            - host
+                            - command
+
+                    copycmd_r2l:
+                        type: str
+                        default: 'scp ${host}:${srcfile} ${dstfile}'
+                        title: "Remote->Local copy command"
+                        description: >
+                            Command to use to copy ${srcfile} on ${host} to 
+                            the local ${dstfile}.
+                        template_vars:
+                            - host
+                            - srcfile
+                            - dstfile
+
+                    copycmd_l2r:
+                        type: str
+                        default: "scp ${srcfile} ${host}:${dstfile}"
+                        title: "Local->Remote copy command"
+                        description: >
+                            Command to use to copy the local ${srcfile} to the
+                            remote ${dstfile}.
+                        template_vars:
+                            - host
+                            - srcfile
+                            - dstfile
+
+                    repetitions:
+                        type: int
+                        default: 5
+                        title: "Repetitions per Run"
+                        description: >
+                            Number of repetitions to run for each click of the 
+                            Run button (median value will be reported in the 
+                            performance chart).
+            mpi:
+                type: dict
+                title: MPI
+                description: MPI execution preferences
+                required:
+                    mpiexec:
+                        type: str
+                        default: 'mpirun -n ${num_procs} ${command}'
+                        title: mpirun command
+                        description: >
+                            Command to use to execute MPI job ${command} with
+                            ${num_procs} processes.
+                        template_vars:
+                            - num_procs
+                            - command
+
+                    num_procs:
+                        type: int
+                        default: 4
+                        title: Number of processes
+                        description: Number of MPI processes to use 
+    diode:
+        type: dict
+        title: DIODE
+        description: DIODE GUI preferences
+        required:
+            layout:
+                type: dict
+                title: Layout
+                description: Window layout preferences
+                required:
+                    window_width:
+                        default: 800
+                        title: Window Width
+                        type: float
+                        description: Window width (in pixels)
+
+                    window_height:
+                        default: 600
+                        title: Window Height
+                        type: float
+                        description: Window height (in pixels)
+
+                    window_maximized:
+                        default: True
+                        title: Window Maximized
+                        type: bool
+                        description: >
+                            If True, DIODE starts with a maximized window
+            
+                    toppane_height:
+                        default: 20
+                        type: float
+                        title: Top-Pane Height
+                        description: >
+                            Height of top pane in Optimizer view (in percentage)
+           
+                    pypane_width:
+                        default: 30
+                        title: Python Pane Width
+                        type: float
+                        description: >
+                            Width of the Python Editor pane (in percentage)
+            
+                    optpane_width:
+                        default: 30
+                        title: Transformation Pane Width
+                        type: float
+                        description: >
+                            Width of the Transformation pane (in percentage)
+            
+                    codepane_width:
+                        default: 30
+                        title: Generated Code Pane Width
+                        type: float
+                        description: >
+                            Width of the Generated Code pane (in percentage)
+
+                    perfpane_width:
+                        default: 30
+                        title: Performance Pane Width
+                        type: float
+                        description: >
+                            Width of the Performance graph pane (in percentage)
+
+            general:
+                type: dict
+                title: General
+                description: General DIODE Preferences
+                required:
+ 
+                    show_transfed:
+                        type: bool
+                        default: False
+                        title: (EXPERIMENTAL) Show Transformation Editor
+                        description: >
+                            Show (or hide) the experimental transformation
+                            editor.
+    
+                    show_sdfged:
+                        type: bool
+                        default: False
+                        title: (EXPERIMENTAL) Show SDFG Editor
+                        description: >
+                            Show (or hide) the experimental SDFG Editor.
+
+                    show_optgraph:
+                        type: bool
+                        default: False
+                        title: Show Optimization Graph
+                        description: >
+                            Show available transformations as a graph. This is
+                            discouraged as the optimization graph may be too
+                            large to be useful.
+
+            fonts:
+                type: dict
+                title: Fonts
+                description: Fonts used in editors
+                required:
+                    python:
+                        default: ''
+                        title: Python
+                        type: font
+                        description: Font used to render Python code.
+
+                    codegen:
+                        default: ''
+                        title: Generated Code
+                        type: font
+                        description: Font used to render generated code.
+
+                    pated:
+                        default: ''
+                        title: Transformation Editor
+                        type: font
+                        description: Font used to render pattern match code.
+        
+    instrumentation:
+        type: dict
+        title: Instrumentation
+        description: Instrumentation preferences
+        required:
+            enable_papi:
+                type: bool
+                title: Enable PAPI
+                default: false
+                description: Enable instrumentation using PAPI
+            enable_vectorization_analysis:
+                type: bool
+                title: Enable vectorization check
+                default: false
+                description: >
+                    Enables analysis of gcc vectorization information. Only gcc/g++ is supported.
+            enable_papi_counter_sanity_check:
+                type: bool
+                title: Counter sanity check
+                default: false
+                description: >
+                    Enables a pre-run sanity check to minimize runtime failures
+            default_papi_counters:
+                type: str
+                title: Default PAPI counters
+                default: "['PAPI_TOT_INS', 'PAPI_TOT_CYC', 'PAPI_L2_TCM', 'PAPI_L3_TCM']"
+                description: >
+                    Sets the default PAPI counter list, formatted as 
+                    a Python list of strings.
+            max_scope_depth:
+                type: int
+                title: Max scope depth
+                default: 5
+                description: >
+                    Sets the maximum depth of instrumentations in
+                    map/consume scopes. Scopes that are deeper will not 
+                    be instrumented.
+            
+    #############################################
+    # General settings
+    debugprint:
+        type: bool
+        default: true 
+        title: Debug printing
+        description: Enable verbose printouts.
+    
+    profiling:
+        type: bool
+        default: false
+        title: Profiling
+        description: Enable profiling support.
+
+    treps:
+        type: int
+        default: 100
+        title: Profiling Repetitions
+        description: Number of times to run program for profiling.
diff --git a/dace/data.py b/dace/data.py
new file mode 100644
index 0000000000..2c0e6a47ac
--- /dev/null
+++ b/dace/data.py
@@ -0,0 +1,496 @@
+import functools
+import operator
+import re
+import copy as cp
+import sympy as sp
+
+import dace
+from dace.codegen import cppunparse
+from dace import symbolic
+from dace.properties import (Property, make_properties, ReferenceProperty,
+                             ShapeProperty, SubsetProperty, SymbolicProperty,
+                             TypeClassProperty, DebugInfoProperty,
+                             CodeProperty)
+
+
+def validate_name(name):
+    if not isinstance(name, str):
+        return False
+    if re.match(r'^[a-zA-Z_][a-zA-Z_0-9]*$', name) is None:
+        return False
+    return True
+
+
+@make_properties
+class Data(object):
+    """ Data type descriptors that can be used as references to memory.
+        Examples: Arrays, Streams, custom arrays (e.g., sparse matrices).
+    """
+
+    dtype = TypeClassProperty()
+    shape = ShapeProperty()
+    transient = Property(dtype=bool)
+    storage = Property(
+        dtype=dace.types.StorageType,
+        desc="Storage location",
+        enum=dace.types.StorageType,
+        default=dace.types.StorageType.Default,
+        from_string=lambda x: types.StorageType[x])
+    location = Property(
+        dtype=str,  # Dict[str, symbolic]
+        desc='Full storage location identifier (e.g., rank, GPU ID)',
+        default='')
+    toplevel = Property(
+        dtype=bool, desc="Allocate array outside of state", default=False)
+    debuginfo = DebugInfoProperty()
+
+    def __init__(self, dtype, shape, transient, storage, location, toplevel,
+                 debuginfo):
+        self.dtype = dtype
+        self.shape = shape
+        self.transient = transient
+        self.storage = storage
+        self.location = location
+        self.toplevel = toplevel
+        self.debuginfo = debuginfo
+        self._validate()
+
+    def validate(self):
+        """ Validate the correctness of this object.
+            Raises an exception on error. """
+        self._validate()
+
+    # Validation of this class is in a separate function, so that this
+    # class can call `_validate()` without calling the subclasses'
+    # `validate` function.
+    def _validate(self):
+        if any(not isinstance(s, (int, symbolic.SymExpr, symbolic.symbol,
+                                  symbolic.sympy.Basic)) for s in self.shape):
+            raise TypeError('Shape must be a list or tuple of integer values '
+                            'or symbols')
+        return True
+
+    def copy(self):
+        raise RuntimeError(
+            'Data descriptors are unique and should not be copied')
+
+    def is_equivalent(self, other):
+        """ Check for equivalence (shape and type) of two data descriptors. """
+        raise NotImplementedError
+
+    def signature(self, with_types=True, name=None):
+        """Returns a string for a C++ function signature (e.g., `int *A`). """
+        raise NotImplementedError
+
+    def __repr__(self):
+        return 'Abstract Data Container, DO NOT USE'
+
+
+@make_properties
+class Scalar(Data):
+    """ Data descriptor of a scalar value. """
+
+    allow_conflicts = Property(dtype=bool)
+
+    def __init__(self,
+                 dtype,
+                 transient=False,
+                 storage=dace.types.StorageType.Default,
+                 allow_conflicts=False,
+                 location='',
+                 toplevel=False,
+                 debuginfo=None):
+        self.allow_conflicts = allow_conflicts
+        shape = [1]
+        super(Scalar, self).__init__(dtype, shape, transient, storage,
+                                     location, toplevel, debuginfo)
+
+    def __repr__(self):
+        return 'Scalar (dtype=%s)' % self.dtype
+
+    def clone(self):
+        return Scalar(self.dtype, self.transient, self.storage,
+                      self.allow_conflicts, self.location, self.toplevel,
+                      self.debuginfo)
+
+    @property
+    def strides(self):
+        return self.shape
+
+    @property
+    def offset(self):
+        return [0]
+
+    def is_equivalent(self, other):
+        if not isinstance(other, Scalar):
+            return False
+        if self.dtype != other.type:
+            return False
+        return True
+
+    def signature(self, with_types=True, name=None):
+        if not with_types: return name
+        return str(self.dtype.ctype) + ' ' + name
+
+    def sizes(self):
+        return None
+
+    def covers_range(self, rng):
+        if len(rng) != 1:
+            return False
+
+        rng = rng[0]
+
+        try:
+            if (rng[1] - rng[0]) > rng[2]:
+                return False
+        except TypeError:  # cannot determine truth value of Relational
+            pass
+            #print('WARNING: Cannot evaluate relational expression %s, assuming true.' % ((rng[1] - rng[0]) > rng[2]),
+            #      'If this expression is false, please refine symbol definitions in the program.')
+
+        return True
+
+
+def set_materialize_func(obj, val):
+    """ Change the storage type of an array with a materialize function to
+        immaterial.
+    """
+    if val is not None:
+        if (obj.storage != dace.types.StorageType.Default
+                and obj.storage != dace.types.StorageType.Immaterial):
+            raise ValueError("Immaterial array must have immaterial storage, "
+                             "but has: {}".format(storage))
+        obj.storage = dace.types.StorageType.Immaterial
+    obj._materialize_func = val
+
+
+@make_properties
+class Array(Data):
+    """ Array/constant descriptor (dimensions, type and other properties). """
+
+    # Properties
+    allow_conflicts = Property(dtype=bool)
+    # TODO: Should we use a Code property here?
+    materialize_func = Property(
+        dtype=str, allow_none=True, setter=set_materialize_func)
+    access_order = Property(dtype=tuple)
+    strides = Property(dtype=list)
+    offset = Property(dtype=list)
+    may_alias = Property(
+        dtype=bool,
+        default=False,
+        desc='This pointer may alias with other pointers in '
+        'the same function')
+
+    def __init__(self,
+                 dtype,
+                 shape,
+                 materialize_func=None,
+                 transient=False,
+                 allow_conflicts=False,
+                 storage=dace.types.StorageType.Default,
+                 location='',
+                 access_order=None,
+                 strides=None,
+                 offset=None,
+                 may_alias=False,
+                 toplevel=False,
+                 debuginfo=None):
+
+        super(Array, self).__init__(dtype, shape, transient, storage, location,
+                                    toplevel, debuginfo)
+
+        if shape is None:
+            raise IndexError('Shape must not be None')
+
+        self.allow_conflicts = allow_conflicts
+        self.materialize_func = materialize_func
+        self.may_alias = may_alias
+
+        if access_order is not None:
+            self.access_order = cp.copy(access_order)
+        else:
+            self.access_order = tuple(i for i in range(len(shape)))
+
+        if strides is not None:
+            self.strides = cp.copy(strides)
+        else:
+            self.strides = cp.copy(list(shape))
+
+        if offset is not None:
+            self.offset = cp.copy(offset)
+        else:
+            self.offset = [0] * len(shape)
+
+        self.validate()
+
+    def __repr__(self):
+        return 'Array (dtype=%s, shape=%s)' % (self.dtype, self.shape)
+
+    def clone(self):
+        return Array(self.dtype, self.shape, self.materialize_func,
+                     self.transient, self.allow_conflicts, self.storage,
+                     self.location, self.access_order, self.strides,
+                     self.offset, self.may_alias, self.toplevel,
+                     self.debuginfo)
+
+    def validate(self):
+        super(Array, self).validate()
+        if len(self.access_order) != len(self.shape):
+            raise TypeError('Access order must be the same size as shape')
+
+        if len(self.strides) != len(self.shape):
+            raise TypeError('Strides must be the same size as shape')
+
+        if any(not isinstance(s, (int, symbolic.SymExpr, symbolic.symbol,
+                                  symbolic.sympy.Basic))
+               for s in self.strides):
+            raise TypeError('Strides must be a list or tuple of integer '
+                            'values or symbols')
+
+        if len(self.offset) != len(self.shape):
+            raise TypeError('Offset must be the same size as shape')
+
+    def covers_range(self, rng):
+        if len(rng) != len(self.shape):
+            return False
+
+        for s, (rb, re, rs) in zip(self.shape, rng):
+            # Shape has to be positive
+            if isinstance(s, sympy.Basic):
+                olds = s
+                if 'positive' in s.assumptions0:
+                    s = sympy.Symbol(str(s), **s.assumptions0)
+                else:
+                    s = sympy.Symbol(str(s), positive=True, **s.assumptions0)
+                if isinstance(rb, sympy.Basic):
+                    rb = rb.subs({olds: s})
+                if isinstance(re, sympy.Basic):
+                    re = re.subs({olds: s})
+                if isinstance(rs, sympy.Basic):
+                    rs = rs.subs({olds: s})
+
+            try:
+                if rb < 0:  # Negative offset
+                    return False
+            except TypeError:  # cannot determine truth value of Relational
+                pass
+                #print('WARNING: Cannot evaluate relational expression %s, assuming true.' % (rb > 0),
+                #      'If this expression is false, please refine symbol definitions in the program.')
+            try:
+                if re > s:  # Beyond shape
+                    return False
+            except TypeError:  # cannot determine truth value of Relational
+                pass
+                #print('WARNING: Cannot evaluate relational expression %s, assuming true.' % (re < s),
+                #      'If this expression is false, please refine symbol definitions in the program.')
+
+        return True
+
+    # Checks for equivalent shape and type
+    def is_equivalent(self, other):
+        if not isinstance(other, Array):
+            return False
+
+        # Test type
+        if self.dtype != other.type:
+            return False
+
+        # Test dimensionality
+        if len(self.shape) != len(other.shape):
+            return False
+
+        # Test shape
+        for dim, otherdim in zip(self.shape, other.shape):
+            # If both are symbols, ensure equality
+            if symbolic.issymbolic(dim) and symbolic.issymbolic(otherdim):
+                if dim != otherdim:
+                    return False
+
+            # If one is a symbol and the other is a constant
+            # make sure they are equivalent
+            elif symbolic.issymbolic(otherdim):
+                if symbolic.eval(otherdim) != dim:
+                    return False
+            elif symbolic.issymbolic(dim):
+                if symbolic.eval(dim) != otherdim:
+                    return False
+            else:
+                # Any other case (constant vs. constant), check for equality
+                if otherdim != dim:
+                    return False
+        return True
+
+    def signature(self, with_types=True, name=None):
+        arrname = name
+        if self.materialize_func is not None:
+            arrname = '/* ' + arrname + ' (immaterial) */'
+            if not with_types:
+                return 'nullptr'
+
+        if not with_types:
+            return arrname
+        if self.may_alias:
+            return str(self.dtype.ctype) + ' *' + arrname
+        return str(self.dtype.ctype) + ' * __restrict__ ' + arrname
+
+    def sizes(self):
+        return [
+            d.name if isinstance(d, symbolic.symbol) else str(d)
+            for d in self.shape
+        ]
+
+
+@make_properties
+class Stream(Data):
+    """ Stream (or stream array) data descriptor. """
+
+    # Properties
+    strides = Property(dtype=list)
+    offset = Property(dtype=list)
+    buffer_size = Property(dtype=int, desc="Size of internal buffer.")
+    veclen = Property(
+        dtype=int, desc="Vector length. Memlets must adhere to this.")
+
+    def __init__(self,
+                 dtype,
+                 veclen,
+                 buffer_size,
+                 shape=None,
+                 transient=False,
+                 storage=dace.types.StorageType.Default,
+                 location='',
+                 strides=None,
+                 offset=None,
+                 toplevel=False,
+                 debuginfo=None):
+
+        if shape is None:
+            shape = (1, )
+
+        self.veclen = veclen
+        self.buffer_size = buffer_size
+
+        if strides is not None:
+            if len(strides) != len(shape):
+                raise TypeError('Strides must be the same size as shape')
+            self.strides = cp.copy(strides)
+        else:
+            self.strides = cp.copy(list(shape))
+
+        if offset is not None:
+            if len(offset) != len(shape):
+                raise TypeError('Offset must be the same size as shape')
+            self.offset = cp.copy(offset)
+        else:
+            self.offset = [0] * len(shape)
+
+        super(Stream, self).__init__(dtype, shape, transient, storage,
+                                     location, toplevel, debuginfo)
+
+    def __repr__(self):
+        return 'Stream (dtype=%s, shape=%s)' % (self.dtype, self.shape)
+
+    def clone(self):
+        return Stream(self.dtype, self.veclen, self.buffer_size, self.shape,
+                      self.transient, self.storage, self.location,
+                      self.strides, self.offset, self.toplevel, self.debuginfo)
+
+    # Checks for equivalent shape and type
+    def is_equivalent(self, other):
+        if not isinstance(other, Stream):
+            return False
+
+        # Test type
+        if self.dtype != other.dtype:
+            return False
+
+        # Test dimensionality
+        if len(self.shape) != len(other.shape):
+            return False
+
+        # Test shape
+        for dim, otherdim in zip(self.shape, other.shape):
+            # If both are symbols, ensure equality
+            if symbolic.issymbolic(dim) and symbolic.issymbolic(otherdim):
+                if dim != otherdim:
+                    return False
+
+            # If one is a symbol and the other is a constant
+            # make sure they are equivalent
+            elif symbolic.issymbolic(otherdim):
+                if symbolic.eval(otherdim) != dim:
+                    return False
+            elif symbolic.issymbolic(dim):
+                if symbolic.eval(dim) != otherdim:
+                    return False
+            else:
+                # Any other case (constant vs. constant), check for equality
+                if otherdim != dim:
+                    return False
+        return True
+
+    def signature(self, with_types=True, name=None):
+        if not with_types: return name
+        if self.storage in [
+                dace.types.StorageType.GPU_Global,
+                dace.types.StorageType.GPU_Shared,
+                dace.types.StorageType.GPU_Stack
+        ]:
+            return 'dace::GPUStream<%s, %s> %s' % (
+                str(self.dtype.ctype), 'true'
+                if sp.log(self.buffer_size, 2).is_Integer else 'false', name)
+
+        return 'dace::Stream<%s> %s' % (str(self.dtype.ctype), name)
+
+    def sizes(self):
+        return [
+            d.name if isinstance(d, symbolic.symbol) else str(d)
+            for d in self.shape
+        ]
+
+    def size_string(self):
+        return (" * ".join([
+            cppunparse.pyexpr2cpp(dace.symbolic.symstr(s))
+            for s in self.strides
+        ]))
+
+    def is_stream_array(self):
+        return functools.reduce(lambda a, b: a * b, self.strides) != 1
+
+    def covers_range(self, rng):
+        if len(rng) != len(self.shape):
+            return False
+
+        for s, (rb, re, rs) in zip(self.shape, rng):
+            # Shape has to be positive
+            if isinstance(s, sympy.Basic):
+                olds = s
+                if 'positive' in s.assumptions0:
+                    s = sympy.Symbol(str(s), **s.assumptions0)
+                else:
+                    s = sympy.Symbol(str(s), positive=True, **s.assumptions0)
+                if isinstance(rb, sympy.Basic):
+                    rb = rb.subs({olds: s})
+                if isinstance(re, sympy.Basic):
+                    re = re.subs({olds: s})
+                if isinstance(rs, sympy.Basic):
+                    rs = rs.subs({olds: s})
+
+            try:
+                if rb < 0:  # Negative offset
+                    return False
+            except TypeError:  # cannot determine truth value of Relational
+                pass
+                #print('WARNING: Cannot evaluate relational expression %s, assuming true.' % (rb > 0),
+                #      'If this expression is false, please refine symbol definitions in the program.')
+            try:
+                if re > s:  # Beyond shape
+                    return False
+            except TypeError:  # cannot determine truth value of Relational
+                pass
+                #print('WARNING: Cannot evaluate relational expression %s, assuming true.' % (re < s),
+                #      'If this expression is false, please refine symbol definitions in the program.')
+
+        return True
diff --git a/dace/external/cub b/dace/external/cub
new file mode 160000
index 0000000000..c3cceac115
--- /dev/null
+++ b/dace/external/cub
@@ -0,0 +1 @@
+Subproject commit c3cceac115c072fb63df1836ff46d8c60d9eb304
diff --git a/dace/external/hlslib b/dace/external/hlslib
new file mode 160000
index 0000000000..628cd40a4a
--- /dev/null
+++ b/dace/external/hlslib
@@ -0,0 +1 @@
+Subproject commit 628cd40a4ac5fe5dd2799030398fcb7a8072252c
diff --git a/dace/external/moodycamel b/dace/external/moodycamel
new file mode 160000
index 0000000000..dea078cf5b
--- /dev/null
+++ b/dace/external/moodycamel
@@ -0,0 +1 @@
+Subproject commit dea078cf5b6e742cd67a0d725e36f872feca4de4
diff --git a/dace/frontend/__init__.py b/dace/frontend/__init__.py
new file mode 100644
index 0000000000..e69de29bb2
diff --git a/dace/frontend/common/__init__.py b/dace/frontend/common/__init__.py
new file mode 100644
index 0000000000..343f8cadd9
--- /dev/null
+++ b/dace/frontend/common/__init__.py
@@ -0,0 +1,5 @@
+from .op_impl import matrix_multiplication, matrix_multiplication_s
+from .op_impl import scalar_array_multiplication, scalar_array_multiplication_s
+from .op_impl import constant_array_multiplication
+from .op_impl import matrix_transpose, matrix_transpose_s
+from .op_impl import matrix_pointwise_op
diff --git a/dace/frontend/common/op_impl.py b/dace/frontend/common/op_impl.py
new file mode 100644
index 0000000000..bf7e617470
--- /dev/null
+++ b/dace/frontend/common/op_impl.py
@@ -0,0 +1,1731 @@
+''' DaCe SDFG linear algebra operation library. '''
+
+import copy
+import dace
+import dace.sdfg as sd
+import dace.subsets as sbs
+from dace import symbolic
+import typing
+
+State = dace.sdfg.SDFGState
+Shape = typing.List[typing.Union[int, dace.symbol]]
+Index = typing.List[typing.Union[int, str, dace.symbol]]
+Node = dace.graph.nodes.Node
+DNode = dace.graph.nodes.AccessNode
+
+# TODO: Most of the external operations here emit Z (complex double) ops, fix
+
+
+# TODO: Refactor to use GPUTransformLocalStorage?
+def gpu_transform_tasklet(sdfg, graph, tasklet_node):
+    """ Transforms a tasklet to run on the GPU. Adapted from 
+        `GPUTransformLocalStorage`.
+        @see: dace.transformation.dataflow.GPUTransformLocalStorage
+    """
+    cnode = tasklet_node
+    exit_nodes = [tasklet_node]
+
+    gpu_storage_types = [
+        dace.types.StorageType.GPU_Global, dace.types.StorageType.GPU_Shared,
+        dace.types.StorageType.GPU_Stack
+    ]
+
+    #######################################################
+    # Add GPU copies of CPU arrays (i.e., not already on GPU)
+
+    # First, understand which arrays to clone
+    all_out_edges = []
+    for enode in exit_nodes:
+        all_out_edges.extend(list(graph.out_edges(enode)))
+    in_arrays_to_clone = set()
+    out_arrays_to_clone = set()
+    for e in graph.in_edges(cnode):
+        data_node = sd.find_input_arraynode(graph, e)
+        if data_node.desc(sdfg).storage not in gpu_storage_types:
+            in_arrays_to_clone.add((data_node, e.data))
+    for e in all_out_edges:
+        data_node = sd.find_output_arraynode(graph, e)
+        if data_node.desc(sdfg).storage not in gpu_storage_types:
+            out_arrays_to_clone.add((data_node, e.data))
+
+    # Second, create a GPU clone of each array
+    # TODO: Overapproximate union of memlets
+    cloned_arrays = {}
+    in_cloned_arraynodes = {}
+    out_cloned_arraynodes = {}
+    for array_node, memlet in in_arrays_to_clone:
+        array = array_node.desc(sdfg)
+        cloned_name = 'gpu_' + array_node.data
+        for i, r in enumerate(memlet.bounding_box_size()):
+            size = symbolic.overapproximate(r)
+            try:
+                if int(size) == 1:
+                    suffix = []
+                    for c in str(memlet.subset[i][0]):
+                        if c.isalpha() or c.isdigit() or c == '_':
+                            suffix.append(c)
+                        elif c == '+':
+                            suffix.append('p')
+                        elif c == '-':
+                            suffix.append('m')
+                        elif c == '*':
+                            suffix.append('t')
+                        elif c == '/':
+                            suffix.append('d')
+                    cloned_name += '_' + ''.join(suffix)
+            except:
+                continue
+        if cloned_name in sdfg.arrays.keys():
+            cloned_array = sdfg.arrays[cloned_name]
+        elif array_node.data in cloned_arrays:
+            cloned_array = cloned_arrays[array_node.data]
+        else:
+            full_shape = []
+            for r in memlet.bounding_box_size():
+                size = symbolic.overapproximate(r)
+                try:
+                    full_shape.append(int(size))
+                except:
+                    full_shape.append(size)
+            actual_dims = [
+                idx for idx, r in enumerate(full_shape)
+                if not (isinstance(r, int) and r == 1)
+            ]
+            if len(actual_dims) == 0:  # abort
+                actual_dims = [len(full_shape) - 1]
+            if isinstance(array, dace.data.Scalar):
+                cloned_array = sdfg.add_array(
+                    name=cloned_name,
+                    shape=[1],
+                    dtype=array.dtype,
+                    transient=True,
+                    storage=dace.types.StorageType.GPU_Global)
+            else:
+                cloned_array = sdfg.add_array(
+                    name=cloned_name,
+                    shape=[full_shape[d] for d in actual_dims],
+                    dtype=array.dtype,
+                    materialize_func=array.materialize_func,
+                    transient=True,
+                    storage=dace.types.StorageType.GPU_Global,
+                    allow_conflicts=array.allow_conflicts,
+                    access_order=tuple(
+                        [array.access_order[d] for d in actual_dims]),
+                    strides=[array.strides[d] for d in actual_dims],
+                    offset=[array.offset[d] for d in actual_dims])
+            cloned_arrays[array_node.data] = cloned_name
+        cloned_node = type(array_node)(cloned_name)
+
+        in_cloned_arraynodes[array_node.data] = cloned_node
+    for array_node, memlet in out_arrays_to_clone:
+        array = array_node.desc(sdfg)
+        cloned_name = 'gpu_' + array_node.data
+        for i, r in enumerate(memlet.bounding_box_size()):
+            size = symbolic.overapproximate(r)
+            try:
+                if int(size) == 1:
+                    suffix = []
+                    for c in str(memlet.subset[i][0]):
+                        if c.isalpha() or c.isdigit() or c == '_':
+                            suffix.append(c)
+                        elif c == '+':
+                            suffix.append('p')
+                        elif c == '-':
+                            suffix.append('m')
+                        elif c == '*':
+                            suffix.append('t')
+                        elif c == '/':
+                            suffix.append('d')
+                    cloned_name += '_' + ''.join(suffix)
+            except:
+                continue
+        if cloned_name in sdfg.arrays.keys():
+            cloned_array = sdfg.arrays[cloned_name]
+        elif array_node.data in cloned_arrays:
+            cloned_array = cloned_arrays[array_node.data]
+        else:
+            full_shape = []
+            for r in memlet.bounding_box_size():
+                size = symbolic.overapproximate(r)
+                try:
+                    full_shape.append(int(size))
+                except:
+                    full_shape.append(size)
+            actual_dims = [
+                idx for idx, r in enumerate(full_shape)
+                if not (isinstance(r, int) and r == 1)
+            ]
+            if len(actual_dims) == 0:  # abort
+                actual_dims = [len(full_shape) - 1]
+            if isinstance(array, dace.data.Scalar):
+                cloned_array = sdfg.add_array(
+                    name=cloned_name,
+                    shape=[1],
+                    dtype=array.dtype,
+                    transient=True,
+                    storage=dace.types.StorageType.GPU_Global)
+            else:
+                cloned_array = sdfg.add_array(
+                    name=cloned_name,
+                    shape=[full_shape[d] for d in actual_dims],
+                    dtype=array.dtype,
+                    materialize_func=array.materialize_func,
+                    transient=True,
+                    storage=dace.types.StorageType.GPU_Global,
+                    allow_conflicts=array.allow_conflicts,
+                    access_order=tuple(
+                        [array.access_order[d] for d in actual_dims]),
+                    strides=[array.strides[d] for d in actual_dims],
+                    offset=[array.offset[d] for d in actual_dims])
+            cloned_arrays[array_node.data] = cloned_name
+        cloned_node = type(array_node)(cloned_name)
+        cloned_node.setzero = True
+
+        out_cloned_arraynodes[array_node.data] = cloned_node
+
+    # Third, connect the cloned arrays to the originals
+    for array_name, node in in_cloned_arraynodes.items():
+        graph.add_node(node)
+        is_scalar = isinstance(sdfg.arrays[array_name], dace.data.Scalar)
+        for edge in graph.in_edges(cnode):
+            if edge.data.data == array_name:
+                graph.remove_edge(edge)
+                newmemlet = copy.deepcopy(edge.data)
+                newmemlet.data = node.data
+
+                if is_scalar:
+                    newmemlet.subset = sbs.Indices([0])
+                else:
+                    offset = []
+                    lost_dims = []
+                    lost_ranges = []
+                    newsubset = [None] * len(edge.data.subset)
+                    for ind, r in enumerate(edge.data.subset):
+                        offset.append(r[0])
+                        if isinstance(edge.data.subset[ind], tuple):
+                            begin = edge.data.subset[ind][0] - r[0]
+                            end = edge.data.subset[ind][1] - r[0]
+                            step = edge.data.subset[ind][2]
+                            if begin == end:
+                                lost_dims.append(ind)
+                                lost_ranges.append((begin, end, step))
+                            else:
+                                newsubset[ind] = (begin, end, step)
+                        else:
+                            newsubset[ind] -= r[0]
+                    if len(lost_dims) == len(edge.data.subset):
+                        newmemlet.subset = type(
+                            edge.data.subset)([lost_ranges[-1]])
+                    else:
+                        newmemlet.subset = type(edge.data.subset)(
+                            [r for r in newsubset if r is not None])
+
+                graph.add_edge(node, edge.src_conn, edge.dst, edge.dst_conn,
+                               newmemlet)
+
+                edge.data.other_subset = newmemlet.subset
+                graph.add_edge(edge.src, None, node, None, edge.data)
+    for array_name, node in out_cloned_arraynodes.items():
+        graph.add_node(node)
+        is_scalar = isinstance(sdfg.arrays[array_name], dace.data.Scalar)
+        for edge in all_out_edges:
+            if edge.data.data == array_name:
+                graph.remove_edge(edge)
+                newmemlet = copy.deepcopy(edge.data)
+                newmemlet.data = node.data
+
+                if is_scalar:
+                    newmemlet.subset = sbs.Indices([0])
+                else:
+                    offset = []
+                    lost_dims = []
+                    lost_ranges = []
+                    newsubset = [None] * len(edge.data.subset)
+                    for ind, r in enumerate(edge.data.subset):
+                        offset.append(r[0])
+                        if isinstance(edge.data.subset[ind], tuple):
+                            begin = edge.data.subset[ind][0] - r[0]
+                            end = edge.data.subset[ind][1] - r[0]
+                            step = edge.data.subset[ind][2]
+                            if begin == end:
+                                lost_dims.append(ind)
+                                lost_ranges.append((begin, end, step))
+                            else:
+                                newsubset[ind] = (begin, end, step)
+                        else:
+                            newsubset[ind] -= r[0]
+                    if len(lost_dims) == len(edge.data.subset):
+                        newmemlet.subset = type(
+                            edge.data.subset)([lost_ranges[-1]])
+                    else:
+                        newmemlet.subset = type(edge.data.subset)(
+                            [r for r in newsubset if r is not None])
+
+                graph.add_edge(edge.src, edge.src_conn, node, edge.dst_conn,
+                               newmemlet)
+
+                edge.data.data = node.data
+                edge.data.other_subset = edge.data.subset
+                edge.data.subset = newmemlet.subset
+                graph.add_edge(node, None, edge.dst, None, edge.data)
+
+
+class ValidationError(Exception):
+    """ An exception raised when inputs are not validated in SDFG library 
+        calls. """
+
+    def __init__(self, message):
+        super().__init__(message)
+
+
+def validate_matrix_multiplication(
+        A_shape: Shape,
+        B_shape: Shape,
+        C_shape: Shape,
+        A_index: Index = None,
+        B_index: Index = None,
+        C_index: Index = None
+) -> ((str, str, str), (str, str, str), (str, str, str), (str, str, str)):
+    """ Validates a matrix multiplication operation, based on the shapes and
+        indices of the arrays involved. Returns the ranges of the maps and
+        memlets at all levels as strings.
+    """
+
+    # Validate input
+    if len(A_shape) < 2:
+        raise ValidationError(
+            'Array A has less than 2 dimensions: {}'.format(A_shape))
+    A_mm_shape = A_shape[-2:]
+    if len(B_shape) < 2:
+        raise ValidationError(
+            'Array B has less than 2 dimensions: {}'.format(B_shape))
+    B_mm_shape = B_shape[-2:]
+    if A_mm_shape[-1] != B_mm_shape[0]:
+        raise ValidationError(
+            'N-dimension mismatch between arrays A and B: {} != {}'.format(
+                A_mm_shape[-1], B_mm_shape[0]))
+
+    # Dimension sizes and ranges
+    M = A_mm_shape[0]
+    N = A_mm_shape[-1]
+    K = B_mm_shape[-1]
+    M_range = '0:{}'.format(M)
+    N_range = '0:{}'.format(N)
+    K_range = '0:{}'.format(K)
+
+    # Validate slices and set input array access ranges
+    A_outer_range = '{}, {}'.format(M_range, N_range)
+    A_middle_range = '{}, ik'.format(M_range)
+    A_inner_range = 'ii, ik'
+    if len(A_shape) > 2:
+        if A_index is None or len(A_index) != len(A_shape) - 2:
+            raise ValidationError(
+                'Invalid slice {} for array A with dimensions {}'.format(
+                    A_index, A_shape))
+        A_index = [str(idx) for idx in A_index]
+        A_outer_range = '{}, {}'.format(', '.join(A_index), A_outer_range)
+        A_middle_range = '{}, {}'.format(', '.join(A_index), A_middle_range)
+        A_inner_range = '{}, {}'.format(', '.join(A_index), A_inner_range)
+    B_outer_range = '{}, {}'.format(N_range, K_range)
+    B_middle_range = 'ik, {}'.format(K_range)
+    B_inner_range = 'ik, ij'
+    if len(B_shape) > 2:
+        if B_index is None or len(B_index) != len(B_shape) - 2:
+            raise ValidationError(
+                'Invalid slice {} for array B with dimensions {}'.format(
+                    B_index, B_shape))
+        B_index = [str(idx) for idx in B_index]
+        B_outer_range = '{}, {}'.format(', '.join(B_index), B_outer_range)
+        B_middle_range = '{}, {}'.format(', '.join(B_index), B_middle_range)
+        B_inner_range = '{}, {}'.format(', '.join(B_index), B_inner_range)
+
+    # Validate output
+    C_mm_shape = [M, K]
+    if len(C_shape) < 2:
+        raise ValidationError(
+            'Array C has less than 2 dimensions: {}'.format(C_shape))
+    if list(C_shape[-2:]) != C_mm_shape:
+        raise ValidationError(
+            'Shape mismatch in array C: expected {}, but got {}'.format(
+                C_mm_shape, C_shape[-2:]))
+    C_outer_range = '{}, {}'.format(M_range, K_range)
+    C_middle_range = '{}, {}'.format(M_range, K_range)
+    C_inner_range = 'ii, ij'
+    if len(C_shape) > 2:
+        if C_index is None or len(C_index) != len(C_shape) - 2:
+            raise ValidationError(
+                'Invalid slice {} for array C with dimensions {}'.format(
+                    C_index, C_shape))
+        C_index = [str(idx) for idx in C_index]
+        C_outer_range = '{}, {}'.format(', '.join(C_index), C_outer_range)
+        C_middle_range = '{}, {}'.format(', '.join(C_index), C_middle_range)
+        C_inner_range = '{}, {}'.format(', '.join(C_index), C_inner_range)
+
+    return ((M_range, N_range, K_range), (A_outer_range, A_middle_range,
+                                          A_inner_range),
+            (B_outer_range, B_middle_range,
+             B_inner_range), (C_outer_range, C_middle_range, C_inner_range))
+
+
+def matrix_multiplication(state: State,
+                          A_src: Node,
+                          A_node: DNode,
+                          B_src: Node,
+                          B_node: DNode,
+                          C_dst: Node,
+                          C_node: DNode,
+                          accumulate: bool = False,
+                          interchange: bool = True,
+                          A_index: Index = None,
+                          B_index: Index = None,
+                          C_index: Index = None,
+                          label: str = None):
+    """ Adds a matrix multiplication operation to an existing SDFG state.
+        @param A_src: The source node from which the memlet of matrix A is
+                      connected.
+        @param A_node: The Access Node for matrix A.
+        @param B_src: The source node from which the memlet of matrix B is
+                      connected.
+        @param B_node: The Access Node for matrix B.
+        @param C_dst: The destination node to which the memlet of matrix C is
+                      connected.
+        @param C_node: The Access Node for matrix C.
+        @param accumulate: Whether to accumulate to C or store to it.
+        @param interchange: If True, interchanges the multiplication maps for
+                            performance (in some cases).
+        @param A_index: Slice of matrix A to use for multiplication.
+        @param B_index: Slice of matrix B to use for multiplication.
+        @param C_index: Slice of matrix C to use for multiplication.
+        @param label: Optional label for the maps and tasklet.
+    """
+
+    # Validate input
+    sdfg = state.parent
+    map_ranges, A_ranges, B_ranges, C_ranges = validate_matrix_multiplication(
+        A_node.desc(sdfg).shape,
+        B_node.desc(sdfg).shape,
+        C_node.desc(sdfg).shape, A_index, B_index, C_index)
+
+    # Extract ranges
+    M_range, N_range, K_range = map_ranges
+    A_outer_range, A_middle_range, A_inner_range = A_ranges
+    B_outer_range, B_middle_range, B_inner_range = B_ranges
+    C_outer_range, C_middle_range, C_inner_range = C_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create maps/tasklet
+    k_entry, k_exit = state.add_map(
+        name=label + '_' + 'k_map',
+        ndrange=dict(ik=N_range),
+        schedule=dace.types.ScheduleType.Sequential)
+    k_entry.in_connectors = {'IN_1', 'IN_2'}
+    k_entry.out_connectors = {'OUT_1', 'OUT_2'}
+    k_exit.in_connectors = {'IN_1'}
+    k_exit.out_connectors = {'OUT_1'}
+    ij_entry, ij_exit = state.add_map(
+        name=label + '_' + 'ij_map', ndrange=dict(ii=M_range, ij=K_range))
+    tasklet = state.add_tasklet(
+        name=label + '_' + 'tasklet',
+        inputs={'a', 'b'},
+        outputs={'c'},
+        code='c = a * b')
+    ij_entry.in_connectors = {'IN_1', 'IN_2'}
+    ij_entry.out_connectors = {'OUT_1', 'OUT_2'}
+    ij_exit.in_connectors = {'IN_1'}
+    ij_exit.out_connectors = {'OUT_1'}
+
+    # Add edges
+    if interchange:
+        state.add_edge(A_src, None, k_entry, 'IN_1',
+                       dace.Memlet.simple(A_node, A_outer_range))
+        state.add_edge(B_src, None, k_entry, 'IN_2',
+                       dace.Memlet.simple(B_node, B_outer_range))
+        state.add_edge(k_entry, 'OUT_1', ij_entry, 'IN_1',
+                       dace.Memlet.simple(A_node, A_middle_range))
+        state.add_edge(k_entry, 'OUT_2', ij_entry, 'IN_2',
+                       dace.Memlet.simple(B_node, B_middle_range))
+        state.add_edge(ij_entry, 'OUT_1', tasklet, 'a',
+                       dace.Memlet.simple(A_node, A_inner_range))
+        state.add_edge(ij_entry, 'OUT_2', tasklet, 'b',
+                       dace.Memlet.simple(B_node, B_inner_range))
+        wcr = 0
+        if accumulate:
+            wcr = None
+        state.add_edge(
+            tasklet, 'c', ij_exit, 'IN_1',
+            dace.Memlet.simple(
+                C_node,
+                C_inner_range,
+                wcr_str='lambda x, y: x + y',
+                wcr_identity=wcr,
+                wcr_conflict=False))
+        state.add_edge(ij_exit, 'OUT_1', k_exit, 'IN_1',
+                       dace.Memlet.simple(C_node, C_middle_range))
+        state.add_edge(k_exit, 'OUT_1', C_dst, None,
+                       dace.Memlet.simple(C_node, C_outer_range))
+    else:
+        state.add_edge(A_src, None, ij_entry, 'IN_1',
+                       dace.Memlet.simple(A_node, A_outer_range))
+        state.add_edge(B_src, None, ij_entry, 'IN_2',
+                       dace.Memlet.simple(B_node, B_outer_range))
+        state.add_edge(ij_entry, 'OUT_1', k_entry, 'IN_1',
+                       dace.Memlet.simple(A_node, A_middle_range))
+        state.add_edge(ij_entry, 'OUT_2', k_entry, 'IN_2',
+                       dace.Memlet.simple(B_node, B_middle_range))
+        state.add_edge(k_entry, 'OUT_1', tasklet, 'a',
+                       dace.Memlet.simple(A_node, A_inner_range))
+        state.add_edge(k_entry, 'OUT_2', tasklet, 'b',
+                       dace.Memlet.simple(B_node, B_inner_range))
+        wcr = 0
+        if accumulate:
+            wcr = None
+        state.add_edge(
+            tasklet, 'c', k_exit, 'IN_1',
+            dace.Memlet.simple(
+                C_node,
+                C_inner_range,
+                wcr_str='lambda x, y: x + y',
+                wcr_identity=wcr,
+                wcr_conflict=False))
+        state.add_edge(k_exit, 'OUT_1', ij_exit, 'IN_1',
+                       dace.Memlet.simple(C_node, C_middle_range))
+        state.add_edge(ij_exit, 'OUT_1', C_dst, None,
+                       dace.Memlet.simple(C_node, C_outer_range))
+
+
+def matrix_multiplication_cublas(state: State,
+                                 A_src: Node,
+                                 A_node: DNode,
+                                 B_src: Node,
+                                 B_node: DNode,
+                                 C_dst: Node,
+                                 C_node: DNode,
+                                 accumulate: bool = False,
+                                 interchange: bool = True,
+                                 alpha: str = 'const_pone',
+                                 beta: str = 'const_zero',
+                                 A_index: Index = None,
+                                 B_index: Index = None,
+                                 C_index: Index = None,
+                                 label: str = None):
+    """ Adds a matrix multiplication operation to an existing SDFG state,
+        using CUBLAS as the implementation.
+        @param A_src: The source node from which the memlet of matrix A is
+                      connected.
+        @param A_node: The Access Node for matrix A.
+        @param B_src: The source node from which the memlet of matrix B is
+                      connected.
+        @param B_node: The Access Node for matrix B.
+        @param C_dst: The destination node to which the memlet of matrix C is
+                      connected.
+        @param C_node: The Access Node for matrix C.
+        @param accumulate: Whether to accumulate to C or store to it.
+        @param interchange: If True, interchanges the multiplication maps for
+                            performance (in some cases).
+        @param alpha: Alpha value for GEMM.
+        @param beta: Beta value for GEMM.
+        @param A_index: Slice of matrix A to use for multiplication.
+        @param B_index: Slice of matrix B to use for multiplication.
+        @param C_index: Slice of matrix C to use for multiplication.
+        @param label: Optional label for the maps and tasklet.
+    """
+
+    # Validate input
+    sdfg = state.parent
+    map_ranges, A_ranges, B_ranges, C_ranges = validate_matrix_multiplication(
+        A_node.desc(sdfg).shape,
+        B_node.desc(sdfg).shape,
+        C_node.desc(sdfg).shape, A_index, B_index, C_index)
+
+    # Extract ranges
+    M_range, N_range, K_range = map_ranges
+    A_outer_range, A_middle_range, A_inner_range = A_ranges
+    B_outer_range, B_middle_range, B_inner_range = B_ranges
+    C_outer_range, C_middle_range, C_inner_range = C_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create tasklet
+    tasklet = state.add_tasklet(
+        name=label + '_' + 'tasklet',
+        inputs={'a', 'b'},
+        outputs={'c'},
+        code='''
+        //cuDoubleComplex alpha = make_cuDoubleComplex(1, 0);
+        //cuDoubleComplex beta = make_cuDoubleComplex(0, 0);
+        cublasSetStream(handle, __dace_current_stream);
+        cublasStatus_t status = cublasZgemm(
+            handle,
+            CUBLAS_OP_N, CUBLAS_OP_N,
+            bsize, bsize, bsize,
+            const_pone,
+            (cuDoubleComplex*)b, bsize,
+            (cuDoubleComplex*)a, bsize,
+            const_zero,
+            (cuDoubleComplex*)c, bsize
+        );
+        ''',  # cuBLAS is column-major, so we switch the arguments
+        language=dace.types.Language.CPP)
+
+    state.add_edge(A_src, None, tasklet, 'a',
+                   dace.Memlet.simple(A_node, A_outer_range))
+    state.add_edge(B_src, None, tasklet, 'b',
+                   dace.Memlet.simple(B_node, B_outer_range))
+    state.add_edge(tasklet, 'c', C_dst, None,
+                   dace.Memlet.simple(C_node, C_outer_range))
+
+    gpu_transform_tasklet(sdfg, state, tasklet)
+
+
+def matrix_multiplication_cublas_v2(state: State,
+                                    A_src: Node,
+                                    A_node: DNode,
+                                    B_src: Node,
+                                    B_node: DNode,
+                                    C_src: Node,
+                                    C_src_node: DNode,
+                                    C_dst: Node,
+                                    C_dst_node: DNode,
+                                    accumulate: bool = False,
+                                    interchange: bool = True,
+                                    alpha: str = 'const_pone',
+                                    beta: str = 'const_zero',
+                                    A_index: Index = None,
+                                    B_index: Index = None,
+                                    C_index: Index = None,
+                                    label: str = None):
+    """ Adds a matrix multiplication operation to an existing SDFG state,
+        using CUBLAS as the implementation, and providing a separate source
+        and destination nodes for the output matrix.
+        @param A_src: The source node from which the memlet of matrix A is
+                      connected.
+        @param A_node: The Access Node for matrix A.
+        @param B_src: The source node from which the memlet of matrix B is
+                      connected.
+        @param B_node: The Access Node for matrix B.
+        @param C_src: The node from which the memlet of matrix C is
+                      connected into the multiplication.
+        @param C_src_node: The input Access Node for matrix C.
+        @param C_dst: The node to which the memlet of matrix C is
+                      connected out of the multiplication.
+        @param C_dst_node: The output Access Node for matrix C.
+        @param accumulate: Whether to accumulate to C or store to it.
+        @param interchange: If True, interchanges the multiplication maps for
+                            performance (in some cases).
+        @param alpha: Alpha value for GEMM.
+        @param beta: Beta value for GEMM.
+        @param A_index: Slice of matrix A to use for multiplication.
+        @param B_index: Slice of matrix B to use for multiplication.
+        @param C_index: Slice of matrix C to use for multiplication.
+        @param label: Optional label for the maps and tasklet.
+    """
+
+    # Validate input
+    sdfg = state.parent
+    map_ranges, A_ranges, B_ranges, C_ranges = validate_matrix_multiplication(
+        A_node.desc(sdfg).shape,
+        B_node.desc(sdfg).shape,
+        C_src_node.desc(sdfg).shape, A_index, B_index, C_index)
+
+    # Extract ranges
+    M_range, N_range, K_range = map_ranges
+    A_outer_range, A_middle_range, A_inner_range = A_ranges
+    B_outer_range, B_middle_range, B_inner_range = B_ranges
+    C_outer_range, C_middle_range, C_inner_range = C_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create tasklet
+    tasklet = state.add_tasklet(
+        name=label + '_' + 'tasklet',
+        inputs={'a', 'b', 'cin'},
+        outputs={'c'},
+        code='''
+        //cuDoubleComplex alpha = make_cuDoubleComplex(1, 0);
+        //cuDoubleComplex beta = make_cuDoubleComplex(0, 0);
+        cublasSetStream(handle, __dace_current_stream);
+        cublasStatus_t status = cublasZgemm(
+            handle,
+            CUBLAS_OP_N, CUBLAS_OP_N,
+            bsize, bsize, bsize,
+            {alpha},
+            (cuDoubleComplex*)b, bsize,
+            (cuDoubleComplex*)a, bsize,
+            {beta},
+            (cuDoubleComplex*)c, bsize
+        );
+        '''.format(
+            alpha=alpha,
+            beta=beta),  # cuBLAS is column-major, so we switch the arguments
+        language=dace.types.Language.CPP)
+
+    state.add_edge(A_src, None, tasklet, 'a',
+                   dace.Memlet.simple(A_node, A_outer_range))
+    state.add_edge(B_src, None, tasklet, 'b',
+                   dace.Memlet.simple(B_node, B_outer_range))
+    state.add_edge(C_src, None, tasklet, 'cin',
+                   dace.Memlet.simple(C_src_node, C_outer_range))
+    state.add_edge(tasklet, 'c', C_dst, None,
+                   dace.Memlet.simple(C_dst_node, C_outer_range))
+
+    gpu_transform_tasklet(sdfg, state, tasklet)
+
+
+def matrix_multiplication_mkl(state: State,
+                              A_src: Node,
+                              A_node: DNode,
+                              B_src: Node,
+                              B_node: DNode,
+                              C_dst: Node,
+                              C_node: DNode,
+                              accumulate: bool = False,
+                              interchange: bool = True,
+                              A_index: Index = None,
+                              B_index: Index = None,
+                              C_index: Index = None,
+                              label: str = None):
+    """ Adds a matrix multiplication operation to an existing SDFG state,
+        using MKL as the implementation.
+        @param A_src: The source node from which the memlet of matrix A is
+                      connected.
+        @param A_node: The Access Node for matrix A.
+        @param B_src: The source node from which the memlet of matrix B is
+                      connected.
+        @param B_node: The Access Node for matrix B.
+        @param C_dst: The destination node to which the memlet of matrix C is
+                      connected.
+        @param C_node: The Access Node for matrix C.
+        @param accumulate: Whether to accumulate to C or store to it.
+        @param interchange: If True, interchanges the multiplication maps for
+                            performance (in some cases).
+        @param A_index: Slice of matrix A to use for multiplication.
+        @param B_index: Slice of matrix B to use for multiplication.
+        @param C_index: Slice of matrix C to use for multiplication.
+        @param label: Optional label for the maps and tasklet.
+    """
+
+    # Validate input
+    sdfg = state.parent
+    map_ranges, A_ranges, B_ranges, C_ranges = validate_matrix_multiplication(
+        A_node.desc(sdfg).shape,
+        B_node.desc(sdfg).shape,
+        C_node.desc(sdfg).shape, A_index, B_index, C_index)
+
+    # Extract ranges
+    M = A_node.desc(sdfg).shape[-2]
+    N = A_node.desc(sdfg).shape[-1]
+    K = B_node.desc(sdfg).shape[-1]
+    M_range, N_range, K_range = map_ranges
+    A_outer_range, A_middle_range, A_inner_range = A_ranges
+    B_outer_range, B_middle_range, B_inner_range = B_ranges
+    C_outer_range, C_middle_range, C_inner_range = C_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create tasklet
+    tasklet = state.add_tasklet(
+        name=label + '_' + 'tasklet',
+        inputs={'a', 'b'},
+        outputs={'c'},
+        code='''
+        std::complex<double> alpha(1, 0);
+        std::complex<double> beta(0, 0);
+        char opa = 'N';
+        char opb = 'N';
+        zgemm(
+            &opa, &opb,
+            &{m}, &{n}, &{k},
+            (MKL_Complex16*)&alpha,
+            (MKL_Complex16*)a, &{m},
+            (MKL_Complex16*)b, &{n},
+            (MKL_Complex16*)&beta,
+            (MKL_Complex16*)c, &{m}
+        );
+        '''.format(m=M, n=N, k=K),
+        language=dace.types.Language.CPP)
+
+    state.add_edge(A_src, None, tasklet, 'a',
+                   dace.Memlet.simple(A_node, A_outer_range))
+    state.add_edge(B_src, None, tasklet, 'b',
+                   dace.Memlet.simple(B_node, B_outer_range))
+    state.add_edge(tasklet, 'c', C_dst, None,
+                   dace.Memlet.simple(C_node, C_outer_range))
+
+
+def matrix_multiplication_s(A_label: str,
+                            A_shape: Shape,
+                            A_type: dace.types.typeclass,
+                            B_label: str,
+                            B_shape: Shape,
+                            B_type: dace.types.typeclass,
+                            create_C: bool = True,
+                            C_label: str = None,
+                            C_shape: Shape = None,
+                            C_type: dace.types.typeclass = None,
+                            is_A_transient: bool = False,
+                            is_B_transient: bool = False,
+                            is_C_transient: bool = False,
+                            accumulate: bool = False,
+                            interchange: bool = True,
+                            A_index: Index = None,
+                            B_index: Index = None,
+                            C_index: Index = None,
+                            label: str = None) -> State:
+    """ Creates a new state with a matrix multiplication operation. """
+
+    # Set output attributes
+    if create_C:
+        if C_label is None:
+            C_label = A_label + B_label
+        if C_type is None:
+            C_type = A_type
+        C_shape = [A_shape[-2], B_shape[-1]]
+    else:
+        if C_shape is None:
+            raise ValidationError(
+                'Array C is not transient, but its shape is not set')
+
+    # Validate input
+    map_ranges, A_ranges, B_ranges, C_ranges = validate_matrix_multiplication(
+        A_shape, B_shape, C_shape, A_index, B_index, C_index)
+
+    # Extract ranges
+    M_range, N_range, K_range = map_ranges
+    A_outer_range, A_middle_range, A_inner_range = A_ranges
+    B_outer_range, B_middle_range, B_inner_range = B_ranges
+    C_outer_range, C_middle_range, C_inner_range = C_ranges
+
+    # Set label
+    if label is None:
+        label = A_label + B_label
+
+    # Create state
+    state = State(label=label)
+
+    # Create data nodes
+    A_node = state.add_array(
+        A_label, A_shape, A_type, transient=is_A_transient)
+    B_node = state.add_array(
+        B_label, B_shape, B_type, transient=is_B_transient)
+    C_node = state.add_array(
+        C_label, C_shape, C_type, transient=is_C_transient or create_C)
+
+    # Create maps/tasklet
+    k_entry, k_exit = state.add_map(
+        name=label + '_' + 'k_map',
+        ndrange=dict(ik=N_range),
+        schedule=dace.types.ScheduleType.Sequential)
+    k_entry.in_connectors = {'IN_1', 'IN_2'}
+    k_entry.out_connectors = {'OUT_1', 'OUT_2'}
+    k_exit.in_connectors = {'IN_1'}
+    k_exit.out_connectors = {'OUT_1'}
+    ij_entry, ij_exit = state.add_map(
+        name=label + '_' + 'ij_map', ndrange=dict(ii=M_range, ij=K_range))
+    tasklet = state.add_tasklet(
+        name=label + '_' + 'tasklet',
+        inputs={'a', 'b'},
+        outputs={'c'},
+        code='c = a * b')
+    ij_entry.in_connectors = {'IN_1', 'IN_2'}
+    ij_entry.out_connectors = {'OUT_1', 'OUT_2'}
+    ij_exit.in_connectors = {'IN_1'}
+    ij_exit.out_connectors = {'OUT_1'}
+
+    # Add edges
+    if interchange:
+        state.add_edge(A_node, None, k_entry, 'IN_1',
+                       dace.Memlet.simple(A_node, A_outer_range))
+        state.add_edge(B_node, None, k_entry, 'IN_2',
+                       dace.Memlet.simple(B_node, B_outer_range))
+        state.add_edge(k_entry, 'OUT_1', ij_entry, 'IN_1',
+                       dace.Memlet.simple(A_node, A_middle_range))
+        state.add_edge(k_entry, 'OUT_2', ij_entry, 'IN_2',
+                       dace.Memlet.simple(B_node, B_middle_range))
+        state.add_edge(ij_entry, 'OUT_1', tasklet, 'a',
+                       dace.Memlet.simple(A_node, A_inner_range))
+        state.add_edge(ij_entry, 'OUT_2', tasklet, 'b',
+                       dace.Memlet.simple(B_node, B_inner_range))
+        wcr = 0
+        if accumulate:
+            wcr = None
+        state.add_edge(
+            tasklet, 'c', ij_exit, 'IN_1',
+            dace.Memlet.simple(
+                C_node,
+                C_inner_range,
+                wcr_str='lambda x, y: x + y',
+                wcr_identity=wcr,
+                wcr_conflict=False))
+        state.add_edge(ij_exit, 'OUT_1', k_exit, 'IN_1',
+                       dace.Memlet.simple(C_node, C_middle_range))
+        state.add_edge(k_exit, 'OUT_1', C_node, None,
+                       dace.Memlet.simple(C_node, C_outer_range))
+    else:
+        state.add_edge(A_node, None, ij_entry, 'IN_1',
+                       dace.Memlet.simple(A_node, A_outer_range))
+        state.add_edge(B_node, None, ij_entry, 'IN_2',
+                       dace.Memlet.simple(B_node, B_outer_range))
+        state.add_edge(ij_entry, 'OUT_1', k_entry, 'IN_1',
+                       dace.Memlet.simple(A_node, A_middle_range))
+        state.add_edge(ij_entry, 'OUT_2', k_entry, 'IN_2',
+                       dace.Memlet.simple(B_node, B_middle_range))
+        state.add_edge(k_entry, 'OUT_1', tasklet, 'a',
+                       dace.Memlet.simple(A_node, A_inner_range))
+        state.add_edge(k_entry, 'OUT_2', tasklet, 'b',
+                       dace.Memlet.simple(B_node, B_inner_range))
+        wcr = 0
+        if accumulate:
+            wcr = None
+        state.add_edge(
+            tasklet, 'c', k_exit, 'IN_1',
+            dace.Memlet.simple(
+                C_node,
+                C_inner_range,
+                wcr_str='lambda x, y: x + y',
+                wcr_identity=wcr,
+                wcr_conflict=False))
+        state.add_edge(k_exit, 'OUT_1', ij_exit, 'IN_1',
+                       dace.Memlet.simple(C_node, C_middle_range))
+        state.add_edge(ij_exit, 'OUT_1', C_node, None,
+                       dace.Memlet.simple(C_node, C_outer_range))
+
+    return state
+
+
+def validate_scalar_array_multiplication(
+        alpha_shape: Shape,
+        A_shape: Shape,
+        B_shape: Shape,
+        alpha_index: Index = None,
+        A_index: Index = None,
+        B_index: Index = None
+) -> (typing.Dict[str, str], (str, str), (str, str), (str, str)):
+    """ Validates a scalar-array multiplication operation, based on the shapes 
+        and indices of the arrays involved. Returns the ranges of the maps and
+        memlets at all levels as strings. """
+
+    # Validate data
+    if alpha_shape != [1]:
+        if alpha_index is None or len(alpha_shape) != len(alpha_index):
+            raise ValidationError(
+                'Slice of alpha is not a scalar: {}, {}'.format(
+                    alpha_shape, alpha_index))
+    if A_index is not None:
+        true_A_shape = A_shape[len(A_index):]
+    else:
+        true_A_shape = A_shape
+    if B_index is not None:
+        true_B_shape = B_shape[len(B_index):]
+    else:
+        true_B_shape = B_shape
+    if true_A_shape != true_B_shape:
+        raise ValidationError('Dimension mismatch between arrays A and B: '
+                              '{}({}) != {}({})'.format(
+                                  true_A_shape, A_shape, true_B_shape,
+                                  B_shape))
+
+    # Map ranges
+    map_ranges = dict()
+    for i, dim in enumerate(true_A_shape):
+        map_ranges['i{}'.format(i)] = '0:{}'.format(dim)
+
+    # Memlet ranges
+    alpha_outer_range = '0'
+    alpha_inner_range = '0'
+    if alpha_index is not None:
+        alpha_index = [str(idx) for idx in alpha_index]
+        alpha_outer_range = ', '.join(alpha_index)
+        alpha_inner_range = ', '.join(alpha_index)
+    A_outer_range = ', '.join(map_ranges.values())
+    A_inner_range = ', '.join(map_ranges.keys())
+    if A_index is not None:
+        A_index = [str(idx) for idx in A_index]
+        A_outer_range = '{}, {}'.format(', '.join(A_index), A_outer_range)
+        A_inner_range = '{}, {}'.format(', '.join(A_index), A_inner_range)
+    B_outer_range = ', '.join(map_ranges.values())
+    B_inner_range = ', '.join(map_ranges.keys())
+    if B_index is not None:
+        B_index = [str(idx) for idx in B_index]
+        B_outer_range = '{}, {}'.format(', '.join(B_index), B_outer_range)
+        B_inner_range = '{}, {}'.format(', '.join(B_index), B_inner_range)
+
+    return (map_ranges, (alpha_outer_range, alpha_inner_range),
+            (A_outer_range, A_inner_range), (B_outer_range, B_inner_range))
+
+
+def scalar_array_multiplication(state: State,
+                                alpha_src: Node,
+                                alpha_node: DNode,
+                                A_src: Node,
+                                A_node: DNode,
+                                B_dst: Node,
+                                B_node: DNode,
+                                accumulate: bool = False,
+                                wcr_conflict: bool = False,
+                                alpha_index: Index = None,
+                                A_index: Index = None,
+                                B_index: Index = None,
+                                label: str = None):
+    """ Adds a scalar-array multiplication operation to an exisiting state. """
+
+    # Validate data
+    sdfg = state.parent
+    alpha_shape = [1]
+    if hasattr(alpha_node, 'shape'):
+        alpha_shape = alpha_node.shape
+    ranges = validate_scalar_array_multiplication(
+        alpha_shape,
+        A_node.desc(sdfg).shape,
+        B_node.desc(sdfg).shape, alpha_index, A_index, B_index)
+    map_ranges, alpha_ranges, A_ranges, B_ranges = ranges
+    alpha_outer_range, alpha_inner_range = alpha_ranges
+    A_outer_range, A_inner_range = A_ranges
+    A_outer_range, A_inner_range = A_ranges
+    B_outer_range, B_inner_range = B_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create map/tasklet
+    map_entry, map_exit = state.add_map(
+        name=label + '_map', ndrange=map_ranges)
+    map_entry.in_connectors = {'IN_1', 'IN_2'}
+    map_entry.out_connectors = {'OUT_1', 'OUT_2'}
+    map_exit.in_connectors = {'IN_1'}
+    map_exit.out_connectors = {'OUT_1'}
+    tasklet = state.add_tasklet(
+        name=label + '_tasklet',
+        inputs={'scalar', 'a'},
+        outputs={'b'},
+        code='b = scalar * a')
+
+    # Add edges
+    state.add_edge(alpha_src, None, map_entry, 'IN_1',
+                   dace.Memlet.simple(alpha_node, alpha_outer_range))
+    state.add_edge(A_src, None, map_entry, 'IN_2',
+                   dace.Memlet.simple(A_node, A_outer_range))
+    state.add_edge(map_exit, 'OUT_1', B_dst, None,
+                   dace.Memlet.simple(B_node, B_outer_range))
+    state.add_edge(map_entry, 'OUT_1', tasklet, 'scalar',
+                   dace.Memlet.simple(alpha_node, alpha_inner_range))
+    state.add_edge(map_entry, 'OUT_2', tasklet, 'a',
+                   dace.Memlet.simple(A_node, A_inner_range))
+    if accumulate:
+        state.add_edge(
+            tasklet, 'b', map_exit, 'IN_1',
+            dace.Memlet.simple(
+                B_node,
+                B_inner_range,
+                wcr_str='lambda x, y: x + y',
+                wcr_identity=None,
+                wcr_conflict=wcr_conflict))
+    else:
+        state.add_edge(tasklet, 'b', map_exit, 'IN_1',
+                       dace.Memlet.simple(B_node, B_inner_range))
+
+
+def scalar_array_multiplication_s(alpha_label: str,
+                                  alpha_shape: Shape,
+                                  alpha_type: dace.types.typeclass,
+                                  A_label: str,
+                                  A_shape: Shape,
+                                  A_type: dace.types.typeclass,
+                                  create_B: bool = True,
+                                  B_label: str = None,
+                                  B_shape: Shape = None,
+                                  B_type: dace.types.typeclass = None,
+                                  is_alpha_transient: bool = False,
+                                  is_A_transient: bool = False,
+                                  is_B_transient: bool = False,
+                                  accumulate: bool = False,
+                                  wcr_conflict: bool = False,
+                                  alpha_index: Index = None,
+                                  A_index: Index = None,
+                                  B_index: Index = None,
+                                  label: str = None) -> State:
+    """ Creates a new state with a scalar-array multiplication operation. """
+
+    # Set output attributes
+    if create_B:
+        if B_label is None:
+            B_label = alpha_label + A_label
+        if B_type is None:
+            B_type = A_type
+        B_shape = A_shape
+    else:
+        if B_shape is None:
+            raise ValidationError(
+                'Array B is not transient, but its shape is not set')
+
+    # Validate data
+    ranges = validate_scalar_array_multiplication(
+        alpha_shape, A_shape, B_shape, alpha_index, A_index, B_index)
+    map_ranges, alpha_ranges, A_ranges, B_ranges = ranges
+    alpha_outer_range, alpha_inner_range = alpha_ranges
+    A_outer_range, A_inner_range = A_ranges
+    A_outer_range, A_inner_range = A_ranges
+    B_outer_range, B_inner_range = B_ranges
+
+    # Set label
+    if label is None:
+        label = alpha_label + A_label
+
+    # Create state
+    state = State(label=label)
+
+    # Create data nodes
+    alpha_node = state.add_array(
+        alpha_label, alpha_shape, alpha_type, transient=is_alpha_transient)
+    A_node = state.add_array(
+        A_label, A_shape, A_type, transient=is_A_transient)
+    B_node = state.add_array(
+        B_label, B_shape, B_type, transient=is_B_transient or create_B)
+
+    # Create map/tasklet
+    map_entry, map_exit = state.add_map(
+        name=label + '_map', ndrange=map_ranges)
+    map_entry.in_connectors = {'IN_1', 'IN_2'}
+    map_entry.out_connectors = {'OUT_1', 'OUT_2'}
+    map_exit.in_connectors = {'IN_1'}
+    map_exit.out_connectors = {'OUT_1'}
+    tasklet = state.add_tasklet(
+        name=label + '_tasklet',
+        inputs={'scalar', 'a'},
+        outputs={'b'},
+        code='b = scalar * a')
+
+    # Add edges
+    state.add_edge(alpha_node, None, map_entry, 'IN_1',
+                   dace.Memlet.simple(alpha_node, alpha_outer_range))
+    state.add_edge(A_node, None, map_entry, 'IN_2',
+                   dace.Memlet.simple(A_node, A_outer_range))
+    state.add_edge(map_exit, 'OUT_1', B_node, None,
+                   dace.Memlet.simple(B_node, B_outer_range))
+    state.add_edge(map_entry, 'OUT_1', tasklet, 'scalar',
+                   dace.Memlet.simple(alpha_node, alpha_inner_range))
+    state.add_edge(map_entry, 'OUT_2', tasklet, 'a',
+                   dace.Memlet.simple(A_node, A_inner_range))
+    if accumulate:
+        state.add_edge(
+            tasklet, 'b', map_exit, 'IN_1',
+            dace.Memlet.simple(
+                B_node,
+                B_inner_range,
+                wcr_str='lambda x, y: x + y',
+                wcr_identity=None,
+                wcr_conflict=wcr_conflict))
+    else:
+        state.add_edge(tasklet, 'b', map_exit, 'IN_1',
+                       dace.Memlet.simple(B_node, B_inner_range))
+
+    return state
+
+
+def constant_array_multiplication(state: State,
+                                  constant,
+                                  A_src: Node,
+                                  A_node: DNode,
+                                  B_dst: Node,
+                                  B_node: DNode,
+                                  accumulate: bool = False,
+                                  A_index: Index = None,
+                                  B_index: Index = None,
+                                  label: str = None):
+    """ Adds a scalar-array multiplication operation to an exisiting state. """
+
+    # Validate data
+    # ranges = validate_scalar_array_multiplication(
+    #     [1], A_node.shape, B_node.shape,
+    #     None, A_index, B_index
+    # )
+    sdfg = state.parent
+    ranges = validate_scalar_array_multiplication([1],
+                                                  A_node.desc(sdfg).shape,
+                                                  B_node.desc(sdfg).shape,
+                                                  None, A_index, B_index)
+    map_ranges, _, A_ranges, B_ranges = ranges
+    A_outer_range, A_inner_range = A_ranges
+    B_outer_range, B_inner_range = B_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create map/tasklet
+    map_entry, map_exit = state.add_map(
+        name=label + '_map', ndrange=map_ranges)
+    map_entry.in_connectors = {'IN_1'}
+    map_entry.out_connectors = {'OUT_1'}
+    map_exit.in_connectors = {'IN_1'}
+    map_exit.out_connectors = {'OUT_1'}
+    tasklet = state.add_tasklet(
+        name=label + '_tasklet',
+        inputs={'a'},
+        outputs={'b'},
+        code='b = {} * a'.format(constant))
+
+    # Add edges
+    state.add_edge(A_src, None, map_entry, 'IN_1',
+                   dace.Memlet.simple(A_node, A_outer_range))
+    state.add_edge(map_exit, 'OUT_1', B_dst, None,
+                   dace.Memlet.simple(B_node, B_outer_range))
+    state.add_edge(map_entry, 'OUT_1', tasklet, 'a',
+                   dace.Memlet.simple(A_node, A_inner_range))
+    if accumulate:
+        state.add_edge(
+            tasklet, 'b', map_exit, 'IN_1',
+            dace.Memlet.simple(
+                B_node,
+                B_inner_range,
+                wcr_str='lambda x, y: x + y',
+                wcr_identity=None,
+                wcr_conflict=False))
+    else:
+        state.add_edge(tasklet, 'b', map_exit, 'IN_1',
+                       dace.Memlet.simple(B_node, B_inner_range))
+
+
+def unary_array_op(state: State,
+                   A_src: Node,
+                   A_node: DNode,
+                   B_dst: Node,
+                   B_node: DNode,
+                   code: str,
+                   lang=dace.types.Language.Python,
+                   accumulate: bool = False,
+                   A_index: Index = None,
+                   B_index: Index = None,
+                   label: str = None):
+    """ Adds a unary array operation to an exisiting state. """
+
+    # Validate data
+    sdfg = state.parent
+    ranges = validate_scalar_array_multiplication([1],
+                                                  A_node.desc(sdfg).shape,
+                                                  B_node.desc(sdfg).shape,
+                                                  None, A_index, B_index)
+    map_ranges, _, A_ranges, B_ranges = ranges
+    A_outer_range, A_inner_range = A_ranges
+    B_outer_range, B_inner_range = B_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create map/tasklet
+    map_entry, map_exit = state.add_map(
+        name=label + '_map', ndrange=map_ranges)
+    map_entry.in_connectors = {'IN_1'}
+    map_entry.out_connectors = {'OUT_1'}
+    map_exit.in_connectors = {'IN_1'}
+    map_exit.out_connectors = {'OUT_1'}
+    tasklet = state.add_tasklet(
+        name=label + '_tasklet',
+        inputs={'a'},
+        outputs={'b'},
+        code=code,
+        language=lang)
+
+    # Add edges
+    state.add_edge(A_src, None, map_entry, 'IN_1',
+                   dace.Memlet.simple(A_node, A_outer_range))
+    state.add_edge(map_exit, 'OUT_1', B_dst, None,
+                   dace.Memlet.simple(B_node, B_outer_range))
+    state.add_edge(map_entry, 'OUT_1', tasklet, 'a',
+                   dace.Memlet.simple(A_node, A_inner_range))
+    if accumulate:
+        state.add_edge(
+            tasklet, 'b', map_exit, 'IN_1',
+            dace.Memlet.simple(
+                B_node,
+                B_inner_range,
+                wcr_str='lambda x, y: x + y',
+                wcr_identity=None,
+                wcr_conflict=False))
+    else:
+        state.add_edge(tasklet, 'b', map_exit, 'IN_1',
+                       dace.Memlet.simple(B_node, B_inner_range))
+
+
+def validate_matrix_transpose(
+        A_shape: Shape,
+        B_shape: Shape,
+        A_index: Index = None,
+        B_index: Index = None
+) -> (typing.Dict[str, str], (str, str), (str, str)):
+    """ Validates a matrix transpose operation, based on the shapes and indices
+        of the arrays involved. Returns the ranges of the maps and memlets at 
+        all levels as strings. """
+
+    # Validate data
+    if len(A_shape) < 2:
+        raise ValidationError(
+            'Array A has less than 2 dimensions: {}'.format(A_shape))
+    A_tr_shape = A_shape[-2:]
+    if len(B_shape) < 2:
+        raise ValidationError(
+            'Array B has less than 2 dimensions: {}'.format(B_shape))
+    B_tr_shape = B_shape[-2:]
+    if A_tr_shape[0] != B_tr_shape[-1] or A_tr_shape[-1] != B_tr_shape[0]:
+        raise ValidationError(
+            'Dimension mismatch between arrays A and B: {} != {}'.format(
+                A_tr_shape, B_tr_shape))
+
+    # Map ranges
+    map_ranges = dict(
+        ii='0:{}'.format(A_tr_shape[0]), ij='0:{}'.format(A_tr_shape[-1]))
+
+    # Validate slices and set array access ranges
+    A_outer_range = '0:{}, 0:{}'.format(A_tr_shape[0], A_tr_shape[-1])
+    A_inner_range = 'ii, ij'
+    if len(A_shape) > 2:
+        if A_index is None or len(A_index) != len(A_shape) - 2:
+            raise ValidationError(
+                'Invalid slice {} for array A with dimensions {}'.format(
+                    A_index, A_shape))
+        A_index = [str(idx) for idx in A_index]
+        A_outer_range = '{}, {}'.format(', '.join(A_index), A_outer_range)
+        A_inner_range = '{}, {}'.format(', '.join(A_index), A_inner_range)
+    B_outer_range = '0:{}, 0:{}'.format(A_tr_shape[-1], A_tr_shape[0])
+    B_inner_range = 'ij, ii'
+    if len(B_shape) > 2:
+        if B_index is None or len(B_index) != len(B_shape) - 2:
+            raise ValidationError(
+                'Invalid slice {} for array B with dimensions {}'.format(
+                    B_index, B_shape))
+        B_index = [str(idx) for idx in B_index]
+        B_outer_range = '{}, {}'.format(', '.join(B_index), B_outer_range)
+        B_inner_range = '{}, {}'.format(', '.join(B_index), B_inner_range)
+
+    return (map_ranges, (A_outer_range, A_inner_range), (B_outer_range,
+                                                         B_inner_range))
+
+
+def matrix_transpose(state: State,
+                     A_src: Node,
+                     A_node: DNode,
+                     B_dst: Node,
+                     B_node: DNode,
+                     A_index: Index = None,
+                     B_index: Index = None,
+                     code: str = None,
+                     lang=dace.types.Language.Python,
+                     label: str = None):
+    """ Adds a matrix transpose operation to an existing state. """
+
+    # Validate data
+    sdfg = state.parent
+    map_ranges, A_ranges, B_ranges = validate_matrix_transpose(
+        A_node.desc(sdfg).shape,
+        B_node.desc(sdfg).shape, A_index, B_index)
+    A_outer_range, A_inner_range = A_ranges
+    B_outer_range, B_inner_range = B_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create map/tasklet
+    if code is None:
+        code = 'b = a'
+    _, map_entry, map_exit = state.add_mapped_tasklet(
+        name=label,
+        map_ranges=map_ranges,
+        inputs=dict(a=dace.Memlet.simple(A_node, A_inner_range)),
+        outputs=dict(b=dace.Memlet.simple(B_node, B_inner_range)),
+        code=code,
+        language=lang)
+
+    # Add edges
+    state.add_nedge(A_src, map_entry, dace.Memlet.simple(
+        A_node, A_outer_range))
+    state.add_nedge(map_exit, B_dst, dace.Memlet.simple(B_node, B_outer_range))
+
+    return state
+
+
+def matrix_transpose_double(state: State,
+                            A_src: Node,
+                            A_node: DNode,
+                            B_dst: Node,
+                            B_node: DNode,
+                            C_dst: Node,
+                            C_node: DNode,
+                            A_index: Index = None,
+                            B_index: Index = None,
+                            C_index: Index = None,
+                            code: str = None,
+                            lang=dace.types.Language.Python,
+                            label: str = None):
+    """ Adds a matrix transpose operation, which transposes to two different
+        matrices, to an existing state. """
+
+    # Validate data
+    sdfg = state.parent
+    map_ranges, A_ranges, B_ranges = validate_matrix_transpose(
+        A_node.desc(sdfg).shape,
+        B_node.desc(sdfg).shape, A_index, B_index)
+    A_outer_range, A_inner_range = A_ranges
+    B_outer_range, B_inner_range = B_ranges
+    _, _, C_ranges = validate_matrix_transpose(
+        A_node.desc(sdfg).shape,
+        C_node.desc(sdfg).shape, A_index, C_index)
+    C_outer_range, C_inner_range = C_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create map/tasklet
+    if code is None:
+        code = '''
+b = a
+c = a
+        '''
+    _, map_entry, map_exit = state.add_mapped_tasklet(
+        name=label,
+        map_ranges=map_ranges,
+        inputs=dict(a=dace.Memlet.simple(A_node, A_inner_range)),
+        outputs=dict(
+            b=dace.Memlet.simple(B_node, B_inner_range),
+            c=dace.Memlet.simple(C_node, C_inner_range),
+        ),
+        code=code,
+        language=lang)
+
+    # Add edges
+    state.add_nedge(A_src, map_entry, dace.Memlet.simple(
+        A_node, A_outer_range))
+    state.add_nedge(map_exit, B_dst, dace.Memlet.simple(B_node, B_outer_range))
+    state.add_nedge(map_exit, C_dst, dace.Memlet.simple(C_node, C_outer_range))
+
+    return state
+
+
+def matrix_transpose_s(A_label: str,
+                       A_shape: Shape,
+                       A_type: dace.types.typeclass,
+                       create_B: bool = True,
+                       B_label: str = None,
+                       B_shape: Shape = None,
+                       B_type: dace.types.typeclass = None,
+                       is_alpha_transient: bool = False,
+                       is_A_transient: bool = False,
+                       is_B_transient: bool = False,
+                       A_index: Index = None,
+                       B_index: Index = None,
+                       label: str = None) -> State:
+    """ Creates a new state with a matrix transpose operation. """
+
+    # Set output attributes
+    if create_B:
+        if B_label is None:
+            B_label = A_label + '^T'
+        if B_type is None:
+            B_type = A_type
+        B_shape = list(A_shape).reverse()
+    else:
+        if B_shape is None:
+            raise ValidationError(
+                'Array B is not transient, but its shape is not set')
+
+    # Validate data
+    map_ranges, A_ranges, B_ranges = validate_matrix_transpose(
+        A_shape, B_shape, A_index, B_index)
+    A_outer_range, A_inner_range = A_ranges
+    B_outer_range, B_inner_range = B_ranges
+
+    # Set label
+    if label is None:
+        label = A_label + '^T'
+
+    # Create state
+    state = State(label=label)
+
+    # Create datanodes
+    A_node = state.add_array(
+        A_label, A_shape, A_type, transient=is_A_transient)
+    B_node = state.add_array(
+        B_label, B_shape, B_type, transient=is_B_transient or create_B)
+
+    # Create map/tasklet
+    _, map_entry, map_exit = state.add_mapped_tasklet(
+        name=label,
+        map_ranges=map_ranges,
+        inputs=dict(a=dace.Memlet.simple(A_node, A_inner_range)),
+        outputs=dict(b=dace.Memlet.simple(B_node, B_inner_range)),
+        code='b = a')
+
+    # Add edges
+    state.add_nedge(A_node, map_entry, dace.Memlet.simple(
+        A_node, A_outer_range))
+    state.add_nedge(map_exit, B_node, dace.Memlet.simple(
+        B_node, B_outer_range))
+
+    return state
+
+
+def validate_matrix_pointwise_op(
+        A_shape: Shape,
+        B_shape: Shape,
+        C_shape: Shape,
+        reduce: bool = False,
+        A_index: Index = None,
+        B_index: Index = None,
+        C_index: Index = None
+) -> (typing.Dict[str, str], (str, str), (str, str), (str, str)):
+    """ Validates a point-wise matrix operation. """
+
+    # Validate data
+    if A_index is not None:
+        true_A_shape = A_shape[len(A_index):]
+    else:
+        true_A_shape = A_shape
+    if B_index is not None:
+        true_B_shape = B_shape[len(B_index):]
+    else:
+        true_B_shape = B_shape
+    if true_A_shape != true_B_shape:
+        raise ValidationError('Dimension mismatch between arrays A and B: '
+                              '{}({}) != {}({})'.format(
+                                  true_A_shape, A_shape, true_B_shape,
+                                  B_shape))
+    if reduce:
+        if C_index is None or len(C_shape) != len(C_index):
+            raise ValidationError(
+                'Point-wise matrix operation result cannot be reduced: '
+                '{}({})'.format(C_shape, C_index))
+    else:
+        if C_index is not None:
+            true_C_shape = C_shape[len(C_index):]
+        else:
+            true_C_shape = C_shape
+        if true_A_shape != true_B_shape:
+            raise ValidationError('Dimension mismatch between arrays A and C: '
+                                  '{}({}) != {}({})'.format(
+                                      true_A_shape, A_shape, true_C_shape,
+                                      C_shape))
+
+    # Map ranges
+    map_ranges = dict()
+    for i, dim in enumerate(true_A_shape):
+        map_ranges['i{}'.format(i)] = '0:{}'.format(dim)
+
+    # Memlet ranges
+    A_outer_range = ', '.join(map_ranges.values())
+    A_inner_range = ', '.join(map_ranges.keys())
+    if A_index is not None:
+        A_index = [str(idx) for idx in A_index]
+        A_outer_range = '{}, {}'.format(', '.join(A_index), A_outer_range)
+        A_inner_range = '{}, {}'.format(', '.join(A_index), A_inner_range)
+    B_outer_range = ', '.join(map_ranges.values())
+    B_inner_range = ', '.join(map_ranges.keys())
+    if B_index is not None:
+        B_index = [str(idx) for idx in B_index]
+        B_outer_range = '{}, {}'.format(', '.join(B_index), B_outer_range)
+        B_inner_range = '{}, {}'.format(', '.join(B_index), B_inner_range)
+    if reduce:
+        C_index = [str(idx) for idx in C_index]
+        C_outer_range = ', '.join(C_index)
+        C_inner_range = ', '.join(C_index)
+    else:
+        C_outer_range = ', '.join(map_ranges.values())
+        C_inner_range = ', '.join(map_ranges.keys())
+        if C_index is not None:
+            C_index = [str(idx) for idx in C_index]
+            C_outer_range = '{}, {}'.format(', '.join(C_index), C_outer_range)
+            C_inner_range = '{}, {}'.format(', '.join(C_index), C_inner_range)
+
+    return (map_ranges, (A_outer_range, A_inner_range),
+            (B_outer_range, B_inner_range), (C_outer_range, C_inner_range))
+
+
+def matrix_pointwise_op(state: State,
+                        A_src: Node,
+                        A_node: DNode,
+                        B_src: Node,
+                        B_node: DNode,
+                        C_dst: Node,
+                        C_node: DNode,
+                        op: str,
+                        reduce: bool = False,
+                        reduce_op: str = None,
+                        accumulate: bool = False,
+                        A_index: Index = None,
+                        B_index: Index = None,
+                        C_index: Index = None,
+                        label: str = None):
+    """ Adds a matrix point-wise operation to an existing state. """
+
+    # Validate data
+    sdfg = state.parent
+    C_shape = None
+    if reduce and not hasattr(C_node.desc(sdfg), 'shape'):
+        C_shape = [1]
+    else:
+        C_shape = C_node.desc(sdfg).shape
+    map_ranges, A_ranges, B_ranges, C_ranges = validate_matrix_pointwise_op(
+        A_node.desc(sdfg).shape,
+        B_node.desc(sdfg).shape, C_shape, reduce, A_index, B_index, C_index)
+    A_outer_range, A_inner_range = A_ranges
+    B_outer_range, B_inner_range = B_ranges
+    C_outer_range, C_inner_range = C_ranges
+
+    # Set label
+    if label is None:
+        label = state.label
+
+    # Create map/tasklet
+    if reduce:
+        schedule = dace.types.ScheduleType.Sequential
+    else:
+        schedule = dace.types.ScheduleType.Default
+    map_entry, map_exit = state.add_map(
+        name=label + '_map', ndrange=map_ranges, schedule=schedule)
+    map_entry.in_connectors = {'IN_1', 'IN_2'}
+    map_entry.out_connectors = {'OUT_1', 'OUT_2'}
+    map_exit.in_connectors = {'IN_1'}
+    map_exit.out_connectors = {'OUT_1'}
+    tasklet = state.add_tasklet(
+        name=label + '_tasklet',
+        inputs={'a', 'b'},
+        outputs={'c'},
+        code='c = a ' + op + ' b')
+
+    # Add edges
+    state.add_edge(A_src, None, map_entry, 'IN_1',
+                   dace.Memlet.simple(A_node, A_outer_range))
+    state.add_edge(B_src, None, map_entry, 'IN_2',
+                   dace.Memlet.simple(B_node, B_outer_range))
+    state.add_edge(map_exit, 'OUT_1', C_dst, None,
+                   dace.Memlet.simple(C_node, C_outer_range))
+    state.add_edge(map_entry, 'OUT_1', tasklet, 'a',
+                   dace.Memlet.simple(A_node, A_inner_range))
+    state.add_edge(map_entry, 'OUT_2', tasklet, 'b',
+                   dace.Memlet.simple(B_node, B_inner_range))
+    if reduce:
+        wcr = 0
+        if accumulate:
+            wcr = None
+        state.add_edge(
+            tasklet, 'c', map_exit, 'IN_1',
+            dace.Memlet.simple(
+                C_node,
+                C_inner_range,
+                wcr_str='lambda x, y: x ' + reduce_op + ' y',
+                wcr_identity=wcr,
+                wcr_conflict=False))
+    else:
+        state.add_edge(tasklet, 'c', map_exit, 'IN_1',
+                       dace.Memlet.simple(C_node, C_inner_range))
+
+
+def csr2dense_cusparse(state: State, val: DNode, rowptr: DNode, colind: DNode,
+                       dense: DNode):
+    """ Adds a CSR->Dense data layout transformation to a state, using 
+        CUSPARSE for the implementation. """
+    sdfg = state.parent
+    dense_array = dense.desc(sdfg)
+    d_shape = dense_array.shape
+    d_dtype = dense_array.dtype
+    T = state.add_transient(dense.data + 'T', d_shape, d_dtype)
+
+    tasklet = state.add_tasklet(
+        name=dense.data + '_csr2dense',
+        inputs={'val', 'rowptr', 'colind'},
+        outputs={'dense'},
+        code='''
+    cusparseSetStream(sparse_handle, __dace_current_stream);
+    cusparseZcsr2dense(
+        sparse_handle,
+        {m}, {n},
+        sparse_mat_descr,
+        (cuDoubleComplex*)val,
+        rowptr,
+        colind,
+        (cuDoubleComplex*)dense,
+        {m}
+    );
+        '''.format(m=str(d_shape[0]), n=str(d_shape[1])),
+        language=dace.types.Language.CPP)
+    state.add_edge(val, None, tasklet, 'val',
+                   dace.Memlet.from_array(val.data, val.desc(sdfg)))
+    state.add_edge(rowptr, None, tasklet, 'rowptr',
+                   dace.Memlet.from_array(rowptr.data, rowptr.desc(sdfg)))
+    state.add_edge(colind, None, tasklet, 'colind',
+                   dace.Memlet.from_array(colind.data, colind.desc(sdfg)))
+    state.add_edge(tasklet, 'dense', T, None,
+                   dace.Memlet.from_array(T.data, T.desc(sdfg)))
+    gpu_transform_tasklet(sdfg, state, tasklet)
+    matrix_transpose(state, T, T, dense, dense, label=T.data)
+
+
+def matrix_inversion_cusolver(state, arg, mat_inv, mat_index, label):
+    """ Adds a matrix inverse operation to a state, using CUSOLVER
+        for the implementation. """
+
+    sdfg = state.parent
+    m_shape = mat_inv.desc(sdfg).shape
+    inv_range = '0 : {sz}, 0 : {sz}'.format(sz=m_shape[-1])
+    if mat_index is not None:
+        index = [str(idx) for idx in mat_index]
+        inv_range = '{}, {}'.format(', '.join(index), inv_range)
+    inv_task = state.add_tasklet(
+        name=label,
+        inputs={'a'},
+        outputs={'b'},
+        code='''
+        cusolverDnSetStream(solver_handle, __dace_current_stream);
+        int new_lwork = 0;
+        cusolverDnZgetrf_bufferSize(
+            solver_handle,
+            {n}, {n},
+            (cuDoubleComplex*)a,
+            {n},
+            &new_lwork
+        );
+        //cudaDeviceSynchronize();
+        if (new_lwork > lwork) {{
+            lwork = new_lwork;
+            cudaFree(dwork);
+            cudaMalloc<cuDoubleComplex>(&dwork, sizeof(cuDoubleComplex) * lwork);
+        }}
+        cusolverDnZgetrf(
+            solver_handle,
+            {n}, {n},
+            (cuDoubleComplex*)a,
+            {n},
+            dwork, ipiv, info
+        );
+        //cudaDeviceSynchronize();
+        cudaMemcpyAsync(b, dev_I, sizeof(cuDoubleComplex) * {n} * {n}, cudaMemcpyDeviceToDevice, __dace_current_stream);
+        cusolverDnZgetrs(
+            solver_handle,
+            CUBLAS_OP_N,
+            {n},
+            {n}, /* nrhs */
+            (cuDoubleComplex*)a,
+            {n},
+            ipiv,
+            (cuDoubleComplex*)b,
+            {n},
+            info
+        );
+        //cudaDeviceSynchronize();
+        '''.format(n=m_shape[-1]),
+        language=dace.types.Language.CPP)
+    state.add_edge(arg, None, inv_task, 'a',
+                   dace.Memlet.from_array(arg.data, arg.desc(sdfg)))
+    state.add_edge(inv_task, 'b', mat_inv, None,
+                   dace.Memlet.simple(mat_inv, inv_range))
+    gpu_transform_tasklet(sdfg, state, inv_task)
diff --git a/dace/frontend/octave/__init__.py b/dace/frontend/octave/__init__.py
new file mode 100644
index 0000000000..362d8c7d52
--- /dev/null
+++ b/dace/frontend/octave/__init__.py
@@ -0,0 +1 @@
+from .ast_node import AST_Node, AST_Statements
\ No newline at end of file
diff --git a/dace/frontend/octave/ast_arrayaccess.py b/dace/frontend/octave/ast_arrayaccess.py
new file mode 100644
index 0000000000..c78a98920c
--- /dev/null
+++ b/dace/frontend/octave/ast_arrayaccess.py
@@ -0,0 +1,217 @@
+import dace
+
+from .ast_node import AST_Node
+
+
+class AST_ArrayAccess(AST_Node):
+    def __init__(self, context, arrayname, accdims):
+        AST_Node.__init__(self, context)
+        self.arrayname = arrayname
+        self.accdims = accdims
+
+    def __repr__(self):
+        return "AST_ArrayAccess(" + str(self.arrayname) + ", " + str(
+            self.accdims) + ")"
+
+    def get_children(self):
+        ret = [self.arrayname]
+        ret += self.accdims
+        return ret
+
+    def replace_child(self, old, new):
+        if old == self.arrayname:
+            self.arrayname = new
+            return
+        if old in self.accdims:
+            newaccdims = [new if x == old else x for x in self.accdims]
+            self.accdims = newaccdims
+
+    def get_basetype(self):
+        # The basetype of an array access is the same as the basetype as the
+        # array that is acccessed.
+        vardef = self.search_vardef_in_scope(self.arrayname.get_name())
+        return (vardef.get_basetype())
+
+    def get_dims(self):
+        from .ast_matrix import AST_Matrix
+        from .ast_loop import AST_ForLoop
+        from .ast_values import AST_Constant, AST_Ident
+        from .ast_range import AST_RangeExpression
+        # array indexing has many forms/cases in matlab and does not seem to
+        # be fully documented, the idea is to implement the simple things
+        # we are sure about and bail out on anything that looks different
+        dims = []
+        if isinstance(self.accdims, list):
+            for acc in self.accdims:
+                if isinstance(acc, AST_Constant):
+                    dims.append(1)
+                elif isinstance(acc, AST_Matrix):
+                    dims.append(len(acc.get_values_row_major()))
+                elif isinstance(acc, AST_RangeExpression):
+                    if isinstance(acc.lhs, AST_Constant) and isinstance(
+                            acc.rhs, AST_Constant):
+                        l = acc.lhs.get_value()
+                        r = acc.rhs.get_value()
+                        dims.append(r - l + 1)
+                    elif (acc.lhs is None) and (acc.rhs is None):
+                        # Get the dims of the array itself
+                        vardef = self.search_vardef_in_scope(
+                            self.arrayname.get_name())
+                        if vardef is None:
+                            raise ValueError("No definition found for Array " +
+                                             self.arrayname.get_name())
+                        d = vardef.get_dims()
+                        dims.append(d[len(dims)])
+                    else:
+                        raise NotImplementedError(
+                            "range with non-constant bounds not supported")
+                elif isinstance(acc, AST_Ident):
+                    vardef = self.search_vardef_in_scope(acc.get_name())
+                    if vardef is None:
+                        raise ValueError(
+                            "No definition found for " + acc.get_name() +
+                            " which is used in Array Access: " + str(self))
+                    if isinstance(vardef, AST_ForLoop) and acc.get_name(
+                    ) == vardef.var.get_name():
+                        d = vardef.initializer.get_dims()[:-1]
+                        if d != [1]:
+                            raise NotImplementedError(
+                                "Complicated slicing not implemented yet.")
+                        else:
+                            dims.append(d[0])
+                else:
+                    raise NotImplementedError(
+                        "unimplemented method of array access (" + str(acc) +
+                        ")")
+        else:
+            raise NotImplementedError("unimplemented method of array access")
+
+        # simplify [1,1] to [1]
+        if dims == [1, 1]:
+            dims = [1]
+        return dims
+
+    def make_range_from_accdims(self):
+        from .ast_range import AST_RangeExpression
+        from .ast_values import AST_Constant
+
+        rangelist = []
+        for acc in self.accdims:
+            if isinstance(acc, AST_Constant):
+                rangelist.append((acc.get_value() - 1, acc.get_value() - 1, 1))
+            elif isinstance(acc, AST_RangeExpression):
+                if isinstance(acc.lhs, AST_Constant) and isinstance(
+                        acc.rhs, AST_Constant):
+                    l = acc.lhs.get_value()
+                    r = acc.rhs.get_value()
+                    rangelist.append((l, r, 1))
+                else:
+                    raise NotImplementedError(
+                        "range with non-constant bounds not supported: " +
+                        str(self))
+            else:
+                raise NotImplementedError(
+                    "Non-constant array indexing not implemented: " +
+                    str(self))
+        ret = dace.subsets.Range(rangelist)
+        return ret
+
+    def is_data_dependent_access(self):
+        from .ast_values import AST_Constant
+        res = False
+        for a in self.accdims:
+            if not isinstance(a, AST_Constant):
+                return True
+
+    def generate_code(self, sdfg, state):
+        from .ast_values import AST_Ident
+        from .ast_loop import AST_ForLoop
+        from .ast_range import AST_RangeExpression
+        # add a new variable to hold the result of this expression
+        dims = self.get_dims()
+        basetype = self.get_basetype()
+        name = self.get_name_in_sdfg(sdfg)
+        if name not in sdfg.arrays:
+            sdfg.add_transient(name, dims, basetype, debuginfo=self.context)
+        # add a memlet from the original array to the transient
+        resnode = self.get_datanode(sdfg, state)
+        arrnode = self.arrayname.get_datanode(sdfg, state)
+        arrdesc = arrnode.desc(sdfg)
+
+        if self.is_data_dependent_access() == False:
+            msubset = self.make_range_from_accdims()
+            memlet = dace.memlet.Memlet(
+                arrnode,
+                msubset.num_elements(),
+                msubset,
+                1,
+                None,
+                None,
+                debuginfo=self.context)
+            sdfg.nodes()[state].add_edge(arrnode, None, resnode, None, memlet)
+        else:
+            # add a map around the access and feed the access dims that are
+            # runtime dependent into a connector which is _not_ named IN
+            access_data_nodes = set()
+            access_dims = []
+            for idx, acc in enumerate(self.accdims):
+                if isinstance(acc, AST_Ident):
+                    vardef = self.search_vardef_in_scope(acc.get_name())
+                    if vardef is None:
+                        raise ValueError('No definition found for ' +
+                                         str(acc.get_name()))
+                    elif isinstance(vardef, AST_ForLoop):
+                        access_data_nodes.add(vardef.var)
+                        access_dims.append(vardef.var.get_name())
+                elif isinstance(acc, AST_RangeExpression):
+                    # if the bounds are identifiers, we need them on the map
+                    # otherwise we do not need to do anything here
+                    if isinstance(acc.lhs, AST_Ident):
+                        access_data_nodes.add(acc.lhs)
+                    if isinstance(acc.rhs, AST_Ident):
+                        access_data_nodes.add(acc.rhs)
+                    if (acc.lhs is None) and (acc.rhs is None):
+                        d = arrdesc.shape
+                        access_dims.append('0:' + str(d[idx]))
+                else:
+                    acc.generate_code(sdfg, state)
+                    access_data_nodes.add(acc)
+                    access_dims.append(acc.get_name_in_sdfg(sdfg))
+            # now construct the dictionary for the map range
+            s = sdfg.nodes()[state]
+            mdict = {}
+            for aa in access_data_nodes:
+                a = aa.get_name_in_sdfg(sdfg)
+                mdict[a] = a
+            if len(mdict) == 0:
+                mdict = {'__DAPUNUSED_i': '0:1'}
+            men, mex = s.add_map('datadepacc', mdict)
+            men._in_connectors.add('IN_1')
+            men._out_connectors.add('OUT_1')
+            s.add_edge(arrnode, None, men, 'IN_1',
+                       dace.memlet.Memlet.from_array(arrnode.data, arrdesc))
+            for a in access_data_nodes:
+                aname = a.get_name_in_sdfg(sdfg)
+                men._in_connectors.add(aname)
+                datanode = a.get_datanode(sdfg, state)
+                s.add_edge(
+                    datanode, None, men, aname,
+                    dace.memlet.Memlet.from_array(datanode.data,
+                                                  datanode.desc(sdfg)))
+            tasklet = s.add_tasklet('ident', {'in'}, {'out'}, 'in=out;',
+                                    dace.Language.CPP)
+            s.add_edge(
+                men, 'OUT_1', tasklet, 'in',
+                dace.memlet.Memlet.simple(arrnode, ','.join(access_dims)))
+            s.add_edge(
+                tasklet, 'out', mex, None,
+                dace.memlet.Memlet.from_array(resnode.data,
+                                              resnode.desc(sdfg)))
+            s.add_edge(
+                mex, None, resnode, None,
+                dace.memlet.Memlet.from_array(resnode.data,
+                                              resnode.desc(sdfg)))
+
+        print("The result of " + str(self) + " will be stored in " + str(name))
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/ast_assign.py b/dace/frontend/octave/ast_assign.py
new file mode 100644
index 0000000000..25a52db783
--- /dev/null
+++ b/dace/frontend/octave/ast_assign.py
@@ -0,0 +1,174 @@
+from .ast_node import AST_Node
+from .ast_values import AST_Ident
+
+import dace
+
+
+class AST_Assign(AST_Node):
+    def __init__(self, context, lhs, rhs, op):
+        # for a normal assignment op is "=", but there is also
+        # in place modification, i.e., "+="
+        AST_Node.__init__(self, context)
+        self.lhs = lhs
+        self.rhs = rhs
+        self.op = op
+        self.children = [self.lhs, self.rhs]
+
+    def get_children(self):
+        retval = [self.lhs, self.rhs]
+        return retval
+
+    def replace_child(self, old, new):
+        if old == self.lhs:
+            self.lhs = new
+        if old == self.rhs:
+            self.rhs = new
+
+    def defined_variables(self):
+        # check if this adds something to the scope, if yes add it.
+        # assume A is undefined before this node, then:
+        # A = expr defines A, A(5) = expr defines A, but
+        # A += expr or A(5) += expr is illegal.
+        if self.op == "=":
+            if isinstance(self.lhs, AST_Ident):
+                return [self.lhs.get_name()]
+            else:
+                return []
+
+    def provide_parents(self, parent):
+        self.parent = parent
+        self.lhs.provide_parents(self)
+        self.rhs.provide_parents(self)
+
+    def __repr__(self):
+        return "AST_Assign(" + str(self.lhs) + ", " + str(
+            self.op) + ", " + str(self.rhs) + ")"
+
+    def print_nodes(self, state):
+        for n in state.nodes():
+            print(str(n))
+        print("---")
+
+    def generate_code(self, sdfg, state):
+        from .ast_arrayaccess import AST_ArrayAccess
+        from .ast_values import AST_Constant
+        from .ast_loop import AST_ForLoop
+
+        self.rhs.generate_code(sdfg, state)
+        s = sdfg.nodes()[state]
+        if self.op == "=":
+            # We assign to an entire array
+            if isinstance(self.lhs, AST_Ident):
+                dims = self.rhs.get_dims()
+                basetype = self.rhs.get_basetype()
+                name = self.lhs.get_name()
+
+                if name not in sdfg.arrays:
+                    sdfg.add_array(
+                        name, dims, basetype, debuginfo=self.context)
+                rhs_datanode = self.rhs.get_datanode(sdfg, state)
+                lhs_datanode = self.lhs.get_datanode(sdfg, state)
+
+                s.add_edge(
+                    rhs_datanode, None, lhs_datanode, None,
+                    dace.memlet.Memlet.from_array(lhs_datanode.data,
+                                                  lhs_datanode.desc(sdfg)))
+
+            # We assign only to a part of an (existing) array, in order to not
+            # create cycles we need to add a new data-node, the add_array()
+            # interface will make sure it is connected to the same memory than
+            # the existing array node.
+            elif isinstance(self.lhs, AST_ArrayAccess):
+                # get the definition of the array we are assigning to
+                lhs_data = self.lhs.arrayname.get_datanode(sdfg, state)
+                vardef = self.search_vardef_in_scope(
+                    self.lhs.arrayname.get_name())
+                if vardef == None:
+                    raise ValueError("No definition found for " +
+                                     self.lhs.arrayname.get_name() +
+                                     " searching from " + str(self))
+                dims = vardef.get_dims()
+                basetype = vardef.get_basetype()
+                if self.lhs.arrayname.get_name() not in sdfg.arrays:
+                    sdfg.add_array(
+                        self.lhs.arrayname.get_name(),
+                        dims,
+                        basetype,
+                        debuginfo=self.context)
+                dn = sdfg.nodes()[state].add_access(
+                    self.lhs.arrayname.get_name())
+
+                # check if the write is "out of bounds": this _is_ allowed in
+                # matlab, but not in SDFGs, since it would require to
+                # dynamically reallocate the array
+
+                # create a memlet which connects the rhs of the assignment to dn
+                rhs_datanode = self.rhs.get_datanode(sdfg, state)
+
+                if self.lhs.is_data_dependent_access() == False:
+                    msubset = self.lhs.make_range_from_accdims()
+                    writem = dace.memlet.Memlet(
+                        self.lhs.arrayname.get_name(),
+                        msubset.num_elements(),
+                        msubset,
+                        1,
+                        None,
+                        None,
+                        debuginfo=self.context)
+
+                    sdfg.nodes()[state].add_edge(rhs_datanode, None, dn, None,
+                                                 writem)
+                else:
+                    s = sdfg.nodes()[state]
+                    acc_data_nodes = set()
+                    acc_dims = []
+                    for a in self.lhs.accdims:
+                        if isinstance(a, AST_Constant):
+                            acc_dims.append(a.get_value())
+                        elif isinstance(a, AST_Ident):
+                            vardef = self.search_vardef_in_scope(a.get_name())
+                            if vardef is None:
+                                raise ValueError('No definition found for ' +
+                                                 str(acc.get_name()))
+                            elif isinstance(vardef, AST_ForLoop):
+                                acc_data_nodes.add(vardef.var)
+                                acc_dims.append(vardef.var.get_name())
+                        else:
+                            raise ValueError(
+                                str(type(a)) +
+                                " in data dependent write not allowed.")
+                    mapdict = {}
+                    for a in acc_dims:
+                        mapdict[a] = str(a)
+                    men, mex = s.add_map('datedepwrite', mapdict)
+                    men._in_connectors.add(
+                        'IN_1')  # the data to write goes here
+                    men._out_connectors.add('OUT_1')  # and comes out here
+                    for d in acc_data_nodes:
+                        dname = d.get_name_in_sdfg(sdfg)
+                        men._in_connectors.add(dname)
+                        datanode = d.get_datanode(sdfg, state)
+                        s.add_edge(
+                            datanode, None, men, dname,
+                            dace.memlet.Memlet.from_array(
+                                datanode.data, datanode.desc(sdfg)))
+                    s.add_edge(
+                        rhs_datanode, None, men, 'IN_1',
+                        dace.memlet.Memlet.from_array(rhs_datanode.data,
+                                                      rhs_datanode.desc(sdfg)))
+                    s.add_edge(
+                        men, 'OUT_1', dn, None,
+                        dace.memlet.Memlet.simple(
+                            self.lhs.arrayname.get_name(),
+                            ','.join([str(d) for d in acc_dims])))
+                    s.add_edge(dn, None, mex, None, dace.memlet.EmptyMemlet())
+
+            else:
+                raise NotImplementedError("Assignment with lhs of type " +
+                                          str(type(self.lhs)) +
+                                          " has not been implemented yet.")
+        else:
+            raise NotImplementedError("Assignment operator " + self.op +
+                                      " has not been implemented yet.")
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/ast_expression.py b/dace/frontend/octave/ast_expression.py
new file mode 100644
index 0000000000..c35b5f0263
--- /dev/null
+++ b/dace/frontend/octave/ast_expression.py
@@ -0,0 +1,304 @@
+import dace
+
+from .ast_node import AST_Node
+
+
+class AST_UnaryExpression(AST_Node):
+    def __init__(self, context, arg, op, order):
+        AST_Node.__init__(self, context)
+        self.arg = arg
+        self.op = op
+        self.order = order  # can be "pre" or "post" (++A vs A++)
+        self.children = [self.arg]
+
+    def __repr__(self):
+        return "AST_UnaryExpression(" + str(self.arg) + ", " + str(self.op) + \
+                ", " + str(self.order) + ")"
+
+    def get_children(self):
+        return [self.arg]
+
+    def replace_child(self, old, new):
+        if self.arg == old:
+            self.arg = new
+        else:
+            raise ValueError(str(old) + " is not a child of " + str(self))
+
+    def specialize(self):
+        from .ast_values import AST_Constant
+        # -A is syntactic sugar for -1*A
+        if (self.op == "-") and isinstance(self.arg, AST_Constant):
+            new = AST_Constant(self.context, -self.arg.get_value())
+            new.next = self.next
+            new.prev = self.prev
+            new.parent = self.parent
+            return new
+        elif (self.op == "-"):
+            new = AST_BinExpression(self.context, self.arg,
+                                    AST_Constant(None, -1), "*")
+            new.next = self.next
+            new.prev = self.prev
+            new.parent = self.parent
+            return new
+
+    __str__ = __repr__
+
+
+class AST_BinExpression(AST_Node):
+    def __init__(self, context, lhs, rhs, op):
+        AST_Node.__init__(self, context)
+        self.lhs = lhs
+        self.rhs = rhs
+        self.op = op
+        self.children = [self.lhs, self.rhs]
+
+    def provide_parents(self, parent):
+        self.parent = parent
+        self.lhs.provide_parents(self)
+        self.rhs.provide_parents(self)
+
+    def get_children(self):
+        return [self.lhs, self.rhs]
+
+    def replace_child(self, old, new):
+        if self.lhs == old:
+            self.lhs = new
+        if self.rhs == old:
+            self.rhs = new
+
+    def __repr__(self):
+        return "AST_BinExpression(" + str(self.lhs) + ", " + str(
+            self.op) + ", " + str(self.rhs) + ")"
+
+    def get_dims(self):
+        left_dims = self.lhs.get_dims()
+        right_dims = self.rhs.get_dims()
+        if len(left_dims) > 2 or len(right_dims) > 2:
+            raise ValueError("Only 2D matrices can be multiplied")
+        outdims = None
+        if self.op == "*":
+            # if lhs is a scalar, outdims = rhs
+            if left_dims == [1]:
+                outdims = right_dims
+                # elif rhs is a scalar, outdims = lhs
+            elif right_dims == [1]:
+                outdims = left_dims
+                # elif lhs is a matrix, check if dims match, compute new outdims
+            elif left_dims[1] != right_dims[0]:
+                print(str(left_dims) + "type: " + str(type(left_dims[1])))
+                print(str(right_dims) + "type: " + str(type(right_dims[0])))
+                raise ValueError("Dims do not match!")
+            else:
+                outdims = [left_dims[0], right_dims[1]]
+        elif self.op == "+" or self.op == "-" or self.op == "/":
+            # if lhs is a scalar, outdims = rhs
+            if left_dims == [1]:
+                outdims = right_dims
+                # elif rhs is a scalar, outdims = lhs
+            elif right_dims == [1]:
+                outdims = left_dims
+                # elif lhs is a matrix, check if dims match, compute new outdims
+            elif left_dims != right_dims:
+                raise ValueError("Dimensions do not match")
+            else:
+                outdims = left_dims
+        else:
+            raise NotImplementedError("Unhandled binary operator: " +
+                                      str(self.op))
+        if outdims == [1, 1]:
+            outdims = [1]
+        return outdims
+
+    def get_basetype(self):
+        # The basetype of a binary expression should be the more accurate
+        # type of lhs and rhs
+        return dace.types.float64
+
+    def matrix2d_scalar(self, sdfg, state, op):
+        lhs_dims = self.lhs.get_dims()
+        rhs_dims = self.rhs.get_dims()
+        M = str(lhs_dims[-2])
+        N = str(lhs_dims[-1])
+        A = self.lhs.get_datanode(sdfg, state)
+        B = self.rhs.get_datanode(sdfg, state)
+        C = self.get_datanode(sdfg, state)
+
+        s = sdfg.nodes()[state]
+        map_entry, map_exit = s.add_map('M' + op + 'M',
+                                        dict(i='0:' + M, j='0:' + N))
+        map_entry._in_connectors.add('IN_1')
+        map_entry._in_connectors.add('IN_2')
+        map_entry._out_connectors.add('OUT_1')
+        map_entry._out_connectors.add('OUT_2')
+        s.add_edge(A, None, map_entry, 'IN_1',
+                   dace.memlet.Memlet.simple(A, '0:' + N + ',0:' + M))
+        s.add_edge(B, None, map_entry, 'IN_2', dace.memlet.Memlet.simple(
+            B, '0'))
+        tasklet = s.add_tasklet(op, {'a', 'b'}, {'c'}, 'c = a' + op + 'b')
+        s.add_edge(map_entry, "OUT_1", tasklet, "a",
+                   dace.memlet.Memlet.simple(A, 'i,j'))
+        s.add_edge(map_entry, "OUT_2", tasklet, "b",
+                   dace.memlet.Memlet.simple(B, '0'))
+        s.add_edge(tasklet, "c", map_exit, None,
+                   dace.memlet.Memlet.simple(C, 'i,j'))
+        s.add_edge(map_exit, None, C, None,
+                   dace.memlet.Memlet.simple(C, '0:' + N + ', 0:' + M))
+
+    def matrix2d_matrix2d_mult(self, sdfg, state):
+        lhs_dims = self.lhs.get_dims()
+        rhs_dims = self.rhs.get_dims()
+        A = self.lhs.get_datanode(sdfg, state)
+        B = self.rhs.get_datanode(sdfg, state)
+        C = self.get_datanode(sdfg, state)
+
+        M = str(lhs_dims[-1])
+        N = str(lhs_dims[-1])
+        K = str(rhs_dims[-1])
+
+        s = sdfg.nodes()[state]
+        map_entry, map_exit = s.add_map(
+            'MMM', dict(i='0:' + M, j='0:' + N, k='0:' + K))
+        map_entry._in_connectors.add('IN_1')
+        map_entry._in_connectors.add('IN_2')
+        map_entry._out_connectors.add('OUT_1')
+        map_entry._out_connectors.add('OUT_2')
+        s.add_edge(A, None, map_entry, 'IN_1',
+                   dace.memlet.Memlet.simple(A, '0:' + M + ',0:' + K))
+        s.add_edge(B, None, map_entry, 'IN_2',
+                   dace.memlet.Memlet.simple(B, '0:' + K + ', 0:' + N))
+        tasklet = s.add_tasklet('mult', {'a', 'b'}, {'c'}, 'c = a*b')
+        s.add_edge(map_entry, "OUT_1", tasklet, "a",
+                   dace.memlet.Memlet.simple(A, 'i,k'))
+        s.add_edge(map_entry, "OUT_2", tasklet, "b",
+                   dace.memlet.Memlet.simple(B, 'k,j'))
+        tmpname = self.get_new_tmpvar(sdfg)
+        sdfg.add_transient(tmpname, [M, N, K], self.get_basetype())
+        tmp = s.add_access(tmpname)
+        s.add_edge(tasklet, "c", map_exit, None,
+                   dace.memlet.Memlet.simple(tmp, 'i,j,k'))
+        rednode = s.add_reduce('lambda a,b: a+b', (2, ), 0)
+        s.add_edge(
+            map_exit, None, tmp, None,
+            dace.memlet.Memlet.simple(tmp, '0:' + M + ',0:' + N + ',0:' + K))
+        s.add_edge(
+            tmp, None, rednode, None,
+            dace.memlet.Memlet.simple(tmp, '0:' + M + ',0:' + N + ',0:' + K))
+        s.add_edge(rednode, None, C, None,
+                   dace.memlet.Memlet.simple(C, '0:' + M + ',0:' + N))
+
+    def vec_mult_vect(self, sdfg, state, op):
+        lhs_dims = self.lhs.get_dims()
+        rhs_dims = self.rhs.get_dims()
+        A = self.lhs.get_datanode(sdfg, state)
+        B = self.rhs.get_datanode(sdfg, state)
+        C = self.get_datanode(sdfg, state)
+
+        N = str(lhs_dims[-1])
+
+        s = sdfg.nodes()[state]
+        map_entry, map_exit = s.add_map('VVM', dict(i='0:' + N))
+        map_entry._in_connectors.add('IN_1')
+        map_entry._in_connectors.add('IN_2')
+        map_entry._out_connectors.add('OUT_1')
+        map_entry._out_connectors.add('OUT_2')
+        s.add_edge(A, None, map_entry, 'IN_1',
+                   dace.memlet.Memlet.simple(A, '0:' + N))
+        s.add_edge(B, None, map_entry, 'IN_2',
+                   dace.memlet.Memlet.simple(B, '0:' + N))
+        tasklet = s.add_tasklet('mult', {'a', 'b'}, {'c'}, 'c = a*b')
+        s.add_edge(map_entry, "OUT_1", tasklet, "a",
+                   dace.memlet.Memlet.simple(A, '0,i'))
+        s.add_edge(map_entry, "OUT_2", tasklet, "b",
+                   dace.memlet.Memlet.simple(B, 'i,0'))
+        tmpname = self.get_new_tmpvar(sdfg)
+        sdfg.add_transient(tmpname, [N], self.get_basetype())
+        tmp = s.add_access(tmpname)
+        s.add_edge(tasklet, "c", map_exit, None,
+                   dace.memlet.Memlet.simple(tmp, 'i'))
+        rednode = s.add_reduce('lambda a,b: a+b', (0, ), 0)
+        s.add_edge(map_exit, None, tmp, None,
+                   dace.memlet.Memlet.simple(tmp, '0:' + N))
+        s.add_edge(tmp, None, rednode, None,
+                   dace.memlet.Memlet.simple(tmp, '0:' + N))
+        s.add_edge(rednode, None, C, None, dace.memlet.Memlet.simple(C, '0'))
+
+    def matrix2d_matrix2d_plus_or_minus(self, sdfg, state, op):
+        lhs_dims = self.lhs.get_dims()
+        rhs_dims = self.rhs.get_dims()
+        M = str(lhs_dims[-2])
+        N = str(lhs_dims[-1])
+        A = self.lhs.get_datanode(sdfg, state)
+        B = self.rhs.get_datanode(sdfg, state)
+        C = self.get_datanode(sdfg, state)
+
+        s = sdfg.nodes()[state]
+        map_entry, map_exit = s.add_map('M' + op + 'M',
+                                        dict(i='0:' + M, j='0:' + N))
+        map_entry._in_connectors.add('IN_1')
+        map_entry._in_connectors.add('IN_2')
+        map_entry._out_connectors.add('OUT_1')
+        map_entry._out_connectors.add('OUT_2')
+        s.add_edge(A, None, map_entry, 'IN_1',
+                   dace.memlet.Memlet.simple(A, '0:' + N + ',0:' + M))
+        s.add_edge(B, None, map_entry, 'IN_2',
+                   dace.memlet.Memlet.simple(B, '0:' + N + ', 0:' + M))
+        tasklet = s.add_tasklet(op, {'a', 'b'}, {'c'}, 'c = a' + op + 'b')
+        s.add_edge(map_entry, "OUT_1", tasklet, "a",
+                   dace.memlet.Memlet.simple(A, 'i,j'))
+        s.add_edge(map_entry, "OUT_2", tasklet, "b",
+                   dace.memlet.Memlet.simple(B, 'i,j'))
+        s.add_edge(tasklet, "c", map_exit, None,
+                   dace.memlet.Memlet.simple(C, 'i,j'))
+        s.add_edge(map_exit, None, C, None,
+                   dace.memlet.Memlet.simple(C, '0:' + N + ', 0:' + M))
+
+    def scalar_scalar(self, sdfg, state, op):
+        A = self.lhs.get_datanode(sdfg, state)
+        B = self.rhs.get_datanode(sdfg, state)
+        C = self.get_datanode(sdfg, state)
+
+        s = sdfg.nodes()[state]
+        tasklet = s.add_tasklet(op, {'a', 'b'}, {'c'}, 'c = a' + op + 'b')
+        s.add_edge(A, None, tasklet, 'a', dace.memlet.Memlet.simple(A, '0'))
+        s.add_edge(B, None, tasklet, 'b', dace.memlet.Memlet.simple(B, '0'))
+        s.add_edge(tasklet, "c", C, None, dace.memlet.Memlet.simple(C, '0'))
+
+    def generate_code(self, sdfg, state):
+        # Generate code for the lhs and rhs
+        self.lhs.generate_code(sdfg, state)
+        self.rhs.generate_code(sdfg, state)
+
+        # Add a new variable to hold the result of this expression
+        dims = self.get_dims()
+        basetype = self.get_basetype()
+        name = self.get_name_in_sdfg(sdfg)
+        sdfg.add_transient(name, dims, basetype, debuginfo=self.context)
+        print("The result of " + str(self) + " will be stored in " + str(name))
+
+        lhs_dims = self.lhs.get_dims()
+        rhs_dims = self.rhs.get_dims()
+
+        if rhs_dims == [1, 1] or rhs_dims == [1]:
+            if lhs_dims == [1, 1] or lhs_dims == [1]:
+                self.scalar_scalar(sdfg, state, self.op)
+            else:
+                self.matrix2d_scalar(sdfg, state, self.op)
+            return
+        if lhs_dims[0] == 1 and rhs_dims[1] == 1 and self.op == "*":
+            self.vec_mult_vect(sdfg, state, self.op)
+        elif lhs_dims == [1, 1] or lhs_dims == [1]:
+            raise NotImplementedError(
+                "Binary expression with scalar on lhs not implemented: " +
+                str(self) + ", lhs dims: " + str(lhs_dims) + ", rhs dims: " +
+                str(rhs_dims))
+        else:
+            if self.op == "*":
+                self.matrix2d_matrix2d_mult(sdfg, state)
+            elif self.op == "-" or self.op == "+":
+                self.matrix2d_matrix2d_plus_or_minus(sdfg, state, self.op)
+            else:
+                raise NotImplementedError("Binary expression with two " +
+                                          "matrices and op=" + str(self.op) +
+                                          " not implemented")
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/ast_function.py b/dace/frontend/octave/ast_function.py
new file mode 100644
index 0000000000..aaa7f5055c
--- /dev/null
+++ b/dace/frontend/octave/ast_function.py
@@ -0,0 +1,371 @@
+import dace
+import copy
+
+from .ast_node import AST_Node
+
+
+class AST_EndFunc(AST_Node):
+    def __init__(self, context):
+        AST_Node.__init__(self, context)
+
+    def get_children(self):
+        return []
+
+    def replace_child(self, old, new):
+        raise ValueError("AST_EndFunc has no children")
+
+    def generate_code(self, sdfg, state):
+        pass
+
+    def __repr__(self):
+        return "AST_EndFunc()"
+
+
+class AST_Function(AST_Node):
+    def __init__(self, context, name, args, retvals):
+        AST_Node.__init__(self, context)
+        self.name = name
+        self.args = args
+        self.retvals = retvals
+        self.statements = None
+
+    def __repr__(self):
+        return "AST_Function(" + self.name.get_name() + ", args=[" + ", ".join(
+            [str(x) for x in self.args]) + "], retvals=[" + ", ".join(
+                [str(x) for x in self.retvals]) + "])"
+
+    def set_statements(self, stmtlist):
+        self.statements = AST_Statements(None, stmtlist)
+        self.statements.provide_parents(self)
+
+    def get_children(self):
+        ret = []
+        ret.append(self.name)
+        ret += self.args
+        ret += self.retvals
+        return ret
+
+    def replace_child(self, old, new):
+        if self.name == old:
+            self.name = new
+        elif old in self.args:
+            newargs = [new if x == old else x for x in self.args]
+            self.args = newargs
+        elif old in self.retvals:
+            newret = [new if x == old else x for x in self.retvals]
+            self.retvals = newret
+
+    def generate_code(self, sdfg, state):
+        # This does not do anything, since we inline functions at the call site,
+        # so the code generation happens there.
+        pass
+
+    __str__ = __repr__
+
+
+class AST_Argument(AST_Node):
+    def __init__(self, context, name, default=None):
+        AST_Node.__init__(self, context)
+        self.name = name
+        self.default = default
+
+    def get_children(self):
+        ret = [self.name]
+        if self.default is not None:
+            ret += [self.default]
+        return ret
+
+    def __repr__(self):
+        return "AST_Argument(" + self.name.get_name() + ", default=" + str(
+            self.default) + ")"
+
+    __str__ = __repr__
+
+
+class AST_BuiltInFunCall(AST_Node):
+    def __init__(self, context, funname, args):
+        AST_Node.__init__(self, context)
+        self.funname = funname
+        self.args = args
+
+    def __repr__(self):
+        return "AST_BuiltInFunCall(" + str(self.funname) + ", " + str(
+            self.args) + ")"
+
+    def get_children(self):
+        retval = self.args[:]
+        retval.append(self.funname)
+        return retval
+
+    def replace_child(self, old, new):
+        if old == self.funname:
+            self.funname = new
+            return
+        if old in self.args:
+            newargs = [new if x == old else x for x in self.args]
+            self.args = newargs
+
+    def get_basetype(self):
+        # For now assume it is always double
+        return dace.types.float64
+
+    def get_dims(self):
+        from .ast_matrix import AST_Matrix
+        dims = None
+        if self.funname.get_name() in ["zeros", "ones", "rand", "eye"]:
+            # The dimensions for these functions are the arguments, but we
+            # need to convert them to values, if we cannot they are symbolic
+            for arg in self.args:
+                if not arg.is_constant():
+
+                    return self.args
+            if isinstance(self.args[0], AST_Matrix):
+                dims = self.args[0].get_values_row_major()
+            else:
+                dims = [self.args[0].get_value(), self.args[1].get_value()]
+        elif self.funname.get_name() in ["sqrt"]:
+            return self.args[0].get_dims()
+        elif self.funname.get_name() in ["length"]:
+            dims = [1]
+        if dims is None:
+            raise NotImplementedError("Cannot infer dimensions for " +
+                                      str(self))
+        return dims
+
+    def generate_code(self, sdfg, state):
+
+        # TODO: rand has options for setting seed/state and controlling
+        # accuracy. We only deal with the simple use-case for now.
+
+        if self.funname.get_name() in ["sqrt"]:
+            dims = self.get_dims()
+            name = self.get_name_in_sdfg(sdfg)
+            basetype = dace.types.float64
+            sdfg.add_transient(name, dims, basetype, debuginfo=self.context)
+            print("The result of expr " + str(self) + " will be stored in " +
+                  str(name))
+
+            self.args[0].generate_code(sdfg, state)
+
+            resnode = self.get_datanode(sdfg, state)
+            if len(dims) == 1:
+                s = sdfg.nodes()[state]
+                A = self.args[0].get_datanode(sdfg, state)
+                tasklet = sdfg.nodes()[state].add_tasklet(
+                    'sqrt', {'in'}, {'out'}, "out=sqrt(in);",
+                    dace.Language.CPP)
+                s.add_edge(A, None, tasklet, "in",
+                           dace.memlet.Memlet.from_array(A.data, A.desc(sdfg)))
+                s.add_edge(
+                    tasklet, "out", resnode, None,
+                    dace.memlet.Memlet.from_array(resnode.data,
+                                                  resnode.desc(sdfg)))
+            elif len(dims) == 2:
+                M = str(dims[0])
+                N = str(dims[1])
+
+                men, mex = sdfg.nodes()[state].add_map(
+                    self.funname.get_name() + 'map',
+                    dict(i="0:" + N, j="0:" + M))
+                tasklet = None
+                s = sdfg.nodes()[state]
+                A = self.args[0].get_datanode(sdfg, state)
+                s.add_edge(A, None, men, None,
+                           dace.memlet.Memlet.from_array(A.data, A.desc(sdfg)))
+                tasklet = sdfg.nodes()[state].add_tasklet(
+                    'sqrt', {'in'}, {'out'}, "out=sqrt(in);",
+                    dace.Language.CPP)
+                s.add_edge(men, None, tasklet, "in",
+                           dace.memlet.Memlet.simple(A, 'i,j'))
+                s.add_edge(tasklet, "out", mex, None,
+                           dace.memlet.Memlet.simple(resnode, 'i,j'))
+                s.add_edge(
+                    mex, None, resnode, None,
+                    dace.memlet.Memlet.simple(resnode, '0:' + N + ',0:' + M))
+            else:
+                raise ValueError(
+                    "sqrt of tensors with more than 2 dims not supported")
+
+        if self.funname.get_name() in ["zeros", "rand"]:
+            dims = self.get_dims()
+            name = self.get_name_in_sdfg(sdfg)
+            basetype = dace.types.float64
+            sdfg.add_transient(name, dims, basetype, debuginfo=self.context)
+            print("The result of expr " + str(self) + " will be stored in " +
+                  str(name))
+
+            # Add a map over all dimensions with a tasklet that will initialize
+            # the array to random values (0,1).
+
+            if len(dims) > 2:
+                raise NotImplementedError(
+                    "Code generation only implemented for 2 arguments")
+
+            resnode = self.get_datanode(sdfg, state)
+            M = str(dims[0])
+            N = str(dims[1])
+
+            s = sdfg.nodes()[state]
+            men, mex = s.add_map(self.funname.get_name() + 'map',
+                                 dict(i="0:" + N, j="0:" + M))
+            tasklet = None
+            if self.funname.get_name() == "zeros":
+                tasklet = sdfg.nodes()[state].add_tasklet(
+                    'zero', {}, {'out'}, "out=0")
+                s.add_edge(men, None, tasklet, None, dace.memlet.EmptyMemlet())
+            elif self.funname.get_name() == "rand":
+                tasklet = sdfg.nodes()[state].add_tasklet(
+                    'rand', {}, {'out'}, "out=drand48()")
+                s.add_edge(men, None, tasklet, None, dace.memlet.EmptyMemlet())
+            elif self.funname.get_name() == "sqrt":
+                A = self.args[0].get_datanode(sdfg, state)
+                tasklet = sdfg.nodes()[state].add_tasklet(
+                    'sqrt', {'in'}, {'out'}, "out=sqrt(in)")
+                s.add_edge(men, None, tasklet, "in",
+                           dace.memlet.Memlet.simple(A, 'i,j'))
+            else:
+                raise NotImplementedError("Code generation for " +
+                                          str(self.funname.get_name()) +
+                                          " is not implemented.")
+            s = sdfg.nodes()[state]
+            s.add_edge(tasklet, "out", mex, None,
+                       dace.memlet.Memlet.simple(resnode, 'i,j'))
+            s.add_edge(
+                mex, None, resnode, None,
+                dace.memlet.Memlet.simple(resnode, '0:' + N + ',0:' + M))
+
+    def specialize(self):
+        from .ast_matrix import AST_Matrix, AST_Matrix_Row
+        from .ast_values import AST_Constant, AST_Ident
+
+        # First try to specialize the arguments (for constant propagation)
+        for c in self.get_children():
+            n = c.specialize()
+            while n is not None:
+                n.replace_parent(c.get_parent())
+                self.replace_child(old=c, new=n)
+                c = n
+                n = n.specialize()
+        for c in self.get_children():
+            if isinstance(c, AST_Ident):
+                if isinstance(c.get_propagated_value(), AST_Constant):
+                    n = copy.deepcopy(c.get_propagated_value())
+                    self.replace_child(old=c, new=n)
+
+        # If this is a call to zeros, ones, or eye, and the arguments are
+        # constants, we can generate a constant expression. `length` is a
+        # special case, since for now we require that all dimensions are
+        # compile time constants.
+
+        if self.funname.get_name() == "length":
+            vardef = self.search_vardef_in_scope(self.args[0].get_name())
+            if vardef is None:
+                raise ValueError("No definition found for " +
+                                 self.args[0].get_name())
+            dims = vardef.get_dims()
+            length = max(dims)
+            return AST_Constant(None, length)
+
+        if not self.funname.get_name() in ["zeros", "ones", "eye"]:
+            return None
+
+        for arg in self.args:
+            if not arg.is_constant():
+                return None
+
+        # The args to those functions can be supplied as a 1x2 matrix or
+        # two seperate values, the semantics are the same.
+        dims = []
+        if isinstance(self.args, AST_Matrix):
+            dims = self.args.get_values_row_major()
+        else:
+            dims = [x.get_value() for x in self.args]
+
+        rows = []
+        for r in range(0, dims[0]):
+            rowelems = []
+            for c in range(0, dims[1]):
+                zero = AST_Constant(self.context, 0)
+                one = AST_Constant(self.context, 1)
+                if self.funname.get_name() == "zeros":
+                    rowelems.append(zero)
+                if self.funname.get_name() == "ones":
+                    rowelems.append(one)
+                if self.funname.get_name() == "eye":
+                    if r == c:
+                        rowelems.append(one)
+                    else:
+                        rowelems.append(zero)
+            rows.append(AST_Matrix_Row(self.context, rowelems))
+        res = AST_Matrix(self.context, rows)
+        res.provide_parents(self.get_parent())
+        res.next = self.next
+        res.prev = self.prev
+        return res
+
+    __str__ = __repr__
+
+
+class AST_FunCall(AST_Node):
+    # NOTE: When parsing, array references, i.e., A(1,2) is the same as
+    #       function calls, so after parsing this node will be used for both,
+    #       and we resolve this later.
+    def __init__(self, context, funname, args):
+        AST_Node.__init__(self, context)
+        self.funname = funname
+        self.args = args
+
+    def get_children(self):
+        retval = self.args[:]
+        retval.append(self.funname)
+        return retval
+
+    def replace_child(self, old, new):
+        if old == self.funname:
+            self.funname = new
+            return
+        if old in self.args:
+            newargs = [new if x == old else x for x in self.args]
+            self.args = newargs
+
+    def __repr__(self):
+        return "AST_FunCall(" + str(self.funname) + ", " + str(self.args) + ")"
+
+    def specialize(self):
+        # This function will be called after we have the complete AST.
+        # Thus we know if this is a real function call or an array access.
+        # If it is a function call, differentiate between built-in functions
+        # and user-defined ones.
+        from .ast_arrayaccess import AST_ArrayAccess
+
+        if self.funname.get_name() in [
+                "zeros", "eye", "rand", "ones", "length", "sqrt"
+        ]:
+            new = AST_BuiltInFunCall(self.context, self.funname, self.args)
+            new.next = self.next
+            new.prev = self.prev
+            new.parent = self.parent
+            for c in new.get_children():
+                c.provide_parents(new)
+            return new
+        else:
+            # find the definition of self.funname, if it is anything else
+            # than an AST_Function this is an array subaccess
+            vardef = self.search_vardef_in_scope(self.funname.get_name())
+            if vardef == None:
+                raise ValueError("No definition found for " +
+                                 self.funname.get_name() + " searching from " +
+                                 str(self))
+            if isinstance(vardef, AST_Function):
+                return None
+            else:
+                new = AST_ArrayAccess(self.context, self.funname, self.args)
+                new.next = self.next
+                new.prev = self.prev
+                new.parent = self.parent
+                for c in new.get_children():
+                    c.provide_parents(new)
+                return new
+        return None
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/ast_loop.py b/dace/frontend/octave/ast_loop.py
new file mode 100644
index 0000000000..f5be68b47d
--- /dev/null
+++ b/dace/frontend/octave/ast_loop.py
@@ -0,0 +1,202 @@
+import dace
+
+from .ast_node import AST_Node
+
+
+class AST_ForLoop(AST_Node):
+    def __init__(self, context, var, initializer, stmts):
+        AST_Node.__init__(self, context)
+        self.var = var
+        self.initializer = initializer
+        self.stmts = stmts
+
+    def __repr__(self):
+        return "AST_ForLoop(" + str(self.var) + " = " + str(
+            self.initializer) + ", stmts: {\n" + str(self.stmts) + "\n})"
+
+    def get_children(self):
+        return [self.var, self.initializer, self.stmts]
+
+    def replace_child(self, old, new):
+        if old == self.var:
+            self.var = new
+            return
+        if old == self.initializer:
+            self.initializer = new
+            return
+        if old == self.stmts:
+            self.stmts = new
+            return
+        raise ValueError("The child " + str(old) + " is not a child of " +
+                         str(self))
+
+    def generate_code(self, sdfg, state):
+        from .ast_range import AST_RangeExpression
+        # This ignores matlab semantics and only works for loops of the form
+        # for var = start:end where start and end are expressions which
+        # evaluate to scalars.
+        if isinstance(self.initializer, AST_RangeExpression):
+            # Generate the initializer:
+            # lhs and rhs of the iteration range as two transients, and a
+            # transient for i (we also have a symbol for i which states will
+            # use)
+            initializer_state_num = state
+            s = sdfg.nodes()[state]
+            self.initializer.lhs.generate_code(sdfg, state)
+            lhs_node = self.initializer.lhs.get_datanode(sdfg, state)
+            self.initializer.rhs.generate_code(sdfg, state)
+            rhs_node = self.initializer.rhs.get_datanode(sdfg, state)
+            sdfg.add_transient(
+                self.var.get_name_in_sdfg(sdfg), [1],
+                self.initializer.lhs.get_basetype())
+            var_node = s.add_access(self.var.get_name_in_sdfg(sdfg))
+            s.add_edge(
+                lhs_node, None, var_node, None,
+                dace.memlet.Memlet.from_array(var_node.data,
+                                              var_node.desc(sdfg)))
+            loop_guard_var = '_loopiter_' + str(state)
+            loop_end_var = '_loopend_' + str(state)
+
+            # Generate guard state, write loop iter symbol into loop iter
+            # datanode
+            guard_state_num = initializer_state_num + 1
+            s_guard = sdfg.add_state('s' + str(guard_state_num))
+            task = s_guard.add_tasklet('reinitloopiter', {}, {'out'},
+                                       "out=" + loop_guard_var)
+
+            if self.var.get_name_in_sdfg(sdfg) not in sdfg.arrays:
+                sdfg.add_transient(
+                    self.var.get_name_in_sdfg(sdfg), [1],
+                    self.initializer.lhs.get_basetype())
+            trans = s_guard.add_access(self.var.get_name_in_sdfg(sdfg))
+            # Workaround until "condition for putting a variable as top-level
+            # doesn't take inter-state edges into account" is solved.
+            # When fixed, the line below can be removed.
+            self.initializer.rhs.generate_code(sdfg, guard_state_num)
+
+            s_guard.add_edge(
+                task, 'out', trans, None,
+                dace.memlet.Memlet.from_array(trans.data, trans.desc(sdfg)))
+            lg_init = dace.graph.edges.InterstateEdge(
+                assignments={
+                    loop_guard_var:
+                    self.var.get_name_in_sdfg(sdfg) + '[0]',
+                    loop_end_var:
+                    self.initializer.rhs.get_name_in_sdfg(sdfg) + '[0]'
+                })
+            sdfg.add_edge(sdfg.nodes()[state], s_guard, lg_init)
+
+            # Add state for each statement within the for loop
+            prev = s_guard
+            for s in self.stmts.statements:
+                state = len(sdfg.nodes())
+                newstate = dace.SDFGState(
+                    "s" + str(state), sdfg, debuginfo=s.context)
+                sdfg.add_node(newstate)
+                last_state = s.generate_code(sdfg, state)
+                if last_state is None: last_state = state
+                if prev != s_guard:
+                    edge = dace.graph.edges.InterstateEdge()
+                    sdfg.add_edge(prev, newstate, edge)
+                else:
+                    edge = dace.graph.edges.InterstateEdge(
+                        condition=dace.properties.CodeProperty.from_string(
+                            loop_guard_var + " <= " + loop_end_var,
+                            language=dace.types.Language.Python))
+                    sdfg.add_edge(prev, newstate, edge)
+                prev = sdfg.nodes()[last_state]
+
+            # Create inter-state back-edge
+            edge = dace.graph.edges.InterstateEdge(
+                assignments={loop_guard_var: loop_guard_var + '+1'})
+            sdfg.add_edge(prev, s_guard, edge)
+
+            # Create the loop exit state
+            state = len(sdfg.nodes())
+            s_lexit = dace.SDFGState(
+                "s" + str(state), sdfg, debuginfo=s.context)
+            lend_val = str(self.initializer.get_dims()[-1])
+            for_exit = dace.graph.edges.InterstateEdge(
+                condition=dace.properties.CodeProperty.from_string(
+                    loop_guard_var + " > " + loop_end_var,
+                    language=dace.types.Language.Python))
+            sdfg.add_edge(s_guard, s_lexit, for_exit)
+
+            return state
+
+        else:
+            raise NotImplementedError(
+                "Loops over anything but ranges are not implemented.")
+
+    def generate_code_proper(self, sdfg, state):
+        # This follows matlab semantics, i.e., a loop iterates over the columns
+        # of a matrix. This does not work well for sdfgs for all but the
+        # simplest case (a matrix which is a compile time constant, ie. 1:10).
+        # To support programs like Cholesky, we try to transform the matlab for
+        # loop into a C-style loop, this is implemented in generate_code().
+
+        # Generate the initializer:
+        # Each iteration of the for loop will use one column
+        initializer_state_num = state
+        self.initializer.generate_code(sdfg, state)
+        loop_guard_var = '_lg_' + str(state)
+        # Generate an (empty) guard state
+        guard_state_num = initializer_state_num + 1
+        s_guard = sdfg.add_state('s' + str(guard_state_num))
+        lg_init = dace.graph.edges.InterstateEdge(
+            assignments={loop_guard_var: '0'})
+        sdfg.add_edge(sdfg.nodes()[state], s_guard, lg_init)
+
+        # Read a column of the initializer
+        get_initializer_state_num = guard_state_num + 1
+        s_getinit = sdfg.add_state('s' + str(get_initializer_state_num))
+        initializer_name = self.initializer.get_name_in_sdfg(sdfg)
+        loopvar_name = self.var.get_name_in_sdfg(sdfg)
+        dims = self.initializer.get_dims()[:1]
+        sdfg.add_transient(loopvar_name, dims, self.initializer.get_basetype())
+        part = s_getinit.add_access(loopvar_name)
+        sdfg.add_transient(initializer_name, self.initializer.get_dims(),
+                           self.initializer.get_basetype())
+        full = s_getinit.add_read(initializer_name)
+        s_getinit.add_edge(full, None, part, None,
+                           dace.memlet.Memlet.simple(initializer_name, 'i,0'))
+
+        # Add edge from guard to getinit
+        lend_val = str(self.initializer.get_dims()[-1])
+        for_entry = dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                loop_guard_var + " < " + lend_val,
+                language=dace.types.Language.Python))
+        sdfg.add_edge(s_guard, s_getinit, for_entry)
+
+        # Add state for each statement within the for loop
+        prev = s_getinit
+        for s in self.stmts.statements:
+            state = len(sdfg.nodes())
+            newstate = dace.SDFGState(
+                "s" + str(state), sdfg, debuginfo=s.context)
+            sdfg.add_node(newstate)
+            last_state = s.generate_code(sdfg, state)
+            if last_state is None: last_state = state
+            edge = dace.graph.edges.InterstateEdge()
+            sdfg.add_edge(prev, newstate, edge)
+            prev = sdfg.nodes()[last_state]
+
+        # Create inter-state back-edge
+        edge = dace.graph.edges.InterstateEdge(
+            assignments={loop_guard_var: loop_guard_var + '+1'})
+        sdfg.add_edge(prev, s_guard, edge)
+
+        # Create the loop exit state
+        state = len(sdfg.nodes())
+        s_lexit = dace.SDFGState("s" + str(state), sdfg, debuginfo=s.context)
+        lend_val = str(self.initializer.get_dims()[-1])
+        for_exit = dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                loop_guard_var + " >= " + lend_val,
+                language=dace.types.Language.Python))
+        sdfg.add_edge(s_guard, s_lexit, for_exit)
+
+        return state
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/ast_matrix.py b/dace/frontend/octave/ast_matrix.py
new file mode 100644
index 0000000000..1fc3c656d5
--- /dev/null
+++ b/dace/frontend/octave/ast_matrix.py
@@ -0,0 +1,214 @@
+from .ast_node import AST_Node
+from .ast_values import AST_Constant
+
+import dace
+
+
+class AST_Matrix_Row(AST_Node):
+    def __init__(self, context, elements):
+        AST_Node.__init__(self, context)
+        self.elements = elements
+        if not isinstance(self.elements, list):
+            raise ValueError(
+                "AST_Matrix_Row() expects a list of elements, got " +
+                str(type(self.elements)))
+
+    def provide_parents(self, parent):
+        self.parent = parent
+        for e in self.elements:
+            e.provide_parents(self)
+
+    def __repr__(self):
+        return "AST_MatrixRow(" + ", ".join([str(i)
+                                             for i in self.elements]) + ")"
+
+    def get_dims(self):
+        return len(self.elements)
+
+    def get_children(self):
+        return self.elements[:]
+
+    def replace_child(self, old, new):
+        newelems = [new if x == old else x for x in self.elements]
+        self.elements = newelems
+
+    def is_constant(self):
+        for r in self.elements:
+            if not isinstance(r, AST_Constant):
+                return False
+            return True
+
+    def __getitem__(self, item):
+        if item >= len(self):
+            raise IndexError("AST_Matrix_Row index out of range")
+        return self.elements[item]
+
+    def __len__(self):
+        return len(self.elements)
+
+    __str__ = __repr__
+
+
+class AST_Matrix(AST_Node):
+    def __init__(self, context, rows):
+        AST_Node.__init__(self, context)
+        self.rows = rows
+        self.children = self.rows
+        if not isinstance(self.rows, list):
+            raise ValueError("AST_Matrix() expects a list of rows, got " +
+                             str(type(self.rows)))
+        for r in self.rows:
+            if not isinstance(r, AST_Matrix_Row):
+                raise ValueError("AST_Matrix() expects a list of rows, got " +
+                                 str(r) + " of type " + str(type(r)))
+
+    def __repr__(self):
+        return "AST_Matrix(" + ", ".join([str(i) for i in self.rows]) + ")"
+
+    def provide_parents(self, parent):
+        self.parent = parent
+        for e in self.rows:
+            e.provide_parents(self)
+
+    def get_dims(self):
+        dims = -1
+        for r in self.rows:
+            if (dims > 0) and (r.get_dims() != dims):
+                raise ValueError(
+                    "Matrices with unequal row lengths are currently not "
+                    "supported.")
+            else:
+                dims = r.get_dims()
+        return [len(self.rows), dims]
+
+    def get_basetype(self):
+        # This should be double, unless we have a complex inside, for now just
+        # return double.
+        return dace.types.float64
+
+    def is_constant(self):
+        for r in self.rows:
+            if not r.is_constant():
+                return False
+        return True
+
+    def get_values_row_major(self):
+        values = []
+        for r in self.rows:
+            for c in r:
+                if isinstance(c, AST_Constant):
+                    values.append(c.get_value())
+                else:
+                    values.append(0)
+        return values
+
+    def generate_code(self, sdfg, state):
+        if self.is_constant():
+            name = self.get_name_in_sdfg(sdfg)
+            dims = self.get_dims()
+            basetype = self.get_basetype()
+            sdfg.add_transient(name, dims, basetype)
+            trans = sdfg.nodes()[state].add_access(name)
+            # Add map over dims, and a taklet which puts the values into the
+            # transient.
+            arrlen = 1
+            for d in dims:
+                arrlen *= d
+            vals = self.get_values_row_major()
+            code = "constexpr double VALUES[" + str(arrlen) + "] = {"
+            code += ", ".join(str(i) for i in vals) + "};\n"
+            code += "out[i] = VALUES[i];"
+
+            tasklet = sdfg.nodes()[state].add_tasklet('init', {}, {'out'},
+                                                      code, dace.Language.CPP)
+            me, mx = sdfg.nodes()[state].add_map(
+                'init', dict(i='0:' + str(arrlen)))
+            sdfg.nodes()[state].add_edge(me, None, tasklet, None,
+                                         dace.memlet.EmptyMemlet())
+            sdfg.nodes()[state].add_edge(
+                tasklet, "out", mx, None,
+                dace.memlet.Memlet.from_array(trans.data, trans.desc(sdfg)))
+            sdfg.nodes()[state].add_edge(
+                mx, None, trans, None,
+                dace.memlet.Memlet.from_array(trans.data, trans.desc(sdfg)))
+
+            print("The const expr " + str(self) + " will be stored in " +
+                  str(name) + ", values are: " +
+                  str(self.get_values_row_major()))
+        else:
+            raise ValueError(
+                "Non-constant matrices are currently not supported")
+
+    def get_children(self):
+        return self.rows[:]
+
+    def replace_child(self, old, new):
+        newrows = [new if x == old else x for x in self.rows]
+        self.rows = newrows
+
+    __str__ = __repr__
+
+
+class AST_Transpose(AST_Node):
+    def __init__(self, context, arg, op):
+        AST_Node.__init__(self, context)
+        self.arg = arg
+        self.op = op
+
+    def __repr__(self):
+        return "AST_Transpose(" + str(self.arg) + ", " + str(self.op) + ")"
+
+    def get_children(self):
+        return [self.arg]
+
+    def get_dims(self):
+        dims = self.arg.get_dims()
+        return dims[::-1]
+
+    def get_basetype(self):
+        return self.arg.get_basetype()
+
+    def generate_code(self, sdfg, state):
+        dims = self.get_dims()
+        name = self.get_name_in_sdfg(sdfg)
+        basetype = self.get_basetype()
+        if basetype.is_complex():
+            raise NotImplementedError(
+                "Transpose of complex matrices not implemented (we might need "
+                "to conjugate)")
+        if len(dims) != 2:
+            raise NotImplementedError(
+                "Transpose only implemented for 2D matrices")
+        sdfg.add_transient(name, dims, basetype, debuginfo=self.context)
+
+        resnode = self.get_datanode(sdfg, state)
+        self.arg.generate_code(sdfg, state)
+        A = self.arg.get_datanode(sdfg, state)
+
+        N = str(dims[0])
+        M = str(dims[1])
+        s = sdfg.nodes()[state]
+        map_entry, map_exit = s.add_map('transpose',
+                                        dict(i='0:' + N, j='0:' + M))
+        map_entry._in_connectors.add('IN_1')
+        map_entry._out_connectors.add('OUT_1')
+        s.add_edge(A, None, map_entry, 'IN_1',
+                   dace.memlet.Memlet.simple(A, '0:' + N + ',0:' + M))
+        tasklet = s.add_tasklet('identity', {'a'}, {'out'}, 'out = a')
+        s.add_edge(map_entry, "OUT_1", tasklet, "a",
+                   dace.memlet.Memlet.simple(A, 'i,j'))
+        s.add_edge(tasklet, "out", map_exit, None,
+                   dace.memlet.Memlet.simple(resnode, 'j,i'))
+        s.add_edge(map_exit, None, resnode, None,
+                   dace.memlet.Memlet.simple(resnode, '0:' + M + ', 0:' + N))
+        print("The result of expr " + str(self) + " will be stored in " +
+              str(name))
+
+    def replace_child(self, old, new):
+        if old == self.arg:
+            self.arg = new
+            return
+        raise ValueError("The child " + str(old) + " is not a child of " +
+                         str(self))
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/ast_node.py b/dace/frontend/octave/ast_node.py
new file mode 100644
index 0000000000..62ba8d69e9
--- /dev/null
+++ b/dace/frontend/octave/ast_node.py
@@ -0,0 +1,307 @@
+import re
+import dace
+from collections import OrderedDict
+
+
+class AST_Node():
+    def __init__(self, context):
+        self.context = context
+        self.name = None  # Name of the variable holding the result in the SDFG
+        self.parent = None
+        self.next = None
+        self.prev = None
+        self.initializers = {}
+
+    def get_parent(self):
+        return self.parent
+
+    def replace_parent(self, newparent):
+        self.parent = newparent
+
+    def get_children(self):
+        raise NotImplementedError(
+            str(type(self)) + " does not implement get_children()")
+
+    def replace_child(self, old, new):
+        raise NotImplementedError(
+            str(type(self)) + " does not implement replace_child()")
+
+    def specialize(self):
+        """ Some nodes can be simplified after parsing the complete AST and 
+            before actually generating code, i.e., AST_FunCall nodes could be 
+            function calls or array accesses, and we don't really know unless
+            we know the context of the call. 
+
+            This function traverses the AST
+            and tries to specialize nodes after completing the AST. It should
+            be called on the top-level AST_Statements node, and a node that
+            wants to be specialized should return its new instance. If no
+            specialzation should take place, it should return None.
+        """
+        for c in self.get_children():
+            n = c.specialize()
+            while n is not None:
+                n.replace_parent(c.get_parent())
+                self.replace_child(old=c, new=n)
+                c = n
+                n = n.specialize()
+
+    def find_data_node_in_sdfg_state(self, sdfg, state, nodename=None):
+        if nodename is None:
+            nodename = self.get_name_in_sdfg(sdfg)
+        sdfg_state = sdfg.nodes()[state]
+        for node in sdfg_state.nodes():
+            if isinstance(node, dace.graph.nodes.AccessNode):
+                if node.label == nodename:
+                    return node
+
+        raise ValueError("No AccessNode with name " + nodename + " found.")
+
+    def get_initializers(self, sdfg):
+        initializers = self.initializers
+        for c in self.get_children():
+            initializers.update(c.get_initializers(sdfg))
+        return initializers
+
+    def provide_parents(self, parent):
+        self.parent = parent
+        for c in self.get_children():
+            c.provide_parents(self)
+
+    def search_vardef_in_scope(self, name):
+        from .ast_assign import AST_Assign
+        from .ast_values import AST_Ident
+        from .ast_loop import AST_ForLoop
+        current_node = self
+
+        # check if we found the definition:
+        # * current_node is an AST_Assign with name as lhs or
+        # * a loop with name as the iterator
+        if isinstance(current_node, AST_Assign) and \
+           isinstance(current_node.lhs, AST_Ident) and \
+           (current_node.lhs.get_name() == name):
+            return current_node.rhs
+        elif isinstance(current_node, AST_ForLoop) and \
+            current_node.var.get_name() == name:
+            return current_node
+
+        # if current node is inside list of stmts, traverse this list using
+        # prev, but first find the enclosing AST_Statements
+        while current_node.get_parent() is not None:
+            old_current_node = current_node
+            if isinstance(current_node.get_parent(), AST_Statements):
+                while current_node.prev is not None:
+                    res = current_node.prev.search_vardef_in_scope(name)
+                    if res is not None:
+                        return res
+                    current_node = current_node.prev
+            current_node = current_node.get_parent()
+            res = current_node.search_vardef_in_scope(name)
+            if res is not None:
+                return res
+
+        return None
+
+    def defined_variables(self):
+        # Override this to return the string names of variables defined by an
+        # AST_Node
+        return []
+
+    def get_datanode(self, sdfg, state):
+        try:
+            result = self.find_data_node_in_sdfg_state(
+                sdfg=sdfg,
+                state=state,
+                nodename=self.get_name_in_sdfg(sdfg=sdfg))
+        except ValueError:
+            result = sdfg.nodes()[state].add_access(
+                self.get_name_in_sdfg(sdfg=sdfg))
+        return result
+
+    def get_new_tmpvar(self, sdfg):
+        TEMPVARS_PREFIX = "__tmp_"
+        maxvar = 0
+        for state in range(0, len(sdfg.nodes())):
+            sdfg_state = sdfg.nodes()[state]
+            for node in sdfg_state.nodes():
+                if isinstance(node, dace.graph.nodes.AccessNode):
+                    m = re.match(TEMPVARS_PREFIX + "(\d+)", node.label)
+                    if m is not None:
+                        if maxvar < int(m.group(1)):
+                            maxvar = int(m.group(1))
+        newvar = maxvar + 1
+        new_name = TEMPVARS_PREFIX + str(newvar)
+        return new_name
+
+    def get_name_in_sdfg(self, sdfg):
+        """ If this node has no name assigned yet, create a new one of the form
+            `__tmp_X` where `X` is an integer, such that this node does not yet 
+            exist in the given SDFG.
+            @note: We assume that we create exactly one SDFG from each AST,
+                   otherwise we need to store the hash of the SDFG the name was
+                   created for (would be easy but seems useless at this point).
+        """
+        if self.name is not None:
+            return self.name
+        self.name = self.get_new_tmpvar(sdfg)
+        return self.name
+
+    def generate_code(self, *args):
+        raise NotImplementedError("Class " + type(
+            self).__name__ + " does not implement the generate_code method.")
+
+    def shortdesc(self):
+        ret = str(self)
+        ret = re.sub(r"\n", " ; ", ret)
+        return "\"" + ret[0:70] + "\""
+
+    def print_as_tree(self):
+        ret = ""
+        ret += self.shortdesc() + ";\n"
+        for c in self.get_children():
+            ret += self.shortdesc() + "->" + c.shortdesc(
+            ) + "[label=\"child\", color=\"red\"] ;\n"
+            ret += c.print_as_tree()
+
+        if self.get_parent() is None:
+            ret += self.shortdesc(
+            ) + " -> \"None\" [label=\"parent\", color=\"blue\"];\n"
+        else:
+            ret += self.shortdesc() + " -> " + self.get_parent().shortdesc(
+            ) + "[label=\"parent\", color=\"blue\"];\n"
+
+        if isinstance(self, AST_Statements):
+            ret += "{ rank=same; "
+            for c in self.get_children():
+                ret += c.shortdesc() + "; "
+            ret += "}\n"
+            for c in self.get_children():
+                if c.next is not None:
+                    ret += c.shortdesc() + " -> " + c.next.shortdesc(
+                    ) + "[label=\"next\", color=\"green\"]"
+                if c.prev is not None:
+                    ret += c.shortdesc() + " -> " + c.prev.shortdesc(
+                    ) + "[label=\"prev\", color=\"yellow\"]"
+
+        return ret
+
+
+class AST_Statements(AST_Node):
+    def __init__(self, context, stmts):
+        AST_Node.__init__(self, context)
+        self.statements = stmts
+
+        # we expect stmts to be a list of AST_Node objects
+        for s in stmts:
+            if not isinstance(s, AST_Node):
+                raise ValueError(
+                    "Expected a list of AST_Nodes, but one of the members is: "
+                    + str(s) + " type " + str(type(s)))
+
+    def __repr__(self):
+        res = ["Statements:"]
+        for s in self.statements:
+            res.append("    " + str(s))
+        return "\n".join(res)
+
+    def get_children(self):
+        return self.statements[:]
+
+    def replace_child(self, old, new):
+        newstmts = [new if x == old else x for x in self.statements]
+        self.provide_parents(self.get_parent())
+
+    def append_statement(self, stmt):
+        if isinstance(stmt, list):
+            self.statements += stmt
+        else:
+            self.statements.append(stmt)
+
+    def provide_parents(self, parent=None):
+        # Overwrite the AST_Node provide_parents() function
+        # because we also set next and prev for statements, which
+        # should be null for most / all AST_Nodes
+        self.parent = parent
+
+        # fix prev
+        prev = None
+        for s in self.statements:
+            s.prev = prev
+            prev = s
+
+        # fix next
+        next = None
+        for s in reversed(self.statements):
+            s.next = next
+            next = s
+
+        for s in self.statements:
+            s.provide_parents(parent=self)
+
+    def specialize(self):
+        # If we have an AST_Function() node, pull all statements between that
+        # and the next AST_EndFunction() into the function. Do that until there
+        # are no more changes.
+        rerun = True
+        while rerun:
+            rerun = False
+            stmts = None
+            func = None
+            for c in self.get_children():
+                from .ast_function import AST_Function, AST_EndFunc
+                if isinstance(c, AST_Function):
+                    func = c
+                    stmts = []
+                elif isinstance(c, AST_EndFunc):
+                    func.set_statements(stmts)
+                    self.statements = [
+                        x for x in self.statements if x not in stmts + [c]
+                    ]
+                    rerun = True
+                elif func is not None:
+                    stmts.append(c)
+
+        # Remove NullStatements, they are only useful during parsing
+        from .ast_nullstmt import AST_NullStmt
+        self.statements = [
+            x for x in self.statements if not isinstance(x, AST_NullStmt)
+        ]
+        self.provide_parents(self.parent)
+
+        # Lastly, specialize all children
+        for c in self.get_children():
+            n = c.specialize()
+            while n is not None:
+                n.replace_parent(c.get_parent())
+                self.replace_child(old=c, new=n)
+                c = n
+                n = n.specialize()
+
+        self.provide_parents(self.parent)
+
+        return None
+
+    def generate_code(self, sdfg=None, state=None):
+        if sdfg is None:
+            sdfg = dace.SDFG("dacelab", OrderedDict(), {})
+            prevstate = None
+            for s in self.statements:
+                state = len(sdfg.nodes())
+                newstate = dace.SDFGState(
+                    "s" + str(state), sdfg, debuginfo=s.context)
+                sdfg.add_node(newstate)
+                last_state = s.generate_code(sdfg, state)
+                if prevstate is not None:
+                    edge = dace.graph.edges.InterstateEdge()
+                    sdfg.add_edge(prevstate, newstate, edge)
+                if last_state is None:
+                    prevstate = newstate
+                else:
+                    prevstate = sdfg.nodes()[last_state]
+
+            return sdfg
+        else:
+            raise ValueError(
+                "Appending statements to an SDFG is not supported.")
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/ast_nullstmt.py b/dace/frontend/octave/ast_nullstmt.py
new file mode 100644
index 0000000000..2b94d0d9ed
--- /dev/null
+++ b/dace/frontend/octave/ast_nullstmt.py
@@ -0,0 +1,52 @@
+from .ast_node import AST_Node
+
+
+class AST_NullStmt(AST_Node):
+    def __init__(self, context):
+        AST_Node.__init__(self, context)
+
+    def get_children(self):
+        return []
+
+    def replace_child(self, old, new):
+        raise ValueError("AST_NullStmt has no children")
+
+    def generate_code(self, sdfg, state):
+        pass
+
+    def __repr__(self):
+        return "AST_NullStmt()"
+
+
+class AST_EndStmt(AST_Node):
+    def __init__(self, context):
+        AST_Node.__init__(self, context)
+
+    def __repr__(self):
+        return "AST_End()"
+
+    def get_children(self):
+        return []
+
+    def replace_child(self, old, new):
+        raise ValueError("This class does not have children")
+
+
+class AST_Comment(AST_Node):
+    def __init__(self, context, text):
+        AST_Node.__init__(self, context)
+        self.text = text
+
+    def get_children(self):
+        return []
+
+    def replace_child(self, old, new):
+        raise ValueError("AST_Comment has no children")
+
+    def generate_code(self, sdfg, state):
+        pass
+
+    def __repr__(self):
+        text = self.text
+        text = text.encode("unicode_escape").decode("utf-8")
+        return "AST_Comment(\"" + text + "\")"
diff --git a/dace/frontend/octave/ast_range.py b/dace/frontend/octave/ast_range.py
new file mode 100644
index 0000000000..00b7c85c04
--- /dev/null
+++ b/dace/frontend/octave/ast_range.py
@@ -0,0 +1,69 @@
+import dace
+
+from .ast_node import AST_Node
+
+
+class AST_RangeExpression(AST_Node):
+    def __init__(self, context, lhs, rhs):
+        AST_Node.__init__(self, context)
+        self.lhs = lhs
+        self.rhs = rhs
+
+    def __repr__(self):
+        return "AST_RangeExpression(" + str(self.lhs) + ", " + str(
+            self.rhs) + ")"
+
+    def get_children(self):
+        L = [self.lhs, self.rhs]
+        return [x for x in L if x is not None]
+
+    def get_dims(self):
+        from .ast_values import AST_Constant
+        if isinstance(self.lhs, AST_Constant) and isinstance(
+                self.rhs, AST_Constant):
+            l = self.rhs.get_value() - self.lhs.get_value() + 1
+            return [1, l]
+        else:
+            print("Dimensionality of " + str(self) + " cannot be inferred")
+            return [1, 1]
+
+    def get_basetype(self):
+        return dace.types.float64
+
+    def replace_child(self, old, new):
+        if old == self.lhs:
+            self.lhs = new
+            return
+        if old == self.rhs:
+            self.rhs = new
+            return
+        raise ValueError("The child " + str(old) + " is not a child of " +
+                         str(self))
+
+    def specialize(self):
+        return None
+
+    def generate_code(self, sdfg, state):
+        # If lhs and rhs are constant, generate a matrix
+        from .ast_values import AST_Constant
+        from .ast_matrix import AST_Matrix_Row, AST_Matrix
+        if isinstance(self.lhs, AST_Constant) and isinstance(
+                self.rhs, AST_Constant):
+            lval = self.lhs.get_value()
+            rval = self.rhs.get_value()
+            vals = [
+                AST_Constant(self.context, v)
+                for v in list(range(lval, rval + 1))
+            ]
+            new = AST_Matrix(self.context,
+                             [AST_Matrix_Row(self.context, vals)])
+            new.parent = self.parent
+            new.prev = self.prev
+            new.next = self.next
+            new.generate_code(sdfg, state)
+        else:
+            raise NotImplementedError(
+                "Code generation for Range with non-constant bounds not "
+                "implemented")
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/ast_values.py b/dace/frontend/octave/ast_values.py
new file mode 100644
index 0000000000..0831409eb4
--- /dev/null
+++ b/dace/frontend/octave/ast_values.py
@@ -0,0 +1,115 @@
+import dace
+
+from .ast_node import AST_Node
+
+
+class AST_Ident(AST_Node):
+    def __init__(self, context, value):
+        AST_Node.__init__(self, context)
+        if isinstance(value, str):
+            self.value = value
+        else:
+            raise ValueError("Expected str, got " + str(type(value)))
+
+    def __repr__(self):
+        return "AST_Ident(" + str(self.value) + ")"
+
+    def get_name(self):
+        return self.value
+
+    def is_constant(self):
+        return False
+
+    def get_name_in_sdfg(self, sdfg):
+        return self.value
+
+    def get_children(self):
+        return []
+
+    def replace_child(self, old, new):
+        raise ValueError("This node does not have children!")
+
+    def generate_code(self, sdfg, state):
+        # An identifier never generates code
+        pass
+
+    def get_dims(self):
+        from .ast_loop import AST_ForLoop
+        """ Check in the scope if this is defined and return the dims of the
+            corresponding SDFG access node it currently maps to. """
+        vardef = self.search_vardef_in_scope(self.value)
+        if vardef is None:
+            raise ValueError("Request for dims of identifier " + self.value +
+                             " which is not defined in the current scope")
+        elif isinstance(vardef, AST_ForLoop):
+            dims = vardef.initializer.get_dims()[:1]
+            return dims
+        else:
+            return vardef.get_dims()
+
+    def specialize(self):
+        pass
+
+    def get_propagated_value(self):
+        vardef = self.search_vardef_in_scope(self.get_name())
+        if isinstance(vardef, AST_Constant):
+            return vardef
+        return None
+
+    def get_basetype(self):
+        """ Check in the scope if this is defined and return the basetype of the
+            corresponding SDFG access node this currently maps to. """
+        bt = self.search_vardef_in_scope(self.value).get_basetype()
+        if bt is None:
+            raise ValueError("Request for basetype of identifier " +
+                             self.value +
+                             " which is not defined in the current scope")
+        else:
+            return bt
+
+    __str__ = __repr__
+
+
+class AST_Constant(AST_Node):
+    def __init__(self, context, value):
+        AST_Node.__init__(self, context)
+        self.value = value
+
+    def __repr__(self):
+        return "AST_Constant(" + str(self.value) + ")"
+
+    def get_value(self):
+        return self.value
+
+    def get_dims(self):
+        return [1]
+
+    def get_basetype(self):
+        return dace.types.float64
+
+    def generate_code(self, sdfg, state):
+        dims = self.get_dims()
+        name = self.get_name_in_sdfg(sdfg)
+        basetype = dace.types.float64
+        if name not in sdfg.arrays:
+            sdfg.add_transient(name, dims, basetype, debuginfo=self.context)
+        trans = sdfg.nodes()[state].add_access(name)
+        code = "out = " + str(self.get_value()) + ";"
+        tasklet = sdfg.nodes()[state].add_tasklet('init', {}, {'out'}, code,
+                                                  dace.Language.CPP)
+        sdfg.nodes()[state].add_edge(
+            tasklet, 'out', trans, None,
+            dace.memlet.Memlet.from_array(trans.data, trans.desc(sdfg)))
+        print("The result of expr " + str(self) + " will be stored in " +
+              str(name))
+
+    def get_children(self):
+        return []
+
+    def is_constant(self):
+        return True
+
+    def replace_child(self, old, new):
+        raise ValueError("This node does not have children!")
+
+    __str__ = __repr__
diff --git a/dace/frontend/octave/lexer.py b/dace/frontend/octave/lexer.py
new file mode 100644
index 0000000000..6c9297cfd8
--- /dev/null
+++ b/dace/frontend/octave/lexer.py
@@ -0,0 +1,353 @@
+import sys
+import re
+import ply.lex as lex
+from ply.lex import TOKEN
+
+tokens = [
+    "AND", "ANDAND", "ANDEQ", "BACKSLASH", "COLON", "COMMA", "DIV", "DIVEQ",
+    "DOT", "DOTDIV", "DOTDIVEQ", "DOTEXP", "DOTMUL", "DOTMULEQ", "END_EXPR",
+    "END_STMT", "EQ", "EQEQ", "EXP", "EXPEQ", "FIELD", "GE", "GT", "HANDLE",
+    "IDENT", "LBRACE", "LBRACKET", "LE", "LPAREN", "LT", "MINUS", "MINUSMINUS",
+    "MINUSEQ", "MUL", "MULEQ", "NE", "NEG", "NUMBER", "OR", "OREQ", "OROR",
+    "PLUS", "PLUSEQ", "PLUSPLUS", "RBRACE", "RBRACKET", "RPAREN", "SEMI",
+    "STRING", "TRANSPOSE", "ERROR_STMT", "COMMENT", "END_FUNCTION",
+    "END_UNEXPECTED", "POW", "CLASSDEF"
+]
+
+reserved = {
+    "break": "BREAK",
+    "case": "CASE",
+    "catch": "CATCH",
+    "continue": "CONTINUE",
+    "else": "ELSE",
+    "elseif": "ELSEIF",
+    "end_unwind_protect": "END_UNWIND_PROTECT",
+    "for": "FOR",
+    "function": "FUNCTION",
+    "global": "GLOBAL",
+    "if": "IF",
+    "otherwise": "OTHERWISE",
+    "persistent": "PERSISTENT",
+    "return": "RETURN",
+    "switch": "SWITCH",
+    "try": "TRY",
+    "unwind_protect": "UNWIND_PROTECT",
+    "unwind_protect_cleanup": "UNWIND_PROTECT_CLEANUP",
+    "while": "WHILE",
+}
+tokens += list(reserved.values())
+
+
+def new():
+    t_AND = r"\&"
+    t_ANDAND = r"\&\&"
+    t_ANDEQ = r"\&="
+    t_BACKSLASH = r"\\"
+    t_COLON = r":"
+    t_DIV = r"\/"
+    t_DIVEQ = r"\/="
+    t_DOT = r"\."
+    t_DOTDIV = r"\./"
+    t_DOTDIVEQ = r"\./="
+    t_DOTEXP = r"\.\^"
+    t_DOTMUL = r"\.\*"
+    t_DOTMULEQ = r"\.\*="
+    t_EQ = r"="
+    t_EQEQ = r"=="
+    t_EXP = r"\^"
+    t_EXPEQ = r"\^="
+    t_GE = r">="
+    t_GT = r"\>"
+    t_HANDLE = r"\@"
+    t_LE = r"<="
+    t_LT = r"\<"
+    t_MINUS = r"\-"
+    t_MINUSEQ = r"\-="
+    t_MINUSMINUS = r"\--"
+    t_MUL = r"\*"
+    t_POW = r"\*\*"
+    t_MULEQ = r"\*="
+    t_NE = r"(~=)|(!=)"
+    t_NEG = r"\~|\!"
+    t_OR = r"\|"
+    t_OREQ = r"\|="
+    t_OROR = r"\|\|"
+    t_PLUS = r"\+"
+    t_PLUSEQ = r"\+="
+    t_PLUSPLUS = r"\+\+"
+
+    states = (("matrix", "inclusive"), ("afterkeyword", "exclusive"))
+
+    states = (("matrix", "inclusive"), ("afterkeyword", "exclusive"))
+
+    ws = r"(\s|\.\.\..*\n|\\\n)"
+    #ws  = r"(\s|(\#|(%[^!])).*\n|\.\.\..*\n|\\\n)"
+    ws1 = ws + "+"
+    ws0 = ws + "*"
+    ms = r"'([^']|(''))*'"
+    os = r'"([^"\a\b\f\r\t\0\v\n\\]|(\\[abfn0vtr\"\n\\])|(""))*"'
+    mos = "(%s)|(%s)" % (os, ms)
+    id = r"[a-zA-Z_][a-zA-Z_0-9]*"
+
+    def unescape(s):
+        if s[0] == "'":
+            return s[1:-1].replace("''", "'")
+        else:
+            try:
+                return s[1:-1].decode("string_escape")
+            except:
+                return s[1:-1]
+
+    @TOKEN(mos)
+    def t_afterkeyword_STRING(t):
+        t.value = unescape(t.value)
+        t.lexer.begin("INITIAL")
+        return t
+
+    def t_afterkeyword_error(t):
+        t_error(t)
+
+    # A quote, immediately following any of: (1) an alphanumeric
+    # charater, (2) right bracket, parenthesis or brace,
+    # or (3) another TRANSPOSE, is a TRANSPOSE.  Otherwise, it starts a
+    # string.  The order of the rules for TRANSPOSE (first) and STRING
+    # (second) is important.  Luckily, if the quote is separated from
+    # the term by line continuation (...), matlab starts a string, so
+    # the above rule still holds.
+
+    def t_TRANSPOSE(t):
+        r"(?<=\w|\]|\)|\})((\.')|')+"
+        # <---context ---><-quotes->
+        # Let the parser figure out what that mix of quotes and
+        # dot-quotes, which is kept in t.value, really means.
+        return t
+
+    @TOKEN(mos)
+    def t_STRING(t):
+        t.value = unescape(t.value)
+        return t
+
+    @TOKEN(r"(\.%s)?%s" % (ws0, id))
+    def t_IDENT(t):
+        if t.value == "parfor":
+            t.value = "for"
+        if t.value == "classdef":
+            raise_exception(SyntaxError, "Not implemented: %s" % t.value,
+                            t.lexer)
+        t.lexer.lineno += t.value.count("\n")
+        if t.value[0] == ".":
+            # Reserved words are not reserved
+            # when used as fields.  So return=1
+            # is illegal, but foo.return=1 is fine.
+            t.type = "FIELD"
+            return t
+        if (t.value == "end" and (t.lexer.parens > 0 or t.lexer.brackets > 0
+                                  or t.lexer.braces > 0)):
+            t.type = "END_EXPR"
+            return t
+        if t.value in ("end", "endif", "endfunction", "endwhile", "endfor",
+                       "endswitch", "end_try_catch"):
+            keyword = t.lexer.stack.pop()  # if,while,etc.
+            if keyword == "function":
+                t.type = "END_FUNCTION"
+            else:
+                t.type = "END_STMT"
+            return t
+        else:
+            t.type = reserved.get(t.value, "IDENT")
+            if t.value in ("if", "function", "while", "for", "switch", "try"):
+                # Lexer stack may contain only these
+                # six words, ever, because there is
+                # one place to push -- here
+                t.lexer.stack.append(t.value)
+            if (t.type != "IDENT" and t.lexer.lexdata[t.lexer.lexpos] == "'"):
+                t.lexer.begin("afterkeyword")
+        return t
+
+    def t_LPAREN(t):
+        r"\("
+        t.lexer.parens += 1
+        return t
+
+    def t_RPAREN(t):
+        r"\)"
+        t.lexer.parens -= 1
+        return t
+
+    @TOKEN(ws0 + r"\]")
+    def t_RBRACKET(t):  # compare w t_LBRACKET
+        t.lexer.lineno += t.value.count("\n")
+        t.lexer.brackets -= 1
+        if t.lexer.brackets + t.lexer.braces == 0:
+            t.lexer.begin("INITIAL")
+        return t
+
+    @TOKEN(r"\[" + ws0)
+    def t_LBRACKET(t):  # compare w t_SEMI
+        t.lexer.lineno += t.value.count("\n")
+        t.lexer.brackets += 1
+        if t.lexer.brackets + t.lexer.braces == 1:
+            t.lexer.begin("matrix")
+        return t
+
+    # maybe we need a dedicated CELLARRAY state
+    @TOKEN(ws0 + r"\}")
+    def t_RBRACE(t):
+        t.lexer.lineno += t.value.count("\n")
+        t.lexer.braces -= 1
+        if t.lexer.braces + t.lexer.brackets == 0:
+            t.lexer.begin("INITIAL")
+        return t
+
+    @TOKEN(r"\{" + ws0)
+    def t_LBRACE(t):
+        t.lexer.lineno += t.value.count("\n")
+        t.lexer.braces += 1
+        if t.lexer.brackets + t.lexer.braces == 1:
+            t.lexer.begin("matrix")
+        return t
+
+    @TOKEN(r"," + ws0)
+    def t_COMMA(t):  # eating spaces is important inside brackets
+        t.lexer.lineno += t.value.count("\n")
+        if (t.lexer.brackets == 0 and t.lexer.parens == 0
+                and t.lexer.braces == 0):
+            t.type = "SEMI"
+            return t
+        return t
+
+    @TOKEN(r"\;" + ws0)
+    def t_SEMI(t):
+        t.lexer.lineno += t.value.count("\n")
+        #        if t.lexer.brackets or t.lexer.braces > 0:
+        #            t.type = "CONCAT"
+        return t
+
+    def t_NUMBER(t):
+        r"(0x[0-9A-Fa-f]+)|((\d+(\.\d*)?|\.\d+)([eE][-+]?\d+)?[ij]?)"
+        #  <-------------> <------------------><------------->
+        #   int,oct,hex        float               exp
+        if t.value[-1] == 'i':
+            t.value = t.value[:-1] + 'j'
+        t.value = eval(t.value)
+        return t
+
+    def t_NEWLINE(t):
+        r'\n+'
+        t.lexer.lineno += len(t.value)
+        if not t.lexer.parens and not t.lexer.braces:
+            t.value = ";"
+            t.type = "SEMI"
+            return t
+
+    def t_ERROR_STMT(t):
+        r"%!(error|warning|test).*\n"
+        t.lexer.lineno += 1
+
+    # Keep multiline comments
+    def t_COMMENT(t):
+        r"(^[ \t]*[%#][^!\n].*\n)+"
+        t.lexer.lineno += t.value.count("\n")
+        t.type = "COMMENT"
+        return t
+
+    # Drop end-of-line comments
+    def t_comment(t):
+        r"(%|\#)!?"
+        if t.value[-1] != "!":
+            t.lexer.lexpos = t.lexer.lexdata.find("\n", t.lexer.lexpos)
+
+    @TOKEN(r"(?<=\w)" + ws1 + r"(?=\()")
+    def t_matrix_BAR(t):
+        # Consume whitespace that follows end of name
+        # and is followed a left parenthesis. This properly handles
+        # a space between a func name and the arguments.
+        pass
+
+    tend = r"(?<=[])}'\".]|\w)"
+    tbeg = r"(?=[-+]?([[({'\"]|\w|\.\d))"
+
+    @TOKEN(tend + ws1 + tbeg)
+    def t_matrix_FOO(t):
+        # In matrix state, consume whitespace separating two
+        # terms and return a fake COMMA token.  This allows
+        # parsing [1 2 3] as if it was [1,2,3].  Handle
+        # with care: [x + y] vs [x +y]
+        #
+        # A term T is
+        # (a) a name or a number
+        # (b) literal string using single or doble quote
+        # (c) (T) or [T] or {T} or T' or +T or -T
+        #
+        # Terms end with
+        # (1) an alphanumeric charater \w
+        # (2) single quote (in octave also double-quote)
+        # (3) right parenthesis, bracket, or brace
+        # (4) a dot (after a number, such as 3.
+        #
+        # The pattern for whitespace accounts for ellipsis as a
+        # whitespace, and for the trailing whitespace.
+        #
+        # Terms start with
+        # (1) an alphanumeric character
+        # (2) a single or double quote,
+        # (3) left parenthesis, bracket, or brace and finally
+        # (4) a dot before a digit, such as .3  .
+
+        # TODO: What about curly brackets?
+        # TODO: What about dot followed by a letter, as in field?
+        #   [foo  .bar]
+
+        t.lexer.lineno += t.value.count("\n")
+        t.type = "COMMA"
+        return t
+
+    def t_ELLIPSIS(t):
+        r"\.\.\..*\n"
+        t.lexer.lineno += 1
+        pass
+
+    def t_SPACES(t):
+        r"(\\\n|[ \t\r])+"
+        pass
+
+    def t_error(t):
+        raise_exception(SyntaxError, ('Unexpected "%s" (lexer)' % t.value),
+                        t.lexer)
+
+    lexer = lex.lex(reflags=re.MULTILINE)
+    lexer.brackets = 0  # count open square brackets
+    lexer.parens = 0  # count open parentheses
+    lexer.braces = 0  # count open curly braces
+    lexer.stack = []
+    return lexer
+
+
+def raise_exception(error_type, message, my_lexer):
+    startpos = 1 + my_lexer.lexdata.rfind("\n", 0, my_lexer.lexpos)
+    endpos = my_lexer.lexdata.find("\n", startpos)
+    raise error_type(
+        message, ("inputfile", my_lexer.lineno, 1 + my_lexer.lexpos - startpos,
+                  my_lexer.lexdata[startpos:endpos]))
+
+
+def main():
+    lexer = new()
+    line = ""
+    while 1:
+        try:
+            line += raw_input("=>> ").decode("string_escape")
+            print(len(line), [c for c in line])
+        except EOFError:
+            reload(sys.modules["lexer.py"])
+            lexer.input(line)
+            print(list(tok for tok in lexer))
+            line = ""
+
+
+if __name__ == "__main__":
+    lexer = new()
+    buf = open(sys.argv[1]).read()
+    lexer.input(buf)
+    for tok in lexer:
+        print(tok)
diff --git a/dace/frontend/octave/parse.py b/dace/frontend/octave/parse.py
new file mode 100644
index 0000000000..a5a1ccbaba
--- /dev/null
+++ b/dace/frontend/octave/parse.py
@@ -0,0 +1,689 @@
+import sys
+from ply import yacc
+from . import lexer
+import copy
+import dace
+
+from .ast_node import AST_Node, AST_Statements
+from .ast_values import AST_Ident, AST_Constant
+from .ast_expression import AST_BinExpression, AST_UnaryExpression
+from .ast_matrix import AST_Matrix_Row, AST_Matrix, AST_Transpose
+from .ast_assign import AST_Assign
+from .ast_function import AST_Argument, AST_BuiltInFunCall, AST_FunCall, AST_Function, AST_EndFunc
+from .ast_range import AST_RangeExpression
+from .ast_loop import AST_ForLoop
+from .ast_nullstmt import AST_NullStmt, AST_Comment, AST_EndStmt
+
+tokens = lexer.tokens
+
+precedence = (
+    ("right", "COMMA"),
+    ("right", "DOTDIVEQ", "DOTMULEQ", "EQ", "EXPEQ", "MULEQ", "MINUSEQ",
+     "DIVEQ", "PLUSEQ", "OREQ", "ANDEQ"),
+    ("nonassoc", "HANDLE"),
+    ("left", "COLON"),
+    ("left", "ANDAND", "OROR"),
+    ("left", "EQEQ", "NE", "GE", "LE", "GT", "LT"),
+    ("left", "OR", "AND"),
+    ("left", "PLUS", "MINUS"),
+    ("left", "MUL", "DIV", "DOTMUL", "DOTDIV", "BACKSLASH"),
+    ("right", "UMINUS", "NEG"),
+    ("right", "TRANSPOSE"),
+    ("right", "EXP", "DOTEXP", "POW"),
+    ("nonassoc", "LPAREN", "RPAREN", "RBRACE", "LBRACE"),
+    ("left", "FIELD", "DOT", "PLUSPLUS", "MINUSMINUS"),
+)
+
+
+def p_top(p):
+    """
+    top :
+        | top stmt
+      """
+
+    if len(p) == 1:
+        retval = AST_Statements(None, [])
+        p[0] = retval
+    else:
+        retval = copy.deepcopy(p[1])
+        retval.append_statement(p[2])
+        p[0] = retval
+
+
+def p_end(p):
+    """
+    top : top END_STMT
+    """
+    retval = copy.deepcopy(p[1])
+    retval.append_statement(AST_EndStmt(None))
+    p[0] = retval
+
+
+def p_end_function(p):
+    """
+    top : top END_FUNCTION
+    """
+    retval = copy.deepcopy(p[1])
+    retval.append_statement(AST_EndFunc(None))
+    p[0] = retval
+
+
+def p_arg1(p):
+    """
+    arg1 : IDENT
+    """
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di = dace.types.DebugInfo(startl, startc, endl, endc)
+    p[0] = AST_Ident(di, p[1])
+
+
+def p_arg2(p):
+    """
+    arg1 : NUMBER
+         | STRING
+    """
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di = dace.types.DebugInfo(startl, startc, endl, endc)
+    p[0] = AST_Constant(di, p[1])
+
+
+def p_global(p):
+    """
+     arg1 : GLOBAL
+    """
+    raise NotImplementedError("global not implemented")
+
+
+def p_arg_list(p):
+    """
+    arg_list : ident_init_opt
+             | arg_list COMMA ident_init_opt
+    """
+    if len(p) == 2:
+        p[0] = [p[1]]
+    else:
+        p[0] = p[1] + [p[3]]
+
+
+def p_args(p):
+    """
+    args : arg1
+         | args arg1
+    """
+    raise NotImplementedError("args not implemented")
+
+
+def p_break_stmt(p):
+    """ break_stmt : BREAK SEMI """
+    raise NotImplementedError("break not implemented")
+
+
+def p_case_list(p):
+    """
+    case_list :
+              | CASE expr sep stmt_list_opt case_list
+              | CASE expr error stmt_list_opt case_list
+              | OTHERWISE stmt_list
+    """
+    raise NotImplementedError("case not implemented")
+
+
+def p_cellarray(p):
+    """
+    cellarray : LBRACE RBRACE
+              | LBRACE matrix_row RBRACE
+              | LBRACE matrix_row SEMI RBRACE
+    """
+    startl, endl = p.linespan(0)
+    startc, endc = p.lexspan(0)
+    di = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    if len(p) == 3:
+        p[0] = AST_Matrix(di, [])
+    else:
+        p[0] = AST_Matrix(di, p[2])
+
+
+def p_cellarray_2(p):
+    """
+    cellarray : LBRACE expr_list RBRACE
+    """
+    p[0] = AST_Matrix(di, [AST_Matrix_Row(p[2])])
+
+
+def p_cellarrayref(p):
+    """expr : expr LBRACE expr_list RBRACE
+            | expr LBRACE RBRACE
+    """
+    raise NotImplementedError("cellarrayref not implemented")
+
+
+def p_command(p):
+    """
+    command : ident args SEMI
+    """
+    raise NotImplementedError("commands not implemented")
+
+
+####################
+
+
+def p_comment_stmt(p):
+    """
+    comment_stmt : COMMENT
+    """
+    di = None
+    p[0] = AST_Comment(di, p[1])
+
+
+def p_concat_list1(p):
+    """
+    matrix_row : expr_list SEMI expr_list
+    """
+
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di1 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    startl, endl = p.linespan(3)
+    startc, endc = p.lexspan(3)
+    di3 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = [AST_Matrix_Row(di1, p[1]), AST_Matrix_Row(di3, p[3])]
+
+
+def p_concat_list2(p):
+    """
+    matrix_row : matrix_row SEMI expr_list
+    """
+    startl, endl = p.linespan(3)
+    startc, endc = p.lexspan(3)
+    di3 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = p[1] + [AST_Matrix_Row(di3, p[3])]
+
+
+def p_continue_stmt(p):
+    "continue_stmt : CONTINUE SEMI"
+    raise NotImplementedError("continue needs to be implemented")
+
+
+def p_elseif_stmt(p):
+    """
+    elseif_stmt :
+                | ELSE stmt_list_opt
+                | ELSEIF expr sep stmt_list_opt elseif_stmt
+                | ELSEIF LPAREN expr RPAREN stmt_list_opt elseif_stmt
+    """
+    raise NotImplementedError("elseif needs to be implemented")
+
+
+def p_error_stmt(p):
+    """
+    error_stmt : ERROR_STMT SEMI
+    """
+    raise NotImplementedError("error stmt")
+
+
+def p_expr(p):
+    """expr : ident
+            | end
+            | number
+            | string
+            | colon
+            | NEG
+            | matrix
+            | cellarray
+            | expr2
+            | expr1
+            | lambda_expr
+    """
+    p[0] = p[1]
+
+
+def p_expr_2(p):
+    """expr : expr PLUSPLUS
+            | expr MINUSMINUS
+    """
+    startl, endl = p.linespan(2)
+    startc, endc = p.lexspan(2)
+    di2 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = AST_UnaryExpression(di2, p[1], p[2], "post")
+
+
+def p_expr1(p):
+    """expr1 : MINUS expr %prec UMINUS
+             | PLUS expr %prec UMINUS
+             | NEG expr
+             | HANDLE ident
+             | PLUSPLUS ident
+             | MINUSMINUS ident
+    """
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di1 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = AST_UnaryExpression(di1, p[2], p[1], "pre")
+
+
+def p_expr2(p):
+    """expr2 : expr AND expr
+             | expr ANDAND expr
+             | expr BACKSLASH expr
+             | expr COLON expr
+             | expr DIV expr
+             | expr DOT expr
+             | expr DOTDIV expr
+             | expr DOTDIVEQ expr
+             | expr DOTEXP expr
+             | expr DOTMUL expr
+             | expr DOTMULEQ expr
+             | expr EQEQ expr
+             | expr POW expr
+             | expr EXP expr
+             | expr EXPEQ expr
+             | expr GE expr
+             | expr GT expr
+             | expr LE expr
+             | expr LT expr
+             | expr MINUS expr
+             | expr MUL expr
+             | expr NE expr
+             | expr OR expr
+             | expr OROR expr
+             | expr PLUS expr
+             | expr EQ expr
+             | expr MULEQ expr
+             | expr DIVEQ expr
+             | expr MINUSEQ expr
+             | expr PLUSEQ expr
+             | expr OREQ expr
+             | expr ANDEQ expr
+    """
+    startl, endl = p.linespan(2)
+    startc, endc = p.lexspan(2)
+    di2 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    if p[2] == "=":
+        p[0] = AST_Assign(di2, p[1], p[3], p[2])
+    elif p[2] == ":":
+        p[0] = AST_RangeExpression(di2, p[1], p[3])
+    else:
+        p[0] = AST_BinExpression(di2, p[1], p[3], p[2])
+
+
+def p_expr_colon(p):
+    """ colon : COLON """
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di1 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = AST_RangeExpression(di1, None, None)
+
+
+def p_expr_end(p):
+    """ end : END_EXPR """
+    raise NotImplementedError("end expression needs to be implemented")
+
+
+def p_expr_ident(p):
+    """ ident : IDENT """
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di1 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = AST_Ident(di1, p[1])
+
+
+def p_ident_init_opt(p):
+    """
+    ident_init_opt : NEG
+                   | ident
+                   | ident EQ expr
+    """
+    if len(p) == 1:
+        raise NotImplementedError("default args need to be implemented")
+    if len(p) == 2:
+        p[0] = p[1]
+    else:
+        raise NotImplementedError("default args need to be implemented")
+
+
+def p_expr_list(p):
+    """
+    expr_list : exprs
+              | exprs COMMA
+    """
+    p[0] = p[1]
+
+
+def p_expr_number(p):
+    """ number : NUMBER """
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di1 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = AST_Constant(di1, p[1])
+
+
+def p_expr_stmt(p):
+    """
+    expr_stmt : expr_list SEMI
+    """
+    p[0] = p[1]
+
+
+def p_expr_string(p):
+    """ string : STRING """
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di1 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = AST_Constant(di1, p[1])
+
+
+def p_exprs(p):
+    """
+    exprs : expr
+          | exprs COMMA expr
+    """
+    if len(p) == 2:
+        p[0] = [p[1]]
+    elif len(p) == 4:
+        p[0] = p[1]
+        p[0].append(p[3])
+
+
+def p_field_expr(p):
+    """
+    expr : expr FIELD
+    """
+    raise NotImplementedError("field expressions needs to be implemented")
+
+
+def p_foo_stmt(p):
+    """ foo_stmt : expr OROR expr SEMI """
+    raise NotImplementedError("foo_stmt needs to be implemented")
+
+
+def p_for_stmt(p):
+    """
+    for_stmt : FOR ident  EQ expr SEMI stmt_list END_STMT
+             | FOR LPAREN ident EQ expr RPAREN SEMI stmt_list END_STMT
+             | FOR matrix EQ expr SEMI stmt_list END_STMT
+    """
+    di = None
+    if len(p) == 8:
+        p[0] = AST_ForLoop(di, p[2], p[4], AST_Statements(di, p[6]))
+    else:
+        p[0] = AST_ForLoop(di, p[3], p[5], AST_Statements(di, p[8]))
+
+
+def p_func_stmt(p):
+    """func_stmt : FUNCTION ident lambda_args SEMI
+                 | FUNCTION ret EQ ident lambda_args SEMI
+    """
+    di = None
+    if len(p) == 5:
+        p[0] = AST_Function(di, p[2], args=p[3], retvals=[])
+    else:
+        p[0] = AST_Function(di, p[4], args=p[5], retvals=p[2])
+
+
+def p_funcall_expr(p):
+    """expr : expr LPAREN expr_list RPAREN
+            | expr LPAREN RPAREN
+    """
+    startl, endl = p.linespan(1)
+    startc, endc = p.lexspan(1)
+    di1 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    if len(p) == 4:
+        p[0] = AST_FunCall(di1, p[1], [])
+    else:
+        p[0] = AST_FunCall(di1, p[1], p[3])
+
+
+def p_global_list(p):
+    """global_list : ident
+                   | global_list ident
+    """
+    raise NotImplementedError("globals need to be implemented")
+
+
+def p_global_stmt(p):
+    """
+    global_stmt : GLOBAL global_list SEMI
+                | GLOBAL ident EQ expr SEMI
+    """
+    raise NotImplementedError("globals need to be implemented")
+
+
+def p_if_stmt(p):
+    """
+    if_stmt : IF expr sep stmt_list_opt elseif_stmt END_STMT
+            | IF LPAREN expr RPAREN stmt_list_opt elseif_stmt END_STMT
+    """
+    raise NotImplementedError("If/else needs to be implemented")
+
+
+def p_lambda_args(p):
+    """lambda_args : LPAREN RPAREN
+                   | LPAREN arg_list RPAREN
+    """
+    if len(p) == 3:
+        p[0] = []
+    else:
+        p[0] = p[2]
+
+
+def p_lambda_expr(p):
+    """lambda_expr : HANDLE lambda_args expr
+    """
+    raise NotImplementedError("lambda needs to be implemented")
+
+
+def p_matrix(p):
+    """matrix : LBRACKET RBRACKET
+              | LBRACKET matrix_row RBRACKET
+              | LBRACKET matrix_row SEMI RBRACKET
+    """
+    startl, endl = p.linespan(0)
+    startc, endc = p.lexspan(0)
+    di0 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    if len(p) == 3:
+        p[0] = AST_Matrix(di0, [])
+    else:
+        p[0] = AST_Matrix(di0, p[2])
+
+
+def p_matrix_2(p):
+    """matrix : LBRACKET expr_list RBRACKET
+              | LBRACKET expr_list SEMI RBRACKET
+    """
+    startl, endl = p.linespan(0)
+    startc, endc = p.lexspan(0)
+    di0 = dace.types.DebugInfo(startl, startc, endl, endc)
+    startl, endl = p.linespan(2)
+    startc, endc = p.lexspan(2)
+    di2 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = AST_Matrix(di0, [AST_Matrix_Row(di2, p[2])])
+
+
+def p_null_stmt(p):
+    """
+    null_stmt : SEMI
+              | COMMA
+    """
+    di = None
+    p[0] = AST_NullStmt(di)
+
+
+def p_parens_expr(p):
+    """
+    expr :  LPAREN expr RPAREN
+    """
+    p[0] = p[2]
+
+
+def p_persistent_stmt(p):
+    """
+    persistent_stmt :  PERSISTENT global_list SEMI
+                    |  PERSISTENT ident EQ expr SEMI
+    """
+    raise NotImplementedError("persistent needs to be implemented")
+
+
+def p_ret(p):
+    """
+    ret : ident
+        | LBRACKET RBRACKET
+        | LBRACKET expr_list RBRACKET
+    """
+    if len(p) == 2:
+        p[0] = [p[1]]
+    elif len(p) == 3:
+        p[0] = []
+    else:
+        p[0] = p[2]
+
+
+def p_return_stmt(p):
+    """ return_stmt : RETURN SEMI """
+    raise NotImplementedError("return needs to be implemented")
+
+
+def p_semi_opt(p):
+    """
+    semi_opt :
+             | semi_opt SEMI
+             | semi_opt COMMA
+    """
+    p[0] = AST_NullStmt(None)
+
+
+def p_separator(p):
+    """
+    sep : COMMA
+        | SEMI
+    """
+    p[0] = p[1]
+
+
+def p_stmt(p):
+    """
+    stmt : continue_stmt
+         | comment_stmt
+         | func_stmt
+         | break_stmt
+         | expr_stmt
+         | global_stmt
+         | persistent_stmt
+         | error_stmt
+         | command
+         | for_stmt
+         | if_stmt
+         | null_stmt
+         | return_stmt
+         | switch_stmt
+         | try_catch
+         | while_stmt
+         | foo_stmt
+         | unwind
+    """
+    # END_STMT is intentionally left out
+    p[0] = copy.deepcopy(p[1])
+
+
+def p_stmt_list(p):
+    """
+    stmt_list : stmt
+              | stmt_list stmt
+    """
+    if len(p) == 2:
+        if p[1] is None:
+            p[0] = []
+        if isinstance(p[1], list):
+            p[0] = copy.deepcopy(p[1])
+    elif len(p) == 3:
+        p[0] = copy.deepcopy(p[1])
+        if p[2] is not None:
+            if isinstance(p[2], list):
+                p[0] = p[0] + p[2]
+            else:
+                p[0].append(p[2])
+    else:
+        assert 0
+
+
+def p_stmt_list_opt(p):
+    """
+    stmt_list_opt :
+                  | stmt_list
+    """
+    if len(p) == 1:
+        p[0] = []
+    else:
+        p[0] = p[1]
+
+
+def p_switch_stmt(p):
+    """
+    switch_stmt : SWITCH expr semi_opt case_list END_STMT
+    """
+    raise NotImplementedError("switch needs to be implemented")
+
+
+def p_transpose_expr(p):
+    # p[2] contains the exact combination of plain and conjugate
+    # transpose operators, such as "'.''.''''".
+    """ expr : expr TRANSPOSE """
+    startl, endl = p.linespan(2)
+    startc, endc = p.lexspan(2)
+    di2 = dace.types.DebugInfo(startl, startc, endl, endc)
+
+    p[0] = AST_Transpose(di2, p[1], p[2])
+
+
+def p_try_catch(p):
+    """
+    try_catch : TRY stmt_list CATCH stmt_list END_STMT
+    """
+    raise NotImplementedError("try/catch needs to be implemented")
+
+
+def p_unwind(p):
+    """
+    unwind : UNWIND_PROTECT stmt_list UNWIND_PROTECT_CLEANUP stmt_list END_UNWIND_PROTECT
+    """
+    raise NotImplementedError("unwind needs to be implemented")
+
+
+def p_while_stmt(p):
+    """
+    while_stmt : WHILE expr SEMI stmt_list END_STMT
+    """
+    raise NotImplementedError("while needs to be implemented")
+
+
+def p_error(p):
+    raise ValueError("Unexpected EOF")
+
+
+parser = yacc.yacc(start="top")
+
+
+def parse(buf, debug=False):
+    new_lexer = lexer.new()
+    p = parser.parse(buf, tracking=1, debug=debug, lexer=new_lexer)
+    return p
+
+
+if __name__ == "__main__":
+    buf = open(sys.argv[1]).read()
+    p = parse(buf, debug=False)
diff --git a/dace/frontend/operations.py b/dace/frontend/operations.py
new file mode 100644
index 0000000000..e5d140f315
--- /dev/null
+++ b/dace/frontend/operations.py
@@ -0,0 +1,175 @@
+from __future__ import print_function
+from functools import partial
+
+from timeit import default_timer as timer
+import ast
+import numpy as np
+import sympy
+import os
+import sys
+
+from dace import types
+from dace.config import Config
+
+
+def timethis(program, title, flop_count, f, *args, **kwargs):
+    """ Runs a function multiple (`DACE_treps`) times, logs the running times 
+        to a file, and prints the median time (with FLOPs if given).
+        @param program: The title of the measurement.
+        @param title: A sub-title of the measurement.
+        @param flop_count: Number of floating point operations in `program`.
+                           If greater than zero, produces a median FLOPS 
+                           report.
+        @param f: The function to measure.
+        @param args: Arguments to invoke the function with.
+        @param kwargs: Keyword arguments to invoke the function with.
+        @return: Latest return value of the function.
+    """
+
+    start = timer()
+    REPS = int(Config.get('treps'))
+    times = [start] * (REPS + 1)
+    ret = None
+    for i in range(REPS):
+        # Call function
+        ret = f(*args, **kwargs)
+        times[i + 1] = timer()
+
+    diffs = np.array([(times[i] - times[i - 1]) for i in range(1, REPS + 1)])
+
+    problem_size = sys.argv[1] if len(sys.argv) >= 2 else 0
+
+    if not os.path.isfile('results.log'):
+        with open('results.log', 'w') as f:
+            f.write('Program\tOptimization\tProblem_Size\tRuntime_sec\n')
+
+    with open('results.log', 'w') as f:
+        for d in diffs:
+            f.write('%s\t%s\t%s\t%.8f\n' % (program, title, problem_size, d))
+
+    if flop_count > 0:
+        gflops_arr = (flop_count / diffs) * 1e-9
+        time_secs = np.median(diffs)
+        GFLOPs = (flop_count / time_secs) * 1e-9
+        print(title, GFLOPs, 'GFLOP/s       (', time_secs * 1000, 'ms)')
+    else:
+        time_secs = np.median(diffs)
+        print(title, time_secs * 1000, 'ms')
+
+    return ret
+
+
+def detect_reduction_type(wcr_str):
+    """ Inspects a lambda function and tries to determine if it's one of the 
+        built-in reductions that frameworks such as MPI can provide.
+
+        @param wcr_str: A Python string representation of the lambda function.
+        @return: types.ReductionType if detected, types.ReductionType.Custom
+                 if not detected, or None if no reduction is found.
+    """
+    if wcr_str == '' or wcr_str is None:
+        return None
+
+    # Get lambda function from string
+    wcr = eval(wcr_str)
+    wcr_ast = ast.parse(wcr_str).body[0].value.body
+
+    # Run function through symbolic math engine
+    a = sympy.Symbol('a')
+    b = sympy.Symbol('b')
+    try:
+        result = wcr(a, b)
+    except TypeError:  # e.g., "Cannot determine truth value of relational"
+        result = None
+
+    # Check resulting value
+    if result == sympy.Max(a, b) or (isinstance(wcr_ast, ast.Call)
+                                     and isinstance(wcr_ast.func, ast.Name)
+                                     and wcr_ast.func.id == 'max'):
+        return types.ReductionType.Max
+    elif result == sympy.Min(a, b) or (isinstance(wcr_ast, ast.Call)
+                                       and isinstance(wcr_ast.func, ast.Name)
+                                       and wcr_ast.func.id == 'min'):
+        return types.ReductionType.Min
+    elif result == a + b:
+        return types.ReductionType.Sum
+    elif result == a * b:
+        return types.ReductionType.Product
+    elif result == a & b:
+        return types.ReductionType.Bitwise_And
+    elif result == a | b:
+        return types.ReductionType.Bitwise_Or
+    elif result == a ^ b:
+        return types.ReductionType.Bitwise_Xor
+    elif isinstance(wcr_ast, ast.BoolOp) and isinstance(wcr_ast.op, ast.And):
+        return types.ReductionType.Logical_And
+    elif isinstance(wcr_ast, ast.BoolOp) and isinstance(wcr_ast.op, ast.Or):
+        return types.ReductionType.Logical_Or
+    elif (isinstance(wcr_ast, ast.Compare)
+          and isinstance(wcr_ast.ops[0], ast.NotEq)):
+        return types.ReductionType.Logical_Xor
+
+    return types.ReductionType.Custom
+
+
+def is_op_commutative(wcr_str):
+    """ Inspects a custom lambda function and tries to determine whether
+        it is symbolically commutative (disregarding data type).
+        @param wcr_str: A string in Python representing a lambda function.
+        @return: True if commutative, False if not, None if cannot be 
+                 determined.
+    """
+    if wcr_str == '' or wcr_str is None:
+        return None
+
+    # Get lambda function from string
+    wcr = eval(wcr_str)
+
+    # Run function through symbolic math engine
+    a = sympy.Symbol('a')
+    b = sympy.Symbol('b')
+    try:
+        aRb = wcr(a, b)
+        bRa = wcr(b, a)
+    except TypeError:  # e.g., "Cannot determine truth value of relational"
+        return None
+
+    return aRb == bRa
+
+
+def is_op_associative(wcr_str):
+    """ Inspects a custom lambda function and tries to determine whether
+        it is symbolically associative (disregarding data type).
+        @param wcr_str: A string in Python representing a lambda function.
+        @return: True if associative, False if not, None if cannot be 
+                 determined.
+    """
+    if wcr_str == '' or wcr_str is None:
+        return None
+
+    # Get lambda function from string
+    wcr = eval(wcr_str)
+
+    # Run function through symbolic math engine
+    a = sympy.Symbol('a')
+    b = sympy.Symbol('b')
+    c = sympy.Symbol('c')
+    try:
+        aRbc = wcr(a, wcr(b, c))
+        abRc = wcr(wcr(a, b), c)
+    except TypeError:  # e.g., "Cannot determine truth value of relational"
+        return None
+
+    return aRbc == abRc
+
+
+def reduce(op, in_array, out_array, axis=None, identity=None):
+    """ Reduces an array according to an operation `op`, starting with 
+        initial value `identity`, over the given axis (or all axes if none 
+        given), to `out_array`.
+
+        Requires `out_array` with one dimension less than `in_array`, or a 
+        scalar if `axis` is None.
+    """
+    # The function is empty because it is parsed in astparser
+    return None
diff --git a/dace/frontend/python/__init__.py b/dace/frontend/python/__init__.py
new file mode 100644
index 0000000000..8b13789179
--- /dev/null
+++ b/dace/frontend/python/__init__.py
@@ -0,0 +1 @@
+
diff --git a/dace/frontend/python/astnodes.py b/dace/frontend/python/astnodes.py
new file mode 100644
index 0000000000..0b8122b353
--- /dev/null
+++ b/dace/frontend/python/astnodes.py
@@ -0,0 +1,196 @@
+""" Support classes for the DaCe Python AST parser. """
+
+from collections import OrderedDict
+from copy import deepcopy as dcpy
+
+from dace import data, types
+from dace.frontend.python import astutils
+
+
+class _Node(object):
+    """ SDFG AST node class, generated from the DaCe Python AST parser. """
+
+    def __init__(self, name, node_ast):
+        self.name = name
+
+        # Maps: {local variable name: array subscript expression (AST node)}
+        self.inputs = OrderedDict()
+
+        # Maps: {local variable name: array subscript expression (AST node)}
+        self.outputs = OrderedDict()
+
+        # All variables in the parent scope + current scope
+        # Maps: {variable name: value AST node}
+        self.globals = OrderedDict()
+
+        # All local variables defined in this scope
+        # Maps: {local variable name: value AST node}
+        self.locals = OrderedDict()
+
+        # Maps: {transient array name: data.Data}
+        self.transients = OrderedDict()
+
+        # List of parameter names
+        self.params = []
+
+        # Parent _Node object
+        self.parent = None
+
+        # List of children _Node objects
+        self.children = []
+
+        # Is asynchronous
+        self.is_async = False
+
+        # Node AST
+        self.ast = node_ast
+
+    def __deepcopy__(self, memo):
+        n = object.__new__(type(self))
+
+        n.name = dcpy(self.name)
+        n.inputs = dcpy(self.inputs)
+        n.outputs = dcpy(self.outputs)
+        n.globals = self.globals
+        n.locals = dcpy(self.locals)
+        n.transients = dcpy(self.transients)
+        n.params = dcpy(self.params)
+        n.parent = None
+        n.children = []
+        n.is_async = dcpy(self.is_async)
+
+        return n
+
+    # Returns the arrays local to this node's context
+    def arrays(self):
+        return OrderedDict([(k, v) for k, v in self.globals.items()
+                            if isinstance(v, data.Data)])
+
+    # Returns all arrays (children included)
+    def all_arrays(self):
+        result = self.arrays()
+        for c in self.children:
+            result.update(c.all_arrays())
+        return result
+
+    def dump(self, indent=0):
+        print('    ' * indent + self.__class__.__name__ + ': ' + self.name)
+        for c in self.children:
+            c.dump(indent + 1)
+
+
+class _ProgramNode(_Node):
+    """ SDFG AST node class. """
+    pass
+
+
+# Dataflow nodes
+class _DataFlowNode(_Node):
+    """ Dataflow AST node superclass. """
+    pass
+
+
+class _ScopeNode(_DataFlowNode):
+    """ Scope (map/consume) AST node superclass. """
+    pass
+
+
+class _MapNode(_ScopeNode):
+    """ Map AST node type. """
+    #def __init__(self, name, node_ast, range, )
+    pass
+
+
+class _ConsumeNode(_ScopeNode):
+    """ Consume AST node type. """
+    #def __init(self, name, node_ast, stream, ...)
+    pass
+
+
+class _TaskletNode(_DataFlowNode):
+    """ Tasklet AST node type. """
+
+    def __init__(self,
+                 name,
+                 node_ast,
+                 language=types.Language.Python,
+                 global_code=''):
+        super(_TaskletNode, self).__init__(name, node_ast)
+        self.language = language
+        self.extcode = None
+        self.gcode = global_code
+
+
+class _EmptyTaskletNode(_TaskletNode):
+    """ Empty Tasklet AST node type. """
+    pass
+
+
+class _NestedSDFGNode(_DataFlowNode):
+    """ Nested SDFG AST node type. """
+
+    def __init__(self, name, node_ast, sdfg):
+        super(_NestedSDFGNode, self).__init__(name, node_ast)
+        self.sdfg = sdfg
+
+
+# Operation nodes
+class _ReduceNode(_DataFlowNode):
+    """ Reduce AST node type. """
+    pass
+
+
+# Control flow nodes
+class _ControlFlowNode(_Node):
+    """ Control-flow AST node superclass. """
+    pass
+
+
+class _IterateNode(_ControlFlowNode):
+    """ Iteration (for-loop) AST node type. """
+    pass
+
+
+class _LoopNode(_ControlFlowNode):
+    """ Loop (while-loop) AST node type. """
+    pass
+
+
+class _ConditionalNode(_ControlFlowNode):
+    """ Conditional (if/else) AST node superclass. """
+    pass
+
+
+class _IfNode(_ConditionalNode):
+    """ If conditional AST node type. """
+    pass
+
+
+class _ElseNode(_ConditionalNode):
+    """ Else conditional AST node type. """
+    pass
+
+
+class _Memlet(object):
+    """ AST Memlet type. Becomes an SDFG edge. """
+
+    def __init__(self, data, data_name, attribute, num_accesses,
+                 write_conflict_resolution, wcr_identity, subset,
+                 vector_length, local_name, ast, array_dependencies):
+        self.data = data  # type: Data
+        self.dataname = data_name  # type: str
+        self.attribute = attribute  # type: str
+        self.num_accesses = num_accesses  # type: sympy math
+        self.wcr = write_conflict_resolution  # type: ast._Lambda
+        self.wcr_identity = wcr_identity  # type: memlet type or None
+        self.subset = subset  # type: subsets.Subset
+        self.veclen = vector_length  # type: int (in elements, default 1)
+        self.local_name = local_name  # type: str
+        self.ast = ast  # type: ast._AST
+        self.otherdeps = array_dependencies  # type: dict(str, data.Data)
+
+    def wcr_name(self):
+        label = astutils.unparse(self.wcr.body)
+        if self.wcr_identity is not None:
+            label += ', id: ' + str(self.wcr_identity)
+        return label
diff --git a/dace/frontend/python/astparser.py b/dace/frontend/python/astparser.py
new file mode 100644
index 0000000000..8ca4b8ada4
--- /dev/null
+++ b/dace/frontend/python/astparser.py
@@ -0,0 +1,1667 @@
+from __future__ import print_function
+import ast
+import astunparse
+from collections import OrderedDict
+import copy
+from functools import wraps
+import inspect
+
+from dace import data, subsets, symbolic, types
+from dace.config import Config
+from dace.frontend.python import astnodes, astutils
+from dace.frontend.python.astutils import *
+
+
+def function_to_ast(f):
+    """ Obtain the source code of a Python function and create an AST. 
+        @param f: Python function.
+        @return: A 4-tuple of (AST, function filename, function line-number,
+                               source code as string).
+    """
+    try:
+        src = inspect.getsource(f)
+    # TypeError: X is not a module, class, method, function, traceback, frame,
+    # or code object; OR OSError: could not get source code
+    except (TypeError, OSError):
+        raise TypeError('cannot obtain source code for dace program')
+
+    src_file = inspect.getfile(f)
+    _, src_line = inspect.findsource(f)
+    src_ast = ast.parse(_remove_outer_indentation(src))
+    ast.increment_lineno(src_ast, src_line)
+
+    return src_ast, src_file, src_line, src
+
+
+def _remove_outer_indentation(src: str):
+    """ Removes extra indentation from a source Python function.
+        @param src: Source code (possibly indented).
+        @return: Code after de-indentation.
+    """
+    lines = src.split('\n')
+    indentation = len(lines[0]) - len(lines[0].lstrip())
+    return '\n'.join([line[indentation:] for line in lines])
+
+
+class FindLocals(ast.NodeVisitor):
+    """ Python AST node visitor that recovers all left-hand-side (stored)
+        locals. """
+
+    def __init__(self):
+        self.locals = {}
+
+    def visit_Name(self, node):
+        if isinstance(node.ctx, ast.Store):
+            self.locals[node.id] = node
+
+
+def parse_dace_program(f, argtypes, global_vars, modules):
+    """ Parses a `@dace.program` function into a _ProgramNode object. 
+        @param f: A Python function to parse.
+        @param argtypes: An iterable of tuples (name, type) for the given
+                         function's arguments.
+        @param global_vars: A dictionary of global variables in the closure
+                            of `f`.
+        @param modules: A dictionary from an imported module name to the
+                        module itself.
+        @return: Hierarchical tree of `astnodes._Node` objects, where the top
+                 level node is an `astnodes._ProgramNode`.
+        @rtype: astnodes._ProgramNode
+    """
+    src_ast, src_file, src_line, src = function_to_ast(f)
+
+    # Find local variables
+    local_finder = FindLocals()
+    local_finder.visit(src_ast)
+    local_vars = local_finder.locals
+
+    # 1. Inline all "dace.call"ed functions
+    inliner = FunctionInliner(global_vars, modules, local_vars)
+    inliner.visit(src_ast)
+
+    # 2. resolve all the symbols in the AST
+    allowed_globals = global_vars.copy()
+    allowed_globals.update(argtypes)
+    symresolver = SymbolResolver(allowed_globals)
+    symresolver.visit(src_ast)
+
+    # 3. Parse the DaCe program to a hierarchical dependency representation
+    ast_parser = ParseDaCe(src_file, src_line, argtypes, global_vars, modules,
+                           symresolver)
+    ast_parser.visit(src_ast)
+    pdp = ast_parser.program
+    pdp.source = src
+    pdp.filename = src_file
+    pdp.param_syms = sorted(symbolic.getsymbols(argtypes.values()).items())
+    pdp.argtypes = argtypes
+
+    return pdp
+
+
+class MemletRemover(ExtNodeTransformer):
+    """ A Python AST transformer that removes memlet expressions of the type
+        `a << b[c]` and `d >> e(f)[g]`. """
+
+    def visit_TopLevelExpr(self, node):
+        # This is a DaCe shift, omit it
+        if isinstance(node.value, ast.BinOp):
+            if isinstance(node.value.op, ast.LShift) or isinstance(
+                    node.value.op, ast.RShift):
+                return None
+        return self.generic_visit(node)
+
+
+class ModuleInliner(ExtNodeTransformer):
+    """ A Python AST transformer that renames modules from their imported alias
+        to their actual name. """
+
+    def __init__(self, modules):
+        self.modules = modules
+
+    def visit_Attribute(self, node):
+        attrname = rname(node)
+        module_name = attrname[:attrname.rfind('.')]
+        if module_name in self.modules:  # math or equivalent modules
+            modname = self.modules[module_name]
+            node.value = ast.copy_location(
+                ast.Name(id=(modname), ctx=ast.Load), node.value)
+            return node
+        return self.generic_visit(node)
+
+
+# Parses a DaCe program
+class ParseDaCe(ExtNodeVisitor):
+    """ A Python AST visitor that creates DaCe program trees.
+        @see: parse_dace_program
+    """
+
+    def __init__(self, filename, lineoffset, argtypes, global_vars, modules,
+                 symresolver):
+        self.curnode = None
+        self.program_name = None
+        self.filename = filename
+        self.lineoffset = lineoffset
+        self.argtypes = argtypes
+        self.modules = modules
+        self.globals = global_vars
+        self.symresolver = symresolver
+
+        # Maps: {array name: data.Data)}
+        self.global_arrays = OrderedDict()
+        self.global_arrays.update(argtypes)
+
+        # Entry point to the program
+        self.program = None
+
+    ###############################################################
+    # Helper functions
+    ###############################################################
+    def _get_module(self, node):
+        try:
+            fullmodname = inspect.getmodule(eval(unparse(node),
+                                                 self.globals)).__name__
+        except NameError:
+            fullmodname = ''
+        # Only use the top-level module
+        if fullmodname.find('.') >= 0:
+            return fullmodname[:fullmodname.find('.')]
+        return fullmodname
+
+    def _inner_eval_ast(self, node, additional_syms=None):
+        code = unparse(node)
+        syms = {}
+        syms.update(self.curnode.globals)
+        if additional_syms is not None:
+            syms.update(additional_syms)
+
+        # First try to evaluate normally
+        try:
+            return eval(code, syms)
+        except:  # Literally anything can happen here
+            # If doesn't work, try to evaluate as a sympy expression
+            # Replace subscript expressions with function calls (sympy support)
+            code = code.replace('[', '(')
+            code = code.replace(']', ')')
+            return symbolic.pystr_to_symbolic(code)
+
+    def _compile_ast(self, node_body, line_offset, filename):
+        self.symresolver.visit(node_body)
+        wrapper = ast.Module(body=[node_body])
+
+        if line_offset is not None:
+            for node in ast.walk(wrapper):
+                node.lineno = line_offset
+                node.col_offset = 0
+
+        codeobj = compile(wrapper, filename, 'exec')
+        gen_module = {}
+        gen_module.update(self.globals)
+        exec(codeobj, gen_module)
+        return gen_module[node_body.name]
+
+    def _eval_ast(self, node):
+        if node is None:
+            return None
+        elif isinstance(node, ast.Call):
+            # Only work on allowed functions and external functions according to
+            # decision flowchart for intra-program function evaluation:
+            # 1. Does it exist in the same program + already parsed?
+            # 2. Is it a @dace.external_function?
+            # 3. Is it one of the standard functions from the allowed module?
+            # 4. If neither of the previous, fail
+            func = rname(node)
+
+            # Function call to a tasklet defined within the same program
+            if func in self.curnode.globals and isinstance(
+                    self.curnode.globals[func], ast.FunctionDef):
+                # Since the function is never compiled by Python, we need to
+                # do so ourselves
+                compiled_func = self._compile_ast(
+                    self.curnode.globals[func], self.lineoffset, self.filename)
+                return self._inner_eval_ast(node, {func: compiled_func})
+
+            # Standard function call, e.g., int(), math.sin()
+            elif self._get_module(node.func) in self.modules:
+                return self._inner_eval_ast(node)
+
+            # External function calls
+            elif func in self.globals:
+                if isinstance(self.globals[func], types._external_function):
+                    # External function needs to be recompiled with current
+                    # symbols
+                    src_ast, src_file, src_line, src = function_to_ast(
+                        self.globals[func].func)
+                    compiled_func = self._compile_ast(src_ast.body[0],
+                                                      src_line, src_file)
+                    return self._inner_eval_ast(node, {func: compiled_func})
+                else:
+                    return self._inner_eval_ast(node)
+
+            else:
+                return self._inner_eval_ast(node)
+        elif isinstance(node, ast.FunctionDef):
+            compiled_sdfg = self._compile_ast(node, node.lineno, self.filename)
+            return compiled_sdfg.to_sdfg()
+        else:
+            # Not a function, try to evaluate
+            return self._inner_eval_ast(node)
+
+    # Track local variables
+    def _set_locals(self):
+        if self.curnode.parent is None:
+            # Handle parameters (first set all to symbols, then set type
+            # descriptors for arrays)
+            self.curnode.globals.update(
+                {k: symbolic.symbol(k)
+                 for k in self.curnode.params})
+            self.curnode.globals.update(self.globals)
+            self.curnode.globals.update(self.global_arrays)
+        else:
+            self.curnode.globals.update(self.curnode.parent.globals)
+            self.curnode.globals.update(
+                {k: symbolic.symbol(k)
+                 for k in self.curnode.params})
+
+    # Helper function to find the dtype of an array, either as a keyword or
+    # as the last parameter
+    def getarg_or_kwarg(self, node, argoff, argname):
+        if len(node.args) > argoff:
+            return node.args[argoff]
+        for k in node.keywords:
+            if rname(k) == argname:
+                return k.value
+        return None
+
+    ###############################################################
+    # Parsing functions
+    ###############################################################
+
+    def _ndslice_to_subset(self, ndslice):
+        is_tuple = [isinstance(x, tuple) for x in ndslice]
+        if not any(is_tuple):
+            return subsets.Indices(ndslice)
+        else:
+            if not all(is_tuple):
+                # If a mix of ranges and indices is found, convert to range
+                for i in range(len(ndslice)):
+                    if not is_tuple[i]:
+                        ndslice[i] = (ndslice[i], ndslice[i], 1)
+            return subsets.Range(ndslice)
+
+    def _fill_missing_slices(self, ast_ndslice, array, indices):
+        # Filling ndslice with default values from array dimensions
+        # if ranges not specified (e.g., of the form "A[:]")
+        ndslice = [None] * len(ast_ndslice)
+        ndslice_size = 1
+        offsets = []
+        idx = 0
+        for i, dim in enumerate(ast_ndslice):
+            if isinstance(dim, tuple):
+                rb = self._eval_ast(dim[0])
+                re = self._eval_ast(dim[1])
+                if re is not None:
+                    re -= 1
+                rs = self._eval_ast(dim[2])
+                if rb is None: rb = 0
+                if re is None: re = array.shape[indices[idx]] - 1
+                if rs is None: rs = 1
+                ndslice[i] = (rb, re, rs)
+                offsets.append(i)
+                idx += 1
+            else:
+                ndslice[i] = self._eval_ast(dim)
+
+        return ndslice, offsets
+
+    # Parses a memlet statement
+    def ParseMemlet(self, local_name, rhsnode):
+        rhs = rname(rhsnode)
+        if rhs.find('.') >= 0:  # attribute, form G.out_edges[:]
+            arrname = rhs[:rhs.find('.')]
+            arrattr = rhs[rhs.find('.') + 1:]
+        else:  # normal memlet, form A(1)[i,j]
+            arrname = rhs
+            arrattr = None
+
+        array = self.curnode.globals[arrname]
+
+        # Determine number of accesses to the memlet (default is the slice size)
+        num_accesses = None
+        write_conflict_resolution = None
+        wcr_identity = None
+        # Detects expressions of the form "A(2)[...]", "A(300)", "A(1, sum)[:]"
+        if isinstance(rhsnode, ast.Call):
+            if len(rhsnode.args) < 1 or len(rhsnode.args) > 3:
+                raise DaCeSyntaxError(
+                    self, rhsnode,
+                    'Number of accesses in memlet must be a number, symbolic '
+                    'expression, or -1')
+            num_accesses = self._eval_ast(rhsnode.args[0])
+            if len(rhsnode.args) >= 2:
+                write_conflict_resolution = rhsnode.args[1]
+            if len(rhsnode.args) >= 3:
+                wcr_identity = ast.literal_eval(rhsnode.args[2])
+        elif isinstance(rhsnode, ast.Subscript) and isinstance(
+                rhsnode.value, ast.Call):
+            if len(rhsnode.value.args) < 1 or len(rhsnode.value.args) > 3:
+                raise DaCeSyntaxError(
+                    self, rhsnode,
+                    'Number of accesses in memlet must be a number, symbolic '
+                    'expression, or -1')
+            num_accesses = self._eval_ast(rhsnode.value.args[0])
+            if len(rhsnode.value.args) >= 2:
+                write_conflict_resolution = rhsnode.value.args[1]
+            if len(rhsnode.value.args) >= 3:
+                wcr_identity = ast.literal_eval(rhsnode.value.args[2])
+
+        array_dependencies = {}
+
+        # Get memlet range
+        ndslice = [(0, s - 1, 1) for s in array.shape]
+        if isinstance(rhsnode, ast.Subscript):
+            # Parse and evaluate ND slice(s) (possibly nested)
+            ast_ndslices = subscript_to_ast_slice_recursive(rhsnode)
+            offsets = list(range(len(array.shape)))
+
+            # Loop over nd-slices (A[i][j][k]...)
+            subset_array = []
+            for ast_ndslice in ast_ndslices:
+                # Loop over the N dimensions
+                ndslice, offsets = self._fill_missing_slices(
+                    ast_ndslice, array, offsets)
+                subset_array.append(self._ndslice_to_subset(ndslice))
+
+            subset = subset_array[0]
+
+            # Compose nested indices, e.g., of the form "A[i,:,j,:][k,l]"
+            for i in range(1, len(subset_array)):
+                subset = subset.compose(subset_array[i])
+
+            # Compute additional array dependencies (as a result of
+            # indirection)
+            for dim in subset:
+                if not isinstance(dim, tuple): dim = [dim]
+                for r in dim:
+                    for expr in symbolic.swalk(r):
+                        if symbolic.is_sympy_userfunction(expr):
+                            arr = expr.func.__name__
+                            array_dependencies[arr] = self.curnode.globals[arr]
+
+        else:  # Use entire range
+            subset = self._ndslice_to_subset(ndslice)
+
+        # If undefined, default number of accesses is the slice size
+        if num_accesses is None:
+            num_accesses = subset.num_elements()
+
+        # This is a valid DaCe load/store, register it
+        return astnodes._Memlet(
+            array, arrname, arrattr, num_accesses, write_conflict_resolution,
+            wcr_identity, subset, 1, local_name, rhsnode, array_dependencies)
+
+    # Helper function: parses DaCe array statement
+    def ParseArrayStatement(self, node, bInput):
+        if self.curnode is None:
+            raise DaCeSyntaxError(
+                self, node,
+                'DaCe load/store statement declared outside function bounds')
+
+        lhs = rname(node.value.left)
+        rhs = rname(node.value.right)
+
+        if rhs.find('.') >= 0:  # attribute, form G.out_edges[:]
+            arrname = rhs[:rhs.find('.')]
+            arrattr = rhs[rhs.find('.') + 1:]
+        else:  # normal memlet, form A(1)[i,j]
+            arrname = rhs
+            arrattr = None
+
+        arrays = self.curnode.arrays()
+
+        # If this is not an undefined symbol (and the rhs is not a DaCe array),
+        # this is just a regular shift
+        if lhs in self.curnode.locals:
+            if arrname not in arrays:
+                return
+            else:
+                raise DaCeSyntaxError(
+                    self, node,
+                    'Cannot load/store DaCe variable using an existing symbol')
+
+        if arrname not in arrays:
+            raise DaCeSyntaxError(
+                self, node, 'Cannot load/store DaCe variable "' + arrname +
+                '" from a non-DaCe array')
+
+        lhs_name = lhs
+        if lhs in arrays:
+            lhs = arrays[lhs]
+
+        # Make sure the DaCe assignment is unique
+        if lhs in self.curnode.inputs:
+            raise DaCeSyntaxError(
+                self, node, 'Variable already assigned to another input')
+        if lhs in self.curnode.outputs:
+            raise DaCeSyntaxError(
+                self, node, 'Variable already assigned to another output')
+
+        ########################
+        # Determine the properties of the memlet
+        memlet = self.ParseMemlet(lhs_name, node.value.right)
+
+        if bInput:
+            self.curnode.inputs[lhs_name] = memlet
+        else:
+            self.curnode.outputs[lhs_name] = memlet
+
+    def ParseCallAssignment(self, node, target):
+        funcname = rname(node.func)
+        modname = self._get_module(node.func)
+
+        ######################################
+        # Handle DaCe-specific calls
+        if modname == 'dace':  # modname is already the real name of the module
+            # Do not allow instantiation of ND arrays and DaCe scalars
+            if funcname == "ndarray" or funcname == "scalar":
+                raise DaCeSyntaxError(
+                    self, node,
+                    'Cannot define a DaCe array within a program, try using '
+                    'dace.define_local or dace.define_local_scalar')
+
+            # Handle transient variables
+            if funcname.endswith(".define_local"):
+                if len(node.args) < 1:
+                    raise DaCeSyntaxError(
+                        self, node,
+                        'Invalid call to define_local, at least 1 parameter '
+                        'is required')
+                if self.getarg_or_kwarg(node, 1, 'dtype') is None:
+                    raise DaCeSyntaxError(
+                        self, node,
+                        'Transient variable declaration must specify type')
+
+                # Construct type descriptor
+                shape = self._eval_ast(node.args[0])
+                dtype = self._eval_ast(self.getarg_or_kwarg(node, 1, 'dtype'))
+                allow_conflicts = self._eval_ast(
+                    self.getarg_or_kwarg(node, 2, 'allow_conflicts'))
+                allow_conflicts = False if allow_conflicts is None else True
+                try:
+                    tdesc = data.Array(
+                        dtype,
+                        shape,
+                        transient=True,
+                        allow_conflicts=allow_conflicts)
+                except TypeError as ex:
+                    raise DaCeSyntaxError(self, node, str(ex))
+
+                self.curnode.transients[rname(target)] = tdesc
+                self.curnode.globals[rname(target)] = tdesc
+                return None
+
+            elif funcname.endswith(".define_local_scalar"):
+                if self.getarg_or_kwarg(node, 0, 'dtype') is None:
+                    raise DaCeSyntaxError(
+                        self, node,
+                        'Transient variable declaration must specify type')
+
+                # Construct type descriptor
+                dtype = self._eval_ast(self.getarg_or_kwarg(node, 0, 'dtype'))
+                allow_conflicts = self._eval_ast(
+                    self.getarg_or_kwarg(node, 1, 'allow_conflicts'))
+                allow_conflicts = False if allow_conflicts is None else True
+
+                tdesc = data.Scalar(
+                    dtype, transient=True, allow_conflicts=allow_conflicts)
+
+                self.curnode.transients[rname(target)] = tdesc
+                self.curnode.globals[rname(target)] = tdesc
+                return None
+            elif funcname.endswith(".define_stream") or funcname.endswith(
+                    ".define_streamarray"):
+                argOffset = 0
+                if funcname.endswith('array'):
+                    # Defined stream array, expecting shape
+                    shape = self._eval_ast(
+                        self.getarg_or_kwarg(node, 0, 'shape'))
+                    argOffset += 1
+                else:
+                    shape = [1]
+
+                dtype = self._eval_ast(
+                    self.getarg_or_kwarg(node, argOffset, 'dtype'))
+
+                # Optional parameters
+                internal_size = self._eval_ast(
+                    self.getarg_or_kwarg(node, argOffset + 1, 'buffer_size'))
+
+                tdesc = data.Stream(
+                    dtype, 1, internal_size, shape=shape, transient=True)
+
+                self.curnode.transients[rname(target)] = tdesc
+                self.curnode.globals[rname(target)] = tdesc
+                return None
+            elif (funcname.rfind('.') != -1
+                  and funcname[funcname.rfind('.') +
+                               1:] in types.TYPECLASS_STRINGS):
+                return node
+            else:
+                raise DaCeSyntaxError(
+                    self, node, 'Unrecognized function call \'%s\'' % funcname)
+        ######################################
+        # Other calls are treated as memlet functions (independent of arrays,
+        # inline-able)
+        else:
+            return node
+
+    def _add_possible_inputs(self, nodes, prim):
+        if not isinstance(nodes, list):
+            nodes = [nodes]
+        extended_nodes = []
+        final_nodes = []
+
+        # Extract values from lists, tuples and subsets
+        for node in nodes:
+            if isinstance(node, tuple):
+                final_nodes.extend(list(node))
+            elif isinstance(node, subsets.Range):
+                for dim in node.ranges:
+                    final_nodes.extend(list(dim))
+            elif isinstance(node, subsets.Indices):
+                final_nodes.extend(list(node))
+
+        # Find AST names
+        for node in extended_nodes:
+            if isinstance(node, ast.AST):
+                for subnode in ast.walk(node):
+                    if isinstance(subnode, ast.Name):
+                        final_nodes.append(subnode.id)
+            else:
+                final_nodes.append(node)
+        nodeset = set()
+        for n in final_nodes:
+            if symbolic.issymbolic(n):
+                nodeset.update(str(s) for s in n.free_symbols)
+            elif isinstance(n, str):
+                nodeset.add(n)
+
+        arrs = self.curnode.arrays()
+        for input in nodeset:
+            if input in arrs:
+                inName = '__DACEIN_' + input
+                prim.inputs[inName] = astnodes._Memlet(
+                    arrs[input], input, None, 1, None, None,
+                    subsets.Indices([0]), 1, None, None, {})
+
+    ###############################################################
+    # AST visiting functions
+    ###############################################################
+
+    def visit_FunctionDef(self, node, is_async=False):
+        # Obtain function name
+        parent_node = self.curnode
+        curprim = None
+
+        arrays = OrderedDict()
+        if self.curnode is not None:
+            arrays = self.curnode.arrays()
+
+        # Obtain program/primitive name (only one program is allowed)
+        if (len(node.decorator_list) > 0):
+            if (len(node.decorator_list) > 1):
+                raise DaCeSyntaxError(self, node,
+                                      'Only one DaCe decorator is allowed')
+
+            # Make sure that the module is DaCe
+            dec = node.decorator_list[0]
+            decname = rname(dec)
+            if isinstance(dec, ast.Call):
+                modname = self._get_module(dec.func)
+            else:
+                modname = self._get_module(dec)
+            if modname not in self.modules.values() or modname != 'dace':
+                raise DaCeSyntaxError(
+                    self, node,
+                    'Decorators from module \'%s\' not allowed' % modname)
+            #####################################
+
+            # Create DaCe program node
+            if decname.endswith('.program'):
+                if self.program is not None:
+                    # Parse internal program separately as an SDFG of its own
+                    sdfg = self._eval_ast(node)
+                    curprim = astnodes._NestedSDFGNode(node.name, node, sdfg)
+
+                    # Inherit I/O from immediate parent
+                    curprim.inputs = copy.copy(parent_node.inputs)
+                    curprim.outputs = copy.copy(parent_node.outputs)
+                    # Cancel parent node's relevant I/O
+                    parent_node.inputs.clear()
+                    parent_node.outputs.clear()
+
+                    # Set children of parent primitive, if it is a primitive
+                    if parent_node is not None and curprim is not None:
+                        parent_node.children.append(curprim)
+                    curprim.parent = parent_node
+
+                    # Exit so that child AST nodes will not be parsed
+                    return
+
+                self.program = astnodes._ProgramNode(node.name, node)
+                curprim = self.program
+
+            # Parse primitives
+            # Dataflow primitives
+            elif decname.endswith('map'):
+                curprim = astnodes._MapNode(node.name, node)
+
+                # If the arguments are defined in the decorator
+                if 'args' in dir(dec) and len(dec.args) > 0:
+                    curprim.range = subsets.Range(
+                        subscript_to_slice(dec.args[0], arrays)[1])
+                else:
+                    try:
+                        curprim.range = subsets.Range([
+                            subscript_to_slice(arg.annotation, arrays)[1][0]
+                            for arg in node.args.args
+                        ])
+                    except (AttributeError, TypeError, ValueError):
+                        raise DaCeSyntaxError(
+                            self, node,
+                            'All arguments in DaCe primitive %s must be annotated with a range'
+                            % node.name)
+                self._add_possible_inputs(curprim.range, curprim)
+
+            elif decname.endswith('consume'):
+                curprim = astnodes._ConsumeNode(node.name, node)
+
+                # If the arguments are defined in the decorator
+                if 'args' in dir(dec) and len(dec.args) > 0:
+                    if dec.args[0].id not in self.curnode.globals:
+                        raise DaCeSyntaxError(
+                            self, node, 'Undefined stream %s in consume %s' %
+                            (dec.args[0].id, node.name))
+                    curprim.stream = self.curnode.globals[rname(dec.args[0])]
+                    ast_memlet = self.ParseMemlet(node.args.args[0].arg,
+                                                  dec.args[0])
+                    ast_memlet.num_accesses = -1
+                    curprim.inputs[node.args.args[0].arg] = ast_memlet
+                    if len(dec.args) < 2:
+                        raise DaCeSyntaxError(
+                            self, node,
+                            'Consume %s missing required argument: '
+                            'number of processing elements' % node.name)
+                    curprim.num_pes = symbolic.pystr_to_symbolic(
+                        unparse(dec.args[1]))
+                    if len(dec.args) > 2:
+                        curprim.condition = unparse(dec.args[2])
+                    else:
+                        curprim.condition = None
+                else:
+                    raise DaCeSyntaxError(
+                        self, node,
+                        'Consume syntax only supports parameters at the '
+                        'decorator')
+                self._add_possible_inputs(curprim.stream, curprim)
+
+            elif decname.endswith('tasklet'):
+                # Parse arguments
+                lang = None
+                gcode = None
+                if isinstance(dec, ast.Call):
+                    lang = self._eval_ast(
+                        self.getarg_or_kwarg(dec, 0, 'language'))
+                    gcode = self._eval_ast(
+                        self.getarg_or_kwarg(dec, 1, 'global_code'))
+
+                if lang is None:
+                    lang = types.Language.Python
+                else:
+                    try:
+                        lang = types.Language[lang]
+                    except KeyError:
+                        raise DaCeSyntaxError(
+                            self, node,
+                            'Unrecognized tasklet language "%s"' % lang)
+                if gcode is None:
+                    gcode = ''
+
+                curprim = astnodes._TaskletNode(node.name, node, lang, gcode)
+
+            # Control flow primitives
+            elif decname.endswith('iterate'):
+                if isinstance(parent_node, astnodes._DataFlowNode):
+                    raise DaCeSyntaxError(
+                        self, node, 'Control flow within data flow disallowed')
+
+                curprim = astnodes._IterateNode(node.name, node)
+
+                if 'args' in dir(dec) and len(
+                        dec.args
+                ) > 0:  # If the arguments are defined in the decorator
+                    curprim.range = subsets.Range(
+                        subscript_to_slice(dec.args[0], arrays)[1])
+                else:
+                    try:
+                        curprim.range = subsets.Range([
+                            subscript_to_slice(arg.annotation, arrays)[1][0]
+                            for arg in node.args.args
+                        ])
+                    except (AttributeError, TypeError, ValueError):
+                        raise SyntaxError(
+                            'All arguments in DaCe primitive %s must be annotated with a range'
+                            % node.name)
+                self._add_possible_inputs(curprim.range, curprim)
+
+            elif decname.endswith('loop'):
+                if isinstance(parent_node, astnodes._DataFlowNode):
+                    raise DaCeSyntaxError(
+                        self, node, 'Control flow within data flow disallowed')
+
+                curprim = astnodes._LoopNode(node.name, node)
+
+                if 'args' in dir(dec) and len(
+                        dec.args
+                ) > 0:  # If the arguments are defined in the decorator
+                    curprim.condition = dec.args[0]
+                else:
+                    raise SyntaxError(
+                        'Condition must be given as argument to decorator in DaCe primitive %s'
+                        % node.name)
+                self._add_possible_inputs(curprim.condition, curprim)
+
+            elif decname.endswith('conditional'):
+                if isinstance(parent_node, astnodes._DataFlowNode):
+                    raise DaCeSyntaxError(
+                        self, node, 'Control flow within data flow disallowed')
+
+                curprim = astnodes._ConditionalNode(node.name, node)
+
+                if 'args' in dir(dec) and len(
+                        dec.args
+                ) > 0:  # If the arguments are defined in the decorator
+                    curprim.condition = dec.args[0]
+                else:
+                    raise SyntaxError(
+                        'Condition must be given as argument to decorator in DaCe primitive %s'
+                        % node.name)
+                self._add_possible_inputs(curprim.condition, curprim)
+
+            else:
+                raise DaCeSyntaxError(self, node,
+                                      'Unrecognized primitive ' + decname)
+
+            if '.async_' in decname or is_async:
+                curprim.is_async = True
+        # End of program/primitive name
+
+        # If this is a primitive
+        if curprim is not None:
+            # If function definition contains arguments
+            if 'args' in dir(node):
+                for arg in node.args.args:
+
+                    # If it is not the program, add locals as symbols
+                    if self.program != node.name:
+                        curprim.globals[rname(arg)] = symbolic.symbol(
+                            rname(arg))
+                    if curprim is not None:
+                        curprim.params.append(rname(arg))
+
+            # Set children of parent primitive, if it is a primitive
+            if parent_node is not None and curprim is not None:
+                parent_node.children.append(curprim)
+            curprim.parent = parent_node
+
+            self.curnode = curprim
+
+            # Track local variables
+            self._set_locals()
+
+            # Mandatory (to keep visiting children)
+            for stmt in node.body:
+                self.visit_TopLevel(stmt)
+
+            # After traversing the function, pop "function name stack"
+            self.curnode = parent_node
+        else:  # Not a primitive
+            self.curnode.locals[node.name] = node
+            self.curnode.globals[node.name] = node
+
+            # Mandatory (to keep visiting children)
+            for stmt in node.body:
+                self.visit_TopLevel(stmt)
+
+    def visit_AsyncFunctionDef(self, node):
+        # Treat as a plain function
+        self.visit_FunctionDef(node, is_async=True)
+
+    def visit_Call(self, node):
+        if (not isinstance(node.func, ast.Attribute)
+                or node.func.value.id not in self.modules
+                or self.modules[node.func.value.id] != 'dace'):
+            self.generic_visit(node)
+            return
+
+        # Reduce call
+        if node.func.attr.endswith('reduce'):
+            dec = node
+            # Mandatory arguments
+            wcr = dec.args[0]
+            src = dec.args[1]
+            dst = dec.args[2]
+            # In case the axis argument is given without explicit kwarg
+            # notation
+            axisarg = dec.args[3] if len(dec.args) > 3 else None
+            identityarg = dec.args[4] if len(dec.args) > 4 else None
+
+            curprim = astnodes._ReduceNode('reduce', wcr)
+            curprim.axes = get_tuple(self, getkwarg(dec, 'axis', axisarg))
+            curprim.identity = get_tuple(
+                self, getkwarg(dec, 'identity', identityarg))
+            if curprim.identity is not None:
+                curprim.identity = curprim.identity[0]
+            curprim.inputs['input'] = self.ParseMemlet('input', src)
+            curprim.outputs['output'] = self.ParseMemlet('output', dst)
+
+            # Set children of parent primitive, if it is a primitive
+            self.curnode.children.append(curprim)
+            curprim.parent = self.curnode
+
+    def visit_TopLevelExpr(self, node):
+        if isinstance(node.value, ast.BinOp):
+            if (isinstance(node.value.op, ast.LShift)):
+                self.ParseArrayStatement(node, True)
+                return
+            if (isinstance(node.value.op, ast.RShift)):
+                self.ParseArrayStatement(node, False)
+                return
+        elif isinstance(node.value, ast.Str):
+            self.visit_TopLevelStr(node.value)
+            return
+
+        self.generic_visit(node)
+
+    def visit_TopLevelStr(self, node):
+        if isinstance(self.curnode, astnodes._TaskletNode):
+            if self.curnode.extcode != None:
+                raise DaCeSyntaxError(
+                    self, node,
+                    'Cannot provide more than one intrinsic implementation ' +
+                    'for tasklet')
+            self.curnode.extcode = node.s
+            return
+
+        self.generic_visit(node)
+
+    # Detect locals and transient variables
+    def visit_Assign(self, node):
+        # Don't allow assignment to tuples (for now)
+        if len(node.targets) > 1:
+            raise DaCeSyntaxError(self, node,
+                                  'Assignment to tuples not supported (yet)')
+        target = node.targets[0]
+        if isinstance(target, ast.Tuple):
+            if len(target.elts) > 1:
+                raise DaCeSyntaxError(
+                    self, node, 'Assignment to tuples not supported (yet)')
+            target = target.elts[0]
+
+        # Tasklet code
+        if self.curnode is not None:
+            if isinstance(node.value, ast.Call) and\
+               not isinstance(self.curnode, astnodes._TaskletNode):
+                retval = self.ParseCallAssignment(node.value, target)
+                if retval is not None:
+                    self.curnode.locals[rname(target)] = retval
+                    self.curnode.globals[rname(target)] = retval
+
+                # No need to further visit the node's children
+                return
+            else:
+                if isinstance(self.curnode, astnodes._DataFlowNode):
+                    self.curnode.locals[rname(target)] = None
+                    self.curnode.globals[rname(target)] = None
+                else:
+                    retval = self._eval_ast(node.value)
+                    self.curnode.locals[rname(target)] = retval
+                    self.curnode.globals[rname(target)] = retval
+
+                # No need to further visit the node's children
+                return
+
+        self.generic_visit(node)
+
+    # Visit statements that define locals
+    def visit_Name(self, node):
+        if self.curnode is None:
+            arrays = self.global_arrays
+        else:
+            arrays = self.curnode.arrays()
+
+        if node.id in arrays and (not isinstance(arrays[node.id], data.Scalar)
+                                  or arrays[node.id].transient):
+            if isinstance(node.ctx, ast.Load) or isinstance(
+                    node.ctx, ast.Store):
+                raise DaCeSyntaxError(
+                    self, node,
+                    'Directly reading from and writing to arrays is not '
+                    'allowed. Please use memlet notation (a << A[i])')
+
+        self.generic_visit(node)
+
+    # Control flow blocks
+    #########################
+    def visit_For(self, node):
+        # Syntax: Only accept for loops without 'else'; only accept for loops
+        # with structure 'for <x> in range(<y>)'
+        if len(node.orelse) > 0:
+            raise DaCeSyntaxError(
+                self, node,
+                'Loops with \'else\' footer are not allowed in DaCe programs')
+
+        if self.curnode is not None:
+            # Verify syntax
+            ########################################################
+            # We allow only three types of for loops:
+            # 1. `for i in range(...)`: Creates a looping state
+            # 2. `for i in parrange(...)`: Creates a 1D map
+            # 3. `for i,j,k in dace.map[0:M, 0:N, 0:K]`: Creates an ND map
+
+            if isinstance(node.iter, ast.Call):
+                funcname = rname(node.iter.func)
+                modname = self._get_module(node.iter.func)
+            elif isinstance(node.iter, ast.Subscript):
+                funcname = rname(node.iter.value)
+                modname = self._get_module(node.iter.value)
+            else:
+                funcname, modname = None, None
+
+            # Case 1: Iterate
+            if (isinstance(node.target, ast.Name)
+                    and isinstance(node.iter, ast.Call)
+                    and isinstance(node.iter.func, ast.Name)
+                    and node.iter.func.id == 'range'):
+                # If we are inside a dataflow construct, ignore
+                if isinstance(self.curnode, astnodes._DataFlowNode):
+                    self.generic_visit(node)
+                    return
+
+                # Obtain parameters
+                varname = node.target.id
+                nargs = len(node.iter.args)
+                var_rb = 0 if nargs < 2 else symbolic.pystr_to_symbolic(
+                    unparse(node.iter.args[0]))
+                var_re = (symbolic.pystr_to_symbolic(
+                    unparse(node.iter.args[1]))
+                          if nargs > 1 else symbolic.pystr_to_symbolic(
+                              unparse(node.iter.args[0]))) - 1
+                var_rs = 1 if nargs < 3 else symbolic.pystr_to_symbolic(
+                    unparse(node.iter.args[2]))
+
+                # Create node
+                curprim = astnodes._IterateNode('iterate_' + str(node.lineno),
+                                                node)
+                curprim.range = [(var_rb, var_re, var_rs)]
+                curprim.params = [varname]
+                self._add_possible_inputs(curprim.range, curprim)
+                self.curnode.children.append(curprim)
+                curprim.parent = self.curnode
+
+                # Traverse into loop
+                oldnode = self.curnode
+                self.curnode = curprim
+                self._set_locals()
+                for stmt in node.body:
+                    self.visit(stmt)
+                self.curnode = oldnode
+                ####################
+                return
+
+            # Case 2: 1D map (for i in parrange(...))
+            elif (isinstance(node.target, ast.Name)
+                  and isinstance(node.iter, ast.Call)
+                  and isinstance(node.iter.func, ast.Name)
+                  and node.iter.func.id == 'parrange'):
+                curprim = astnodes._MapNode('map_' + str(node.lineno), node)
+
+                # Get arguments for range
+                maprange = []
+                if len(node.iter.args) == 1:  # end only
+                    maprange = [(None, node.iter.args[0], None)]
+                elif len(node.iter.args) == 2:  # begin, end
+                    maprange = [(node.iter.args[0], node.iter.args[1], None)]
+                elif len(node.iter.args) == 3:  # begin, end, skip
+                    maprange = [(node.iter.args[0], node.iter.args[1],
+                                 node.iter.args[2])]
+                else:
+                    raise DaCeSyntaxError(
+                        self, node,
+                        'Invalid number of arguments for "parrange"')
+
+                curprim.range = subsets.Range(
+                    astrange_to_symrange(maprange, self.curnode.arrays()))
+                curprim.params = [rname(node.target)]
+
+                self._add_possible_inputs(curprim.range, curprim)
+                self.curnode.children.append(curprim)
+                curprim.parent = self.curnode
+
+                # Traverse into loop
+                oldnode = self.curnode
+                self.curnode = curprim
+                self._set_locals()
+                for stmt in node.body:
+                    self.visit(stmt)
+                self.curnode = oldnode
+                ####################
+
+                return
+
+            # Case 3: ND map
+            elif (isinstance(node.target, ast.Tuple)
+                  and isinstance(node.iter, ast.Subscript)
+                  and isinstance(node.iter.value, ast.Attribute)
+                  and modname == 'dace' and node.iter.value.attr == 'map'):
+                curprim = astnodes._MapNode('map_' + str(node.lineno), node)
+
+                # Get range from array subscript, check for length mismatch
+                _, range_values = subscript_to_slice(node.iter,
+                                                     self.curnode.arrays())
+                range_keys = [rname(n) for n in node.target.elts]
+                if len(range_keys) != len(range_values):
+                    raise DaCeSyntaxError(
+                        self, node,
+                        'Map range must match tuple length in for loop')
+                curprim.params = range_keys
+                curprim.range = subsets.Range(range_values)
+
+                self._add_possible_inputs(curprim.range, curprim)
+                self.curnode.children.append(curprim)
+                curprim.parent = self.curnode
+
+                # Traverse into loop
+                oldnode = self.curnode
+                self.curnode = curprim
+                self._set_locals()
+                for stmt in node.body:
+                    self.visit(stmt)
+                self.curnode = oldnode
+                ####################
+
+                return
+
+            # No match
+            else:
+                raise DaCeSyntaxError(
+                    self, node, 'Invalid loop syntax. Supported options are:\n'
+                    '    for <var> in range(<value>)\n'
+                    '    for <var> in parrange(<value>)\n'
+                    '    for <vars> in dace.map[ranges]')
+            #######################################################
+
+        self.generic_visit(node)
+
+    def visit_While(self, node):
+        # Syntax: Only accept while loops without 'else'
+        if len(node.orelse) > 0:
+            raise DaCeSyntaxError(
+                self, node,
+                'Loops with \'else\' footer are not allowed in DaCe programs')
+
+        if self.curnode is not None:
+            # If we are inside a dataflow construct, ignore
+            if not isinstance(self.curnode, astnodes._DataFlowNode):
+                # Obtain parameters
+                cond = node.test
+
+                # Create node
+                curprim = astnodes._LoopNode('while_' + str(node.lineno), node)
+                curprim.condition = cond
+                self._add_possible_inputs(curprim.condition, curprim)
+                self.curnode.children.append(curprim)
+                curprim.parent = self.curnode
+
+                # Traverse into loop
+                oldnode = self.curnode
+                self.curnode = curprim
+                self._set_locals()
+                for stmt in node.body:
+                    self.visit(stmt)
+                self.curnode = oldnode
+                ####################
+                return
+
+        self.generic_visit(node)
+
+    def visit_If(self, node):
+        if self.curnode is not None:
+            # If we are inside a dataflow construct, ignore
+            if not isinstance(self.curnode, astnodes._DataFlowNode):
+                # Obtain parameters
+                cond = node.test
+
+                # Create node
+                curprim = astnodes._IfNode('if_' + str(node.lineno), node)
+                curprim.condition = cond
+                self._add_possible_inputs(curprim.condition, curprim)
+                self.curnode.children.append(curprim)
+                curprim.parent = self.curnode
+
+                # Traverse into condition
+                oldnode = self.curnode
+                self.curnode = curprim
+                self._set_locals()
+                for stmt in node.body:
+                    self.visit(stmt)
+                self.curnode = oldnode
+
+                # Process 'else'/'elif' statements
+                if len(node.orelse) > 0:
+                    # Create node
+                    curprim = astnodes._ElseNode(
+                        'else_' + str(node.orelse[0].lineno), node)
+                    # Negate condition
+                    curprim.condition = astutils.negate_expr(cond)
+                    self.curnode.children.append(curprim)
+                    curprim.parent = self.curnode
+
+                    # Traverse into condition
+                    oldnode = self.curnode
+                    self.curnode = curprim
+                    self._set_locals()
+                    for stmt in node.orelse:
+                        self.visit(stmt)
+                    self.curnode = oldnode
+
+                return
+
+        self.generic_visit(node)
+
+    def visit_With(self, node, is_async=False):
+        # "with dace.tasklet" syntax
+        if len(node.items) == 1:
+            dec = node.items[0].context_expr
+            if isinstance(dec, ast.Call):
+                funcname = rname(dec.func)
+                modname = self._get_module(dec.func)
+            elif isinstance(dec, ast.Attribute):
+                funcname = rname(dec)
+                modname = self._get_module(dec)
+            else:
+                funcname, modname = None, None
+
+            if modname == 'dace' and funcname.endswith('.tasklet'):
+                # Parse as tasklet
+                # NOTE: This is almost a direct copy of the tasklet parser
+                # above.
+                lang = None
+                gcode = None
+                if isinstance(dec, ast.Call):
+                    lang = self._eval_ast(
+                        self.getarg_or_kwarg(dec, 0, 'language'))
+                    gcode = self._eval_ast(
+                        self.getarg_or_kwarg(dec, 1, 'global_code'))
+
+                if lang is None:
+                    lang = types.Language.Python
+                else:
+                    try:
+                        lang = types.Language[lang]
+                    except KeyError:
+                        raise DaCeSyntaxError(
+                            self, node,
+                            'Unrecognized tasklet language "%s"' % lang)
+                if gcode is None:
+                    gcode = ''
+
+                curprim = astnodes._TaskletNode('tasklet_' + str(node.lineno),
+                                                node, lang, gcode)
+                if self.curnode is not None:
+                    self.curnode.children.append(curprim)
+                curprim.parent = self.curnode
+
+                # Traverse into tasklet
+                oldnode = self.curnode
+                self.curnode = curprim
+                self._set_locals()
+                for stmt in node.body:
+                    self.visit_TopLevel(stmt)
+                self.curnode = oldnode
+                return
+
+        raise DaCeSyntaxError(
+            self, node,
+            'General "with" statements disallowed in DaCe programs')
+
+    #########################
+
+    ## Disallowed statements
+    def visit_Global(self, node):
+        raise DaCeSyntaxError(
+            self, node, '"global" statements disallowed in DaCe sub-programs')
+
+    def visit_Delete(self, node):
+        raise DaCeSyntaxError(self, node,
+                              '"del" statements disallowed in DaCe programs')
+
+    def visit_Import(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'imports disallowed in DaCe programs')
+
+    def visit_ImportFrom(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'imports disallowed in DaCe programs')
+
+    def visit_Assert(self, node):
+        raise DaCeSyntaxError(
+            self, node, '"assert" statements disallowed in DaCe programs')
+
+    def visit_Pass(self, node):
+        raise DaCeSyntaxError(self, node,
+                              '"pass" statements disallowed in DaCe programs')
+
+    def visit_Exec(self, node):
+        raise DaCeSyntaxError(self, node,
+                              '"exec" statements disallowed in DaCe programs')
+
+    def visit_Print(self, node):
+        raise DaCeSyntaxError(
+            self, node, '"print" statements disallowed in DaCe programs')
+
+    def visit_Nonlocal(self, node):
+        raise DaCeSyntaxError(
+            self, node, '"nonlocal" statements disallowed in DaCe programs')
+
+    def visit_Yield(self, node):
+        raise DaCeSyntaxError(
+            self, node, '"yield" statements disallowed in DaCe programs')
+
+    def visit_YieldFrom(self, node):
+        raise DaCeSyntaxError(
+            self, node, '"yield" statements disallowed in DaCe programs')
+
+    def visit_Raise(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'exceptions disallowed in DaCe programs')
+
+    def visit_Try(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'exceptions disallowed in DaCe programs')
+
+    def visit_TryExcept(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'exceptions disallowed in DaCe programs')
+
+    def visit_TryFinally(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'exceptions disallowed in DaCe programs')
+
+    def visit_ExceptHandler(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'exceptions disallowed in DaCe programs')
+
+    def visit_AsyncWith(self, node):
+        self.visit_With(node, is_async=True)
+
+    def visit_Starred(self, node):
+        raise DaCeSyntaxError(
+            self, node, 'starred statements disallowed in DaCe programs')
+
+    def visit_Ellipsis(self, node):
+        raise DaCeSyntaxError(self, node,
+                              '"..." statements disallowed in DaCe programs')
+
+    # disallowed for now
+    def visit_ClassDef(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'classes disallowed (for now) in DaCe programs')
+
+    def visit_AsyncFor(self, node):
+        raise DaCeSyntaxError(
+            self, node,
+            'asynchronous loops disallowed (for now) in DaCe programs')
+
+    def visit_Await(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'await disallowed (for now) in DaCe programs')
+
+    #Data structures
+    def visit_Bytes(self, node):
+        raise DaCeSyntaxError(
+            self, node, 'bytestrings disallowed (for now) in DaCe programs')
+
+    def visit_Set(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'sets disallowed (for now) in DaCe programs')
+
+    def visit_Dict(self, node):
+        raise DaCeSyntaxError(
+            self, node, 'dictionaries disallowed (for now) in DaCe programs')
+
+    #Comprehensions
+    def visit_ListComp(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'comprehensions disallowed in DaCe programs')
+
+    def visit_GeneratorExp(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'comprehensions disallowed in DaCe programs')
+
+    def visit_SetComp(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'comprehensions disallowed in DaCe programs')
+
+    def visit_DictComp(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'comprehensions disallowed in DaCe programs')
+
+    def visit_comprehension(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'comprehensions disallowed in DaCe programs')
+
+    def visit_ImportFrom(self, node):
+        raise DaCeSyntaxError(self, node,
+                              'imports disallowed in DaCe programs')
+
+
+class ASTFindAndReplace(ast.NodeTransformer):
+    """ A Python AST transformer utility that finds and replaces names. """
+
+    def __init__(self, replacements, skip_subscripts=True):
+        self.replacement_dict = replacements
+        self.skip_subscripts = skip_subscripts
+
+    def visit_Subscript(self, node):
+        # Do not visit subscripts that contain a replacement
+        if rname(node) in self.replacement_dict and self.skip_subscripts:
+            return node
+        self.generic_visit(node)
+
+    def visit_Name(self, node):
+        if node.id in self.replacement_dict:
+            return ast.copy_location(
+                ast.Name(id=self.replacement_dict[node.id], ctx=node.ctx),
+                node)
+
+        return self.generic_visit(node)
+
+
+class SymbolResolver(astutils.ExtNodeTransformer):
+    """ Python AST transformer that resolves symbols to their name or 
+        value. """
+
+    def __init__(self, symbols):
+        self.symbols = symbols
+        self.locals = {}
+        self.top_function = True
+
+    def resolve(self, node):
+        if node is None:
+            return None
+        if isinstance(node, tuple):
+            return tuple(self.resolve(n) for n in node)
+        return unparse(self.visit(node))
+
+    def visit_FunctionDef(self, node):
+        oldlocals = {}
+        oldlocals.update(self.locals)
+        oldtop = self.top_function
+
+        # Register parameters as locals
+        if not self.top_function:
+            for arg in node.args.args:
+                self.locals[rname(arg)] = arg
+
+        self.top_function = False
+        result = self.generic_visit(node)
+        self.top_function = oldtop
+
+        self.locals = oldlocals
+
+        return result
+
+    def visit_TopLevelExpr(self, node):
+        if isinstance(node.value, ast.BinOp):
+            if isinstance(node.value.op, ast.LShift) or isinstance(
+                    node.value.op, ast.RShift):
+                self.locals[rname(node.value.left)] = node.value.left
+
+                node.value.right = self.visit(node.value.right)
+                return node
+
+        return self.generic_visit(node)
+
+    def visit_Name(self, node):
+        # Defining a local
+        if isinstance(node.ctx, ast.Store):
+            # TODO(later): Scope management
+            # Example:
+            # n = 5
+            # @dace.program
+            # def prog():
+            #     def inner():
+            #         n = dace.define_local(...)
+            #         use n (should be "n")
+            #     use n (should be 5)
+
+            self.locals[node.id] = node
+            return node
+
+        if node.id not in self.symbols:
+            return node
+        if node.id in self.locals:
+            return node
+
+        sym = self.symbols[node.id]
+        if isinstance(sym, symbolic.symbol):
+            return ast.copy_location(ast.Name(id=sym.name, ctx=node.ctx), node)
+        elif isinstance(sym, types.typeclass):
+            # Find dace module name
+            dacemodule = next(
+                k for k, v in self.symbols.items()
+                if isinstance(v, type(types)) and v.__name__ == 'dace')
+
+            return ast.copy_location(
+                ast.Attribute(
+                    value=ast.Name(id=dacemodule, ctx=ast.Load()),
+                    attr=sym.to_string(),
+                    ctx=node.ctx), node)
+        elif types.isconstant(sym):
+            return ast.copy_location(ast.Num(n=sym, ctx=node.ctx), node)
+        elif isinstance(sym, ast.Name):
+            return ast.copy_location(ast.Name(id=sym.id, ctx=node.ctx), node)
+        elif isinstance(sym, ast.AST):
+            return ast.copy_location(sym, node)
+        else:
+            return node
+
+
+##########################################################################
+# Function inlining
+
+
+class CounterDict(object):
+    """ Dictionary object that counts how many times a value was added to 
+        it. """
+
+    def __init__(self):
+        self.values = {}
+
+    def get(self, key):
+        if key in self.values:
+            return self.values[key]
+        else:
+            return 0
+
+    def add(self, key, count=1):
+        if key not in self.values:
+            self.values[key] = count
+        else:
+            self.values[key] += count
+
+
+class FunctionInliner(ExtNodeTransformer):
+    """ A Python AST transformer that inlines functions called (e.g., with 
+        "dace.call") in an existing AST. """
+
+    def __init__(self, global_vars, modules, local_vars={}):
+        self.globals = global_vars
+        self.locals = local_vars
+        self.modules = modules
+        self.function_inline_counter = CounterDict()
+
+    def visit_Call(self, node):
+        cnode = node
+
+        # First, visit arguments and (possibly) inline them. This takes care
+        # of "dace.call(func, dace.call(f2, arg), ...)" cases
+        node = self.generic_visit(node)
+
+        # Only accept "dace.call" calls
+        if isinstance(cnode.func, ast.Attribute) and cnode.func.attr == 'call':
+            # Verify that the module is DaCe
+            if self.modules[cnode.func.value.id] == 'dace':
+                # INLINE
+                if len(cnode.args) < 1:
+                    raise SyntaxError(
+                        'dace.call must have at least one parameter')
+                return self.inline_function(cnode, cnode.args[0])
+
+        return node
+
+    # Inline top-level calls as well
+    def visit_TopLevelExpr(self, node):
+        if isinstance(node.value, ast.Call):
+            node.value = self.visit_TopLevelCall(node.value)
+            return node
+        return self.generic_visit(node)
+
+    def _fname_and_module(self, funcnode):
+        funcmodule = None
+        if isinstance(funcnode, ast.Attribute):
+            funcmodule = funcnode.value.id
+            funcname = funcnode.attr
+        else:
+            funcname = funcnode.id
+        return (funcmodule, funcname)
+
+    def visit_TopLevelCall(self, node):
+        # If dace.call(...)
+        if isinstance(node.func, ast.Attribute) and node.func.attr == 'call':
+            return self.visit_Call(node)
+
+        funcmodule, funcname = self._fname_and_module(node.func)
+        if funcmodule is None and funcname in self.globals:
+            # First, visit arguments and (possibly) inline them. This takes care
+            # of "dace.call(func, dace.call(f2, arg), ...)" cases
+            node = self.generic_visit(node)
+
+            return self.inline_function(node, node.func)
+
+        return self.generic_visit(node)
+
+    def _transients_from_ast(self, src_ast):
+        results = set()
+        for astnode in ast.walk(src_ast):
+            if (isinstance(astnode, ast.Assign)
+                    and isinstance(astnode.value, ast.Call)):
+                modulename, _ = self._fname_and_module(astnode.value.func)
+                if (modulename is not None
+                        and self.modules[modulename] == 'dace'):
+                    # Don't allow assignment to tuples (for now)
+                    if len(astnode.targets) > 1:
+                        raise DaCeSyntaxError(
+                            self, astnode,
+                            'Assignment to tuples not supported (yet)')
+                    target = astnode.targets[0]
+                    if isinstance(target, ast.Tuple):
+                        if len(target.elts) > 1:
+                            raise DaCeSyntaxError(
+                                self, node,
+                                'Assignment to tuples not supported (yet)')
+                        target = target.elts[0]
+
+                    results.add(rname(target))
+        return results
+
+    def inline_function(self, cnode, funcnode):
+        funcmodule, funcname = self._fname_and_module(funcnode)
+        if funcmodule is None and funcname not in self.globals:
+            raise SyntaxError(
+                'Function %s not found (is it declared as @dace.external_function?)'
+                % funcname)
+        if funcmodule is not None:
+            raise SyntaxError('External DaCe functions should be' +
+                              ' imported directly using "from ' +
+                              '<MODULE> import ..."')
+
+        self.function_inline_counter.add(funcname)
+
+        # Obtain the function object
+        f = None
+        if isinstance(self.globals[funcname], types._external_function):
+            f = self.globals[funcname].func
+        else:
+            f = self.globals[funcname]
+
+        # Parse that function's AST
+        src_ast, src_file, src_line, src = function_to_ast(f)
+
+        # Inline the function's intenal dace.calls recursively
+        for astnode in ast.walk(src_ast):
+            if isinstance(astnode, ast.Call):
+                src_ast = FunctionInliner(self.globals,
+                                          self.modules).visit(src_ast)
+                break
+
+        # Replace the function's parameters with the values in arguments
+        func_args = src_ast.body[0].args.args
+
+        if cnode.func == funcnode:  # In case of calling a function directly
+            call_args = cnode.args[:]
+        else:  # In case of calling a function through dace.call
+            call_args = cnode.args[1:]
+        if len(func_args) != len(call_args):
+            raise SyntaxError(
+                'Mismatch in arguments to call %s. Expecting %d, got %d' %
+                (f.__name__, len(func_args), len(call_args)))
+
+        replacement_map = {  # parameter replacement map
+            rname(k): v
+            for k, v in zip(func_args, call_args)
+        }
+
+        # Obtain and rename transients as well. "tmp" --> "func0_tmp"
+        local_replacement_map = {
+            k: ast.Name(
+                ctx=ast.Load(),
+                id='%s%d_%s' % (funcname,
+                                self.function_inline_counter.get(funcname), k))
+            for k in self._transients_from_ast(src_ast)
+        }
+        for replacement in local_replacement_map.values():
+            for repl_ast in ast.walk(replacement):
+                if isinstance(repl_ast, ast.Name):
+                    if (repl_ast.id in self.globals
+                            or repl_ast.id in self.locals):
+                        raise SyntaxError(
+                            ('Cannot name a symbol %s due to function ' +
+                             'inlining, please choose another name') %
+                            repl_ast.id)
+        replacement_map.update(local_replacement_map)
+
+        src_ast = SymbolResolver(replacement_map).visit(src_ast)
+
+        # If the function has a return statement, then we need to
+        # evaluate the AST instead
+        if any(isinstance(stmt, ast.Return) for stmt in ast.walk(src_ast)):
+            if len(src_ast.body[0].body) > 1:
+                raise NotImplementedError(
+                    "Functions with return value and more than one statement are not implemented"
+                )
+
+            # Inline the function by replacing the return value
+            return src_ast.body[0].body[0].value
+
+        return src_ast.body[0].body
diff --git a/dace/frontend/python/astutils.py b/dace/frontend/python/astutils.py
new file mode 100644
index 0000000000..ac16dffe6f
--- /dev/null
+++ b/dace/frontend/python/astutils.py
@@ -0,0 +1,317 @@
+""" Various AST parsing utilities for DaCe. """
+import ast
+import astunparse
+import sympy
+
+from dace import types, symbolic
+
+
+def rname(node):
+    """ Obtains names from different types of AST nodes. """
+
+    if isinstance(node, str):
+        return node
+    if isinstance(node, ast.Num):
+        return str(node.n)
+    if isinstance(node, ast.Name):  # form x
+        return node.id
+    if isinstance(node, ast.Subscript):  # form A[a:b,...,c:d]
+        return rname(node.value)
+    if isinstance(node, ast.Attribute):  # form @dace.attr_noparams
+        return rname(node.value) + '.' + rname(node.attr)
+    if isinstance(node, ast.Call):  # form @dace.attr(...)
+        if isinstance(node.func, ast.Name):
+            return node.func.id
+        return node.func.value.id + '.' + node.func.attr
+    if isinstance(node, ast.FunctionDef):  # form def func(...)
+        return node.name
+    if isinstance(node, ast.keyword):
+        return node.arg
+    try:
+        if isinstance(node, ast.arg):  # form func(..., arg, ...)
+            return node.arg
+    except AttributeError:
+        pass
+
+    raise TypeError('Invalid AST node type: ' + str(type(node)))
+
+
+def getkwarg(node, argname, default=None):
+    """ Helper function to get AST node of a keyword argument (of form 
+        "argname=<value>"). """
+
+    for kw in node.keywords:
+        if rname(kw) == argname:
+            return kw.value
+    return default
+
+
+def DaCeSyntaxError(visitor, node, err):
+    """ Reports errors with their corresponding file/line information. """
+
+    try:
+        line = node.lineno
+        col = node.col_offset
+    except AttributeError:
+        line = 0
+        col = 0
+
+    return SyntaxError(err + "\n  in File " + str(visitor.filename) +
+                       ", line " + str(line) + ":" + str(col) +
+                       ", in function " + str(visitor.curnode.name))
+
+
+def get_tuple(visitor, node):
+    """ Parses and returns a tuple from an AST node, including explicit None
+        values. """
+    if isinstance(node, ast.Num):  # A number (single axis)
+        return (node.n, )
+    if isinstance(
+            node,
+            ast.Tuple):  # Tuple. Assumes all axes are values. Form: (2,3,5)
+        for v in node.elts:
+            if not isinstance(v, ast.Num):
+                raise DaCeSyntaxError(visitor, node,
+                                      'Axis tuple can only contain integers')
+        return tuple(value.n for value in node.elts)
+    if isinstance(node, ast.Name):  # Python 2.x variant of explicit "None"
+        if node.id == 'None':
+            return None
+    if node is None:  # Argument not given
+        return None
+
+    # Python 3 only
+    try:
+        if isinstance(node, ast.NameConstant
+                      ):  # Explicit None (or True). Example: "axis=None"
+            if node.value is None:
+                return node.value
+    except AttributeError:
+        pass
+
+    raise DaCeSyntaxError(
+        visitor, node,
+        'Invalid expression, expected tuple of constant numbers or None')
+
+
+def subscript_to_ast_slice(node, without_array=False):
+    """ Converts an AST subscript to slice on the form
+        (<name>, [<3-tuples of AST nodes>]). If an ast.Name is passed, returns
+        (name, None), implying the full range. 
+        @param node: The AST node to convert.
+        @param without_array: If True, returns only the slice. Otherwise,
+                              returns a 2-tuple of (array, range).
+    """
+
+    result_arr = None
+    result_slice = None
+
+    if isinstance(node, ast.Name):
+        # None implies the full array. We can't create artificial
+        # (None, None, None) tuples, because we don't know the dimensionality of
+        #  the array at this point
+        result_arr, result_slice = node.id, None
+        return result_slice if without_array else (result_arr, result_slice)
+
+    if not isinstance(node, ast.Subscript):
+        raise TypeError('AST node is not a subscript')
+
+    # ND Index
+    if isinstance(node.slice, ast.Index):
+        if isinstance(node.slice.value, ast.Tuple):
+            result_slice = [dim for dim in node.slice.value.elts]
+        else:
+            result_slice = [node.slice.value]
+    # 1D slice
+    elif isinstance(node.slice, ast.Slice):
+        result_slice = [(node.slice.lower, node.slice.upper, node.slice.step)]
+    else:  # ND slice
+        result_slice = []
+
+        for d in node.slice.dims:
+            if isinstance(d, ast.Index):
+                result_slice.append(d.value)
+            else:
+                result_slice.append((d.lower, d.upper, d.step))
+
+    if without_array:
+        return result_slice
+    else:
+        return (rname(node.value), result_slice)
+
+
+def subscript_to_ast_slice_recursive(node):
+    """ Converts an AST subscript to a slice in a recursive manner into nested
+        subscripts.
+        @see: subscript_to_ast_slice
+    """
+    result = []
+    while isinstance(node, ast.Subscript):
+        result.insert(0, subscript_to_ast_slice(node, True))
+        node = node.value
+
+    return result
+
+
+def unparse(node):
+    """ Unparses an AST node to a Python string, chomping trailing newline. """
+    if node is None:
+        return None
+    return astunparse.unparse(node).strip()
+
+
+# Helper function to convert an ND subscript AST node to a list of 3-tuple
+# slice strings
+def subscript_to_slice(node, arrays, without_array=False):
+    """ Converts an AST subscript to slice on the form
+        (<name>, [<3-tuples of indices>]). If an ast.Name is passed, return
+        (name, None), implying the full range. """
+
+    name, ast_slice = subscript_to_ast_slice(node)
+    if name in arrays:
+        arrname = name
+    else:
+        arrname = None
+
+    rng = astrange_to_symrange(ast_slice, arrays, arrname)
+    if without_array:
+        return rng
+    else:
+        return name, rng
+
+
+def astrange_to_symrange(astrange, arrays, arrname=None):
+    """ Converts an AST range (array, [(start, end, skip)]) to a symbolic math 
+        range, using the obtained array sizes and resolved symbols. """
+    if arrname is not None:
+        arrdesc = arrays[arrname]
+
+        # If the array is a scalar, return None
+        if arrdesc.shape is None:
+            return None
+
+        # If range is the entire array, use the array descriptor to obtain the
+        # entire range
+        if astrange is None:
+            return [
+                (symbolic.pystr_to_symbolic(0),
+                 symbolic.pystr_to_symbolic(types.symbol_name_or_value(s)) - 1,
+                 symbolic.pystr_to_symbolic(1)) for s in arrdesc.shape
+            ]
+
+    result = [None] * len(astrange)
+    for i, r in enumerate(astrange):
+        if isinstance(r, tuple):
+            begin, end, skip = r
+            # Default values
+            if begin is None:
+                begin = symbolic.pystr_to_symbolic(0)
+            else:
+                begin = symbolic.pystr_to_symbolic(unparse(begin))
+            if end is None and arrname is None:
+                raise SyntaxError('Cannot define range without end')
+            elif end is not None:
+                end = symbolic.pystr_to_symbolic(unparse(end)) - 1
+            else:
+                end = symbolic.pystr_to_symbolic(
+                    types.symbol_name_or_value(arrdesc.shape[i])) - 1
+            if skip is None:
+                skip = symbolic.pystr_to_symbolic(1)
+            else:
+                skip = symbolic.pystr_to_symbolic(unparse(skip))
+        else:
+            # In the case where a single element is given
+            begin = symbolic.pystr_to_symbolic(unparse(r))
+            end = begin
+            skip = symbolic.pystr_to_symbolic(1)
+
+        result[i] = (begin, end, skip)
+
+    return result
+
+
+def negate_expr(node):
+    """ Negates an AST expression by adding a `Not` AST node in front of it. 
+    """
+    if hasattr(node, "__len__"):
+        if len(node) > 1:
+            raise ValueError("negate_expr only expects "
+                             "single expressions, got: {}".format(node))
+        expr = node[0]
+    else:
+        expr = node
+    newexpr = ast.Expr(value=ast.UnaryOp(op=ast.Not(), operand=expr))
+    newexpr = ast.copy_location(newexpr, expr)
+    return ast.fix_missing_locations(newexpr)
+
+
+class ExtNodeTransformer(ast.NodeTransformer):
+    """ A `NodeTransformer` subclass that walks the abstract syntax tree and
+        allows modification of nodes. As opposed to `NodeTransformer`,
+        this class is capable of traversing over top-level expressions in 
+        bodies in order to discern DaCe statements from others.
+    """
+
+    # Default implementation of TopLevelExpr
+    def visit_TopLevelExpr(self, node):
+        return self.visit(node)
+
+    def generic_visit(self, node):
+        for field, old_value in ast.iter_fields(node):
+            if isinstance(old_value, list):
+                new_values = []
+                for value in old_value:
+                    if isinstance(value, ast.AST):
+                        if (field == 'body'
+                                or field == 'orelse') and isinstance(
+                                    value, ast.Expr):
+                            value = self.visit_TopLevelExpr(value)
+                        else:
+                            value = self.visit(value)
+                        if value is None:
+                            continue
+                        elif not isinstance(value, ast.AST):
+                            new_values.extend(value)
+                            continue
+                    new_values.append(value)
+                old_value[:] = new_values
+            elif isinstance(old_value, ast.AST):
+                new_node = self.visit(old_value)
+                if new_node is None:
+                    delattr(node, field)
+                else:
+                    setattr(node, field, new_node)
+        return node
+
+
+class ExtNodeVisitor(ast.NodeVisitor):
+    """ A `NodeVisitor` subclass that walks the abstract syntax tree. 
+        As opposed to `NodeVisitor`, this class is capable of traversing over 
+        top-level expressions in bodies in order to discern DaCe statements 
+        from others. """
+
+    def visit_TopLevel(self, node):
+        clsname = type(node).__name__
+        if getattr(self, "visit_TopLevel" + clsname, False):
+            getattr(self, "visit_TopLevel" + clsname)(node)
+        else:
+            self.visit(node)
+
+    def generic_visit(self, node):
+        for field, old_value in ast.iter_fields(node):
+            if isinstance(old_value, list):
+                for value in old_value:
+                    if isinstance(value, ast.AST):
+                        if (field == 'body' or field == 'orelse'):
+                            clsname = type(value).__name__
+                            if getattr(self, "visit_TopLevel" + clsname,
+                                       False):
+                                getattr(self,
+                                        "visit_TopLevel" + clsname)(value)
+                            else:
+                                self.visit(value)
+                        else:
+                            self.visit(value)
+            elif isinstance(old_value, ast.AST):
+                self.visit(old_value)
+        return node
diff --git a/dace/frontend/python/decorators.py b/dace/frontend/python/decorators.py
new file mode 100644
index 0000000000..f6e980446d
--- /dev/null
+++ b/dace/frontend/python/decorators.py
@@ -0,0 +1,109 @@
+""" Python decorators for DaCe functions. """
+
+from __future__ import print_function
+from functools import wraps
+
+from dace import types
+from dace.frontend.python import parser
+
+
+def paramdec(dec):
+    """ Parameterized decorator meta-decorator. Enables using `@decorator`,
+        `@decorator()`, and `@decorator(...)` with the same function. """
+
+    @wraps(dec)
+    def layer(*args, **kwargs):
+
+        # Allows the use of @decorator, @decorator(), and @decorator(...)
+        if len(kwargs) == 0 and len(args) == 1 and callable(
+                args[0]) and not isinstance(args[0], types.typeclass):
+            return dec(*args, **kwargs)
+
+        @wraps(dec)
+        def repl(f):
+            return dec(f, *args, **kwargs)
+
+        return repl
+
+    return layer
+
+
+#############################################
+
+
+@paramdec
+def program(f, *args, **kwargs):
+    """ DaCe program, entry point to a data-centric program. """
+
+    # Parses a python @dace.program function and returns an object that can
+    # be translated
+    return parser.DaceProgram(f, args, kwargs)
+
+
+#############################################
+
+
+@paramdec
+def external_function(f, **alternative_implementations):
+    """ External functions that may be called within a DaCe program. """
+    return types._external_function(f, alternative_implementations)
+
+
+# Internal DaCe decorators, these are not actually run, but rewritten
+
+
+# Dataflow constructs
+@paramdec
+def map(f, rng):
+    """ A Map is representation of parallel execution, containing
+        an integer set (Python range) for which its contents are run 
+        concurrently.
+        @param rng: The map's range.
+    """
+    return None
+
+
+@paramdec
+def consume(f, stream, pes):
+    """ Consume is a scope, like `Map`, that creates parallel execution.
+        Unlike `Map`, it creates a producer-consumer relationship between an
+        input stream and the contents. The contents are run by the given number
+        of processing elements, who will try to pop elements from the input
+        stream until a given quiescence condition is reached. 
+        @param stream: The stream to pop from.
+        @param pes: The number of processing elements to use.
+    """
+    return None
+
+
+def tasklet(f):
+    """ A general procedure that cannot access any memory apart from incoming
+        and outgoing memlets. The DaCe framework cannot analyze these tasklets
+        for optimization. """
+    return None
+
+
+# Control-flow constructs
+@paramdec
+def iterate(f, rng):
+    """ A decorator version of a for loop, with a range of `rng`.
+        @param rng: The range of the for loop.
+    """
+    return None
+
+
+@paramdec
+def loop(f, cond):
+    """ A decorator version of a while loop, with a looping condition `cond`.
+        @param cond: The condition of the while loop.
+    """
+    return None
+
+
+@paramdec
+def conditional(f, cond):
+    """ A decorator version of conditional execution, with an if-condition 
+        `cond`.
+        @param cond: The condition of the branch.
+    """
+    return None
diff --git a/dace/frontend/python/depanalysis.py b/dace/frontend/python/depanalysis.py
new file mode 100644
index 0000000000..a7c4e880d6
--- /dev/null
+++ b/dace/frontend/python/depanalysis.py
@@ -0,0 +1,796 @@
+""" Data dependency analysis functionality, as well as functions to convert
+    an AST-parsed data-centric Python program into an SDFG. """
+import ast
+from collections import deque, OrderedDict
+from copy import deepcopy as dcpy
+import sympy
+
+from dace import data as dt, types, symbolic
+from dace.graph import edges as ed
+from dace.graph import nodes as nd
+from dace import subsets as sbs
+from dace import sdfg
+from dace.memlet import EmptyMemlet, Memlet
+from dace.frontend.python import astnodes, astutils
+from dace.frontend.python.astparser import MemletRemover, ModuleInliner
+
+
+def create_states_simple(pdp,
+                         out_sdfg,
+                         start_state=None,
+                         end_state=None,
+                         start_edge=None):
+    """ Creates a state per primitive, with the knowledge that they can be 
+        optimized later.
+        @param pdp: A parsed dace program.
+        @param out_sdfg: The output SDFG.
+        @param start_state: The starting/parent state to connect from (for
+                            recursive calls).
+        @param end_state: The end/parent state to connect to (for
+                          recursive calls).
+        @return: A dictionary mapping between a state and the list of dace
+                 primitives included in it.
+    """
+    state_to_primitives = OrderedDict()
+
+    # Create starting state and edge
+    if start_state is None:
+        start_state = out_sdfg.add_state('start')
+        state_to_primitives[start_state] = []
+    if start_edge is None:
+        start_edge = ed.InterstateEdge()
+
+    previous_state = start_state
+    previous_edge = start_edge
+
+    for i, primitive in enumerate(pdp.children):
+        state = out_sdfg.add_state(primitive.name)
+        state_to_primitives[state] = []
+        # Edge that can be created on entry to control flow children
+        entry_edge = None
+
+        #########################################
+        # Cases depending on primitive type
+        #########################################
+
+        # Nothing special happens with a dataflow node (nested states are
+        # handled with a separate call to create_states_simple)
+        if isinstance(primitive, astnodes._DataFlowNode):
+            out_sdfg.add_edge(previous_state, state, previous_edge)
+            state_to_primitives[state] = [primitive]
+            previous_state = state
+            previous_edge = ed.InterstateEdge()
+
+        # Control flow needs to traverse into children nodes
+        elif isinstance(primitive, astnodes._ControlFlowNode):
+            # Iteration has >=3 states - begin, loop[...], end; and connects the
+            # loop states, as well as the begin to end directly if the condition
+            # did not evaluate to true
+            if isinstance(primitive, astnodes._IterateNode):
+
+                condition = ast.parse(
+                    '(%s %s %s)' % (primitive.params[0], '<'
+                                    if primitive.range[0][2] >= 0 else '>',
+                                    primitive.range[0][1] + 1)).body[0]
+                condition_neg = astutils.negate_expr(condition)
+
+                # Loop-start state
+                lstart_state = out_sdfg.add_state(primitive.name + '_start')
+                state_to_primitives[lstart_state] = []
+                out_sdfg.add_edge(previous_state, lstart_state, previous_edge)
+                out_sdfg.add_edge(
+                    lstart_state,
+                    state,
+                    ed.InterstateEdge(
+                        assignments={
+                            primitive.params[0]: primitive.range[0][0]
+                        }))
+
+                # Loop-end state that jumps back to `state`
+                loop_state = out_sdfg.add_state(primitive.name + '_end')
+                state_to_primitives[loop_state] = []
+                # Connect loop
+                out_sdfg.add_edge(
+                    loop_state,
+                    state,
+                    ed.InterstateEdge(
+                        assignments={
+                            primitive.params[0]:
+                            symbolic.pystr_to_symbolic(primitive.params[0]) +
+                            primitive.range[0][2]
+                        }))
+
+                # End connection
+                previous_state = state
+                previous_edge = ed.InterstateEdge(condition=condition_neg)
+
+                # Create children states
+                cmap = create_states_simple(
+                    primitive,
+                    out_sdfg,
+                    state,
+                    loop_state,
+                    ed.InterstateEdge(condition=condition))
+                state_to_primitives.update(cmap)
+
+            # Loop is similar to iterate, but more general w.r.t. conditions
+            elif isinstance(primitive, astnodes._LoopNode):
+                loop_condition = primitive.condition
+
+                # Entry
+                out_sdfg.add_edge(previous_state, state, previous_edge)
+
+                # Loop-end state that jumps back to `state`
+                loop_state = out_sdfg.add_state(primitive.name + '_end')
+                state_to_primitives[loop_state] = []
+
+                # Loopback
+                out_sdfg.add_edge(loop_state, state, ed.InterstateEdge())
+                # End connection
+                previous_state = state
+                previous_edge = ed.InterstateEdge(
+                    condition=astutils.negate_expr(loop_condition))
+                entry_edge = ed.InterstateEdge(condition=loop_condition)
+
+                # Create children states
+                cmap = create_states_simple(primitive, out_sdfg, state,
+                                            loop_state, entry_edge)
+                state_to_primitives.update(cmap)
+
+            elif isinstance(primitive, astnodes._IfNode):
+                if_condition = primitive.condition
+                # Check if we have an else node, otherwise add a skip condition
+                # ourselves
+                if (i + 1) < len(pdp.children) and isinstance(
+                        pdp.children[i + 1], astnodes._ElseNode):
+                    has_else = True
+                    else_prim = pdp.children[i + 1]
+                    else_condition = else_prim.condition
+                else:
+                    has_else = False
+                    else_condition = astutils.negate_expr(primitive.condition)
+
+                # End-of-branch state (converge to this)
+                bend_state = out_sdfg.add_state(primitive.name + '_end')
+                state_to_primitives[bend_state] = []
+
+                # Entry
+                out_sdfg.add_edge(previous_state, state, previous_edge)
+
+                # Create children states
+                cmap = create_states_simple(
+                    primitive,
+                    out_sdfg,
+                    state,
+                    bend_state,
+                    ed.InterstateEdge(condition=if_condition))
+                state_to_primitives.update(cmap)
+
+                # Handle 'else' condition
+                if not has_else:
+                    out_sdfg.add_edge(
+                        state,
+                        bend_state,
+                        ed.InterstateEdge(condition=else_condition))
+                else:
+                    # Recursively parse 'else' primitive's children
+                    cmap = create_states_simple(
+                        else_prim,
+                        out_sdfg,
+                        state,
+                        bend_state,
+                        ed.InterstateEdge(condition=else_condition))
+                    state_to_primitives.update(cmap)
+
+                # Exit
+                previous_state = bend_state
+                previous_edge = ed.InterstateEdge()
+
+            elif isinstance(primitive, astnodes._ElseNode):
+                if i - 1 < 0 or not isinstance(pdp.children[i - 1],
+                                               astnodes._IfNode):
+                    raise SyntaxError('Found else state without matching if')
+
+                # If 'else' state is correct, we already processed it
+                del state_to_primitives[state]
+                out_sdfg.remove_node(state)
+
+    # Connect to end_state (and create it if necessary)
+    if end_state is None:
+        end_state = out_sdfg.add_state('end')
+        state_to_primitives[end_state] = []
+    out_sdfg.add_edge(previous_state, end_state, previous_edge)
+
+    return state_to_primitives
+
+
+def _make_full_range(memlet: astnodes._Memlet):
+    fullRange = sbs.Range([(0, s - 1, 1) for s in memlet.data.shape])
+    fullMemlet = astnodes._Memlet(memlet.data,
+                                  memlet.dataname, memlet.attribute,
+                                  fullRange.num_elements(), None, None,
+                                  fullRange, memlet.veclen, None, None, {})
+    return fullMemlet
+
+
+def _full_memlet_from_array(arrayname, array):
+    fullRange = sbs.Range([(0, s - 1, 1) for s in array.shape])
+    fullMemlet = astnodes._Memlet(array, arrayname, None,
+                                  fullRange.num_elements(), None, None,
+                                  fullRange, 1, None, None, {})
+    return fullMemlet
+
+
+def inherit_dependencies(prim):
+
+    # Inject tasklets for map nodes and push down dependencies
+    if (isinstance(prim, (astnodes._MapNode, astnodes._ConsumeNode))
+            and len(prim.children) == 0):
+        tasklet = astnodes._TaskletNode(prim.name, prim.ast)
+        tasklet.parent = prim
+        tasklet.inputs = OrderedDict(
+            [(k, v) for k, v in prim.inputs.items() if '__DACEIN_' not in k])
+        tasklet.outputs = OrderedDict(
+            [(k, v) for k, v in prim.outputs.items() if '__DACEIN_' not in k])
+        prim.inputs = OrderedDict(
+            [(k, v) for k, v in prim.inputs.items() if '__DACEIN_' in k])
+        prim.outputs = OrderedDict(
+            [(k, v) for k, v in prim.outputs.items() if '__DACEIN_' in k])
+        prim.children.append(tasklet)
+
+    # The recursive dependencies of this node which we will return
+    dependIn = OrderedDict()
+    dependOut = OrderedDict()
+
+    # Add own dependencies (input)
+    inputQueue = deque(prim.inputs.items())
+    while len(inputQueue) > 0:
+        arrname, memlet = inputQueue.popleft()
+        fullMemlet = _make_full_range(memlet)
+        dependIn[fullMemlet.dataname] = fullMemlet
+        # Additional dependencies (e.g., as a result of indirection)
+        for aname, additional_arr in memlet.otherdeps.items():
+            additional_astmemlet = _full_memlet_from_array(
+                aname, additional_arr)
+            dependIn[additional_astmemlet.dataname] = additional_astmemlet
+
+    # Add own dependencies (output)
+    outputQueue = deque(prim.outputs.items())
+    while len(outputQueue) > 0:
+        arrname, memlet = outputQueue.popleft()
+        fullMemlet = _make_full_range(memlet)
+        dependOut[fullMemlet.dataname] = fullMemlet
+        if isinstance(memlet.subset, astnodes._Memlet):
+            outputQueue.push(memlet.subset)
+
+    # Add recursively inherited dependencies
+    inheritIn = OrderedDict()
+    inheritOut = OrderedDict()
+    arrs = prim.transients.keys()
+    for child in prim.children:
+        childIn, childOut = inherit_dependencies(child)
+        # Only inherit dependencies from arrays defined in this scope
+        inheritIn.update(
+            OrderedDict([(k, v) for k, v in childIn.items() if k not in arrs]))
+        inheritOut.update(
+            OrderedDict(
+                [(k, v) for k, v in childOut.items() if k not in arrs]))
+
+    # We should not overwrite an explicit dependency with an inherited one:
+    # this is most likely a programming mistake
+    for key in inheritIn.keys():
+        if key in prim.inputs:
+            raise ValueError("Inherited dependency from '" + child.name +
+                             "' overwrites explicit dependency in '" +
+                             prim.name + "' (" + str(prim.inputs[key]) + ")")
+    for key in inheritOut.keys():
+        if key in prim.outputs:
+            raise ValueError("Inherited dependency from '" + child.name +
+                             "' overwrites explicit dependency in '" +
+                             prim.name + "' (" + str(prim.outputs[key]) + ")")
+    prim.inputs.update(inheritIn)
+    prim.outputs.update(inheritOut)
+    dependIn.update(dcpy(inheritIn))
+    dependOut.update(dcpy(inheritOut))
+
+    if isinstance(prim, astnodes._ControlFlowNode):
+        # Don't inherit dependencies across control flow boundaries
+        return OrderedDict(), OrderedDict()
+    else:
+        return dependIn, dependOut
+
+
+def _subset_has_indirection(subset):
+    for dim in subset:
+        if not isinstance(dim, tuple):
+            dim = [dim]
+        for r in dim:
+            if symbolic.contains_sympy_functions(r):
+                return True
+    return False
+
+
+def _add_astmemlet_edge(sdfg,
+                        state,
+                        src_node,
+                        src_conn,
+                        dst_node,
+                        dst_conn,
+                        ast_memlet,
+                        data=None,
+                        wcr=None,
+                        wcr_identity=None):
+    try:
+        if src_node.data == dst_node.data:
+            raise RuntimeError("Added edge connection data nodes "
+                               "with same descriptor: {} to {}".format(
+                                   src_node, dst_node))
+    except AttributeError:
+        pass
+    if _subset_has_indirection(ast_memlet.subset):
+        add_indirection_subgraph(sdfg, state, src_node, dst_node, ast_memlet)
+        return
+
+    if data is not None:
+        raise NotImplementedError('This should never happen')
+
+    memlet = Memlet(ast_memlet.dataname, ast_memlet.num_accesses,
+                    ast_memlet.subset, ast_memlet.veclen, wcr, wcr_identity)
+    state.add_edge(src_node, src_conn, dst_node, dst_conn, memlet)
+
+
+def _get_input_symbols(inputs, freesyms):
+    syminputs = set(
+        str(i)[9:] for i in inputs.keys() if str(i).startswith('__DACEIN_'))
+    return freesyms & syminputs
+
+
+# TODO: The following two functions can be replaced with better dataflow
+# generation procedures
+
+
+def input_node_for_array(state, data: str):
+    # If the node appears as one of the source nodes, return it first
+    for n in state.source_nodes():
+        if isinstance(n, nd.AccessNode):
+            if n.data == data:
+                return n
+    # Otherwise, if the node is located elsewhere, return it
+    for n in state.nodes():
+        if isinstance(n, nd.AccessNode):
+            if n.data == data:
+                return n
+
+    return nd.AccessNode(data)
+
+
+def output_node_for_array(state, data: str):
+    for n in state.sink_nodes():
+        if isinstance(n, nd.AccessNode):
+            if n.data == data:
+                return n
+
+    return nd.AccessNode(data)
+
+
+def _build_dataflow_graph_recurse(sdfg, state, primitives, modules, superEntry,
+                                  super_exit):
+    # Array of pairs (exit node, memlet)
+    exit_nodes = []
+
+    if len(primitives) == 0:
+        # Inject empty tasklets into empty states
+        primitives = [astnodes._EmptyTaskletNode("Empty Tasklet", None)]
+
+    for prim in primitives:
+        label = prim.name
+
+        # Expand node to get entry and exit points
+        if isinstance(prim, astnodes._MapNode):
+            if len(prim.children) == 0:
+                raise ValueError("Map node expected to have children")
+            mapNode = nd.Map(
+                label, prim.params, prim.range, is_async=prim.is_async)
+            # Add connectors for inputs that exist as array nodes
+            entry = nd.MapEntry(
+                mapNode,
+                _get_input_symbols(prim.inputs, prim.range.free_symbols))
+            exit = nd.MapExit(mapNode)
+        elif isinstance(prim, astnodes._ConsumeNode):
+            if len(prim.children) == 0:
+                raise ValueError("Consume node expected to have children")
+            consumeNode = nd.Consume(label, (prim.params[1], prim.num_pes),
+                                     prim.condition)
+            entry = nd.ConsumeEntry(consumeNode)
+            exit = nd.ConsumeExit(consumeNode)
+        elif isinstance(prim, astnodes._ReduceNode):
+            rednode = nd.Reduce(prim.ast, prim.axes, prim.identity)
+            state.add_node(rednode)
+            entry = rednode
+            exit = rednode
+        elif isinstance(prim, astnodes._TaskletNode):
+            if isinstance(prim, astnodes._EmptyTaskletNode):
+                tasklet = nd.EmptyTasklet(prim.name)
+            else:
+                # Remove memlets from tasklet AST
+                if prim.language == types.Language.Python:
+                    clean_code = MemletRemover().visit(prim.ast)
+                    clean_code = ModuleInliner(modules).visit(clean_code)
+                else:  # Use external code from tasklet definition
+                    if prim.extcode is None:
+                        raise SyntaxError("Cannot define an intrinsic "
+                                          "tasklet without an implementation")
+                    clean_code = prim.extcode
+                tasklet = nd.Tasklet(
+                    prim.name,
+                    set(prim.inputs.keys()),
+                    set(prim.outputs.keys()),
+                    code=clean_code,
+                    language=prim.language,
+                    code_global=prim.gcode)  # TODO: location=prim.location
+
+            # Need to add the tasklet in case we're in an empty state, where no
+            # edge will be drawn to it
+            state.add_node(tasklet)
+            entry = tasklet
+            exit = tasklet
+
+        elif isinstance(prim, astnodes._NestedSDFGNode):
+            prim.sdfg.parent = state
+            prim.sdfg._parent_sdfg = sdfg
+            prim.sdfg.update_sdfg_list([])
+            nsdfg = nd.NestedSDFG(prim.name, prim.sdfg,
+                                  set(prim.inputs.keys()),
+                                  set(prim.outputs.keys()))
+            state.add_node(nsdfg)
+            entry = nsdfg
+            exit = nsdfg
+
+        elif isinstance(prim, astnodes._ProgramNode):
+            return
+        elif isinstance(prim, astnodes._ControlFlowNode):
+            continue
+        else:
+            raise TypeError("Node type not implemented: " +
+                            str(prim.__class__))
+
+        # Add incoming edges
+        for varname, memlet in prim.inputs.items():
+            arr = memlet.dataname
+            if (prim.parent is not None
+                    and memlet.dataname in prim.parent.transients.keys()):
+                node = input_node_for_array(state, memlet.dataname)
+
+                # Add incoming edge into transient as well
+                # FIXME: A bit hacked?
+                if arr in prim.parent.inputs:
+                    astmem = prim.parent.inputs[arr]
+                    _add_astmemlet_edge(sdfg, state, superEntry, None, node,
+                                        None, astmem)
+
+                    # Remove local name from incoming edge to parent
+                    prim.parent.inputs[arr].local_name = None
+            elif superEntry:
+                node = superEntry
+            else:
+                node = input_node_for_array(state, memlet.dataname)
+
+            # Destination connector inference
+            # Connected to a tasklet or a nested SDFG
+            dst_conn = (memlet.local_name
+                        if isinstance(entry, nd.CodeNode) else None)
+            # Connected to a scope as part of its range
+            if str(varname).startswith('__DACEIN_'):
+                dst_conn = str(varname)[9:]
+            # Handle special case of consume input stream
+            if (isinstance(entry, nd.ConsumeEntry)
+                    and memlet.data == prim.stream):
+                dst_conn = 'IN_stream'
+
+            # If a memlet that covers this input already exists, skip
+            # generating this one; otherwise replace memlet with ours
+            skip_incoming_edge = False
+            remove_edge = None
+            for e in state.edges_between(node, entry):
+                if e.data.data != memlet.dataname or dst_conn != e.dst_conn:
+                    continue
+                if e.data.subset.covers(memlet.subset):
+                    skip_incoming_edge = True
+                    break
+                elif memlet.subset.covers(e.data.subset):
+                    remove_edge = e
+                    break
+                else:
+                    print('WARNING: Performing bounding-box union on',
+                          memlet.subset, 'and', e.data.subset, '(in)')
+                    e.data.subset = sbs.bounding_box_union(
+                        e.data.subset, memlet.subset)
+                    e.data.num_accesses += memlet.num_accesses
+                    skip_incoming_edge = True
+                    break
+
+            if remove_edge is not None:
+                state.remove_edge(remove_edge)
+
+            if skip_incoming_edge == False:
+                _add_astmemlet_edge(sdfg, state, node, None, entry, dst_conn,
+                                    memlet)
+
+        # If there are no inputs, generate a dummy edge
+        if superEntry and len(prim.inputs) == 0:
+            state.add_edge(superEntry, None, entry, None, EmptyMemlet())
+
+        if len(prim.children) > 0:
+            # Recurse
+            inner_outputs = _build_dataflow_graph_recurse(
+                sdfg, state, prim.children, modules, entry, exit)
+            # Infer output node for each memlet
+            for i, (out_src, mem) in enumerate(inner_outputs):
+                # If there is no such array in this primitive's outputs,
+                # it's an external array (e.g., a map in a map). In this case,
+                # connect to the exit node
+                if mem.dataname in prim.outputs:
+                    inner_outputs[i] = (out_src, prim.outputs[mem.dataname])
+                else:
+                    inner_outputs[i] = (out_src, mem)
+        else:
+            inner_outputs = [(exit, mem) for mem in prim.outputs.values()]
+
+        # Add outgoing edges
+        for out_src, astmem in inner_outputs:
+
+            data = astmem.data
+            dataname = astmem.dataname
+
+            # If WCR is not none, it needs to be handled in the code. Check for
+            # this after, as we only expect it for one distinct case
+            wcr_was_handled = astmem.wcr is None
+
+            # TODO: This is convoluted. We should find a more readable
+            # way of connecting the outgoing edges.
+
+            if super_exit is None:
+
+                # Assert that we're in a top-level node
+                if ((not isinstance(prim.parent, astnodes._ProgramNode)) and
+                    (not isinstance(prim.parent, astnodes._ControlFlowNode))):
+                    raise RuntimeError("Expected to be at the top node")
+
+                # Looks hacky
+                src_conn = (astmem.local_name if isinstance(
+                    out_src, (nd.Tasklet, nd.NestedSDFG)) else None)
+
+                # Here we just need to connect memlets directly to their
+                # respective data nodes
+                out_tgt = output_node_for_array(state, astmem.dataname)
+
+                # If a memlet that covers this outuput already exists, skip
+                # generating this one; otherwise replace memlet with ours
+                skip_outgoing_edge = False
+                remove_edge = None
+                for e in state.edges_between(out_src, out_tgt):
+                    if e.data.data != astmem.dataname or src_conn != e.src_conn:
+                        continue
+                    if e.data.subset.covers(astmem.subset):
+                        skip_outgoing_edge = True
+                        break
+                    elif astmem.subset.covers(e.data.subset):
+                        remove_edge = e
+                        break
+                    else:
+                        print('WARNING: Performing bounding-box union on',
+                              astmem.subset, 'and', e.data.subset, '(out)')
+                        e.data.subset = sbs.bounding_box_union(
+                            e.data.subset, astmem.subset)
+                        e.data.num_accesses += astmem.num_accesses
+                        skip_outgoing_edge = True
+                        break
+
+                if skip_outgoing_edge == True:
+                    continue
+                if remove_edge is not None:
+                    state.remove_edge(remove_edge)
+
+                _add_astmemlet_edge(
+                    sdfg,
+                    state,
+                    out_src,
+                    src_conn,
+                    out_tgt,
+                    None,
+                    astmem,
+                    wcr=astmem.wcr,
+                    wcr_identity=astmem.wcr_identity)
+                wcr_was_handled = (True if astmem.wcr is not None else
+                                   wcr_was_handled)
+
+                # If the program defines another output, connect it too.
+                # This refers to the case where we have streams, which
+                # must define an input and output, and sometimes this output
+                # is defined in pdp.outputs
+                if (isinstance(out_tgt, nd.AccessNode)
+                        and isinstance(out_tgt.desc(sdfg), dt.Stream)):
+                    try:
+                        stream_memlet = next(
+                            v for k, v in prim.parent.outputs.items()
+                            if k == out_tgt.data)
+                        stream_output = output_node_for_array(
+                            state, stream_memlet.dataname)
+                        _add_astmemlet_edge(sdfg, state, out_tgt, None,
+                                            stream_output, None, stream_memlet)
+                    except StopIteration:  # Stream output not found, skip
+                        pass
+
+            else:  # We're in a nest
+
+                if isinstance(prim, astnodes._ScopeNode):
+                    # We're a map or a consume node, that needs to connect our
+                    # exit to either an array or to the super_exit
+                    if data.transient and dataname in prim.parent.transients:
+                        # Connect the exit directly
+                        out_tgt = output_node_for_array(state, data.dataname)
+                        _add_astmemlet_edge(sdfg, state, out_src, None,
+                                            out_tgt, None, astmem)
+                    else:
+                        # This is either a transient defined in an outer scope,
+                        # or an I/O array, so redirect thruogh the exit node
+                        _add_astmemlet_edge(sdfg, state, out_src, None,
+                                            super_exit, None, astmem)
+                        # Instruct outer recursion layer to continue the route
+                        exit_nodes.append((super_exit, astmem))
+                elif isinstance(
+                        prim,
+                    (astnodes._TaskletNode, astnodes._NestedSDFGNode)):
+                    # We're a tasklet, and need to connect either to the exit
+                    # if the array is I/O or is defined in a scope further out,
+                    # or directly to the transient if it's defined locally
+                    if dataname in prim.parent.transients:
+                        # This is a local transient variable, so connect to it
+                        # directly
+                        out_tgt = output_node_for_array(state, data.dataname)
+                        _add_astmemlet_edge(sdfg, state, out_src,
+                                            astmem.local_name, out_tgt, None,
+                                            astmem)
+                    else:
+                        # This is an I/O array, or an outer level transient, so
+                        # redirect through the exit node
+                        _add_astmemlet_edge(
+                            sdfg,
+                            state,
+                            out_src,
+                            astmem.local_name,
+                            super_exit,
+                            None,
+                            astmem,
+                            wcr=astmem.wcr,
+                            wcr_identity=astmem.wcr_identity)
+                        exit_nodes.append((super_exit, astmem))
+                        if astmem.wcr is not None:
+                            wcr_was_handled = True  # Sanity check
+                else:
+                    raise TypeError("Unexpected node type: {}".format(
+                        type(out_src).__name__))
+
+            if not wcr_was_handled and not isinstance(prim,
+                                                      astnodes._ScopeNode):
+                raise RuntimeError("Detected unhandled WCR for primitive '{}' "
+                                   "of type {}. WCR is only expected for "
+                                   "tasklets in a map/consume scope.".format(
+                                       prim.name,
+                                       type(prim).__name__))
+
+    return exit_nodes
+
+
+def build_dataflow_graph(sdfg, state, primitives, modules):
+    _build_dataflow_graph_recurse(sdfg, state, primitives, modules, None, None)
+
+
+def add_indirection_subgraph(sdfg, graph, src, dst, memlet):
+    """ Replaces the specified edge in the specified graph with a subgraph that
+        implements indirection without nested AST memlet objects. """
+    if not isinstance(memlet, astnodes._Memlet):
+        raise TypeError("Expected memlet to be astnodes._Memlet")
+
+    indirect_inputs = set()
+    indirect_outputs = set()
+
+    # Scheme for multi-array indirection:
+    # 1. look for all arrays and accesses, create set of arrays+indices
+    #    from which the index memlets will be constructed from
+    # 2. each separate array creates a memlet, of which num_accesses = len(set)
+    # 3. one indirection tasklet receives them all + original array and
+    #    produces the right output index/range memlet
+    #########################
+    # Step 1
+    accesses = OrderedDict()
+    newsubset = dcpy(memlet.subset)
+    for dimidx, dim in enumerate(memlet.subset):
+        # Range/Index disambiguation
+        direct_assignment = False
+        if not isinstance(dim, tuple):
+            dim = [dim]
+            direct_assignment = True
+
+        for i, r in enumerate(dim):
+            for expr in sympy.preorder_traversal(r):
+                if symbolic.is_sympy_userfunction(expr):
+                    fname = expr.func.__name__
+                    if fname not in accesses:
+                        accesses[fname] = []
+
+                    # Replace function with symbol (memlet local name to-be)
+                    if expr.args in accesses[fname]:
+                        aindex = accesses[fname].index(expr.args)
+                        toreplace = 'index_' + fname + '_' + str(aindex)
+                    else:
+                        accesses[fname].append(expr.args)
+                        toreplace = 'index_' + fname + '_' + str(
+                            len(accesses[fname]) - 1)
+
+                    if direct_assignment:
+                        newsubset[dimidx] = r.subs(expr, toreplace)
+                    else:
+                        newsubset[dimidx][i] = r.subs(expr, toreplace)
+    #########################
+    # Step 2
+    ind_inputs = {'__ind_' + memlet.local_name}
+    ind_outputs = {'lookup'}
+    # Add accesses to inputs
+    for arrname, arr_accesses in accesses.items():
+        for i in range(len(arr_accesses)):
+            ind_inputs.add('index_%s_%d' % (arrname, i))
+
+    tasklet = nd.Tasklet("Indirection", ind_inputs, ind_outputs)
+
+    input_index_memlets = []
+    for arrname, arr_accesses in accesses.items():
+        arr = memlet.otherdeps[arrname]
+        for i, access in enumerate(arr_accesses):
+            # Memlet to load the indirection index
+            indexMemlet = Memlet(arrname, 1, sbs.Indices(list(access)), 1)
+            input_index_memlets.append(indexMemlet)
+            graph.add_edge(src, None, tasklet, "index_%s_%d" % (arrname, i),
+                           indexMemlet)
+
+    #########################
+    # Step 3
+    # Create new tasklet that will perform the indirection
+    indirection_ast = ast.parse("lookup = {arr}[{index}]".format(
+        arr='__ind_' + memlet.local_name,
+        index=', '.join([symbolic.symstr(s) for s in newsubset])))
+    # Conserve line number of original indirection code
+    tasklet.code = ast.copy_location(indirection_ast.body[0], memlet.ast)
+
+    # Create transient variable to trigger the indirected load
+    if memlet.num_accesses == 1:
+        storage = sdfg.add_scalar(
+            '__' + memlet.local_name + '_value',
+            memlet.data.dtype,
+            transient=True)
+    else:
+        storage = sdfg.add_array(
+            '__' + memlet.local_name + '_value',
+            memlet.data.dtype,
+            storage=types.StorageType.Default,
+            transient=True,
+            shape=memlet.bounding_box_size())
+    indirectRange = sbs.Range([(0, s - 1, 1) for s in storage.shape])
+    dataNode = nd.AccessNode('__' + memlet.local_name + '_value')
+
+    # Create memlet that depends on the full array that we look up in
+    fullRange = sbs.Range([(0, s - 1, 1) for s in memlet.data.shape])
+    fullMemlet = Memlet(memlet.dataname, memlet.num_accesses, fullRange,
+                        memlet.veclen)
+    graph.add_edge(src, None, tasklet, '__ind_' + memlet.local_name,
+                   fullMemlet)
+
+    # Memlet to store the final value into the transient, and to load it into
+    # the tasklet that needs it
+    indirectMemlet = Memlet('__' + memlet.local_name + '_value',
+                            memlet.num_accesses, indirectRange, memlet.veclen)
+    graph.add_edge(tasklet, 'lookup', dataNode, None, indirectMemlet)
+
+    valueMemlet = Memlet('__' + memlet.local_name + '_value',
+                         memlet.num_accesses, indirectRange, memlet.veclen)
+    graph.add_edge(dataNode, None, dst, memlet.local_name, valueMemlet)
diff --git a/dace/frontend/python/ndarray.py b/dace/frontend/python/ndarray.py
new file mode 100644
index 0000000000..792e5c961c
--- /dev/null
+++ b/dace/frontend/python/ndarray.py
@@ -0,0 +1,187 @@
+""" Array types and wrappers used in DaCe's Python frontend. """
+from __future__ import print_function
+import ctypes
+import enum
+import inspect
+import numpy
+import itertools
+from collections import deque
+
+from dace import symbolic, types
+
+###########################################################
+# NDArray type
+
+
+class ndarray(numpy.ndarray):
+    """ An N-dimensional array wrapper around `numpy.ndarray` that enables 
+        symbolic sizes. """
+
+    def __new__(cls,
+                shape,
+                dtype=types.float32,
+                materialize_func=None,
+                allow_conflicts=False,
+                *args,
+                **kwargs):
+        """ Initializes a DaCe ND-array.
+            @param shape: The array shape (may contain symbols).
+            @param dtype: The array data type.
+            @param materialize_func: An optional string that contains a method
+                                     to materialize array contents on demand.
+                                     If not None, the array is not allocated 
+                                     within the DaCe program.
+            @param allow_conflicts: If True, suppresses warnings on conflicting
+                                    array writes in DaCe programs without a 
+                                    matching conflict resolution memlet.
+        """
+        # Avoiding import loops
+        from dace import data
+
+        tmpshape = shape
+        shape = [symbolic.eval(s, 0) for s in shape]
+
+        kwargs.update({'dtype': dtype.type})
+
+        res = numpy.ndarray.__new__(cls, shape, *args, **kwargs)
+        res._symlist = symbolic.symlist(tmpshape)
+        for _, sym in res._symlist.items():
+            sym._arrays_to_update.append(res)
+
+        if not isinstance(dtype, types.typeclass):
+            dtype = types.typeclass(dtype.type)
+
+        res.descriptor = data.Array(
+            dtype,
+            tmpshape,
+            materialize_func=materialize_func,
+            transient=False,
+            allow_conflicts=allow_conflicts)
+        return res
+
+    def update_resolved_symbol(self, sym):
+        """ Notifies an array that a symbol has been resolved so that it
+            can be resized. """
+        self.resize(
+            [symbolic.eval(s, 0) for s in self.descriptor.shape],
+            refcheck=False)
+        self._symlist = symbolic.symlist(self.descriptor.shape)
+
+    def missing_syms(self):
+        return ','.join(
+            [s for s, v in self._symlist.items() if not v.is_initialized()])
+
+    def __setitem__(self, key, value):
+        if self.descriptor.materialize_func is not None:
+            raise PermissionError(
+                "You cannot write into an Immaterial storage.")
+        return numpy.ndarray.__setitem__(self, key, value)
+
+    def __getitem__(self, key):
+        if 0 in self.shape:
+            self.update_resolved_symbol(None)
+        if 0 in self.shape:
+            raise IndexError(
+                'Cannot create sub-array, not all symbols are set " "(missing symbols: %s)'
+                % self.missing_syms())
+        return numpy.ndarray.__getitem__(self, key)
+
+    # Python 2.x compatibility
+    def __getslice__(self, *args):
+        if 0 in self.shape:
+            raise IndexError(
+                'Cannot create sub-array, not all symbols are set (missing symbols: %s)'
+                % self.missing_syms())
+        return numpy.ndarray.__getslice__(self, *args)
+
+    def __array_finalize__(self, obj):
+        if obj is None:
+            return
+        from dace import data
+
+        # Create a new descriptor
+        self.descriptor = data.Array(
+            types.typeclass(obj.dtype.type),
+            obj.shape,
+            materialize_func=None,
+            transient=False,
+            allow_conflicts=False)
+
+        self._symlist = {}
+
+    def __lshift__(self, other):
+        pass
+
+    def __rshift__(self, other):
+        pass
+
+    def __hash__(self):
+        return hash(self.data.tobytes())
+
+    def __call__(self, *args):
+        return self
+
+
+class transient(ndarray):
+    """ Transient DaCe array subclass. """
+
+    def __new__(cls, *args, **kwargs):
+        res = ndarray.__new__(cls, *args, **kwargs)
+        res.descriptor.transient = True
+        return res
+
+
+class stream(object):
+    """ Stream array object in Python. Mostly used in the Python SDFG 
+        simulator. """
+
+    def __init__(self, dtype, shape):
+        from dace import data
+
+        self._type = dtype
+        self._shape = shape
+        self.descriptor = data.Stream(dtype, 1, 0, shape, True)
+        self.queue_array = numpy.ndarray(shape, dtype=deque)
+        for i in itertools.product(*(range(s) for s in shape)):
+            self.queue_array[i] = deque()
+
+    @property
+    def shape(self):
+        return self.shape
+
+    def __getitem__(self, key):
+        return self.queue_array.__getitem__(key)
+
+    def __getslice__(self, *args):
+        return self.queue_array.__getslice__(*args)
+
+
+def scalar(dtype=types.float32, allow_conflicts=False):
+    """ Convenience function that defines a scalar (array of size 1). """
+    return ndarray([1], dtype, allow_conflicts=allow_conflicts)
+
+
+def define_local(dimensions, dtype=types.float32, allow_conflicts=False):
+    """ Defines a transient array in a DaCe program. """
+    return transient(dimensions, dtype=dtype, allow_conflicts=allow_conflicts)
+
+
+def define_local_scalar(dtype=types.float32, allow_conflicts=False):
+    """ Defines a transient scalar (array of size 1) in a DaCe program. """
+    return transient([1], dtype=dtype, allow_conflicts=allow_conflicts)
+
+
+def define_stream(dtype=types.float32, buffer_size=0):
+    """ Defines a local stream in a DaCe program. """
+    return define_streamarray([1], dtype=dtype, buffer_size=buffer_size)
+
+
+def define_streamarray(dimensions, dtype=types.float32, buffer_size=0):
+    """ Defines a local stream array in a DaCe program. """
+    return stream(dtype, dimensions)
+
+
+def asarray(array):
+    """ Converts an existing Numpy NDArray to DaCe NDArray. """
+    obj = numpy.asarray(array).view(ndarray)
+    return obj
diff --git a/dace/frontend/python/ndloop.py b/dace/frontend/python/ndloop.py
new file mode 100644
index 0000000000..eed068a9b7
--- /dev/null
+++ b/dace/frontend/python/ndloop.py
@@ -0,0 +1,63 @@
+""" A single generator that creates an N-dimensional for loop in Python. """
+import itertools
+from dace.frontend.python import ndarray
+
+# Python 3 compatibility for xrange
+try:
+    xxrange = xrange
+except NameError:
+    xxrange = range
+
+
+def slicetoxrange(s):
+    """ Helper function that turns a slice into a range (for iteration). """
+    if isinstance(s, int):
+        return xxrange(s, s + 1)
+
+    ifnone = lambda a, b: b if a is None else a
+
+    return xxrange(ifnone(s.start, 0), s.stop + 1, ifnone(s.step, 1))
+
+
+def tupletoxrange(s):
+    """ Helper function that turns a tuple into a range (for iteration). """
+    if isinstance(s, int):
+        return xxrange(s, s + 1)
+
+    ifnone = lambda a, b: b if a is None else a
+    ifscalar = lambda a: a[0] if isinstance(a, ndarray.ndarray) else a
+    allconds = lambda a, b: ifnone(ifscalar(a), b)
+
+    return xxrange(allconds(s[0], 0), ifscalar(s[1]) + 1, allconds(s[2], 1))
+
+
+def NDLoop(ndslice, internal_function, *args, **kwargs):
+    """ Wrapped generator that calls an internal function in an N-dimensional 
+        for-loop in Python. 
+        @param ndslice: Slice or list of slices (`slice` objects) to loop over.
+        @param internal_function: Function to call in loop.
+        @param *args: Arguments to `internal_function`.
+        @param **kwargs: Keyword arguments to `internal_function`.
+        @return: N-dimensional loop index generator.
+    """
+    if isinstance(ndslice, int) or isinstance(ndslice, slice):
+        ndxrange = (slicetoxrange(ndslice), )
+    else:
+        ndxrange = tuple(slicetoxrange(d) for d in ndslice)
+    for indices in itertools.product(*ndxrange):
+        internal_function(*(indices + args), **kwargs)
+
+
+def ndrange(slice_list):
+    """ Generator that creates an N-dimensional for loop in Python. 
+        @param slice_list: Slice or list of slices (as tuples or `slice`s)
+                          to loop over.
+        @return: N-dimensional loop index generator.
+    """
+    if not isinstance(slice_list, list):
+        ndxrange = (tupletoxrange(slice_list), )
+    else:
+        ndxrange = tuple(tupletoxrange(d) for d in slice_list)
+
+    for indices in itertools.product(*ndxrange):
+        yield indices
diff --git a/dace/frontend/python/parser.py b/dace/frontend/python/parser.py
new file mode 100644
index 0000000000..1410395fae
--- /dev/null
+++ b/dace/frontend/python/parser.py
@@ -0,0 +1,320 @@
+""" DaCe Python parsing functionality and entry point to Python frontend. """
+from __future__ import print_function
+from collections import OrderedDict
+from functools import wraps
+import inspect
+import ast
+import copy
+import sys
+import numpy
+
+from dace import data, symbolic, types
+from dace.config import Config
+from dace.frontend.python import astparser, astutils, depanalysis
+from dace.sdfg import SDFG
+from dace.graph import labeling
+
+
+def _create_datadescriptor(obj):
+    """ Creates a data descriptor from various types of objects.
+        @see: dace.data.Data
+    """
+    if isinstance(obj, data.Data):
+        return obj
+
+    try:
+        return obj.descriptor
+    except AttributeError:
+        if isinstance(obj, numpy.ndarray):
+            return data.Array(
+                dtype=types.typeclass(obj.dtype.type), shape=obj.shape)
+        if symbolic.issymbolic(obj):
+            return data.Scalar(symbolic.symtype(obj))
+        if isinstance(obj, types.typeclass):
+            return data.Scalar(obj)
+        return data.Scalar(types.typeclass(type(obj)))
+
+
+def _get_type_annotations(f, f_argnames, decorator_args):
+    """ Obtains types from decorator or from type annotations in a function. 
+    """
+    type_annotations = {}
+    if hasattr(f, '__annotations__'):
+        type_annotations.update(f.__annotations__)
+
+    # Type annotation conditions
+    has_args = len(decorator_args) > 0
+    has_annotations = len(type_annotations) > 0
+    if 'return' in type_annotations:
+        raise TypeError('DaCe programs do not have a return type')
+    if has_args and has_annotations:
+        raise SyntaxError('DaCe programs can only have decorator arguments ' +
+                          '(\'@dace.program(...)\') or type annotations ' +
+                          '(\'def program(arr: type, ...)\'), but not both')
+
+    # Alert if there are any discrepancies between annotations and arguments
+    if has_args:
+        # Make sure all arguments are annotated
+        if len(decorator_args) != len(f_argnames):
+            raise SyntaxError(
+                'Decorator arguments must match number of DaCe ' +
+                'program parameters (expecting ' + str(len(f_argnames)) + ')')
+        # Return arguments and their matched decorator annotation
+        return {
+            k: _create_datadescriptor(v)
+            for k, v in zip(f_argnames, decorator_args)
+        }
+    elif has_annotations:
+        # Make sure all arguments are annotated
+        if len(type_annotations) != len(f_argnames):
+            raise SyntaxError(
+                'Either none or all DaCe program parameters must ' +
+                'have type annotations')
+    return {k: _create_datadescriptor(v) for k, v in type_annotations.items()}
+
+
+def _get_argnames(f):
+    """ Returns a Python function's argument names. """
+    try:
+        return inspect.getfullargspec(f).args
+    except AttributeError:
+        return inspect.getargspec(f).args
+
+
+def _compile_module(s, name='<string>'):
+    """ Compiles a string representing a python module (file or code) and
+        returns the resulting global objects as a dictionary mapping name->val.
+        @param name: Optional name for better error message handling.
+    """
+
+    gen_module = {}
+    code = compile(s, name, 'exec')
+    exec(code, gen_module)
+    return gen_module
+
+
+def parse_from_file(filename, *compilation_args):
+    """ Try to parse all DaCe programs in `filename` and return a list of
+        obtained SDFGs. Raises exceptions in case of compilation errors.
+        Also accepts optional compilation arguments containing types and symbol
+        values.
+    """
+
+    with open(filename, 'r') as f:
+        code = f.read()
+
+    mod = _compile_module(code, filename)
+
+    programs = [
+        program for program in mod.values()
+        if isinstance(program, DaceProgram)
+    ]
+
+    return [parse_function(p, *compilation_args) for p in programs]
+
+
+def parse_from_function(function, *compilation_args, strict=None):
+    """ Try to parse a DaceProgram object and return the `dace.SDFG` object
+        that corresponds to it.
+        @param function: DaceProgram object (obtained from the `@dace.program`
+                         decorator).
+        @param compilation_args: Various compilation arguments e.g. types.
+        @param strict: Whether to apply strict transformations or not (None
+                       uses configuration-defined value). 
+        @return: The generated SDFG object.
+    """
+    if not isinstance(function, DaceProgram):
+        raise TypeError(
+            'Function must be of type dace.frontend.python.DaceProgram')
+
+    # Obtain parsed DaCe program
+    pdp, modules = function.generate_pdp(*compilation_args)
+
+    # Create an empty SDFG
+    sdfg = SDFG(pdp.name, pdp.argtypes)
+
+    sdfg.set_sourcecode(pdp.source, 'python')
+
+    # Populate SDFG with states and nodes, according to the parsed DaCe program
+
+    # 1) Inherit dependencies and inject tasklets
+    # 2) Traverse program graph and recursively split into states,
+    #    annotating edges with their transition conditions.
+    # 3) Add arrays, streams, and scalars to the SDFG array store
+    # 4) Eliminate empty states with no conditional outgoing transitions
+    # 5) Label states in topological order
+    # 6) Construct dataflow graph for each state
+
+    # Step 1)
+    for primitive in pdp.children:
+        depanalysis.inherit_dependencies(primitive)
+
+    # Step 2)
+    state_primitives = depanalysis.create_states_simple(pdp, sdfg)
+
+    # Step 3)
+    for dataname, datadesc in pdp.all_arrays().items():
+        sdfg.add_datadesc(dataname, datadesc)
+
+    # Step 4) Absorb next state into current, if possible
+    oldstates = list(sdfg.topological_sort(sdfg.start_state))
+    for state in oldstates:
+        if state not in sdfg.nodes():  # State already removed
+            continue
+        if sdfg.out_degree(state) == 1:
+            edge = sdfg.out_edges(state)[0]
+            nextState = edge.dst
+            if not edge.data.is_unconditional():
+                continue
+            if sdfg.in_degree(nextState) > 1:  # If other edges point to state
+                continue
+            if len(state_primitives[nextState]) > 0:  # Don't fuse full states
+                continue
+
+            outEdges = list(sdfg.out_edges(nextState))
+            for e in outEdges:
+                # Construct new edge from the current assignments, new
+                # assignments, and new conditions
+                newEdge = copy.deepcopy(edge.data)
+                newEdge.assignments.update(e.data.assignments)
+                newEdge.condition = e.data.condition
+                sdfg.add_edge(state, e.dst, newEdge)
+            sdfg.remove_node(nextState)
+
+    # Step 5)
+    stateList = sdfg.topological_sort(sdfg.start_state)
+    for i, state in enumerate(stateList):
+        if state.label is None or state.label == "":
+            state.set_label("s" + str(i))
+
+    # Step 6)
+    for i, state in enumerate(stateList):
+        depanalysis.build_dataflow_graph(sdfg, state, state_primitives[state],
+                                         modules)
+
+    # Fill in scope entry/exit connectors
+    sdfg.fill_scope_connectors()
+
+    # Memlet propagation
+    if sdfg.propagate:
+        labeling.propagate_labels_sdfg(sdfg)
+
+    # Drawing the SDFG before strict transformations
+    sdfg.draw_to_file(recursive=True)
+
+    # Apply strict transformations automatically
+    if (strict == True
+            or (strict is None
+                and Config.get_bool('optimizer', 'automatic_state_fusion'))):
+        sdfg.apply_strict_transformations()
+
+    # Drawing the SDFG (again) to a .dot file
+    sdfg.draw_to_file(recursive=True)
+
+    # Validate SDFG
+    sdfg.validate()
+
+    return sdfg
+
+
+class DaceProgram:
+    """ A data-centric program object, obtained by decorating a function with
+        `@dace.program`. """
+
+    def __init__(self, f, args, kwargs):
+        self.f = f
+        self.args = args
+        self.kwargs = kwargs
+        self._name = f.__name__
+
+    @property
+    def name(self):
+        return self._name
+
+    def to_sdfg(self, *args, strict=None):
+        """ Parses the DaCe function into an SDFG. """
+        return parse_from_function(self, *args, strict=strict)
+
+    def compile(self, *args, strict=None, specialize=None):
+        """ Convenience function that parses and compiles a DaCe program. """
+        sdfg = parse_from_function(self, *args, strict=strict)
+        return sdfg.compile(specialize=specialize)
+
+    def __call__(self, *args, strict=None, specialize=None):
+        """ Convenience function that parses, compiles, and runs a DaCe 
+            program. """
+        binaryobj = self.compile(*args, strict=strict, specialize=specialize)
+        return binaryobj(*args)
+
+    def generate_pdp(self, *compilation_args):
+        """ Generates the parsed AST representation of a DaCe program.
+            @param compilation_args: Various compilation arguments e.g., types.
+            @return: A 2-tuple of (program, modules), where `program` is a 
+                     `dace.astnodes._ProgramNode` representing the parsed DaCe 
+                     program, and `modules` is a dictionary mapping imported 
+                     module names to their actual module names (for maintaining
+                     import aliases).
+        """
+        dace_func = self.f
+        args = self.args
+        argnames = _get_argnames(dace_func)
+
+        if not argnames:
+            raise SyntaxError(
+                'DaCe program must contain at least one parameter')
+
+        # If exist, obtain type annotations (for compilation)
+        argtypes = _get_type_annotations(dace_func, argnames, args)
+
+        # Parse argument types from call
+        if not argtypes:
+            if not compilation_args:
+                raise SyntaxError(
+                    'DaCe program compilation requires either type annotations '
+                    'or arrays')
+
+            # Parse compilation arguments
+            if len(compilation_args) != len(argnames):
+                raise SyntaxError(
+                    'Arguments must match DaCe program parameters (expecting '
+                    + str(len(argnames)) + ')')
+            argtypes = {
+                k: _create_datadescriptor(v)
+                for k, v in zip(argnames, compilation_args)
+            }
+        #############################################
+
+        # Parse allowed global variables
+        # (for inferring types and values in the DaCe program)
+        global_vars = {
+            k: v
+            for k, v in dace_func.__globals__.items() if types.isallowed(v)
+        }
+        modules = {
+            k: v.__name__
+            for k, v in dace_func.__globals__.items()
+            if types.ismodule_and_allowed(v)
+        }
+        modules['builtins'] = ''
+
+        # Add symbols as globals with their actual names (sym_0 etc.)
+        global_vars.update({
+            v.name: v
+            for k, v in global_vars.items() if isinstance(v, symbolic.symbol)
+        })
+
+        # Add keyword arguments as additional globals
+        global_vars.update(
+            {k: v
+             for k, v in self.kwargs.items() if types.isallowed(v)})
+
+        argtypes_ordered = OrderedDict()
+        for param in argnames:
+            argtypes_ordered[param] = argtypes[param]
+
+        # Parse AST to create the SDFG
+        pdp = astparser.parse_dace_program(dace_func, argtypes_ordered,
+                                           global_vars, modules)
+
+        # Transform parsed DaCe code into a DaCe program (Stateful DFG)
+        return pdp, modules
diff --git a/dace/frontend/python/simulator.py b/dace/frontend/python/simulator.py
new file mode 100644
index 0000000000..4e9f3a3354
--- /dev/null
+++ b/dace/frontend/python/simulator.py
@@ -0,0 +1,703 @@
+""" A Python simulator for DaCe programs. Currently reads and runs Python 
+    functions rather than any SDFG. """
+
+from __future__ import print_function
+import ast
+import copy
+from functools import wraps
+import inspect
+import numpy
+import sys
+import numpy
+
+from dace import data, symbolic, types
+from dace.config import Config
+from dace.frontend.python import astparser, astnodes, astutils, ndloop, ndarray
+from dace.frontend.python.astutils import unparse
+from dace.frontend.python.parser import DaceProgram
+
+
+def simulate(dace_program: DaceProgram, *args):
+    """ Simulate a DaCe program using Python. 
+        @param dace_program: A program function annotated with `@dace.program`.
+        @param *args: Program arguments to pass.
+    """
+    pdp, modules = dace_program.generate_pdp()
+
+    # Transform the decorated AST into working python code (annotated so
+    # that debugging works)
+    simulated_ast = SimulatorTransformer(pdp).visit(pdp.ast)
+    mod = ast.Module(body=simulated_ast, lineno=1)
+    mod = ast.fix_missing_locations(mod)
+
+    # Compile the transformed AST
+    codeobj = compile(mod, pdp.filename, 'exec')
+
+    fname = dace_program.name
+
+    if Config.get_bool('debugprint'):
+        print("Simulating DaCe program with name", fname)
+
+    param_symbols = {}
+
+    if len(pdp.params) != len(args):
+        raise SyntaxError('Argument number mismatch in \'' + fname +
+                          '\', expecting ' + str(len(args)))
+
+    ##################################################################
+    # Disallow external variables
+    # EXCEPTIONS:
+    #   * The dace module ('import dace')
+    #   * The math module ('import math')
+    #   * Constants (types int, float, dace.int*, dace.float*)
+    #   * DaCe symbols that have been defined in @dace.program args
+    ##################################################################
+
+    f_globals = {}
+
+    # WORKAROUND: Works around a bug in CPython 2.x where True and
+    # False are undefined
+    f_globals['True'] = True
+    f_globals['False'] = False
+    ######################
+
+    # Allow certain namespaces/modules and constants
+    f_globals.update(pdp.globals)
+
+    # Resolve symbols
+    symbols = {}
+    symbols.update(symbolic.getsymbols(
+        args))  # from parameter values (externally defined as "dace.symbol")
+    symbols.update(param_symbols)  # from parameter values (constant inputs)
+
+    resolve = {}
+    for gname, gval in f_globals.items():
+        if isinstance(gval, symbolic.symbol):
+            if gval.name in symbols:
+                resolve[gname] = gval.get()  # Raise exception if undefined
+            else:
+                resolve[gname] = None  # Mark unrelated symbols for removal
+
+    f_globals.update(resolve)
+
+    # Remove unrelated symbols from globals
+    for rk, rv in resolve.items():
+        if rv is None:
+            del f_globals[rk]
+
+    # Resolve symbols in arguments as well
+    newargs = tuple(symbolic.eval(a) for a in args)
+    ##################################################################
+
+    # Store parameter objects
+    pdp.arrayobjs = {
+        k: v
+        for k, v in zip(pdp.params, newargs) if isinstance(v, ndarray.ndarray)
+    }
+
+    # Simulate f
+    ################################
+    # Obtain function object
+    gen_module = {}
+    gen_module.update(f_globals)
+    exec(codeobj, gen_module)
+    cfunc = gen_module[fname]
+
+    # Run function
+    result = cfunc(*newargs)
+    ################################
+
+    return result
+
+
+class RangeStorage:
+    """ Range storage object that is injected to the `_` variable in order to 
+        determine DaCe primitive extents at runtime. """
+
+    def __init__(self):
+        self.range = []
+
+    def __getitem__(
+            self,
+            key):  # Set object's range every time it is called with a range
+        self.range = key
+        return self
+
+
+def converttype(argument, cvt_type, argname):
+    """ Helper function to convert a scalar argument to its type. """
+    if isinstance(argument, ndarray.ndarray):
+        return argument
+
+    # Convert type
+    converted = cvt_type.type(argument)
+
+    # Try to cast back to the original type. If the value has changed
+    # (e.g., out of bounds, lost precision), raise exception
+    origtype = type(argument)
+    if origtype(converted) != argument:
+        raise TypeError('Type conversion of argument \'' + argname +
+                        '\' resulted in loss of precision, please ' +
+                        'cast explicitly before calling program')
+
+    return converted
+
+
+def _copy_location(newnode, node):
+    return ast.fix_missing_locations(ast.copy_location(newnode, node))
+
+
+class SimulatorTransformer(ast.NodeTransformer):
+    """ A Python AST transformer that converts a DaCe program into runnable
+        Python code for the simulator. """
+
+    def __init__(self, pdp):
+        self.pdp = pdp
+        self.curprim = None
+        self.module_name = None
+        self.storeOnAssignment = {}  # Mapping from local names to memlets
+        self.accumOnAssignment = {}  # Mapping from local names to memlets
+        self.curchild = -1
+
+    # Visiting a DaCe primitive
+    def visit_FunctionDef(self, node):
+        after_nodes = []
+
+        if self.curprim is None:
+            self.curprim = self.pdp
+            self.curchild = -1
+            if isinstance(node.decorator_list[0], ast.Call):
+                self.module_name = node.decorator_list[0].func.value.id
+            else:
+                self.module_name = node.decorator_list[0].value.id
+            # Strip decorator
+            del node.decorator_list[0]
+
+            oldchild = self.curchild
+            oldprim = self.curprim
+
+        else:
+            if len(node.decorator_list) == 0:
+                return self.generic_visit(node)
+            dec = node.decorator_list[0]
+            if isinstance(dec, ast.Call):
+                decname = astparser.rname(dec.func.attr)
+            else:
+                decname = astparser.rname(dec.attr)
+
+            if decname in [
+                    'map', 'async_map', 'reduce', 'async_reduce', 'consume',
+                    'async_consume', 'tasklet', 'async_tasklet', 'iterate',
+                    'loop', 'conditional'
+            ]:
+                self.curchild += 1
+
+                oldchild = self.curchild
+                oldprim = self.curprim
+                self.curprim = self.curprim.children[self.curchild]
+                self.curchild = -1
+
+                if isinstance(self.curprim, astnodes._MapNode):
+                    newnode = \
+                        _copy_location(ast.For(target=ast.Tuple(ctx=ast.Store(),
+                                                    elts=[ast.Name(id=name, ctx=ast.Store()) for name in self.curprim.params]),
+                                                    iter=ast.parse('%s.ndrange(%s)' % (self.module_name, self.curprim.range.pystr())).body[0].value,
+                                                    body=node.body, orelse=[]),
+                                            node)
+                    node = newnode
+                elif isinstance(self.curprim, astnodes._ConsumeNode):
+                    stream = self.curprim.stream
+                    if isinstance(self.curprim.stream, ast.AST):
+                        stream = unparse(self.curprim.stream)
+                    if '[' not in stream:
+                        stream += '[0]'
+
+                    newnode = \
+                        _copy_location(ast.While(
+                            test=ast.parse('len(%s) > 0' % stream).body[0].value,
+                                           body=node.body, orelse=[]),
+                                       node)
+                    node = newnode
+                    node.body.insert(
+                        0,
+                        _copy_location(
+                            ast.parse('%s = %s.popleft()' % (str(
+                                self.curprim.params[0]), stream)).body[0],
+                            node))
+
+                elif isinstance(self.curprim, astnodes._TaskletNode):
+                    # Strip decorator
+                    del node.decorator_list[0]
+
+                    newnode = \
+                        _copy_location(ast.parse('if True: pass').body[0], node)
+                    newnode.body = node.body
+                    newnode = ast.fix_missing_locations(newnode)
+                    node = newnode
+                elif isinstance(self.curprim, astnodes._ReduceNode):
+                    in_memlet = self.curprim.inputs['input']
+                    out_memlet = self.curprim.outputs['output']
+                    # Create reduction call
+                    params = [unparse(p) for p in node.decorator_list[0].args]
+                    params.extend([
+                        unparse(kp) for kp in node.decorator_list[0].keywords
+                    ])
+                    reduction = ast.parse(
+                        '%s.simulator.simulate_reduce(%s, %s)' %
+                        (self.module_name, node.name,
+                         ', '.join(params))).body[0]
+                    reduction = _copy_location(reduction, node)
+                    reduction = ast.increment_lineno(reduction,
+                                                     len(node.body) + 1)
+                    reduction = ast.fix_missing_locations(reduction)
+
+                    # Strip decorator
+                    del node.decorator_list[0]
+
+                    after_nodes.append(reduction)
+                elif isinstance(self.curprim, astnodes._IterateNode):
+                    newnode = \
+                        _copy_location(ast.For(target=ast.Tuple(ctx=ast.Store(),
+                                                    elts=[ast.Name(id=name, ctx=ast.Store()) for name in self.curprim.params]),
+                                                    iter=ast.parse('%s.ndrange(%s)' % (self.module_name, self.curprim.range.pystr())).body[0].value,
+                                                    body=node.body, orelse=[]),
+                                            node)
+                    newnode = ast.fix_missing_locations(newnode)
+                    node = newnode
+                elif isinstance(self.curprim, astnodes._LoopNode):
+                    newnode = \
+                        _copy_location(ast.While(test=node.decorator_list[0].args[0],
+                                                    body=node.body, orelse=[]),
+                                            node)
+                    newnode = ast.fix_missing_locations(newnode)
+                    node = newnode
+                else:
+                    raise RuntimeError('Unimplemented primitive %s' % decname)
+            else:
+                return self.generic_visit(node)
+
+        newbody = []
+        end_stmts = []
+        substitute_stmts = []
+        # Incrementally build new body from original body
+        for stmt in node.body:
+            if isinstance(stmt, ast.Expr):
+                res, append, prepend = self.VisitTopLevelExpr(stmt)
+                if res is not None:
+                    newbody.append(res)
+                if append is not None:
+                    end_stmts.extend(append)
+                if prepend is not None:
+                    substitute_stmts.extend(prepend)
+            else:
+                subnodes = self.visit(stmt)
+                if subnodes is not None:
+                    if isinstance(subnodes, list):
+                        newbody.extend(subnodes)
+                    else:
+                        newbody.append(subnodes)
+        node.body = newbody + end_stmts
+
+        self.curchild = oldchild
+        self.curprim = oldprim
+
+        substitute_stmts.append(node)
+        if len(after_nodes) > 0:
+            return substitute_stmts + after_nodes
+        return substitute_stmts
+
+    def VisitTopLevelExpr(self, node):
+        # DaCe memlet expression
+        if isinstance(node.value, ast.BinOp):
+            rhs = node.value.right
+            lhs = node.value.left
+            arrays = self.curprim.arrays()
+
+            if isinstance(node.value.op, ast.LShift):
+                # Dynamic access. Emit nothing and load memory on encounter
+                if isinstance(rhs, ast.Call) and ast.literal_eval(
+                        rhs.args[0]) == -1:
+                    array_name = rhs.func.id
+                    stripped_subscript = '%s[:]' % (array_name)
+                    self.storeOnAssignment[node.value.left.id] = \
+                        ast.parse(stripped_subscript).body[0].value
+                    return None, None, None
+
+                if isinstance(rhs, ast.Subscript) and isinstance(
+                        rhs.value, ast.Call):
+
+                    # Dynamic access. Emit nothing and load memory on encounter
+                    if ast.literal_eval(rhs.value.args[0]) == -1:
+                        array_name = rhs.value.func.id
+                        stripped_subscript = '%s[%s]' % (array_name,
+                                                         unparse(rhs.slice))
+                        self.storeOnAssignment[node.value.left.id] = \
+                            ast.parse(stripped_subscript).body[0].value
+                        return None, None, None
+
+                    rhs = ast.Subscript(
+                        value=rhs.value.func, ctx=ast.Load(), slice=rhs.slice)
+
+                result = _copy_location(
+                    ast.Assign(targets=[node.value.left], value=rhs), node)
+                result.targets[0].ctx = ast.Store()
+                return result, None, None
+            # END of "a << b"
+            elif isinstance(node.value.op, ast.RShift):
+                # If the memlet refers to a sub-array (view), also add an expression to initialize it
+                init_expr = None
+                result = None
+                prefix = []
+
+                if isinstance(rhs, ast.Subscript):
+                    # Index subscript expression ("tmp >> b(1, sum)[i,j,k,l]")
+                    if isinstance(rhs.value, ast.Call):
+                        # Only match expressions with possible write-conflict resolution, such as "A(...)[...]"
+                        array_name = rhs.value.func.id
+                        stripped_subscript = '%s[%s]' % (array_name,
+                                                         unparse(rhs.slice))
+
+                        # WCR initialization with identity value
+                        if len(rhs.value.args) >= 3:
+                            prefix.append(
+                                _copy_location(
+                                    ast.parse(
+                                        '%s = %s' %
+                                        (stripped_subscript,
+                                         unparse(rhs.value.args[2]))).body[0],
+                                    node))
+
+                        # Dynamic access. Emit nothing and store memory on assignment
+                        if ast.literal_eval(rhs.value.args[0]) == -1:
+                            if len(rhs.value.args) >= 2:
+                                self.accumOnAssignment[node.value.left.id] = \
+                                    (stripped_subscript, rhs.value.args[1])
+                            else:
+                                self.storeOnAssignment[node.value.left.id] = \
+                                    ast.parse(stripped_subscript).body[0].value
+                            return init_expr, None, prefix
+
+                        # Make sure WCR function exists
+                        if len(rhs.value.args) >= 2:
+                            result = ast.parse(
+                                '%s = (%s)(%s, %s)' %
+                                (stripped_subscript, unparse(
+                                    rhs.value.args[1]), stripped_subscript,
+                                 node.value.left.id)).body[0]
+                            result = _copy_location(result, node)
+                        else:
+                            result = ast.parse(
+                                '%s = %s' % (stripped_subscript,
+                                             node.value.left.id)).body[0]
+                            result = _copy_location(result, node)
+                    else:
+                        array_name = rhs.value.id
+
+                    if not isinstance(rhs.slice, ast.Index):
+                        init_expr = _copy_location(
+                            ast.Assign(
+                                targets=[
+                                    ast.Name(
+                                        id=node.value.left.id, ctx=ast.Store())
+                                ],
+                                value=ast.Subscript(
+                                    value=ast.Name(
+                                        id=array_name, ctx=ast.Load()),
+                                    slice=rhs.slice,
+                                    ctx=ast.Load())), node)
+                elif not isinstance(rhs, ast.Subscript):
+                    if isinstance(rhs, ast.Call):
+                        array_name = rhs.func
+                    else:
+                        array_name = rhs
+
+                    lhs_name = lhs.id
+
+                    # In case of "tmp >> array", write "array[:]"
+                    if node.value.left.id in self.curprim.transients:
+                        init_expr = None
+                    # If reading from a single stream ("b << stream")
+                    elif (array_name.id in arrays
+                          and isinstance(arrays[array_name.id], data.Stream)):
+                        if arrays[array_name.id].shape == [1]:
+                            init_expr = _copy_location(
+                                ast.parse('{v} = {q}[0]'.format(
+                                    v=lhs_name, q=array_name.id)).body[0],
+                                node)
+                        return init_expr, None, []
+                    else:
+                        init_expr = _copy_location(
+                            ast.Assign(
+                                targets=[
+                                    ast.Name(id=lhs_name, ctx=ast.Store())
+                                ],
+                                value=ast.Subscript(
+                                    value=ast.Name(
+                                        id=array_name.id, ctx=ast.Load()),
+                                    slice=ast.Slice(
+                                        lower=None, upper=None, step=None),
+                                    ctx=ast.Load())), node)
+
+                    # If we are setting a stream's sink
+                    if lhs_name in arrays and isinstance(
+                            arrays[lhs_name], data.Stream):
+                        result = ast.parse(
+                            '{arr}[0:len({q}[0])] = list({q}[0])'.format(
+                                arr=rhs.id, q=lhs.id)).body[0]
+                        result = _copy_location(result, node)
+
+                    # If WCR function exists
+                    elif isinstance(rhs, ast.Call) and len(rhs.args) >= 2:
+                        # WCR initialization with identity value
+                        if len(rhs.args) >= 3:
+                            prefix.append(
+                                _copy_location(
+                                    ast.parse('%s[:] = %s' %
+                                              (array_name.id,
+                                               unparse(rhs.args[2]))).body[0],
+                                    node))
+
+                        # Dynamic access. Emit nothing and store memory on assignment
+                        if ast.literal_eval(rhs.args[0]) == -1:
+                            self.accumOnAssignment[lhs.id] = (array_name.id,
+                                                              rhs.args[1])
+                            return init_expr, None, prefix
+
+                        result = ast.parse(
+                            '%s[:] = (%s)(%s[:], %s)' %
+                            (array_name.id, unparse(rhs.args[1]),
+                             array_name.id, node.value.left.id)).body[0]
+                        result = _copy_location(result, node)
+
+                    else:
+                        result = _copy_location(
+                            ast.Assign(
+                                targets=[
+                                    ast.Subscript(
+                                        value=ast.Name(
+                                            id=array_name.id, ctx=ast.Load()),
+                                        slice=ast.Slice(
+                                            lower=None, upper=None, step=None),
+                                        ctx=ast.Store())
+                                ],
+                                value=node.value.left), node)
+
+                if result is None:
+                    result = _copy_location(
+                        ast.Assign(
+                            targets=[node.value.right], value=node.value.left),
+                        node)
+                result.targets[0].ctx = ast.Store()
+                return init_expr, [result], prefix
+            # END of "a >> b"
+
+        return self.generic_visit(node), [], None
+
+    def visit_Name(self, node):
+        if node.id in self.storeOnAssignment:
+            subscript = self.storeOnAssignment[node.id]
+            newnode = copy.deepcopy(subscript)
+            newnode.ctx = node.ctx
+            return _copy_location(newnode, node)
+
+        return self.generic_visit(node)
+
+    def visit_Assign(self, node):
+        if astutils.rname(node.targets[0]) in self.accumOnAssignment:
+            var_name = astutils.rname(node.targets[0])
+            array_name, accum = self.accumOnAssignment[var_name]
+            if isinstance(node.targets[0], ast.Subscript):
+                array_name += '[' + unparse(node.targets[0].slice) + ']'
+            if '[' not in array_name:
+                array_name += '[:]'
+
+            newnode = ast.parse('{out} = {accum}({out}, {val})'.format(
+                out=array_name, accum=unparse(accum),
+                val=unparse(node.value))).body[0]
+            newnode = _copy_location(newnode, node)
+            return newnode
+
+        return self.generic_visit(node)
+
+    def visit_Call(self, node):
+        if '.push' in astutils.rname(node.func):
+            node.func.attr = 'append'
+        return self.generic_visit(node)
+
+    # Control flow: for-loop is the same as dace.iterate in the right context
+    def visit_For(self, node):
+        if not isinstance(self.curprim, astnodes._DataFlowNode):
+            self.curchild += 1
+
+            oldchild = self.curchild
+            oldprim = self.curprim
+            self.curprim = self.curprim.children[self.curchild]
+            self.curchild = -1
+
+            newbody = []
+            end_stmts = []
+            substitute_stmts = []
+            # Incrementally build new body from original body
+            for stmt in node.body:
+                if isinstance(stmt, ast.Expr):
+                    res, append, prepend = self.VisitTopLevelExpr(stmt)
+                    if res is not None:
+                        newbody.append(res)
+                    if append is not None:
+                        end_stmts.extend(append)
+                    if prepend is not None:
+                        substitute_stmts.extend(prepend)
+                else:
+                    subnodes = self.visit(stmt)
+                    if subnodes is not None:
+                        if isinstance(subnodes, list):
+                            newbody.extend(subnodes)
+                        else:
+                            newbody.append(subnodes)
+            node.body = newbody + end_stmts
+            substitute_stmts.append(node)
+
+            self.curchild = oldchild
+            self.curprim = oldprim
+            return substitute_stmts
+        return self.generic_visit(node)
+
+    # Control flow: while-loop is the same as dace.loop in the right context
+    def visit_While(self, node):
+        return self.visit_For(node)
+
+    # Control flow: if-condition is the same as dace.conditional in the right context
+    def visit_If(self, node):
+        if not isinstance(self.curprim, astnodes._DataFlowNode):
+            self.curchild += 1
+
+            oldchild = self.curchild
+            oldprim = self.curprim
+            self.curprim = self.curprim.children[self.curchild]
+            self.curchild = -1
+
+            newbody = []
+            end_stmts = []
+            substitute_stmts = []
+            # Incrementally build new body from original body
+            for stmt in node.body:
+                if isinstance(stmt, ast.Expr):
+                    res, append, prepend = self.VisitTopLevelExpr(stmt)
+                    if res is not None:
+                        newbody.append(res)
+                    if append is not None:
+                        end_stmts.extend(append)
+                    if prepend is not None:
+                        substitute_stmts.extend(prepend)
+                else:
+                    subnodes = self.visit(stmt)
+                    if subnodes is not None:
+                        if isinstance(subnodes, list):
+                            newbody.extend(subnodes)
+                        else:
+                            newbody.append(subnodes)
+            node.body = newbody + end_stmts
+
+            self.curchild = oldchild
+            self.curprim = oldprim
+
+            # Process 'else'/'elif' statements
+            if len(node.orelse) > 0:
+                self.curchild += 1
+
+                oldchild = self.curchild
+                oldprim = self.curprim
+                self.curprim = self.curprim.children[self.curchild]
+                self.curchild = -1
+
+                newbody = []
+                end_stmts = []
+                # Incrementally build new body from original body
+                for stmt in node.orelse:
+                    if isinstance(stmt, ast.Expr):
+                        res, append, prepend = self.VisitTopLevelExpr(stmt)
+                        if res is not None:
+                            newbody.append(res)
+                        if append is not None:
+                            end_stmts.extend(append)
+                        if prepend is not None:
+                            substitute_stmts.extend(prepend)
+                    else:
+                        subnodes = self.visit(stmt)
+                        if subnodes is not None:
+                            if isinstance(subnodes, list):
+                                newbody.extend(subnodes)
+                            else:
+                                newbody.append(subnodes)
+                node.orelse = newbody + end_stmts
+
+                self.curchild = oldchild
+                self.curprim = oldprim
+
+            substitute_stmts.append(node)
+            return substitute_stmts
+
+        return self.generic_visit(node)
+
+
+def simulate_reduce(op, in_array, out_array, axis=None, identity=None):
+    inshape = numpy.shape(in_array)
+    outshape = numpy.shape(out_array)
+
+    # Argument validation
+    if axis is None and (len(outshape) != 1 or outshape[0] != 1):
+        raise RuntimeError("Cannot reduce to non-scalar value")
+    if axis is not None and (axis < 0 or axis >= len(in_array.shape)):
+        raise RuntimeError("Cannot reduce in nonexistent axis " + str(axis))
+
+        unreduced = outshape[:axis] + (inshape[axis], ) + outshape[axis:]
+        if unreduced != inshape:
+            raise RuntimeError("Incompatible shapes in reduction: " +
+                               str(inshape) + " -> " + str(outshape))
+    # End of argument validation
+
+    # Reduce everything
+    if axis is None:
+        storevalue = True
+
+        # If we have an initial value to insert
+        if identity is not None:
+            out_array[0] = identity
+            storevalue = False
+
+        for i in numpy.nditer(in_array):
+            if storevalue:  # If no identity value given, store first value as output
+                out_array[0] = i
+                storevalue = False
+            else:
+                out_array[0] = op(out_array[0], i)
+
+    else:  # Reduce a single axis
+        storevalue = True
+
+        # If we have an initial value to insert
+        if identity is not None:
+            # Store identity scalar in output array
+            out_array[:] = identity
+            storevalue = False
+
+        # Determine reduction slice (A[:,:,...,:,i,:,...,:])
+        red_slice = [slice(None, None, None) for i in inshape]
+        for i in ndloop.xxrange(inshape[axis]):
+            red_slice[axis] = slice(i, i + 1, None)
+
+            inslice = in_array[red_slice]
+
+            if storevalue:
+                # Store initial value
+                for arrout, arrin in zip(
+                        numpy.nditer(out_array, op_flags=['readwrite']),
+                        numpy.nditer(inslice)):
+                    arrout[...] = arrin
+                storevalue = False
+            else:
+                # Reduce entire (N-1)-dimensional tensor for the given slice
+                for arrout, arrin in zip(
+                        numpy.nditer(out_array, op_flags=['readwrite']),
+                        numpy.nditer(inslice)):
+                    arrout[...] = op(arrout, arrin)
diff --git a/dace/frontend/tensorflow/__init__.py b/dace/frontend/tensorflow/__init__.py
new file mode 100644
index 0000000000..910095ba68
--- /dev/null
+++ b/dace/frontend/tensorflow/__init__.py
@@ -0,0 +1 @@
+from .tensorflow import *
diff --git a/dace/frontend/tensorflow/tensorflow.py b/dace/frontend/tensorflow/tensorflow.py
new file mode 100644
index 0000000000..939506da21
--- /dev/null
+++ b/dace/frontend/tensorflow/tensorflow.py
@@ -0,0 +1,2579 @@
+# -*- coding: utf-8 -*-
+# Author: Roman Haag
+
+# TODO: This code should undergo major refactoring
+
+import dace
+from dace.memlet import Memlet, EmptyMemlet
+from dace import SDFG, SDFGState
+from dace.graph.nodes import Tasklet, NestedSDFG
+
+import numpy as np
+from collections import OrderedDict
+import re
+
+try:
+    import tensorflow as tf
+except ImportError:
+    raise ImportError('Cannot use Tensorflow frontend without Tensorflow, ' +
+                      'please install: https://www.tensorflow.org/install/')
+
+from tensorflow.python.framework import tensor_util
+
+
+# http://stackoverflow.com/q/3844948/
+def _checkEqualIvo(lst):
+    return not lst or lst.count(lst[0]) == len(lst)
+
+
+def _tensortype(tensor: tf.Tensor):
+    """ Returns a numpy type from a given TF tensor. """
+
+    # Heuristics to determine op type
+    if isinstance(tensor, tf.Operation):
+        if len(tensor.outputs) == 1:
+            tensor = tensor.outputs[0]
+        elif len(tensor.inputs) == 1:
+            tensor = tensor.inputs[0]
+        elif _checkEqualIvo([inp.dtype for inp in tensor.inputs]):
+            tensor = tensor.inputs[0]
+        else:
+            try:
+                dtype = tensor.get_attr('T')
+                if dtype.as_numpy_dtype == object:
+                    raise NotImplementedError(
+                        'Type %s is not a valid numpy type' % str(dtype))
+                return dtype.as_numpy_dtype
+            except ValueError:
+                pass
+            raise TypeError('Ambiguous type for operation %s' % tensor)
+
+    if tensor.dtype.as_numpy_dtype == object:
+        raise NotImplementedError(
+            'Type %s is not a valid numpy type' % str(tensor.dtype))
+
+    if (tensor.dtype.is_bool):
+        return np.int32
+
+    return tensor.dtype.as_numpy_dtype
+
+
+def _tensorshape(tensor: tf.Tensor):
+    if tensor.shape.dims is None or tensor.shape.dims == []:
+        return 1  # Scalar
+    return tensor.shape
+
+
+def _string_builder(string):
+    """ To match DaCe variable naming conventions, replaces all undesired 
+        characters with "_".
+    """
+    newstring = string
+    if (string[0].isdigit()):
+        newstring = "_" + string
+    out = re.sub('[^a-zA-Z0-9_]', '_', newstring)
+    return out
+
+
+def _name(tensor_or_op):
+    if isinstance(tensor_or_op, tf.Operation):
+        return None
+    return _string_builder(tensor_or_op.name)
+
+
+_LASTSESSION = 0
+
+
+class TFSession:
+    def __init__(self, name: str = 'tfsession', seed: int = None, config=None):
+        """ Creates a DaCe Tensorflow session.
+            @param name: (optional) The name of the resulting SDFG.
+            @param seed: (optional) Fix random seed.
+        """
+        self._internal_session = tf.Session(config=config)
+
+        # Set for bookkeeping of already visited nodes
+        self.visitedNodes = set()
+
+        # Reinit state only used in training mode
+        self.reinitState = None
+
+        # Different input dictionaries
+        self.constDict = dict()
+        self.varDict = dict()
+        self.inpDict = dict()
+        self.reinitDict = dict()
+        self.initDict = dict()
+
+        self.training = False
+        self.iterations = 1
+        self.seed = seed
+        self.graph = SDFG(name)
+        self.kill = False
+
+    def __enter__(self):
+        return self
+
+    def __exit__(self, exception_type, exception_value, traceback):
+        pass
+
+    def train(self, optimizer, initializer, iterations, feed_dict, nodes=None):
+        """ Trains a subgraph for the specified number of iterations and 
+            returns requested nodes after training.
+            
+            @param optimizer: A TensorFlow tf.Optimizer node.
+            @param initializer: Either a list of global and local initializers
+                                or one initializer.
+            @param iterations: Number of training steps.
+            @param feed_dict: Dictionary representing input values and arrays 
+                              to feed in to the evaluator.
+            @param nodes: (optional) A TensorFlow node or an iterable 
+                          (e.g. list) of nodes to evaluate.
+            @return: A 2-tuple of (varDict, values) - the first is a dictionary
+                     of all variables used in the network in arbitrary order,
+                     and the second is a tuple of values in the same order as
+                     `nodes`.
+        """
+
+        # Initialize a new SDFG
+        self.graph = SDFG(self.graph.name)
+        self.graph.propagate = False
+        self.state = SDFGState("s0", self.graph)
+        self.graph.add_node(self.state)
+        self.iterations = iterations
+        state = self.state
+        sdfg = self.graph
+        outputs = []
+        output_names = []
+        #init state
+        s0 = state
+        #computational state"
+        s1 = sdfg.add_state('s1')
+        #emtpy exit state
+        s2 = sdfg.add_state('s2')
+        # As currently output arrays of conflict resolution do not automaticly
+        # get reinitialized in each state iterations, we have to manually do
+        # it in this state.
+        reinitState = sdfg.add_state("reinitialization")
+        self.reinitState = reinitState
+        #set training mode
+
+        self.training = True
+
+        #add edges between states
+        sdfg.add_edge(
+            s0,
+            s1,
+            dace.graph.edges.InterstateEdge(assignments=dict(__dacet1=0)))
+        sdfg.add_edge(
+            s1, reinitState,
+            dace.graph.edges.InterstateEdge(
+                condition=dace.properties.CodeProperty.from_string(
+                    "__dacet1 <" + str(iterations - 1),
+                    dace.types.Language.Python),
+                assignments={'__dacet1': '__dacet1+1'}))
+        sdfg.add_edge(reinitState, s1, dace.graph.edges.InterstateEdge())
+        sdfg.add_edge(
+            s1,
+            s2,
+            dace.graph.edges.InterstateEdge(
+                condition=dace.properties.CodeProperty.from_string(
+                    "__dacet1 >= " +
+                    str(iterations - 1), dace.types.Language.Python)))
+
+        try:
+            iter(initializer)
+            initializer = list(initializer)
+        except TypeError:
+            initializer = [initializer]
+
+        try:
+            iter(nodes)
+            nodes = list(nodes)
+        except TypeError:
+            nodes = [nodes]
+
+        if (not nodes == None):
+            try:
+                iter(optimizer)
+                optimizer = list(optimizer)
+            except TypeError:
+                optimizer = [optimizer]
+
+        ###########################
+        # Prepare subgraph to process
+        # If only one node was given, construct a list from it
+        if (not nodes == [None]):
+            ops = [
+                node if isinstance(node, tf.Operation) else node.op
+                for node in nodes
+            ]
+            output_names = [
+                _string_builder(node.name)
+                if not isinstance(node, tf.Operation) else None
+                for node in nodes
+            ]
+
+        # Visit initializer and create subgraph for init state
+        # If only one node was given, construct a list from it
+
+        init = [
+            i if isinstance(i, tf.Operation) else i.op for i in initializer
+        ]
+        self.visit_backwards(init)
+
+        # Visit the rest of the nodes
+        self.state = s1
+        state = s1
+        # As we are in a new state, all variable nodes should be revisited
+        self.visitedNodes.clear()
+        self.visit_backwards(optimizer)
+        if (not nodes == [None]):
+            self.visit_backwards(ops)
+        ############################
+
+        # Remove orphan nodes and register node types
+        node_types = {}
+        for state in self.graph.nodes():
+            for node in state.nodes():
+                if state.in_degree(node) + state.out_degree(node) == 0:
+                    state.remove_node(node)
+                    if node.label in self.constDict:
+                        del self.constDict[node.label]
+                elif isinstance(node, dace.graph.nodes.AccessNode):
+                    node_types[node.data] = node.desc(self.graph).dtype.type
+        ############################
+        # Set up arguments
+        sdfg_args = {}
+        sdfg_args.update(self.constDict)
+        sdfg_args.update(self.varDict)
+        sdfg_args.update(self.inpDict)
+        sdfg_args.update(self.reinitDict)
+        sdfg_args.update(self.initDict)
+
+        sdfg_args.update({(k if isinstance(k, str) else
+                           _string_builder(k.name + "_Inp")): v
+                          for k, v in feed_dict.items()})
+
+        # Set scalar arguments to appropriate arrays of size 1
+        sdfg_args.update({
+            k: (v if isinstance(v, np.ndarray) else np.array(
+                v, dtype=node_types[k]))
+            for k, v in sdfg_args.items()
+        })
+
+        ############################
+        # Create output numpy arrays
+        if (not nodes == [None]):
+            outputs = {
+                name: np.zeros(_tensorshape(node), dtype=_tensortype(node))
+                for node, name in zip(nodes, output_names)
+                if name is not None and name not in sdfg_args
+            }
+            outputs.update(
+                {k: v
+                 for k, v in sdfg_args.items() if k in output_names})
+
+            sdfg_args.update(outputs)
+
+        ############################
+        # Mark outputs as non-transients
+        for output in outputs:
+            self.graph.arrays[output].transient = False
+        ############################
+
+        # Compile and call the SDFG
+        self.graph.draw_to_file()
+        compiled_sdfg = self.graph.compile(optimizer=False)
+        compiled_sdfg(**sdfg_args)
+        ############################
+
+        # Return the outputs and weights
+
+        return self.varDict, tuple(
+            outputs[output] if output is not None else None
+            for output in output_names)
+
+    def compile(self, nodes, name=None):
+        """ Compiles a subgraph into a callable function, which is equivalent 
+            to calling `run()`. 
+            @param nodes: Node or an iterable (e.g. list) of nodes to evaluate.
+            @param name: Name of the SDFG to create, or None for a unique name.
+            @return: A function that receives a feed_dict, evaluates the nodes,
+                     and returns a tuple of values in the same order as nodes.
+        """
+        # Create a unique name for this session
+        if name is None:
+            global _LASTSESSION
+            _LASTSESSION += 1
+            name = "tfsession%d" % _LASTSESSION
+
+        # Initialize a new SDFG
+        self.graph = SDFG(name)
+        self.graph.propagate = False
+        self.state = SDFGState("s0", self.graph)
+        self.graph.add_node(self.state)
+        self.visitedNodes.clear()
+        ############################
+
+        # Prepare subgraph to process
+        total_nodes = []
+
+        # Determine output type
+        output_type = None
+        if not isinstance(nodes,
+                          (list, tuple, dict)):  # iter() works in TensorFlow
+            output_type = object
+            total_nodes.append(nodes)
+            output_names = _name(nodes)
+        elif isinstance(nodes, dict):
+            output_type = type(nodes)
+            output_names = {}
+            for k, node in nodes.items():
+                try:
+                    iter(node)
+                    if isinstance(node, dict):
+                        raise TypeError(
+                            'Dictionaries of dictionaries unsupported')
+                    total_nodes.extend(node)
+                    output_names[k] = type(node)(_name(n) for n in node)
+                except TypeError:
+                    total_nodes.append(node)
+                    output_names[k] = _name(node)
+        elif isinstance(nodes, (list, tuple)):
+            output_type = type(nodes)
+            total_nodes.extend(nodes)
+            output_names = output_type(_name(node) for node in nodes)
+        else:
+            raise TypeError('Unsupported type for fetches: ' +
+                            str(type(nodes)))
+
+        ops = [
+            node if isinstance(node, tf.Operation) else node.op
+            for node in total_nodes
+        ]
+        total_output_names = [
+            _string_builder(node.name)
+            if not isinstance(node, tf.Operation) else None
+            for node in total_nodes
+        ]
+
+        self.kill = False
+        self.visit_backwards(ops)
+        if self.kill:
+            raise NotImplementedError('Nodes listed above are not implemented')
+        ############################
+
+        # Remove orphan nodes and register node types
+        node_types = {}
+        for state in self.graph.nodes():
+            for node in state.nodes():
+                if state.in_degree(node) + state.out_degree(node) == 0:
+                    state.remove_node(node)
+                    if node.label in self.constDict:
+                        del self.constDict[node.label]
+                elif isinstance(node, dace.graph.nodes.AccessNode):
+                    node_types[node.data] = node.desc(self.graph).dtype.type
+        ############################
+
+        # Set up arguments
+        sdfg_args = {}
+        sdfg_args.update(self.constDict)
+        sdfg_args.update(self.varDict)
+        sdfg_args.update(self.inpDict)
+        sdfg_args.update(self.initDict)
+
+        # Set scalar arguments to appropriate arrays of size 1
+        sdfg_args.update({
+            k: (v if isinstance(v, np.ndarray) else np.array(
+                v, dtype=node_types[k]))
+            for k, v in sdfg_args.items()
+        })
+        ############################
+
+        # Create output numpy arrays
+        outputs = {
+            name: np.zeros(_tensorshape(node), dtype=_tensortype(node))
+            for node, name in zip(total_nodes, total_output_names)
+            if name is not None and name not in sdfg_args
+        }
+        outputs.update(
+            {k: v
+             for k, v in sdfg_args.items() if k in total_output_names})
+
+        sdfg_args.update(outputs)
+
+        ############################
+        # Mark outputs as non-transients
+        for output in outputs:
+            self.graph.arrays[output].transient = False
+        ############################
+
+        # Compile the SDFG
+        self.graph.fill_scope_connectors()
+        self.graph.draw_to_file()
+        compiled_sdfg = self.graph.compile(optimizer=False)
+
+        ############################
+        # Create the function that invokes the SDFG
+        def call_func(feed_dict={}):
+            invoke_args = dict(
+                sdfg_args, **{(k if isinstance(k, str) else
+                               _string_builder(k.name)): v
+                              for k, v in feed_dict.items()})
+
+            compiled_sdfg(**invoke_args)
+
+            # Single output
+            if output_type is object:
+                return outputs[
+                    output_names] if output_names is not None else None
+            # Dictionary of lists/single outputs
+            elif output_type is dict:
+                out_dict = {}
+                for k, v in output_names.items():
+                    if isinstance(v, (list, tuple)):
+                        out_dict[k] = type(v)(
+                            outputs[vname] if vname is not None else None
+                            for vname in v)
+                    else:
+                        out_dict[k] = outputs[v] if v is not None else None
+                return out_dict
+            # List of outputs
+            else:
+                return output_type(
+                    outputs[output] if output is not None else None
+                    for output in output_names)
+
+        # Return the function
+        return call_func
+
+    def run(self, nodes, feed_dict={}, name=None):
+        """ Evaluates a subgraph and returns a tuple of the evaluated nodes
+            (behaves similarly to sess.run).
+            @param nodes: Node or an iterable (e.g. list) of nodes to evaluate.
+            @param feed_dict: Dictionary representing input values and arrays 
+                              to feed in to the evaluator.
+            @param name: Name of the SDFG to create, or None for a unique name.
+            
+            @return: Tuple or dictionary of values in the same order as `nodes`.
+        """
+        callfunc = self.compile(nodes, name=name)
+        return callfunc(feed_dict=feed_dict)
+
+    def dfs_nodes(self, source):
+        """ Produce nodes in a depth-first-search (DFS) on a TensorFlow graph.
+            @param source: The source node to start from.
+            @return: A generator of nodes in the depth-first-search.       
+            @note: Based on http://www.ics.uci.edu/~eppstein/PADS/DFS.py
+                    by D. Eppstein, July 2004.
+        """
+
+        # If source is a list of nodes (or any iterable), start from all
+        try:
+            iter(source)
+            nodes = list(source)
+        except TypeError:
+            nodes = [source]
+
+        visited = set()
+
+        for start in nodes:
+            if start in visited:
+                continue
+            visited.add(start)
+            yield start
+
+            inputSet = [inp.op for inp in start.inputs]
+            inputSet.extend(list(start.control_inputs))
+            stack = [(start, iter(inputSet))]
+            while stack:
+                parent, children = stack[-1]
+                try:
+                    child = next(children)
+
+                    if child not in visited:
+                        yield child
+                        visited.add(child)
+
+                        inputSet = [inp.op for inp in child.inputs]
+                        inputSet.extend(list(child.control_inputs))
+                        stack.append((child, iter(inputSet)))
+                except StopIteration:
+                    stack.pop()
+
+    def visit_backwards(self, node):
+        """ Visit a graph from an output node backwards to the inputs. """
+        for node in self.dfs_nodes(node):
+            if node not in self.visitedNodes:
+                self.visit(node)
+
+    def visit(self, node):
+        """ Visit a specific node in the graph, creating the SDFG. """
+        try:
+            func = getattr(self, "visit_" + node.type)
+        except AttributeError:
+            # Only stop processing after all node types have been visited,
+            # so that we know which implementations are missing.
+            self.kill = True
+            print('MISSING IMPLEMENTATION:', node.type)
+        if self.kill == False:
+            func(node)
+        #mark node as visited
+        self.visitedNodes.add(node)
+
+    ######################################################################
+    # Operator (TensorFlow graph node) visitors
+
+    def visit_Add(self, node):
+        self.visit_element_wise_op(node, "+")
+
+    def visit_Mul(self, node):
+        self.visit_element_wise_op(node, "*")
+
+    def visit_Sub(self, node):
+        self.visit_element_wise_op(node, "-")
+
+    def visit_RealDiv(self, node):
+        self.visit_element_wise_op(node, "/")
+
+    def visit_Equal(self, node):
+        self.visit_element_wise_op(node, "==")
+
+    def visit_Const(self, node):
+        state = self.state
+        label = _string_builder(node.name + "_0")
+
+        # Create DaCe shape
+        shape = dace.properties.ShapeProperty.from_string(
+            str(_tensorshape(node.outputs[0])))
+        # Create np array from tensor value
+        npArray = tensor_util.MakeNdarray(
+            node.get_attr('value')).reshape(shape)
+
+        # Add to constDict so that it can be fed to the program
+        self.constDict[label] = npArray.astype(_tensortype(node))
+
+        nodeArray = list(
+            filter(lambda a: a.label == label, self.state.nodes()))
+
+        # If node already present set it non transient, otherwise add node
+        if (not nodeArray):
+            dtype = dace.typeclass(_tensortype(node))
+            state.add_array(label, shape, dtype, toplevel=True)
+        else:
+            nodeArray[0].desc(self.graph).transient = False
+
+    def visit_NoOp(self, node):
+        # no op case where nothing happens
+        pass
+
+    def visit_Pack(self, node):
+        # we do nothing with this op
+        pass
+
+    def visit_StridedSlice(self, node):
+        # we do nothing with this op
+        pass
+
+    def visit_VariableV2(self, node):
+
+        state = self.state
+        label = _string_builder(node.name) + "_0"
+        shape = dace.properties.ShapeProperty.from_string(
+            str(_tensorshape(node.outputs[0])))
+
+        try:
+            outputNode = state.find_node(label)
+            outputNode.desc(self.graph).transient = False
+        except (LookupError):
+            dtype = dace.typeclass(_tensortype(node))
+            state.add_array(label, shape, dtype)
+
+        # If not already added to the varDict, add a placeholder
+        # zero-initialized array to it so a value error is not triggered.
+        if (label not in self.varDict.keys()):
+            npArray = np.zeros(shape=shape)
+            self.varDict[label] = npArray.astype(_tensortype(node))
+
+    def visit_Assign(self, node):
+        # Simple memcopy from input1 to input0 as assign has no outputlist but
+        # input0 is the variable we want to assign
+        state = self.state
+        inputList = []
+        inputNodes = []
+        inputParams = []
+        inputDims = []
+
+        for count, inp in enumerate(node.inputs):
+            inputNode, params, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            inputParams.append(params)
+            inputDims.append(dims)
+
+        memlet = Memlet.simple(inputNodes[0], ",".join(inputDims[0]))
+        state.add_edge(inputNodes[1], None, inputNodes[0], None, memlet)
+
+    def visit_Placeholder(self, node):
+
+        outputShape = []
+        outputParams = []
+        outputDims = []
+        inputShape = []
+        inputParams = []
+        inputDims = []
+        outputTensor = node.outputs[0]
+        state = self.state
+        label = _string_builder(node.name + "_0")
+
+        # Check if the node is already in the graph and get as a list
+        try:
+            outputNode = state.find_node(label)
+
+        except (LookupError):
+            outputNode = self.create_and_add_output_node(node)
+
+        dtype = _tensortype(node)
+
+        # If we are in training mode, we set up another map to reduce the huge
+        # (iterations x batchsize x size of input) input to one dimension less
+        if (self.training):
+            # Output dimensions of the map
+
+            outputDims = self.get_default_dims(outputTensor)
+            outputParams = self.get_default_params(outputTensor, 1)
+            outputShape = list(map(str, _tensorshape(outputTensor)))
+
+            # Prepend the iterations dimension to the input (t1=iterations)
+            inputShape.append(str(self.iterations))
+            inputShape.extend(outputShape)
+            inputParams.append("i0")
+            inputParams.extend(outputParams)
+            inputDims.append("__dacet1:__dacet1+1")
+            inputDims.extend(outputDims)
+
+            #create node for the training examples
+            shape = dace.properties.ShapeProperty.from_string(
+                ",".join(inputShape))
+            dtype = _tensortype(node)
+            inputNode = state.add_array(
+                name=label + "_Inp", shape=shape, dtype=dace.typeclass(dtype))
+
+            #create and add mapp
+            mapDict = dict(zip(inputParams, inputDims))
+            inMemletDict = dict(
+                j0=Memlet.simple(inputNode, ",".join(inputParams)))
+            outMemletDict = dict(
+                out=Memlet.simple(outputNode, ",".join(outputParams)))
+            code = "out = j0"
+            tasklet, map_entry, map_exit = state.add_mapped_tasklet(
+                label, mapDict, inMemletDict, code, outMemletDict)
+            state.add_edge(inputNode, None, map_entry, None,
+                           Memlet.simple(inputNode, ",".join(inputDims)))
+            state.add_edge(map_exit, None, outputNode, None,
+                           Memlet.simple(outputNode, ",".join(outputDims)))
+
+            # If training example node is not already in inputDict, add a
+            # zero array. This prevents DaCe from raising a key error when
+            # trying to call the dace function if we only execute a subgraph
+            # where it does not appear. This might not be necessary any longer.
+            if (label + "_Inp" not in self.inpDict.keys()):
+                self.inpDict[label + "_Inp"] = np.zeros(
+                    tuple(map(int, (inputShape))), dtype=dtype)
+
+            # If we are not training, set the output non transient and add to
+            # input dict
+        else:
+            outputNode.desc(self.graph).transient = False
+            self.inpDict[label] = np.zeros(
+                tuple(map(int, (outputNode.desc(self.graph).shape))),
+                dtype=dtype)
+
+    def visit_TruncatedNormal(self, node):
+        # Creates a truncated normal array and adds it to initDict
+        state = self.state
+        label = _string_builder(node.name + "_0")
+        # Check if already in graph, set non-transient. Otherwise add to graph.
+        try:
+            outputNode = state.find_node(label)
+            outputNode.desc(self.graph).transient = False
+
+        except (LookupError):
+            self.create_and_add_output_node(node)
+
+        seed = 0 if self.seed is None else self.seed
+
+        array = tf.truncated_normal(
+            node.outputs[0].shape,
+            seed=seed).eval(session=self._internal_session)
+        self.initDict[label] = array.astype(_tensortype(node))
+
+    def visit_RandomStandardNormal(self, node):
+
+        state = self.state
+        label = _string_builder(node.name + "_0")
+
+        try:
+            outputNode = state.find_node(label)
+            outputNode.desc(self.graph).transient = False
+
+        except (LookupError):
+            self.create_and_add_output_node(node)
+
+        array = tf.random_normal(
+            node.outputs[0].shape,
+            seed=self.seed).eval(session=self._internal_session)
+        self.initDict[label] = array.astype(_tensortype(node))
+
+    def visit_RandomUniform(self, node):
+        # Creates a random uniform array and adds it to initDict
+        state = self.state
+        label = _string_builder(node.name + "_0")
+        # Check if already in graph, set non-transient. Otherwise add to graph.
+        try:
+            outputNode = state.find_node(label)
+            outputNode.desc(self.graph).transient = False
+
+        except (LookupError):
+            self.create_and_add_output_node(node)
+
+        seed = 0 if self.seed is None else self.seed
+
+        array = tf.random_uniform(
+            node.outputs[0].shape,
+            seed=seed).eval(session=self._internal_session)
+        self.initDict[label] = array.astype(_tensortype(node))
+
+    def visit_RandomUniformInt(self, node):
+        # Creates a random uniform array and adds it to initDict
+        state = self.state
+        label = _string_builder(node.name + "_0")
+        # Check if already in graph, set non-transient. Otherwise add to graph.
+        try:
+            outputNode = state.find_node(label)
+            outputNode.desc(self.graph).transient = False
+
+        except (LookupError):
+            self.create_and_add_output_node(node)
+
+        seed = 0 if self.seed is None else self.seed
+
+        array = tf.random_uniform(
+            node.outputs[0].shape,
+            dtype=tf.as_dtype(_tensortype(node)),
+            minval=node.inputs[1],
+            maxval=node.inputs[2],
+            seed=seed).eval(session=self._internal_session)
+        self.initDict[label] = array.astype(_tensortype(node))
+
+    def visit_Fill(self, node):
+        # Fills an array with a scalar input value
+        state = self.state
+        inputList = []
+        inputNodes = []
+        outputList = []
+        mapParams = []
+        mapRange = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+
+        for count, inp in enumerate(node.inputs):
+            # Scalar input is at position 1
+            if (count == 1):
+                inp, params, dims = self.create_and_add_input_node(inp)
+                inputList.append(inp.desc(self.graph))
+                inputNodes.append(inp)
+                inputParams.append(params)
+                inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+
+        for out in node.outputs:
+            params = self.get_default_params(out, 1)
+            dims = self.get_default_dims(out)
+            outputParams.append(params)
+            outputDims.append(dims)
+
+        mapLabel = _string_builder(node.type)
+        mapParams = inputParams[0] + outputParams[0]
+        mapRange = inputDims[0] + outputDims[0]
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'}, "out = j0")
+        self.add_out_memlets(outputList, mapExit, tasklet, outputDims,
+                             outputParams)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_Mean(self, node):
+
+        inputList = []
+        inputNodes = []
+        outputList = []
+        state = self.state
+        mapParams = []
+        mapRange = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+
+        for count, inp in enumerate(node.inputs):
+            if (count == 0):
+                inputNode, params, dims = self.create_and_add_input_node(inp)
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+                inputParams.append(params)
+                inputDims.append(dims)
+                # Need to get total size of array
+                n = 1
+                for i in (inputNode.desc(self.graph).shape):
+                    n *= i
+                # Output is scalar
+                outputParams = ["i1"]
+                outputDims = ["0:1"]
+                outputShape = dace.properties.ShapeProperty.from_string(
+                    str(_tensorshape(node.outputs[0])))
+                outputNode = state.add_transient(
+                    _string_builder(node.outputs[0].name),
+                    outputShape,
+                    dace.typeclass(_tensortype(inp)),
+                    toplevel=True)
+                outputList = []
+                outputList.append(outputNode)
+
+        mapLabel = _string_builder(node.type)
+        mapParams = inputParams[0] + outputParams
+        mapRange = inputDims[0] + outputDims
+
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        self.reinitCR(outputList[0], [["i0"]], [["0:1"]], "0")
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'},
+                                    "out = j0/" + str(n))
+        self.add_out_memlets(outputList, mapExit, tasklet, [outputDims],
+                             [outputParams], "lambda a, b: (a + b)", 0)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_Tile(self, node):
+        # Replicates input multiple times
+        inputList = []
+        inputNodes = []
+
+        state = self.state
+
+        for inp in node.inputs:
+
+            label = _string_builder(inp.name)
+            try:
+                inputNode = state.find_node(label)
+            except (LookupError):
+
+                inputNode = self.create_and_add_input_node(inp)[0]
+
+            inputNodes.append(inputNode)
+            inputList.append(inputNode.desc(self.graph))
+
+        outputList = self.create_and_add_output_node(node)
+
+        mapLabel = _string_builder(node.type)
+        outputDims = self.get_default_dims(node.outputs[0])
+        outputParams = self.get_default_params(node.outputs[0])
+        inputDims = self.get_default_dims(node.inputs[0])
+        inputParams = []
+
+        for i, dim in enumerate(inputList[0].shape):
+            inputParams.append("i" + str(i) + "%" + str(dim))
+
+        mapDict = dict(zip(outputParams, outputDims))
+        inMemletDict = dict(
+            j0=Memlet.simple(inputNodes[0], ",".join(inputParams)))
+        outMemletDict = dict(
+            out=Memlet.simple(outputList[0], ",".join(outputParams)))
+        code = "out = j0"
+        tasklet, map_entry, map_exit = state.add_mapped_tasklet(
+            mapLabel, mapDict, inMemletDict, code, outMemletDict)
+        state.add_edge(inputNodes[0], None, map_entry, None,
+                       Memlet.simple(inputNodes[0], ",".join(inputDims)))
+        state.add_edge(map_exit, None, outputList[0], None,
+                       Memlet.simple(outputList[0], ",".join(outputDims)))
+
+    def visit_PreventGradient(self, node):
+        # Just a memcopy, works like visit_assign or visit_identity
+        state = self.state
+        inputList = []
+        inputNodes = []
+        outputList = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+
+        for count, inp in enumerate(node.inputs):
+            #relevant input is at position 0
+            if (count == 0):
+                inputNode, params, dims = self.create_and_add_input_node(inp)
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+                inputParams.append(params)
+                inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+
+        for count, out in enumerate(node.outputs):
+
+            dims = self.get_default_dims(out)
+            params = self.get_default_params(out)
+            outputParams.append(params)
+            outputDims.append(dims)
+
+        memlet = Memlet.simple(inputNodes[0], ",".join(inputDims[0]))
+        state.add_edge(inputNodes[0], None, outputList[0], None, memlet)
+
+    def visit_ExpandDims(self, node):
+        # Takes an N-dimensional array and adds one dimension to it with a
+        # length of 1. Example: (M,K) -> (1,M,K).
+        # We can just use DaCe memory copy to do the same
+        state = self.state
+        inputList = []
+        inputNodes = []
+        inputDims = []
+        inputParams = []
+
+        for count, inp in enumerate(node.inputs):
+            if (count == 0):
+                inputNode, params, dims = self.create_and_add_input_node(inp)
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+                inputDims.append(dims)
+                inputParams.append(params)
+
+        outputList = self.create_and_add_output_node(node)
+        memlet = Memlet.simple(inputNodes[0], ",".join(inputDims[0]))
+        state.add_edge(inputNodes[0], None, outputList[0], None, memlet)
+
+    def visit_ApplyGradientDescent(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+        mapParams = []
+        mapRange = []
+        inputParams = []
+        inputDims = []
+
+        for count, inp in enumerate(node.inputs):
+
+            inputNode, params, dims = self.create_and_add_input_node(inp)
+            inputParams.append(params)
+            inputDims.append(dims)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+
+        mapLabel = _string_builder(node.type)
+        #inputList[1] is learning rate which needs its own parameter
+        inputParams[1] = ["i4"]
+        # This is the variable which is input and output of this map at the same
+        # time. We create the output version of it here
+        out = node.inputs[0]
+        shape = dace.properties.ShapeProperty.from_string(
+            str(_tensorshape(out)))
+        outName = _string_builder(out.name)
+        dtype = _tensortype(out)
+        outputNode = state.add_array(outName, shape, dtype)
+        dims = self.get_default_dims(out)
+        params = self.get_default_params(out)
+        outputList = [outputNode]
+        outputParams = [params]
+        outputDims = [dims]
+
+        mapLabel = _string_builder(node.type)
+        mapParams = inputParams[0] + ["i4"]
+        mapRange = inputDims[0] + ["0:1"]
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0', 'j1', 'j2'}, {'out'},
+                                    "out = j0-(j1*j2)")
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+        self.add_out_memlets(outputList, mapExit, tasklet, outputDims,
+                             outputParams)
+
+    def visit_MatMul(self, node):
+        # 2d Matrix Multiplication
+        inputList = []
+        inputNodes = []
+        state = self.state
+        mapParams = []
+        outputParams = [[]]
+        mapRange = []
+        outputDims = [[]]
+        inputParams = [[], []]
+        inputDims = [[], []]
+
+        for inp in node.inputs:
+            inputNode = self.create_and_add_input_node(inp)[0]
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+
+        outputList = self.create_and_add_output_node(node)
+
+        ndims = len(outputList[0].desc(self.graph).shape)
+        # Params for higher dimensions (not verified)
+        # (for 2d it works)
+        for i in range(0, ndims + 1):
+            if (i == ndims):
+                mapParams.append("i" + str(i))
+                inputParams[1].append("i" + str(i))
+                outputParams[0].append("i" + str(i))
+
+            elif (i == ndims - 1):
+                mapParams.append("i" + str(i))
+                inputParams[0].append("i" + str(i))
+                inputParams[1].append("i" + str(i))
+
+            elif (i == ndims - 2):
+                mapParams.append("i" + str(i))
+                inputParams[0].append("i" + str(i))
+                outputParams[0].append("i" + str(i))
+
+            else:
+                mapParams.append("i" + str(i))
+                inputParams[0].append("i" + str(i))
+                inputParams[1].append("i" + str(i))
+                outputParams[0].append("i" + str(i))
+
+        for i in range(0, ndims):
+            inputDims[0].append(str(0) + ":" + str(node.inputs[0].shape[i]))
+            inputDims[1].append(str(0) + ":" + str(node.inputs[1].shape[i]))
+            outputDims[0].append(str(0) + ":" + str(node.outputs[0].shape[i]))
+            mapRange.append(str(0) + ":" + str(node.inputs[0].shape[i]))
+
+        mapRange.append(str(0) + ":" + str(node.outputs[0].shape[ndims - 1]))
+        #if first input needs to be transposed
+        if (node.get_attr("transpose_a")):
+            mapRange[0], mapRange[1] = mapRange[1], mapRange[0]
+            inputParams[0][0], inputParams[0][1] = inputParams[0][
+                1], inputParams[0][0]
+        #if second input needs to be transposed
+        if (node.get_attr("transpose_b")):
+            inputParams[1][0], inputParams[1][1] = inputParams[1][
+                1], inputParams[1][0]
+
+        mentry, mexit = state.add_map('matmul_outer',
+                                      {mapParams[1]: mapRange[1]},
+                                      dace.ScheduleType.Sequential)
+        minentry, minexit = state.add_map('matmul_inner', {
+            mapParams[0]: mapRange[0],
+            mapParams[2]: mapRange[2]
+        }, dace.ScheduleType.CPU_Multicore)
+        tasklet = state.add_tasklet('mm_code', {'j0', 'j1'}, {'out'},
+                                    'out = j0*j1')
+
+        for i, inp in enumerate(inputNodes):
+            name = "j" + str(i)
+            memlet = Memlet.simple(inp, ",".join(inputParams[i]))
+            state.add_edge(minentry, None, tasklet, name, memlet)
+
+        for i, out in enumerate(outputList):
+            name = "out"
+            memlet = Memlet.simple(
+                out,
+                ",".join(outputParams[i]),
+                wcr_str='lambda a,b: a+b',
+                wcr_identity=0)
+            state.add_edge(tasklet, name, minexit, None, memlet)
+
+        self.reinitCR(outputList[0], outputParams, outputDims, '0')
+        self.add_out_memlets(outputList, mexit, minexit, outputDims,
+                             outputParams, 'lambda a,b: a+b', 0)
+        self.add_in_memlets(inputNodes, mentry, minentry, inputDims,
+                            inputParams)
+
+    def visit_element_wise_op(self, node, operation):
+        """ Handles all the element wise operations, supports broadcasting. """
+        inputList = []
+        inputNodes = []
+        mapParams = []
+        outputParams = []
+        mapRange = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+        state = self.state
+
+        for inp in node.inputs:
+
+            inputNode, _, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            inputDims.append(dims)
+
+        outputNodes = self.create_and_add_output_node(node)
+        mapLabel = _string_builder(node.type)
+        #create params
+        for inp in inputList:
+            inputParamsString = []
+            for i, dim in enumerate(inp.shape):
+                #scalar case that we want to broadcast
+                if (str(dim) == "1"):
+                    inputParamsString.append("0")
+                else:
+                    inputParamsString.append("i" + str(i))
+
+            inputParams.append(inputParamsString)
+
+        params = self.get_default_params(node.outputs[0])
+        dims = self.get_default_dims(node.outputs[0])
+        outputParams.append(params)
+        outputDims.append(dims)
+
+        mapParams = outputParams[0]
+        mapRange = outputDims[0]
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0', 'j1'}, {'out'},
+                                    "out = j0 " + operation + " j1")
+        self.add_out_memlets(outputNodes, mapExit, tasklet, outputDims,
+                             outputParams)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_Conv2D(self, node):
+
+        inputList = []
+        inputNodes = []
+        ndims = 0
+        strides = node.get_attr("strides")[1]
+        state = self.state
+
+        for inp in node.inputs:
+
+            inputNode = self.create_and_add_input_node(inp)[0]
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+
+        outputList = self.create_and_add_output_node(node)
+        ndims = len(outputList[0].desc(self.graph).shape)
+        mapLabel = _string_builder(node.type)
+
+        mapParams = []
+        outputParams = []
+        mapRange = []
+        outputDims = [[]]
+        inputParams = []
+        inputDims = [[], []]
+        #create conv params
+        inputParams.append([
+            "i0", "i1*" + str(strides) + "+i5", "i2*" + str(strides) + "+i6",
+            "i3"
+        ])
+        inputParams.append(["i5", "i6", "i3", "i4"])
+        outputParams.append(["i0", "i1", "i2", "i4"])
+        #create conv dims
+        for i in range(0, ndims):
+            inputDims[0].append(str(0) + ":" + str(node.inputs[0].shape[i]))
+            inputDims[1].append(str(0) + ":" + str(node.inputs[1].shape[i]))
+            outputDims[0].append(str(0) + ":" + str(node.outputs[0].shape[i]))
+        # add a padding map for same padding(zero padding so that input and
+        # output of convolution have the same size)
+        if (str(node.get_attr("padding"))[2:-1] == "SAME"):
+            paddedInput, paddedDims = self.inputPadding(
+                node, inputNodes[0], inputList[0], outputList[0].desc(
+                    self.graph).shape[1], inputList[1].shape[0], strides,
+                inputDims[0])
+            inputDims[0] = paddedDims
+            inputList[0] = paddedInput
+
+        mapParams = outputParams[0]
+        mapParams2 = inputParams[1][:-1]
+        mapRange = outputDims[0]
+        mapRange2 = inputDims[1][:-1]
+
+        mapEntry, mapExit = state.add_map(mapLabel + "_outer",
+                                          dict(zip(mapParams, mapRange)))
+        mapEntry2, mapExit2 = state.add_map(mapLabel + "_inner",
+                                            dict(zip(mapParams2, mapRange2)))
+        self.reinitCR(outputList[0], outputParams, outputDims, "0")
+        tasklet = state.add_tasklet(mapLabel, {'j0', 'j1'}, {'out'},
+                                    "out = j0 * j1")
+        self.add_out_memlets(outputList, mapExit, mapExit2, outputDims,
+                             outputParams, 'lambda a,b: a+b', 0)
+        self.add_in_memlets(inputNodes, mapEntry, mapEntry2, inputDims,
+                            inputParams)
+        #add memlets from inner map to tasklet
+        for i, inp in enumerate(inputNodes):
+            name = "j" + str(i)
+            memlet = Memlet.simple(inp, ",".join(inputParams[i]))
+            state.add_edge(mapEntry2, None, tasklet, name, memlet)
+        #add memelets from tasklet to cr
+        for i, out in enumerate(outputList):
+            name = "out"
+            memlet = Memlet.simple(
+                out,
+                ",".join(outputParams[i]),
+                wcr_str='lambda a,b: a+b',
+                wcr_identity=0)
+            state.add_edge(tasklet, name, mapExit2, None, memlet)
+
+    def visit_BiasAdd(self, node):
+
+        inputList = []
+        inputNodes = []
+        state = self.state
+
+        for inp in node.inputs:
+            inputNode = self.create_and_add_input_node(inp)[0]
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+
+        outputList = self.create_and_add_output_node(node)
+        dims = outputList[0].desc(self.graph).shape
+
+        mapLabel = _string_builder(node.type)
+        mapParams = []
+        outputParams = []
+        mapRange = []
+        outputDims = []
+        inputParams = [[], []]
+        inputDims = [[], []]
+
+        params = self.get_default_params(node.outputs[0])
+        dims = self.get_default_dims(node.outputs[0])
+        outputParams.append(params)
+        outputDims.append(dims)
+
+        mapParams = outputParams[0]
+        inputParams[0] = outputParams[0]
+        #the bias matches the last dimension of input resp. output
+        inputParams[1] = [mapParams[-1]]
+        mapRange = outputDims[0]
+        inputDims[0] = outputDims[0]
+        inputDims[1] = ["0:" + str(node.inputs[1].shape[0])]
+
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0', 'j1'}, {'out'},
+                                    "out = j0 + j1")
+        self.add_out_memlets(outputList, mapExit, tasklet, outputDims,
+                             outputParams)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_MaxPool(self, node):
+
+        inputList = []
+        inputNodes = []
+        dims = []
+        inputDims = []
+        strides = node.get_attr("strides")[1]
+        ksize = node.get_attr("ksize")[1]
+        state = self.state
+
+        for inp in node.inputs:
+            inputNode, _, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            inputDims.append(dims)
+        inputParams = [[
+            "i0", "i1*" + str(strides) + "+i4", "i2*" + str(strides) + "+i5",
+            "i3"
+        ]]
+
+        outputParams = []
+        outputDims = []
+        outputList = self.create_and_add_output_node(node)
+        dims = self.get_default_dims(node.outputs[0])
+        params = self.get_default_params(node.outputs[0])
+        outputDims.append(dims)
+        outputParams.append(params)
+
+        mapLabel = _string_builder(node.type)
+        mapParams1 = outputParams[0]
+        mapRange1 = outputDims[0]
+        mapParams2 = ["i4", "i5"]
+        mapRange2 = ["0:" + str(ksize), "0:" + str(ksize)]
+
+        mapEntry, mapExit = state.add_map(mapLabel + "_outer",
+                                          dict(zip(mapParams1, mapRange1)))
+        mapEntry2, mapExit2 = state.add_map(mapLabel + "_inner",
+                                            dict(zip(mapParams2, mapRange2)))
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'}, "out = j0")
+        self.reinitCR(outputList[0], outputParams, outputDims, "-9999999999")
+        self.add_out_memlets(outputList, mapExit, mapExit2, outputDims,
+                             outputParams, 'lambda a,b: max(a,b)', -9999999999)
+        self.add_in_memlets(inputNodes, mapEntry, mapEntry2, inputDims,
+                            inputParams)
+        #add memlets from inner map to tasklet
+        for i, inp in enumerate(inputNodes):
+            name = "j" + str(i)
+            memlet = Memlet.simple(inp, ",".join(inputParams[i]))
+            state.add_edge(mapEntry2, None, tasklet, name, memlet)
+        #add memelets from tasklet to cr
+        for i, out in enumerate(outputList):
+            name = "out"
+            memlet = Memlet.simple(
+                out,
+                ",".join(outputParams[i]),
+                wcr_str='lambda a,b: max(a,b)',
+                wcr_identity=-9999999999)
+            state.add_edge(tasklet, name, mapExit2, None, memlet)
+
+    def visit_Relu(self, node):
+
+        inputList = []
+        inputNodes = []
+        state = self.state
+        inputParams = []
+        inputDims = []
+
+        for inp in node.inputs:
+
+            inputNode, params, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            inputParams.append(params)
+            inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+
+        mapLabel = _string_builder(node.type)
+        mapParams = []
+        mapRange = []
+        mapParams = inputParams[0]
+        mapRange = inputDims[0]
+
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'},
+                                    "out = max(dace.float32(0),j0)")
+        self.add_out_memlets(outputList, mapExit, tasklet, inputDims,
+                             inputParams)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_ShapeN(self, node):
+        inputList = []
+        inputNodes = []
+        state = self.state
+        inputParams = []
+        inputDims = []
+
+        for inp in node.inputs:
+            inputNode, params, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            inputParams.append(params)
+            inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+
+        mapLabel = _string_builder(node.type)
+        for i, node in enumerate(outputList):
+            tasklet = state.add_tasklet(
+                mapLabel + str(i), {}, {'out'}, '\n'.join([
+                    'out[%d] = %s' % (j, dim)
+                    for j, dim in enumerate(inputList[i].shape)
+                ]))
+            self.state.add_edge(
+                tasklet, 'out', node, None,
+                Memlet.simple(node, '0:' + str(len(inputDims[i]))))
+
+    def visit_Reshape(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+
+        inp = node.inputs[0]
+        inputParams = []
+        inputDims = []
+        inputNode, params, dims = self.create_and_add_input_node(inp)
+        inputParams.append(params)
+        inputDims.append(dims)
+        inDims = max(inp.shape.ndims, 1)
+        inputList.append(inputNode.desc(self.graph))
+        inputNodes.append(inputNode)
+
+        outputDims = []
+        outputList = self.create_and_add_output_node(node)
+        dims = outputList[0].desc(self.graph).shape
+        outDims = len(dims)
+        outputDims.append(self.get_default_dims(node.outputs[0]))
+
+        mapLabel = _string_builder(node.type)
+        mapParams = []
+        outputParams = [[]]
+        mapRange = []
+        mapParams = inputParams[0]
+        mapRange = inputDims[0]
+
+        # Reshape from 4 to 2 dimensions
+        if (inDims > outDims):
+            outputParams[0] = [
+                "i0", "i1*" + str(node.inputs[0].shape[2]) + "*" + str(
+                    node.inputs[0].shape[3]) + "+i2*" + str(
+                        node.inputs[0].shape[3]) + "+i3"
+            ]
+        # Reshape from 2 to 4 dimensions
+        elif (inDims < outDims):
+            outputParams[0] = [
+                "i0", "i1/(" + str(node.outputs[0].shape[2]) + "*" + str(
+                    node.outputs[0].shape[3]) + ")",
+                "(i1%" + "(" + str(node.outputs[0].shape[2]) + "*" + str(
+                    node.outputs[0].shape[3]) + "))/" + str(
+                        node.outputs[0].shape[3]),
+                "i1%" + str(node.outputs[0].shape[3])
+            ]
+        # If they have the same dimension
+        else:
+            outputParams[0] = mapParams
+            mapRange = outputDims[0]
+            inputDims[0] = outputDims[0]
+
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'}, "out = j0")
+        self.add_out_memlets(outputList, mapExit, tasklet, outputDims,
+                             outputParams)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_MaxPoolGrad(self, node):
+        # TODO: Currently only supports 2x2 maxpooling
+        state = self.state
+        mapParams = []
+        mapRange = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+        inputList = []
+        inputNodes = []
+
+        for count, inp in enumerate(node.inputs):
+
+            inputNode, _, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            params = []
+
+            for ndims, dim in enumerate(inp.shape):
+                if ((not count == 0) and (ndims == 1 or ndims == 2)):
+                    params.append("i" + str(ndims) + "/2")
+
+                else:
+                    params.append("i" + str(ndims))
+
+            inputParams.append(params)
+            inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+        mapLabel = _string_builder(node.type)
+
+        dtype = dace.typeclass(_tensortype(node))
+        shape = dace.properties.ShapeProperty.from_string(
+            str(inputList[0].shape))
+
+        tempNode = state.add_transient(
+            _string_builder(node.name + "_tmp"), shape, dtype, toplevel=True)
+        tempList = [tempNode]
+
+        outputDims = inputDims
+        outputParams = inputParams
+        # Copy as we manipulate inputParams but don't want map params/range to
+        # change
+        mapParams = inputParams[0].copy()
+        mapRange = inputDims[0].copy()
+
+        mapEntry, mapExit = state.add_map(mapLabel + "_map1",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(
+            mapLabel + "_map1", {'j0', 'j1', 'j2'}, {'out'},
+            "if (j0==j1):\n\tout = j2\nelse:\n\tout = 0")
+
+        self.add_out_memlets(tempList, mapExit, tasklet, outputDims,
+                             outputParams)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+        # Second map:
+        # as we don't have the indicies of the maxpooling we need to manually
+        # figure out which one contributed. If it is ambigious we break the
+        # tie by the following priority k[i,j]<k[i+1,j]...<k[0,j+1]...
+        newDims = [inputDims[0]] * 4
+        mapRange[1] += ":2"
+        mapRange[2] += ":2"
+
+        newParams = [inputParams[0]]
+        # 2x2 kernel
+        newParams = [["i0", "i1", "i2", "i3"], ["i0", "i1+1", "i2", "i3"],
+                     ["i0", "i1", "i2+1", "i3"], ["i0", "i1+1", "i2+1", "i3"]]
+
+        string = """
+if(j0!=0):
+        out0=j0
+        out1=0
+        out2=0
+        out3=0
+elif(j1!=0):
+        out0=j0
+        out1=j1
+        out2=0
+        out3=0
+elif(j2!=0):
+        out0=j0
+        out1=j1
+        out2=j2
+        out3=0
+else:
+        out0=j0
+        out1=j1
+        out2=j2
+        out3=j3
+"""
+        tasklet = state.add_tasklet(mapLabel + "_map2",
+                                    {'j0', 'j1', 'j2', 'j3'},
+                                    {'out0', 'out1', 'out2', 'out3'}, string)
+        mapEntry, mapExit = state.add_map(mapLabel + "_map2",
+                                          dict(zip(mapParams, mapRange)))
+        self.add_out_memlets(outputList * 4, mapExit, tasklet, newDims,
+                             newParams)
+        self.add_in_memlets(tempList * 4, mapEntry, tasklet, newDims,
+                            newParams)
+
+    def visit_ReluGrad(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+        outputList = []
+        mapParams = []
+        mapRange = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+
+        for inp in node.inputs:
+            inputNode, params, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            inputParams.append(params)
+            inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+        for out in node.outputs:
+            dims = self.get_default_dims(out)
+            params = self.get_default_params(out)
+            outputParams.append(params)
+            outputDims.append(dims)
+
+        mapLabel = _string_builder(node.type)
+        mapParams = inputParams[0]
+        mapRange = inputDims[0]
+
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(
+            mapLabel, {'j0', 'j1'}, {'out'},
+            "if (j1>0):\n\tout = j0\nelse:\n\tout = 0")
+        self.add_out_memlets(outputList, mapExit, tasklet, outputDims,
+                             outputParams)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_BiasAddGrad(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+        outputList = []
+        mapParams = []
+        mapRange = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+
+        for count, inp in enumerate(node.inputs):
+            inputNode, params, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            inputParams.append(params)
+            inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+        for out in node.outputs:
+            outputParams.append([inputParams[0][-1]])
+            outputDims.append([inputDims[0][-1]])
+
+        mapLabel = _string_builder(node.type)
+        mapParams = inputParams[0]
+        mapRange = inputDims[0]
+
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'}, "out = j0")
+        self.reinitCR(outputList[0], outputParams, outputDims, "0")
+        self.add_out_memlets(outputList, mapExit, tasklet, outputDims,
+                             outputParams, 'lambda a,b: a+b', 0)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_Conv2DBackpropInput(self, node):
+
+        inputList = []
+        inputNodes = []
+        mapParams = []
+        outputParams = []
+        mapRange = []
+        outputDims = [[]]
+        inputParams = []
+        inputDims = [[], []]
+        strides = node.get_attr("strides")[1]
+        state = self.state
+
+        for count, inp in enumerate(node.inputs):
+            if (not count == 0):
+                inputNode = self.create_and_add_input_node(inp)[0]
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+
+        outputList = self.create_and_add_output_node(node)
+
+        for i in range(0, 7):
+            mapParams.append("i" + str(i))
+        ndims = len(outputList[0].desc(self.graph).shape)
+        for i in range(0, ndims):
+            inputDims[1].append(str(0) + ":" + str(inputList[1].shape[i]))
+            inputDims[0].append(str(0) + ":" + str(inputList[0].shape[i]))
+            outputDims[0].append(
+                str(0) + ":" + str(outputList[0].desc(self.graph).shape[i]))
+
+        ksize = inputList[0].shape[0]
+        paddedInput, paddedDims = self.inputPadding(
+            node, inputNodes[1], inputList[1], outputList[0].desc(
+                self.graph).shape[1], ksize, strides, inputDims[1])
+        inputDims[1] = paddedDims
+        inputList[1] = paddedInput
+        inputParams.append(
+            ["-1-i5+" + str(ksize), "-1-i6+" + str(ksize), "i3", "i4"])
+        inputParams.append([
+            "i0", "i1*" + str(strides) + "+i5", "i2*" + str(strides) + "+i6",
+            "i4"
+        ])
+
+        outputParams.append(["i0", "i1", "i2", "i3"])
+
+        mapLabel = _string_builder(node.type)
+        mapParams = ["i0", "i1", "i2", "i3"]
+        mapParams2 = ["i5", "i6", "i4"]
+        mapRange = outputDims[0]
+        mapRange2 = inputDims[0][:-2]
+        mapRange2.append(inputDims[1][-1])
+        mapEntry, mapExit = state.add_map(mapLabel + "_outer",
+                                          dict(zip(mapParams, mapRange)))
+        mapEntry2, mapExit2 = state.add_map(mapLabel + "_inner",
+                                            dict(zip(mapParams2, mapRange2)))
+
+        tasklet = state.add_tasklet(mapLabel, {'j0', 'j1'}, {'out'},
+                                    "out = j0 * j1")
+        self.reinitCR(outputList[0], outputParams, outputDims, "0")
+
+        self.add_out_memlets(outputList, mapExit, mapExit2, outputDims,
+                             outputParams, 'lambda a,b: a+b', 0)
+        self.add_in_memlets(inputNodes, mapEntry, mapEntry2, inputDims,
+                            inputParams)
+        for i, inp in enumerate(inputNodes):
+            name = "j" + str(i)
+            memlet = Memlet.simple(inp, ",".join(inputParams[i]))
+            state.add_edge(mapEntry2, None, tasklet, name, memlet)
+        for i, out in enumerate(outputList):
+            name = "out"
+            memlet = Memlet.simple(
+                out,
+                ",".join(outputParams[i]),
+                wcr_str='lambda a,b: a+b',
+                wcr_identity=0)
+            state.add_edge(tasklet, name, mapExit2, None, memlet)
+
+    def visit_Conv2DBackpropFilter(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+        outputList = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+
+        for count, inp in enumerate(node.inputs):
+            if (count != 1):
+                inputNode, _, dims = self.create_and_add_input_node(inp)
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+                inputDims.append(dims)
+        inputParams.append(["i0", "i1+i5", "i2+i6", "i3"])
+        inputParams.append(["i0", "i1", "i2", "i4"])
+
+        outputList = self.create_and_add_output_node(node)
+        for count, out in enumerate(node.outputs):
+            params = ["i5", "i6", "i3", "i4"]
+            dims = self.get_default_dims(out)
+            outputParams.append(params)
+            outputDims.append(dims)
+
+        mapParams = outputParams[0]
+        mapParams2 = inputParams[1][:-1]
+        mapRange = outputDims[0]
+        mapRange2 = inputDims[1][:-1]
+        mapLabel = _string_builder(node.type)
+        mapEntry, mapExit = state.add_map(mapLabel + "_outer",
+                                          dict(zip(mapParams, mapRange)))
+        mapEntry2, mapExit2 = state.add_map(mapLabel + "_inner",
+                                            dict(zip(mapParams2, mapRange2)))
+
+        tasklet = state.add_tasklet(mapLabel, {'j0', 'j1'}, {'out'},
+                                    "out = j0*j1")
+
+        self.reinitCR(outputList[0], outputParams, outputDims, "0")
+
+        self.add_out_memlets(outputList, mapExit, mapExit2, outputDims,
+                             outputParams, 'lambda a,b: a+b', 0)
+        self.add_in_memlets(inputNodes, mapEntry, mapEntry2, inputDims,
+                            inputParams)
+
+        for i, inp in enumerate(inputNodes):
+            name = "j" + str(i)
+            memlet = Memlet.simple(inp, ",".join(inputParams[i]))
+            state.add_edge(mapEntry2, None, tasklet, name, memlet)
+
+        for i, out in enumerate(outputList):
+            name = "out"
+            memlet = Memlet.simple(
+                out,
+                ",".join(outputParams[i]),
+                wcr_str='lambda a,b: a+b',
+                wcr_identity=0)
+            state.add_edge(tasklet, name, mapExit2, None, memlet)
+
+    def visit_SparseSoftmaxCrossEntropyWithLogits(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+        outputList = []
+        inputParams = []
+        inputDims = []
+
+        for inp in node.inputs:
+            inputNode, params, dims = self.create_and_add_input_node(inp)
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            inputDims.append(dims)
+            inputParams.append(params)
+
+        for out in node.outputs:
+            label = _string_builder(out.name)
+            try:
+                outputNode = state.find_node(label)
+            except (LookupError):
+                dtype = dace.typeclass(_tensortype(node))
+                shape = dace.properties.ShapeProperty.from_string(
+                    str(_tensorshape(out)))
+                outputNode = state.add_transient(
+                    label, shape, dtype, toplevel=True)
+            outputList.append(outputNode)
+
+        mapLabel = _string_builder(node.type)
+        mapParams = inputParams[0]
+        mapRange = inputDims[0]
+
+        #1st map, get maximum in each batchsize dimension
+        dtype = dace.typeclass(_tensortype(node))
+        shape = dace.properties.ShapeProperty.from_string(
+            str(inputList[1].shape))
+
+        temp1Node = state.add_transient(
+            mapLabel + "_max_tmp", shape, dtype, toplevel=True)
+        mapEntry, mapExit = state.add_map(mapLabel + "_max",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel + "_max", {'j0'}, {'out'},
+                                    "out = j0")
+        self.reinitCR(temp1Node, [inputParams[1]], [inputDims[1]],
+                      "-999999999999")
+        self.add_in_memlets([inputNodes[0]], mapEntry, tasklet, [inputDims[0]],
+                            [inputParams[0]])
+        self.add_out_memlets([temp1Node], mapExit, tasklet, [inputDims[1]],
+                             [inputParams[1]], 'lambda a,b: max(a,b)',
+                             -9999999999)
+
+        # 2nd map, calculate the denominator sum
+        temp2Node = state.add_transient(
+            mapLabel + "_denominator_tmp", shape, dtype, toplevel=True)
+        mapEntry, mapExit = state.add_map(mapLabel + "_denominator",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(
+            mapLabel + "_denominator", {'j0', 'j1'}, {'out'},
+            "out = dace::math::exp(j0-j1);",
+            language=dace.types.Language.CPP)
+        self.reinitCR(temp2Node, [inputParams[1]], [inputDims[1]], "0")
+        inList = [inputNodes[0], temp1Node]
+        self.add_in_memlets(inList, mapEntry, tasklet, inputDims, inputParams)
+        self.add_out_memlets([temp2Node], mapExit, tasklet, [inputDims[1]],
+                             [inputParams[1]], 'lambda a,b: a+b', 0)
+
+        # 3rd map, calculate the sofmax
+        shape = dace.properties.ShapeProperty.from_string(
+            str(inputList[0].shape))
+        temp3Node = state.add_transient(
+            mapLabel + "_softmax_tmp", shape, dtype, toplevel=True)
+        mapEntry, mapExit = state.add_map(mapLabel + "_softmax",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(
+            mapLabel + "_softmax", {'j0', 'j1', 'j2'}, {'out'},
+            "out = (dace::math::exp(j0-j1))/j2;",
+            language=dace.types.Language.CPP)
+        inList = [inputNodes[0], temp1Node, temp2Node]
+        paramsList = inputParams + [inputParams[1]]
+        dimsList = inputDims + [inputDims[1]]
+        self.add_in_memlets(inList, mapEntry, tasklet, dimsList, paramsList)
+        self.add_out_memlets([temp3Node], mapExit, tasklet, [inputDims[0]],
+                             [inputParams[0]])
+
+        # 4th map, calculate the cross-entropy loss for an optional loss output
+        mapEntry, mapExit = state.add_map(mapLabel + "_loss",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(
+            mapLabel + "_loss", {'j0', 'j1'}, {'out'},
+            "if (int(j1) == i1) {\n\tout=-(dace::math::log(j0));}\nelse{\n\tout=0;}",
+            language=dace.types.Language.CPP)
+        self.reinitCR(outputList[0], [inputParams[1]], [inputDims[1]], "0")
+        self.add_in_memlets([temp3Node, inputNodes[1]], mapEntry, tasklet,
+                            inputDims, inputParams)
+        self.add_out_memlets([outputList[0]], mapExit, tasklet, [inputDims[1]],
+                             [inputParams[1]], 'lambda a,b: a+b', 0)
+
+        # 5th map, gradient of the whole layer
+        mapEntry, mapExit = state.add_map(mapLabel + "_gradient",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(
+            mapLabel + "_gradient", {'j0', 'j1'}, {'out'},
+            "if(int(j1)==i1):\n\tout = j0-1\nelse:\n\tout = j0")
+        self.add_out_memlets([outputList[1]], mapExit, tasklet, [inputDims[0]],
+                             [inputParams[0]])
+        self.add_in_memlets([temp3Node, inputNodes[1]], mapEntry, tasklet,
+                            inputDims, inputParams)
+
+    def visit_Identity(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+        outputList = []
+        inputParams = []
+        inputDims = []
+
+        # Create input node and its params
+        for count, inp in enumerate(node.inputs):
+            if (count == 0):
+                inputNode, params, dims = self.create_and_add_input_node(inp)
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+                inputParams.append(params)
+                inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+        memlet = Memlet.simple(inputNodes[0], ",".join(inputDims[0]))
+        state.add_edge(inputNodes[0], None, outputList[0], None, memlet)
+
+    def visit_LRNGrad(self, node):
+
+        inputList = []
+        inputNodes = []
+        outputList = []
+        state = self.state
+
+        alpha = str(node.get_attr("alpha"))
+        beta = str(node.get_attr("beta"))
+        bias = str(node.get_attr("bias"))
+        depth_radius = str(node.get_attr("depth_radius"))
+
+        for count, inp in enumerate(node.inputs):
+            inputNode = self.create_and_add_input_node(inp)[0]
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            if (count == 0):
+                shortDims = []
+                shortAccesses = []
+                for dim in inp.shape:
+                    shortDims.append("0:" + str(dim))
+                    shortAccesses.append(str(dim))
+                longDims = []
+                longDims = shortDims + ["0:" + depth_radius + "*2+1"]
+                paddedDims = []
+                paddedDims += shortDims
+                paddedDims[-1] += "+" + depth_radius + "*2"
+
+        label = _string_builder(node.name)
+        outputList = self.create_and_add_output_node(node)
+        longParams = ["i0", "i1", "i2", "i3", "i4"]
+        shortParams = ["i0", "i1", "i2", "i3"]
+        copyParams = ["i0", "i1", "i2", "i3+" + depth_radius]
+        normParams = ["i0", "i1", "i2", "i3+i4"]
+
+        paddedShape = []
+        paddedShape += shortAccesses
+        paddedShape[-1] += "+" + depth_radius
+        paddedInput = state.add_transient(
+            label + "_paddedInput",
+            paddedShape,
+            dace.typeclass(_tensortype(node)),
+            toplevel=True)
+        mapEntry, mapExit = state.add_map(label + "_padding",
+                                          dict(zip(shortParams, shortDims)))
+        tasklet = state.add_tasklet(label + "_padding", {'j0'}, {'out'},
+                                    "out=j0")
+        self.add_in_memlets([inputNodes[2]], mapEntry, tasklet, [shortDims],
+                            [shortParams])
+        self.add_out_memlets([paddedInput], mapExit, tasklet, [paddedDims],
+                             [copyParams])
+
+        sqrsum = state.add_transient(
+            label + "_Sqrsum", shortAccesses, _tensortype(node), toplevel=True)
+        mapEntry, mapExit = state.add_map(label + "_sqrsum",
+                                          dict(zip(longParams, longDims)))
+        tasklet = state.add_tasklet(label + "_sqrsum", {'j0'}, {'out'},
+                                    "out=j0*j0")
+        self.reinitCR(sqrsum, [shortParams], [shortDims], "0")
+        self.add_in_memlets([paddedInput], mapEntry, tasklet, [paddedDims],
+                            [normParams])
+        self.add_out_memlets([sqrsum], mapExit, tasklet, [shortDims],
+                             [shortParams], 'lambda a,b: a+b', 0)
+
+        label = _string_builder(node.name)
+        norm = state.add_transient(
+            label + "_Norm", shortAccesses, _tensortype(node), toplevel=True)
+        mapEntry, mapExit = state.add_map(label + "_norm",
+                                          dict(zip(shortParams, shortDims)))
+        tasklet = state.add_tasklet(label + "_norm", {'j0'}, {'out'},
+                                    "out=" + alpha + "*j0+" + bias)
+        self.add_in_memlets([sqrsum], mapEntry, tasklet, [shortDims],
+                            [shortParams])
+        self.add_out_memlets([norm], mapExit, tasklet, [shortDims],
+                             [shortParams])
+
+        preOut = state.add_transient(
+            label + "_preOut", shortAccesses, _tensortype(node), toplevel=True)
+        mapEntry, mapExit = state.add_map(label, dict(
+            zip(longParams, longDims)))
+        taskletCode = "if (i4==" + depth_radius + "){\n out = pow(j2," + beta + ")-2*" + alpha + "*" + beta + "*j1*j0/j2;}\n else{\n out = -2*" + alpha + "*" + beta + "*j1*j0/j2;}"
+        tasklet = state.add_tasklet(
+            label, {'j0', 'j1', 'j2'}, {'out'},
+            taskletCode,
+            language=dace.types.Language.CPP)
+        self.reinitCR(preOut, [shortParams], [shortDims], "0")
+        inList = [inputNodes[1]]
+        inList.append(paddedInput)
+        inList.append(norm)
+        self.add_in_memlets(inList, mapEntry, tasklet,
+                            [shortDims, paddedDims, shortDims],
+                            [shortParams, normParams, shortParams])
+        self.add_out_memlets([preOut], mapExit, tasklet, [shortDims],
+                             [shortParams], 'lambda a,b: a+b', 0)
+
+        mapEntry, mapExit = state.add_map(label + "_out",
+                                          dict(zip(shortParams, shortDims)))
+        tasklet = state.add_tasklet(label + "_out", {'j0', 'j1'}, {'out'},
+                                    "out=j0*j1")
+        self.add_in_memlets([inputNodes[0], preOut], mapEntry, tasklet,
+                            [shortDims, shortDims], [shortParams, shortParams])
+        self.add_out_memlets(outputList, mapExit, tasklet, [shortDims],
+                             [shortParams])
+
+    def visit_LRN(self, node):
+
+        inputList = []
+        inputNodes = []
+        outputList = []
+        state = self.state
+        alpha = str(node.get_attr("alpha"))
+        beta = str(node.get_attr("beta"))
+        bias = str(node.get_attr("bias"))
+        depth_radius = str(node.get_attr("depth_radius"))
+
+        for count, inp in enumerate(node.inputs):
+            inputNode = self.create_and_add_input_node(inp)[0]
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+            if (count == 0):
+                shortDims = []
+                shortAccesses = []
+                for dim in inp.shape:
+                    shortDims.append("0:" + str(dim))
+                    shortAccesses.append(str(dim))
+                longDims = []
+                longDims = shortDims + ["0:" + depth_radius + "*2+1"]
+                paddedDims = []
+                paddedDims += shortDims
+                paddedDims[-1] += "+" + depth_radius + "*2"
+
+        label = _string_builder(node.name)
+        outputList = self.create_and_add_output_node(node)
+        longParams = ["i0", "i1", "i2", "i3", "i4"]
+        shortParams = ["i0", "i1", "i2", "i3"]
+        copyParams = ["i0", "i1", "i2", "i3+" + depth_radius]
+        normParams = ["i0", "i1", "i2", "i3+i4"]
+
+        paddedShape = []
+        paddedShape += shortAccesses
+        paddedShape[-1] += "+" + depth_radius
+        paddedInput = state.add_transient(
+            label + "_paddedInput",
+            paddedShape,
+            dace.typeclass(_tensortype(node)),
+            toplevel=True)
+        mapEntry, mapExit = state.add_map(label + "_padding",
+                                          dict(zip(shortParams, shortDims)))
+        tasklet = state.add_tasklet(label + "_padding", {'j0'}, {'out'},
+                                    "out=j0")
+        self.add_in_memlets([inputNodes[0]], mapEntry, tasklet, [shortDims],
+                            [shortParams])
+        self.add_out_memlets([paddedInput], mapExit, tasklet, [paddedDims],
+                             [copyParams])
+
+        sqrsum = state.add_transient(
+            label + "_Sqrsum", shortAccesses, _tensortype(node), toplevel=True)
+        mapEntry, mapExit = state.add_map(label + "_sqrsum",
+                                          dict(zip(longParams, longDims)))
+        tasklet = state.add_tasklet(label + "_sqrsum", {'j0'}, {'out'},
+                                    "out=j0*j0")
+        self.reinitCR(sqrsum, [shortParams], [shortDims], "0")
+        self.add_in_memlets([paddedInput], mapEntry, tasklet, [paddedDims],
+                            [normParams])
+        self.add_out_memlets([sqrsum], mapExit, tasklet, [shortDims],
+                             [shortParams], 'lambda a,b: a+b', 0)
+
+        mapEntry, mapExit = state.add_map(label,
+                                          dict(zip(shortParams, shortDims)))
+        tasklet = state.add_tasklet(
+            _string_builder(node.name), {'j0', 'j1'}, {'out'},
+            "out = j0/(pow(" + bias + "+" + alpha + "*j1," + beta + "));",
+            language=dace.types.Language.CPP)
+        self.add_in_memlets((inputNodes + [sqrsum]), mapEntry, tasklet,
+                            [shortDims, shortDims], [shortParams, shortParams])
+        self.add_out_memlets(outputList, mapExit, tasklet, [shortDims],
+                             [shortParams])
+
+    def visit_ArgMax(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+
+        for count, inp in enumerate(node.inputs):
+            if (count == 0):
+                inputNode = self.create_and_add_input_node(inp)[0]
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+
+                inputAccesses = [[], []]
+                inputDims = [[], []]
+                inputParams = [[], []]
+                for i, dim in enumerate(inp.shape):
+                    if (i == 0):
+                        inputAccesses[1].append(str(dim))
+                        inputParams[1].append("i" + str(i))
+                        inputDims[1].append("0:" + str(dim))
+                    inputAccesses[0].append(str(dim))
+                    inputParams[0].append("i" + str(i))
+                    inputDims[0].append("0:" + str(dim))
+
+        outputList = self.create_and_add_output_node(node)
+
+        mapLabel = _string_builder(node.name)
+        mapEntry, mapExit = state.add_map(
+            mapLabel + "_max", dict(zip(inputParams[0], inputDims[0])))
+        dtype = dace.typeclass(_tensortype(node))
+        shape = dace.properties.ShapeProperty.from_string(",".join(
+            inputAccesses[1]))
+        temp1Node = state.add_transient(
+            mapLabel + "_max_tmp", shape, dtype, toplevel=True)
+
+        tasklet = state.add_tasklet(mapLabel + "_max", {'j0'}, {'out'},
+                                    "out = j0")
+        self.reinitCR(temp1Node, [inputParams[1]], [inputDims[1]],
+                      "-999999999999")
+        self.add_in_memlets([inputNodes[0]], mapEntry, tasklet, [inputDims[0]],
+                            [inputParams[0]])
+        self.add_out_memlets([temp1Node], mapExit, tasklet, [inputDims[1]],
+                             [inputParams[1]], 'lambda a,b: max(a,b)',
+                             -999999999999)
+
+        mapEntry, mapExit = state.add_map(
+            mapLabel + "_arg", dict(zip(inputParams[0], inputDims[0])))
+        outputNode = outputList[0]
+        tasklet = state.add_tasklet(mapLabel + "_map2", {'j0', 'j1'}, {'out'},
+                                    "if (j0==j1):\n\tout=i1")
+        self.add_in_memlets([inputNodes[0], temp1Node], mapEntry, tasklet,
+                            inputDims, inputParams)
+        self.add_out_memlets([outputNode], mapExit, tasklet, [inputDims[1]],
+                             [inputParams[1]])
+
+    def visit_Cast(self, node):
+
+        state = self.state
+        inputList = []
+        inputNodes = []
+        outputList = []
+        mapParams = []
+        mapRange = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+        castType = None
+
+        dtype = node.get_attr("DstT")
+        if dtype.as_numpy_dtype == object:
+            raise NotImplementedError(
+                'Type %s is not a valid numpy type' % str(dtype))
+        castType = dace.typeclass(dtype.as_numpy_dtype).ctype
+
+        for count, inp in enumerate(node.inputs):
+            if (count == 0):
+                inputNode, params, dims = self.create_and_add_input_node(inp)
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+                inputParams.append(params)
+                inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+        for out in node.outputs:
+            params = self.get_default_params(out)
+            dims = self.get_default_dims(out)
+            outputParams.append(params)
+            outputDims.append(dims)
+
+        mapLabel = _string_builder(node.type)
+        mapParams = inputParams[0]
+        mapRange = inputDims[0]
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'},
+                                    "out = " + castType + "(j0)")
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+        self.add_out_memlets(outputList, mapExit, tasklet, outputDims,
+                             outputParams)
+
+    def visit_Print(self, node):
+        inputList = []
+        inputNodes = []
+        outputList = []
+        state = self.state
+        mapParams = []
+        mapRange = []
+        outputParams = []
+        outputDims = []
+        inputParams = []
+        inputDims = []
+
+        for count, inp in enumerate(node.inputs):
+            if (count == 0):
+                inputNode, params, dims = self.create_and_add_input_node(inp)
+                inputList.append(inputNode.desc(self.graph))
+                inputNodes.append(inputNode)
+                inputParams.append(params)
+                inputDims.append(dims)
+
+        outputList = self.create_and_add_output_node(node)
+        for out in node.outputs:
+            params = self.get_default_params(out)
+            dims = self.get_default_dims(out)
+            outputParams.append(params)
+            outputDims.append(dims)
+
+        mapLabel = _string_builder(node.type)
+        mapParams = inputParams[0]
+        mapRange = inputDims[0]
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+
+        ifClause = "if ("
+        for param in mapParams:
+            ifClause += param + "==1 and "
+
+        ifClause = ifClause[:-4] + "):"
+        taskletCode = "out = j0\n" + ifClause + "\n\tprintf(\"" + inputList[0].label + "\")\n"
+        taskletCode = "out = j0\nif(True):\n\tprintf(\"%f\\n\",out)"
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'}, taskletCode)
+        self.add_out_memlets(outputList, mapExit, tasklet, outputDims,
+                             outputParams)
+        self.add_in_memlets(inputNodes, mapEntry, tasklet, inputDims,
+                            inputParams)
+
+    def visit_Softmax(self, node):
+
+        inputList = []
+        inputNodes = []
+        state = self.state
+
+        for inp in node.inputs:
+            label = _string_builder(inp.name)
+            try:
+                inputNode = state.find_node(label)
+            except (LookupError):
+                inputNode = self.create_and_add_input_node(inp)[0]
+            inputList.append(inputNode.desc(self.graph))
+            inputNodes.append(inputNode)
+
+        outputList = self.create_and_add_output_node(node)
+
+        inputDims = [[], []]
+        inputParams = [[], []]
+
+        for i, dim in enumerate(inp.shape):
+            if (i == 0):
+                inputParams[1].append("i" + str(i))
+                inputDims[1].append("0:" + str(dim))
+            inputParams[0].append("i" + str(i))
+            inputDims[0].append("0:" + str(dim))
+
+        mapLabel = _string_builder(node.name)
+        mapEntry, mapExit = state.add_map(
+            mapLabel + "_map1", dict(zip(inputParams[0], inputDims[0])))
+        mapParams = inputParams[0]
+        mapRange = inputDims[0]
+
+        # 1st map, get maximum in each batchsize dimension
+        dtype = dace.typeclass(_tensortype(node))
+        shape = dace.properties.ShapeProperty.from_string(
+            str(node.inputs[0].shape.dims[0]))
+        temp1Node = state.add_transient(
+            mapLabel + "_max_tmp", shape, dtype, toplevel=True)
+        mapEntry, mapExit = state.add_map(mapLabel + "_max",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel + "_max", {'j0'}, {'out'},
+                                    "out = j0")
+        self.reinitCR(temp1Node, [inputParams[1]], [inputDims[1]],
+                      "-999999999999")
+        self.add_in_memlets([inputNodes[0]], mapEntry, tasklet, [inputDims[0]],
+                            [inputParams[0]])
+        self.add_out_memlets([temp1Node], mapExit, tasklet, [inputDims[1]],
+                             [inputParams[1]], 'lambda a,b: max(a,b)',
+                             -999999999999)
+
+        # 2nd map, calculate the denominator sum
+        temp2Node = state.add_transient(
+            mapLabel + "_denominator_tmp", shape, dtype, toplevel=True)
+        mapEntry, mapExit = state.add_map(mapLabel + "_denominator",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(
+            mapLabel + "_denominator", {'j0', 'j1'}, {'out'},
+            "out = dace::math::exp(j0-j1);",
+            language=dace.types.Language.CPP)
+        self.reinitCR(temp2Node, [inputParams[1]], [inputDims[1]], "0")
+        inList = [inputNodes[0], temp1Node]
+        self.add_in_memlets(inList, mapEntry, tasklet, inputDims, inputParams)
+        self.add_out_memlets([temp2Node], mapExit, tasklet, [inputDims[1]],
+                             [inputParams[1]], 'lambda a,b: a+b', 0)
+
+        # 3rd map, calculate the sofmax
+        mapEntry, mapExit = state.add_map(mapLabel + "_softmax",
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(
+            mapLabel + "_softmax", {'j0', 'j1', 'out'}, {'out'},
+            "out = (dace::math::exp(j0-j1))/j2;",
+            language=dace.types.Language.CPP)
+        inList = [inputList[0], temp1Node, temp2Node]
+        paramsList = inputParams + [inputParams[1]]
+        dimsList = inputDims + [inputDims[1]]
+        self.add_in_memlets(inList, mapEntry, tasklet, dimsList, paramsList)
+        self.add_out_memlets(outputList, mapExit, tasklet, [inputDims[0]],
+                             [inputParams[0]])
+
+    def add_in_memlets(self, inputList, otherNode, tasklet, inputDims,
+                       inputParams):
+        """ Convenience function that adds two memlets for each input of the 
+            node: external and internal to a given map.
+            @param inputList: list of inputNodes (DaCe access node)
+            @param otherNode: DaCe node (mostly map_entry)
+            @param tasklet: Normally a tasklet node, but it can also be another 
+                            mapEntry, for example map in map.
+            @param inputDims: List of list of strings dimension of the 
+                              respective input. Example:
+                              [["0:5","0:7"],["0:2","0:4"]]  
+            @param inputParams: List of list of strings params of respective 
+                                input. Example: [["i0","i1"],["i2","i3"]]  
+        """
+        state = self.state
+        connected_nodes = set()
+        for i, inp in enumerate(inputList):
+            assert isinstance(inputDims[i], list)
+            if inp.data not in connected_nodes:
+                outerMemlet = Memlet.simple(inp, ",".join(inputDims[i]))
+                state.add_edge(inp, None, otherNode, None, outerMemlet)
+                connected_nodes.add(inp.data)
+            name = "j" + str(i)
+            innerMemlet = Memlet.simple(inp, ",".join(inputParams[i]))
+
+            if isinstance(tasklet, (Tasklet, NestedSDFG)):
+                state.add_edge(otherNode, None, tasklet, name, innerMemlet)
+            else:
+                state.add_edge(otherNode, None, tasklet, None, innerMemlet)
+
+    def add_out_memlets(self,
+                        outputList,
+                        otherNode,
+                        tasklet,
+                        outputDims,
+                        outputParams,
+                        wcr=None,
+                        wcr_identity=None):
+        """ Convenience function that adds two memlets for each output of the 
+            node: external and internal to a given map.
+            @param outputList: list of outputNodes (DaCe access node)
+            @param otherNode: DaCe node (mostly map_entry)
+            @param tasklet: Normally a tasklet node, but it can also be another 
+                            mapEntry, for example map in map.
+            @param outputDims: List of list of strings dimension of the 
+                               respective output. Example:
+                               [["0:5","0:7"],["0:2","0:4"]]  
+            @param outputParams: List of list of strings params of respective 
+                                 output. Example: [["i0","i1"],["i2","i3"]]  
+            @param wcr: (optional) Write-conflict resolution function (as 
+                        string).
+            @param wcr_identity: (optional) Identity element for write-conflict
+                                 resolution.
+        """
+
+        connected_nodes = set()
+
+        state = self.state
+        for i, out in enumerate(outputList):
+            assert isinstance(outputDims[i], list)
+            if (len(outputList) > 1):
+                name = "out" + str(i)
+            else:
+                name = "out"
+
+            if out.data not in connected_nodes:
+                outerMemlet = Memlet.simple(
+                    out,
+                    ",".join(outputDims[i]),
+                    wcr_str=wcr,
+                    wcr_identity=wcr_identity)
+                state.add_edge(otherNode, None, out, None, outerMemlet)
+                connected_nodes.add(out.data)
+            innerMemlet = Memlet.simple(
+                out,
+                ",".join(outputParams[i]),
+                wcr_str=wcr,
+                wcr_identity=wcr_identity)
+
+            if isinstance(tasklet, (Tasklet, NestedSDFG)):
+                state.add_edge(tasklet, name, otherNode, None, innerMemlet)
+            else:
+                state.add_edge(tasklet, None, otherNode, None, innerMemlet)
+
+    def create_and_add_input_node(self, inp):
+        """ Creates a DaCe access node for each input of `inp`, adds it to the 
+            state, and returns it.
+            If the node already exists, returns the pre-existing node.
+            @param inp: tf.Operation
+            @return: A 3-tuple of (input DaCe access node, 
+                                   list of parameter strings,
+                                   list of dimension strings).
+        """
+
+        state = self.state
+        # Get DaCe name of the operation
+        label = _string_builder(inp.name)
+        # Try to find node in DaCe graph
+        try:
+            # If successful, use the existing node
+            inputNode = state.find_node(label)
+        except (LookupError):
+            # Get type and shape of the input tensor
+            dtype = dace.typeclass(_tensortype(inp))
+            shape = dace.properties.ShapeProperty.from_string(
+                str(_tensorshape(inp)))
+            # Create and add array, default is transient, toplevel =True
+            inputNode = state.add_transient(
+                name=label, shape=shape, dtype=dtype, toplevel=True)
+
+        params = self.get_default_params(inp)
+        dims = self.get_default_dims(inp)
+
+        return inputNode, params, dims
+
+    def create_and_add_output_node(self, node):
+        """ Creates a DaCe access node for each output of `node`, adds it to 
+            the state, and returns it.
+            If the node already exists, returns the pre-existing node.
+            @param node: tf.Operation
+            @return: List of DaCe access node.
+        """
+        outputList = []
+        state = self.state
+        # Iterate over all output nodes
+        for count, out in enumerate(node.outputs):
+            label = _string_builder(out.name)
+            # Try to find node in DaCe graph
+            try:
+                # If successful, use the existing node
+                outputNode = state.find_node(label)
+            except (LookupError):
+                # Get type and shape of the tensor
+                dtype = dace.typeclass(_tensortype(out))
+                shape = dace.properties.ShapeProperty.from_string(
+                    str(_tensorshape(out)))
+                outputNode = state.add_transient(
+                    label, shape, dtype, toplevel=True)
+            outputList.append(outputNode)
+        return outputList
+
+    def reinitCR(self, inp, params, dims, identity):
+        """ Adds a reinitialization map to a `reinit` state, setting inputs
+            to their initial values. Only used in training mode.
+            @param inp: DaCe access node.
+            @param params: List of string parameters to `inp`.
+            @param dims: List of strings dimensions of `inp`.
+            @param identity: Identity value of the CR node (as a string)
+        """
+
+        if self.training:
+            # Swap current state and reinitState
+            self.state, self.reinitState = self.reinitState, self.state
+            node = inp
+            state = self.state
+            dtype = node.desc(self.graph).dtype
+            label = node.label
+
+            # Mark node as non-transient as we need to set it from the outside
+            # the SDFG.
+            node.desc(self.graph).transient = False
+
+            shape = dace.properties.ShapeProperty.from_string(
+                str(inp.desc(self.graph).shape))
+            # Add input, output and map to reinitState
+            inputNode = state.add_array(label, shape, dtype)
+            outputNode = state.add_array(label, shape, dtype)
+            mapEntry, mapExit = state.add_map(label,
+                                              dict(zip(params[0], dims[0])))
+
+            # Output is set to identity
+            tasklet = state.add_tasklet(label, set(), {'out'},
+                                        "out = " + identity)
+            state.add_edge(mapEntry, None, tasklet, None, EmptyMemlet())
+            self.add_out_memlets([outputNode], mapExit, tasklet, dims, params)
+            # Add numpy array with identity value to the reinit dict.
+            npArray = np.full(shape, int(identity)).astype(
+                node.desc(self.graph).dtype.type)
+            self.reinitDict.update({label: npArray})
+            # Swap state back
+            self.reinitState, self.state = self.state, self.reinitState
+        else:
+            pass
+
+    def inputPadding(self, node, inpnode, inp, outputSize, kernelSize, strides,
+                     inputDims):
+        """ Zero-pads the input to fit the outputSize.
+            @param node: tf.Operation
+            @param inpnode: DaCe access node to pad
+            @param outputSize: Output size.
+            @param kernelSize: Kernel size.
+            @param strides: Strides.
+            @param inputDims: List of strings (e.g.["0:N","0:M"]).
+            @return: A 2-tuple (output DaCe access node with padded input,
+                                list of dimension strings of the padded data).
+        """
+        state = self.state
+        paddingUp = 0
+        paddingDown = 0
+        label = inpnode.label
+        inputSize = inp.shape[1]
+        # Calculate padding according to paper
+        padding = strides * (outputSize - 1) + kernelSize - inputSize
+        # If padding is even (padding is on each side the same)
+        if (padding % 2 == 0):
+            paddingUp = padding // 2
+            paddingDown = padding // 2
+        # If padding is uneven, we pad more on the bottom and on the right side
+        # of an image (matching TensorFlow behavior)
+        else:
+            paddingUp = padding // 2
+            paddingDown = paddingUp + 1
+
+        # Set up the different padding dimensions, accesses and params.
+        outputDims = inputDims.copy()
+        outputDims[1] = str(paddingUp) + ":" + str(
+            inp.shape[1]) + "+" + str(paddingUp)
+        outputDims[2] = str(paddingUp) + ":" + str(
+            inp.shape[2]) + "+" + str(paddingUp)
+        outputAccesses = list(map(str, list(inp.shape)))
+        outputAccesses[1] += "+" + str(paddingUp) + "+" + str(paddingDown)
+        outputAccesses[2] += "+" + str(paddingUp) + "+" + str(paddingDown)
+        outputDims = []
+        inputParams = []
+        for i, dim in enumerate(outputAccesses):
+            inputParams.append("i" + str(i))
+            outputDims.append("0:" + dim)
+
+        outputParams = inputParams.copy()
+        outputParams[1] += "+" + str(paddingUp)
+        outputParams[2] += "+" + str(paddingUp)
+
+        # Add the padded input to the graph, set it to zero, and add the map.
+        shape = dace.properties.ShapeProperty.from_string(
+            ",".join(outputAccesses))
+        output = state.add_transient(
+            label + "_padded", shape=shape, dtype=inp.dtype, toplevel=True)
+        output.desc(self.graph).setzero = True
+
+        mapParams = inputParams
+        mapRange = inputDims
+        mapLabel = _string_builder(node.type)
+        mapEntry, mapExit = state.add_map(mapLabel,
+                                          dict(zip(mapParams, mapRange)))
+        tasklet = state.add_tasklet(mapLabel, {'j0'}, {'out'}, "out = j0")
+        self.add_in_memlets([inpnode], mapEntry, tasklet, [inputDims],
+                            [inputParams])
+        self.add_out_memlets([output], mapExit, tasklet, [outputDims],
+                             [outputParams])
+        return output, outputDims
+
+    def get_default_params(self, tensor, start=0):
+        """ Returns the default parameters of a tensor starting at `start`,
+            e.g., ["i0","i1",...].
+            @param tensor: tf.Tensor.
+            @param start: Starting position for the iteration.
+            @return: List of parameters as strings ["i0",i"1",...].
+        """
+        params = []
+        shape = _tensorshape(tensor)
+        if shape == 1:
+            shape = [1]
+        for i, dim in enumerate(shape, start):
+            params.append("i" + str(i))
+        return params
+
+    def get_default_dims(self, tensor):
+        """ Returns the default dimensions of a tensor e.g., ["0:N","0:M"]
+            @param tensor: tf.Tensor.
+            @return: List of dimensions as strings ["0:N","0:M"]
+        """
+        dims = []
+        shape = _tensorshape(tensor)
+        if shape == 1:
+            shape = [1]
+        for dim in shape:
+            dims.append("0:" + str(dim))
+        return dims
diff --git a/dace/graph/__init__.py b/dace/graph/__init__.py
new file mode 100644
index 0000000000..e69de29bb2
diff --git a/dace/graph/dot.py b/dace/graph/dot.py
new file mode 100644
index 0000000000..d7839dba05
--- /dev/null
+++ b/dace/graph/dot.py
@@ -0,0 +1,201 @@
+import copy
+import html
+from dace import data, memlet
+from dace.graph import graph as gr, edges
+
+
+def draw_edge_explicit(srcName, dstName, edge, sdfg, graph, **extraOpts):
+    opts = {}
+    if isinstance(edge.data, memlet.Memlet):
+        if getattr(edge.data, '__label__', False):
+            opts["label"] = edge.data.__label__(sdfg, graph)
+        else:
+            opts["label"] = str(edge.data)
+        if edge.data.wcr is not None:
+            opts['style'] = 'dashed'
+    elif isinstance(edge.data, edges.InterstateEdge):
+        opts.update(edge.data.dotOpts)
+    # Unhandled properties
+    elif edge.data != None:
+        raise ValueError("Unhandled edge: " + str(edge.data))
+    if extraOpts:
+        opts.update(extraOpts)  # Custom options will overwrite default
+
+    if isinstance(edge, gr.MultiConnectorEdge):
+        sconn = '' if edge.src_conn is None else (':' + edge.src_conn)
+        dconn = '' if edge.dst_conn is None else (':' + edge.dst_conn)
+    else:
+        sconn = ''
+        dconn = ''
+
+    return ("\"{}\"{sconn} -> \"{}\"{dconn}".format(
+        srcName, dstName, sconn=sconn, dconn=dconn) + ((" [" + ", ".join(
+            ["{}=\"{}\"".format(key, value)
+             for key, value in opts.items()]) + "];") if opts else ";"))
+
+
+def draw_edge(sdfg, graph, edge, **extraOpts):
+    srcName = 's%d_%d' % (sdfg.node_id(graph), graph.node_id(edge.src))
+    dstName = 's%d_%d' % (sdfg.node_id(graph), graph.node_id(edge.dst))
+
+    return draw_edge_explicit(srcName, dstName, edge, sdfg, graph)
+
+
+def draw_interstate_edge(sdfg, src_graph, dst_graph, edge, **extraOpts):
+    srcName = 's%d_%d' % (sdfg.node_id(src_graph), src_graph.node_id(edge.src))
+    dstName = 's%d_%d' % (sdfg.node_id(dst_graph), dst_graph.node_id(edge.dst))
+    if isinstance(edge, gr.MultiConnectorEdge):
+        if edge.src_conn is not None:
+            srcName += '@' + edge.src_conn
+        if edge.dst_conn is not None:
+            dstName += '@' + edge.dst_conn
+
+    return draw_edge_explicit(srcName, dstName, edge, sdfg, src_graph,
+                              **extraOpts)
+
+
+def draw_interstate_edge_by_name(srcName, dstName, edge, sdfg, src_graph,
+                                 **extraOpts):
+    return draw_edge_explicit(srcName, dstName, edge, sdfg, src_graph,
+                              **extraOpts)
+
+
+def draw_node(sdfg, graph, obj, **kwargs):
+    name = 's%d_%d' % (sdfg.node_id(graph), graph.node_id(obj))
+    if getattr(obj, '__label__', False):
+        opts = {"label": obj.__label__(sdfg, graph)}
+    else:
+        opts = {"label": str(obj)}
+    opts.update(kwargs)
+    opts["label"] = "\"{}\"".format(opts["label"])
+
+    if 'fillcolor' not in opts:
+        opts['fillcolor'] = '"#ffffff"'
+        if 'style' not in opts:
+            opts['style'] = 'filled'
+        else:
+            opts['style'] = '"filled,%s"' % opts['style']
+
+    ############################################
+    if getattr(obj, 'in_connectors', False) != False and len(
+            obj.in_connectors) + len(obj.out_connectors) > 0:
+        # Header
+        code = '{name} [label=<<TABLE BORDER="0" CELLBORDER="0" CELLSPACING="-4" CELLPADDING="0">'
+        code = code.format(name=name)
+        # Input connectors
+        code += '<TR><TD BORDER="0"><TABLE BORDER="0" CELLBORDER="0" CELLSPACING="-8" CELLPADDING="0"><TR>'
+        code += '<TD WIDTH="20"></TD>'
+        connector_code = []
+        for conn in sorted(obj.in_connectors):
+            connector_code.append(
+                '<TD PORT="{conn}" BORDER="1" CELLPADDING="1"><FONT POINT-SIZE="10">{conn}</FONT></TD>'.
+                format(conn=conn))
+        code += '<TD WIDTH="20"></TD>'.join(connector_code)
+        code += '<TD WIDTH="20"></TD></TR></TABLE></TD></TR>'
+
+        # Contents
+        html_label = html.escape(opts['label'][1:-1])
+        code += '<TR><TD BORDER="0" CELLPADDING="15" COLOR="black">{label}</TD></TR>'.format(
+            label=html_label)
+
+        # Output connectors
+        code += '<TR><TD BORDER="0"><TABLE BORDER="0" CELLBORDER="0" CELLSPACING="-8" CELLPADDING="0"><TR>'
+        code += '<TD WIDTH="20"></TD>'
+        connector_code = []
+        for conn in sorted(obj.out_connectors):
+            connector_code.append(
+                '<TD PORT="{conn}" BORDER="1" CELLPADDING="1"><FONT POINT-SIZE="10">{conn}</FONT></TD>'.
+                format(conn=conn))
+        code += '<TD WIDTH="20"></TD>'.join(connector_code)
+        code += '<TD WIDTH="20"></TD></TR></TABLE></TD></TR>'
+
+        # Footer
+        code += '</TABLE>>'
+
+        filtered_opts = {k: v for k, v in opts.items() if k != 'label'}
+        if len(filtered_opts.items()) > 0:
+            ostr = ", ".join([
+                str(key) + "=" + str(val)
+                for key, val in filtered_opts.items()
+            ])
+            code += ', ' + ostr
+        code += '];\n'
+
+        return code
+    ############################################
+
+    return "\"{}\" [{}];".format(
+        name,
+        ", ".join([str(key) + "=" + str(val) for key, val in opts.items()]))
+
+
+def draw_invisible_node(name, **kwargs):
+    opts = dict(label='\"\"', style="invisible")
+    opts.update(kwargs)
+    return "\"{}\" [{}];".format(
+        name,
+        ", ".join([str(key) + "=" + str(val) for key, val in opts.items()]))
+
+
+def draw_graph(sdfg, graph, standalone=True):
+    """ Creates a graphviz dot file from a networkx graph input.
+
+        If standalone is set, return a full dot string including header and footer.
+    """
+    state_id = sdfg.node_id(graph)
+    sdfg = copy.deepcopy(sdfg)
+    graph = sdfg.nodes()[state_id]
+
+    sdict = graph.scope_dict()
+    sdict_children = graph.scope_dict(True)
+
+    # Omit collapsed nodes out of nodes to draw
+    def is_collapsed(node):
+        scope = sdict[node]
+        while scope is not None:
+            if scope.is_collapsed:
+                return True
+            scope = sdict[scope]
+        return False
+
+    nodes_to_draw = set(
+        node for node in graph.nodes() if not is_collapsed(node))
+
+    # Collect edges to draw for collapsed nodes (we also need edges coming out of scope exits)
+    nodes_for_edges = set()
+    nodes_for_edges.update(nodes_to_draw)
+
+    def add_exit_nodes(scope):
+        for node in sdict_children[scope]:
+            if node in sdict_children and node.is_collapsed:
+                nodes_for_edges.add(graph.exit_nodes(node)[0])
+            elif node in sdict_children:
+                add_exit_nodes(node)
+
+    add_exit_nodes(None)
+
+    edges_to_draw = set(
+        e for e in graph.edges()
+        if e.src in nodes_for_edges and e.dst in nodes_for_edges)
+
+    # Take care of scope entry connectors
+    for node in nodes_to_draw:
+        if node in sdict_children and node.is_collapsed:
+            node._out_connectors.clear()
+
+    # Take care of scope exit edges and connectors
+    for e in edges_to_draw:
+        if e.src in nodes_for_edges and e.src not in nodes_to_draw:
+            newsrc = sdict[e.src]
+            if newsrc is None:
+                continue
+            e._src = newsrc
+            newsrc._out_connectors.add(e.src_conn)
+
+    nodes = [x.draw_node(sdfg, graph) for x in nodes_to_draw]
+    edges = [draw_edge(sdfg, graph, e) for e in edges_to_draw]
+
+    if not standalone:
+        return nodes, edges
+
+    return "digraph DaCe {{\n    {}\n}}".format("\n    ".join(nodes + edges))
diff --git a/dace/graph/edges.py b/dace/graph/edges.py
new file mode 100644
index 0000000000..67faa234c9
--- /dev/null
+++ b/dace/graph/edges.py
@@ -0,0 +1,285 @@
+import ast
+import copy
+import enum
+import re
+
+import dace
+from dace import types
+from dace.graph.graph import Edge
+from dace.frontend.python import astutils
+from dace.properties import Property, CodeProperty, make_properties
+
+
+def assignments_from_string(astr):
+    """ Returns a dictionary of assignments from a semicolon-delimited 
+        string of expressions. """
+
+    result = {}
+    for aitem in astr.split(';'):
+        aitem = aitem.strip()
+        m = re.search(r'([^=\s]+)\s*=\s*([^=]+)', aitem)
+        result[m.group(1)] = m.group(2)
+
+    return result
+
+
+def assignments_to_string(assdict):
+    """ Returns a semicolon-delimited string from a dictionary of assignment 
+        expressions. """
+    return '; '.join(['%s=%s' % (k, v) for k, v in assdict.items()])
+
+
+@make_properties
+class InterstateEdge(object):
+    """ An SDFG state machine edge. These edges can contain a condition     
+        (which may include data accesses for data-dependent decisions) and
+        zero or more assignments of values to inter-state variables (e.g.,
+        loop iterates).
+    """
+
+    assignments = Property(
+        dtype=dict,
+        desc="Assignments to perform upon transition (e.g., 'x=x+1; y = 0')",
+        from_string=assignments_from_string,
+        to_string=assignments_to_string)
+    condition = CodeProperty(desc="Transition condition")
+    language = Property(enum=types.Language, default=types.Language.Python)
+
+    def __init__(self, condition=None, assignments=None):
+
+        if condition is None:
+            condition = ast.parse("1").body[0]
+
+        if assignments is None:
+            assignments = {}
+
+        self.condition = condition
+        self.assignments = assignments
+
+        self._dotOpts = {"minlen": 3, "color": "blue", "fontcolor": "blue"}
+
+    def is_unconditional(self):
+        """ Returns True if the state transition is unconditional. """
+        return (self.condition == None or InterstateEdge.condition.to_string(
+            self.condition).strip() == "1")
+
+    def condition_sympy(self):
+        cond_ast = self.condition
+        return symbolic.pystr_to_symbolic(astutils.unparse(cond_ast))
+
+    def condition_symbols(self):
+        return dace.symbolic.symbols_in_ast(self.condition[0])
+
+    def toJSON(self, indent=0):
+        json = str(self.label)
+        # get rid of newlines (why are they there in the first place?)
+        json = re.sub(r"\n", " ", json)
+        return "\"" + json + "\""
+
+    @property
+    def label(self):
+        assignments = ','.join(
+            ['%s=%s' % (k, v) for k, v in self.assignments.items()])
+
+        # Edge with assigment only (no condition)
+        if astutils.unparse(self.condition) == '1':
+            # Edge without conditions or assignments
+            if len(self.assignments) == 0:
+                return ''
+            return assignments
+
+        # Edge with condition only (no assignment)
+        if len(self.assignments) == 0:
+            return astutils.unparse(self.condition)
+
+        # Edges with assigments and conditions
+        return assignments + '; ' + astutils.unparse(self.condition)
+
+    @property
+    def dotOpts(self):
+        result = {}
+        result.update(self._dotOpts)
+        result.update({'label': self.label})
+        return result
+
+
+class RedirectEdge(InterstateEdge):
+    """ An inter-state edge type used for rendering self-looping edges
+        on graph clusters in GraphViz. """
+
+    def __init__(self):
+        super(RedirectEdge, self).__init__()
+        self._dotOpts["arrowhead"] = "none"
+
+
+###############################################################################
+# Various classes to facilitate the detection of control flow elements (e.g.,
+# `for`, `if`, `while`) from state machines in SDFGs.
+
+
+@make_properties
+class ControlFlowScope:
+
+    nodes_in_scope = Property(
+        dtype=set,
+        desc="Nodes contained in this scope, "
+        "including entry and exit nodes, in topological order.")
+
+    def __init__(self, nodes_in_scope):
+        self.nodes_in_scope = nodes_in_scope
+
+    def __contains__(self, node):
+        return node in self.nodes_in_scope
+
+    def __iter__(self):
+        return iter(self.nodes_in_scope)
+
+
+# make_properties will be called after adding cyclic class reference members
+class LoopScope(ControlFlowScope):
+    def __init__(self, *args, **kwargs):
+        super().__init__(*args, **kwargs)
+        self.assignment = None
+        self.entry = None
+        self.back = None
+        self.exit = None
+
+
+class ControlFlow:
+    pass
+
+
+@make_properties
+class LoopAssignment(ControlFlow):
+
+    scope = Property(dtype=LoopScope)
+    edge = Property(dtype=Edge)
+
+    def __init__(self, scope, edge, *args, **kwargs):
+        self.scope = scope
+        self.edge = edge
+        scope.assignment = self
+        super().__init__(*args, **kwargs)
+
+
+@make_properties
+class LoopEntry(ControlFlow):
+
+    scope = Property(dtype=LoopScope)
+    edge = Property(dtype=Edge)
+
+    def __init__(self, scope, edge, *args, **kwargs):
+        self.scope = scope
+        self.edge = edge
+        scope.entry = self
+        super().__init__(*args, **kwargs)
+
+
+@make_properties
+class LoopExit(ControlFlow):
+
+    scope = Property(dtype=LoopScope)
+    edge = Property(dtype=Edge)
+
+    def __init__(self, scope, edge, *args, **kwargs):
+        self.scope = scope
+        self.edge = edge
+        scope.exit = self
+        super().__init__(*args, **kwargs)
+
+
+@make_properties
+class LoopBack(ControlFlow):
+
+    scope = Property(dtype=LoopScope)
+    edge = Property(dtype=Edge)
+
+    def __init__(self, scope, edge, *args, **kwargs):
+        self.scope = scope
+        self.edge = edge
+        scope.back = self
+        super().__init__(*args, **kwargs)
+
+
+# These will be assigned when the various control flow objects are created
+LoopScope.assignment = Property(dtype=LoopAssignment, allow_none=True)
+LoopScope.entry = Property(dtype=LoopEntry, allow_none=True)
+LoopScope.back = Property(dtype=LoopBack, allow_none=True)
+LoopScope.exit = Property(dtype=LoopExit, allow_none=True)
+LoopScope = make_properties(LoopScope)
+
+
+# Extra meta-object binding together then and else scopes.
+# make_properties will be called after adding cyclic class reference members
+class IfThenElse:
+
+    entry = Property()
+    exit = Property()
+
+    def __init__(self, entry, exit):
+        self.entry = entry
+        self.exit = exit
+        self.then_scope = None
+        self.else_scope = None
+
+
+@make_properties
+class IfEntry(ControlFlow):
+
+    scope = Property(dtype=ControlFlowScope)
+    edge = Property(dtype=Edge)
+
+    def __init__(self, scope, edge, *args, **kwargs):
+        self.scope = scope
+        self.edge = edge
+        scope.entry = self
+        super().__init__(*args, **kwargs)
+
+
+@make_properties
+class IfExit(ControlFlow):
+
+    scope = Property(dtype=ControlFlowScope)
+    edge = Property(dtype=Edge)
+
+    def __init__(self, scope, edge, *args, **kwargs):
+        self.scope = scope
+        self.edge = edge
+        scope.exit = self
+        super().__init__(*args, **kwargs)
+
+
+@make_properties
+class IfThenScope(ControlFlowScope):
+
+    if_then_else = Property(dtype=IfThenElse)
+    entry = Property(dtype=IfEntry, allow_none=True)
+    exit = Property(dtype=IfExit, allow_none=True)
+
+    def __init__(self, if_then_else, *args, **kwargs):
+        self.if_then_else = if_then_else
+        if_then_else.then_scope = self
+        self.entry = None
+        self.exit = None
+        super().__init__(*args, **kwargs)
+
+
+@make_properties
+class IfElseScope(ControlFlowScope):
+
+    if_then_else = Property(dtype=IfThenElse)
+    entry = Property(dtype=IfEntry, allow_none=True)
+    exit = Property(dtype=IfExit, allow_none=True)
+
+    def __init__(self, if_then_else, *args, **kwargs):
+        self.if_then_else = if_then_else
+        if_then_else.else_scope = self
+        self.entry = None
+        self.exit = None
+        super().__init__(*args, **kwargs)
+
+
+# Cyclic class reference
+IfThenElse.then_scope = Property(dtype=IfThenScope, allow_none=True)
+IfThenElse.else_scope = Property(dtype=IfElseScope, allow_none=True)
+IfThenElse = make_properties(IfThenElse)
diff --git a/dace/graph/graph.py b/dace/graph/graph.py
new file mode 100644
index 0000000000..03c487bd48
--- /dev/null
+++ b/dace/graph/graph.py
@@ -0,0 +1,711 @@
+""" Graph and multigraph implementations for DaCe. """
+
+from collections import deque, OrderedDict
+import itertools
+import networkx as nx
+from dace.types import deduplicate
+
+
+class NodeNotFoundError(Exception):
+    pass
+
+
+class EdgeNotFoundError(Exception):
+    pass
+
+
+class Edge(object):
+    def __init__(self, src, dst, data):
+        self._src = src
+        self._dst = dst
+        self._data = data
+
+    @property
+    def src(self):
+        return self._src
+
+    @property
+    def dst(self):
+        return self._dst
+
+    @property
+    def data(self):
+        return self._data
+
+    def __iter__(self):
+        yield self._src
+        yield self._dst
+        yield self._data
+
+    def toJSON(self, indent=0):
+        if self._data is None:
+            return "null"
+        return self._data.toJSON(indent)
+
+    @staticmethod
+    def __len__():
+        return 3
+
+    def reverse(self):
+        self._src, self._dst = self._dst, self._src
+
+
+class MultiEdge(Edge):
+    def __init__(self, src, dst, data, key):
+        super(MultiEdge, self).__init__(src, dst, data)
+        self._key = key
+
+    def toJSON(self, indent=0):
+        # we loose the key here, what is that even?
+        if self._data is None:
+            return "null"
+        return self._data.toJSON(indent)
+
+    @property
+    def key(self):
+        return self._key
+
+
+class MultiConnectorEdge(MultiEdge):
+    def __init__(self, src, src_conn, dst, dst_conn, data, key):
+        super(MultiConnectorEdge, self).__init__(src, dst, data, key)
+        self._src_conn = src_conn
+        self._dst_conn = dst_conn
+
+    def toJSON(self, indent=0):
+        # we lose the key here, what is that even?
+        return ('%s' % ("null"
+                        if self._data is None else self._data.toJSON(indent)))
+
+    @property
+    def src_conn(self):
+        return self._src_conn
+
+    @property
+    def src_connector(self):
+        return self._src_conn
+
+    @property
+    def dst_conn(self):
+        return self._dst_conn
+
+    @property
+    def dst_connector(self):
+        return self._dst_conn
+
+    def __iter__(self):
+        yield self._src
+        yield self._src_conn
+        yield self._dst
+        yield self._dst_conn
+        yield self._data
+
+    @staticmethod
+    def __len__():
+        return 5
+
+
+class Graph(object):
+    def _not_implemented_error(self):
+        return NotImplementedError("Not implemented for " + str(type(self)))
+
+    def toJSON(self, indent=0):
+        json = " " * indent + "{\n"
+        indent += 2
+        json += " " * indent + "\"type\": \"" + type(self).__name__ + "\",\n"
+        json += " " * indent + "\"nodes\": [\n"
+        indent += 2
+        for n in self.nodes():
+            json += " " * indent + "{\n"
+            indent += 2
+            json += " " * indent + "\"id\" : \"" + str(
+                self.node_id(n)) + "\",\n"
+            json += " " * indent + "\"attributes\" : " + n.toJSON(indent) + "\n"
+            indent -= 2
+            if n == self.nodes()[-1]:
+                json += " " * indent + "}\n"
+            else:
+                json += " " * indent + "},\n"
+        indent -= 2
+        json += " " * indent + "],\n"
+
+        json += " " * indent + "\"edges\": [\n"
+        for e in self.edges():
+            json += " " * indent + "{\n"
+            indent += 2
+            json += " " * indent + "\"src\" : \"" + str(self.node_id(
+                e.src)) + "\",\n"
+            if isinstance(e, MultiConnectorEdge):
+                json += " " * indent + '"src_connector" : "%s",\n' % e.src_conn
+            json += " " * indent + "\"dst\" : \"" + str(self.node_id(
+                e.dst)) + "\",\n"
+            if isinstance(e, MultiConnectorEdge):
+                json += " " * indent + '"dst_connector" : "%s",\n' % e.dst_conn
+            json += " " * indent + "\"attributes\" : " + e.toJSON(indent) + "\n"
+            indent -= 2
+            if e == self.edges()[-1]:
+                json += " " * indent + "}\n"
+            else:
+                json += " " * indent + "},\n"
+        indent -= 2
+        json += " " * indent + "]\n"
+        json += " " * indent + "}\n"
+        return json
+
+    def nodes(self):
+        """Returns an iterable to internal graph nodes."""
+        raise self._not_implemented_error()
+
+    def edges(self):
+        """Returns an iterable to internal graph edges."""
+        raise self._not_implemented_error()
+
+    def in_edges(self, node):
+        """Returns an iterable to Edge objects."""
+        raise self._not_implemented_error()
+
+    def out_edges(self, node):
+        """Returns an iterable to Edge objects."""
+        raise self._not_implemented_error()
+
+    def all_edges(self, *nodes):
+        """Returns an iterable to incoming and outgoing Edge objects."""
+        result = set()
+        for node in nodes:
+            result.update(self.in_edges(node))
+            result.update(self.out_edges(node))
+        return list(result)
+
+    def add_node(self, node):
+        """Adds node to the graph."""
+        raise self._not_implemented_error()
+
+    def add_nodes_from(self, node_list):
+        """Adds nodes from an iterable to the graph"""
+        for node in node_list:
+            self.add_node(node)
+
+    def node_id(self, node):
+        """Returns a numeric node ID that corresponds to the node index in the
+           internal graph representation (unique)."""
+        for i, n in enumerate(self.nodes()):
+            if node == n:
+                return i
+        raise NodeNotFoundError(node)
+
+    def add_edge(self, source, destination, data):
+        """Adds an edge to the graph containing the specified data.
+        Returns the added edge."""
+        raise self._not_implemented_error()
+
+    def remove_node(self, node):
+        """Removes the specified node."""
+        raise self._not_implemented_error()
+
+    def remove_nodes_from(self, node_list):
+        """Removes the nodes specified in an iterable."""
+        for node in node_list:
+            self.remove_node(node)
+
+    def remove_edge(self, edge):
+        """Removes the specified Edge object."""
+        raise self._not_implemented_error()
+
+    def edges_between(self, source, destination):
+        """Returns all edges that connect source and destination directly"""
+        raise self._not_implemented_error()
+
+    def predecessors(self, node):
+        """Returns an iterable of nodes that have edges leading to the passed
+        node"""
+        return deduplicate([e.src for e in self.in_edges(node)])
+
+    def successors(self, node):
+        """Returns an iterable of nodes that have edges leading to the passed
+        node"""
+        return deduplicate([e.dst for e in self.out_edges(node)])
+
+    def neighbors(self, node):
+        return itertools.chain(self.predecessors(node), self.successors(node))
+
+    def in_degree(self, node):
+        """Returns the number of incoming edges to the specified node."""
+        raise self._not_implemented_error()
+
+    def out_degree(self, node):
+        """Returns the number of outgoing edges from the specified node."""
+        raise self._not_implemented_error()
+
+    def number_of_nodes(self):
+        """Returns the total number of nodes in the graph."""
+        raise self._not_implemented_error()
+
+    def number_of_edges(self):
+        """Returns the total number of edges in the graph."""
+        raise self._not_implemented_error()
+
+    def is_directed(self):
+        raise self._not_implemented_error()
+
+    def is_multigraph(self):
+        raise self._not_implemented_error()
+
+    def __iter__(self):
+        return iter(self.nodes())
+
+    def __len__(self):
+        """ Returns the total number of nodes in the graph (nx compatibility)"""
+        return self.number_of_nodes()
+
+    def bfs_edges(self, node, reverse=False):
+        """Returns a generator over edges in the graph originating from the
+        passed node in BFS order"""
+        if isinstance(node, (tuple, list)):
+            queue = deque(node)
+        else:
+            queue = deque([node])
+        visited = set()
+        while len(queue) > 0:
+            node = queue.popleft()
+            if node in visited:
+                continue
+            visited.add(node)
+            edges = (self.out_edges(node)
+                     if not reverse else self.in_edges(node))
+            for e in edges:
+                next_node = e.dst if not reverse else e.src
+                if next_node not in visited:
+                    queue.append(next_node)
+                yield e
+
+    def dfs_edges(G, source, condition=None):
+        """Traverse a graph (DFS) with an optional condition to filter out nodes
+        """
+        if isinstance(source, list): nodes = source
+        else: nodes = [source]
+        visited = set()
+        for start in nodes:
+            if start in visited:
+                continue
+            visited.add(start)
+            stack = [(start, G.out_edges(start).__iter__())]
+            while stack:
+                parent, children = stack[-1]
+                try:
+                    e = next(children)
+                    if e.dst not in visited:
+                        visited.add(e.dst)
+                        if condition is None or condition(
+                                e.src, e.dst, e.data):
+                            yield e
+                            stack.append((e.dst,
+                                          G.out_edges(e.dst).__iter__()))
+                except StopIteration:
+                    stack.pop()
+
+    def source_nodes(self):
+        """Returns nodes with no incoming edges."""
+        return [n for n in self.nodes() if self.in_degree(n) == 0]
+
+    def sink_nodes(self):
+        """Returns nodes with no outgoing edges."""
+        return [n for n in self.nodes() if self.out_degree(n) == 0]
+
+    def topological_sort(self, source=None):
+        """Returns nodes in topological order iff the graph contains exactly
+        one node with no incoming edges."""
+        if source is not None:
+            sources = [source]
+        else:
+            sources = self.source_nodes()
+            if len(sources) == 0:
+                sources = [self.nodes()[0]]
+                #raise RuntimeError("No source nodes found")
+            if len(sources) > 1:
+                sources = [self.nodes()[0]]
+                #raise RuntimeError("Multiple source nodes found")
+        seen = OrderedDict()  # No OrderedSet in Python
+        queue = deque(sources)
+        while len(queue) > 0:
+            node = queue.popleft()
+            seen[node] = None
+            for e in self.out_edges(node):
+                succ = e.dst
+                if succ not in seen:
+                    seen[succ] = None
+                    queue.append(succ)
+        return seen.keys()
+
+    def all_simple_paths(self, source_node, dest_node):
+        """ Finds all simple paths (with no repeating nodes) from source_node
+            to dest_node """
+        return nx.all_simple_paths(self._nx, source_node, dest_node)
+
+
+class SubgraphView(Graph):
+    def __init__(self, graph, subgraph_nodes):
+        self._graph = graph
+        self._subgraph_nodes = subgraph_nodes
+        self._parallel_parent = None
+
+    def is_parallel(self):
+        return self._parallel_parent != None
+
+    def set_parallel_parent(self, parallel_parent):
+        self._parallel_parent = parallel_parent
+
+    def get_parallel_parent(self):
+        return self._parallel_parent
+
+    def nodes(self):
+        return self._subgraph_nodes
+
+    def edges(self):
+        return [
+            e for e in self._graph.edges()
+            if e.src in self._subgraph_nodes and e.dst in self._subgraph_nodes
+        ]
+
+    def in_edges(self, node):
+        if node not in self._subgraph_nodes:
+            raise NodeNotFoundError
+
+        return [
+            e for e in self._graph.in_edges(node)
+            if e.src in self._subgraph_nodes
+        ]
+
+    def out_edges(self, node):
+        if node not in self._subgraph_nodes:
+            raise NodeNotFoundError
+
+        return [
+            e for e in self._graph.out_edges(node)
+            if e.dst in self._subgraph_nodes
+        ]
+
+    def add_node(self, node):
+        raise PermissionError
+
+    def add_nodes_from(self, node_list):
+        raise PermissionError
+
+    def node_id(self, node):
+        if node not in self._subgraph_nodes:
+            raise NodeNotFoundError
+        return self._graph.node_id(node)
+
+    def add_edge(self, source, destination, data):
+        raise PermissionError
+
+    def remove_node(self, node):
+        raise PermissionError
+
+    def remove_nodes_from(self, node_list):
+        raise PermissionError
+
+    def remove_edge(self, edge):
+        raise PermissionError
+
+    def edges_between(self, source, destination):
+        if source not in self._subgraph_nodes or \
+           destination not in self._subgraph_nodes:
+            raise NodeNotFoundError
+        return self._graph.edges_between(source, destination)
+
+    def in_degree(self, node):
+        return len(self.in_edges(node))
+
+    def out_degree(self, node):
+        return len(self.out_edges(node))
+
+    def number_of_nodes(self):
+        return len(self._subgraph_nodes)
+
+    def number_of_edges(self):
+        return len(self.edges())
+
+    def is_directed(self):
+        return self._graph.is_directed()
+
+    def is_multigraph(self):
+        return self._graph.is_multigraph()
+
+
+class DiGraph(Graph):
+    def __init__(self):
+        self._nx = nx.DiGraph()
+
+    def nodes(self):
+        return self._nx.nodes()
+
+    @staticmethod
+    def _from_nx(edge):
+        return Edge(edge[0], edge[1], edge[2]["data"])
+
+    def edges(self):
+        return [DiGraph._from_nx(e) for e in self._nx.edges()]
+
+    def in_edges(self, node):
+        return [DiGraph._from_nx(e) for e in self._nx.in_edges()]
+
+    def out_edges(self, node):
+        return [DiGraph._from_nx(e) for e in self._nx.out_edges()]
+
+    def add_node(self, node):
+        return self._nx.add_node(node)
+
+    def add_edge(self, source, destination, data):
+        return self._nx.add_edge(source, destination, data=data)
+
+    def remove_node(self, node):
+        self._nx.remove_node(node)
+
+    def remove_edge(self, edge):
+        self._nx.remove_edge(edge[0], edge[1])
+
+    def in_degree(self, node):
+        return self._nx.in_degree(node)
+
+    def out_degree(self, node):
+        return self._nx.out_degree(node)
+
+    def number_of_nodes(self):
+        return self._nx.number_of_nodes()
+
+    def number_of_edges(self):
+        return self._nx.number_of_edges()
+
+    def is_directed(self):
+        return True
+
+    def is_multigraph(self):
+        return False
+
+    def edges_between(self, source, destination):
+        return [e for e in self.out_edges(source) if e.dst == destination]
+
+    def find_cycles(self):
+        return nx.simple_cycles(self._nx)
+
+
+class MultiDiGraph(DiGraph):
+    def __init__(self):
+        self._nx = nx.MultiDiGraph()
+
+    @staticmethod
+    def _from_nx(edge):
+        return MultiEdge(edge[0], edge[1], edge[3]["data"], edge[2])
+
+    def add_edge(self, source, destination, data):
+        key = self._nx.add_edge(source, destination, data=data)
+        return (source, destination, data, key)
+
+    def remove_edge(self, edge):
+        self._nx.remove_edge(edge[0], edge[1], edge.key)
+
+    def is_multigraph(self):
+        return True
+
+
+class MultiDiConnectorGraph(MultiDiGraph):
+    def __init__(self):
+        super().__init__()
+
+    @staticmethod
+    def _from_nx(edge):
+        return MultiConnectorEdge(edge[0], edge[3]["src_conn"], edge[1],
+                                  edge[3]["dst_conn"], edge[3]["data"],
+                                  edge[2])
+
+    def add_edge(self, source, src_connector, destination, dst_connector,
+                 data):
+        key = self._nx.add_edge(
+            source,
+            destination,
+            data=data,
+            src_conn=src_connector,
+            dst_conn=dst_connector)
+        return (source, src_connector, destination, dst_connector, data, key)
+
+    def remove_edge(self, edge):
+        self._nx.remove_edge(edge[0], edge[1], edge.key)
+
+    def is_multigraph(self):
+        return True
+
+
+class OrderedDiGraph(Graph):
+    """ Directed graph where nodes and edges are returned in the order they
+        were added. """
+
+    def __init__(self):
+        self._nx = nx.DiGraph()
+        # {node: ({in edge: None}, {out edges: None})}
+        self._nodes = OrderedDict()
+        # {(src, dst): edge}
+        self._edges = OrderedDict()
+
+    @property
+    def nx(self):
+        return self._nx
+
+    def node(self, id):
+        return list(self._nodes.keys())[id]
+
+    def nodes(self):
+        return list(self._nodes.keys())
+
+    def edges(self):
+        return list(self._edges.values())
+
+    def in_edges(self, node):
+        return list(self._nodes[node][0].values())
+
+    def out_edges(self, node):
+        return list(self._nodes[node][1].values())
+
+    def add_node(self, node):
+        if node in self._nodes:
+            raise RuntimeError("Duplicate node added")
+        self._nodes[node] = (OrderedDict(), OrderedDict())
+        self._nx.add_node(node)
+
+    def add_edge(self, src, dst, data):
+        t = (src, dst)
+        if t in self._edges:
+            raise RuntimeError("Duplicate edge added")
+        if src not in self._nodes:
+            self.add_node(src)
+        if dst not in self._nodes:
+            self.add_node(dst)
+        edge = Edge(src, dst, data)
+        self._edges[t] = edge
+        self._nodes[src][1][t] = edge
+        self._nodes[dst][0][t] = edge
+        return self._nx.add_edge(src, dst, data=data)
+
+    def remove_node(self, node):
+        for edge in itertools.chain(self.in_edges(node), self.out_edges(node)):
+            self.remove_edge(edge)
+        del self._nodes[node]
+        self._nx.remove_node(node)
+
+    def remove_edge(self, edge):
+        src = edge.src
+        dst = edge.dst
+        t = (src, dst)
+        self._nx.remove_edge(src, dst)
+        del self._nodes[src][1][t]
+        del self._nodes[dst][0][t]
+        del self._edges[t]
+
+    def in_degree(self, node):
+        return len(self._nodes[node][0])
+
+    def out_degree(self, node):
+        return len(self._nodes[node][1])
+
+    def number_of_nodes(self):
+        return len(self._nodes)
+
+    def number_of_edges(self):
+        return len(self._edges)
+
+    def is_directed(self):
+        return True
+
+    def is_multigraph(self):
+        return False
+
+    def find_cycles(self):
+        return nx.simple_cycles(self._nx)
+
+    def edges_between(self, source, destination):
+        if source not in self.nodes(): return []
+        return [e for e in self.out_edges(source) if e.dst == destination]
+
+    def reverse(self):
+        """Reverses source and destination of all edges in the graph"""
+        raise self._not_implemented_error()
+
+
+class OrderedMultiDiGraph(OrderedDiGraph):
+    """ Directed multigraph where nodes and edges are returned in the order
+        they were added. """
+
+    def __init__(self):
+        self._nx = nx.MultiDiGraph()
+        # {node: ({in edge: edge}, {out edge: edge})}
+        self._nodes = OrderedDict()
+        # {edge: edge}
+        self._edges = OrderedDict()
+
+    def add_edge(self, src, dst, data):
+        key = self._nx.add_edge(src, dst, data=data)
+        edge = MultiEdge(src, dst, data, key)
+        if src not in self._nodes:
+            self.add_node(src)
+        if dst not in self._nodes:
+            self.add_node(dst)
+        self._nodes[src][1][edge] = edge
+        self._nodes[dst][0][edge] = edge
+        self._edges[edge] = edge
+        return edge
+
+    def remove_edge(self, edge):
+        del self._edges[edge]
+        del self._nodes[edge.src][1][edge]
+        del self._nodes[edge.dst][0][edge]
+        self._nx.remove_edge(edge.src, edge.dst, edge.key)
+
+    def reverse(self):
+        self._nx.reverse(False)
+        for e in self._edges.keys():
+            e.reverse()
+        for n, (in_edges, out_edges) in self._nodes.items():
+            self._nodes[n] = (out_edges, in_edges)
+
+    def is_multigraph(self):
+        return True
+
+
+class OrderedMultiDiConnectorGraph(OrderedMultiDiGraph):
+    """ Directed multigraph with node connectors (SDFG states), where nodes
+        and edges are returned in the order they were added. """
+
+    def __init__(self):
+        super().__init__()
+
+    def add_edge(self, src, src_conn, dst, dst_conn, data):
+        key = self._nx.add_edge(
+            src, dst, data=data, src_conn=src_conn, dst_conn=dst_conn)
+        edge = MultiConnectorEdge(src, src_conn, dst, dst_conn, data, key)
+        if src not in self._nodes:
+            self.add_node(src)
+        if dst not in self._nodes:
+            self.add_node(dst)
+        self._nodes[src][1][edge] = edge
+        self._nodes[dst][0][edge] = edge
+        self._edges[edge] = edge
+        return edge
+
+    def add_nedge(self, src, dst, data):
+        """ Adds an edge without (value=None) connectors. """
+        return self.add_edge(src, None, dst, None, data)
+
+    def remove_edge(self, edge):
+        del self._edges[edge]
+        del self._nodes[edge.src][1][edge]
+        del self._nodes[edge.dst][0][edge]
+        self._nx.remove_edge(edge.src, edge.dst, edge.key)
+
+    def reverse(self):
+        self._nx.reverse(False)
+        for e in self._edges.keys():
+            e.reverse()
+        for n, (in_edges, out_edges) in self._nodes.items():
+            self._nodes[n] = (out_edges, in_edges)
+
+    def is_multigraph(self):
+        return True
diff --git a/dace/graph/labeling.py b/dace/graph/labeling.py
new file mode 100644
index 0000000000..c47734cb48
--- /dev/null
+++ b/dace/graph/labeling.py
@@ -0,0 +1,813 @@
+""" Functionality relating to Memlet propagation (deducing external memlets
+    from internal memory accesses and scope ranges). """
+
+import copy
+import itertools
+import functools
+import networkx as nx
+import sympy
+import unittest
+import math
+
+from dace import data, subsets, symbolic, types
+from dace.memlet import Memlet
+from dace.graph import nodes, nxutil
+from dace.graph.graph import OrderedMultiDiGraph
+from dace.transformation import pattern_matching
+
+
+class MemletPattern(object):
+    """ A pattern match on a memlet subset that can be used for propagation. 
+    """
+    s_patterns = []
+    s_dependencies = {}
+
+    @staticmethod
+    def patterns():
+        return [p() for p in MemletPattern.s_patterns]
+
+    @staticmethod
+    def register_pattern(clazz, depends=None):
+        if not issubclass(clazz, MemletPattern):
+            raise TypeError
+        MemletPattern.s_patterns.append(clazz)
+
+    @staticmethod
+    def unregister_pattern(clazz):
+        if not issubclass(clazz, MemletPattern):
+            raise TypeError
+        MemletPattern.s_patterns.remove(clazz)
+
+    ####################################################
+
+    def match(self, expressions, variable_context, node_range, orig_edges):
+        raise NotImplementedError
+
+    def propagate(self, array, expressions, node_range):
+        raise NotImplementedError
+
+
+class SeparableMemletPattern(object):
+    """ Memlet pattern that can be applied to each of the dimensions 
+        separately. """
+
+    s_smpatterns = []
+
+    @staticmethod
+    def register_pattern(cls):
+        if not issubclass(cls, SeparableMemletPattern): raise TypeError
+        if cls not in SeparableMemletPattern.s_smpatterns:
+            SeparableMemletPattern.s_smpatterns.append(cls)
+
+    @staticmethod
+    def unregister_pattern(cls):
+        SeparableMemletPattern.s_smpatterns.remove(cls)
+
+    def match(self, dim_exprs, variable_context, node_range, orig_edges,
+              dim_index, total_dims):
+        raise NotImplementedError
+
+    def propagate(self, array, dim_exprs, node_range):
+        raise NotImplementedError
+
+
+class SeparableMemlet(MemletPattern):
+    """ Meta-memlet pattern that applies all separable memlet patterns. """
+
+    def match(self, expressions, variable_context, node_range, orig_edges):
+        # Assuming correct dimensionality in each of the expressions
+        data_dims = len(expressions[0])
+        self.patterns_per_dim = [None] * data_dims
+
+        overapprox_range = subsets.Range([(rb.approx if isinstance(
+            rb, symbolic.SymExpr) else rb, re.approx if isinstance(
+                re, symbolic.SymExpr) else re, rs.approx if isinstance(
+                    rs, symbolic.SymExpr) else rs)
+                                          for rb, re, rs in node_range])
+
+        for dim in range(data_dims):
+
+            dexprs = []
+            for expr in expressions:
+                if isinstance(expr[dim], symbolic.SymExpr):
+                    dexprs.append(expr[dim].approx)
+                elif isinstance(expr[dim], tuple):
+                    dexprs.append(
+                        (expr[dim][0].approx
+                         if isinstance(expr[dim][0], symbolic.SymExpr) else
+                         expr[dim][0], expr[dim][1].approx
+                         if isinstance(expr[dim][1], symbolic.SymExpr) else
+                         expr[dim][1], expr[dim][2].approx
+                         if isinstance(expr[dim][2],
+                                       symbolic.SymExpr) else expr[dim][2]))
+                else:
+                    dexprs.append(expr[dim])
+
+            for pattern_class in SeparableMemletPattern.s_smpatterns:
+                smpattern = pattern_class()
+                if smpattern.match(dexprs, variable_context, overapprox_range,
+                                   orig_edges, dim, data_dims):
+                    self.patterns_per_dim[dim] = smpattern
+                    break
+
+        return None not in self.patterns_per_dim
+
+    def propagate(self, array, expressions, node_range):
+        result = [(None, None, None)] * len(self.patterns_per_dim)
+
+        overapprox_range = subsets.Range([(rb.approx if isinstance(
+            rb, symbolic.SymExpr) else rb, re.approx if isinstance(
+                re, symbolic.SymExpr) else re, rs.approx if isinstance(
+                    rs, symbolic.SymExpr) else rs)
+                                          for rb, re, rs in node_range])
+
+        for i, smpattern in enumerate(self.patterns_per_dim):
+
+            dexprs = []
+            for expr in expressions:
+                if isinstance(expr[i], symbolic.SymExpr):
+                    dexprs.append(expr[i].approx)
+                elif isinstance(expr[i], tuple):
+                    dexprs.append((expr[i][0].approx if isinstance(
+                        expr[i][0],
+                        symbolic.SymExpr) else expr[i][0], expr[i][1].approx
+                                   if isinstance(expr[i][1], symbolic.SymExpr)
+                                   else expr[i][1], expr[i][2].approx
+                                   if isinstance(expr[i][2], symbolic.SymExpr)
+                                   else expr[i][2], expr.tile_sizes[i]))
+                else:
+                    dexprs.append(expr[i])
+
+            result[i] = smpattern.propagate(array, dexprs, overapprox_range)
+
+        # TODO(later): Not necessarily Range (general integer sets)
+        return subsets.Range(result)
+
+
+MemletPattern.register_pattern(SeparableMemlet)
+
+
+class AffineSMemlet(SeparableMemletPattern):
+    """ Separable memlet pattern that matches affine expressions, i.e.,
+        of the form `a * {index} + b`.
+    """
+
+    def match(self, dim_exprs, variable_context, node_range, orig_edges,
+              dim_index, total_dims):
+
+        params = variable_context[-1]  # Why only last element?
+        # Create wildcards for multiplication and addition
+        a = sympy.Wild('a', exclude=params)
+        b = sympy.Wild('b', exclude=params)
+
+        self.param = None
+        self.paramind = None
+        self.mult = None
+        self.add_min = None
+        self.add_max = None
+        self.constant_min = None
+        self.constant_max = None
+
+        # Obtain vector length
+        self.veclen = None
+        if dim_index == total_dims - 1:
+            for e in orig_edges:
+                self.veclen = e.veclen
+        if self.veclen is None:
+            self.veclen = 1
+        ######################
+
+        # Special case: Get the total internal access range
+        # If this range matches (0, rs), we say that the propagated skip is 1
+        self.internal_range = set()
+
+        for dexpr in dim_exprs:
+            subexprs = None
+            step = None
+            if isinstance(dexpr, sympy.Basic):  # Affine index
+                subexprs = [dexpr]
+
+            elif isinstance(dexpr, tuple) and len(dexpr) == 3:  # Affine range
+                subexprs = [dexpr[0], dexpr[1]]
+                step = dexpr[2]
+
+            if subexprs is None:  # Something else
+                return False
+
+            for i, subexpr in enumerate(subexprs):
+                # Try to match an affine expression with a parameter
+                param = None
+                pind = -1
+                for indp, p in enumerate(params):
+                    matches = subexpr.match(a * p + b)
+                    if param is None and matches is None:
+                        continue
+                    elif param is not None and matches is not None:
+                        return False  # Only one parameter may match
+                    elif matches is not None:
+                        multiplier = matches[a]
+                        addition = matches[b]
+                        param = p
+                        pind = indp
+
+                if param is None:
+                    return False  # A parameter must match
+                if self.param is not None and param != self.param:
+                    return False  # There can only be one parameter
+                if self.mult is not None and multiplier != self.mult:
+                    return False  # Multiplier must be the same
+
+                self.param = param
+                self.paramind = pind
+                self.multiplier = multiplier
+
+                # If this is one expression
+                if len(subexprs) == 1:
+                    self.internal_range.add(addition)
+                elif i == 0:  # Range begin
+                    brb = addition
+                elif i == 1:  # Range end
+                    bre = addition
+
+            if len(subexprs) > 1:
+                self.internal_range.add((brb, bre))
+
+            if step is not None:
+                if self.param in step.free_symbols:
+                    return False  # Step must be independent of parameter
+
+            node_rb, node_re, node_rs = node_range[self.paramind]
+            if node_rs != 1:
+                # Map ranges where the last index is not known
+                # exactly are not supported by this pattern.
+                return False
+
+        if self.param is None:  # and self.constant_min is None:
+            return False
+
+        return True
+
+    def propagate(self, array, dim_exprs, node_range):
+        # Compute last index in map according to range definition
+        node_rb, node_re, node_rs = node_range[self.paramind]  # node_rs = 1
+        node_rlen = node_re - node_rb + 1
+
+        if isinstance(dim_exprs, list):
+            dim_exprs = dim_exprs[0]
+
+        if isinstance(dim_exprs, tuple):
+
+            if len(dim_exprs) == 3:
+                rb, re, rs = dim_exprs
+                rt = '1'
+            elif len(dim_exprs) == 4:
+                rb, re, rs, rt = dim_exprs
+            else:
+                raise NotImplementedError
+
+            rb = symbolic.pystr_to_symbolic(rb).expand()
+            re = symbolic.pystr_to_symbolic(re).expand()
+            rs = symbolic.pystr_to_symbolic(rs).expand()
+            rt = symbolic.pystr_to_symbolic(rt).expand()
+        else:
+            rb, re = (dim_exprs.expand(), dim_exprs.expand())
+            rs = 1
+            rt = 1
+
+        result_begin = rb.subs(self.param, node_rb).expand()
+        result_end = re.subs(self.param, node_re).expand()
+
+        # Experimental
+        # This should be using sympy.floor
+        memlet_start_pts = ((re - rt + 1 - rb) / rs) + 1
+        memlet_rlen = memlet_start_pts.expand() * rt
+        interval_len = (result_end - result_begin + 1) * self.veclen
+        num_elements = node_rlen * memlet_rlen
+
+        if (interval_len == num_elements
+                or interval_len.expand() == num_elements):
+            # Continuous access
+            result_skip = 1
+            result_tile = 1
+        else:
+            if rt == 1:
+                result_skip = (result_end - result_begin - re + rb) / (
+                    node_re - node_rb)
+                try:
+                    if result_skip < 1:
+                        result_skip = 1
+                except:
+                    pass
+                result_tile = result_end - result_begin + 1 - (
+                    node_rlen - 1) * result_skip
+            else:
+                candidate_skip = rs
+                candidate_tile = rt * node_rlen
+                candidate_lstart_pt = result_end - result_begin + 1 - candidate_tile
+                if (candidate_lstart_pt / (num_elements / candidate_tile - 1)
+                    ).simplify() == candidate_skip:
+                    result_skip = rs
+                    result_tile = rt * node_rlen
+                else:
+                    result_skip = rs / node_rlen
+                    result_tile = rt
+
+            if result_skip == result_tile or result_skip == 1:
+                result_skip = 1
+                result_tile = 1
+
+        result_begin = sympy.simplify(result_begin)
+        result_end = sympy.simplify(result_end)
+        result_skip = sympy.simplify(result_skip)
+        result_tile = sympy.simplify(result_tile)
+
+        return (result_begin, result_end, result_skip, result_tile)
+
+
+SeparableMemletPattern.register_pattern(AffineSMemlet)
+
+
+class ModuloSMemlet(SeparableMemletPattern):
+    """ Separable memlet pattern that matches modulo expressions, i.e.,
+        of the form `f(x) % N`.
+
+        Acts as a meta-pattern: Finds the underlying pattern for `f(x)`.
+    """
+
+    def match(self, dim_exprs, variable_context, node_range, orig_edges,
+              dim_index, total_dims):
+        # Pattern does not support unions of expressions
+        if len(dim_exprs) > 1: return False
+        dexpr = dim_exprs[0]
+        # Pattern does not support ranges
+        if not isinstance(dexpr, sympy.Basic): return False
+
+        # Create wildcards
+        val = sympy.Wild('val')
+        mod = sympy.Wild('mod', exclude=variable_context[-1])
+
+        # Try to match an affine expression
+        matches = dexpr.match(val % mod)
+        if matches is None or len(matches) != 2:
+            return False
+
+        self.subexpr = matches[val]
+        self.modulo = matches[mod]
+
+        self.subpattern = None
+        for pattern_class in SeparableMemletPattern.s_smpatterns:
+            smpattern = pattern_class()
+            if smpattern.match([self.subexpr], variable_context, node_range,
+                               orig_edges, dim_index, total_dims):
+                self.subpattern = smpattern
+
+        return self.subpattern is not None
+
+    def propagate(self, array, dim_exprs, node_range):
+        se_range = self.subpattern.propagate(array, [self.subexpr], node_range)
+
+        # Apply modulo on start and end ranges
+        try:
+            if se_range[0] < 0:
+                se_range = (0, self.modulo, se_range[2])
+        except TypeError:  # cannot determine truth value of Relational
+            print('WARNING: Cannot evaluate relational %s, assuming true.' %
+                  (se_range[0] < 0))
+        try:
+            if se_range[1] > self.modulo:
+                se_range = (0, self.modulo, se_range[2])
+        except TypeError:  # cannot determine truth value of Relational
+            print('WARNING: Cannot evaluate relational %s, assuming true.' %
+                  (se_range[1] > self.modulo))
+
+        return se_range
+
+
+SeparableMemletPattern.register_pattern(ModuloSMemlet)
+
+
+class ConstantSMemlet(SeparableMemletPattern):
+    """ Separable memlet pattern that matches constant (i.e., unrelated to 
+        current scope) expressions.
+    """
+
+    def match(self, dim_exprs, variable_context, node_range, orig_edges,
+              dim_index, total_dims):
+        # Pattern does not support unions of expressions. TODO: Support
+        if len(dim_exprs) > 1: return False
+        dexpr = dim_exprs[0]
+
+        # Create a wildcard that excludes current map's parameters
+        cst = sympy.Wild('cst', exclude=variable_context[-1])
+
+        # Range case
+        if isinstance(dexpr, tuple) and len(dexpr) == 3:
+            # Try to match a constant expression for the range
+            for rngelem in dexpr:
+                if types.isconstant(rngelem):
+                    continue
+
+                matches = rngelem.match(cst)
+                if matches is None or len(matches) != 1:
+                    return False
+                if not matches[cst].is_constant():
+                    return False
+
+        else:  # Single element case
+            # Try to match a constant expression
+            if not types.isconstant(dexpr):
+                matches = dexpr.match(cst)
+                if matches is None or len(matches) != 1:
+                    return False
+                if not matches[cst].is_constant():
+                    return False
+
+        return True
+
+    def propagate(self, array, dim_exprs, node_range):
+        if isinstance(dim_exprs[0], tuple):
+            return dim_exprs[0]  # Already in range format
+        # Convert index to range format
+        return (dim_exprs[0], dim_exprs[0], 1)
+
+
+SeparableMemletPattern.register_pattern(ConstantSMemlet)
+
+
+class GenericSMemlet(SeparableMemletPattern):
+    """ Separable memlet pattern that detects any expression, and propagates 
+        interval bounds. Used as a last resort. """
+
+    def match(self, dim_exprs, variable_context, node_range, orig_edges,
+              dim_index, total_dims):
+
+        self.params = variable_context[-1]
+
+        # Always matches
+        return True
+
+    def propagate(self, array, dim_exprs, node_range):
+
+        result_begin = None
+        result_end = None
+
+        # Iterate over the node dimensions
+        for idx, node_r in enumerate(node_range):
+
+            # Get dimension range
+            if len(node_r) == 3:
+                node_rb, node_re, node_rs = node_r
+            elif len(node_r) == 4:
+                node_rb, node_re, node_rs, _ = node_r
+            else:
+                raise NotImplementedError
+
+            # Get true range end
+            lastindex = node_re
+            if node_rs != 1:
+                lastindex = symbolic.pystr_to_symbolic(
+                    '%s + int_floor(%s - %s, %s) * %s' %
+                    (symbolic.symstr(node_rb), symbolic.symstr(node_re),
+                     symbolic.symstr(node_rb), symbolic.symstr(node_rs),
+                     symbolic.symstr(node_rs)))
+
+            if isinstance(dim_exprs, list):
+                dim_exprs = dim_exprs[0]
+
+            if isinstance(dim_exprs, tuple):
+
+                if len(dim_exprs) == 3:
+                    rb, re, rs = dim_exprs
+                elif len(dim_exprs) == 4:
+                    rb, re, rs, _ = dim_exprs
+                else:
+                    raise NotImplementedError
+
+                rb = symbolic.pystr_to_symbolic(rb)
+                re = symbolic.pystr_to_symbolic(re)
+                rs = symbolic.pystr_to_symbolic(rs)
+
+            else:
+                rb, re = (dim_exprs, dim_exprs)
+
+            if result_begin is None:
+                result_begin = rb.subs(self.params[idx], node_rb)
+            else:
+                result_begin = result_begin.subs(self.params[idx], node_rb)
+            if result_end is None:
+                result_end = re.subs(self.params[idx], lastindex)
+            else:
+                result_end = result_end.subs(self.params[idx], lastindex)
+
+        result_skip = 1
+        result_tile = 1
+
+        return (result_begin, result_end, result_skip, result_tile)
+
+
+SeparableMemletPattern.register_pattern(GenericSMemlet)
+
+
+def _subexpr(dexpr, repldict):
+    if isinstance(dexpr, tuple):
+        return tuple(_subexpr(d, repldict) for d in dexpr)
+    elif isinstance(dexpr, symbolic.SymExpr):
+        return dexpr.expr.subs(repldict)
+    else:
+        return dexpr.subs(repldict)
+
+
+class ConstantRangeMemlet(MemletPattern):
+    """ Memlet pattern that matches arbitrary expressions with constant range.
+    """
+
+    def match(self, expressions, variable_context, node_range, orig_edges):
+        constant_range = True
+        for dim in node_range:
+            for rngelem in dim:  # For (begin, end, skip)
+                if not types.isconstant(rngelem) and not isinstance(
+                        rngelem, sympy.Number):
+                    constant_range = False
+                    break
+        if not constant_range:
+            return False
+
+        self.params = variable_context[-1]
+
+        return True
+
+    # TODO: An integer set library should shine here (unify indices)
+    def propagate(self, array, expressions, node_range):
+        rng = [(None, None, 1)] * len(array.shape)
+        node_range_gen = (range(rb, re, rs) for rb, re, rs in node_range)
+        for ndind in itertools.product(*tuple(node_range_gen)):
+            repldict = {p: ndind[i] for i, p in enumerate(self.params)}
+            for expr in expressions:
+                for dim, dexpr in enumerate(expr):
+                    evaldexpr = _subexpr(dexpr, repldict)
+                    rb, re, rs = rng[dim]
+                    if rb is None:
+                        rng[dim] = (evaldexpr, evaldexpr, 1)
+                    else:
+                        if evaldexpr < rb:
+                            rng[dim] = (evaldexpr, re, rs)
+                        if evaldexpr > re:  # The +1 is because ranges are exclusive
+                            rng[dim] = (rb, evaldexpr, rs)
+
+        return subsets.Range(rng)
+
+
+# ConstantRangeMemlet is slow, so it should be evaluated last
+MemletPattern.register_pattern(ConstantRangeMemlet)
+
+
+def propagate_labels_sdfg(sdfg):
+    """ Propagates memlets throughout an entire given SDFG. 
+        @note: This is an in-place operation on the SDFG.
+    """
+    for state in sdfg.nodes():
+        _propagate_labels(state, sdfg)
+
+
+def _propagate_labels(g, sdfg):
+    """ Propagates memlets throughout one SDFG state. 
+        @param g: The state to propagate in.
+        @param sdfg: The SDFG in which the state is situated.
+        @note: This is an in-place operation on the SDFG state.
+    """
+    patterns = MemletPattern.patterns()
+
+    # Algorithm:
+    # 1. Start propagating information from tasklets outwards (their edges
+    #    are hardcoded).
+    #    NOTE: This process can be performed in parallel.
+    # 2. Traverse the neighboring nodes (topological sort, first forward to
+    #    outputs and then backward to inputs).
+    #    There are four possibilities:
+    #    a. If the neighboring node is a tasklet, skip (such edges are
+    #       immutable)
+    #    b. If the neighboring node is an array, make sure it is the correct
+    #       array. Otherwise, throw a mismatch exception.
+    #    c. If the neighboring node is a scope node, and its other edges are
+    #       not set, set the results per-array, using the union of the
+    #       obtained ranges in the previous depth.
+    #    d. If the neighboring node is a scope node, and its other edges are
+    #       already set, verify the results per-array, using the union of the
+    #       obtained ranges in the previous depth.
+    #    NOTE: The SDFG creation process ensures that all edges in the
+    #          multigraph are tagged with the appropriate array. In any case
+    #          of ambiguity, the function raises an exception.
+    # 3. For each edge in the multigraph, collect results and group by array assigned to edge.
+    #    Accumulate information about each array in the target node.
+    scope_dict = g.scope_dict()
+
+    def stop_at(parent, child):
+        # Transients should only propagate in the direction of the
+        # non-transient data
+        if isinstance(parent,
+                      nodes.AccessNode) and parent.desc(sdfg).transient:
+            for _, _, _, _, memlet in g.edges_between(parent, child):
+                if parent.data != memlet.data:
+                    return True
+            return False
+        if isinstance(child, nodes.AccessNode):
+            return False
+        return True
+
+    array_data = {}  # type: dict(node -> dict(data -> list(Subset)))
+    tasklet_nodes = [
+        node for node in g.nodes() if (isinstance(node, nodes.CodeNode) or (
+            isinstance(node, nodes.AccessNode) and node.desc(sdfg).transient))
+    ]
+    # Step 1: Direction - To output
+    for start_node in tasklet_nodes:
+        for node in nxutil.dfs_topological_sort(
+                g, start_node, condition=stop_at):
+            _propagate_node(sdfg, g, node, array_data, patterns, scope_dict,
+                            True)
+    # Step 1: Direction - To input
+    array_data = {}
+    g.reverse()
+    for node in nxutil.dfs_topological_sort(
+            g, tasklet_nodes, condition=stop_at):
+        _propagate_node(sdfg, g, node, array_data, patterns, scope_dict)
+
+    # To support networkx 1.11
+    g.reverse()
+
+
+# External API
+def propagate_memlet(dfg_state, memlet: Memlet, scope_node: nodes.EntryNode,
+                     union_inner_edges: bool):
+    """ Tries to propagate a memlet through a scope (computes the image of 
+        the memlet function applied on an integer set of, e.g., a map range) 
+        and returns a new memlet object.
+        @param dfg_state: An SDFGState object representing the graph.
+        @param memlet: The memlet adjacent to the scope node from the inside.
+        @param scope_node: A scope entry or exit node.
+        @param union_inner_edges: True if the propagation should take other
+                                  neighboring internal memlets within the same
+                                  scope into account.
+    """
+    if isinstance(scope_node, nodes.EntryNode):
+        neighboring_edges = dfg_state.out_edges(scope_node)
+    elif isinstance(scope_node, nodes.ExitNode):
+        neighboring_edges = dfg_state.in_edges(scope_node)
+    else:
+        raise TypeError('Trying to propagate through a non-scope node')
+
+    # Find other adjacent edges within the connected to the scope node
+    # and union their subsets
+    if union_inner_edges:
+        aggdata = [
+            e.data for e in neighboring_edges
+            if e.data.data == memlet.data and e.data != memlet
+        ]
+    else:
+        aggdata = []
+
+    aggdata.append(memlet)
+
+    new_subset = _propagate_edge(dfg_state.parent, None,
+                                 scope_node, None, memlet, aggdata,
+                                 MemletPattern.patterns(), None)
+
+    new_memlet = copy.copy(memlet)
+    new_memlet.subset = new_subset
+    new_memlet.other_subset = None
+
+    # Number of accesses in the propagated memlet is the sum of the internal
+    # number of accesses times the size of the map range set
+    new_memlet.num_accesses = (
+        sum(m.num_accesses for m in aggdata) * functools.reduce(
+            lambda a, b: a * b, scope_node.map.range.size(), 1))
+
+    return new_memlet
+
+
+def _propagate_node(sdfg,
+                    g,
+                    node,
+                    array_data,
+                    patterns,
+                    scope_dict,
+                    write=False):
+    # Step 2: Propagate edges
+    # If this is a tasklet, we only propagate to adjacent nodes and not modify edges
+    # Special case: starting from reduction, no need for external nodes to compute edges
+    if (not isinstance(node, nodes.CodeNode)
+            and not isinstance(node, nodes.AccessNode) and node in array_data):
+        # Otherwise (if primitive), use current node information and accumulated data
+        # on arrays to set the memlets per edge
+        for _, _, target, _, memlet in g.out_edges(node):
+            # Option (a)
+            if (isinstance(target, nodes.CodeNode)):
+                continue
+
+            if not isinstance(memlet, Memlet):
+                raise AttributeError('Edge does not contain a memlet')
+
+            aggdata = None
+            if node in array_data:
+                if memlet.data in array_data[node]:
+                    aggdata = array_data[node][memlet.data]
+
+            wcr = None
+            if aggdata is not None:
+                for m in aggdata:
+                    if m.wcr is not None:
+                        wcr = (m.wcr, m.wcr_identity)
+                        break
+
+            # Compute candidate edge
+            candidate = _propagate_edge(sdfg, g, node, target, memlet, aggdata,
+                                        patterns, not write)
+            if candidate is None:
+                continue
+
+            # Option (b)
+            if isinstance(target, nodes.AccessNode):
+                # Check for data mismatch
+                if target.data != memlet.data:  #and not target.desc.transient:
+                    raise LookupError(
+                        'Mismatch between edge data %s and data node %s' %
+                        (memlet.data, target.data))
+
+            # Options (c), (d)
+            else:
+                pass
+
+            # Set new edge value
+            memlet.subset = candidate
+
+            # Number of accesses in the propagated memlet is the sum of the internal
+            # number of accesses times the size of the map range set
+            memlet.num_accesses = (
+                sum(m.num_accesses for m in aggdata) * functools.reduce(
+                    lambda a, b: a * b, node.map.range.size(), 1))
+
+            # Set WCR, if necessary
+            if wcr is not None:
+                memlet.wcr, memlet.wcr_identity = wcr
+
+    # Step 3: Accumulate edge information in adjacent node, grouped by array
+    for _, _, target, _, memlet in g.out_edges(node):
+        if (isinstance(target, nodes.CodeNode)):
+            continue
+
+        if not isinstance(memlet, Memlet):
+            raise AttributeError('Edge does not contain a memlet')
+
+        # Transients propagate only towards the data they are writing to
+        if isinstance(node, nodes.AccessNode) and node.data == memlet.data:
+            continue
+
+        # No data
+        if memlet.subset is None:
+            continue
+        #if isinstance(memlet, subsets.SequentialDependency):
+        #    continue
+
+        # Accumulate data information on target node
+        if target not in array_data:
+            array_data[target] = {}
+        if memlet.data not in array_data[target]:
+            array_data[target][memlet.data] = []
+        array_data[target][memlet.data].append(memlet)
+
+
+def _propagate_edge(sdfg, g, u, v, memlet, aggdata, patterns, reversed):
+    if ((isinstance(u, nodes.EntryNode) or isinstance(u, nodes.ExitNode))):
+        mapnode = u.map
+
+        if aggdata is None:
+            return None
+
+        # Collect data about edge
+        data = memlet.data
+        expr = [edge.subset for edge in aggdata]
+
+        if memlet.data not in sdfg.arrays:
+            raise KeyError('Data descriptor (Array, Stream) "%s" not defined '
+                           'in SDFG.' % memlet.data)
+
+        for pattern in patterns:
+            if pattern.match(
+                    expr,
+                [[symbolic.pystr_to_symbolic(p) for p in mapnode.params]],
+                    mapnode.range, aggdata):  # Only one level of context
+                return pattern.propagate(sdfg.arrays[memlet.data], expr,
+                                         mapnode.range)
+
+        # No patterns found. Emit a warning and propagate the entire array
+        print('WARNING: Cannot find appropriate memlet pattern to propagate %s'
+              % str(expr))
+
+        return subsets.Range.from_array(sdfg.arrays[memlet.data])
+    elif isinstance(u, nodes.ConsumeEntry) or isinstance(u, nodes.ConsumeExit):
+
+        # Nothing to analyze/propagate in consume
+        return subsets.Range.from_array(sdfg.arrays[memlet.data])
+
+    else:
+        raise NotImplementedError('Unimplemented primitive: %s' % type(u))
diff --git a/dace/graph/nodes.py b/dace/graph/nodes.py
new file mode 100644
index 0000000000..58a30a6f9c
--- /dev/null
+++ b/dace/graph/nodes.py
@@ -0,0 +1,749 @@
+""" Contains classes implementing the different types of nodes of the stateful
+    dataflow multigraph representation. """
+
+import ast
+from copy import deepcopy as dcpy
+import itertools
+from typing import Set
+from dace.graph import dot, graph
+from dace.frontend.python.astutils import unparse
+from dace.properties import (Property, CodeProperty, LambdaProperty,
+                             ParamsProperty, RangeProperty, DebugInfoProperty,
+                             SetProperty, make_properties, indirect_properties,
+                             DataProperty, SymbolicProperty)
+from dace.frontend.operations import detect_reduction_type
+from dace import data, subsets as sbs, types
+import pickle
+
+# -----------------------------------------------------------------------------
+
+
+@make_properties
+class Node(object):
+    """ Base node class. """
+
+    in_connectors = SetProperty(
+        str, default=set(), desc="A set of input connectors for this node.")
+    out_connectors = SetProperty(
+        str, default=set(), desc="A set of output connectors for this node.")
+
+    def __init__(self, in_connectors=set(), out_connectors=set()):
+        self.in_connectors = in_connectors
+        self.out_connectors = out_connectors
+
+    def __str__(self):
+        if hasattr(self, 'label'):
+            return self.label
+        else:
+            return type(self).__name__
+
+    def validate(self, sdfg, state):
+        pass
+
+    def toJSON(self, indent=0):
+        labelstr = str(self)
+        typestr = str(type(self).__name__)
+        inconn = "[" + ",".join(
+            ['"' + str(x) + '"' for x in self.in_connectors]) + "]"
+        outconn = "[" + ",".join(
+            ['"' + str(x) + '"' for x in self.out_connectors]) + "]"
+        json = " " * indent + "{ \"label\": \"" + labelstr
+        json += "\", \"type\": \"" + typestr + "\", \"in_connectors\": " + inconn
+        json += ", \"out_connectors\" :" + outconn
+        json += "}\n"
+        return json
+
+    def __repr__(self):
+        return type(self).__name__ + ' (' + self.__str__() + ')'
+
+    def add_in_connector(self, connector_name: str):
+        """ Adds a new input connector to the node. The operation will fail if
+            a connector (either input or output) with the same name already 
+            exists in the node.
+
+            @param connector_name: The name of the new connector.
+            @return: True if the operation is successful, otherwise False.
+        """
+
+        if (connector_name in self.in_connectors
+                or connector_name in self.out_connectors):
+            return False
+        connectors = self.in_connectors
+        connectors.add(connector_name)
+        self.in_connectors = connectors
+        return True
+
+    def add_out_connector(self, connector_name: str):
+        """ Adds a new output connector to the node. The operation will fail if
+            a connector (either input or output) with the same name already 
+            exists in the node.
+
+            @param connector_name: The name of the new connector.
+            @return: True if the operation is successful, otherwise False.
+        """
+
+        if (connector_name in self.in_connectors
+                or connector_name in self.out_connectors):
+            return False
+        connectors = self.out_connectors
+        connectors.add(connector_name)
+        self.out_connectors = connectors
+        return True
+
+    def remove_in_connector(self, connector_name: str):
+        """ Removes an input connector from the node.
+            @param connector_name: The name of the connector to remove.
+            @return: True if the operation was successful.
+        """
+
+        if connector_name in self.in_connectors:
+            connectors = self.in_connectors
+            connectors.remove(connector_name)
+            self.in_connectors = connectors
+        return True
+
+    def remove_out_connector(self, connector_name: str):
+        """ Removes an output connector from the node.
+            @param connector_name: The name of the connector to remove.
+            @return: True if the operation was successful.
+        """
+
+        if connector_name in self.out_connectors:
+            connectors = self.out_connectors
+            connectors.remove(connector_name)
+            self.out_connectors = connectors
+        return True
+
+    def _next_connector_int(self) -> int:
+        """ Returns the next unused connector ID (as an integer). Used for
+            filling connectors when adding edges to scopes. """
+        next_number = 1
+        for conn in itertools.chain(self.in_connectors, self.out_connectors):
+            if conn.startswith('IN_'):
+                cconn = conn[3:]
+            elif conn.startswith('OUT_'):
+                cconn = conn[4:]
+            else:
+                continue
+            try:
+                curconn = int(cconn)
+                if curconn >= next_number:
+                    next_number = curconn + 1
+            except TypeError:  # not integral
+                continue
+        return next_number
+
+    def next_connector(self) -> str:
+        """ Returns the next unused connector ID (as a string). Used for
+            filling connectors when adding edges to scopes. """
+        return str(self._next_connector_int())
+
+    def last_connector(self) -> str:
+        """ Returns the last used connector ID (as a string). Used for
+            filling connectors when adding edges to scopes. """
+        return str(self._next_connector_int() - 1)
+
+
+# ------------------------------------------------------------------------------
+
+
+@make_properties
+class AccessNode(Node):
+    """ A node that accesses data in the SDFG. Denoted by a circular shape. """
+
+    access = Property(
+        enum=types.AccessType,
+        desc="Type of access to this array",
+        default=types.AccessType.ReadWrite)
+    setzero = Property(dtype=bool, desc="Initialize to zero", default=False)
+    debuginfo2 = DebugInfoProperty()
+    data = DataProperty(desc="Data (array, stream, scalar) to access")
+
+    def __init__(self, data, access=types.AccessType.ReadWrite,
+                 debuginfo=None):
+        super(AccessNode, self).__init__()
+
+        # Properties
+        self.debuginfo2 = debuginfo
+        self.access = access
+        if not isinstance(data, str):
+            raise TypeError('Data for AccessNode must be a string')
+        self.data = data
+
+    def __deepcopy__(self, memo):
+        node = object.__new__(AccessNode)
+        node._access = self._access
+        node._data = self._data
+        node._setzero = self._setzero
+        node._in_connectors = self._in_connectors
+        node._out_connectors = self._out_connectors
+        node.debuginfo2 = dcpy(self.debuginfo2)
+        return node
+
+    @property
+    def label(self):
+        return self.data
+
+    def __label__(self, sdfg, state):
+        return self.data
+
+    def desc(self, sdfg):
+        from dace.sdfg import SDFGState, ScopeSubgraphView
+        if isinstance(sdfg, (SDFGState, ScopeSubgraphView)):
+            sdfg = sdfg.parent
+        return sdfg.arrays[self.data]
+
+    def draw_node(self, sdfg, graph):
+        desc = self.desc(sdfg)
+        if isinstance(desc, data.Stream):
+            return dot.draw_node(
+                sdfg, graph, self, shape="oval", style='dashed')
+        elif desc.transient:
+            return dot.draw_node(sdfg, graph, self, shape="oval")
+        else:
+            return dot.draw_node(sdfg, graph, self, shape="oval", style='bold')
+
+    def validate(self, sdfg, state):
+        if self.data not in sdfg.arrays:
+            raise KeyError('Array "%s" not found in SDFG' % self.data)
+
+
+# ------------------------------------------------------------------------------
+
+
+class CodeNode(Node):
+    """ A node that contains runnable code with acyclic external data 
+        dependencies. May either be a tasklet or a nested SDFG, and 
+        denoted by an octagonal shape. """
+    pass
+
+
+@make_properties
+class Tasklet(CodeNode):
+    """ A node that contains a tasklet: a functional computation procedure
+        that can only access external data specified using connectors. 
+        
+        Tasklets may be implemented in Python, C++, or any supported 
+        language by the code generator. 
+    """
+
+    label = Property(dtype=str, desc="Name of the tasklet")
+    language = Property(enum=types.Language, default=types.Language.Python)
+    code = CodeProperty(desc="Tasklet code")
+    code_global = CodeProperty(
+        desc="Global scope code needed for tasklet execution", default="")
+    code_init = CodeProperty(
+        desc="Extra code that is called on DaCe runtime initialization",
+        default="")
+    code_exit = CodeProperty(
+        desc="Extra code that is called on DaCe runtime cleanup", default="")
+    location = Property(
+        dtype=str, desc="Tasklet execution location descriptor")
+    debuginfo = DebugInfoProperty()
+
+    def __init__(self,
+                 label,
+                 inputs=set(),
+                 outputs=set(),
+                 code="",
+                 language=types.Language.Python,
+                 code_global="",
+                 code_init="",
+                 code_exit="",
+                 location="-1",
+                 debuginfo=None):
+        super(Tasklet, self).__init__(inputs, outputs)
+
+        # Properties
+        self.label = label
+        self.language = language
+        self.code = code
+        self.location = location
+        self.code_global = code_global
+        self.code_init = code_init
+        self.code_exit = code_exit
+        self.debuginfo = debuginfo
+
+    @property
+    def name(self):
+        return self._label
+
+    def draw_node(self, sdfg, graph):
+        return dot.draw_node(sdfg, graph, self, shape="octagon")
+
+    def validate(self, sdfg, state):
+        if not data.validate_name(self.label):
+            raise NameError('Invalid tasklet name "%s"' % self.label)
+        for in_conn in self.in_connectors:
+            if not data.validate_name(in_conn):
+                raise NameError('Invalid input connector "%s"' % in_conn)
+        for out_conn in self.out_connectors:
+            if not data.validate_name(out_conn):
+                raise NameError('Invalid output connector "%s"' % out_conn)
+
+    def __str__(self):
+        if not self.label:
+            return "--Empty--"
+        else:
+            return self.label
+
+
+class EmptyTasklet(Tasklet):
+    """ A special tasklet that contains no code. Used for filling empty states
+        in an SDFG. """
+
+    def __init__(self, label=""):
+        super(EmptyTasklet, self).__init__(label)
+
+    def draw_node(self, sdfg, graph):
+        return dot.draw_node(sdfg, graph, self, style="invis", shape="octagon")
+
+    def validate(self, sdfg, state):
+        pass
+
+
+# ------------------------------------------------------------------------------
+
+
+@make_properties
+class NestedSDFG(CodeNode):
+    """ An SDFG state node that contains an SDFG of its own, runnable using
+        the data dependencies specified using its connectors.
+
+        It is encouraged to use nested SDFGs instead of coarse-grained tasklets
+        since they are analyzable with respect to transformations.
+        
+        @note: A nested SDFG cannot create recursion (one of its parent SDFGs).
+    """
+
+    label = Property(dtype=str, desc="Name of the SDFG")
+    # NOTE: We cannot use SDFG as the type because of an import loop
+    sdfg = Property(dtype=graph.OrderedDiGraph, desc="The SDFG")
+    schedule = Property(
+        dtype=types.ScheduleType,
+        desc="SDFG schedule",
+        enum=types.ScheduleType,
+        from_string=lambda x: types.ScheduleType[x])
+    location = Property(dtype=str, desc="SDFG execution location descriptor")
+    debuginfo = DebugInfoProperty()
+    is_collapsed = Property(
+        dtype=bool,
+        desc="Show this node/scope/state as collapsed",
+        default=False)
+
+    def __init__(self,
+                 label,
+                 sdfg,
+                 inputs: Set[str],
+                 outputs: Set[str],
+                 schedule=types.ScheduleType.Default,
+                 location="-1",
+                 debuginfo=None):
+        super(NestedSDFG, self).__init__(inputs, outputs)
+
+        # Properties
+        self.label = label
+        self.sdfg = sdfg
+        self.schedule = schedule
+        self.location = location
+        self.debuginfo = debuginfo
+
+    def draw_node(self, sdfg, graph):
+        return dot.draw_node(sdfg, graph, self, shape="doubleoctagon")
+
+    def __str__(self):
+        if not self.label:
+            return "SDFG"
+        else:
+            return self.label
+
+    def validate(self, sdfg, state):
+        if not data.validate_name(self.label):
+            raise NameError('Invalid nested SDFG name "%s"' % self.label)
+        for in_conn in self.in_connectors:
+            if not data.validate_name(in_conn):
+                raise NameError('Invalid input connector "%s"' % in_conn)
+        for out_conn in self.out_connectors:
+            if not data.validate_name(out_conn):
+                raise NameError('Invalid output connector "%s"' % out_conn)
+
+        # Recursively validate nested SDFG
+        self.sdfg.validate()
+
+
+# ------------------------------------------------------------------------------
+
+
+# Scope entry class
+class EntryNode(Node):
+    """ A type of node that opens a scope (e.g., Map or Consume). """
+
+    def validate(self, sdfg, state):
+        self.map.validate(sdfg, state, self)
+
+
+# ------------------------------------------------------------------------------
+
+
+# Scope exit class
+class ExitNode(Node):
+    """ A type of node that closes a scope (e.g., Map or Consume). """
+
+    def validate(self, sdfg, state):
+        self.map.validate(sdfg, state, self)
+
+
+# ------------------------------------------------------------------------------
+
+
+class MapEntry(EntryNode):
+    """ Node that opens a Map scope. 
+        @see: Map
+    """
+
+    def __init__(self, map, dynamic_inputs=set()):
+        super(MapEntry, self).__init__(dynamic_inputs)
+        if map is None:
+            raise ValueError("Map for MapEntry can not be None.")
+        self._map = map
+        self._map_depth = 0
+
+    @property
+    def map(self):
+        return self._map
+
+    @map.setter
+    def map(self, val):
+        self._map = val
+
+    def draw_node(self, sdfg, graph):
+        if self.is_collapsed:
+            return dot.draw_node(sdfg, graph, self, shape="hexagon")
+        return dot.draw_node(sdfg, graph, self, shape="trapezium")
+
+    def __str__(self):
+        return str(self.map)
+
+
+class MapExit(ExitNode):
+    """ Node that closes a Map scope.
+        @see: Map
+    """
+
+    def __init__(self, map):
+        super(MapExit, self).__init__()
+        if map is None:
+            raise ValueError("Map for MapExit can not be None.")
+        self._map = map
+
+    @property
+    def map(self):
+        return self._map
+
+    @map.setter
+    def map(self, val):
+        self._map = val
+
+    def draw_node(self, sdfg, graph):
+        return dot.draw_node(sdfg, graph, self, shape="invtrapezium")
+
+    def __str__(self):
+        return str(self.map)
+
+
+@make_properties
+class Map(object):
+    """ A Map is a two-node representation of parametric graphs, containing
+        an integer set by which the contents (nodes dominated by an entry 
+        node and post-dominated by an exit node) are replicated.
+        
+        Maps contain a `schedule` property, which specifies how the scope
+        should be scheduled (execution order). Code generators can use the
+        schedule property to generate appropriate code, e.g., GPU kernels.
+    """
+    from dace.codegen.instrumentation.perfsettings import PerfSettings
+
+    # List of (editable) properties
+    label = Property(dtype=str, desc="Label of the map")
+    params = ParamsProperty(desc="Mapped parameters")
+    range = RangeProperty(desc="Ranges of map parameters")
+    #   order = OrderProperty(desc="Order of map dimensions", unmapped=True)
+    schedule = Property(
+        dtype=types.ScheduleType,
+        desc="Map schedule",
+        enum=types.ScheduleType,
+        from_string=lambda x: types.ScheduleType[x])
+    is_async = Property(dtype=bool, desc="Map asynchronous evaluation")
+    unroll = Property(dtype=bool, desc="Map unrolling")
+    flatten = Property(dtype=bool, desc="Map loop flattening")
+    fence_instrumentation = Property(
+        dtype=bool, desc="Disable instrumentation in all subnodes")
+    papi_counters = Property(
+        dtype=list,
+        desc="List of PAPI counter preset identifiers.",
+        default=PerfSettings.perf_default_papi_counters())
+    debuginfo = DebugInfoProperty()
+    is_collapsed = Property(
+        dtype=bool,
+        desc="Show this node/scope/state as collapsed",
+        default=False)
+
+    # We cannot have multiple consecutive papi start/stops inside the same thread. The following variable is used to recognize the map that started the counters.
+    _has_papi_counters = False
+    _can_be_supersection_start = True  # We must have supersections synchronized.
+
+    def __init__(self,
+                 label,
+                 params,
+                 ndrange,
+                 schedule=types.ScheduleType.Default,
+                 unroll=False,
+                 is_async=False,
+                 flatten=False,
+                 fence_instrumentation=False,
+                 debuginfo=None):
+        super(Map, self).__init__()
+
+        # Assign properties
+        self.label = label
+        self.schedule = schedule
+        self.unroll = unroll
+        self.is_async = is_async
+        self.flatten = flatten
+        self.params = params
+        self.range = ndrange
+        self.debuginfo = debuginfo
+        self._fence_instrumentation = fence_instrumentation
+
+    def __str__(self):
+        return self.label + "[" + ", ".join([
+            "{}={}".format(i, r)
+            for i, r in zip(self._params,
+                            [sbs.Range.dim_to_string(d) for d in self._range])
+        ]) + "]"
+
+    def validate(self, sdfg, state, node):
+        if not data.validate_name(self.label):
+            raise NameError('Invalid map name "%s"' % self.label)
+
+    def get_param_num(self):
+        """ Returns the number of map dimension parameters/symbols. """
+        return len(self.params)
+
+
+# Indirect Map properties to MapEntry and MapExit
+MapEntry = indirect_properties(Map, lambda obj: obj.map)(MapEntry)
+MapExit = indirect_properties(Map, lambda obj: obj.map)(MapExit)
+
+# ------------------------------------------------------------------------------
+
+
+class ConsumeEntry(EntryNode):
+    """ Node that opens a Consume scope. 
+        @see: Consume
+    """
+
+    def __init__(self, consume, dynamic_inputs=set()):
+        super(ConsumeEntry, self).__init__(dynamic_inputs)
+        if consume is None:
+            raise ValueError("Consume for ConsumeEntry can not be None.")
+        self._consume = consume
+        self.add_in_connector('IN_stream')
+        self.add_out_connector('OUT_stream')
+
+    @property
+    def map(self):
+        return self._consume.as_map()
+
+    @property
+    def consume(self):
+        return self._consume
+
+    @consume.setter
+    def consume(self, val):
+        self._consume = val
+
+    def draw_node(self, sdfg, graph):
+        if self.is_collapsed:
+            return dot.draw_node(
+                sdfg, graph, self, shape="hexagon", style='dashed')
+        return dot.draw_node(
+            sdfg, graph, self, shape="trapezium", style='dashed')
+
+    def __str__(self):
+        return str(self.consume)
+
+
+class ConsumeExit(ExitNode):
+    """ Node that closes a Consume scope. 
+        @see: Consume
+    """
+
+    def __init__(self, consume):
+        super(ConsumeExit, self).__init__()
+        if consume is None:
+            raise ValueError("Consume for ConsumeExit can not be None.")
+        self._consume = consume
+
+    @property
+    def map(self):
+        return self._consume.as_map()
+
+    @property
+    def consume(self):
+        return self._consume
+
+    @consume.setter
+    def consume(self, val):
+        self._consume = val
+
+    def draw_node(self, sdfg, graph):
+        return dot.draw_node(
+            sdfg, graph, self, shape="invtrapezium", style='dashed')
+
+    def __str__(self):
+        return str(self.consume)
+
+
+@make_properties
+class Consume(object):
+    """ Consume is a scope, like `Map`, that is a part of the parametric 
+        graph extension of the SDFG. It creates a producer-consumer 
+        relationship between the input stream and the scope subgraph. The
+        subgraph is scheduled to a given number of processing elements
+        for processing, and they will try to pop elements from the input
+        stream until a given quiescence condition is reached. """
+
+    # Properties
+    label = Property(dtype=str, desc="Name of the consume node")
+    pe_index = Property(dtype=str, desc="Processing element identifier")
+    num_pes = SymbolicProperty(desc="Number of processing elements")
+    condition = CodeProperty(desc="Quiescence condition", allow_none=True)
+    language = Property(enum=types.Language, default=types.Language.Python)
+    schedule = Property(
+        dtype=types.ScheduleType,
+        desc="Consume schedule",
+        enum=types.ScheduleType,
+        from_string=lambda x: types.ScheduleType[x])
+    chunksize = Property(
+        dtype=int,
+        desc="Maximal size of elements to consume at a time",
+        default=1)
+    debuginfo = DebugInfoProperty()
+    is_collapsed = Property(
+        dtype=bool,
+        desc="Show this node/scope/state as collapsed",
+        default=False)
+
+    def as_map(self):
+        """ Compatibility function that allows to view the consume as a map,
+            mainly in memlet propagation. """
+        return Map(self.label, [self.pe_index],
+                   sbs.Range([(0, self.num_pes - 1, 1)]), self.schedule)
+
+    def __init__(self,
+                 label,
+                 pe_tuple,
+                 condition,
+                 schedule=types.ScheduleType.Default,
+                 chunksize=1,
+                 debuginfo=None):
+        super(Consume, self).__init__()
+
+        # Properties
+        self.label = label
+        self.pe_index, self.num_pes = pe_tuple
+        self.condition = condition
+        self.schedule = schedule
+        self.chunksize = chunksize
+        self.debuginfo = debuginfo
+
+    def __str__(self):
+        if self.condition is not None:
+            return ("%s [%s=0:%s], Condition: %s" %
+                    (self._label, self.pe_index, self.num_pes,
+                     CodeProperty.to_string(self.condition)))
+        else:
+            return (
+                "%s [%s=0:%s]" % (self._label, self.pe_index, self.num_pes))
+
+    def validate(self, sdfg, state, node):
+        if not data.validate_name(self.label):
+            raise NameError('Invalid consume name "%s"' % self.label)
+
+    def get_param_num(self):
+        """ Returns the number of consume dimension parameters/symbols. """
+        return 1
+
+
+# Redirect Consume properties to ConsumeEntry and ConsumeExit
+ConsumeEntry = indirect_properties(Consume,
+                                   lambda obj: obj.consume)(ConsumeEntry)
+ConsumeExit = indirect_properties(Consume,
+                                  lambda obj: obj.consume)(ConsumeExit)
+
+# ------------------------------------------------------------------------------
+
+
+@make_properties
+class Reduce(Node):
+    """ An SDFG node that reduces an N-dimensional array to an 
+        (N-k)-dimensional array, with a list of axes to reduce and
+        a reduction binary function. """
+    from dace.codegen.instrumentation.perfsettings import PerfSettings
+
+    # Properties
+    axes = Property(dtype=tuple, allow_none=True)
+    wcr = LambdaProperty()
+    identity = Property(dtype=object, allow_none=True)
+    schedule = Property(
+        dtype=types.ScheduleType,
+        desc="Reduction execution policy",
+        enum=types.ScheduleType,
+        from_string=lambda x: types.ScheduleType[x])
+
+    papi_counters = Property(
+        dtype=list,
+        desc="List of PAPI counter preset identifiers.",
+        default=PerfSettings.perf_default_papi_counters())
+    debuginfo = DebugInfoProperty()
+
+    def __init__(self,
+                 wcr,
+                 axes,
+                 wcr_identity=None,
+                 schedule=types.ScheduleType.Default,
+                 debuginfo=None):
+        super(Reduce, self).__init__()
+        self.wcr = wcr  # type: ast._Lambda
+        self.axes = axes
+        self.identity = wcr_identity
+        self.schedule = schedule
+        self.debuginfo = debuginfo
+
+    def draw_node(self, sdfg, state):
+        return dot.draw_node(sdfg, state, self, shape="invtriangle")
+
+    def __str__(self):
+        # Autodetect reduction type
+        redtype = detect_reduction_type(self.wcr)
+        if redtype == types.ReductionType.Custom:
+            wcrstr = unparse(ast.parse(self.wcr).body[0].value.body)
+        else:
+            wcrstr = str(redtype)
+            wcrstr = wcrstr[wcrstr.find('.') + 1:]  # Skip "ReductionType."
+
+        return 'Op: {op}, Axes: {axes}'.format(
+            axes=('all' if self.axes is None else str(self.axes)), op=wcrstr)
+
+    def __label__(self, sdfg, state):
+        # Autodetect reduction type
+        redtype = detect_reduction_type(self.wcr)
+        if redtype == types.ReductionType.Custom:
+            wcrstr = unparse(ast.parse(self.wcr).body[0].value.body)
+        else:
+            wcrstr = str(redtype)
+            wcrstr = wcrstr[wcrstr.find('.') + 1:]  # Skip "ReductionType."
+
+        return 'Op: {op}\nAxes: {axes}'.format(
+            axes=('all' if self.axes is None else str(self.axes)), op=wcrstr)
diff --git a/dace/graph/nxutil.py b/dace/graph/nxutil.py
new file mode 100644
index 0000000000..feab768075
--- /dev/null
+++ b/dace/graph/nxutil.py
@@ -0,0 +1,668 @@
+from ast import Subscript
+from collections import deque
+import copy
+import itertools
+import re
+import os
+from typing import Callable, List, Union
+from string import ascii_uppercase
+import networkx as nx
+
+import dace
+from dace import sdfg, types, symbolic
+from dace.config import Config
+from dace.graph import nodes, graph as gr
+
+params = List[dace.symbolic.symbol]
+ranges = List[Union[dace.subsets.Range, dace.subsets.Indices]]
+
+
+class CannotExpand(Exception):
+    pass
+
+
+def node_path_graph(*args):
+    """ Generates a path graph passing through the input nodes.
+
+        The function generates a graph using as nodes the input arguments.
+        Subsequently, it creates a path passing through all the nodes, in
+        the same order as they were given in the function input.
+
+        @param *args: Variable number of nodes or a list of nodes.
+        @return: A directed graph based on the input arguments.
+        @rtype: gr.OrderedDiGraph
+    """
+
+    # 1. Create new networkx directed graph.
+    path = gr.OrderedDiGraph()
+    # 2. Place input nodes in a list.
+    if len(args) == 1 and isinstance(args[0], list):
+        # Input is a single list of nodes.
+        input_nodes = args[0]
+    else:
+        # Input is a variable number of nodes.
+        input_nodes = list(args)
+    # 3. Add nodes to the graph.
+    path.add_nodes_from(input_nodes)
+    # 4. Add path edges to the graph.
+    for i in range(len(input_nodes) - 1):
+        path.add_edge(input_nodes[i], input_nodes[i + 1], None)
+    # 5. Return the graph.
+    return path
+
+
+def depth_limited_search(source, depth):
+    """ Return best node and its value using a limited-depth Search (depth-
+        limited DFS). """
+    value = source.evaluate()
+    if depth == 0:
+        return source, value
+
+    candidate = source
+    candidate_value = value
+
+    # Node, depth, children generator
+    stack = [(source, 0, source.children_iter())]
+    while stack:
+        node, cur_depth, children = stack[-1]
+        try:
+            child = next(children)
+            child_val = child.evaluate()
+            # Check for best candidate
+            if child_val > candidate_value:
+                candidate = child
+                candidate_value = child_val
+
+            if cur_depth < depth - 1:
+                stack.append((child, cur_depth + 1, child.children_iter()))
+        except StopIteration:
+            stack.pop()
+
+    # Return maximal candidate
+    return candidate, candidate_value
+
+
+def depth_limited_dfs_iter(source, depth):
+    """ Produce nodes in a Depth-Limited DFS. """
+    if depth == 0:
+        yield source
+        return
+
+    # Node, depth, children generator
+    stack = [(source, 0, source.children_iter())]
+    while stack:
+        node, cur_depth, children = stack[-1]
+        try:
+            child = next(children)
+            yield child
+
+            if cur_depth < depth - 1:
+                stack.append((child, cur_depth + 1, child.children_iter()))
+        except StopIteration:
+            stack.pop()
+
+
+def dfs_topological_sort(G, sources=None, parent=False, condition=None):
+    """ Produce nodes in a depth-first topological ordering.
+
+    The function produces nodes in a depth-first topological ordering
+    (DFS to make sure maps are visited properly), with the condition
+    that each node visited had all its predecessors visited. Applies
+    for DAGs only.
+
+    @param G: An input DiGraph (assumed acyclic).
+    @param sources: (optional) node or list of nodes that
+                    specify starting point(s) for depth-first search and return
+                    edges in the component reachable from source.
+    @return: A generator of edges in the lastvisit depth-first-search.
+
+    @note: Based on http://www.ics.uci.edu/~eppstein/PADS/DFS.py
+    by D. Eppstein, July 2004.
+
+    @note: If a source is not specified then a source is chosen arbitrarily and
+    repeatedly until all components in the graph are searched.
+
+    """
+    if sources is None:
+        # produce edges for all components
+        nodes = G
+    else:
+        # produce edges for components with source
+        try:
+            nodes = iter(sources)
+        except TypeError:
+            nodes = [sources]
+
+    visited = set()
+    for start in nodes:
+        if start in visited:
+            continue
+        yield start
+        visited.add(start)
+        stack = [(start, iter(G.neighbors(start)))]
+        while stack:
+            parent, children = stack[-1]
+            try:
+                child = next(children)
+                if child not in visited:
+                    # Make sure that all predecessors have been visited
+                    skip = False
+                    for pred in G.predecessors(child):
+                        if pred not in visited:
+                            skip = True
+                            break
+                    if skip:
+                        continue
+
+                    visited.add(child)
+                    if condition is None or condition(parent, child):
+                        yield child
+                        stack.append((child, iter(G.neighbors(child))))
+            except StopIteration:
+                stack.pop()
+
+
+def dfs_conditional(G, source, condition, reversed=False):
+    """ Traverse a graph (DFS) only through edges that match a condition. """
+    if isinstance(source, list): nodes = source
+    else: nodes = [source]
+
+    def in_edges_reversed(graph):
+        def _in_edges_reversed(node):
+            for e in graph.in_edges(node):
+                ecpy = copy.copy(e)
+                ecpy.reverse()
+                yield ecpy
+
+        return _in_edges_reversed
+
+    get_children = G.out_edges if not reversed else in_edges_reversed(G)
+
+    visited = set()
+    for start in nodes:
+        if start in visited:
+            continue
+        visited.add(start)
+        stack = [(start, get_children(start).__iter__())]
+        while stack:
+            parent, children = stack[-1]
+            try:
+                e = next(children)
+                if e.dst not in visited:
+                    visited.add(e.dst)
+                    if condition(e.src, e.dst, e.data):
+                        yield e
+                        stack.append((e.dst, get_children(e.dst).__iter__()))
+            except StopIteration:
+                stack.pop()
+
+
+def bfs_conditional(G, source, condition):
+    """ Traverse a graph (BFS) only through edges that match a condition. """
+
+    visited = set([source])
+    queue = deque([(source, G.out_edges(source).__iter__())])
+    while queue:
+        parent, children = queue[0]
+        try:
+            e = next(children)
+            if e.dst not in visited:
+                visited.add(e.dst)
+                if condition(e.src, e.dst, e.data):
+                    yield e
+                    queue.append((e.dst, G.out_edges(child).__iter__()))
+        except StopIteration:
+            queue.popleft()
+
+
+def traverse_sdfg_scope(G, source, yield_edges=True):
+    """ Traverse an SDFG scope (nodes dominated by a ScopeEntry and 
+        post-dominated by a ScopeExit). 
+        @param G: Input graph (assumed SDFGState).
+        @param source: Source node.
+        @param yield_edges: If True, returned generator yields edges
+                            as well as nodes.
+        @return: A generator that iterates over the scope nodes (or edges).
+    """
+
+    if not isinstance(source, nodes.EntryNode):
+        raise SyntaxError('Source should be an entry node')
+
+    visited = set()
+    visited.add(source)
+
+    if yield_edges:
+        for e in G.out_edges(source):
+            yield tuple(e) + (1, )
+    else:
+        yield source, 1
+
+    stack = [(1, source, iter(G.out_edges(source)))]
+    while stack:
+        scope, parent, children = stack[-1]
+        try:
+            e = next(children)
+            child = e.dst
+            if child not in visited:
+                # Make sure that all predecessors have been visited
+                skip = False
+                for pred in G.predecessors(child):
+                    if pred not in visited:
+                        skip = True
+                        break
+                if skip:
+                    continue
+
+                if yield_edges:
+                    if not (isinstance(child, nodes.ExitNode) and scope == 1):
+                        for e in G.out_edges(child):
+                            yield tuple(e) + (scope, )
+                else:
+                    yield child, scope
+
+                visited.add(child)
+                if isinstance(child, nodes.EntryNode):
+                    stack.append((scope + 1, child, iter(G.out_edges(child))))
+                elif isinstance(child, nodes.ExitNode):
+                    if scope > 1:  # Don't traverse beyond scope
+                        stack.append((scope - 1, child, iter(
+                            G.out_edges(child))))
+                else:
+                    stack.append((scope, child, iter(G.out_edges(child))))
+        except StopIteration:
+            stack.pop()
+
+
+def gen_label(prefix=""):
+    """ Generates a label as A,B,C,...,Z,AA,AB,... """
+    indices = [0]
+    while True:
+        label = "".join([ascii_uppercase[i] for i in indices])
+        yield prefix + label
+        indices[0] += 1
+        for pos, val in enumerate(indices):
+            if val == len(ascii_uppercase):
+                indices[pos] = 0
+            if len(indices) == pos + 1:
+                indices.append(1)
+            else:
+                indices[pos + 1] += 1
+
+
+def indstr(x):
+    try:
+        return int(x)
+    except TypeError:  # int() argument must be a string, a bytes-like object or a number, not [X]
+        return str(x)
+
+
+def range_to_str(ranges, limit_length=50):
+    """ Converts one or multiple range tuples to a string. """
+
+    try:
+        len(ranges[0])
+    except TypeError:
+        ranges = [ranges]
+
+    def convert_index(r):
+        if len(r) == 3:
+            if r[2] != 1:
+                return "{}:{}:{}".format(
+                    symbolic.symstr(r[0]), symbolic.symstr(r[1]),
+                    symbolic.symstr(r[2]))
+            else:
+                return "{}:{}".format(
+                    symbolic.symstr(r[0]), symbolic.symstr(r[1]))
+        else:
+            raise ValueError("Unsupported range: " + str(r))
+
+    s = []
+    for r in ranges:
+        s.append(convert_index(r))
+
+    res = ', '.join(s)
+    if limit_length is not None:
+        if not Config.get_bool('renderer', 'fulledges') and \
+           len(res) > limit_length:
+            res = '...'
+
+    return "[" + res + "]"
+
+
+def str_to_range(rangeStr):
+    """ Converts a range string into a range tuple. """
+    if rangeStr[0] != "[" or rangeStr[-1] != "]":
+        raise ValueError("Invalid range " + rangeStr)
+    rangeStr = re.sub("[\[\] ]", "", rangeStr)
+    dimensions = rangeStr.split(",")
+    ranges = [None] * len(dimensions)
+    for i, r in enumerate(dimensions):
+        entries = r.split(":")
+        numArgs = len(entries)
+        if numArgs < 2 or numArgs > 3:
+            raise ValueError(
+                "Range string should contain one or two separators (received "
+                + str(r) + ")")
+        iMin = None
+        iMax = None
+        step = None
+        if entries[0]:
+            iMin = entries[0]
+        if entries[1]:
+            iMax = entries[1]
+        if numArgs == 3:
+            if not entries[2]:
+                raise ValueError("Stride for range cannot be empty")
+            step = entries[2]
+        ranges[i] = (iMin, iMax, step)
+    return ranges
+
+
+def make_list(val):
+    """ If a scalar or string is passed make it a list, otherwise do nothing. 
+    """
+    try:
+        len(val)
+        if not isinstance(val, str):
+            return val
+    except TypeError:
+        pass
+    return [val]
+
+
+def make_2d(ranges):
+    """ If a 1D list is passed, make it 2D, otherwise do nothing. """
+    if isinstance(ranges, Subscript):
+        return [ranges]
+    firstElem = ranges[0]
+    try:
+        if isinstance(firstElem, Subscript):
+            return ranges
+        len(firstElem)
+        if not isinstance(firstElem, str):
+            return ranges
+    except TypeError:
+        pass
+    return [ranges]
+
+
+def label_of(obj):
+    """ Fetches the label of an object, or generates one if it doesn't exist. 
+    """
+    try:
+        return obj.label
+    except AttributeError:
+        try:
+            return obj.name
+        except AttributeError:
+            try:
+                return next(type(obj)._nameGen)
+            except AttributeError:
+                type(obj)._nameGen = gen_label(type(obj).__name__ + " ")
+                obj.label = next(type(obj)._nameGen)
+                return obj.label
+
+
+def fullrange(ndslice, var_size):
+    """ Returns True iff the ND-slice represents the full array size. """
+    for dim, (b, e, s) in zip(var_size, ndslice):
+        if b != 0 or e != symbolic.pystr_to_symbolic(
+                types.symbol_name_or_value(dim)) or s != 1:
+            return False
+    return True
+
+
+def change_edge_dest(
+        graph: dace.graph.graph.OrderedDiGraph,
+        node_a: Union[dace.graph.nodes.Node,
+                      dace.graph.graph.OrderedMultiDiConnectorGraph],
+        node_b: Union[dace.graph.nodes.Node,
+                      dace.graph.graph.OrderedMultiDiConnectorGraph]):
+    """ Changes the destination of edges from node A to node B.
+
+        The function finds all edges in the graph that have node A as their
+        destination. It then creates a new edge for each one found,
+        using the same source nodes and data, but node B as the destination.
+        Afterwards, it deletes the edges found and inserts the new ones into 
+        the graph.
+
+        @param graph: The graph upon which the edge transformations will be 
+                      applied.  
+        @param node_a: The original destination of the edges.  
+        @param node_b: The new destination of the edges to be transformed. 
+    """
+
+    # Create new incoming edges to node B, by copying the incoming edges to
+    # node A and setting their destination to node B.
+    edges = list(graph.in_edges(node_a))
+    for e in edges:
+        # Delete the incoming edges to node A from the graph.
+        graph.remove_edge(e)
+        # Insert the new edges to the graph.
+        if isinstance(e, gr.MultiConnectorEdge):
+            # dst_conn = e.dst_conn
+            # if e.dst_conn is not None:
+            #     # Remove connector from node A.
+            #     node_a.remove_in_connector(e.dst_conn)
+            #     # Insert connector to node B.
+            #     if (not node_b.add_in_connector(dst_conn) and isinstance(
+            #             node_b, (dace.graph.nodes.CodeNode,
+            #                      dace.graph.nodes.MapEntry))):
+            #         while not node_b.add_in_connector(dst_conn):
+            #             dst_conn = dst_conn + '_'
+            # graph.add_edge(e.src, e.src_conn, node_b, dst_conn, e.data)
+            graph.add_edge(e.src, e.src_conn, node_b, e.dst_conn, e.data)
+        else:
+            graph.add_edge(e.src, node_b, e.data)
+
+
+def change_edge_src(
+        graph: dace.graph.graph.OrderedDiGraph,
+        node_a: Union[dace.graph.nodes.Node,
+                      dace.graph.graph.OrderedMultiDiConnectorGraph],
+        node_b: Union[dace.graph.nodes.Node,
+                      dace.graph.graph.OrderedMultiDiConnectorGraph]):
+    """ Changes the sources of edges from node A to node B.
+
+        The function finds all edges in the graph that have node A as their 
+        source. It then creates a new edge for each one found, using the same 
+        destination nodes and data, but node B as the source. Afterwards, it 
+        deletes the edges
+        found and inserts the new ones into the graph.
+
+        @param graph: The graph upon which the edge transformations will be 
+                      applied.
+        @param node_a: The original source of the edges to be transformed.
+        @param node_b: The new source of the edges to be transformed.
+    """
+
+    # Create new outgoing edges from node B, by copying the outgoing edges from
+    # node A and setting their source to node B.
+    edges = list(graph.out_edges(node_a))
+    for e in edges:
+        # Delete the outgoing edges from node A from the graph.
+        graph.remove_edge(e)
+        # Insert the new edges to the graph.
+        if isinstance(e, gr.MultiConnectorEdge):
+            # src_conn = e.src_conn
+            # if e.src_conn is not None:
+            #     # Remove connector from node A.
+            #     node_a.remove_out_connector(e.src_conn)
+            #     # Insert connector to node B.
+            #     if (not node_b.add_out_connector(src_conn) and isinstance(
+            #             node_b, (dace.graph.nodes.CodeNode,
+            #                      dace.graph.nodes.MapExit))):
+            #         while not node_b.add_out_connector(src_conn):
+            #             src_conn = src_conn + '_'
+            # graph.add_edge(node_b, src_conn, e.dst, e.dst_conn, e.data)
+            graph.add_edge(node_b, e.src_conn, e.dst, e.dst_conn, e.data)
+        else:
+            graph.add_edge(node_b, e.dst, e.data)
+
+
+def find_source_nodes(graph):
+    """ Finds the source nodes of a graph.
+
+        The function finds the source nodes of a graph, i.e. the nodes with 
+        zero in-degree.
+
+        @param graph: The graph whose source nodes are being searched for.
+        @return: A list of the source nodes found.
+    """
+    return [n for n in graph.nodes() if graph.in_degree(n) == 0]
+
+
+def find_sink_nodes(graph):
+    """ Finds the sink nodes of a graph.
+
+        The function finds the sink nodes of a graph, i.e. the nodes with zero out-degree.
+
+        @param graph: The graph whose sink nodes are being searched for.
+        @return: A list of the sink nodes found.
+    """
+    return [n for n in graph.nodes() if graph.out_degree(n) == 0]
+
+
+def replace_subgraph(graph: dace.graph.graph.OrderedDiGraph,
+                     old: dace.graph.graph.OrderedDiGraph,
+                     new: dace.graph.graph.OrderedDiGraph):
+    """ Replaces a subgraph of a graph with a new one. If replacement is not
+        possible, it returns False.
+
+        The function replaces the 'old' subgraph of the input graph with the 
+        'new' subgraph. Both the 'old' and the 'new' subgraphs must have 
+        unique source and sink nodes. Graph edges incoming to the source of 
+        the 'old' subgraph have their destination changed to the source of 
+        the 'new subgraph. Likewise, graph edges outgoing from the sink of 
+        the 'old subgraph have their source changed to the sink of the 'new' 
+        subgraph.
+
+        @param graph: The graph upon which the replacement will be applied.
+        @param old: The subgraph to be replaced.
+        @param new: The replacement subgraph.
+
+        @return: True if the replacement succeeded, otherwise False.
+    """
+
+    # 1. Find the source node of 'old' subgraph.
+    # 1.1. Retrieve the source nodes of the 'old' subgraph.
+    old_source_nodes = find_source_nodes(old)
+    # 1.2. Verify the existence of a unique source in the 'old' subgraph.
+    if len(old_source_nodes) != 1:
+        return False
+    old_source = old_source_nodes[0]
+
+    # 2. Find the sink node of the 'old' subgraph.
+    # 2.1. Retrieve the sink nodes of the 'old' subgraph.
+    old_sink_nodes = find_sink_nodes(old)
+    # 2.2. Verify the existence of a unique sink in the 'old' subgraph.
+    if len(old_sink_nodes) != 1:
+        return False
+    old_sink = old_sink_nodes[0]
+
+    # 3. Find the source node of 'new' subgraph.
+    # 3.1. Retrieve the source nodes of the 'new' subgraph.
+    new_source_nodes = find_source_nodes(new)
+    # 3.2. Verify the existence of a unique source in the 'new' subgraph.
+    if len(new_source_nodes) != 1:
+        return False
+    new_source = new_source_nodes[0]
+
+    # 4. Find the sink node of the 'new' subgraph.
+    # 4.1. Retrieve the sink nodes of the 'new' subgraph.
+    new_sink_nodes = find_sink_nodes(new)
+    # 4.2. Verify the existence of a unique sink in the 'new' subgraph.
+    if len(new_sink_nodes) != 1:
+        return False
+    new_sink = new_sink_nodes[0]
+
+    # 5. Add the 'new' subgraph to the graph.
+    # 5.1. Add the nodes of the 'new' subgraph to the graph.
+    graph.add_nodes_from(new.nodes())
+    # 5.2. Add the edges of the 'new' subgraph to the graph.
+    for e in new.edges():
+        graph.add_edge(*e)
+
+    # 6. Create new incoming edges to the source of the 'new' subgraph.
+    change_edge_dest(graph, old_source, new_source)
+
+    # 7. Create new outgoing edges from the sink of the 'new' subgraph.
+    change_edge_src(graph, old_sink, new_sink)
+
+    # 8. Remove all nodes of the 'old' subgraph from the graph.
+    graph.remove_nodes_from(old.nodes())
+
+    # 10. Subgraph replacement has succeeded. Return true.
+    return True
+
+
+def merge_maps(graph: dace.graph.graph.OrderedMultiDiConnectorGraph,
+               outer_map_entry: dace.graph.nodes.MapEntry,
+               outer_map_exit: dace.graph.nodes.MapExit,
+               inner_map_entry: dace.graph.nodes.MapEntry,
+               inner_map_exit: dace.graph.nodes.MapExit,
+               param_merge: Callable[[params, params],
+                                     params] = lambda p1, p2: p1 + p2,
+               range_merge: Callable[[
+                   ranges, ranges
+               ], ranges] = lambda r1, r2: type(r1)(r1.ranges + r2.ranges)
+               ) -> (dace.graph.nodes.MapEntry, dace.graph.nodes.MapExit):
+    """ Merges two maps (their entries and exits). It is assumed that the
+    operation is valid. """
+
+    outer_map = outer_map_entry.map
+    inner_map = inner_map_entry.map
+
+    # Create merged map by inheriting attributes from outer map and using
+    # the merge functions for parameters and ranges.
+    merged_map = dace.graph.nodes.Map(
+        label='_merged_' + outer_map.label + '_' + inner_map.label,
+        params=param_merge(outer_map.params, inner_map.params),
+        ndrange=range_merge(outer_map.range, inner_map.range),
+        schedule=outer_map.schedule,
+        unroll=outer_map.unroll,
+        is_async=outer_map.is_async,
+        flatten=outer_map.flatten,
+        debuginfo=outer_map.debuginfo)
+
+    merged_entry = dace.graph.nodes.MapEntry(merged_map)
+    merged_entry.in_connectors = outer_map_entry.in_connectors
+    merged_entry.out_connectors = outer_map_entry.out_connectors
+
+    merged_exit = dace.graph.nodes.MapExit(merged_map)
+    merged_exit.in_connectors = outer_map_exit.in_connectors
+    merged_exit.out_connectors = outer_map_exit.out_connectors
+
+    graph.add_nodes_from([merged_entry, merged_exit])
+
+    # Redirect inner in edges.
+    inner_in_edges = graph.out_edges(inner_map_entry)
+    for edge in graph.edges_between(outer_map_entry, inner_map_entry):
+        in_conn_num = edge.dst_conn[3:]
+        out_conn = 'OUT_' + in_conn_num
+        inner_edge = [e for e in inner_in_edges if e.src_conn == out_conn][0]
+        graph.remove_edge(edge)
+        graph.remove_edge(inner_edge)
+        graph.add_edge(merged_entry, edge.src_conn, inner_edge.dst,
+                       inner_edge.dst_conn, inner_edge.data)
+
+    # Redirect inner out edges.
+    inner_out_edges = graph.in_edges(inner_map_exit)
+    for edge in graph.edges_between(inner_map_exit, outer_map_exit):
+        out_conn_num = edge.src_conn[4:]
+        in_conn = 'IN_' + out_conn_num
+        inner_edge = [e for e in inner_out_edges if e.dst_conn == in_conn][0]
+        graph.remove_edge(edge)
+        graph.remove_edge(inner_edge)
+        graph.add_edge(inner_edge.src, inner_edge.src_conn, merged_exit,
+                       edge.dst_conn, inner_edge.data)
+
+    # Redirect outer edges.
+    change_edge_dest(graph, outer_map_entry, merged_entry)
+    change_edge_src(graph, outer_map_exit, merged_exit)
+
+    # Clean-up
+    graph.remove_nodes_from(
+        [outer_map_entry, outer_map_exit, inner_map_entry, inner_map_exit])
+
+    return merged_entry, merged_exit
diff --git a/dace/memlet.py b/dace/memlet.py
new file mode 100644
index 0000000000..3950e465d9
--- /dev/null
+++ b/dace/memlet.py
@@ -0,0 +1,278 @@
+import ast
+from functools import reduce
+import operator
+import copy as cp
+
+import dace
+from dace import data as dt, subsets, symbolic, types
+from dace.frontend.operations import detect_reduction_type
+from dace.frontend.python.astutils import unparse
+from dace.properties import (
+    Property, make_properties, DataProperty, ShapeProperty, SubsetProperty,
+    SymbolicProperty, TypeClassProperty, DebugInfoProperty, LambdaProperty)
+
+
+@make_properties
+class Memlet(object):
+    """ Data movement object. Represents the data, the subset moved, and the
+        manner it is reindexed (`other_subset`) into the destination.
+        If there are multiple conflicting writes, this object also specifies
+        how they are resolved with a lambda function.
+    """
+
+    # Properties
+    veclen = Property(dtype=int, desc="Vector length")
+    num_accesses = SymbolicProperty()
+    subset = SubsetProperty()
+    other_subset = SubsetProperty(allow_none=True)
+    data = DataProperty()
+    debuginfo = DebugInfoProperty()
+    wcr = LambdaProperty(allow_none=True)
+    wcr_identity = Property(dtype=object, default=None, allow_none=True)
+    wcr_conflict = Property(dtype=bool, default=True)
+
+    def __init__(self,
+                 data,
+                 num_accesses,
+                 subset,
+                 vector_length,
+                 wcr=None,
+                 wcr_identity=None,
+                 other_subset=None,
+                 debuginfo=None,
+                 wcr_conflict=True):
+        """ Constructs a Memlet.
+            @param data: The data object or name to access. B{Note:} this
+                         parameter will soon be deprecated.
+            @type data: Either a string of the data descriptor name or an
+                        AccessNode.
+            @param num_accesses: The number of times that the moved data
+                                 will be subsequently accessed. If
+                                 `dace.types.DYNAMIC` (-1),
+                                 designates that the number of accesses is
+                                 unknown at compile time.
+            @param subset: The subset of `data` that is going to be accessed.
+            @param vector_length: The length of a single unit of access to
+                                  the data (used for vectorization 
+                                  optimizations).
+            @param wcr: A lambda function specifying how write-conflicts
+                        are resolved. The syntax of the lambda function receives two elements: `current` value and `new` value,
+                        and returns the value after resolution. For example,
+                        summation is `lambda cur, new: cur + new`.
+            @param wcr_identity: Identity value used for the first write 
+                                 conflict. B{Note:} this parameter will soon
+                                 be deprecated.
+            @param other_subset: The reindexing of `subset` on the other 
+                                 connected data.
+            @param debuginfo: Source-code information (e.g., line, file) 
+                              used for debugging.
+            @param wcr_conflict: If False, forces non-locked conflict 
+                                 resolution when generating code. The default
+                                 is to let the code generator infer this 
+                                 information from the SDFG.
+        """
+
+        # Properties
+        self.num_accesses = num_accesses  # type: sympy math
+        self.subset = subset  # type: subsets.Subset
+        self.veclen = vector_length  # type: int (in elements, default 1)
+        if hasattr(data, 'data'):
+            data = data.data
+        self.data = data  # type: str
+
+        # Annotates memlet with _how_ writing is performed in case of conflict
+        self.wcr = wcr
+        self.wcr_identity = wcr_identity
+        self.wcr_conflict = wcr_conflict
+
+        # The subset of the other endpoint we are copying from/to (note:
+        # carries the dimensionality of the other endpoint too!)
+        self.other_subset = other_subset
+
+        self.debuginfo = debuginfo
+
+    def toJSON(self, indent=0):
+        json = " " * indent + "{\n"
+        indent += 2
+        json += " " * indent + "\"type\" : \"" + type(self).__name__ + "\",\n"
+        json += " " * indent + "\"label\" : \"" + str(self) + "\"\n"
+        indent -= 2
+        json += " " * indent + "}\n"
+        return json
+
+    @staticmethod
+    def simple(data,
+               subset_str,
+               veclen=1,
+               wcr_str=None,
+               wcr_identity=None,
+               other_subset_str=None,
+               wcr_conflict=True,
+               num_accesses=None,
+               debuginfo=None):
+        """ Constructs a Memlet from string-based expressions.
+            @param data: The data object or name to access. B{Note:} this
+                         parameter will soon be deprecated.
+            @type data: Either a string of the data descriptor name or an
+                        AccessNode.
+            @param subset_str: The subset of `data` that is going to 
+                               be accessed in string format. Example: '0:N'.
+            @param veclen: The length of a single unit of access to
+                           the data (used for vectorization optimizations).
+            @param wcr_str: A lambda function (as a string) specifying 
+                            how write-conflicts are resolved. The syntax 
+                            of the lambda function receives two elements:
+                            `current` value and `new` value,
+                            and returns the value after resolution. For 
+                            example, summation is 
+                            `'lambda cur, new: cur + new'`.
+            @param wcr_identity: Identity value used for the first write 
+                                 conflict. B{Note:} this parameter will soon
+                                 be deprecated.
+            @param other_subset_str: The reindexing of `subset` on the other 
+                                     connected data (as a string).
+            @param wcr_conflict: If False, forces non-locked conflict 
+                                 resolution when generating code. The default
+                                 is to let the code generator infer this 
+                                 information from the SDFG.
+            @param num_accesses: The number of times that the moved data
+                                 will be subsequently accessed. If
+                                 `dace.types.DYNAMIC` (-1),
+                                 designates that the number of accesses is
+                                 unknown at compile time.
+            @param debuginfo: Source-code information (e.g., line, file) 
+                              used for debugging.
+                                 
+        """
+        subset = SubsetProperty.from_string(subset_str)
+        if num_accesses is not None:
+            na = num_accesses
+        else:
+            na = subset.num_elements()
+
+        if wcr_str is not None:
+            wcr = LambdaProperty.from_string(wcr_str)
+        else:
+            wcr = None
+
+        if other_subset_str is not None:
+            other_subset = SubsetProperty.from_string(other_subset_str)
+        else:
+            other_subset = None
+
+        # If it is an access node or another memlet
+        if hasattr(data, 'data'):
+            data = data.data
+
+        return Memlet(
+            data,
+            na,
+            subset,
+            veclen,
+            wcr=wcr,
+            wcr_identity=wcr_identity,
+            other_subset=other_subset,
+            wcr_conflict=wcr_conflict,
+            debuginfo=debuginfo)
+
+    @staticmethod
+    def from_array(dataname, datadesc):
+        """ Constructs a Memlet that transfers an entire array's contents.
+            @param dataname: The name of the data descriptor in the SDFG.
+            @param datadesc: The data descriptor object.
+            @type datadesc: Data.
+        """
+        range = subsets.Range.from_array(datadesc)
+        return Memlet(dataname, range.num_elements(), range, 1)
+
+    def __hash__(self):
+        return hash((self.data, self.num_accesses, self.subset, self.veclen,
+                     str(self.wcr), self.wcr_identity, self.other_subset))
+
+    def __eq__(self, other):
+        return all([
+            self.data == other.data, self.num_accesses == other.num_accesses,
+            self.subset == other.subset, self.veclen == other.veclen,
+            self.wcr == other.wcr, self.wcr_identity == other.wcr_identity,
+            self.other_subset == other.other_subset
+        ])
+
+    def num_elements(self):
+        """ Returns the number of elements in the Memlet subset. """
+        return self.subset.num_elements()
+
+    def bounding_box_size(self):
+        """ Returns a per-dimension upper bound on the maximum number of
+            elements in each dimension.
+
+            This bound will be tight in the case of Range.
+        """
+        return self.subset.bounding_box_size()
+
+    def validate(self, sdfg, state):
+        if self.data not in sdfg.arrays:
+            raise KeyError('Array "%s" not found in SDFG' % self.data)
+
+    def __label__(self, sdfg, state):
+        """ Returns a string representation of the memlet for display in a 
+            graph.
+
+            @param sdfg: The SDFG in which the memlet resides.
+            @param state: An SDFGState object in which the memlet resides.
+        """
+        if self.data is None:
+            return self._label(None)
+        return self._label(sdfg.arrays[self.data].shape)
+
+    def __str__(self):
+        return self._label(None)
+
+    def _label(self, shape):
+        result = ''
+        if self.data is not None:
+            result = self.data
+
+        if self.subset is None:
+            return result
+
+        num_elements = self.subset.num_elements()
+        if self.num_accesses != num_elements:
+            result += '(%s) ' % str(self.num_accesses)
+        arrayNotation = True
+        try:
+            if shape is not None and reduce(operator.mul, shape, 1) == 1:
+                # Don't draw array if we're accessing a single element
+                arrayNotation = False
+        except TypeError:
+            # Will fail if trying to check the truth value of a sympy expr
+            pass
+        if arrayNotation:
+            result += '[%s]' % str(self.subset)
+        if self.wcr is not None and str(self.wcr) != '':
+            # Autodetect reduction type
+            redtype = detect_reduction_type(self.wcr)
+            if redtype == types.ReductionType.Custom:
+                wcrstr = unparse(ast.parse(self.wcr).body[0].value.body)
+            else:
+                wcrstr = str(redtype)
+                wcrstr = wcrstr[wcrstr.find('.') + 1:]  # Skip "ReductionType."
+
+            result += ' (CR: %s' % wcrstr
+            if self.wcr_identity is not None:
+                result += ', id: %s' % str(self.wcr_identity)
+            result += ')'
+
+        if self.other_subset is not None:
+            result += ' -> [%s]' % str(self.other_subset)
+        return result
+
+    def __repr__(self):
+        return "Memlet (" + self.__str__() + ")"
+
+
+class EmptyMemlet(Memlet):
+    """ A memlet without data. Primarily used for connecting nodes to scopes
+        without transferring data to them. """
+
+    def __init__(self):
+        super(EmptyMemlet, self).__init__(None, 0, None, 1)
diff --git a/dace/properties.py b/dace/properties.py
new file mode 100644
index 0000000000..9dd11432fe
--- /dev/null
+++ b/dace/properties.py
@@ -0,0 +1,846 @@
+import ast
+import astunparse
+from collections import OrderedDict
+import copy
+from dace.frontend.python.astutils import unparse
+import itertools
+import pydoc
+import re
+import sympy as sp
+import numpy as np
+import dace.subsets as sbs
+import dace
+from dace.symbolic import pystr_to_symbolic
+from dace.types import DebugInfo
+
+###############################################################################
+# External interface to guarantee correct usage
+###############################################################################
+
+
+def set_property_from_string(prop, obj, string, sdfg=None):
+    """ Interface function that guarantees that a property will always be
+    correctly set, if possible, by accepting all possible input arguments to
+    from_string. """
+
+    # If the property is a string (property name), obtain it from the object
+    if isinstance(prop, str):
+        prop = type(obj).__properties__[prop]
+
+    if isinstance(prop, CodeProperty):
+        val = prop.from_string(string, obj.language)
+    elif isinstance(prop, (ReferenceProperty, DataProperty)):
+        if sdfg is None:
+            raise ValueError(
+                "You cannot pass sdfg=None when editing a ReferenceProperty!")
+        val = prop.from_string(string, sdfg)
+    else:
+        val = prop.from_string(string)
+    setattr(obj, prop.attr_name, val)
+
+
+###############################################################################
+# Property base implementation
+###############################################################################
+
+
+class PropertyError(Exception):
+    """Exception type for errors related to internal functionality of
+    these properties."""
+    pass
+
+
+class Property:
+    """ Class implementing properties of DaCe objects that conform to strong
+    typing, and allow conversion to and from strings to be edited. """
+
+    def __init__(
+            self,
+            getter=None,
+            setter=None,
+            dtype=None,
+            default=None,
+            from_string=None,
+            to_string=None,
+            enum=None,  # Values must be present in this enum
+            unmapped=False,  # Don't enforce 1:1 mapping with a member variable
+            allow_none=False,
+            indirected=False,  # This property belongs to a different class
+            desc=""):
+
+        self._getter = getter
+        self._setter = setter
+        self._dtype = dtype
+        self._default = default
+        if from_string is not None:
+            self._from_string = from_string
+        elif enum is not None:
+            self._from_string = lambda s: enum[s]
+        else:
+            self._from_string = self.dtype
+        if to_string is not None:
+            self._to_string = to_string
+        elif enum is not None:
+            self._to_string = lambda val: val._name_
+        else:
+            self._to_string = str
+        self._enum = enum
+        self._unmapped = unmapped
+        self._allow_none = allow_none
+        self._indirected = indirected
+        self._desc = desc
+
+    def __get__(self, obj, objtype=None):
+        if obj is None:
+            # Called on the class rather than an instance, so return the
+            # property object itself
+            return self
+        # If a custom getter is specified, use it
+        if self.getter:
+            return self.getter(obj)
+        if not hasattr(self, "attr_name"):
+            raise RuntimeError("Attribute name not set")
+        # Otherwise look for attribute prefixed by "_"
+        return getattr(obj, "_" + self.attr_name)
+
+    def __set__(self, obj, val):
+        # If custom setter is specified, use it
+        if self.setter:
+            return self.setter(obj, val)
+        if not hasattr(self, "attr_name"):
+            raise RuntimeError("Attribute name not set")
+        # Fail on None unless explicitly allowed
+        if val is None and not self.allow_none:
+            raise ValueError(
+                "None not allowed for property {} in class {}".format(
+                    self.attr_name,
+                    type(obj).__name__))
+
+        # Accept all DaCe/numpy typeclasses as Python native types
+        if isinstance(val, np.number):
+            val = val.item()
+
+        # Check if type matches before setting
+        if (self.dtype is not None and not isinstance(val, self.dtype)
+                and not (val is None and self.allow_none)):
+            if isinstance(val, str):
+                raise TypeError(
+                    "Received str for property {} of type {}. Use "
+                    "dace.properties.set_property_from_string or the "
+                    "from_string method of the property.".format(
+                        self.attr_name, self.dtype))
+            raise TypeError(
+                "Invalid type \"{}\" for property {}: expected {}".format(
+                    type(val).__name__, self.attr_name, self.dtype.__name__))
+        # If the value has not yet been set, we cannot pass it to the enum
+        # function. Fail silently if this happens
+        if self.enum is not None and isinstance(self.enum, (list, tuple, set)):
+            if val not in self.enum:
+                raise ValueError("Value {} not present in enum: {}".format(
+                    val, self.enum))
+        setattr(obj, "_" + self.attr_name, val)
+
+    # Property-ception >:-)
+
+    @property
+    def getter(self):
+        return self._getter
+
+    @getter.setter
+    def getter(self, val):
+        self._getter = val
+
+    @property
+    def setter(self):
+        return self._setter
+
+    @setter.setter
+    def setter(self, val):
+        self._setter = val
+
+    @property
+    def dtype(self):
+        return self._dtype
+
+    @property
+    def default(self):
+        return self._default
+
+    @property
+    def allow_none(self):
+        return self._allow_none
+
+    @property
+    def desc(self):
+        return self._desc
+
+    @property
+    def from_string(self):
+        return self._from_string
+
+    @property
+    def to_string(self):
+        return self._to_string
+
+    @property
+    def enum(self):
+        return self._enum
+
+    @property
+    def unmapped(self):
+        return self._unmapped
+
+    @property
+    def indirected(self):
+        return self._indirected
+
+    @indirected.setter
+    def indirected(self, val):
+        self._indirected = val
+
+
+###############################################################################
+# Decorator for objects with properties
+###############################################################################
+
+
+def _property_generator(instance):
+    for name, prop in type(instance).__properties__.items():
+        yield prop, getattr(instance, name)
+
+
+def make_properties(cls):
+    """ A decorator for objects that adds support and checks for strongly-typed 
+        properties (which use the Property class).
+    """
+
+    # Extract all Property members of the class
+    properties = OrderedDict([(name, prop)
+                              for name, prop in cls.__dict__.items()
+                              if isinstance(prop, Property)])
+    # Set the property name to its field name in the class
+    for name, prop in properties.items():
+        prop.attr_name = name
+    # Grab properties from baseclass(es)
+    own_properties = copy.copy(properties)
+    for base in cls.__bases__:
+        if hasattr(base, "__properties__"):
+            duplicates = base.__properties__.keys() & own_properties.keys()
+            if len(duplicates) != 0:
+                raise AttributeError(
+                    "Duplicate properties in class {} deriving from {}: {}".
+                    format(cls.__name__, base.__name__, duplicates))
+            properties.update(base.__properties__)
+    # Add the list of properties to the class
+    cls.__properties__ = properties
+    # Add an iterator to pairs of property names and their values
+    cls.properties = _property_generator
+
+    # Grab old init. This will be brought into the closure in the below
+    init = cls.__init__
+
+    def initialize_properties(obj, *args, **kwargs):
+        # Set default values. If we don't do this, properties that depend on
+        # other might fail because the others rely on being set by a default
+        # value
+        for name, prop in own_properties.items():
+            # Only assign our own properties, so we don't overwrite what's been
+            # set by the base class
+            if hasattr(obj, name):
+                raise PropertyError(
+                    "Property {} already assigned in {}".format(
+                        name,
+                        type(obj).__name__))
+            if not prop.indirected and prop.default is not None:
+                setattr(obj, name, prop.default)
+        # Now call vanilla __init__, which can initialize members
+        init(obj, *args, **kwargs)
+        # Assert that all properties have been set
+        for name, prop in properties.items():
+            try:
+                getattr(obj, name)
+            except AttributeError:
+                if not prop.unmapped:
+                    raise PropertyError(
+                        "Property {} is unassigned in __init__ for {}".format(
+                            name, cls.__name__))
+        # Assert that there are no fields in the object not captured by
+        # properties, unless they are prefixed with "_"
+        for name, prop in obj.__dict__.items():
+            if name not in properties and not name.startswith("_"):
+                raise PropertyError(
+                    "{} : Variable {} is neither a Property nor "
+                    "an internal variable (prefixed with \"_\")".format(
+                        str(type(obj)), name))
+
+    # Replace the __init__ method
+    cls.__init__ = initialize_properties
+
+    return cls
+
+
+def indirect_property(cls, f, prop, override):
+
+    # Make a copy of the original property, but override its getter and setter
+    prop_name = prop.attr_name
+    prop_indirect = copy.copy(prop)
+    prop_indirect.indirected = True
+
+    # Because this is a separate function, prop_name is caught in the closure
+    def indirect_getter(obj):
+        return getattr(f(obj), prop_name)
+
+    def indirect_setter(obj, val):
+        return setattr(f(obj), prop_name, val)
+
+    prop_indirect.getter = indirect_getter
+    prop_indirect.setter = indirect_setter
+
+    # Add the property to the class
+    if not override and hasattr(cls, prop_name):
+        raise TypeError(
+            "Property \"{}\" already exists in class \"{}\"".format(
+                prop_name, cls.__name__))
+    setattr(cls, prop_name, prop_indirect)
+
+
+def indirect_properties(indirect_class, indirect_function, override=False):
+    """ A decorator for objects that provides indirect properties defined
+        in another class.
+    """
+
+    def indirection(cls):
+        # For every property in the class we are indirecting to, create an
+        # indirection property in this class
+        for prop in indirect_class.__properties__.values():
+            indirect_property(cls, indirect_function, prop, override)
+        return make_properties(cls)
+
+    return indirection
+
+
+###############################################################################
+# Custom properties
+###############################################################################
+
+
+# TODO: does not currently work because of how enums work
+class OrderProperty(Property):
+    """ Custom property class that handles the mapping between the order
+        property and the actual class fields (range and parameters). """
+
+    # This is implemented in the context of dace.nodes.Map, but could in
+    # principle be reused for other objects, assuming they set the internal
+    # fields "_range" and "_params".
+
+    def __get__(self, obj, objtype=None):
+        # Copy to avoid changes in the list at callee to be reflected in
+        # the map directly
+        return list(obj._params)
+
+    def __set__(self, obj, val):
+        """ Update both params and ranges based on the new order. """
+        # Make this more lenient to the input by comparing strings, and
+        # using the new order to shuffle the original lists
+        param_strings = list(map(str, obj._params))
+        update_strings = list(map(str, val))
+        if len(update_strings) != len(param_strings):
+            raise ValueError(
+                "Wrong length of new order: {} (found {}, expected {})".format(
+                    str(val), len(update_strings), len(param_strings)))
+        # The below will throw a ValueError if a parameter doesn't exist
+        # We assume that no parameter will be present twice...
+        indices = [param_strings.index(x) for x in update_strings]
+        obj._params = [obj._params[i] for i in indices]
+        obj._range.reorder(indices)
+
+    @staticmethod
+    def to_string(val):
+        return "({})".format(", ".join(map(str, val)))
+
+    @staticmethod
+    def from_string(s):
+        """Create a list of symbols from a list of strings."""
+        return [sp.Symbol(i) for i in re.sub("[\(\)\[\]]", "", s).split(",")]
+
+    @staticmethod
+    def enum(obj):
+        """Implement enum to populate e.g. dropdown."""
+        return list(itertools.permutations(obj))
+
+
+class RangeProperty(Property):
+    """ Custom Property type for `dace.graph.subset.Range` members. """
+
+    def __set__(self, obj, value):
+        if isinstance(value, list):
+            value = dace.subsets.Range(value)
+        super(RangeProperty, self).__set__(obj, value)
+
+    @property
+    def dtype(self):
+        return sbs.Range
+
+    @staticmethod
+    def to_string(obj):
+        return sbs.Range.ndslice_to_string(obj)
+
+    @staticmethod
+    def from_string(s):
+        return sbs.Range.from_string(s)
+
+
+class DebugInfoProperty(Property):
+    """ Custom Property type for DebugInfo members. """
+
+    @property
+    def dtype(self):
+        return DebugInfo
+
+    @property
+    def allow_none(self):
+        return True
+
+    @staticmethod
+    def to_string(di):
+        if isinstance(di, DebugInfo):
+            r = "file:" + str(di.filename) + " "
+            r += "from line: " + str(di.start_line) + " col: " + str(
+                di.start_column) + " "
+            r += "to line: " + str(di.end_line) + " col: " + str(di.end_column)
+            return r
+        else:
+            return "None"
+
+    @staticmethod
+    def from_string(s):
+        f = None
+        sl = 0
+        el = 0
+        sc = 0
+        ec = 0
+        info_available = False
+        di = None
+
+        m = re.search("file: (\w+)", s)
+        if m is not None:
+            info_available = True
+            f = sl = m.group(1)
+        m = re.search("from line: (\d+)", s)
+        if m is not None:
+            sl = m.group(1)
+            el = sl
+            info_available = True
+        m = re.search("to line: (\d+)", s)
+        if m is not None:
+            el = m.group(1)
+            info_available = True
+        m = re.search("from col: (\d+)", s)
+        if m is not None:
+            sc = m.group(1)
+            ec = sc
+            info_available = True
+        m = re.search("to col: (\d+)", s)
+        if m is not None:
+            ec = m.group(1)
+            info_available = True
+        if info_available:
+            di = DebugInfo(f, sl, sc, el, ec)
+        return di
+
+
+class ParamsProperty(Property):
+    """ Property for list of parameters, such as parameters for a Map. """
+
+    @property
+    def dtype(self):
+        return list
+
+    @staticmethod
+    def to_string(l):
+        return "[{}]".format(", ".join(map(str, l)))
+
+    @staticmethod
+    def from_string(s):
+        return [
+            sp.Symbol(m.group(0))
+            for m in re.finditer("[a-zA-Z_][a-zA-Z0-9_]*", s)
+        ]
+
+
+class SetProperty(Property):
+    """Property for a set of elements of one type, e.g., connectors. """
+
+    def __init__(
+            self,
+            element_type,
+            getter=None,
+            setter=None,
+            default=None,
+            from_string=None,
+            to_string=None,
+            unmapped=False,  # Don't enforce 1:1 mapping with a member variable
+            allow_none=False,
+            desc=""):
+        super(SetProperty, self).__init__(
+            getter=getter,
+            setter=setter,
+            dtype=set,
+            default=default,
+            from_string=from_string,
+            to_string=to_string,
+            enum=None,
+            unmapped=unmapped,
+            allow_none=allow_none,
+            desc=desc)
+        self._element_type = element_type
+
+    @property
+    def dtype(self):
+        return set
+
+    @staticmethod
+    def to_string(l):
+        return str(l)
+
+    @staticmethod
+    def from_string(s):
+        return [eval(i) for i in re.sub("[\{\}\(\)\[\]]", "", s).split(",")]
+
+    def __get__(self, obj, objtype=None):
+        # Copy to avoid changes in the set at callee to be reflected in
+        # the node directly
+        return set(super(SetProperty, self).__get__(obj, objtype))
+
+    def __set__(self, obj, val):
+        # Check for uniqueness
+        if len(val) != len(set(val)):
+            dups = set([x for x in val if val.count(x) > 1])
+            raise ValueError('Duplicates found in set: ' + str(dups))
+        # Cast to element type
+        try:
+            new_set = set(self._element_type(elem) for elem in val)
+        except (TypeError, ValueError):
+            raise ValueError('Some elements could not be converted to %s' %
+                             (str(self._element_type)))
+
+        super(SetProperty, self).__set__(obj, new_set)
+
+
+class LambdaProperty(Property):
+    """ Custom Property type that accepts a lambda function, with conversions
+        to and from strings. """
+
+    @property
+    def dtype(self):
+        return str
+
+    @staticmethod
+    def from_string(s):
+        return ast.parse(s).body[0].value
+
+    @staticmethod
+    def to_string(obj):
+        if obj is None:
+            return 'lambda: None'
+        if isinstance(obj, str):
+            return obj
+        return unparse(obj)
+
+    def __set__(self, obj, val):
+        if val is not None:
+            if isinstance(val, str):
+                self.from_string(val)  # Check that from_string doesn't fail
+            elif isinstance(val, ast.Lambda):
+                val = self.to_string(val)  # Store as string internally
+            else:
+                raise TypeError(
+                    "Lambda property must be either string or ast.Lambda")
+        super(LambdaProperty, self).__set__(obj, val)
+
+
+class CodeBlock(list):
+    """ Helper class that represents AST code blocks for `CodeProperty`, 
+        implemented as a list with an extra _as_string property. The object
+        also stores the original string, allowing us to preserve comments and
+        formatting from user input.
+    """
+
+    def __init__(self, *args, **kwargs):
+        self._as_string = ""
+        super().__init__(*args, **kwargs)
+
+    @property
+    def as_string(self):
+        return self._as_string
+
+    @as_string.setter
+    def as_string(self, string):
+        self._as_string = string
+
+
+class CodeProperty(Property):
+    """ Custom Property type that accepts code in various languages. """
+
+    @property
+    def dtype(self):
+        return None
+
+    @staticmethod
+    def from_string(string, language=None):
+        if language is None:
+            raise TypeError("Must pass language as second argument to "
+                            "from_string method of CodeProperty")
+        if language == dace.types.Language.Python:
+            block = CodeBlock(ast.parse(string).body)
+            block.as_string = string
+            return block
+        else:
+            # Do nothing for now
+            return string
+
+    @staticmethod
+    def to_string(obj):
+        if isinstance(obj, str):
+            return obj
+        # Grab the originally parsed string if any
+        if obj._as_string is not None and obj._as_string != "":
+            return obj._as_string
+        # It's probably good enough to assume that there is an original string
+        # if the language was not Python, so we just throw the string to the
+        # astunparser.
+        return unparse(obj)
+
+    def __set__(self, obj, val):
+        # Check if the class has a language property
+        if not hasattr(type(obj), "language"):
+            raise AttributeError(
+                "Class \"{}\" with a CodeProperty field must also "
+                "have a \"language\" attribute.".format(type(obj).__name__))
+        # Check if the object has a language attribute
+        try:
+            language = obj.language
+        except AttributeError:
+            # Language exists as an attribute, but has not yet been set. Accept
+            # this, because __dict__ is not guaranteed to be in the order that
+            # the attributes are defined in.
+            language = None
+        if val is None:
+            # Keep as None. The "allow_none" check in the superclass
+            # ensures that this is legal
+            pass
+        elif isinstance(val, str):
+            if language is not None:
+                # Store original string
+                val = self.from_string(val, language)
+        else:
+            try:
+                if language is not dace.types.Language.Python:
+                    raise TypeError("Only strings accepted for other "
+                                    "languages than Python.")
+            except AttributeError:
+                # Don't check language if it has not been set yet. We will
+                # assume it's Python AST, since it wasn't a string
+                pass
+            if isinstance(val, (ast.FunctionDef, ast.With)):
+                # TODO: the original parsing should have already stripped this
+                val = CodeBlock(val.body)
+            elif isinstance(val, ast.AST):
+                val = CodeBlock([val])
+            else:
+                try:
+                    iter(val)
+                except TypeError:
+                    raise TypeError(
+                        "CodeProperty expected an iterable of expressions, "
+                        " got {}".format(type(val).__name__))
+                for e in val:
+                    if not isinstance(e, ast.AST):
+                        raise TypeError(
+                            "Found type {} in list of AST expressions: "
+                            "expected ast.AST".format(type(e).__name__))
+        super(CodeProperty, self).__set__(obj, val)
+
+
+class SubsetProperty(Property):
+    """ Custom Property type that accepts any form of subset, and enables
+    parsing strings into multiple types of subsets. """
+
+    @property
+    def dtype(self):
+        return None
+
+    @property
+    def allow_none(self):
+        return True
+
+    def __set__(self, obj, val):
+        if (val is not None and not isinstance(val, sbs.Range)
+                and not isinstance(val, sbs.Indices)):
+            try:
+                val = self.from_string(val)
+            except SyntaxError:
+                raise TypeError(
+                    "Subset property must be either Range or Indices: got {}".
+                    format(type(val).__name__))
+        super(SubsetProperty, self).__set__(obj, val)
+
+    @staticmethod
+    def from_string(s):
+        if s is None or s == 'None' or len(s) == 0:
+            return None
+        ranges = sbs.Range.from_string(s)
+        if ranges:
+            return ranges
+        else:
+            return sbs.Indices.from_string(s)
+
+    @staticmethod
+    def to_string(val):
+        if isinstance(val, sbs.Range):
+            return sbs.Range.ndslice_to_string(val)
+        elif isinstance(val, sbs.Indices):
+            return sbs.Indices.__str__(val)
+        elif val is None:
+            return 'None'
+        raise TypeError
+
+
+class SymbolicProperty(Property):
+    """ Custom Property type that accepts integers or Sympy expressions. """
+
+    @property
+    def dtype(self):
+        return None
+
+    def __set__(self, obj, val):
+        if (not isinstance(val, sp.expr.Expr) and not isinstance(val, int)
+                and not isinstance(val, str)):
+            raise TypeError(
+                "Property {} must an int or symbolic expression".format(
+                    self.attr_name))
+        super(SymbolicProperty, self).__set__(obj, val)
+
+    @staticmethod
+    def from_string(s):
+        return pystr_to_symbolic(s)
+
+
+class DataProperty(Property):
+    """ Custom Property type that represents a link to a data descriptor.
+        Needs the SDFG to be passed as an argument to `from_string` and
+        `enum`. """
+
+    def __init__(self, desc='', default=None):
+        # Data can be None when no data is flowing, e.g., on a memlet with a
+        # map that has no external inputs
+        return super().__init__(
+            dtype=str, allow_none=True, desc=desc, default=default)
+
+    @staticmethod
+    def enum(sdfg=None):
+        if sdfg is None:
+            raise TypeError("Must pass SDFG as second argument to "
+                            "enum method of ArrayProperty")
+        return list(sdfg.arrays.keys())
+
+    @staticmethod
+    def from_string(s, sdfg=None):
+        if sdfg is None:
+            raise TypeError("Must pass SDFG as second argument to "
+                            "from_string method of ArrayProperty")
+        if s not in sdfg.arrays:
+            raise ValueError("No data found in SDFG with name: {}".format(s))
+        return s
+
+    @staticmethod
+    def to_string(obj):
+        return str(obj)
+
+
+class ReferenceProperty(Property):
+    """ Custom Property type that represents a link to another SDFG object.
+        Needs the SDFG to be passed as an argument to `from_string`."""
+
+    @staticmethod
+    def from_string(s, sdfg=None):
+        if sdfg is None:
+            raise TypeError("Must pass SDFG as second argument to "
+                            "from_string method of ReferenceProperty")
+        for node in sdfg.states():
+            if node.label == s:
+                return node
+        for node, _ in sdfg.all_nodes_recursive():
+            if node.label == s:
+                return node
+        raise ValueError("No node found in SDFG with name: {}".format(s))
+
+    @staticmethod
+    def to_string(obj):
+        return obj.label
+
+
+class ShapeProperty(Property):
+    """ Custom Property type that defines a shape. """
+
+    @property
+    def dtype(self):
+        return tuple
+
+    @staticmethod
+    def from_string(s):
+        if s[0] == "(" and s[-1] == ")":
+            s = s[1:-1]
+        return tuple([
+            dace.symbolic.pystr_to_symbolic(m.group(0))
+            for m in re.finditer("[^,;:]+", s)
+        ])
+
+    @staticmethod
+    def to_string(obj):
+        return ", ".join(map(str, obj))
+
+    def __set__(self, obj, val):
+        if isinstance(val, list):
+            val = tuple(val)
+        super(ShapeProperty, self).__set__(obj, val)
+
+
+class TypeProperty(Property):
+    """ Custom Property type that finds a type according to the input string. 
+    """
+
+    @property
+    def dtype(self):
+        return type
+
+    # TODO: this does not work both ways! If converted to a string we lose the
+    # location information.
+    @staticmethod
+    def from_string(s):
+        dtype = pydoc.locate(s)
+        if dtype is None:
+            raise ValueError("No type \"{}\" found.".format(s))
+        if not isinstance(dtype, type):
+            raise ValueError("Object \"{}\" is not a type.".format(dtype))
+        return dtype
+
+
+class TypeClassProperty(Property):
+    """ Custom property type for memory as defined in dace.types,
+        e.g. `dace.float32`. """
+
+    @property
+    def dtype(self):
+        return dace.types.typeclass
+
+    @staticmethod
+    def from_string(s):
+        dtype = pydoc.locate("dace.types.{}".format(s))
+        if dtype is None or not isinstance(dtype, dace.types.typeclass):
+            raise ValueError("Not a valid data type: {}".format(s))
+        return dtype
+
+    @staticmethod
+    def to_string(obj):
+        return obj.to_string()
diff --git a/dace/runtime/include/dace/complex.h b/dace/runtime/include/dace/complex.h
new file mode 100644
index 0000000000..f220c347b6
--- /dev/null
+++ b/dace/runtime/include/dace/complex.h
@@ -0,0 +1,63 @@
+#ifndef __DACE_COMPLEX_H
+#define __DACE_COMPLEX_H
+
+#include <complex>
+
+#ifdef __CUDACC__
+    #include <thrust/complex.h>
+    #define dace_conj thrust::conj
+#else
+   #define dace_conj std::conj
+#endif
+
+// Contains a complex-j class to support the native complex type in Python
+
+namespace dace 
+{
+    struct complexJ
+    {
+        int val;
+        explicit complexJ(int v = 1) : val(v) {}
+    };
+
+    static inline int operator*(const complexJ& j1, const complexJ& j2)
+    {
+        return -j1.val * j2.val;
+    }
+    template<typename T>
+    std::complex<T> operator*(const complexJ& j, const T& other)
+    {
+        return std::complex<T>(T(0), j.val * other);
+    }
+    template<typename T>
+    std::complex<T> operator*(const T& other, const complexJ& j)
+    {
+        return std::complex<T>(T(0), j.val * other);
+    }
+    static inline complexJ operator*(const int& other, const complexJ& j)
+    {
+        return complexJ(j.val * other);
+    }
+    static inline complexJ operator*(const complexJ& j, const int& other)
+    {
+        return complexJ(j.val * other);
+    }
+    static inline complexJ operator-(const complexJ& j)
+    {
+        return complexJ(-j.val);
+    }
+}
+
+
+// Complex-scalar multiplication functions
+
+template<typename T>
+std::complex<T> operator*(const std::complex<T>& a, const int& b) {
+    return std::complex<T>(b*a.real(), b*a.imag());
+}
+template<typename T>
+std::complex<T> operator*(const int& a, const std::complex<T>& b) {
+    return std::complex<T>(a*b.real(), a*b.imag());
+}
+
+#endif  // __DACE_COMPLEX_H
diff --git a/dace/runtime/include/dace/copy.h b/dace/runtime/include/dace/copy.h
new file mode 100644
index 0000000000..6ae7016361
--- /dev/null
+++ b/dace/runtime/include/dace/copy.h
@@ -0,0 +1,267 @@
+#ifndef __DACE_COPY_H
+#define __DACE_COPY_H
+
+#include "types.h"
+#include "vector.h"
+
+namespace dace
+{
+    template<typename T, typename U>
+    inline void InitArray(T *ptr, const U& value, int size)
+    {
+        for (int i = 0; i < size; ++i)
+            *ptr++ = T(value);
+    }
+
+    template <typename T, int VECLEN, int ALIGNED, int... COPYDIMS>
+    struct CopyND;
+    template <typename T, int VECLEN, int ALIGNED, int N>
+    struct CopyNDDynamic;
+
+    template <typename T, int VECLEN, int ALIGNED, int COPYDIM,
+              int... OTHER_COPYDIMS>
+    struct CopyND<T, VECLEN, ALIGNED, COPYDIM, OTHER_COPYDIMS...>
+    {
+        template <int SRC_STRIDE, int... OTHER_SRCDIMS>
+        struct ConstSrc
+        {
+            template <typename... Args>
+            static DACE_HDFI void Copy(const T *src, T *dst, const int& dst_stride, const Args&... dst_otherdims)
+            {
+#ifndef __CUDA_ARCH__
+                // Memcpy specialization
+                if (sizeof...(OTHER_COPYDIMS) == 0 && SRC_STRIDE == 1 && dst_stride == 1) {
+                    memcpy(dst, src, COPYDIM * sizeof(T) * VECLEN);
+                    return;
+                }
+#endif
+
+                __DACE_UNROLL
+                for (int i = 0; i < COPYDIM; ++i)
+                    CopyND<T, VECLEN, ALIGNED, OTHER_COPYDIMS...>::template ConstSrc<OTHER_SRCDIMS...>::Copy(
+                        src + i * SRC_STRIDE, dst + i * dst_stride, dst_otherdims...);
+            }
+
+            template <typename ACCUMULATE, typename... Args>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc, const int& dst_stride, const Args&... dst_otherdims)
+            {
+                __DACE_UNROLL
+                for (int i = 0; i < COPYDIM; ++i)
+                    CopyND<T, VECLEN, ALIGNED, OTHER_COPYDIMS...>::template ConstSrc<OTHER_SRCDIMS...>::Accumulate(
+                        src + i * SRC_STRIDE, dst + i * dst_stride, acc, dst_otherdims...);
+            }
+        };
+
+        template <int DST_STRIDE, int... OTHER_DSTDIMS>
+        struct ConstDst
+        {
+            template <typename... Args>
+            static DACE_HDFI void Copy(const T *src, T *dst, const int& src_stride, const Args&... src_otherdims)
+            {
+#ifndef __CUDA_ARCH__
+                // Memcpy specialization
+                if (sizeof...(OTHER_COPYDIMS) == 0 && src_stride == 1 && DST_STRIDE == 1) {
+                    memcpy(dst, src, COPYDIM * sizeof(T) * VECLEN);
+                    return;
+                }
+#endif
+
+                __DACE_UNROLL
+                for (int i = 0; i < COPYDIM; ++i)
+                    CopyND<T, VECLEN, ALIGNED, OTHER_COPYDIMS...>::template ConstDst<OTHER_DSTDIMS...>::Copy(
+                        src + i * src_stride, dst + i * DST_STRIDE, src_otherdims...);
+            }
+            
+            template <typename ACCUMULATE, typename... Args>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc, const int& src_stride, const Args&... src_otherdims)
+            {
+                __DACE_UNROLL
+                for (int i = 0; i < COPYDIM; ++i)
+                    CopyND<T, VECLEN, ALIGNED, OTHER_COPYDIMS...>::template ConstDst<OTHER_DSTDIMS...>::Accumulate(
+                        src + i * src_stride, dst + i * DST_STRIDE, acc, src_otherdims...);
+            }
+        };
+    };
+    
+    // Specialization for actual copy / accumulation
+    template <typename T, int VECLEN, int ALIGNED>
+    struct CopyND<T, VECLEN, ALIGNED>
+    {
+        template <int...>
+        struct ConstSrc
+        {
+            static DACE_HDFI void Copy(const T *src, T *dst)
+            {
+                *(vec<T, VECLEN> *)dst = *(vec<T, VECLEN> *)src;
+            }
+
+            template <typename ACCUMULATE, typename... Args>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc)
+            {
+                *(vec<T, VECLEN> *)dst = acc(*(vec<T, VECLEN> *)dst, *(vec<T, VECLEN> *)src);
+            }
+        };
+
+        template <int...>
+        struct ConstDst
+        {
+            static DACE_HDFI void Copy(const T *src, T *dst)
+            {
+                *(vec<T, VECLEN> *)dst = *(vec<T, VECLEN> *)src;
+            }
+
+            template <typename ACCUMULATE>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc)
+            {
+                *(vec<T, VECLEN> *)dst = acc(*(vec<T, VECLEN> *)dst, *(vec<T, VECLEN> *)src);
+            }
+        };
+    };
+
+    template <typename T, int VECLEN, int ALIGNED, int N>
+    struct CopyNDDynamic
+    {
+        template <int SRC_STRIDE, int... OTHER_SRCDIMS>
+        struct ConstSrc
+        {
+            template <typename... Args>
+            static DACE_HDFI void Copy(const T *src, T *dst, const int& copydim, const int& dst_stride, const Args&... otherdims)
+            {
+#ifndef __CUDA_ARCH__
+                // Memcpy specialization
+                if (N == 1 && SRC_STRIDE == 1 && dst_stride == 1) {
+                    memcpy(dst, src, copydim * sizeof(T) * VECLEN);
+                    return;
+                }
+#endif
+
+                __DACE_UNROLL
+                for (int i = 0; i < copydim; ++i)
+                    CopyNDDynamic<T, VECLEN, ALIGNED, N-1>::template ConstSrc<OTHER_SRCDIMS...>::Copy(
+                        src + i * SRC_STRIDE, dst + i * dst_stride, otherdims...);
+            }
+
+            template <typename ACCUMULATE, typename... Args>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc, const int& copydim, const int& dst_stride, const Args&... otherdims)
+            {
+                __DACE_UNROLL
+                for (int i = 0; i < copydim; ++i)
+                    CopyNDDynamic<T, VECLEN, ALIGNED, N-1>::template ConstSrc<OTHER_SRCDIMS...>::Accumulate(
+                        src + i * SRC_STRIDE, dst + i * dst_stride, acc, otherdims...);
+            }
+        };
+
+        template <int DST_STRIDE, int... OTHER_DSTDIMS>
+        struct ConstDst
+        {
+            template <typename... Args>
+            static DACE_HDFI void Copy(const T *src, T *dst, const int& copydim, const int& src_stride, const Args&... otherdims)
+            {
+#ifndef __CUDA_ARCH__
+                // Memcpy specialization
+                if (N == 1 && src_stride == 1 && DST_STRIDE == 1) {
+                    memcpy(dst, src, copydim * sizeof(T) * VECLEN);
+                    return;
+                }
+#endif
+
+                __DACE_UNROLL
+                for (int i = 0; i < copydim; ++i)
+                    CopyNDDynamic<T, VECLEN, ALIGNED, N-1>::template ConstDst<OTHER_DSTDIMS...>::Copy(
+                        src + i * src_stride, dst + i * DST_STRIDE, otherdims...);
+            }
+
+            template <typename ACCUMULATE, typename... Args>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc, const int& copydim, const int& src_stride, const Args&... otherdims)
+            {
+                __DACE_UNROLL
+                for (int i = 0; i < copydim; ++i)
+                    CopyNDDynamic<T, VECLEN, ALIGNED, N-1>::template ConstDst<OTHER_DSTDIMS...>::Accumulate(
+                        src + i * src_stride, dst + i * DST_STRIDE, acc, otherdims...);
+            }
+        };
+
+        struct Dynamic
+        {
+            template <typename... Args>
+            static DACE_HDFI void Copy(const T *src, T *dst, const int& copydim, const int& src_stride, const int& dst_stride, const Args&... otherdims)
+            {
+                static_assert(sizeof...(otherdims) == (N - 1) * 3, "Dimensionality mismatch in dynamic copy");
+
+#ifndef __CUDA_ARCH__
+                // Memcpy specialization
+                if (N == 1 && src_stride == 1 && dst_stride == 1) {
+                    memcpy(dst, src, copydim * sizeof(T) * VECLEN);
+                    return;
+                }
+#endif
+
+                __DACE_UNROLL
+                for (int i = 0; i < copydim; ++i)
+                    CopyNDDynamic<T, VECLEN, ALIGNED, N - 1>::Dynamic::Copy(
+                        src + i * src_stride, dst + i * dst_stride, otherdims...);
+            }
+
+            template <typename ACCUMULATE, typename... Args>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc, const int& copydim, const int& src_stride, const int& dst_stride, const Args&... otherdims)
+            {
+                static_assert(sizeof...(otherdims) == (N - 1) * 3, "Dimensionality mismatch in dynamic copy");
+                __DACE_UNROLL
+                for (int i = 0; i < copydim; ++i)
+                    CopyNDDynamic<T, VECLEN, ALIGNED, N - 1>::Dynamic::Accumulate(
+                        src + i * src_stride, dst + i * dst_stride, acc, otherdims...);
+            }
+        };
+    };
+
+    template <typename T, int VECLEN, int ALIGNED>
+    struct CopyNDDynamic<T, VECLEN, ALIGNED, 0>
+    {
+        template <int...>
+        struct ConstSrc
+        {
+            static DACE_HDFI void Copy(const T *src, T *dst)
+            {
+                *(vec<T, VECLEN> *)dst = *(vec<T, VECLEN> *)src;
+            }
+
+            template <typename ACCUMULATE, typename... Args>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc)
+            {
+                *(vec<T, VECLEN> *)dst = acc(*(vec<T, VECLEN> *)dst, *(vec<T, VECLEN> *)src);
+            }
+        };
+
+        template <int...>
+        struct ConstDst
+        {
+            static DACE_HDFI void Copy(const T *src, T *dst)
+            {
+                *(vec<T, VECLEN> *)dst = *(vec<T, VECLEN> *)src;
+            }
+
+            template <typename ACCUMULATE>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc)
+            {
+                *(vec<T, VECLEN> *)dst = acc(*(vec<T, VECLEN> *)dst, *(vec<T, VECLEN> *)src);
+            }
+        };
+
+        struct Dynamic
+        {
+            static DACE_HDFI void Copy(const T *src, T *dst)
+            {
+                *(vec<T, VECLEN> *)dst = *(vec<T, VECLEN> *)src;
+            }
+
+            template <typename ACCUMULATE>
+            static DACE_HDFI void Accumulate(const T *src, T *dst, ACCUMULATE acc)
+            {
+                *(vec<T, VECLEN> *)dst = acc(*(vec<T, VECLEN> *)dst, *(vec<T, VECLEN> *)src);
+            }
+        };
+    };
+
+}  // namespace dace
+
+#endif  // __DACE_COPY_H
diff --git a/dace/runtime/include/dace/cuda/copy.cuh b/dace/runtime/include/dace/cuda/copy.cuh
new file mode 100644
index 0000000000..be989bb66e
--- /dev/null
+++ b/dace/runtime/include/dace/cuda/copy.cuh
@@ -0,0 +1,819 @@
+// Redistribution and use in source and binary forms, with or without
+// modification, are permitted provided that the following conditions are met:
+//
+// * Redistributions of source code must retain the above copyright notice,
+//   this list of conditions and the following disclaimer.
+// * Redistributions in binary form must reproduce the above copyright notice,
+//   this list of conditions and the following disclaimer in the documentation
+//   and/or other materials provided with the distribution.
+// * Neither the names of the copyright holders nor the names of its 
+//   contributors may be used to endorse or promote products derived from this
+//   software without specific prior written permission.
+//
+// THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
+// AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
+// IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
+// ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE
+// LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
+// CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF
+// SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS
+// INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN
+// CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)
+// ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE
+// POSSIBILITY OF SUCH DAMAGE.
+#ifndef __DACE_CUDACOPY_CUH
+#define __DACE_CUDACOPY_CUH
+
+#include <cuda_runtime.h>
+#include "../types.h"
+#include "../vector.h"
+#include "../reduction.h"
+
+namespace dace
+{
+    // Adapted from "MAPS: GPU Optimization and Memory Abstraction Framework"
+    // https://github.com/maps-gpu/MAPS
+
+    // Converts from an integral amount of bytes to a type.
+    template <int BYTES>
+    struct BytesToType
+    {
+        typedef void type;
+    };
+
+    #ifdef __DACE_BYTES_TO_TYPE
+    #error Using disallowed macro name __DACE_BYTES_TO_TYPE
+    #endif
+
+    #define __DACE_BYTES_TO_TYPE(bytes, t)                  \
+    template<>                                              \
+    struct BytesToType<bytes>                               \
+    {                                                       \
+        typedef t type;                                     \
+    }
+
+    __DACE_BYTES_TO_TYPE(16, float4);
+    __DACE_BYTES_TO_TYPE(8, uint64_t);
+    __DACE_BYTES_TO_TYPE(4, uint32_t);
+    __DACE_BYTES_TO_TYPE(2, uint16_t);
+    __DACE_BYTES_TO_TYPE(1, uint8_t);
+
+    # undef __DACE_BYTES_TO_TYPE
+
+    template<unsigned int BLOCK_WIDTH, unsigned int BLOCK_HEIGHT, unsigned int BLOCK_DEPTH>
+    struct LinearizeTID
+    {
+        static DACE_DFI unsigned int get()
+        {
+            return threadIdx.x + threadIdx.y * BLOCK_WIDTH + 
+                threadIdx.z * BLOCK_WIDTH * BLOCK_HEIGHT;
+        }
+    };
+
+    template<unsigned int BLOCK_WIDTH, unsigned int BLOCK_HEIGHT>
+    struct LinearizeTID<BLOCK_WIDTH, BLOCK_HEIGHT, 1>
+    {
+        static DACE_DFI unsigned int get()
+        {
+            return threadIdx.x + threadIdx.y * BLOCK_WIDTH;
+        }
+    };
+
+    template<unsigned int BLOCK_WIDTH>
+    struct LinearizeTID<BLOCK_WIDTH, 1, 1>
+    {
+        static DACE_DFI unsigned int get()
+        {
+            return threadIdx.x;
+        }
+    };
+
+    template<int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH>
+    static DACE_DFI unsigned int GetLinearTID() {
+        return LinearizeTID<BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH>::get();
+    }
+
+    ////////////////////////////////////////////////////////////////////////
+    // Detect optimal bit read preference
+
+    enum
+    {
+        #if __CUDA_ARCH__ >= 500
+        PREFERRED_GREAD_SIZE = 128 / 8, // 128-bit
+        PREFERRED_SWRITE_SIZE = 128 / 8, // 128-bit
+        #elif __CUDA_ARCH__ >= 300
+        PREFERRED_GREAD_SIZE = 128 / 8, // 128-bit
+        PREFERRED_SWRITE_SIZE = 64 / 8, // 64-bit
+        #elif __CUDA_ARCH__ >= 130
+        PREFERRED_GREAD_SIZE = 64 / 8, // 64-bit
+        PREFERRED_SWRITE_SIZE = 32 / 8, // 32-bit
+        #else
+        PREFERRED_GREAD_SIZE = 32 / 8, // Default to 32-bit loads
+        PREFERRED_SWRITE_SIZE = 32 / 8, // 32-bit
+        #endif
+    };
+
+    #define DEBUG_PRINT(...) do {} while(0)
+    #define BLOCK_PRINT(...) do {} while(0)
+
+    //#define DEBUG_PRINT(...) do { if(threadIdx.x + threadIdx.y == 0 && blockIdx.x + blockIdx.y + blockIdx.z == 0 && threadIdx.z == 1) printf(__VA_ARGS__);  } while(0)
+    //#define BLOCK_PRINT(...) do { if(blockIdx.x + blockIdx.y + blockIdx.z == 0) printf(__VA_ARGS__);  } while(0)
+
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH, 
+              int COPY_ZLEN, int COPY_YLEN, int COPY_XLEN, 
+              int DST_ZSTRIDE, int DST_YSTRIDE, int DST_XSTRIDE,
+              bool ASYNC>
+    static DACE_DFI void GlobalToShared3D(
+            const T *ptr, int src_zstride,
+            int src_ystride, int src_xstride, T *smem)
+    {       
+        // Linear thread ID
+        int ltid = GetLinearTID<BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH>();
+        
+        constexpr int BLOCK_SIZE = BLOCK_WIDTH * BLOCK_HEIGHT * BLOCK_DEPTH;
+        constexpr int TOTAL_XYZ = COPY_XLEN * COPY_YLEN * COPY_ZLEN;
+        constexpr int TOTAL_XY = COPY_XLEN * COPY_YLEN;
+        constexpr int XY_SLICES = BLOCK_SIZE / TOTAL_XY;
+        constexpr int XY_REM = BLOCK_SIZE % TOTAL_XY;
+        constexpr int X_SLICES = BLOCK_SIZE / COPY_XLEN;
+        constexpr int X_REM = BLOCK_SIZE % COPY_XLEN;
+
+        //////////////////////////////////////////////////////////////////////
+        // Block size larger than number of elements, one read
+        if ((BLOCK_SIZE / TOTAL_XYZ) > 0)
+        {
+            DEBUG_PRINT("Chose path XYZ\n");
+
+            // De-linearize
+            int ltidx = ltid % COPY_XLEN;
+            int ltidy = (ltid / COPY_XLEN) % COPY_YLEN;
+            int ltidz = (ltid / COPY_XLEN) / COPY_YLEN;
+
+            if (ltid < TOTAL_XYZ)
+            {
+                smem[ltidx*DST_XSTRIDE + ltidy * DST_YSTRIDE + ltidz * DST_ZSTRIDE] =
+                    *(ptr + ltidx * src_xstride
+                      + src_ystride * ltidy
+                      + src_zstride * ltidz);
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // More than one XY slice
+        else if ((BLOCK_SIZE / TOTAL_XYZ) == 0 && XY_SLICES > 0 && XY_REM > 0)
+        {
+            DEBUG_PRINT("Chose path XY.1\n");
+
+            // Currently, only use threads in slice
+            // TODO(later): If contiguous (DST_YSTRIDE == COPY_XLEN), use the rest
+            constexpr int SLICES_PER_ITER = (XY_SLICES == 0 ? 1 :  // Compilers are annoying
+                                             COPY_ZLEN / XY_SLICES);
+            constexpr int REMAINDER = (XY_SLICES == 0 ? 1 : // Compilers are annoying
+                                       COPY_ZLEN % XY_SLICES);
+            constexpr int REMOFF = SLICES_PER_ITER * XY_SLICES;
+
+            if (ltid < (BLOCK_SIZE - XY_REM))
+            {
+                // De-linearize
+                int ltidx = ltid % COPY_XLEN;
+                int ltidy = (ltid / COPY_XLEN) % COPY_YLEN;
+                int ltidz = (ltid / COPY_XLEN) / COPY_YLEN;
+
+                #pragma unroll
+                for (int i = 0; i < SLICES_PER_ITER; ++i)
+                {
+                    smem[ltidx * DST_XSTRIDE + ltidy * DST_YSTRIDE + (i*XY_SLICES + ltidz) * DST_ZSTRIDE] =
+                        *(ptr +
+                          src_xstride * ltidx +
+                          src_ystride * ltidy +
+                          src_zstride * (i * XY_SLICES + ltidz));
+                }
+
+                if (ltidz < REMAINDER)
+                {
+                    // Read remainder
+                    smem[ltidx * DST_XSTRIDE + ltidy * DST_YSTRIDE + (REMOFF + ltidz) * DST_ZSTRIDE] =
+                        *(ptr +
+                          src_xstride * ltidx +
+                          src_ystride * ltidy +
+                          src_zstride * (REMOFF + ltidz));
+                }
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // Exactly n*XY slices
+        else if ((BLOCK_SIZE / TOTAL_XYZ) == 0 && XY_SLICES > 0 && XY_REM == 0)
+        {
+            DEBUG_PRINT("Chose path XY.2\n");
+
+            constexpr int SLICES_PER_ITER = (XY_SLICES == 0 ? 1 :  // Compilers are annoying
+                                             COPY_ZLEN / XY_SLICES);
+            constexpr int REMAINDER = (XY_SLICES == 0 ? 1 :  // Compilers are annoying
+                                       COPY_ZLEN % XY_SLICES);
+            constexpr int REMOFF = SLICES_PER_ITER * XY_SLICES;
+
+            // De-linearize
+            int ltidx = ltid % COPY_XLEN;
+            int ltidy = (ltid / COPY_XLEN) % COPY_YLEN;
+            int ltidz = (ltid / COPY_XLEN) / COPY_YLEN;
+
+            #pragma unroll
+            for (int i = 0; i < SLICES_PER_ITER; ++i)
+            {
+                smem[ltidx * DST_XSTRIDE + ltidy * DST_YSTRIDE + (i*XY_SLICES + ltidz) * DST_ZSTRIDE] =
+                    *(ptr +
+                      src_xstride * ltidx +
+                      src_ystride * ltidy +
+                      src_zstride * (i * XY_SLICES + ltidz));
+            }
+
+            if (ltidz < REMAINDER)
+            {
+                // Read remainder
+                smem[ltidx * DST_XSTRIDE + ltidy * DST_YSTRIDE + (REMOFF + ltidz) * DST_ZSTRIDE] =
+                    *(ptr +
+                      src_xstride * ltidx +
+                      src_ystride * ltidy +
+                      src_zstride * (REMOFF + ltidz));
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // More than X row
+        else if (XY_SLICES == 0 && X_SLICES > 0 && X_REM > 0)
+        {
+            DEBUG_PRINT("Chose path X.1\n");
+
+            // Currently, only use threads in row
+            // TODO(later): If contiguous (DST_YSTRIDE == COPY_XLEN), use the rest
+            constexpr int ROWS_PER_XY_SLICE = (X_SLICES == 0 ? 1 :  // Compilers are annoying
+                                               COPY_YLEN / X_SLICES);
+            constexpr int REMAINDER = (X_SLICES == 0 ? 1 :  // Compilers are annoying
+                                       COPY_YLEN % X_SLICES);
+            constexpr int REMOFF = ROWS_PER_XY_SLICE * X_SLICES;
+
+            if (ltid < (BLOCK_SIZE - X_REM))
+            {
+                // De-linearize
+                int ltidx = ltid % COPY_XLEN;
+                int ltidy = ltid / COPY_XLEN;
+
+                #pragma unroll
+                for (int i = 0; i < COPY_ZLEN; ++i)
+                {
+                    #pragma unroll
+                    for (int j = 0; j < ROWS_PER_XY_SLICE; ++j)
+                    {
+                        smem[ltidx * DST_XSTRIDE + (j*X_SLICES + ltidy) * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                            *(ptr +
+                              src_xstride * ltidx +
+                              src_ystride * (j * X_SLICES + ltidy) +
+                              src_zstride * i);
+                    }
+
+                    if (ltidy < REMAINDER)
+                    {
+                        // Read remainder
+                        smem[ltidx * DST_XSTRIDE + (REMOFF + ltidy)* DST_YSTRIDE + i * DST_ZSTRIDE] =
+                            *(ptr +
+                              src_xstride * ltidx +
+                              src_ystride * (REMOFF + ltidy) +
+                              src_zstride * i);
+                    }
+
+                }
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // Exactly n*X rows
+        else if (XY_SLICES == 0 && X_SLICES > 0 && X_REM == 0)
+        {
+            DEBUG_PRINT("Chose path X.2\n");
+
+            constexpr int ROWS_PER_XY_SLICE = (X_SLICES == 0 ? 1 :  // Compilers are annoying
+                                               COPY_YLEN / X_SLICES);
+            constexpr int REMAINDER = (X_SLICES == 0 ? 1 :  // Compilers are annoying
+                                       COPY_YLEN % X_SLICES);
+            constexpr int REMOFF = ROWS_PER_XY_SLICE * X_SLICES;
+
+            // De-linearize
+            int ltidx = ltid % COPY_XLEN;
+            int ltidy = ltid / COPY_XLEN;
+
+            #pragma unroll
+            for (int i = 0; i < COPY_ZLEN; ++i)
+            {
+                #pragma unroll
+                for (int j = 0; j < ROWS_PER_XY_SLICE; ++j)
+                {
+                    smem[ltidx * DST_XSTRIDE + (j*X_SLICES + ltidy) * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                        *(ptr +
+                          src_xstride * ltidx +
+                          src_ystride * (j * X_SLICES + ltidy) +
+                          src_zstride * i);
+                }
+
+                if (ltidy < REMAINDER)
+                {
+                    // Read remainder
+                    smem[ltidx * DST_XSTRIDE + (REMOFF + ltidy) * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                        *(ptr +
+                          src_xstride * ltidx +
+                          src_ystride * (REMOFF + ltidy) +
+                          src_zstride * i);
+                }
+
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // Less than one X row
+        else if (X_SLICES == 0)
+        {
+            DEBUG_PRINT("Chose path X.3\n");
+
+
+            constexpr int ITERATIONS_PER_ROW = COPY_XLEN / BLOCK_SIZE;
+            constexpr int REMAINDER = COPY_XLEN % BLOCK_SIZE;
+            constexpr int REMOFF = ITERATIONS_PER_ROW * BLOCK_SIZE;
+
+            #pragma unroll
+            for (int i = 0; i < COPY_ZLEN; ++i)
+            {
+                #pragma unroll
+                for (int j = 0; j < COPY_YLEN; ++j)
+                {
+                    #pragma unroll
+                    for (int k = 0; k < ITERATIONS_PER_ROW; ++k)
+                    {
+                        smem[(k * BLOCK_SIZE + ltid) * DST_XSTRIDE + j * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                            *(ptr +
+                              src_xstride * (k * BLOCK_SIZE + ltid) +
+                              src_ystride * j +
+                              src_zstride * i);
+                    }
+
+                    if (ltid < REMAINDER)
+                    {
+                        // Read remainder
+                        smem[(REMOFF + ltid) * DST_ZSTRIDE + j * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                            *(ptr +
+                              src_xstride * (REMOFF + ltid) +
+                              src_ystride * j +
+                              src_zstride * i);
+                    }
+                }
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        //////////////////////////////////////////////////////////////////////
+
+
+        if (!ASYNC)
+            __syncthreads();
+    }
+
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+              int COPY_XLEN, int DST_XSTRIDE,
+              bool ASYNC>
+    static DACE_DFI void GlobalToShared1D(
+            const T *ptr, int src_xstride, T *smem)
+    {
+        GlobalToShared3D<T, BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH, 1,
+            1, COPY_XLEN, 1, 1, DST_XSTRIDE, ASYNC>(
+                ptr, 1, 1, src_xstride, smem);
+    }
+    
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+              int COPY_YLEN, int COPY_XLEN, int DST_YSTRIDE, int DST_XSTRIDE,
+              bool ASYNC>
+    static DACE_DFI void GlobalToShared2D(
+            const T *ptr, int src_ystride, int src_xstride,
+            T *smem)
+    {
+        GlobalToShared3D<T, BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH, 1, 
+                         COPY_YLEN, COPY_XLEN, 1, DST_YSTRIDE, DST_XSTRIDE, 
+                         ASYNC>(
+            ptr, 1, src_ystride, src_xstride, smem);
+    }
+
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+        int DST_ZSTRIDE, int DST_YSTRIDE, int DST_XSTRIDE,
+        bool ASYNC>
+        static DACE_DFI void GlobalToShared3DDynamic(
+            const T *ptr, int src_zstride,
+            int src_ystride, int src_xstride, T *smem,
+            int COPY_ZLEN, int COPY_YLEN, int COPY_XLEN)
+    {
+        // Linear thread ID
+        int ltid = GetLinearTID<BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH>();
+
+        int BLOCK_SIZE = BLOCK_WIDTH * BLOCK_HEIGHT * BLOCK_DEPTH;
+        int TOTAL_XYZ = COPY_XLEN * COPY_YLEN * COPY_ZLEN;
+        int TOTAL_XY = COPY_XLEN * COPY_YLEN;
+        int XY_SLICES = BLOCK_SIZE / TOTAL_XY;
+        int XY_REM = BLOCK_SIZE % TOTAL_XY;
+        int X_SLICES = BLOCK_SIZE / COPY_XLEN;
+        int X_REM = BLOCK_SIZE % COPY_XLEN;
+
+        //////////////////////////////////////////////////////////////////////
+        // Block size larger than number of elements, one read
+        if ((BLOCK_SIZE / TOTAL_XYZ) > 0)
+        {
+            DEBUG_PRINT("Chose path XYZ\n");
+
+            // De-linearize
+            int ltidx = ltid % COPY_XLEN;
+            int ltidy = (ltid / COPY_XLEN) % COPY_YLEN;
+            int ltidz = (ltid / COPY_XLEN) / COPY_YLEN;
+
+            if (ltid < TOTAL_XYZ)
+            {
+                smem[ltidx*DST_XSTRIDE + ltidy * DST_YSTRIDE + ltidz * DST_ZSTRIDE] =
+                    *(ptr + ltidx * src_xstride
+                      + src_ystride * ltidy
+                      + src_zstride * ltidz);
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // More than one XY slice
+        else if ((BLOCK_SIZE / TOTAL_XYZ) == 0 && XY_SLICES > 0 && XY_REM > 0)
+        {
+            DEBUG_PRINT("Chose path XY.1\n");
+
+            // Currently, only use threads in slice
+            // TODO(later): If contiguous (DST_YSTRIDE == COPY_XLEN), use the rest
+            int SLICES_PER_ITER = (XY_SLICES == 0 ? 1 :  // Compilers are annoying
+                                             COPY_ZLEN / XY_SLICES);
+            int REMAINDER = (XY_SLICES == 0 ? 1 : // Compilers are annoying
+                                       COPY_ZLEN % XY_SLICES);
+            int REMOFF = SLICES_PER_ITER * XY_SLICES;
+
+            if (ltid < (BLOCK_SIZE - XY_REM))
+            {
+                // De-linearize
+                int ltidx = ltid % COPY_XLEN;
+                int ltidy = (ltid / COPY_XLEN) % COPY_YLEN;
+                int ltidz = (ltid / COPY_XLEN) / COPY_YLEN;
+
+                #pragma unroll
+                for (int i = 0; i < SLICES_PER_ITER; ++i)
+                {
+                    smem[ltidx * DST_XSTRIDE + ltidy * DST_YSTRIDE + (i*XY_SLICES + ltidz) * DST_ZSTRIDE] =
+                        *(ptr +
+                          src_xstride * ltidx +
+                          src_ystride * ltidy +
+                          src_zstride * (i * XY_SLICES + ltidz));
+                }
+
+                if (ltidz < REMAINDER)
+                {
+                    // Read remainder
+                    smem[ltidx * DST_XSTRIDE + ltidy * DST_YSTRIDE + (REMOFF + ltidz) * DST_ZSTRIDE] =
+                        *(ptr +
+                          src_xstride * ltidx +
+                          src_ystride * ltidy +
+                          src_zstride * (REMOFF + ltidz));
+                }
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // Exactly n*XY slices
+        else if ((BLOCK_SIZE / TOTAL_XYZ) == 0 && XY_SLICES > 0 && XY_REM == 0)
+        {
+            DEBUG_PRINT("Chose path XY.2\n");
+
+            int SLICES_PER_ITER = (XY_SLICES == 0 ? 1 :  // Compilers are annoying
+                                             COPY_ZLEN / XY_SLICES);
+            int REMAINDER = (XY_SLICES == 0 ? 1 :  // Compilers are annoying
+                                       COPY_ZLEN % XY_SLICES);
+            int REMOFF = SLICES_PER_ITER * XY_SLICES;
+
+            // De-linearize
+            int ltidx = ltid % COPY_XLEN;
+            int ltidy = (ltid / COPY_XLEN) % COPY_YLEN;
+            int ltidz = (ltid / COPY_XLEN) / COPY_YLEN;
+
+            #pragma unroll
+            for (int i = 0; i < SLICES_PER_ITER; ++i)
+            {
+                smem[ltidx * DST_XSTRIDE + ltidy * DST_YSTRIDE + (i*XY_SLICES + ltidz) * DST_ZSTRIDE] =
+                    *(ptr +
+                      src_xstride * ltidx +
+                      src_ystride * ltidy +
+                      src_zstride * (i * XY_SLICES + ltidz));
+            }
+
+            if (ltidz < REMAINDER)
+            {
+                // Read remainder
+                smem[ltidx * DST_XSTRIDE + ltidy * DST_YSTRIDE + (REMOFF + ltidz) * DST_ZSTRIDE] =
+                    *(ptr +
+                      src_xstride * ltidx +
+                      src_ystride * ltidy +
+                      src_zstride * (REMOFF + ltidz));
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // More than X row
+        else if (XY_SLICES == 0 && X_SLICES > 0 && X_REM > 0)
+        {
+            DEBUG_PRINT("Chose path X.1\n");
+
+            // Currently, only use threads in row
+            // TODO(later): If contiguous (DST_YSTRIDE == COPY_XLEN), use the rest
+            int ROWS_PER_XY_SLICE = (X_SLICES == 0 ? 1 :  // Compilers are annoying
+                                               COPY_YLEN / X_SLICES);
+            int REMAINDER = (X_SLICES == 0 ? 1 :  // Compilers are annoying
+                                       COPY_YLEN % X_SLICES);
+            int REMOFF = ROWS_PER_XY_SLICE * X_SLICES;
+
+            if (ltid < (BLOCK_SIZE - X_REM))
+            {
+                // De-linearize
+                int ltidx = ltid % COPY_XLEN;
+                int ltidy = ltid / COPY_XLEN;
+
+                #pragma unroll
+                for (int i = 0; i < COPY_ZLEN; ++i)
+                {
+                    #pragma unroll
+                    for (int j = 0; j < ROWS_PER_XY_SLICE; ++j)
+                    {
+                        smem[ltidx * DST_XSTRIDE + (j*X_SLICES + ltidy) * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                            *(ptr +
+                              src_xstride * ltidx +
+                              src_ystride * (j * X_SLICES + ltidy) +
+                              src_zstride * i);
+                    }
+
+                    if (ltidy < REMAINDER)
+                    {
+                        // Read remainder
+                        smem[ltidx * DST_XSTRIDE + (REMOFF + ltidy)* DST_YSTRIDE + i * DST_ZSTRIDE] =
+                            *(ptr +
+                              src_xstride * ltidx +
+                              src_ystride * (REMOFF + ltidy) +
+                              src_zstride * i);
+                    }
+
+                }
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // Exactly n*X rows
+        else if (XY_SLICES == 0 && X_SLICES > 0 && X_REM == 0)
+        {
+            DEBUG_PRINT("Chose path X.2\n");
+
+            int ROWS_PER_XY_SLICE = (X_SLICES == 0 ? 1 :  // Compilers are annoying
+                                               COPY_YLEN / X_SLICES);
+            int REMAINDER = (X_SLICES == 0 ? 1 :  // Compilers are annoying
+                                       COPY_YLEN % X_SLICES);
+            int REMOFF = ROWS_PER_XY_SLICE * X_SLICES;
+
+            // De-linearize
+            int ltidx = ltid % COPY_XLEN;
+            int ltidy = ltid / COPY_XLEN;
+
+            #pragma unroll
+            for (int i = 0; i < COPY_ZLEN; ++i)
+            {
+                #pragma unroll
+                for (int j = 0; j < ROWS_PER_XY_SLICE; ++j)
+                {
+                    smem[ltidx * DST_XSTRIDE + (j*X_SLICES + ltidy) * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                        *(ptr +
+                          src_xstride * ltidx +
+                          src_ystride * (j * X_SLICES + ltidy) +
+                          src_zstride * i);
+                }
+
+                if (ltidy < REMAINDER)
+                {
+                    // Read remainder
+                    smem[ltidx * DST_XSTRIDE + (REMOFF + ltidy) * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                        *(ptr +
+                          src_xstride * ltidx +
+                          src_ystride * (REMOFF + ltidy) +
+                          src_zstride * i);
+                }
+
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        // Less than one X row
+        else if (X_SLICES == 0)
+        {
+            DEBUG_PRINT("Chose path X.3\n");
+
+
+            int ITERATIONS_PER_ROW = COPY_XLEN / BLOCK_SIZE;
+            int REMAINDER = COPY_XLEN % BLOCK_SIZE;
+            int REMOFF = ITERATIONS_PER_ROW * BLOCK_SIZE;
+
+            #pragma unroll
+            for (int i = 0; i < COPY_ZLEN; ++i)
+            {
+                #pragma unroll
+                for (int j = 0; j < COPY_YLEN; ++j)
+                {
+                    #pragma unroll
+                    for (int k = 0; k < ITERATIONS_PER_ROW; ++k)
+                    {
+                        smem[(k * BLOCK_SIZE + ltid) * DST_XSTRIDE + j * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                            *(ptr +
+                              src_xstride * (k * BLOCK_SIZE + ltid) +
+                              src_ystride * j +
+                              src_zstride * i);
+                    }
+
+                    if (ltid < REMAINDER)
+                    {
+                        // Read remainder
+                        smem[(REMOFF + ltid) * DST_ZSTRIDE + j * DST_YSTRIDE + i * DST_ZSTRIDE] =
+                            *(ptr +
+                              src_xstride * (REMOFF + ltid) +
+                              src_ystride * j +
+                              src_zstride * i);
+                    }
+                }
+            }
+        }
+
+        //////////////////////////////////////////////////////////////////////
+        //////////////////////////////////////////////////////////////////////
+
+
+        if (!ASYNC)
+            __syncthreads();
+    }
+
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+        int DST_XSTRIDE,
+        bool ASYNC>
+        static DACE_DFI void GlobalToShared1DDynamic(
+            const T *ptr, int src_xstride, T *smem, int COPY_XLEN)
+    {
+        GlobalToShared3DDynamic<T, BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH, 1,
+            1, DST_XSTRIDE, ASYNC>(
+                ptr, 1, 1, src_xstride, smem, 1, 1, COPY_XLEN);
+    }
+
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+        int DST_YSTRIDE, int DST_XSTRIDE,
+        bool ASYNC>
+        static DACE_DFI void GlobalToShared2DDynamic(
+            const T *ptr, int src_ystride, int src_xstride,
+            T *smem, int COPY_YLEN, int COPY_XLEN)
+    {
+        GlobalToShared3DDynamic<T, BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH, 1,
+            DST_YSTRIDE, DST_XSTRIDE,
+            ASYNC>(
+                ptr, 1, src_ystride, src_xstride, smem, 1, COPY_YLEN, COPY_XLEN);
+    }
+
+
+    /*
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+        int COPY_XLEN, int DST_XSTRIDE,
+        bool ASYNC>
+        static DACE_DFI void SharedToGlobal1D(
+            const T *smem, int src_xstride, T *ptr)
+    {
+        GlobalToShared3D<T, BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH, 1,
+            1, COPY_XLEN, 1, 1, DST_XSTRIDE, ASYNC>(
+                smem, 1, 1, src_xstride, ptr);
+    }
+    */
+
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+        int SMEM_TOTAL_ELEMENTS, int DST_XSTRIDE,
+        bool ASYNC>
+    struct ResetShared
+    {
+        static DACE_DFI void Reset(T *smem) {
+            // Linear thread ID
+            int ltid = GetLinearTID<BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH>();
+            constexpr int BLOCK_SIZE = BLOCK_WIDTH * BLOCK_HEIGHT * BLOCK_DEPTH;
+            constexpr int TOTAL = SMEM_TOTAL_ELEMENTS;
+            constexpr int WRITES = TOTAL / BLOCK_SIZE;
+            constexpr int REM_WRITES = TOTAL % BLOCK_SIZE;
+
+            #pragma unroll
+            for (int i = 0; i < WRITES; ++i) {
+                *(smem + (ltid + i * BLOCK_SIZE) * DST_XSTRIDE) = T(0);
+            }
+
+            if (REM_WRITES != 0) {
+                if (ltid < REM_WRITES)
+                    *(smem + (ltid + WRITES * BLOCK_SIZE) * DST_XSTRIDE) = T(0);
+            }
+
+            if (!ASYNC)
+                __syncthreads();
+        }
+    };
+
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+        int COPY_XLEN, int DST_XSTRIDE,
+        bool ASYNC>
+    struct SharedToGlobal1D
+    {
+        template <typename WCR>
+        static DACE_DFI void Accum(const T *smem, int src_xstride, T *ptr, WCR wcr)
+        {
+            if (!ASYNC)
+                __syncthreads();
+
+            // Linear thread ID
+            int ltid = GetLinearTID<BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH>();
+            constexpr int BLOCK_SIZE = BLOCK_WIDTH * BLOCK_HEIGHT * BLOCK_DEPTH;
+            constexpr int TOTAL = COPY_XLEN;
+            constexpr int WRITES = TOTAL / BLOCK_SIZE;
+            constexpr int REM_WRITES = TOTAL % BLOCK_SIZE;
+
+            #pragma unroll
+            for (int i = 0; i < WRITES; ++i) {
+                wcr_custom<T>::template reduce(
+                    wcr, ptr + (ltid + i * BLOCK_SIZE) * DST_XSTRIDE,
+                    *(smem + (ltid + i * BLOCK_SIZE) * src_xstride));
+            }
+
+            if (REM_WRITES != 0) {
+                if (ltid < REM_WRITES)
+                    wcr_custom<T>::template reduce(
+                        ptr + (ltid + WRITES * BLOCK_SIZE)* DST_XSTRIDE,
+                        *(smem + (ltid + WRITES * BLOCK_SIZE) * src_xstride));
+            }
+        }
+
+        template <ReductionType REDTYPE>
+        static DACE_DFI void Accum(const T *smem, int src_xstride, T *ptr)
+        {
+            if (!ASYNC)
+                __syncthreads();
+            
+            // Linear thread ID
+            int ltid = GetLinearTID<BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH>();
+            constexpr int BLOCK_SIZE = BLOCK_WIDTH * BLOCK_HEIGHT * BLOCK_DEPTH;
+            constexpr int TOTAL = COPY_XLEN;
+            constexpr int WRITES = TOTAL / BLOCK_SIZE;
+            constexpr int REM_WRITES = TOTAL % BLOCK_SIZE;
+
+            #pragma unroll
+            for (int i = 0; i < WRITES; ++i) {
+                wcr_fixed<REDTYPE, T>::template reduce_atomic(
+                    ptr + (ltid + i * BLOCK_SIZE) * DST_XSTRIDE,
+                    *(smem + (ltid + i * BLOCK_SIZE) * src_xstride));
+            }
+
+            if (REM_WRITES != 0) {
+                if (ltid < REM_WRITES)
+                    wcr_fixed<REDTYPE, T>::template reduce_atomic(
+                        ptr + (ltid + WRITES*BLOCK_SIZE)* DST_XSTRIDE,
+                        *(smem + (ltid + WRITES * BLOCK_SIZE) * src_xstride));
+            }
+        }
+    };
+    
+    // TODO: Make like SharedToGlobal1D
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+        int COPY_YLEN, int COPY_XLEN, int DST_YSTRIDE, int DST_XSTRIDE,
+        bool ASYNC>
+        static DACE_DFI void SharedToGlobal2D(
+            const T *ptr, int src_ystride, int src_xstride,
+            T *smem)
+    {
+        GlobalToShared3D<T, BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH, 1,
+            COPY_YLEN, COPY_XLEN, 1, DST_YSTRIDE, DST_XSTRIDE,
+            ASYNC>(
+                ptr, 1, src_ystride, src_xstride, smem);
+    }
+    template <typename T, int BLOCK_WIDTH, int BLOCK_HEIGHT, int BLOCK_DEPTH,
+        int DST_YSTRIDE, int DST_XSTRIDE,
+        bool ASYNC>
+        static DACE_DFI void SharedToGlobal2DDynamic(
+            const T *ptr, int src_ystride, int src_xstride,
+            T *smem, int COPY_YLEN, int COPY_XLEN)
+    {
+        GlobalToShared3DDynamic<T, BLOCK_WIDTH, BLOCK_HEIGHT, BLOCK_DEPTH, 1,
+            DST_YSTRIDE, DST_XSTRIDE,
+            ASYNC>(
+                ptr, 1, src_ystride, src_xstride, smem, 1, COPY_YLEN, COPY_XLEN);
+    }
+
+}  // namespace dace
+
+
+
+
+#endif  // __DACE_CUDACOPY_CUH
diff --git a/dace/runtime/include/dace/cuda/cudacommon.cuh b/dace/runtime/include/dace/cuda/cudacommon.cuh
new file mode 100644
index 0000000000..61aa4623df
--- /dev/null
+++ b/dace/runtime/include/dace/cuda/cudacommon.cuh
@@ -0,0 +1,19 @@
+#ifndef __DACE_CUDACOMMON_CUH
+#define __DACE_CUDACOMMON_CUH
+
+#define DACE_CUDA_CHECK(err) do {                                            \
+    cudaError_t errr = (err);                                                \
+    if(errr != (cudaError_t)0)                                               \
+    {                                                                        \
+        printf("CUDA ERROR at %s:%d, code: %d\n", __FILE__, __LINE__, errr); \
+    }                                                                        \
+} while(0)
+
+namespace dace {
+    namespace cuda {
+        extern cudaStream_t __streams[];
+        extern cudaEvent_t __events[];
+    }  // namespace cuda
+}  // namespace dace
+
+#endif  // __DACE_CUDACOMMON_CUH
diff --git a/dace/runtime/include/dace/cuda/dynmap.cuh b/dace/runtime/include/dace/cuda/dynmap.cuh
new file mode 100644
index 0000000000..14330f1425
--- /dev/null
+++ b/dace/runtime/include/dace/cuda/dynmap.cuh
@@ -0,0 +1,249 @@
+// Redistribution and use in source and binary forms, with or without
+// modification, are permitted provided that the following conditions are met:
+//
+// * Redistributions of source code must retain the above copyright notice,
+//   this list of conditions and the following disclaimer.
+// * Redistributions in binary form must reproduce the above copyright notice,
+//   this list of conditions and the following disclaimer in the documentation
+//   and/or other materials provided with the distribution.
+// * Neither the names of the copyright holders nor the names of its 
+//   contributors may be used to endorse or promote products derived from this
+//   software without specific prior written permission.
+//
+// THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
+// AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
+// IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
+// ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE
+// LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
+// CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF
+// SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS
+// INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN
+// CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)
+// ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE
+// POSSIBILITY OF SUCH DAMAGE.
+
+// Adapted from "Groute: An Asynchronous Multi-GPU Programming Framework"
+// http://www.github.com/groute/groute
+
+
+#ifndef __DACE_DYNMAP_CUH
+#define __DACE_DYNMAP_CUH
+
+#include <initializer_list>
+#include <vector>
+#include <map>
+#include <memory>
+#include <cuda_runtime.h>
+#include <mutex>
+
+#include "../../../../external/cub/cub/util_ptx.cuh"
+
+#define __FULL_MASK 0xffffffff
+
+namespace dace {   
+    /**
+     * A map (usually dynamically sized) that can be rescheduled across a 
+     * threadblock
+     **/
+    template<int BLOCK_SIZE, typename index_type = int32_t, bool WARP_INTRINSICS = true>
+    struct DynamicMap
+    {
+        template<const int WARPS_PER_TB> struct warp_np {
+            volatile index_type owner[WARPS_PER_TB];
+            volatile index_type start[WARPS_PER_TB];
+            volatile index_type size[WARPS_PER_TB];
+            volatile index_type src[WARPS_PER_TB];
+        };
+
+        struct tb_np {
+            index_type owner;
+            index_type start;
+            index_type size;
+            index_type src;
+        };
+
+        struct empty_np {
+        };
+
+
+        template <typename ts_type, typename TTB, typename TWP>
+        union np_shared {
+            // for scans
+            ts_type temp_storage;
+
+            // for tb-level np
+            TTB tb;
+
+            // for warp-level np
+            TWP warp;
+
+            // fine-grained schedule (unused)
+            //TFG fg;
+        };
+
+        /*
+        * @brief A structure representing a scheduled chunk of work
+        */
+        struct np_local
+        {
+            index_type size; // work size
+            index_type start; // work start
+            index_type src; // work source thread / metadata
+        };
+
+
+        template <typename Functor>
+        __device__ __forceinline__ static void schedule(index_type local_start, index_type local_end, index_type local_src, Functor&& work)
+        {
+            const int WP_SIZE = CUB_PTX_WARP_THREADS;
+            const int TB_SIZE = BLOCK_SIZE;
+
+            const int NP_WP_CROSSOVER = CUB_PTX_WARP_THREADS;
+            const int NP_TB_CROSSOVER = blockDim.x;
+
+            typedef union std::conditional<WARP_INTRINSICS,
+                np_shared<empty_np, tb_np, empty_np>,
+                np_shared<empty_np, tb_np, warp_np<BLOCK_SIZE / CUB_PTX_WARP_THREADS>>>::type np_shared_type;
+
+            __shared__ np_shared_type np_shared;
+
+            index_type local_size = local_end - local_start;
+
+            if (threadIdx.x == 0)
+            {
+                np_shared.tb.owner = TB_SIZE + 1;
+            }
+
+            __syncthreads();
+
+            //
+            // First scheduler: processing high-degree work items using the entire block
+            //
+            while (true)
+            {
+                if (local_size >= NP_TB_CROSSOVER)
+                {
+                    // 'Elect' one owner for the entire thread block 
+                    np_shared.tb.owner = threadIdx.x;
+                }
+
+                __syncthreads();
+
+                if (np_shared.tb.owner == TB_SIZE + 1)
+                {
+                    // No owner was elected, i.e. no high-degree work items remain 
+
+                    // No need to sync threads before moving on to WP scheduler  
+                    // because it does not use shared memory
+                    if (!WARP_INTRINSICS)
+                        __syncthreads(); // Necessary do to the shared memory union used by both TB and WP schedulers
+                    break;
+                }
+
+                if (np_shared.tb.owner == threadIdx.x)
+                {
+                    // This thread is the owner
+                    np_shared.tb.start = local_start;
+                    np_shared.tb.size = local_size;
+                    np_shared.tb.src = local_src;
+
+                    // Mark this work-item as processed for future schedulers 
+                    local_start = 0;
+                    local_size = 0;
+                }
+
+                __syncthreads();
+
+                index_type start = np_shared.tb.start;
+                index_type size = np_shared.tb.size;
+                index_type src = np_shared.tb.src;
+
+                if (np_shared.tb.owner == threadIdx.x)
+                {
+                    np_shared.tb.owner = TB_SIZE + 1;
+                }
+
+                // Use all threads in thread block to execute individual work  
+                for (int ii = threadIdx.x; ii < size; ii += TB_SIZE)
+                {
+                    work(start + ii, src);
+                }
+
+                __syncthreads();
+            }
+
+            //
+            // Second scheduler: tackle medium-degree work items using the warp 
+            //
+            const int warp_id = cub::WarpId();
+            const int lane_id = cub::LaneId();
+
+            while (__any_sync(__FULL_MASK, local_size >= NP_WP_CROSSOVER))
+            {
+                index_type start, size, src;
+                if (WARP_INTRINSICS)
+                {
+                    // Compete for work scheduling  
+                    unsigned int mask = __ballot_sync(__FULL_MASK, local_size >= NP_WP_CROSSOVER ? 1 : 0);
+                    // Select a deterministic winner  
+                    int leader = __ffs(mask) - 1;
+
+                    // Broadcast data from the leader  
+                    start = cub::ShuffleIndex<WP_SIZE>(local_start, leader, mask);
+                    size = cub::ShuffleIndex<WP_SIZE>(local_size, leader, mask);
+                    src = cub::ShuffleIndex<WP_SIZE>(local_src, leader, mask);
+
+                    if (leader == lane_id)
+                    {
+                        // Mark this work-item as processed   
+                        local_start = 0;
+                        local_size = 0;
+                    }
+                }
+                else
+                {
+                    // In order for this to compile, it should be refactored to another function
+                    /*
+                    if (local_size >= NP_WP_CROSSOVER)
+                    {
+                        // Again, race to select an owner for warp 
+                        np_shared.warp.owner[warp_id] = lane_id;
+                    }
+                    if (np_shared.warp.owner[warp_id] == lane_id)
+                    {
+                        // This thread is owner 
+                        np_shared.warp.start[warp_id] = local_start;
+                        np_shared.warp.size[warp_id] = local_size;
+
+                        // Mark this work-item as processed   
+                        local_start = 0;
+                        local_size = 0;
+                    }
+                    start = np_shared.warp.start[warp_id];
+                    size = np_shared.warp.size[warp_id];
+                    */
+                }
+
+                for (int ii = lane_id; ii < size; ii += WP_SIZE)
+                {
+                    work(start + ii, src);
+                }
+            }
+
+            __syncthreads();
+
+            //
+            // Third scheduler: tackle all work-items with size < 32 serially   
+            //
+            // It is possible to disable this scheduler by setting NP_WP_CROSSOVER to 0 
+
+            for (int ii = 0; ii < local_size; ii++)
+            {
+                work(local_start + ii, local_src);
+            }
+        }
+    };
+
+}  // namespace dace
+
+#endif // __DACE_DYNMAP_CUH
diff --git a/dace/runtime/include/dace/cuda/stream.cuh b/dace/runtime/include/dace/cuda/stream.cuh
new file mode 100644
index 0000000000..84314ecf9a
--- /dev/null
+++ b/dace/runtime/include/dace/cuda/stream.cuh
@@ -0,0 +1,245 @@
+#ifndef __DACE_STREAM_CUH
+#define __DACE_STREAM_CUH
+
+#include <initializer_list>
+#include <vector>
+#include <map>
+#include <memory>
+#include <mutex>
+#include <new> // Used for the in-memory ctor call in the move assignment operator below  
+
+#include <cuda_runtime.h>
+#include <cooperative_groups.h>
+
+#include "../../../../external/cub/cub/util_ptx.cuh"
+#include "../../../../external/cub/cub/warp/warp_reduce.cuh"
+#include "../../../../external/cub/cub/warp/warp_scan.cuh"
+
+#include "cudacommon.cuh"
+
+namespace dace {
+    // Adapted from https://devblogs.nvidia.com/cuda-pro-tip-optimized-filtering-warp-aggregated-atomics/
+    __inline__ __device__ uint32_t atomicAggInc(uint32_t *ctr) {
+        auto g = cooperative_groups::coalesced_threads();
+        uint32_t warp_res;
+        int rank = g.thread_rank();
+        if (rank == 0)
+            warp_res = atomicAdd(ctr, g.size());
+        return g.shfl(warp_res, 0) + rank;
+    }
+
+    __inline__ __device__ uint32_t atomicAggDec(uint32_t *ctr) {
+        auto g = cooperative_groups::coalesced_threads();
+        uint32_t warp_res;
+        int rank = g.thread_rank();
+        if (rank == 0)
+            warp_res = atomicAdd(ctr, -g.size());
+        return g.shfl(warp_res, 0) + rank;
+    }
+
+    /*
+    __inline__ __device__ uint32_t warpReduceSum(uint32_t val) {
+        for (int offset = CUB_PTX_WARP_THREADS / 2; offset > 0; offset /= 2)
+            val += __shfl_down(val, offset);
+        return val;
+    }
+    */
+
+    //
+    // Queue classes (device):
+    //
+
+    /*
+    * @brief A device-level MPMC Queue
+    */
+    template<typename T, bool IS_POWEROFTWO = false>
+    class GPUStream
+    {
+    public:
+        T* m_data;
+        uint32_t *m_start, *m_end, *m_pending;
+        uint32_t m_capacity_mask;
+
+        __host__ GPUStream() : m_data(nullptr), m_start(nullptr), m_end(nullptr),
+            m_pending(nullptr), m_capacity_mask(0) {}
+        __host__ __device__ GPUStream(T* data, uint32_t capacity,
+                                      uint32_t *start, uint32_t *end, 
+                                      uint32_t *pending) :
+            m_data(data), m_start(start), m_end(end), m_pending(pending),
+            m_capacity_mask(IS_POWEROFTWO ? (capacity - 1) : capacity)
+        {
+            if (IS_POWEROFTWO) {
+                assert((capacity - 1 & capacity) == 0); // Must be a power of two for handling circular overflow correctly  
+            }
+        }
+
+        __device__ __forceinline__ void reset() const
+        {
+            *m_start = 0; 
+            *m_end = 0;
+            *m_pending = 0;
+        }
+
+        __device__ __forceinline__ T pop()
+        {
+            uint32_t allocation = atomicAggInc(m_start);
+            return m_data[get_addr(allocation)];
+        }
+
+        __device__ __forceinline__ T *leader_pop(uint32_t count) {
+            uint32_t current = *m_start;
+            T *result = m_data + get_addr(current);
+            *m_start += count;
+            return result;
+        }
+
+
+        __device__ __forceinline__ uint32_t get_addr(const uint32_t& i) const {
+            if (IS_POWEROFTWO)
+                return i & m_capacity_mask;
+            else
+                return i % m_capacity_mask;
+        }
+
+        __device__ __forceinline__ void push(const T& item)
+        {
+            uint32_t allocation = atomicAggInc(m_pending);
+            m_data[get_addr(allocation)] = item;
+        }
+
+        /*
+        __device__ __forceinline__ void push(T *items, int count) 
+        {
+            // Perform a warp-wide scan to get thread offsets
+            typedef cub::WarpScan<int> WarpScan;
+            __shared__ typename WarpScan::TempStorage temp_storage[4];
+            int offset;
+            int warp_id = threadIdx.x / 32;
+            WarpScan(temp_storage[warp_id]).ExclusiveSum(count, offset);
+
+            // Atomic-add the total count once per warp
+            uint32_t addr;
+            if (threadIdx.x & 31 == 31) // Last thread
+                addr = atomicAdd(m_pending, offset + count);
+            // Broadcast starting address
+            addr = cub::ShuffleIndex(addr, 31, 0xffffffff);
+
+            // Copy data from each thread
+            for(int i = 0; i < count; ++i)
+                m_data[get_addr(addr + offset + i)] = items[i];
+        }
+        */
+
+        __device__ __forceinline__ void prepend(const T& item)
+        {
+            uint32_t allocation = atomicAggDec(m_start) - 1;
+            m_data[get_addr(allocation)] = item;
+        }
+
+        __device__ __forceinline__ T read(uint32_t i) const
+        {
+            return m_data[get_addr(*m_start + i)];
+        }
+                        
+        __device__ __forceinline__ uint32_t count() const
+        {
+            return *m_end - *m_start;
+        }
+
+        // Returns the 'count' of pending items and commits
+        __device__ __forceinline__ uint32_t commit_pending() const
+        {
+            uint32_t count = *m_pending - *m_end;
+                
+            // Sync end with pending, this makes the pushed items visible to the consumer
+            *m_end = *m_pending;
+            return count;
+        }
+
+        __device__ __forceinline__ uint32_t get_start() const
+        {
+            return *m_start;
+        }
+
+        __device__ __forceinline__ uint32_t get_start_delta(uint32_t prev_start) const
+        {
+            return prev_start - *m_start;
+        }
+    };
+
+    ////////////////////////////////////////////////////////////
+    // Host controllers for GPU streams
+
+    template<typename T, bool IS_POW2>
+    __global__ void ResetGPUStream_kernel(GPUStream<T, IS_POW2> stream)
+    {
+        stream.reset();
+    }
+
+    template<typename T, bool IS_POW2>
+    void ResetGPUStream(GPUStream<T, IS_POW2>& stream)
+    {
+        void *args_reset[1] = { &stream };
+        DACE_CUDA_CHECK(cudaLaunchKernel((void *)&ResetGPUStream_kernel<T, IS_POW2>,
+                                         dim3(1, 1, 1), dim3(1, 1, 1), 
+                                         args_reset, 0, (cudaStream_t)0));
+    }
+
+    template<typename T, bool IS_POW2>
+    __global__ void PushToGPUStream_kernel(GPUStream<T, IS_POW2> stream, T item)
+    {
+        stream.push(item);
+        stream.commit_pending();
+    }
+
+    template<typename T, bool IS_POW2>
+    void PushToGPUStream(GPUStream<T, IS_POW2>& stream, const T& item)
+    {
+        void *args_push[2] = { &stream, &item };
+        DACE_CUDA_CHECK(cudaLaunchKernel((void *)&PushToGPUStream_kernel<T, IS_POW2>,
+                                         dim3(1, 1, 1), dim3(1, 1, 1), 
+                                         args_push, 0, (cudaStream_t)0));
+    }
+
+    ////////////////////////////////////////////////////////////
+    // Host memory management for GPU streams
+
+
+    template<typename T, bool IS_POW2>
+    GPUStream<T, IS_POW2> AllocGPUArrayStreamView(T *ptr, uint32_t capacity)
+    {
+        uint32_t *gStart, *gEnd, *gPending;
+        DACE_CUDA_CHECK(cudaMalloc(&gStart, sizeof(uint32_t)));
+        DACE_CUDA_CHECK(cudaMalloc(&gEnd, sizeof(uint32_t)));
+        DACE_CUDA_CHECK(cudaMalloc(&gPending, sizeof(uint32_t)));
+        DACE_CUDA_CHECK(cudaMemsetAsync(gStart, 0, sizeof(uint32_t)));
+        DACE_CUDA_CHECK(cudaMemsetAsync(gEnd, 0, sizeof(uint32_t)));
+        DACE_CUDA_CHECK(cudaMemsetAsync(gPending, 0, sizeof(uint32_t)));
+        return GPUStream<T, IS_POW2>(ptr, capacity, gStart, gEnd, gPending);
+    }
+
+    template<typename T, bool IS_POW2>
+    GPUStream<T, IS_POW2> AllocGPUStream(uint32_t capacity)
+    {
+        T *gData;
+        DACE_CUDA_CHECK(cudaMalloc(&gData, capacity * sizeof(T)));
+        return AllocGPUArrayStreamView<T, IS_POW2>(gData, capacity);
+    }
+
+    template<typename T, bool IS_POW2>
+    void FreeGPUArrayStreamView(GPUStream<T, IS_POW2>& stream)
+    {
+        DACE_CUDA_CHECK(cudaFree(stream.m_start));
+        DACE_CUDA_CHECK(cudaFree(stream.m_end));
+        DACE_CUDA_CHECK(cudaFree(stream.m_pending));
+    }
+
+    template<typename T, bool IS_POW2>
+    void FreeGPUStream(GPUStream<T, IS_POW2>& stream)
+    {
+        FreeGPUArrayStreamView(stream);
+        DACE_CUDA_CHECK(cudaFree(stream.m_data));
+    }
+
+}  // namespace dace
+#endif // __DACE_STREAM_CUH
\ No newline at end of file
diff --git a/dace/runtime/include/dace/cuda/vectype.cuh b/dace/runtime/include/dace/cuda/vectype.cuh
new file mode 100644
index 0000000000..f8acaa82e1
--- /dev/null
+++ b/dace/runtime/include/dace/cuda/vectype.cuh
@@ -0,0 +1,326 @@
+////////////////////////////////////////////////////////////////////////
+// Define some operators on vector types
+
+#define DEFINE_EXTTYPE1(T, NAME)                                       \
+    struct exttype_##T##_##1 : NAME##1 {                               \
+        DACE_HDFI exttype_##T##_##1 operator*(const exttype_##T##_##1 &other) const {  \
+            exttype_##T##_##1 result;                                  \
+            result.x = other.x * x;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##1 operator+(const exttype_##T##_##1 &other) const {  \
+            exttype_##T##_##1 result;                                  \
+            result.x = other.x + x;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##1 operator-(const exttype_##T##_##1 &other) const {  \
+            exttype_##T##_##1 result;                                  \
+            result.x = x - other.x;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##1 operator/(const exttype_##T##_##1 &other) const {  \
+            exttype_##T##_##1 result;                                  \
+            result.x = x / other.x;                                    \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##1 operator*(const U &other) const {  \
+            exttype_##T##_##1 result;                                  \
+            result.x = other * x;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##1 operator+(const U &other) const {  \
+            exttype_##T##_##1 result;                                  \
+            result.x = other + x;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##1 operator-(const U &other) const {  \
+            exttype_##T##_##1 result;                                  \
+            result.x = x - other;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##1 operator/(const U &other) const {  \
+            exttype_##T##_##1 result;                                  \
+            result.x = x / other;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI T operator[](const U &index) const {                 \
+            return x;                                                  \
+        }                                                              \
+    };
+#define DEFINE_EXTTYPE2(T, NAME)                                       \
+    struct exttype_##T##_##2 : NAME##2 {                               \
+        DACE_HDFI exttype_##T##_##2 operator*(const exttype_##T##_##2 &other) const {  \
+            exttype_##T##_##2 result;                                  \
+            result.x = other.x * x;                                    \
+            result.y = other.y * y;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##2 operator+(const exttype_##T##_##2 &other) const {  \
+            exttype_##T##_##2 result;                                  \
+            result.x = other.x + x;                                    \
+            result.y = other.y + y;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##2 operator-(const exttype_##T##_##2 &other) const {  \
+            exttype_##T##_##2 result;                                  \
+            result.x = x - other.x;                                    \
+            result.y = y - other.y;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##2 operator/(const exttype_##T##_##2 &other) const {  \
+            exttype_##T##_##2 result;                                  \
+            result.x = x / other.x;                                    \
+            result.y = y / other.y;                                    \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##2 operator*(const U &other) const {  \
+            exttype_##T##_##2 result;                                  \
+            result.x = other * x;                                      \
+            result.y = other * y;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##2 operator+(const U &other) const {  \
+            exttype_##T##_##2 result;                                  \
+            result.x = other + x;                                      \
+            result.y = other + y;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##2 operator-(const U &other) const {  \
+            exttype_##T##_##2 result;                                  \
+            result.x = x - other;                                      \
+            result.y = y - other;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##2 operator/(const U &other) const {  \
+            exttype_##T##_##2 result;                                  \
+            result.x = x / other;                                      \
+            result.y = y / other;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI T operator[](const U &index) const {                 \
+            if (index == U(0)) return x;                               \
+            return y;                                                  \
+        }                                                              \
+    };
+#define DEFINE_EXTTYPE3(T, NAME)                                       \
+    struct exttype_##T##_##3 : NAME##3 {                               \
+        DACE_HDFI exttype_##T##_##3 operator*(const exttype_##T##_##3 &other) const {  \
+            exttype_##T##_##3 result;                                  \
+            result.x = other.x * x;                                    \
+            result.y = other.y * y;                                    \
+            result.z = other.z * z;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##3 operator+(const exttype_##T##_##3 &other) const {  \
+            exttype_##T##_##3 result;                                  \
+            result.x = other.x + x;                                    \
+            result.y = other.y + y;                                    \
+            result.z = other.z + z;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##3 operator-(const exttype_##T##_##3 &other) const {  \
+            exttype_##T##_##3 result;                                  \
+            result.x = x - other.x;                                    \
+            result.y = y - other.y;                                    \
+            result.z = z - other.z;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##3 operator/(const exttype_##T##_##3 &other) const {  \
+            exttype_##T##_##3 result;                                  \
+            result.x = x / other.x;                                    \
+            result.y = y / other.y;                                    \
+            result.z = z / other.z;                                    \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##3 operator*(const U &other) const {  \
+            exttype_##T##_##3 result;                                  \
+            result.x = other * x;                                      \
+            result.y = other * y;                                      \
+            result.z = other * z;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##3 operator+(const U &other) const {  \
+            exttype_##T##_##3 result;                                  \
+            result.x = other + x;                                      \
+            result.y = other + y;                                      \
+            result.z = other + z;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##3 operator-(const U &other) const {  \
+            exttype_##T##_##3 result;                                  \
+            result.x = x - other;                                      \
+            result.y = y - other;                                      \
+            result.z = z - other;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##3 operator/(const U &other) const {  \
+            exttype_##T##_##3 result;                                  \
+            result.x = x / other;                                      \
+            result.y = y / other;                                      \
+            result.z = z / other;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI T operator[](const U &index) const {                 \
+            if (index == U(0)) return x;                               \
+            else if (index == U(1)) return y;                          \
+            return z;                                                  \
+        }                                                              \
+    };
+#define DEFINE_EXTTYPE4(T, NAME)                                       \
+    struct exttype_##T##_##4 : NAME##4 {                               \
+        DACE_HDFI exttype_##T##_##4 operator*(const exttype_##T##_##4 &other) const {  \
+            exttype_##T##_##4 result;                                  \
+            result.x = other.x * x;                                    \
+            result.y = other.y * y;                                    \
+            result.z = other.z * z;                                    \
+            result.w = other.w * w;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##4 operator+(const exttype_##T##_##4 &other) const {  \
+            exttype_##T##_##4 result;                                  \
+            result.x = other.x + x;                                    \
+            result.y = other.y + y;                                    \
+            result.z = other.z + z;                                    \
+            result.w = other.w + w;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##4 operator-(const exttype_##T##_##4 &other) const {  \
+            exttype_##T##_##4 result;                                  \
+            result.x = x - other.x;                                    \
+            result.y = y - other.y;                                    \
+            result.z = z - other.z;                                    \
+            result.w = w - other.w;                                    \
+            return result;                                             \
+        }                                                              \
+        DACE_HDFI exttype_##T##_##4 operator/(const exttype_##T##_##4 &other) const {  \
+            exttype_##T##_##4 result;                                  \
+            result.x = x / other.x;                                    \
+            result.y = y / other.y;                                    \
+            result.z = z / other.z;                                    \
+            result.w = w / other.w;                                    \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##4 operator*(const U &other) const {  \
+            exttype_##T##_##4 result;                                  \
+            result.x = other * x;                                      \
+            result.y = other * y;                                      \
+            result.z = other * z;                                      \
+            result.w = other * w;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##4 operator+(const U &other) const {  \
+            exttype_##T##_##4 result;                                  \
+            result.x = other + x;                                      \
+            result.y = other + y;                                      \
+            result.z = other + z;                                      \
+            result.w = other + w;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##4 operator-(const U &other) const {  \
+            exttype_##T##_##4 result;                                  \
+            result.x = x - other;                                      \
+            result.y = y - other;                                      \
+            result.z = z - other;                                      \
+            result.w = w - other;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI exttype_##T##_##4 operator/(const U &other) const {  \
+            exttype_##T##_##4 result;                                  \
+            result.x = x / other;                                      \
+            result.y = y / other;                                      \
+            result.z = z / other;                                      \
+            result.w = w / other;                                      \
+            return result;                                             \
+        }                                                              \
+        template <typename U>                                          \
+        DACE_HDFI T operator[](const U &index) const {                 \
+            if (index == U(0)) return x;                               \
+            else if (index == U(1)) return y;                          \
+            else if (index == U(2)) return z;                          \
+            return w;                                                  \
+        }                                                              \
+    };
+
+#define DEFINE_ALL_EXT_TYPES(T, NAME)                                  \
+    DEFINE_EXTTYPE1(T, NAME);                                          \
+    DEFINE_EXTTYPE2(T, NAME);                                          \
+    DEFINE_EXTTYPE3(T, NAME);                                          \
+    DEFINE_EXTTYPE4(T, NAME);
+#define DEFINE_TWO_EXT_TYPES(T, NAME)                                  \
+    DEFINE_EXTTYPE1(T, NAME);                                          \
+    DEFINE_EXTTYPE2(T, NAME);
+
+DEFINE_ALL_EXT_TYPES(int8,   char);
+DEFINE_ALL_EXT_TYPES(uint8,  uchar);
+DEFINE_ALL_EXT_TYPES(int16,  short);
+DEFINE_ALL_EXT_TYPES(uint16, ushort);
+DEFINE_ALL_EXT_TYPES(int32,  int);
+DEFINE_ALL_EXT_TYPES(uint32, uint);
+DEFINE_ALL_EXT_TYPES(int64,  longlong);
+DEFINE_TWO_EXT_TYPES(uint64, ulonglong);
+DEFINE_ALL_EXT_TYPES(float32,float);
+DEFINE_ALL_EXT_TYPES(float64,double);
+
+/////////////////////////////////////////////////////////////////////////////
+
+#define DEFINE_VECTYPE(T, N)                                           \
+    template<>                                                         \
+        struct _vtype<T, N>                                            \
+    {                                                                  \
+        typedef exttype_##T##_##N aligned;                             \
+        typedef aligned unaligned;                                     \
+    };
+#define DEFINE_ARRVECTYPE(T, N)                                        \
+    template<>                                                         \
+    struct _vtype<T, N>                                                \
+    {                                                                  \
+        typedef T aligned[N];                                          \
+        typedef aligned unaligned;                                     \
+    };
+
+
+    DEFINE_VECTYPE(int8, 2);
+    DEFINE_VECTYPE(int8, 3);
+    DEFINE_VECTYPE(int8, 4);
+    DEFINE_VECTYPE(uint8, 2);
+    DEFINE_VECTYPE(uint8, 3);
+    DEFINE_VECTYPE(uint8, 4);
+    DEFINE_VECTYPE(int16, 2);
+    DEFINE_VECTYPE(int16, 3);
+    DEFINE_VECTYPE(int16, 4);
+    DEFINE_VECTYPE(uint16, 2);
+    DEFINE_VECTYPE(uint16, 3);
+    DEFINE_VECTYPE(uint16, 4);
+    DEFINE_VECTYPE(int32, 2);
+    DEFINE_VECTYPE(int32, 3);
+    DEFINE_VECTYPE(int32, 4);
+    DEFINE_ARRVECTYPE(int32, 8);
+    DEFINE_VECTYPE(uint32, 2);
+    DEFINE_VECTYPE(uint32, 3);
+    DEFINE_VECTYPE(uint32, 4);
+    DEFINE_ARRVECTYPE(uint32, 8);
+    DEFINE_VECTYPE(int64, 2);
+    DEFINE_VECTYPE(uint64, 2);
+    DEFINE_VECTYPE(float32, 2);
+    DEFINE_VECTYPE(float32, 3);
+    DEFINE_VECTYPE(float32, 4);
+    DEFINE_VECTYPE(float64, 2);
diff --git a/dace/runtime/include/dace/cudainterop.h b/dace/runtime/include/dace/cudainterop.h
new file mode 100644
index 0000000000..0221e9d724
--- /dev/null
+++ b/dace/runtime/include/dace/cudainterop.h
@@ -0,0 +1,109 @@
+#ifndef __DACE_CUDAINTEROP_H
+#define __DACE_CUDAINTEROP_H
+
+#ifdef WITH_CUDA
+#include <cuda_runtime.h>
+#else
+
+// CUDA interoperability (defining external functions without having to include
+// cuda_runtime.h)
+typedef int cudaError_t;
+typedef void *cudaStream_t;
+typedef void *cudaEvent_t;
+enum cudaMemcpyKind {
+    cudaMemcpyHostToHost = 0,
+    cudaMemcpyHostToDevice = 1,
+    cudaMemcpyDeviceToHost = 2,
+    cudaMemcpyDeviceToDevice = 3,
+    cudaMemcpyDefault = 4
+};
+
+#include "cuda/cudacommon.cuh"
+
+extern "C"
+{
+    cudaError_t cudaMalloc(void **devPtr, size_t size);
+    cudaError_t cudaFree(void *devPtr);
+    cudaError_t cudaMallocHost(void **devPtr, size_t size);
+    cudaError_t cudaFreeHost(void *devPtr);
+    cudaError_t cudaMemcpy(void *dst, const void *src,
+                           size_t count,
+                           enum cudaMemcpyKind kind);
+    cudaError_t cudaMemcpyAsync(void *dst, const void *src,
+                                size_t count,
+                                enum cudaMemcpyKind kind,
+                                cudaStream_t stream = 0);
+    cudaError_t cudaMemcpy2D(void *  dst,
+                             size_t  dpitch,
+                             const void *  src,
+                             size_t  spitch,
+                             size_t  width,
+                             size_t  height,
+                             enum cudaMemcpyKind  kind);
+    cudaError_t cudaMemcpy2DAsync(void *  dst,
+                             size_t  dpitch,
+                             const void *  src,
+                             size_t  spitch,
+                             size_t  width,
+                             size_t  height,
+                             enum cudaMemcpyKind  kind,
+                             cudaStream_t stream = 0);
+    cudaError_t cudaMemsetAsync(void *dst, int value,
+                                size_t count,
+                                cudaStream_t stream = 0);
+}
+
+
+template<typename T> cudaError_t cudaMalloc(T **devPtr, size_t size) {
+    return cudaMalloc((void **)devPtr, size);
+}
+template<typename T> cudaError_t cudaFree(T *devPtr) {
+    return cudaFree((void *)devPtr);
+}
+template<typename T> cudaError_t cudaMallocHost(T **devPtr, size_t size) {
+    return cudaMallocHost((void **)devPtr, size);
+}
+template<typename T> cudaError_t cudaFreeHost(T *devPtr) {
+    return cudaFreeHost((void *)devPtr);
+}
+
+template<typename T> cudaError_t cudaMemcpy(T *dst, const void *src,
+                                            size_t count,
+                                            enum cudaMemcpyKind kind) {
+    return cudaMemcpy((void *)dst, src, count, kind);
+}
+template<typename T> cudaError_t cudaMemcpyAsync(T *dst, const void *src,
+                                                 size_t count,
+                                                 enum cudaMemcpyKind kind,
+                                                 cudaStream_t stream = 0) {
+    return cudaMemcpyAsync((void *)dst, src, count, kind, stream);
+}
+
+template<typename T> cudaError_t cudaMemcpy2D(T *  dst,
+                                              size_t  dpitch,
+                                              const void *  src,
+                                              size_t  spitch,
+                                              size_t  width,
+                                              size_t  height,
+                                              enum cudaMemcpyKind  kind) {
+    return cudaMemcpy2D((void*)dst, dpitch, src, spitch, width, height, kind);
+}
+template<typename T> cudaError_t cudaMemcpy2DAsync(T *  dst,
+                                                   size_t  dpitch,
+                                                   const void *  src,
+                                                   size_t  spitch,
+                                                   size_t  width,
+                                                   size_t  height,
+                                                   enum cudaMemcpyKind  kind,
+                                                   cudaStream_t stream = 0) {
+    return cudaMemcpy2DAsync((void*)dst, dpitch, src, spitch, width, height, kind, stream);
+}
+template<typename T> cudaError_t cudaMemsetAsync(T *dst, int value,
+                                                 size_t count,
+                                                 cudaStream_t stream = 0) {
+    return cudaMemsetAsync((void *)dst, value, count, stream);
+}
+
+#endif  // WITH_CUDA
+
+#endif  // __DACE_CUDAINTEROP_H
diff --git a/dace/runtime/include/dace/dace.h b/dace/runtime/include/dace/dace.h
new file mode 100644
index 0000000000..dd79a98eaf
--- /dev/null
+++ b/dace/runtime/include/dace/dace.h
@@ -0,0 +1,36 @@
+#ifndef __DACE_RUNTIME_H
+#define __DACE_RUNTIME_H
+
+// Necessary headers
+#include <cstdio>
+#include <cmath>
+#include <numeric>
+#include <tuple>
+#include <cstring>
+
+// The order in which these are included matters - sorting them
+// alphabetically causes compilation to fail.
+#include "types.h"
+#include "vector.h"
+#include "intset.h"
+#include "math.h"
+#include "complex.h"
+#include "pyinterop.h"
+#include "copy.h"
+#include "view.h"
+#include "stream.h"
+#include "os.h"
+
+#ifdef __CUDACC__
+#include "cuda/copy.cuh"
+#include "cuda/dynmap.cuh"
+#else
+#include "cudainterop.h"
+#endif
+
+#ifdef DACE_XILINX
+#include "xilinx/host.h"
+#endif
+
+
+#endif  // __DACE_RUNTIME_H
diff --git a/dace/runtime/include/dace/intset.h b/dace/runtime/include/dace/intset.h
new file mode 100644
index 0000000000..a00eb50ffc
--- /dev/null
+++ b/dace/runtime/include/dace/intset.h
@@ -0,0 +1,138 @@
+#ifndef __DACE_INTSET_H
+#define __DACE_INTSET_H
+
+// Iterable integer sets for compiler inference and automatic unrolling
+
+#include <array>
+#include <tuple>
+
+#include "types.h"
+
+namespace dace
+{
+
+    template <int... ArgRanges>
+    struct const_int_range;
+
+    template <int RangeBegin, int RangeEnd, int RangeSkip>
+    struct const_int_range<RangeBegin, RangeEnd, RangeSkip> {
+        static constexpr size_t dims = 1;
+        static constexpr int size = (RangeEnd - RangeBegin + RangeSkip - 1) / RangeSkip;
+
+        static DACE_CONSTEXPR DACE_HDFI size_t len(size_t dim) {
+            return (RangeEnd - RangeBegin + RangeSkip - 1) / RangeSkip;
+        }
+        static DACE_CONSTEXPR DACE_HDFI int index_value(const size_t range_value,
+                                                        const size_t /*dimension*/) {
+            return RangeBegin + (range_value % size) * RangeSkip;
+        }
+    };
+
+    template <int RangeBegin, int RangeEnd, int RangeSkip, int... ArgRanges>
+    struct const_int_range<RangeBegin, RangeEnd, RangeSkip, ArgRanges...> {
+        static constexpr size_t dims = sizeof...(ArgRanges) / 3 + 1;
+        static constexpr int size = const_int_range<ArgRanges...>::size *
+            const_int_range<RangeBegin, RangeEnd, RangeSkip>::size;
+
+        static DACE_CONSTEXPR DACE_HDFI size_t len(size_t dim) {
+            const int _ranges[] = { RangeBegin, RangeEnd, RangeSkip, ArgRanges... };
+            return (_ranges[3 * dim + 1] - _ranges[3 * dim] + _ranges[3 * dim + 2] - 1) / _ranges[3 * dim + 2];
+        }
+
+        static DACE_CONSTEXPR DACE_HDFI int index_value(const size_t range_value, const size_t dimension) {
+            const int _ranges[] = { RangeBegin, RangeEnd, RangeSkip, ArgRanges... };
+            if (dimension == dims - 1)
+                return _ranges[3 * dimension] +
+                (range_value % len(dimension)) * _ranges[3 * dimension + 2];
+            auto value = range_value;
+            for (auto dim = dimension + 1; dim < dims; ++dim) {
+                value /= len(dim);
+            }
+            return _ranges[3 * dimension] +
+                (value % len(dimension)) * _ranges[3 * dimension + 2];
+        }
+
+        static DACE_CONSTEXPR DACE_HDFI std::array<int, dims> index_values(const size_t range_value) {
+            std::array<int, dims> values{};
+            const int _ranges[] = { RangeBegin, RangeEnd, RangeSkip, ArgRanges... };
+            auto value = range_value;
+            for (int dim = dims - 1; dim >= 0; --dim) {
+                values[dim] = _ranges[3 * dim] +
+                    (value % len(dim)) * _ranges[3 * dim + 2];
+                value /= len(dim);
+            }
+            return values;
+        }
+    };
+
+    template <class... ArgRanges>
+    class int_range {
+        static constexpr size_t kDims = sizeof...(ArgRanges);
+
+    private:
+        const std::array<std::tuple<int, int, int>, kDims> _ranges;
+        std::array<size_t, kDims> _range_lengths;
+        const int _total_length;
+
+        // For some reason constexpr works even when passing a runtime size
+        DACE_HDFI int _calc_length() {
+            // Hopefully the compiler vectorizes this
+            size_t total_length = 1;
+            for (size_t i = 0; i < kDims; ++i) {
+                auto length = (std::get<1>(_ranges[i]) - std::get<0>(_ranges[i]) +
+                               std::get<2>(_ranges[i]) - 1) /
+                    std::get<2>(_ranges[i]);
+                _range_lengths[i] = length;
+                total_length *= length;
+            }
+            return total_length;
+        }
+
+    public:
+        DACE_HDFI int_range(ArgRanges &&... ranges)
+            : _ranges({ ranges... }), _total_length(_calc_length()) {
+            // -std=c++1z
+            // (_ranges.push_back(ranges), ...);
+        }
+
+        DACE_HDFI int size() const { return _total_length; }
+
+        DACE_HDFI int index_value(const size_t range_value, 
+                                  const size_t dimension) const {
+            if (dimension == kDims - 1)
+                return std::get<0>(_ranges[dimension]) +
+                (range_value % _range_lengths[dimension]) *
+                std::get<2>(_ranges[dimension]);
+            auto value = range_value;
+            for (auto dim = dimension + 1; dim < kDims; ++dim) {
+                value /= _range_lengths[dim];
+            }
+            return std::get<0>(_ranges[dimension]) +
+                (value % _range_lengths[dimension]) *
+                std::get<2>(_ranges[dimension]);
+        }
+        DACE_HDFI std::array<int, kDims> index_values(
+                const size_t range_value) const {
+            std::array<int, kDims> values;
+            auto value = range_value;
+            for (int dim = kDims - 1; dim >= 0; --dim) {
+                values[dim] = std::get<0>(_ranges[dim]) +
+                    (value % _range_lengths[dim]) *
+                    std::get<2>(_ranges[dim]);
+                value /= _range_lengths[dim];
+            }
+            return values;
+        }
+
+    };
+
+    template <class... ArgRanges>
+    DACE_HDFI int_range<ArgRanges...> make_range(ArgRanges &&... ranges) {
+        return int_range<ArgRanges...>(std::forward<ArgRanges>(ranges)...);
+    }
+
+
+
+}  // namespace dace
+
+#endif  // __DACE_INTSET_H
diff --git a/dace/runtime/include/dace/math.h b/dace/runtime/include/dace/math.h
new file mode 100644
index 0000000000..8c5729ba4c
--- /dev/null
+++ b/dace/runtime/include/dace/math.h
@@ -0,0 +1,196 @@
+#ifndef __DACE_MATH_H
+#define __DACE_MATH_H
+
+#include "pi.h"
+#include "types.h"
+
+#include <complex>
+#include <numeric>
+#include <cfloat>
+
+
+// dace::math: A namespace that contains typeless math functions
+
+// Math functions that are Python/sympy built-ins and must reside outside
+// of the DaCe namespace for ease of code generation
+
+// Math and python builtins
+using std::abs;
+
+// Ternary workarounds so that vector types work
+// template <typename T>
+// DACE_CONSTEXPR DACE_HDFI T min(const T& a, const T& b) {
+//     return (a < b) ? a : b;
+// }
+// template <typename T>
+// DACE_CONSTEXPR DACE_HDFI T max(const T& a, const T& b) {
+//     return (a > b) ? a : b;
+// }
+
+template <typename T>
+DACE_CONSTEXPR DACE_HDFI T min(const T& val)
+{
+    return val;
+}
+template <typename T, typename... Ts>
+DACE_CONSTEXPR DACE_HDFI T min(const T& a, const T& b, const Ts&... c)
+{
+    return (a < b) ? min(a, c...) : min(b, c...);
+}
+
+template <typename T>
+DACE_CONSTEXPR DACE_HDFI T max(const T& val)
+{
+    return val;
+}
+template <typename T, typename... Ts>
+DACE_CONSTEXPR DACE_HDFI T max(const T& a, const T& b, const Ts&... c)
+{
+    return (a > b) ? max(a, c...) : max(b, c...);
+}
+
+template <typename T>
+static DACE_CONSTEXPR DACE_HDFI T Mod(const T& value, const T& modulus) {
+    return value % modulus;
+}
+
+template <typename T>
+static DACE_CONSTEXPR DACE_HDFI T int_ceil(const T& numerator, const T& denominator) {
+    return (numerator + denominator - 1) / denominator;
+}
+
+static DACE_CONSTEXPR DACE_HDFI int ceiling(int arg) {
+    return arg;
+}
+
+static DACE_HDFI float ceiling(float /*arg*/) {
+    return FLT_MAX;
+}
+
+static DACE_HDFI double ceiling(double /*arg*/) {
+    return DBL_MAX;
+}
+
+template <typename T>
+static DACE_CONSTEXPR DACE_HDFI T int_floor(const T& numerator, const T& denominator) {
+    return numerator / denominator;
+}
+
+template <typename T>
+static DACE_CONSTEXPR DACE_HDFI int sgn(T val) {
+    return (T(0) < val) - (val < T(0));
+}
+
+#ifndef DACE_SYNTHESIS
+
+// Computes integer floor, rounding the remainder towards negative infinity.
+// Assuming inputs are of integer type.
+template <typename T>
+static DACE_CONSTEXPR DACE_HDFI T int_floor_ni(const T& numerator, const T& denominator) {
+    T quotient = numerator / denominator;
+    T remainder = numerator % denominator;
+    // This doesn't work properly if both numbers have sign 0.
+    // However, in this case we crash due to division by 0.
+    if (sgn(numerator) + sgn(denominator) == 0 && remainder > 0)
+        return quotient - 1;
+    return quotient;
+}
+
+#endif
+
+namespace dace
+{
+    namespace math
+    {       
+        static DACE_CONSTEXPR typeless_pi pi{};
+        //////////////////////////////////////////////////////
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI T exp(const T& a)
+        {
+            return (T)std::exp(a);
+        }
+
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI T pow(const T& a, const T& b)
+        {
+            return (T)std::pow(a, b);
+        }
+
+#ifndef DACE_XILINX
+        static DACE_CONSTEXPR DACE_HDFI int pow(const int& a, const int& b)
+        {
+/*#ifndef __CUDA_ARCH__
+            return std::pow(a, b);
+#else*/
+            if (b < 0) return 0;
+            int result = 1;
+            for (int i = 0; i < b; ++i)
+                result *= a;
+            return result;
+//#endif
+        }
+        static DACE_CONSTEXPR DACE_HDFI unsigned int pow(const unsigned int& a,
+                                       const unsigned int& b)
+        {
+/*#ifndef __CUDA_ARCH__
+            return std::pow(a, b);
+#else*/
+            unsigned int result = 1;
+            for (unsigned int i = 0; i < b; ++i)
+                result *= a;
+            return result;
+//#endif
+        }
+#endif
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI T pow(const T& a, const int& b)
+        {
+            return (T)std::pow(a, (T)b);
+        }
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI T pow(const T& a, const unsigned int& b)
+        {
+            return (T)std::pow(a, (T)b);
+        }
+
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI int ifloor(const T& a)
+        {
+            return (int)std::floor(a);
+        }
+
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI T sin(const T& a)
+        {
+            return std::sin(a);
+        }
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI T cos(const T& a)
+        {
+            return std::cos(a);
+        }
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI T sqrt(const T& a)
+        {
+            return std::sqrt(a);
+        }
+        template<typename T>
+        DACE_CONSTEXPR DACE_HDFI T log(const T& a)
+        {
+          return std::log(a);
+        }
+    }
+
+    namespace cmath
+    {
+        template<typename T>
+        DACE_CONSTEXPR std::complex<T> exp(const std::complex<T>& a)
+        {
+            return std::exp(a);
+        }
+    }
+    
+}
+
+
+#endif  // __DACE_MATH_H
diff --git a/dace/runtime/include/dace/os.h b/dace/runtime/include/dace/os.h
new file mode 100644
index 0000000000..725a7cec60
--- /dev/null
+++ b/dace/runtime/include/dace/os.h
@@ -0,0 +1,40 @@
+#pragma once
+
+#include <cstdlib>
+#include <stdexcept>
+#include <string>
+
+#ifdef _MSC_VER
+inline int setenv(const char *name, const char *value, int overwrite)
+{
+    int errcode = 0;
+    if (!overwrite) {
+        size_t envsize = 0;
+        errcode = getenv_s(&envsize, NULL, 0, name);
+        if (errcode || envsize) return errcode;
+    }
+    return _putenv_s(name, value);
+}
+inline int unsetenv(const char *name)
+{
+    return _putenv_s(name, "");
+}
+#endif // _MSC_VER
+
+namespace dace {
+
+
+
+inline void set_environment_variable(std::string const &key,
+                                     std::string const &val) {
+  const auto ret = setenv(key.c_str(), val.c_str(), 1);
+  if (ret != 0) {
+    throw std::runtime_error("Failed to set environment variable " + key);
+  }
+}
+
+inline void unset_environment_variable(std::string const &key) {
+  unsetenv(key.c_str());
+}
+
+} // End namespace dace
diff --git a/dace/runtime/include/dace/perf/instrumentation.h b/dace/runtime/include/dace/perf/instrumentation.h
new file mode 100644
index 0000000000..ec1e811f7e
--- /dev/null
+++ b/dace/runtime/include/dace/perf/instrumentation.h
@@ -0,0 +1,844 @@
+#pragma once
+
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <papi.h>
+#include <string>
+#include <future>
+#include <mutex>
+#include <omp.h>
+#include <vector>
+#include <assert.h>
+#include <iostream>
+#include <memory>
+
+
+#ifdef __x86_64__ // We don't support i386 (macro: __i386__)
+#ifdef __GNUC__
+#include <x86intrin.h>
+#define DACE_PERF_mfence _mm_mfence()
+#else
+// Left #TODO for other compilers
+#define DACE_PERF_mfence /* Default: NO FENCE AVAILABLE*/
+
+#endif
+#else
+#define DACE_PERF_mfence /* Default: NO FENCE AVAILABLE*/
+#endif
+
+#ifndef DACE_INSTRUMENTATION_FAST_AND_DANGEROUS
+#define TEST_ALIGNMENT
+#endif
+//#define PAPI_EXPLICIT_THREADS // Define to use explicit thread assignment. Docs say it's not necessary.
+
+#ifdef DACE_INSTRUMENTATION_FAST_AND_DANGEROUS
+#define SKIP_RETVAL_CHECKS
+#endif
+#ifndef DACE_INSTRUMENTATION_FAST_AND_DANGEROUS
+#define CHECK_BOUNDS
+#endif
+//#define LOG_ERRORS // define to create errors.log
+#define FAST_ASSERTS // disable some of the slower asserts.
+
+#define NO_RUNTIME_BYTEMOVEMENT_ACCUMULATION // Define to disable byte movement recording. Defining this can reduce cache line ping pong
+
+//#define ASSIGN_COMPONENT // Assigns the component explicitly. This should not be enabled for 2 reasons: 1) PAPI_start() already does this, and 2) there might be a tiny to medium overhead when enabling twice
+namespace dace_perf
+{
+
+    constexpr uint32_t invalid_node_id = std::numeric_limits<uint32_t>::max();
+
+void logError(const std::string& str) 
+{
+    #ifdef LOG_ERRORS
+    FILE* f = fopen("errors.log", "a");
+    if(f) {
+        fprintf(f, "%s\n", str.c_str());
+        fclose(f);
+    }
+    #endif
+}
+
+template<int... events> class PAPIPerfLowLevel;
+
+constexpr size_t CACHE_LINE_SIZE = 64;
+
+template<typename T, size_t Alignment = CACHE_LINE_SIZE>
+class AlignedElement
+{
+public:
+
+    static constexpr size_t alignment_padding = (sizeof(T) == Alignment) ? (0) : (Alignment - (sizeof(T) & (Alignment - 1)));
+
+    AlignedElement() = default;
+    AlignedElement(const T& x) : m_elem(x) {}
+    AlignedElement(T&& x) : m_elem(x) {}
+
+    operator T&() {
+        static_assert(sizeof(*this) % Alignment == 0, "Aligned Element is bugged");
+        return m_elem;
+    }
+
+    ~AlignedElement() = default;
+
+private:
+    T m_elem;
+    uint8_t m_padding[alignment_padding];
+};
+
+template<typename T, size_t Alignment = CACHE_LINE_SIZE>
+class AlignedContainer
+{
+public:
+    static constexpr auto realsize = (sizeof(T) == Alignment) ? (sizeof(T)) : ((sizeof(T) + Alignment) & ~(Alignment - 1));
+    static_assert(realsize >= sizeof(T), "Realsize is less than an object!");
+    static_assert(realsize <= sizeof(T) + Alignment, "Realsize larger than necessary");
+    static_assert(realsize == sizeof(AlignedElement<T, Alignment>), "realsize should be identical to aligned element size");
+
+    AlignedContainer()
+        : m_rawdat(nullptr), align_offset(std::numeric_limits<size_t>::max()), m_size(0), m_alloc_size(0)
+    {
+        
+    }
+    
+    ~AlignedContainer()
+    {
+        clear();
+    }
+
+    void resize(size_t n)
+    {
+        logError("Buffer resized");
+        clear();
+        if(n == m_size)
+        {
+            initialize_elements(); // Reset the elements
+            return;
+        }
+        m_alloc_size = (n + 1) * realsize * sizeof(*m_rawdat.get());
+        m_rawdat.reset(new uint8_t[m_alloc_size]);
+        if(m_rawdat == nullptr)
+        {
+            logError("Failed to allocate buffer");
+        }
+
+        align_offset = Alignment - (reinterpret_cast<uintptr_t>(m_rawdat.get()) & (Alignment - 1));
+
+        m_size = n;
+        initialize_elements();
+    }
+
+    void initialize_elements()
+    {
+        auto* data = elementArray();
+        assert(data != nullptr);
+        for(size_t i = 0; i < m_size; ++i)
+        {
+            data[i] = AlignedElement<T, Alignment>();
+
+            #ifdef CHECK_BOUNDS
+            assert((uint8_t*)&data[i] <= m_rawdat.get() + m_alloc_size);
+            #endif
+        }
+    }
+
+    void clear()
+    {
+        if(align_offset == std::numeric_limits<size_t>::max())
+            return;
+        for(size_t i = 0; i < m_size; ++i)
+        {
+            (*this)[i].~T();
+        }
+    }
+
+    size_t size() const {
+        return m_size;
+    }
+
+    AlignedElement<T, Alignment>* elementArray() const
+    {
+        #ifdef CHECK_BOUNDS
+        assert(align_offset != std::numeric_limits<size_t>::max());
+        assert(m_rawdat.get() != nullptr);
+        #endif
+        auto* ptr = reinterpret_cast<AlignedElement<T, Alignment>*>(m_rawdat.get() + align_offset);
+        #ifdef CHECK_BOUNDS
+        assert((uint8_t*)ptr > m_rawdat.get() && (uint8_t*)ptr < m_rawdat.get() + m_alloc_size && "out of bounds");
+        #endif
+        return ptr;
+    }
+
+    T& operator[](size_t index)
+    {
+        auto* ptr = &(elementArray()[index]);
+        #ifdef CHECK_BOUNDS
+        assert((uint8_t*)ptr > m_rawdat.get() && (uint8_t*)ptr + 1 < m_rawdat.get() + m_alloc_size && "out of bounds");
+        #endif
+        return *ptr;
+    }
+
+private:
+    //uint8_t* m_rawdat;
+    std::unique_ptr<uint8_t> m_rawdat;
+    size_t align_offset;
+    size_t m_size;
+    size_t m_alloc_size;
+};
+
+
+class PAPI
+{
+public:
+
+    static void init()
+    {
+        init_library();
+        init_threads();
+
+        logError("Papi initialized");
+    }
+
+    static void init_library()
+    {
+        const auto r_init = PAPI_library_init(PAPI_VER_CURRENT);
+        #ifndef SKIP_RETVAL_CHECKS
+        if(r_init != PAPI_VER_CURRENT && r_init != PAPI_OK)
+        {
+            std::cerr << "Failed to init PAPI" << std::endl;
+            PAPI_perror("Error: ");
+        }
+        #endif
+    }
+
+    static void init_threads()
+    {
+        const auto r_init = ::PAPI_thread_init((long unsigned int (*)())omp_get_thread_num);
+        #ifndef SKIP_RETVAL_CHECKS
+        if(r_init != PAPI_VER_CURRENT && r_init != PAPI_OK)
+        {
+            std::cerr << "Failed to init PAPI threads code " << r_init << std::endl;
+            PAPI_perror("Error: ");
+        }
+        #endif
+    }
+
+    static void init_multiplexing()
+    {
+        const auto r_init = ::PAPI_multiplex_init();
+        #ifndef SKIP_RETVAL_CHECKS
+        if(r_init != PAPI_VER_CURRENT && r_init != PAPI_OK)
+        {
+            std::cerr << "Failed to init PAPI multiplexing, code " << r_init << std::endl;
+            PAPI_perror("Error: ");
+        }
+        #endif
+    }
+
+    static double getTimePerCycle();
+private:
+};
+
+
+enum class ValueSetType
+    : uint32_t
+{
+    Default = 0,
+    Raw,
+    OMP_marker_parfor_start,
+    OMP_marker_parfor_end,
+
+    marker_section_start,
+    marker_supersection_start,
+
+    Copy,
+    CounterOverride,
+};
+
+template<bool standalone, int... events>
+class PAPIValueSetInternal
+{
+public:
+
+    PAPIValueSetInternal()
+        : m_flags(ValueSetType::Default)
+    {
+
+    }
+
+    PAPIValueSetInternal(uint32_t nodeid, uint32_t coreid, uint32_t iteration, ValueSetType flags = ValueSetType::Default)
+        : m_nodeid(nodeid), m_coreid(coreid), m_iteration(iteration), m_flags(flags)
+    {
+
+    }
+
+    ~PAPIValueSetInternal()
+    {
+        if(standalone)
+        {
+            // report in destructor
+            std::cout << "Value set destroyed" << std::endl;
+            size_t index = 0;
+            PAPI_event_info_t info;
+            for(const auto& e : {events...})
+            {
+                PAPI_get_event_info(e, &info);
+                std::cout << info.symbol << ": " << m_values[index] << std::endl;
+                ++index;
+            }
+        }
+    }
+
+    std::string toStoreFormat(int* event_override = nullptr) const
+    {
+        int event_tags[] = {events...};
+        if(event_override)
+        {
+            std::copy(event_override, event_override + sizeof...(events), event_tags);
+        }
+        std::string ret = "# entry";
+
+        if(m_nodeid != invalid_node_id && (m_flags == ValueSetType::Default || m_flags == ValueSetType::Copy))
+        {
+            ret += " (" + std::to_string(m_nodeid) + ", " + std::to_string(m_coreid) + ", " + std::to_string(m_iteration) + ", " + std::to_string((int)m_flags) + ")\n";
+        }
+        else if(m_flags == ValueSetType::OMP_marker_parfor_start)
+        {
+            ret = "# LOOP START\n";
+            return ret;
+        }
+        else if(m_flags == ValueSetType::marker_section_start)
+        {
+            ret = "# Section start (node " + std::to_string(m_nodeid) + ", core " + std::to_string(m_coreid) + ")\n";
+            ret += "bytes: " + std::to_string(m_values[0]) + "\n";
+            return ret;
+        }
+        else if(m_flags == ValueSetType::marker_supersection_start)
+        {
+            ret = "# Supersection start (node " + std::to_string(m_nodeid) + ")\n";
+            return ret;
+        }
+        else
+            ret += "\n";
+
+        //constexpr auto evcount = sizeof...(events);
+        size_t i = 0;
+
+        for(const auto& e : event_tags)
+        {
+            if(e == 0) continue; // Skip unnecessary/invalid entries
+            ret += std::to_string(e) + ": " + std::to_string(m_values[i]) + "\n";
+            ++i;
+        }
+        return ret;
+    }
+
+    // Return a reference to the value-array
+    long long (&store())[sizeof...(events)]
+    {
+        return m_values;
+    }
+
+    const long long (&cstore() const)[sizeof...(events)]
+    {
+        return m_values;
+    }
+
+    const ValueSetType& flags() const { return m_flags; }
+
+private:
+    // Note: If we keep adding variables we will at some point run into issues. Right now, we should probably aim for 64 bytes, so we have enough room for 6 PMCs.
+    alignas(CACHE_LINE_SIZE) long long m_values[sizeof...(events)];
+    uint32_t m_nodeid;      // The node in the graph
+    uint32_t m_coreid;      // The ID of the core.
+    uint32_t m_iteration;   // The iteration (in a given loop)
+    ValueSetType m_flags;   // The flags (mode) for this value set.
+};
+
+template<int... events>
+using PAPIValueSet = PAPIValueSetInternal<true, events...>;
+
+
+    constexpr auto store_path = "instrumentation_results.txt";
+// Class to store the value sets during execution and writing it to disk after execution
+// For now, the store is dynamic
+template<int... events>
+class PAPIValueStore
+{
+    using byte_counter_size_t = uint64_t;
+public:
+    static constexpr size_t store_reserve_size = 4096 * 1024;
+    PAPIValueStore()
+    {
+        assert(m_moved_bytes.is_lock_free() && "Moved byte counter is not lockfree!");
+        // Skip first few growth operations
+        //m_values.reserve(store_reserve_size);
+        m_values.resize(store_reserve_size);
+
+        // Remove previous instrumentation data
+        //m_store_file = fopen(store_path, "wb");
+        m_store_file = fopen(store_path, "ab");
+        if(!m_store_file)
+        {
+            std::cerr << "Failed to open result file" << std::endl;
+        }
+
+        m_insertion_position = 0;
+        m_moved_bytes = 0;
+        m_contention_value = 0;
+
+
+        logError("Value store created");
+    }
+
+    ~PAPIValueStore()
+    {
+    
+        flush();
+        fclose(m_store_file);
+    }
+
+    // Provides a thread-safe implementation to increase a counter representing the bytes moved
+    inline void addBytesMoved(byte_counter_size_t size)
+    {
+        #ifndef NO_RUNTIME_BYTEMOVEMENT_ACCUMULATION
+        byte_counter_size_t oldval = m_moved_bytes;
+        byte_counter_size_t newval;
+        do {
+            newval = oldval + size;
+        } while(!m_moved_bytes.compare_exchange_strong(oldval, newval));
+        #endif
+    }
+
+    inline byte_counter_size_t collectBytesMoved()
+    {
+        byte_counter_size_t oldval = m_moved_bytes;
+        byte_counter_size_t newval = 0;
+        do {
+            // Nothing
+        } while(!m_moved_bytes.compare_exchange_strong(oldval, newval));
+        return oldval;
+    }
+
+    void flush()
+    {
+#ifdef LOG_ERRORS
+        FILE* f = fopen("errors.log", "a");
+        if(f) {
+            fprintf(f, "Flushing with pos %ld\n", size_t(this->m_insertion_position));
+            fclose(f);
+        }
+#endif
+        if(!m_store_file)
+            return;
+        // We want to store information about what we actually measured
+        int event_override[sizeof...(events)];
+        bool override_events = false;
+        for(size_t i = 0; i < m_insertion_position; ++i)
+        {
+            const auto& e = m_values[i];
+            if(e.flags() == ValueSetType::CounterOverride)
+            {
+                // We have to override the counter type
+                for(size_t i = 0; i < sizeof...(events); ++i)
+                {
+                    event_override[i] = e.cstore()[i];
+                }
+                override_events = true;
+            }
+            else
+            {
+                if(override_events)
+                {
+                    fprintf(m_store_file, "%s", e.toStoreFormat(event_override).c_str());
+                    override_events = false;
+                }
+                else
+                {
+                    fprintf(m_store_file, "%s", e.toStoreFormat().c_str());
+                }
+            }
+        }
+        if(m_insertion_position > 0)
+        {
+#ifndef NO_RUNTIME_BYTEMOVEMENT_ACCUMULATION
+            // Quite expensive using std::to_string, but it adapts to different types...
+            auto bm = collectBytesMoved();
+            fprintf(m_store_file, "# moved_bytes: %s\n", std::to_string(bm).c_str());
+#endif
+            // Also store contention
+            uint64_t cont = 0;
+            cont = m_contention_value.exchange(cont);
+            
+            if(cont != 0)
+                fprintf(m_store_file, "# contention: %s\n", std::to_string(cont).c_str());
+        }
+        m_values.clear();
+        //m_values.reserve(store_reserve_size);
+        m_values.resize(store_reserve_size);
+        m_insertion_position = 0;
+    }
+
+    // This is to provide a default sync point. Its effect on the output can be disregarded
+    void markSuperSectionStart(uint32_t nodeid, ValueSetType flags = ValueSetType::marker_supersection_start)
+    {
+        flush();
+        PAPIValueSetInternal<false, events...> set(nodeid, 0, 0, flags);
+        addEntry(set);
+    }
+
+    // This marks sections in a threadsafe way. In principle, instead of being a "barrier" syncing threads, it now just guarantees serial properties per thread (instead of as before, over all threads)
+    void markSectionStart(uint32_t nodeid, long long SizeInBytes, uint32_t threadid, uint32_t iteration = 0, ValueSetType flags = ValueSetType::marker_section_start)
+    {
+        PAPIValueSetInternal<false, events...> set(nodeid, threadid, iteration, flags);
+        set.store()[0] = SizeInBytes; // Use the first slot for memory information
+        addEntry(set);
+    }
+
+    // # TODO: Maybe with forward args?
+    template<int slots = 1>
+    size_t getNewSlotPosition()
+    {
+        size_t pos = 0;
+        bool r_ex = false;
+        // Lock-free
+        do {
+            size_t oldpos = m_insertion_position;
+            pos = oldpos + slots;
+            assert(pos <= m_values.size());
+            if(pos >= m_values.size())
+            {
+                std::cerr << "Array not big enough!" << std::endl;
+
+                // To keep it running, we just have to flush. For this, we should lock
+
+                // Ideally, we only let the main thread flush (to avoid excess memory movements, especially with NUMA)
+                // However, it's hard to guarantee termination like this. (What if the main thread is done and will never call getNewSlotPosition())
+                if(m_vec_mutex.try_lock()) {
+                    // We got the lock, so we can flush
+                    flush();
+                    m_vec_mutex.unlock();
+                }
+                else {
+                    // We didn't get the lock, which means that somebody else is already flushing.
+                    // Since we lost a lot of time already, we can just use the lock to wait instead of spinlocking.
+                    m_vec_mutex.lock();
+                    m_vec_mutex.unlock();
+                }
+
+                // We always have to try again to get the new (correct) position
+                continue;
+            }
+            // We use strong exchange here because we don't want to be spinning for a long time.
+            r_ex = m_insertion_position.compare_exchange_weak(oldpos, pos);
+            pos = oldpos;
+            if(!r_ex)
+            {
+                ++m_contention_value;
+            }
+        } while(!r_ex);
+
+        return pos;
+    }
+
+    PAPIValueSetInternal<false, events...>& addEntry(const PAPIValueSetInternal<false, events...>& val)
+    {
+        auto pos = getNewSlotPosition();
+
+        m_values[pos] = val;
+
+        return m_values[pos];
+    }
+
+    template<int... counterevents>
+    PAPIValueSetInternal<false, events...>& getNewValueSet([[maybe_unused]] const PAPIPerfLowLevel<counterevents...>& perf, uint32_t nodeid, uint32_t coreid, uint32_t iteration, ValueSetType type = ValueSetType::Default)
+    {
+        // If counterevents and store events are not the same, we have an issue.
+        // The value set must have the same arguments as the store, so it will always print the same counter ids.
+        // But in this case, the counterids are not the same. We therefore have to mark the entries as invalid so the store can deal with it.
+
+        static_assert(sizeof...(counterevents) <= sizeof...(events), "Counter event size must not exceed store size");
+
+        int codes[] = {counterevents...};
+        int storecodes[] = {events...};
+        bool correct_subset = true;
+        for(size_t i = 0; i < sizeof...(counterevents); ++i)
+        {
+            if(codes[i] != storecodes[i])
+            {
+                correct_subset = false;
+                break;
+            }
+        }
+        if(correct_subset)
+        {
+            // If we have a good subset (same order, same start, same entries), we can skip this expensive operation.
+            return __impl_getNewValueSet(nodeid, coreid, iteration, type);
+        }
+
+        // Get 2 slots
+        auto pos = getNewSlotPosition<2>();
+
+        // Mark the first one to override the counters.
+        m_values[pos] = PAPIValueSetInternal<false, events...>(nodeid, coreid, iteration, ValueSetType::CounterOverride);
+
+        // Store the events in the "value"-fields. Write 0 to unused fields
+        for(size_t i = 0; i < sizeof...(counterevents); ++i)
+        {
+            m_values[pos].store()[i] = codes[i];
+        }
+        for(size_t i = sizeof...(counterevents); i < sizeof...(events); ++i)
+        {
+            m_values[pos].store()[i] = 0;
+        }
+    }
+
+    inline PAPIValueSetInternal<false, events...>& getNewValueSet([[maybe_unused]] const PAPIPerfLowLevel<events...>& perf, uint32_t nodeid, uint32_t coreid, uint32_t iteration, ValueSetType type = ValueSetType::Default)
+    {
+        return __impl_getNewValueSet(nodeid, coreid, iteration, type);
+    }
+
+    PAPIValueSetInternal<false, events...>& __impl_getNewValueSet(uint32_t nodeid, uint32_t coreid, uint32_t iteration, ValueSetType type = ValueSetType::Default)
+    {
+        auto& retval = addEntry(PAPIValueSetInternal<false, events...>(nodeid, coreid, iteration, type));
+#ifdef TEST_ALIGNMENT
+        uintptr_t val = (uintptr_t)&retval;
+        auto lower_bits = val & (CACHE_LINE_SIZE - 1);
+        if(lower_bits != 0)
+        {
+            std::cerr << "ERROR: LOWER BITS EXPECTED 0, got " << lower_bits << ". This means the values are not aligned!" << std::endl;
+        }
+        assert(lower_bits == 0);
+#endif
+        return retval;
+    }
+    
+    PAPIValueSetInternal<false, events...>& getNewValueSet()
+    {
+        return getNewValueSet(0, 0, 0);
+    }
+private:
+    std::recursive_mutex m_vec_mutex;
+    //aligned_vector<PAPIValueSetInternal<false, events...>> m_values;
+    //std::vector<PAPIValueSetInternal<false, events...>> m_values;
+    AlignedContainer<PAPIValueSetInternal<false, events...>> m_values;
+    
+    std::atomic<size_t> m_insertion_position;
+    std::atomic<uint64_t> m_contention_value;
+    std::atomic<byte_counter_size_t> m_moved_bytes;
+
+    FILE* m_store_file;
+};
+
+
+// Class similar to PAPIPerf, but allows for much finer grained control
+template<int... events>
+class PAPIPerfLowLevel
+{
+public:
+
+    PAPIPerfLowLevel(const bool multiplexing = false)
+        : m_event_set(PAPI_NULL)
+    {
+        // Need to synchronize accesses because papi is not threadsafe...
+        //std::lock_guard<std::recursive_mutex> guard(papi_mutex);
+        auto r_create_eventset = PAPI_create_eventset(&m_event_set);
+        #ifndef SKIP_RETVAL_CHECKS
+        if(r_create_eventset != PAPI_OK)
+        {
+            std::cerr << "Failed to create event set, code " << r_create_eventset << std::endl;
+            #ifdef LOG_ERRORS
+            FILE* f = fopen("errors.log", "a");
+            if(f) {
+                fprintf(f, "Failed to create event set with code %d\n", r_create_eventset);
+                fclose(f);
+            }
+            #endif
+        }
+        #endif
+
+        #ifdef ASSIGN_COMPONENT
+        // We need this because multiplexing will otherwise act up.
+        // Issue is that if we don't do it in the general case as well, the starting will take longer. It's not really a good solution to put it here, though.
+        PAPI_assign_eventset_component(m_event_set, 0);
+        #endif
+        if(multiplexing)
+        {
+            #ifndef ASSIGN_COMPONENT
+            // We need this because multiplexing will otherwise act up.
+            PAPI_assign_eventset_component(m_event_set, 0);
+            #endif
+            this->enable_multiplexing();
+        }
+        int evarr[] = {events...};
+        auto r_add_events = PAPI_add_events(m_event_set, evarr, sizeof...(events));
+        #ifndef SKIP_RETVAL_CHECKS
+        if(r_add_events != PAPI_OK)
+        {
+            PAPI_cleanup_eventset(m_event_set);
+            std::cerr << "Failed to add events to event set, code " << r_add_events << std::endl;
+            #ifdef LOG_ERRORS
+            FILE* f = fopen("errors.log", "a");
+            if(f) {
+                fprintf(f, "Failed to add events to event set with code %d\n", r_add_events);
+                fclose(f);
+            }
+            #endif
+        }
+        #endif
+#ifdef PAPI_EXPLICIT_THREADS
+        const auto r_reg_thread = PAPI_register_thread();
+        #ifndef SKIP_RETVAL_CHECKS
+        if(r_reg_thread != PAPI_OK)
+        {
+            std::cerr << "Failed to register thread, code " << r_reg_thread << std::endl;
+        }
+        #endif
+#endif
+    }
+
+    
+
+    ~PAPIPerfLowLevel()
+    {
+        if(m_event_set != PAPI_NULL)
+        {
+            PAPI_cleanup_eventset(m_event_set);
+            PAPI_destroy_eventset(&m_event_set);
+#ifdef PAPI_EXPLICIT_THREADS
+            PAPI_unregister_thread();
+#endif
+        }
+
+    }
+
+protected:
+    void enable_multiplexing()
+    {
+        const auto r_multiplex = PAPI_set_multiplex(m_event_set);
+        if(r_multiplex != PAPI_OK)
+        {
+            std::cerr << "Failed to enable multiplexing, code " << r_multiplex << std::endl;
+            exit(-1);
+        }
+    }
+public:
+
+    static PAPIValueSet<events...> ValueSet()
+    {
+        return PAPIValueSet<events...>();
+    }
+
+    void enterCritical()
+    {
+        //constexpr auto num_events = sizeof...(events);
+        DACE_PERF_mfence; // Fence before starting to keep misses outside
+        const auto r_start = PAPI_start(m_event_set);
+        // #TODO: Check if we can just omit checking on the return value...
+        #ifndef SKIP_RETVAL_CHECKS
+        if(r_start != PAPI_OK)
+        {
+            std::cerr << "Failed to start counters with code " << r_start << std::endl;
+            #ifdef LOG_ERRORS
+            FILE* f = fopen("errors.log", "a");
+            if(f) {
+                fprintf(f, "Failed to start counters with code %d\n", r_start);
+                fclose(f);
+            }
+            #endif
+        }
+        #endif
+    }
+
+    template<bool standalone, int... e>
+    void leaveCritical(PAPIValueSetInternal<standalone, e...>& values)
+    {
+        constexpr auto num_events = sizeof...(events);
+        
+        // Make sure we have the correct sizes
+        static_assert(sizeof...(e) >= num_events); 
+
+        // Fence before stopping to keep misses inside
+        DACE_PERF_mfence;
+        const auto r_stop = PAPI_stop(m_event_set, values.store());
+        #ifndef SKIP_RETVAL_CHECKS
+        // #TODO: Check if we can just omit checking on the return value...
+        if(r_stop != PAPI_OK)
+        {
+            std::cerr << "Failed to stop counters with code " << r_stop << std::endl;
+            #ifdef LOG_ERRORS
+            FILE* f = fopen("errors.log", "a");
+            if(f) {
+                fprintf(f, "Failed to stop counters with code %d\n", r_stop);
+                fclose(f);
+            }
+            #endif
+        }
+        #endif
+        // Since we allow storing less, we have to make sure there's no stale data in the value store. Set to an invalid value, i.e. max.
+        for(auto i = num_events; i < sizeof...(e); ++i)
+        {
+            values.store()[i] = std::numeric_limits<long long>::max();
+        }
+    }
+    
+    std::string listEvents() const 
+    {
+        std::string ret = "===== Events =====\n";
+        PAPI_event_info_t info;
+        int evlist[128];
+        int evsize = sizeof(evlist) / sizeof(*evlist);
+        const auto r_list_events = PAPI_list_events(m_event_set, evlist, &evsize);
+        if(r_list_events != PAPI_OK)
+        {
+            std::cerr << "Failed in list events, code " << r_list_events << std::endl;
+            return ret;
+        }
+        for(size_t i = 0; i < evsize; ++i) 
+        {
+            const auto& e = evlist[i];
+            const auto r = PAPI_get_event_info(e, &info);
+            if(r != PAPI_OK)
+                ret += "<ERROR determining event info>";
+            else
+                ret += info.symbol;
+            ret +=  "\n";
+        }
+
+        return ret;
+    }    
+private:
+    int m_event_set;
+};
+
+template<int... events>
+using PAPIPerf = PAPIPerfLowLevel<events...>;
+
+// Convenience typedef
+typedef PAPIPerfLowLevel<PAPI_TOT_INS, PAPI_TOT_CYC, PAPI_L1_TCM, PAPI_L2_TCM, PAPI_L3_TCM> PAPIPerfAllMisses;
+typedef PAPIPerfLowLevel<PAPI_TOT_INS, PAPI_TOT_CYC, PAPI_L1_DCM, PAPI_L2_DCM, PAPI_L3_TCM> PAPIPerfDataMisses;
+
+typedef PAPIValueStore<PAPI_TOT_INS, PAPI_TOT_CYC, PAPI_L1_TCM, PAPI_L2_TCM, PAPI_L3_TCM> PAPIPerfStoreAllMisses;
+
+
+
+inline double PAPI::getTimePerCycle()
+{
+    using std::chrono::high_resolution_clock;
+    using std::chrono::duration_cast;
+
+    PAPIPerfLowLevel<PAPI_TOT_CYC, PAPI_REF_CYC> perf;
+    auto vs = perf.ValueSet();
+
+    perf.enterCritical();
+    auto start = high_resolution_clock::now();
+    for(size_t i = 0; i < 10000000l; ++i)
+        __asm__ __volatile__ ("");
+    auto stop = high_resolution_clock::now();
+    perf.leaveCritical(vs);
+
+    return double(duration_cast<std::chrono::microseconds>(std::chrono::duration<double>(stop - start)).count()) / double(vs.store()[0]) / 1e6;
+}
+
+};
\ No newline at end of file
diff --git a/dace/runtime/include/dace/pi.h b/dace/runtime/include/dace/pi.h
new file mode 100644
index 0000000000..0e7464b017
--- /dev/null
+++ b/dace/runtime/include/dace/pi.h
@@ -0,0 +1,249 @@
+#ifndef __DACE_PI_H
+#define __DACE_PI_H
+
+// Classes that are used to define a typeless Pi
+
+//#define _USE_MATH_DEFINES
+//#include <cmath>
+#ifndef M_PI
+#define M_PI 3.14159265358979323846
+#endif
+
+namespace dace
+{
+    namespace math
+    {
+        //////////////////////////////////////////////////////
+        // Defines a typeless Pi
+        struct typeless_pi
+        {
+            double value() const { return M_PI; }
+            operator int() const
+            {
+                return int(this->value());
+            }
+            operator float() const
+            {
+                return float(this->value());
+            }
+            operator double() const
+            {
+                return double(this->value());
+            }
+        };
+        struct typeless_pi_mult : typeless_pi
+        {
+            int mult; typeless_pi_mult(int m = 1) : mult(m) {}
+            double value() const { return mult * M_PI; }
+
+            operator int() const
+            {
+                return int(this->value());
+            }
+            operator float() const
+            {
+                return float(this->value());
+            }
+            operator double() const
+            {
+                return double(this->value());
+            }
+        };
+        struct typeless_pi_exp : typeless_pi_mult
+        {
+            int mult, exp; typeless_pi_exp(int m = 1, int e = 1) : mult(m), exp(e) {}
+            double value() const { return mult * std::pow(M_PI, exp); }
+            operator int() const
+            {
+                return int(this->value());
+            }
+            operator float() const
+            {
+                return float(this->value());
+            }
+            operator double() const
+            {
+                return double(this->value());
+            }
+        };
+        inline typeless_pi_mult operator*(const typeless_pi&, const int& num)
+        {
+            return typeless_pi_mult(num);
+        }
+        inline typeless_pi_mult operator*(const typeless_pi_mult& p, const int& num)
+        {
+            return typeless_pi_mult(p.mult * num);
+        }
+        inline typeless_pi_exp operator*(const typeless_pi_exp& p, const int& num)
+        {
+            return typeless_pi_exp(p.mult * num, p.exp);
+        }
+        inline typeless_pi_mult operator*(const int& num, const typeless_pi&)
+        {
+            return typeless_pi_mult(num);
+        }
+        inline typeless_pi_mult operator*(const int& num, const typeless_pi_mult& p)
+        {
+            return typeless_pi_mult(num * p.mult);
+        }
+        inline typeless_pi_exp operator*(const int& num, const typeless_pi_exp& p)
+        {
+            return typeless_pi_exp(num * p.mult, p.exp);
+        }
+        template <typename T>
+        T operator+(const typeless_pi& p, const T& num)
+        {
+            return T(p.value()) + num;
+        }
+        template <typename T>
+        T operator-(const typeless_pi& p, const T& num)
+        {
+            return T(p.value()) - num;
+        }
+
+        template <typename T>
+        T operator*(const typeless_pi& p, const T& num)
+        {
+            return T(p.value()) * num;
+        }
+        template <typename T>
+        T operator/(const typeless_pi& p, const T& num)
+        {
+            return T(p.value()) / num;
+        }
+        template <typename T>
+        T operator+(const T& num, const typeless_pi& p)
+        {
+            return num + T(p.value());
+        }
+        template <typename T>
+        T operator-(const T& num, const typeless_pi& p)
+        {
+            return num - T(p.value());
+        }
+        template <typename T>
+        T operator*(const T& num, const typeless_pi& p)
+        {
+            return num * T(p.value());
+        }
+        template <typename T>
+        T operator/(const T& num, const typeless_pi& p)
+        {
+            return num / T(p.value());
+        }
+        template <typename T>
+        T operator+(const typeless_pi_mult& p, const T& num)
+        {
+            return T(p.value()) + num;
+        }
+        template <typename T>
+        T operator-(const typeless_pi_mult& p, const T& num)
+        {
+            return T(p.value()) - num;
+        }
+
+        template <typename T>
+        T operator*(const typeless_pi_mult& p, const T& num)
+        {
+            return T(p.value()) * num;
+        }
+        template <typename T>
+        T operator/(const typeless_pi_mult& p, const T& num)
+        {
+            return T(p.value()) / num;
+        }
+        template <typename T>
+        T operator+(const T& num, const typeless_pi_mult& p)
+        {
+            return num + T(p.value());
+        }
+        template <typename T>
+        T operator-(const T& num, const typeless_pi_mult& p)
+        {
+            return num - T(p.value());
+        }
+        template <typename T>
+        T operator*(const T& num, const typeless_pi_mult& p)
+        {
+            return num * T(p.value());
+        }
+        template <typename T>
+        T operator/(const T& num, const typeless_pi_mult& p)
+        {
+            return num / T(p.value());
+        }
+        template <typename T>
+        T operator+(const typeless_pi_exp& p, const T& num)
+        {
+            return T(p.value()) + num;
+        }
+        template <typename T>
+        T operator-(const typeless_pi_exp& p, const T& num)
+        {
+            return T(p.value()) - num;
+        }
+
+        template <typename T>
+        T operator*(const typeless_pi_exp& p, const T& num)
+        {
+            return T(p.value()) * num;
+        }
+        template <typename T>
+        T operator/(const typeless_pi_exp& p, const T& num)
+        {
+            return T(p.value()) / num;
+        }
+        template <typename T>
+        T operator+(const T& num, const typeless_pi_exp& p)
+        {
+            return num + T(p.value());
+        }
+        template <typename T>
+        T operator-(const T& num, const typeless_pi_exp& p)
+        {
+            return num - T(p.value());
+        }
+        template <typename T>
+        T operator*(const T& num, const typeless_pi_exp& p)
+        {
+            return num * T(p.value());
+        }
+        template <typename T>
+        T operator/(const T& num, const typeless_pi_exp& p)
+        {
+            return num / T(p.value());
+        }
+        inline typeless_pi_mult operator-(const typeless_pi&)
+        {
+            return typeless_pi_mult(-1);
+        }
+        template <typename T>
+        typeless_pi_mult operator+(const typeless_pi&, const typeless_pi&)
+        {
+            return typeless_pi_mult(2);
+        }
+        template <typename T>
+        typeless_pi_mult operator+(const typeless_pi_mult& p1, const typeless_pi_mult& p2)
+        {
+            return typeless_pi_mult(p1.mult + p2.mult);
+        }
+        template <typename T>
+        typeless_pi_exp operator*(const typeless_pi_mult& p1, const typeless_pi_mult& p2)
+        {
+            return typeless_pi_exp(p1.mult * p2.mult, 2);
+        }
+        template <typename T>
+        typeless_pi_exp operator*(const typeless_pi&, const typeless_pi&)
+        {
+            return typeless_pi_exp(1, 2);
+        }
+        template <typename T>
+        typeless_pi_exp operator*(const typeless_pi_exp& p1, const typeless_pi_exp& p2)
+        {
+            return typeless_pi_exp(p1.mult * p2.mult, p1.exp + p2.exp);
+        }
+    }
+}
+
+
+#endif  // __DACE_PI_H
diff --git a/dace/runtime/include/dace/pyinterop.h b/dace/runtime/include/dace/pyinterop.h
new file mode 100644
index 0000000000..b6366cc417
--- /dev/null
+++ b/dace/runtime/include/dace/pyinterop.h
@@ -0,0 +1,39 @@
+#ifndef __DACE_INTEROP_H
+#define __DACE_INTEROP_H
+
+#include "types.h"
+
+// Various classes to simplify interoperability with python in code converted to C++
+
+class range 
+{
+public:
+    class iterator
+    {
+        friend class range;
+    public:
+        DACE_HDFI int operator *() const { return i_; }
+        DACE_HDFI const iterator &operator ++() { i_ += s_; return *this; }
+        DACE_HDFI iterator operator ++(int) { iterator copy(*this); i_ += s_; return copy; }
+
+        DACE_HDFI bool operator ==(const iterator &other) const { return i_ == other.i_; }
+        DACE_HDFI bool operator !=(const iterator &other) const { return i_ != other.i_; }
+
+    protected:
+        DACE_HDFI iterator(int start, int skip = 1) : i_(start), s_(skip) { }
+
+    private:
+        int i_, s_;
+    };
+
+    DACE_HDFI iterator begin() const { return begin_; }
+    DACE_HDFI iterator end() const { return end_; }
+    DACE_HDFI range(int end) : begin_(0), end_(end) {}
+    DACE_HDFI range(int begin, int end) : begin_(begin), end_(end) {}
+    DACE_HDFI range(int begin, int end, int skip) : begin_(begin, skip), end_(end, skip) {}
+private:
+    iterator begin_;
+    iterator end_;
+};
+
+#endif  // __DACE_INTEROP_H
diff --git a/dace/runtime/include/dace/reduction.h b/dace/runtime/include/dace/reduction.h
new file mode 100644
index 0000000000..a2aa99f3aa
--- /dev/null
+++ b/dace/runtime/include/dace/reduction.h
@@ -0,0 +1,296 @@
+#ifndef __DACE_REDUCTION_H
+#define __DACE_REDUCTION_H
+
+#include <cstdint>
+
+#include "types.h"
+#include "math.h"  // for ::min, ::max
+
+#ifdef __CUDACC__
+    #include "../../../external/cub/cub/device/device_reduce.cuh"
+    #include "../../../external/cub/cub/block/block_reduce.cuh"
+#endif
+
+// Specializations for reductions implemented in frameworks like OpenMP, MPI
+
+namespace dace {
+
+    // Internal type. See below for wcr_fixed external type, which selects
+    // the implementation according to T's properties.
+    template <ReductionType REDTYPE, typename T>
+    struct _wcr_fixed
+    {
+        static DACE_HDFI void reduce(T *ptr, const T& value);
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value);
+
+        DACE_HDFI T operator()(const T &a, const T &b) const;
+    };
+
+
+    // Custom reduction with a lambda function
+    template <typename T>
+    struct wcr_custom {
+        template <typename WCR>
+        static DACE_HDFI void reduce_atomic(WCR wcr, T *ptr, const T& value) {
+            // The slowest kind of atomic operations (locked/compare-and-swap),
+            // this should only happen in case of unrecognized lambdas
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                // Adapted from CUDA's pre-v8.0 double atomicAdd implementation
+                T old = *ptr, assumed;
+                do {
+                    assumed = old;
+                    old = atomicCAS(ptr, assumed, wcr(assumed, value));
+                } while (assumed != old);
+            #else
+                #pragma omp critical
+                *ptr = wcr(*ptr, value);
+            #endif
+        }
+
+        // Non-conflicting version --> no critical section
+        template <typename WCR>
+        static DACE_HDFI void reduce(WCR wcr, T *ptr, const T& value) {
+            *ptr = wcr(*ptr, value);
+        }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Sum, T> {
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr += value; }
+        
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicAdd(ptr, value);
+            #else
+                #pragma omp atomic
+                *ptr += value; 
+            #endif
+        }
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return a + b; }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Product, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr *= value; }
+        
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                wcr_custom<T>::reduce(
+                    _wcr_fixed<ReductionType::Product, T>(), ptr, value);
+            #else
+                #pragma omp atomic
+                *ptr *= value; 
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return a * b; }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Min, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr = ::min(*ptr, value); }
+                
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicMin(ptr, value);
+            #else
+                wcr_custom<T>::reduce(
+                    _wcr_fixed<ReductionType::Min, T>(), ptr, value);
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return ::min(a, b); }
+    };
+    
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Max, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr = ::max(*ptr, value); }
+                
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicMax(ptr, value);
+            #else
+              wcr_custom<T>::reduce(
+                  _wcr_fixed<ReductionType::Max, T>(), ptr, value);
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return ::max(a, b); }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Logical_And, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr = (*ptr && value); }
+                
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicAnd(ptr, value ? T(1) : T(0));
+            #else
+                T val = (value ? T(1) : T(0));
+                #pragma omp atomic
+                *ptr &= val;
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return a && b; }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Bitwise_And, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr &= value; }
+                
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicAnd(ptr, value);
+            #else
+                #pragma omp atomic
+                *ptr &= value;
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return a & b; }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Logical_Or, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr = (*ptr || value); }
+                
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicOr(ptr, value ? T(1) : T(0));
+            #else
+                T val = (value ? T(1) : T(0));
+                #pragma omp atomic
+                *ptr |= val;
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return a || b; }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Bitwise_Or, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr |= value; }
+                        
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicOr(ptr, value);
+            #else
+                #pragma omp atomic
+                *ptr |= value;
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return a | b; }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Logical_Xor, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr = (*ptr != value); }
+                        
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicXor(ptr, value ? T(1) : T(0));
+            #else
+                T val = (value ? T(1) : T(0));
+                #pragma omp atomic
+                *ptr ^= val;
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return a != b; }
+    };
+
+    template <typename T>
+    struct _wcr_fixed<ReductionType::Bitwise_Xor, T> {
+
+        static DACE_HDFI void reduce(T *ptr, const T& value) { *ptr ^= value; }
+                        
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) { 
+            #if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 300
+                atomicXor(ptr, value);
+            #else
+                #pragma omp atomic
+                *ptr ^= value;
+            #endif
+        }
+
+
+        DACE_HDFI T operator()(const T &a, const T &b) const { return a ^ b; }
+    };
+
+    //////////////////////////////////////////////////////////////////////////
+
+    // Specialization that regresses to critical section / locked update for
+    // unsupported types
+    template<typename T>
+    using EnableIfScalar = typename std::enable_if<std::is_scalar<T>::value>::type;
+
+    // Any vector type that is not of length 1, or struct/complex types 
+    // do not support atomics. In these cases, we regress to locked updates.
+    template <ReductionType REDTYPE, typename T, typename SFINAE = void>
+    struct wcr_fixed
+    {
+        static DACE_HDFI void reduce(T *ptr, const T& value) 
+        {
+            _wcr_fixed<REDTYPE, T>::reduce(ptr, value);
+        }
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value) 
+        {
+            wcr_custom<T>::template reduce_atomic(
+                _wcr_fixed<REDTYPE, T>(), ptr, value);
+        }
+    };
+
+    // When atomics are supported, use _wcr_fixed normally
+    template <ReductionType REDTYPE, typename T>
+    struct wcr_fixed<REDTYPE, T, EnableIfScalar<T> >
+    {
+        static DACE_HDFI void reduce(T *ptr, const T& value)
+        {
+            _wcr_fixed<REDTYPE, T>::reduce(ptr, value);
+        }
+
+        static DACE_HDFI void reduce_atomic(T *ptr, const T& value)
+        {
+            _wcr_fixed<REDTYPE, T>::reduce_atomic(ptr, value);
+        }
+
+        DACE_HDFI T operator()(const T &a, const T &b) const
+        {
+            return _wcr_fixed<REDTYPE, T>()(a, b);
+        }
+    };
+
+}  // namespace dace
+
+
+#endif  // __DACE_REDUCTION_H
diff --git a/dace/runtime/include/dace/stream.h b/dace/runtime/include/dace/stream.h
new file mode 100644
index 0000000000..9ab5864f83
--- /dev/null
+++ b/dace/runtime/include/dace/stream.h
@@ -0,0 +1,513 @@
+#ifndef __DACE_STREAM_H
+#define __DACE_STREAM_H
+
+#include "../../../external/moodycamel/blockingconcurrentqueue.h"
+
+// Consume
+#include <thread>
+#include <atomic>
+#include <vector>
+
+#include "vector.h"
+
+#ifdef __CUDACC__
+#include "cuda/stream.cuh"
+#else
+#include "cudainterop.h"
+#include "cuda/cudacommon.cuh"
+
+namespace dace {
+    // Forward/mirror declaration of GPU classes
+    template<typename T, bool IS_POWEROFTWO = false>
+    class GPUStream
+    {
+    public:
+        T* m_data;
+        uint32_t *m_start, *m_end, *m_pending;
+        uint32_t m_capacity_mask;
+
+        GPUStream() : m_data(nullptr), m_start(nullptr), m_end(nullptr),
+            m_pending(nullptr), m_capacity_mask(0) {}
+        GPUStream(T* data, uint32_t capacity,
+                  uint32_t *start, uint32_t *end,
+                  uint32_t *pending) :
+            m_data(data), m_start(start), m_end(end), m_pending(pending),
+            m_capacity_mask(IS_POWEROFTWO ? (capacity - 1) : capacity)
+        { }
+    };
+
+    template<typename T, bool IS_POW2>
+    void FreeGPUArrayStreamView(GPUStream<T, IS_POW2>& stream)
+    {
+        DACE_CUDA_CHECK(cudaFree(stream.m_start));
+        DACE_CUDA_CHECK(cudaFree(stream.m_end));
+        DACE_CUDA_CHECK(cudaFree(stream.m_pending));
+    }
+
+    template<typename T, bool IS_POW2>
+    void FreeGPUStream(GPUStream<T, IS_POW2>& stream)
+    {
+        FreeGPUArrayStreamView(stream);
+        DACE_CUDA_CHECK(cudaFree(stream.m_data));
+    }
+}  // namespace dace
+#endif
+
+namespace dace {
+
+    using moodycamel::BlockingConcurrentQueue;
+    using moodycamel::ConcurrentQueueDefaultTraits;
+
+
+    // Stream implementation with a direct array connection
+    template <typename T, bool ALIGNED = false>
+    class ArrayStreamView;
+    template <typename T, bool ALIGNED = false>
+    class ArrayStreamViewThreadlocal;
+
+
+    template <typename T, bool ALIGNED = false>
+    class Stream;
+
+    template <typename T, int DIMS, bool ALIGNED = false>
+    class StreamArray : public ArrayViewOut<Stream<T, ALIGNED>, DIMS, 1, 0, ALIGNED> {
+        template <typename... Dim>
+        explicit DACE_HDFI StreamArray(const Dim&... strides) {
+            static_assert(sizeof...(strides) == static_cast<int>(DIMS),
+                          "Dimension mismatch");
+            int stridearr[] = { static_cast<int>(strides)... };
+            for (int i = 0; i < DIMS; ++i)
+                this->m_stride[i] = stridearr[i];
+            this->m_ptr = new Stream<T, ALIGNED>[this->m_stride[DIMS - 1]];
+        }
+
+        virtual ~StreamArray() {
+            delete[] this->m_ptr;
+        }
+    };
+
+
+    // Performance can be increased by removing qsize, but this is necessary for
+    // consume to work for now.
+    template <typename T, bool ALIGNED>
+    class Stream {
+    protected:
+        BlockingConcurrentQueue<T> m_queue;
+    public:
+        std::atomic<unsigned int> m_elements;
+
+        Stream(size_t capacity = 6 * ConcurrentQueueDefaultTraits::BLOCK_SIZE) :
+            m_queue(BlockingConcurrentQueue<T>(capacity)), m_elements(0) {}
+
+        inline void pop(T& item, bool noupdate = false) {
+            m_queue.wait_dequeue(item);
+            if (!noupdate)
+                m_elements--;
+        }
+        inline size_t pop(T *valarr, int max_size, bool noupdate = false) {
+            size_t result = m_queue.wait_dequeue_bulk(valarr, max_size);
+            if (result > 0 && !noupdate)
+                m_elements -= result;
+            return result;
+        }
+        inline bool pop_try(T& output, bool noupdate = false) {
+            bool result = m_queue.try_dequeue(output);
+            if (result && !noupdate)
+                m_elements--;
+            return result;
+        }
+        inline size_t pop_try(T *valarr, int max_size, bool noupdate = false) {
+            size_t result = m_queue.try_dequeue_bulk(valarr, max_size);
+            if (result > 0 && !noupdate)
+                m_elements -= result;
+            return result;
+        }
+
+        inline void push(T const& val) {
+            m_queue.enqueue(val);
+            m_elements++;
+        }
+        inline void push(T&& val) {
+            m_queue.enqueue(val);
+            m_elements++;
+        }
+        inline void push(const T* valarr, int size) {
+            m_queue.enqueue_bulk(valarr, size);
+            m_elements += size;
+        }
+
+        template <bool A>
+        void push(const ArrayStreamView<T, A>& s) {
+            m_queue.enqueue_bulk(s.m_array, s.m_elements);
+            m_elements += s.m_elements;
+        }
+
+        template <bool A>
+        void push(const ArrayStreamViewThreadlocal<T, A>& s) {
+            m_queue.enqueue_bulk(s.m_array, s.m_elements);
+            m_elements += s.m_elements;
+        }
+    
+        inline bool push_try(T const& val) {
+            bool result = m_queue.try_enqueue(val);
+            if (result)
+                m_elements++;
+            return result;
+        }
+        inline bool push_try(T&& val) {
+            bool result = m_queue.try_enqueue(val);
+            if (result)
+                m_elements++;
+            return result;
+        }
+        inline bool push_try(const T* valarr, int size) {
+            bool result = m_queue.try_enqueue_bulk(valarr, size);
+            if (result)
+                m_elements += size;
+            return result;
+        }
+
+    };
+
+    // DataView interface for streams
+    template <template <typename, bool> typename StreamT, typename T, bool ALIGNED>
+    class StreamView {
+        StreamT<T, ALIGNED>& m_stream;
+    public:
+        DACE_HDFI StreamView(StreamT<T, ALIGNED>& stream) : m_stream(stream)
+        { }
+
+        template <typename U>
+        DACE_HDFI void write(U&& val) {
+            m_stream.push(std::forward<U>(val));
+        }
+
+        template <typename U>
+        DACE_HDFI void operator=(U&& val) {
+            m_stream.push(std::forward<U>(val));
+        }
+
+        DACE_HDFI operator T() {
+            T item;
+            m_stream.pop(item);
+            return item;
+        }
+
+        DACE_HDFI T pop() {
+            return T(*this);
+        }
+
+        #define __DACE_VECPUSHIF(N)                                 \
+        DACE_HDFI void push_if(const dace::vec<T, N>& element,      \
+                               const dace::vec<int, N>& mask) {     \
+            m_stream.push_if(element, mask);                        \
+        }
+        __DACE_VECPUSHIF(1)
+        __DACE_VECPUSHIF(2)
+        __DACE_VECPUSHIF(4)
+        //__DACE_VECPUSHIF(8)
+        #undef __DACE_VECPUSHIF
+    };
+
+    template <template <typename, bool> typename StreamT, typename T, bool ALIGNED>
+    DACE_HDFI StreamView<StreamT, T, ALIGNED> make_streamview(StreamT<T, ALIGNED>& stream)
+    {
+        return StreamView<StreamT, T, ALIGNED>(stream);
+    }
+
+    // Stream implementation with a direct array connection
+    template <typename T, bool ALIGNED>
+    class ArrayStreamView {
+    protected:
+        T* m_array;
+        std::atomic<unsigned int> m_elements;
+
+        friend class ArrayStreamView<T, !ALIGNED>;
+        friend class ArrayStreamViewThreadlocal<T, ALIGNED>;
+        friend class ArrayStreamViewThreadlocal<T, !ALIGNED>;
+        friend class Stream<T, ALIGNED>;
+        friend class Stream<T, !ALIGNED>;
+
+    public:
+        static constexpr bool aligned = ALIGNED;
+
+        explicit ArrayStreamView(T* sink) : m_array(sink), m_elements(0) {}
+
+        void push(const T& element) {
+            const unsigned int offset = m_elements.fetch_add(1);
+            *(m_array + offset) = element;
+        }
+
+        template <int VECTOR_LEN>
+        void push(const dace::vec<T, VECTOR_LEN>& element) {
+            // The internal pointer type relies on the alignment of the original array
+            typedef typename std::conditional<ALIGNED, vec<T, VECTOR_LEN>,
+                vecu<T, VECTOR_LEN>>::type vec_t;
+
+            const unsigned int offset = m_elements.fetch_add(VECTOR_LEN);
+            *((vec_t*)(m_array + offset)) = element;
+        }
+
+        void push_if(const T& element, bool condition) {
+            if (condition) {
+                const unsigned int offset = m_elements.fetch_add(1);
+                *(m_array + offset) = element;
+            }
+        }
+
+        template <int VECTOR_LEN>
+        void push_if(const dace::vec<T, VECTOR_LEN>& element,
+                     const dace::vec<int, VECTOR_LEN>& mask) {
+            int ppcnt = 0;
+            for (int v = 0; v < VECTOR_LEN; ++v) ppcnt += (mask[v] ? 1 : 0);
+            const unsigned int off = m_elements.fetch_add(ppcnt);
+            T* ptr = m_array + off;
+            for (int v = 0; v < VECTOR_LEN; ++v) {
+                if (mask[v]) {
+                    *ptr++ = element[v];
+                }
+            }
+        }
+
+        void push_if(const dace::vec<T, 4>& element, const dace::vec<int, 4>& mask) {
+            int ppcnt = 0;
+            for (int v = 0; v < 4; ++v) ppcnt += (mask[v] ? 1 : 0);
+            const unsigned int off = m_elements.fetch_add(ppcnt);
+            T* ptr = m_array + off;
+            for (int v = 0; v < 4; ++v) {
+                if (mask[v]) {
+                    *ptr++ = element[v];
+                }
+            }
+        }
+
+        void push_if(const dace::vec<T, 8>& element, const dace::vec<int, 8>& mask) {
+            int ppcnt = 0;
+            for (int v = 0; v < 8; ++v) ppcnt += (mask[v] ? 1 : 0);
+            const unsigned int off = m_elements.fetch_add(ppcnt);
+            T* ptr = m_array + off;
+            for (int v = 0; v < 8; ++v) {
+                if (mask[v]) {
+                    *ptr++ = element[v];
+                }
+            }
+        }
+
+        void push(const T* elements, unsigned int num_elements) {
+            const unsigned int offset = m_elements.fetch_add(num_elements);
+            std::copy(elements, elements + num_elements, m_array + offset);
+        }
+
+        template <bool A>
+        void push(const ArrayStreamView<T, A>& s) {
+            const unsigned int offset = m_elements.fetch_add(s.m_elements);
+            std::copy(s.m_array, s.m_array + s.m_elements, m_array + offset);
+        }
+
+        template <bool A>
+        void push(const ArrayStreamViewThreadlocal<T, A>& s) {
+            const unsigned int offset = m_elements.fetch_add(s.m_elements);
+            std::copy(s.m_array, s.m_array + s.m_elements, m_array + offset);
+        }
+
+        template <int VECTOR_LEN>
+        void push(const dace::vec<T, VECTOR_LEN>* elements,
+                  unsigned int num_elements) {
+            // The internal pointer type relies on the alignment of the original array
+            typedef typename std::conditional<ALIGNED, vec<T, VECTOR_LEN>,
+                vecu<T, VECTOR_LEN>>::type vec_t;
+
+            const unsigned int offset = m_elements.fetch_add(num_elements * VECTOR_LEN);
+            std::copy(elements, elements + num_elements, (vec_t*)(m_array + offset));
+        }
+    };
+
+
+    // Stream implementation with a direct array connection - thread-local
+    template <typename T, bool ALIGNED>
+    class ArrayStreamViewThreadlocal {
+    protected:
+        T* m_array;
+        unsigned int m_elements;
+
+        friend class ArrayStreamViewThreadlocal<T, !ALIGNED>;
+        friend class ArrayStreamView<T, ALIGNED>;
+        friend class ArrayStreamView<T, !ALIGNED>;
+        friend class Stream<T, ALIGNED>;
+        friend class Stream<T, !ALIGNED>;
+
+    public:
+        static constexpr bool aligned = ALIGNED;
+
+        explicit ArrayStreamViewThreadlocal(T* sink) : m_array(sink), m_elements(0) {}
+
+        void push(const T& element) {
+            m_array[m_elements++] = element;
+        }
+
+        template <int VECTOR_LEN>
+        void push(const dace::vec<T, VECTOR_LEN>& element) {
+            // The internal pointer type relies on the alignment of the original array
+            typedef typename std::conditional<ALIGNED, vec<T, VECTOR_LEN>,
+                vecu<T, VECTOR_LEN>>::type vec_t;
+
+            *((vec_t*)(m_array + m_elements)) = element;
+            m_elements += VECTOR_LEN;
+        }
+
+        void push_if(const T& element, bool condition) {
+            if (condition) {
+                m_array[m_elements++] = element;
+            }
+        }
+
+        template <int VECTOR_LEN>
+        void push_if(const dace::vec<T, VECTOR_LEN>& element,
+                     const dace::vec<int, VECTOR_LEN>& mask) {
+            for (int v = 0; v < VECTOR_LEN; ++v) {
+                if (mask[v]) {
+                    m_array[m_elements++] = element[v];
+                }
+            }
+        }
+
+        void push_if(const dace::vec<T, 4>& element, const dace::vec<int, 4>& mask) {
+            for (int v = 0; v < 4; ++v) {
+                if (mask[v]) {
+                    m_array[m_elements++] = element[v];
+                }
+            }
+        }
+
+        void push_if(const dace::vec<T, 8>& element, const dace::vec<int, 8>& mask) {
+            for (int v = 0; v < 8; ++v) {
+                if (mask[v]) {
+                    m_array[m_elements++] = element[v];
+                }
+            }
+        }
+
+        void push(const T* elements, unsigned int num_elements) {
+            std::copy(elements, elements + num_elements, m_array + m_elements);
+            m_elements += num_elements;
+        }
+
+        template <bool A>
+        void push(const ArrayStreamView<T, A>& s) {
+            std::copy(s.m_array, s.m_array + s.m_elements, m_array + m_elements);
+            m_elements += s.m_elements;
+        }
+
+        template <bool A>
+        void push(const ArrayStreamViewThreadlocal<T, A>& s) {
+            std::copy(s.m_array, s.m_array + s.m_elements, m_array + m_elements);
+            m_elements += s.m_elements;
+        }
+
+        template <int VECTOR_LEN>
+        void push(const dace::vec<T, VECTOR_LEN>* elements,
+                  unsigned int num_elements) {
+            // The internal pointer type relies on the alignment of the original array
+            typedef typename std::conditional<ALIGNED, vec<T, VECTOR_LEN>,
+                vecu<T, VECTOR_LEN>>::type vec_t;
+
+            std::copy(elements, elements + num_elements, (vec_t*)(m_array + m_elements));
+            m_elements += num_elements * VECTOR_LEN;
+        }
+    };
+
+    template <int CHUNKSIZE = 1>
+    struct Consume;
+
+    template <int CHUNKSIZE>
+    struct Consume {
+        template <template <typename, bool> typename StreamT, typename T, bool ALIGNED,
+                  typename Functor>
+        static void consume(StreamT<T, ALIGNED>& stream, unsigned num_threads,
+                            Functor&& contents) {
+            std::vector<std::thread> threads;
+            auto thread_contents = [&](int pe) {
+                T consumed_elements[CHUNKSIZE];
+                while (stream.m_elements > 0) {
+                    size_t elems = stream.pop_try(consumed_elements, CHUNKSIZE, true);
+                    if (elems > 0) {
+                        contents(pe, consumed_elements, elems);
+                        stream.m_elements -= elems;
+                    }
+                }
+            };
+            for (unsigned i = 0; i < num_threads; ++i)
+                threads.emplace_back(std::thread(thread_contents, i));
+
+            for (auto& t : threads) t.join();
+        }
+
+        template <template <typename, bool> typename StreamT, typename T, bool ALIGNED,
+                  typename CondFunctor, typename Functor>
+        static void consume_cond(StreamT<T, ALIGNED>& stream, unsigned num_threads,
+                                 CondFunctor&& quiescence, Functor&& contents) {
+            std::vector<std::thread> threads;
+            auto thread_contents = [&](int pe) {
+                T consumed_elements[CHUNKSIZE];
+                while (!quiescence()) {
+                    size_t elems = stream.pop_try(consumed_elements, CHUNKSIZE, true);
+                    if (elems > 0) {
+                        contents(pe, consumed_elements, elems);
+                        stream.m_elements -= elems;
+                    }
+                }
+            };
+            for (unsigned i = 0; i < num_threads; ++i)
+                threads.emplace_back(std::thread(thread_contents, i));
+
+            for (auto& t : threads) t.join();
+        }
+    };
+
+    // Specialization for consumption of 1 element
+    template<>
+    struct Consume<1> {
+        template <template <typename, bool> typename StreamT, typename T, bool ALIGNED,
+                  typename Functor>
+        static void consume(StreamT<T, ALIGNED>& stream, unsigned num_threads,
+                            Functor&& contents) {
+            std::vector<std::thread> threads;
+            auto thread_contents = [&](int pe) {
+                T consumed_element;
+                while (stream.m_elements > 0) {
+                    if (stream.pop_try(consumed_element, true)) {
+                        contents(pe, consumed_element);
+                        stream.m_elements--;
+                    }
+                }
+            };
+            for (unsigned i = 0; i < num_threads; ++i)
+                threads.emplace_back(std::thread(thread_contents, i));
+
+            for (auto& t : threads) t.join();
+        }
+
+        template <template <typename, bool> typename StreamT, typename T, bool ALIGNED,
+                  typename CondFunctor, typename Functor>
+        static void consume_cond(StreamT<T, ALIGNED>& stream, unsigned num_threads,
+                                 CondFunctor&& quiescence, Functor&& contents) {
+            std::vector<std::thread> threads;
+            auto thread_contents = [&](int pe) {
+                T consumed_element;
+                while (!quiescence()) {
+                    if (stream.pop_try(consumed_element, true)) {
+                        contents(pe, consumed_element);
+                        stream.m_elements--;
+                    }
+                }
+            };
+            for (unsigned i = 0; i < num_threads; ++i)
+                threads.emplace_back(std::thread(thread_contents, i));
+
+            for (auto& t : threads) t.join();
+        }
+    };
+
+}  // namespace dace
+
+#endif  // __DACE_STREAM_H
diff --git a/dace/runtime/include/dace/types.h b/dace/runtime/include/dace/types.h
new file mode 100644
index 0000000000..54cf2fbf8e
--- /dev/null
+++ b/dace/runtime/include/dace/types.h
@@ -0,0 +1,114 @@
+#ifndef __DACE_TYPES_H
+#define __DACE_TYPES_H
+
+#include <cstdint>
+#include <complex>
+
+#ifdef _MSC_VER
+    //#define DACE_ALIGN(N) __declspec( align(N) )
+    #define DACE_ALIGN(N) 
+    #undef __in
+    #undef __inout
+    #undef __out
+    #define DACE_EXPORTED extern "C" __declspec(dllexport)
+    #define DACE_PRAGMA(x) __pragma(x)
+#else
+    #define DACE_ALIGN(N) __attribute__((aligned(N)))
+    #define DACE_EXPORTED extern "C"
+    #define DACE_PRAGMA(x) _Pragma(#x)
+#endif
+
+// Visual Studio (<=2017) + CUDA support
+#if defined(_MSC_VER) && (_MSC_VER <= 1999 || defined(__CUDACC__)) || defined(DACE_XILINX)
+#define DACE_CONSTEXPR
+#else
+#define DACE_CONSTEXPR constexpr
+#endif
+
+
+#ifdef __CUDACC__
+    #include <cuda_runtime.h>
+    #include <thrust/complex.h>
+    #include "../../../external/cub/cub/grid/grid_barrier.cuh"
+    #define DACE_HDFI __host__ __device__ __forceinline__
+    #define DACE_HFI __host__ __forceinline__
+    #define DACE_DFI __device__ __forceinline__
+#else
+    #define DACE_HDFI inline
+    #define DACE_HFI inline
+    #define DACE_DFI inline
+#endif
+
+#ifdef __CUDA_ARCH__
+    #define __DACE_UNROLL DACE_PRAGMA(unroll)
+#else
+    #define __DACE_UNROLL
+#endif
+
+
+
+namespace dace
+{
+    typedef int8_t  int8;
+    typedef int16_t int16;
+    typedef int32_t int32;
+    typedef int64_t int64;
+    typedef uint8_t  uint8;
+    typedef uint16_t uint16;
+    typedef uint32_t uint32;
+    typedef uint64_t uint64;
+    //typedef half float16;
+    typedef float float32;
+    typedef double float64;
+
+    #ifdef __CUDACC__
+    typedef thrust::complex<float> complex64;
+    typedef thrust::complex<double> complex128;
+    #else
+    typedef std::complex<float> complex64;
+    typedef std::complex<double> complex128;
+    #endif
+
+    enum NumAccesses
+    {
+        NA_DYNAMIC = -1, // Dynamic number of accesses
+        NA_RUNTIME = 0, // Given at runtime
+    };
+
+    template <int DIM, int... OTHER_DIMS>
+    struct TotalNDSize
+    {
+	enum 
+	{
+	    value = DIM * TotalNDSize<OTHER_DIMS...>::value,
+	};
+    };
+
+    template <int DIM>
+    struct TotalNDSize<DIM>
+    {
+	enum 
+	{
+	    value = DIM,
+	};
+    };
+
+    // Mirror of dace.types.ReductionType
+    enum class ReductionType {
+        Custom = 0,
+        Min = 1,
+        Max = 2,
+        Sum = 3,
+        Product = 4,
+        Logical_And = 5,
+        Bitwise_And = 6,
+        Logical_Or = 7,
+        Bitwise_Or = 8,
+        Logical_Xor = 9,
+        Bitwise_Xor = 10,
+        Min_Location = 11,
+        Max_Location = 12,
+    };
+}
+
+#endif  // __DACE_TYPES_H
diff --git a/dace/runtime/include/dace/vector.h b/dace/runtime/include/dace/vector.h
new file mode 100644
index 0000000000..9d62bc9bff
--- /dev/null
+++ b/dace/runtime/include/dace/vector.h
@@ -0,0 +1,257 @@
+#ifndef __DACE_VECTOR_H
+#define __DACE_VECTOR_H
+
+#ifdef DACE_XILINX_DEVICE_CODE
+#include "xilinx/vec.h"
+#else // Don't include this file if building for Xilinx
+
+#include "types.h"
+
+namespace dace
+{
+    //////////////////////////////////////////////////////////////////
+    // Workaround for clang++
+    // Defining all vector sizes at compile time
+    template <typename T, int N>
+    struct _vtype;
+
+    // Identity type
+    template <typename T>
+    struct _vtype<T, 1>
+    {
+        typedef T aligned;
+        typedef T unaligned;
+    };
+    
+#ifdef __CUDACC__
+    // NOTE: This file is inline and MUST be included here
+    #include "cuda/vectype.cuh"
+#else
+    #if defined(_MSC_VER)
+        template <typename T, int N>
+        struct simplevec;
+        template <typename T>
+        struct simplevec<T, 1> {
+            union { struct { T x; }; T s[1]; };
+            inline T operator[](int ind) const { return s[ind]; }
+            template <typename U>
+            inline simplevec<T, 1> operator*(const U& other) const
+            {
+                simplevec<T, 1> result;
+                result.x = x * other;
+                return result;
+            }
+            template <typename U>
+            inline simplevec<T, 1> operator+(const U& other) const
+            {
+                simplevec<T, 1> result;
+                result.x = x + other;
+                return result;
+            }
+        };
+        template <typename T>
+        struct simplevec<T, 2>
+        {
+            union { struct { T x, y; }; T s[2]; };
+            inline T operator[](int ind) const { return s[ind]; }
+            template <typename U>
+            inline simplevec<T, 2> operator*(const U& other) const
+            {
+                simplevec<T, 2> result;
+                result.x = x * other;
+                result.y = y * other;
+                return result;
+            }
+            inline simplevec<T, 2> operator+(const simplevec<T, 2>& other) const
+            {
+                simplevec<T, 2> result;
+                result.x = x + other.x;
+                result.y = y + other.y;
+                return result;
+            }
+        };
+        template <typename T>
+        struct simplevec<T, 3>
+        {
+            union { struct { T x, y, z; }; T s[3]; };
+            inline T operator[](int ind) const { return s[ind]; }
+            template <typename U>
+            inline simplevec<T, 3> operator*(const U& other) const
+            {
+                simplevec<T, 3> result;
+                for (int i = 0; i < 3; ++i) result.s[i] = s[i] * other;
+                return result;
+            }
+            inline simplevec<T, 3> operator+(const simplevec<T, 3>& other) const
+            {
+                simplevec<T, 3> result;
+                for (int i = 0; i < 3; ++i) result.s[i] = s[i] + other.s[i];
+                return result;
+            }
+        };
+        template <typename T>
+        struct simplevec<T, 4>
+        {
+            union { struct { T x, y, z, w; }; T s[4]; };
+            inline T operator[](int ind) const { return s[ind]; }
+            template <typename U>
+            inline simplevec<T, 4> operator*(const U& other) const
+            {
+                simplevec<T, 4> result;
+                for (int i = 0; i < 4; ++i) result.s[i] = s[i] * other;
+                return result;
+            }
+            inline simplevec<T, 4> operator+(const simplevec<T, 4>& other) const
+            {
+                simplevec<T, 4> result;
+                for (int i = 0; i < 4; ++i) result.s[i] = s[i] + other.s[i];
+                return result;
+            }
+        };
+        template <typename T>
+        struct simplevec<T, 8>
+        {
+            union { struct { T s0, s1, s2, s3, s4, s5, s6, s7; }; T s[8]; };
+            inline T operator[](int ind) const { return s[ind]; }
+            template <typename U>
+            inline simplevec<T, 8> operator*(const U& other) const
+            {
+                simplevec<T, 8> result;
+                for (int i = 0; i < 8; ++i) result.s[i] = s[i] * other;
+                return result;
+            }
+            inline simplevec<T, 8> operator+(const simplevec<T, 8>& other) const
+            {
+                simplevec<T, 8> result;
+                for (int i = 0; i < 8; ++i) result.s[i] = s[i] + other.s[i];
+                return result;
+            }
+        };
+        template <typename T>
+        struct simplevec<T, 16>
+        {
+            union { 
+                struct {
+                    T s0, s1, s2, s3, s4, s5, s6, s7, s8, s9, s10, s11, s12, s13,
+                      s14, s15;
+                }; 
+                T s[16]; 
+            };
+            inline T operator[](int ind) const { return s[ind]; }
+            template <typename U>
+            inline simplevec<T, 16> operator*(const U& other) const
+            {
+                simplevec<T, 16> result;
+                for (int i = 0; i < 16; ++i) result.s[i] = s[i] * other;
+                return result;
+            }
+            inline simplevec<T, 16> operator+(const simplevec<T, 16>& other) const
+            {
+                simplevec<T, 16> result;
+                for (int i = 0; i < 16; ++i) result.s[i] = s[i] + other.s[i];
+                return result;
+            }
+        };
+        template <typename T>
+        struct simplevec<T, 32>
+        {
+            union {
+                struct {
+                    T s0, s1, s2, s3, s4, s5, s6, s7, s8, s9, s10, s11, s12, s13,
+                      s14, s15, s16, s17, s18, s19, s20, s21, s22, s23, s24, s25,
+                      s26, s27, s28, s29, s30, s31;
+                };
+                T s[32];
+            };
+            inline T operator[](int ind) const { return s[ind]; }
+            template <typename U>
+            inline simplevec<T, 32> operator*(const U& other) const
+            {
+                simplevec<T, 32> result;
+                for (int i = 0; i < 32; ++i) result.s[i] = s[i] * other;
+                return result;
+            }
+            inline simplevec<T, 32> operator+(const simplevec<T, 32>& other) const
+            {
+                simplevec<T, 32> result;
+                for (int i = 0; i < 32; ++i) result.s[i] = s[i] + other.s[i];
+                return result;
+            }
+        };
+
+        #define DEFINE_VECTYPE(T, BASE_SIZE, N)                             \
+        template<>                                                          \
+        struct _vtype<T, N>                                                 \
+        {                                                                   \
+            typedef simplevec<T, N> aligned;                                \
+            typedef aligned unaligned;                                      \
+        };                                                                  
+
+    #else
+        #define DEFINE_VECTYPE(T, BASE_SIZE, N)                             \
+        template<>                                                          \
+        struct _vtype<T, N>                                                 \
+        {                                                                   \
+            typedef T aligned __attribute__((vector_size(N * BASE_SIZE)));  \
+            typedef aligned __attribute__((aligned(BASE_SIZE))) unaligned;  \
+        };
+    #endif
+        #define DEFINE_VECTYPE_ALLSIZES(T, BASE_SIZE)                       \
+            DEFINE_VECTYPE(T, BASE_SIZE, 2);                                \
+            DEFINE_VECTYPE(T, BASE_SIZE, 4);                                \
+            DEFINE_VECTYPE(T, BASE_SIZE, 8);                                \
+            DEFINE_VECTYPE(T, BASE_SIZE, 16);                               \
+            DEFINE_VECTYPE(T, BASE_SIZE, 32);
+
+
+        DEFINE_VECTYPE_ALLSIZES(int8   , 1);
+        DEFINE_VECTYPE_ALLSIZES(int16  , 2);
+        DEFINE_VECTYPE_ALLSIZES(int32  , 4);
+        DEFINE_VECTYPE_ALLSIZES(int64  , 8);
+        DEFINE_VECTYPE_ALLSIZES(uint8  , 1);
+        DEFINE_VECTYPE_ALLSIZES(uint16 , 2);
+        DEFINE_VECTYPE_ALLSIZES(uint32 , 4);
+        DEFINE_VECTYPE_ALLSIZES(uint64 , 8);
+    //  DEFINE_VECTYPE_ALLSIZES(float16, 2);
+        DEFINE_VECTYPE_ALLSIZES(float32, 4);
+        DEFINE_VECTYPE_ALLSIZES(float64, 8);
+#endif    
+
+    // END of workaround for clang++
+    //////////////////////////////////////////////////////////////////
+
+    template <typename T, unsigned int N>
+    struct vector_type
+    {
+        typedef typename _vtype<T, N>::aligned aligned;
+        typedef typename _vtype<T, N>::unaligned unaligned;
+        typedef T element_type;
+        static constexpr unsigned int size = N;
+        typedef union {
+            aligned v;
+            T s[N];
+        } access_aligned;
+        typedef union {
+            unaligned v;
+            T s[N];
+        } access_unaligned;
+    };
+
+    template <typename T, unsigned int N>
+    using vec = typename vector_type<T, N>::aligned;
+
+    template <typename T, unsigned int N>
+    using vecu = typename vector_type<T, N>::unaligned;
+
+    template <typename T1, typename T2, unsigned int N>
+    vec<T1, N> xtoy(vec<T2, N> x) {
+        vec<T1, N> y;
+        for (int i = 0; i < 4; ++i)
+            y[i] = x[i];
+        return y;
+    }
+
+}
+
+#endif // XILINX_DEVICE_CODE
+#endif  // __DACE_VECTOR_H
diff --git a/dace/runtime/include/dace/view.h b/dace/runtime/include/dace/view.h
new file mode 100644
index 0000000000..d58e2a532a
--- /dev/null
+++ b/dace/runtime/include/dace/view.h
@@ -0,0 +1,659 @@
+#ifndef __DACE_VIEW_H
+#define __DACE_VIEW_H
+
+#include <cstdint>
+
+#include "types.h"
+#include "vector.h"
+#include "reduction.h"
+
+void __dace_materialize(const char* arrayname, int start, int end,
+                        void* outarray);
+void __dace_serialize(const char* arrayname, int start, int end,
+                      const void* outarray);
+
+// ADVICE:
+// Be aware that there are specialized versions of ArrayView and ArrayView
+// Immaterial _below_.
+
+namespace dace {
+
+    template <typename T, uint8_t DIMS, int VECTOR_LEN = 1,
+        int NUM_ACCESSES = static_cast<int>(NA_RUNTIME), bool ALIGNED = false,
+        typename OffsetT = int32_t>
+    class ArrayViewIn
+    {
+    protected:
+
+     template <int VECTOR_LEN_OTHER>
+     using vec_other_t = typename std::conditional<ALIGNED, vec<T, VECTOR_LEN>,
+                                                   vecu<T, VECTOR_LEN>>::type;
+
+     template <int VECTOR_LEN_OTHER>
+     using vecu_other_t = typename std::conditional<false, vec<T, VECTOR_LEN>,
+                                                    vecu<T, VECTOR_LEN>>::type;
+
+     // The internal pointer type relies on the alignment of the original array
+     using vec_t = typename std::conditional<ALIGNED, vec<T, VECTOR_LEN>,
+                                             vecu<T, VECTOR_LEN>>::type;
+
+     T const* m_ptr;
+     OffsetT m_stride[DIMS];
+
+    public:
+        template <typename... Dim>
+        explicit DACE_HDFI ArrayViewIn(T const* ptr, const Dim&... strides) : 
+            m_ptr(ptr) {
+            static_assert(sizeof...(strides) == static_cast<int>(DIMS),
+                          "Dimension mismatch");
+            OffsetT stridearr[] = { static_cast<OffsetT>(strides)... };
+            for (int i = 0; i < DIMS; ++i)
+                m_stride[i] = stridearr[i];
+        }
+
+        template <typename... Dim>
+        DACE_HDFI const vec_t& operator()(const Dim&... indices) const {
+            static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            return get_element(index_array);
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER> const* ptr(
+            T const* _ptr) const {
+            return reinterpret_cast<vecu_other_t<VECTOR_LEN_OTHER> const*>(
+                _ptr);
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER> const* ptr() const {
+            return ptr<VECTOR_LEN_OTHER>(m_ptr);
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vec_other_t<VECTOR_LEN_OTHER> const& ref() const {
+            return *ptr<VECTOR_LEN_OTHER>();
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vec_other_t<VECTOR_LEN_OTHER> val() const {
+            return *ptr<VECTOR_LEN_OTHER>();
+        }
+
+        // template <typename... Dim>
+        // DACE_HDFI T const* ptr_at(const Dim&... indices) const {
+        //     static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+        //     OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+        //     OffsetT offset;
+        //     get_offset(index_array, offset);
+        //     return m_ptr + offset;
+        // }
+
+    protected:
+        DACE_HDFI void get_offset(OffsetT(&index_array)[DIMS],
+                                  OffsetT& offset) const {
+            offset = 0;
+            for (int i = 0; i < DIMS - 1; ++i) {
+                offset += index_array[i] * m_stride[i];
+            }
+            offset += index_array[DIMS - 1] * m_stride[DIMS - 1] * VECTOR_LEN;
+        }
+
+        DACE_HDFI const vec_t& get_element(OffsetT(&index_array)[DIMS]) const {
+            OffsetT offset;
+            get_offset(index_array, offset);
+            return *ptr(m_ptr + offset);
+        }
+    };
+    
+    template <typename T, uint8_t DIMS, int VECTOR_LEN = 1,
+        int NUM_ACCESSES = static_cast<int>(NA_RUNTIME), bool ALIGNED = false,
+        typename OffsetT = int32_t>
+    class ArrayViewOut
+    {
+    protected:
+        // The internal pointer type relies on the alignment of the original array
+        using vec_t = typename std::conditional<ALIGNED, vec<T, VECTOR_LEN>,
+            vecu<T, VECTOR_LEN>>::type;
+
+        template <int VECTOR_LEN_OTHER>
+        using vecu_other_t = vecu<T, VECTOR_LEN_OTHER>;
+
+        T* m_ptr;
+        OffsetT m_stride[DIMS];
+
+    public:
+        template <typename... Dim>
+        explicit DACE_HDFI ArrayViewOut(T* ptr, const Dim&... strides) : 
+            m_ptr(ptr) {
+            static_assert(sizeof...(strides) == static_cast<int>(DIMS),
+                          "Dimension mismatch");
+            OffsetT stridearr[] = { static_cast<OffsetT>(strides)... };
+            for (int i = 0; i < DIMS; ++i)
+                m_stride[i] = stridearr[i];
+        }
+
+        template <typename... Dim>
+        DACE_HDFI const vec_t& operator()(const Dim&... indices) const {
+            static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            return get_element(index_array);
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER> const* ptr(
+            T const* _ptr) const {
+          if (VECTOR_LEN_OTHER == VECTOR_LEN) {
+            return reinterpret_cast<vecu_other_t<VECTOR_LEN_OTHER> const*>(
+                _ptr);
+          }
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER>* ptr(T *_ptr) const {
+          if (VECTOR_LEN_OTHER == VECTOR_LEN) {
+            return reinterpret_cast<vecu_other_t<VECTOR_LEN_OTHER>*>(_ptr);
+          }
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER> const* ptr() const {
+          if (VECTOR_LEN_OTHER == VECTOR_LEN) {
+              return ptr<VECTOR_LEN_OTHER>(m_ptr);
+          }
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER>* ptr() {
+          if (VECTOR_LEN_OTHER == VECTOR_LEN) {
+            return ptr<VECTOR_LEN_OTHER>(m_ptr);
+          }
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER> const& ref() const {
+            return *ptr<VECTOR_LEN_OTHER>();
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER> & ref() {
+            return *ptr<VECTOR_LEN_OTHER>();
+        }
+
+        template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+        DACE_HDFI vecu_other_t<VECTOR_LEN_OTHER> val() const {
+            return *ptr<VECTOR_LEN_OTHER>();
+        }
+
+        // template <typename... Dim>
+        // DACE_HDFI T const* ptr_at(const Dim&... indices) const {
+        //     static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+        //     OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+        //     OffsetT offset;
+        //     get_offset(index_array, offset);
+        //     return m_ptr + offset;
+        // }
+
+        // template <typename... Dim>
+        // DACE_HDFI T* ptr_at(const Dim&... indices) {
+        //     static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+        //     OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+        //     OffsetT offset;
+        //     get_offset(index_array, offset);
+        //     return m_ptr + offset;
+        // }
+
+        template <typename... Dim>
+        DACE_HDFI void write(const vec_t& value, const Dim&... indices) {
+            static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            set_element(value, index_array);
+        }
+
+        template <typename CONFLICT_RESOLUTION, typename... Dim>
+        DACE_HDFI void write_and_resolve(const vec_t& value, CONFLICT_RESOLUTION wcr,
+                                         const Dim&... indices) {
+            static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            return set_element_wcr(value, index_array, wcr);
+        }
+
+        template <typename CONFLICT_RESOLUTION, typename... Dim>
+        DACE_HDFI void write_and_resolve_nc(const vec_t& value,
+                                            CONFLICT_RESOLUTION wcr,
+                                            const Dim&... indices) {
+            static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            return set_element_wcr_nc(value, index_array, wcr);
+        }
+
+        template <ReductionType REDT, typename... Dim>
+        DACE_HDFI void write_and_resolve(const vec_t& value,
+                                         const Dim&... indices) {
+            static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            return set_element_wcr<REDT>(value, index_array);
+        }
+
+        template <ReductionType REDT, typename... Dim>
+        DACE_HDFI void write_and_resolve_nc(const vec_t& value,
+                                            const Dim&... indices) {
+            static_assert(sizeof...(indices) == DIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            return set_element_wcr_nc<REDT>(value, index_array);
+        }
+
+    protected:
+        DACE_HDFI void get_offset(OffsetT(&index_array)[DIMS],
+                                  OffsetT& offset) const {
+            offset = 0;
+            for (int i = 0; i < DIMS - 1; ++i) {
+                offset += index_array[i] * m_stride[i];
+            }
+            offset += index_array[DIMS - 1] * m_stride[DIMS - 1] * VECTOR_LEN;
+        }
+
+        DACE_HDFI const vec_t& get_element(OffsetT(&index_array)[DIMS]) const {
+            OffsetT offset;
+            get_offset(index_array, offset);
+            return *ptr(m_ptr + offset);
+        }
+
+        DACE_HDFI void set_element(const vec_t& value, OffsetT(&index_array)[DIMS]) {
+            OffsetT offset;
+            get_offset(index_array, offset);
+            *ptr(m_ptr + offset) = value;
+        }
+
+        template <ReductionType REDT>
+        DACE_HDFI void set_element_wcr(const vec_t& value,
+                                       OffsetT(&index_array)[DIMS]) {
+            OffsetT offset;
+            get_offset(index_array, offset);
+
+            wcr_fixed<REDT, vec_t>::reduce_atomic(ptr<VECTOR_LEN>(m_ptr + offset),
+                                           value);
+        }
+
+        template <ReductionType REDT>
+        DACE_HDFI void set_element_wcr_nc(const vec_t& value,
+                                          OffsetT(&index_array)[DIMS]) {
+            OffsetT offset;
+            get_offset(index_array, offset);
+
+            wcr_fixed<REDT, vec_t>::reduce(ptr(m_ptr + offset), value);
+        }
+
+        template <typename WCR_T>
+        DACE_HDFI void set_element_wcr(const vec_t& value,
+                                       OffsetT(&index_array)[DIMS], WCR_T wcr) {
+            OffsetT offset;
+            get_offset(index_array, offset);
+
+            wcr_custom<vec_t>::template reduce_atomic(
+                wcr, ptr(m_ptr + offset), value);
+        }
+
+        template <typename WCR_T>
+        DACE_HDFI void set_element_wcr_nc(const vec_t& value,
+                                          OffsetT(&index_array)[DIMS], WCR_T wcr) {
+            OffsetT offset;
+            get_offset(index_array, offset);
+
+            wcr_custom<vec_t>::template reduce(
+                wcr, ptr(m_ptr + offset), value);
+        }
+    };
+
+    // Scalar version
+    template <typename T, int VECTOR_LEN, int NUM_ACCESSES, bool ALIGNED,
+        typename OffsetT>
+        class ArrayViewIn<T, 0, VECTOR_LEN, NUM_ACCESSES, ALIGNED, OffsetT> {
+        protected:
+            // The internal pointer type relies on the alignment of the original array
+            // using vec_t = vec<T, VECTOR_LEN>;
+            using vec_t = vecu<T, VECTOR_LEN>;
+
+            T const *m_ptr;
+
+        public:
+
+            explicit DACE_HDFI ArrayViewIn(T const *ptr) : m_ptr(ptr) {}
+
+            // Template on int to conform to the same interface as non-scalar,
+            // but only accept the native vector length
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t const* ptr(T const *_ptr) const {
+                return reinterpret_cast<vec_t const*>(_ptr);
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t const* ptr() const {
+                return ptr<VECTOR_LEN_OTHER>(m_ptr);
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t const& ref() const {
+                return *ptr<VECTOR_LEN_OTHER>();
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t val() const {
+                return *ptr<VECTOR_LEN_OTHER>();
+            }
+
+            DACE_HDFI operator vec_t() const {
+                return val<VECTOR_LEN>();
+            }
+    };
+
+    // Scalar version
+    template <typename T, int VECTOR_LEN, int NUM_ACCESSES, bool ALIGNED,
+        typename OffsetT>
+        class ArrayViewOut<T, 0, VECTOR_LEN, NUM_ACCESSES, ALIGNED, OffsetT> {
+        protected:
+            // The internal pointer type relies on the alignment of the original array
+            // using vec_t = vec<T, VECTOR_LEN>;
+            using vec_t = vecu<T, VECTOR_LEN>;
+
+            T* m_ptr;
+
+        public:
+            explicit DACE_HDFI ArrayViewOut(T* ptr) : m_ptr(ptr) {}
+
+            // Template on int to conform to the same interface as non-scalar,
+            // but only accept the native vector length
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t const* ptr(T const *_ptr) const {
+                return reinterpret_cast<vec_t const*>(_ptr);
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t *ptr(T *_ptr) const {
+                return reinterpret_cast<vec_t *>(_ptr);
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t const* ptr() const {
+                return ptr<VECTOR_LEN_OTHER>(m_ptr);
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t *ptr() {
+                return ptr<VECTOR_LEN_OTHER>(m_ptr);
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t const& ref() const {
+                return *ptr<VECTOR_LEN_OTHER>();
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t &ref() {
+                return *ptr<VECTOR_LEN_OTHER>();
+            }
+
+            template <int VECTOR_LEN_OTHER = VECTOR_LEN>
+            DACE_HDFI vec_t val() const {
+                return *ptr<VECTOR_LEN_OTHER>();
+            }
+
+            DACE_HDFI operator vec_t() const {
+                return val<VECTOR_LEN>();
+            }
+
+            DACE_HDFI void write(const vec_t& value) {
+                *ptr<VECTOR_LEN>() = value;
+            }
+
+            template <typename CONFLICT_RESOLUTION>
+            DACE_HDFI void write_and_resolve(const vec_t& value,
+                                             CONFLICT_RESOLUTION wcr) {
+                wcr_custom<vec_t>::reduce_atomic(
+                    wcr, ptr<VECTOR_LEN>(), value);
+            }
+
+            template <typename CONFLICT_RESOLUTION>
+            DACE_HDFI void write_and_resolve_nc(const vec_t& value,
+                                             CONFLICT_RESOLUTION wcr) {
+                wcr_custom<vec_t>::template reduce(
+                    wcr, ptr<VECTOR_LEN>(), value);
+            }
+
+            template <ReductionType REDT>
+            DACE_HDFI void write_and_resolve(const vec_t& value) {
+                wcr_fixed<REDT, vec_t>::reduce_atomic(
+                    ptr<VECTOR_LEN>(), value);
+            }
+
+            template <ReductionType REDT>
+            DACE_HDFI void write_and_resolve_nc(const vec_t& value) {
+                wcr_fixed<REDT, vec_t>::reduce(ptr<VECTOR_LEN>(), value);
+            }
+
+            // Special case for vector conditionals
+#define VECTOR_CONDITIONAL_WRITE_AND_RESOLVE(N)                             \
+            template <typename CONFLICT_RESOLUTION>                         \
+            DACE_HDFI void write_and_resolve(const vec<int, N>& value,      \
+                                             CONFLICT_RESOLUTION wcr) {     \
+                int ppcnt = 0;                                              \
+                for (int v = 0; v < N; ++v) ppcnt += value[v] ? 1 : 0;      \
+                write_and_resolve(ppcnt);                                   \
+            }                                                               \
+            template <ReductionType REDT>                                 \
+            DACE_HDFI void write_and_resolve(const vec<int, N>& value) {    \
+                int ppcnt = 0;                                              \
+                for (int v = 0; v < N; ++v) ppcnt += value[v] ? 1 : 0;      \
+                write_and_resolve<REDT>(ppcnt);                             \
+            }                                                               \
+            template <typename CONFLICT_RESOLUTION>                         \
+            DACE_HDFI void write_and_resolve_nc(const vec<int, N>& value,   \
+                                                CONFLICT_RESOLUTION wcr) {  \
+                int ppcnt = 0;                                              \
+                for (int v = 0; v < N; ++v) ppcnt += value[v] ? 1 : 0;      \
+                write_and_resolve_nc(ppcnt);                                \
+            }                                                               \
+            template <ReductionType REDT>                                 \
+            DACE_HDFI void write_and_resolve_nc(const vec<int, N>& value) { \
+                int ppcnt = 0;                                              \
+                for (int v = 0; v < N; ++v) ppcnt += value[v] ? 1 : 0;      \
+                write_and_resolve_nc<REDT>(ppcnt);                          \
+            }
+
+            //VECTOR_CONDITIONAL_WRITE_AND_RESOLVE(2)
+            //VECTOR_CONDITIONAL_WRITE_AND_RESOLVE(4)
+            //VECTOR_CONDITIONAL_WRITE_AND_RESOLVE(8)
+    };
+
+    template <typename T, int VECTOR_LEN, typename OffsetT, int... DIMS>
+    class ArrayViewImmaterialIn  // No skip version
+    {
+    protected:
+        enum {
+            NDIMS = sizeof...(DIMS),
+            TOTAL_SIZE = TotalNDSize<DIMS...>::value,
+        };
+
+        // The internal pointer type relies on the alignment of the original array
+        using vec_t = vecu<T, VECTOR_LEN>;
+
+        T m_local_data[TOTAL_SIZE];
+        // OffsetT m_stride[NDIMS];
+
+    public:
+        template <typename... Dim>
+        explicit DACE_HDFI ArrayViewImmaterialIn(const char* name,
+                                                 const Dim&... dims) {
+            static_assert(sizeof...(dims) == 2 * static_cast<int>(NDIMS),
+                          "Dimension mismatch");
+            OffsetT dim_array[] = { static_cast<OffsetT>(dims)... };
+            OffsetT global_offset = 0;
+            OffsetT cur_stride = 1;
+
+            for (int i = 0; i < NDIMS; ++i) {
+                global_offset += dim_array[2 * i] * cur_stride;
+                // m_stride[i] = dim_array[2*i+1];
+                cur_stride *= dim_array[2 * i + 1];
+            }
+            // TODO: Assuming contiguous memory regions for now.
+            // Change to ND regions later
+            __dace_materialize(name, global_offset, global_offset + TOTAL_SIZE,
+                               m_local_data);
+        }
+
+        template <typename... Dim>
+        DACE_HDFI const vec_t& operator()(const Dim&... indices) const {
+            static_assert(sizeof...(indices) == NDIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            return get_element(index_array);
+        }
+
+    protected:
+        // TODO: Only supports 1D, implement ND later
+        /*
+            DACE_HDFI void get_offset(OffsetT(&index_array)[NDIMS], OffsetT& offset)
+           const
+            {
+                OffsetT mult = 1;//m_stride[NDIMS - 1];
+                offset = 0;
+                for (int8_t i = NDIMS - 2; i >= 0; --i)
+                {
+
+                }
+                offset += (m_off[NDIMS - 1] + index_array[NDIMS - 1]) * VECTOR_LEN;
+            }*/
+
+        DACE_HDFI const vec_t& get_element(OffsetT(&index_array)[NDIMS]) const {
+            OffsetT offset = index_array[0];
+            // get_offset(index_array, offset);
+
+            return *reinterpret_cast<vec_t const*>(m_local_data + offset);
+        }
+    };
+
+    template <typename T, int VECTOR_LEN, typename OffsetT, int... DIMS>
+    class ArrayViewImmaterialOut  // No skip version
+    {
+    protected:
+        enum {
+            NDIMS = sizeof...(DIMS),
+            TOTAL_SIZE = TotalNDSize<DIMS...>::value,
+        };
+
+        // The internal pointer type relies on the alignment of the original array
+        using vec_t = vecu<T, VECTOR_LEN>;
+
+        T m_local_data[TOTAL_SIZE];
+        // OffsetT m_stride[NDIMS];
+        const char* m_name;
+        OffsetT m_global_offset;
+
+    public:
+        template <typename... Dim>
+        explicit DACE_HDFI ArrayViewImmaterialOut(const char* name,
+                                                  const Dim&... dims)
+            : m_name(name) {
+            static_assert(sizeof...(dims) == 2 * static_cast<int>(NDIMS),
+                          "Dimension mismatch");
+            OffsetT dim_array[] = { static_cast<OffsetT>(dims)... };
+            OffsetT global_offset = 0;
+            OffsetT cur_stride = 1;
+
+            for (int i = 0; i < NDIMS; ++i) {
+                global_offset += dim_array[2 * i] * cur_stride;
+                // m_stride[i] = dim_array[2*i+1];
+                cur_stride *= dim_array[2 * i + 1];
+            }
+            m_global_offset = global_offset;
+        }
+
+        DACE_HDFI ~ArrayViewImmaterialOut() {
+            __dace_serialize(m_name, m_global_offset, m_global_offset + TOTAL_SIZE,
+                             m_local_data);
+        }
+
+        template <typename... Dim>
+        DACE_HDFI void write(const vec_t& value, const Dim&... indices) {
+            static_assert(sizeof...(indices) == NDIMS, "Dimension mismatch");
+            OffsetT index_array[] = { static_cast<OffsetT>(indices)... };
+
+            set_element(value, index_array);
+        }
+
+    protected:
+        // TODO: Only supports 1D, implement ND later
+        /*
+            DACE_HDFI void get_offset(OffsetT(&index_array)[NDIMS], OffsetT& offset)
+           const
+            {
+                OffsetT mult = 1;//m_stride[NDIMS - 1];
+                offset = 0;
+                for (int8_t i = NDIMS - 2; i >= 0; --i)
+                {
+
+                }
+                offset += (m_off[NDIMS - 1] + index_array[NDIMS - 1]) * VECTOR_LEN;
+            }*/
+
+        DACE_HDFI void set_element(const vec_t& value,
+                                   OffsetT(&index_array)[NDIMS]) {
+            OffsetT offset = index_array[0];
+            // get_offset(index_array, offset);
+
+            *reinterpret_cast<vec_t*>(m_local_data + offset) = value;
+        }
+    };
+
+    // Scalar version
+    template <typename T, int VECTOR_LEN, typename OffsetT>
+    class ArrayViewImmaterialIn<T, VECTOR_LEN, OffsetT> {
+    protected:
+        using vec_t = vecu<T, VECTOR_LEN>;
+
+        const char* m_name;
+        OffsetT m_offset;
+
+    public:
+        explicit DACE_HDFI ArrayViewImmaterialIn(const char* name, OffsetT offset)
+            : m_name(name), m_offset(offset) {}
+
+        DACE_HDFI operator vec_t() {
+            vec_t tmp;
+            __dace_materialize(m_name, m_offset, m_offset + 1, &tmp);
+            return tmp;
+        }
+    };
+
+    // Scalar version
+    template <typename T, int VECTOR_LEN, typename OffsetT>
+    class ArrayViewImmaterialOut<T, VECTOR_LEN, OffsetT> {
+    protected:
+        using vec_t = vecu<T, VECTOR_LEN>;
+
+        const char* m_name;
+        OffsetT m_offset;
+
+    public:
+        explicit DACE_HDFI ArrayViewImmaterialOut(const char* name, OffsetT offset)
+            : m_name(name), m_offset(offset) {}
+
+        DACE_HDFI void write(const vec_t& value) {
+            __dace_serialize(m_name, m_offset, m_offset + 1, &value);
+        }
+
+        DACE_HDFI void write(const vec_t& value, OffsetT offset) {
+            __dace_serialize(m_name, m_offset + offset, m_offset + 1 + offset, &value);
+        }
+    };
+
+}  // namespace dace
+
+#endif  // __DACE_VIEW_H
diff --git a/dace/runtime/include/dace/xilinx/access.h b/dace/runtime/include/dace/xilinx/access.h
new file mode 100644
index 0000000000..0781c24664
--- /dev/null
+++ b/dace/runtime/include/dace/xilinx/access.h
@@ -0,0 +1,74 @@
+#pragma once
+
+#include "dace/xilinx/array_interface.h"
+#include "dace/xilinx/vec.h"
+#include "dace/xilinx/stream.h"
+
+namespace dace {
+
+template <typename T, unsigned vector_length>
+vec<T, vector_length> Read(ArrayInterface<T, vector_length> const &interface) {
+  #pragma HLS INLINE
+  return *interface.ptr_in();
+}
+
+template <typename T, unsigned vector_length>
+vec<T, vector_length> Read(vec<T, vector_length> const *ptr) {
+  #pragma HLS INLINE
+  return *ptr;
+}
+
+template <typename T, unsigned vector_length>
+vec<T, vector_length> Read(vec<T, vector_length> const &ref) {
+  #pragma HLS INLINE
+  return ref;
+}
+
+template <typename T, unsigned vector_length, unsigned capacity>
+vec<T, vector_length> Read(
+    StreamView<T, vector_length, capacity> &stream_view) {
+  #pragma HLS INLINE
+  return stream_view.pop();
+}
+
+template <typename T, unsigned vector_length>
+void Write(ArrayInterface<T, vector_length> &interface,
+           vec<T, vector_length> const &value) {
+  #pragma HLS INLINE
+  *interface.ptr_out() = value;
+}
+
+template <typename T, unsigned vector_length>
+void Write(ArrayInterface<T, vector_length> interface,
+           vec<T, vector_length> const &value) {
+  #pragma HLS INLINE
+  *interface.ptr_out() = value;
+}
+
+template <typename T, unsigned vector_length>
+void Write(vec<T, vector_length> *ptr, vec<T, vector_length> const &value) {
+  #pragma HLS INLINE
+  *ptr = value;
+}
+
+template <typename T, unsigned vector_length, unsigned capacity>
+void Write(vec<T, vector_length> *ptr,
+           StreamView<T, vector_length, capacity> &stream) {
+  #pragma HLS INLINE
+  *ptr = stream;
+}
+
+template <typename T, unsigned vector_length>
+void Write(vec<T, vector_length> &ref, vec<T, vector_length> const &value) {
+  #pragma HLS INLINE
+  ref = value;
+}
+
+template <typename T, unsigned vector_length, unsigned capacity>
+void Write(StreamView<T, vector_length, capacity> &stream_view,
+           vec<T, vector_length> const &value) {
+  #pragma HLS INLINE
+  stream_view.push(value);
+}
+
+} // End namespace dace
diff --git a/dace/runtime/include/dace/xilinx/array_interface.h b/dace/runtime/include/dace/xilinx/array_interface.h
new file mode 100644
index 0000000000..7c8cebc9ab
--- /dev/null
+++ b/dace/runtime/include/dace/xilinx/array_interface.h
@@ -0,0 +1,58 @@
+#pragma once
+
+#include "dace/xilinx/vec.h"
+
+namespace dace {
+
+// Class that wraps both an input and an output pointer, as these are generated
+// as separate interfaces in HLS, but should be seen as a single pointer from
+// the point of view of dataflow modules
+template <typename T, unsigned vector_length>
+class ArrayInterface {
+
+ public:
+ 
+  ArrayInterface(vec<T, vector_length> const *ptr_in,
+                 vec<T, vector_length> *ptr_out)
+      : ptr_in_(ptr_in), ptr_out_(ptr_out) {
+    #pragma HLS INLINE    
+  }
+
+  vec<T, vector_length> const *ptr_in() const {
+    #pragma HLS INLINE
+#ifndef DACE_SYNTHESIS
+    if (ptr_in_ == nullptr) {
+      throw std::runtime_error("Accessed nullptr in ArrayInterface");
+    }
+#endif
+    return ptr_in_;
+  }
+
+  vec<T, vector_length> *ptr_out() const {
+    #pragma HLS INLINE
+#ifndef DACE_SYNTHESIS
+    if (ptr_out_ == nullptr) {
+      throw std::runtime_error("Accessed nullptr in ArrayInterface");
+    }
+#endif
+    return ptr_out_;
+  }
+
+  T const &operator[](size_t i) const {
+     #pragma HLS INLINE
+     return ptr_in_[i];
+  }
+
+  friend ArrayInterface<T, vector_length> operator+(
+      ArrayInterface<T, vector_length> const &arr, size_t offset) {
+    #pragma HLS INLINE
+    return ArrayInterface<T, vector_length>(arr.ptr_in_ + offset,
+                                            arr.ptr_out_ + offset);
+  }
+
+ private:
+  vec<T, vector_length> const *ptr_in_;
+  vec<T, vector_length> *ptr_out_;
+};
+
+} // End namespace dace
diff --git a/dace/runtime/include/dace/xilinx/device.h b/dace/runtime/include/dace/xilinx/device.h
new file mode 100644
index 0000000000..1ac525ce99
--- /dev/null
+++ b/dace/runtime/include/dace/xilinx/device.h
@@ -0,0 +1,14 @@
+#pragma once
+
+#include "hlslib/xilinx/Simulation.h"
+#include "hlslib/xilinx/Utility.h"
+
+#include "dace/copy.h"
+#include "dace/types.h"
+#include "dace/pyinterop.h"
+
+#include "dace/xilinx/reduce.h"
+#include "dace/xilinx/stream.h"
+#include "dace/xilinx/vec.h"
+#include "dace/xilinx/view.h"
+#include "dace/xilinx/access.h"
diff --git a/dace/runtime/include/dace/xilinx/host.h b/dace/runtime/include/dace/xilinx/host.h
new file mode 100644
index 0000000000..567c040578
--- /dev/null
+++ b/dace/runtime/include/dace/xilinx/host.h
@@ -0,0 +1,5 @@
+#pragma once
+
+#include <dace/os.h>
+#include <dace/types.h>
+#include "hlslib/xilinx/SDAccel.h"
diff --git a/dace/runtime/include/dace/xilinx/reduce.h b/dace/runtime/include/dace/xilinx/reduce.h
new file mode 100644
index 0000000000..0e988457d5
--- /dev/null
+++ b/dace/runtime/include/dace/xilinx/reduce.h
@@ -0,0 +1,156 @@
+#pragma once
+
+#include <iterator>
+#include <type_traits>
+#include <utility>
+
+#include "hlslib/xilinx/Operators.h"
+#include "hlslib/xilinx/TreeReduce.h"
+
+#include "dace/types.h"
+#include "dace/xilinx/access.h"
+
+namespace dace {
+
+////////////////////////////////////////////////////////////////////////////////
+// Conversion from DACE reduction types to hlslib types 
+////////////////////////////////////////////////////////////////////////////////
+
+template <ReductionType rt>
+struct ConvertReduction;
+
+template <>
+struct ConvertReduction<ReductionType::Min> {
+  template <typename T>
+  using Operator = hlslib::op::Min<T>;
+};
+
+template <>
+struct ConvertReduction<ReductionType::Max> {
+  template <typename T>
+  using Operator = hlslib::op::Max<T>;
+};
+
+template <>
+struct ConvertReduction<ReductionType::Sum> {
+  template <typename T>
+  using Operator = hlslib::op::Sum<T>;
+};
+
+template <>
+struct ConvertReduction<ReductionType::Product> {
+  template <typename T>
+  using Operator = hlslib::op::Product<T>;
+};
+
+template <>
+struct ConvertReduction<ReductionType::Logical_And> {
+  template <typename T>
+  using Operator = hlslib::op::And<T>;
+};
+
+////////////////////////////////////////////////////////////////////////////////
+// Helper functions/template implementation
+// (Actual implementation is at the bottom of the file.)
+////////////////////////////////////////////////////////////////////////////////
+
+namespace {
+
+template <typename T, typename = void>
+struct IsRandomAccess {
+  static constexpr bool value = false;
+};
+
+template <typename T>
+struct IsRandomAccess<
+    T, typename std::enable_if<!std::is_same<
+           typename std::iterator_traits<T>::value_type, void>::value>::type> {
+  static constexpr bool value = true;
+};
+
+// Vector to a scalar, call tree reduction
+template <typename T, unsigned W_in, unsigned W_out, class Functor,
+          typename T_in, typename T_out>
+typename std::enable_if<(IsRandomAccess<T_in>::value &&
+                         !IsRandomAccess<T_out>::value && W_out - W_in < 0),
+                        T_out>::type
+ReduceImpl(T_in &&a, T_out &&b) {
+#pragma HLS INLINE
+  static_assert(W_out != 1,
+                "Vector reduction only supported for output length 1.");
+  const auto a_reduced = hlslib::TreeReduce<T, Functor, W_in>(a);
+  return Functor::Apply(a_reduced, b[0]);
+}
+
+// Vector to a scalar wrapped in a vector type, call tree reduction
+template <typename T, unsigned W_in, unsigned W_out, class Functor,
+          typename T_in, typename T_out>
+typename std::enable_if<(IsRandomAccess<T_in>::value &&
+                         IsRandomAccess<T_out>::value && W_out - W_in < 0),
+                        T_out>::type
+ReduceImpl(T_in &&a, T_out &&b) {
+#pragma HLS INLINE
+  static_assert(W_out != 1,
+                "Vector reduction only supported for output length 1.");
+  const auto a_reduced = hlslib::TreeReduce<T, Functor, W_in>(a);
+  typename std::remove_reference<T_out>::type result;
+  result[0] = Functor::Apply(a_reduced, b[0]);
+  return result;
+}
+
+// Between two scalars
+template <typename T, unsigned W_in, unsigned W_out, class Functor,
+          typename T_in, typename T_out>
+typename std::enable_if<(!IsRandomAccess<T_in>::value &&
+                         !IsRandomAccess<T_out>::value && W_in == 1 &&
+                         W_out == 1),
+                        typename std::remove_reference<T_out>::type>::type
+ReduceImpl(T_in &&a, T_out &&b) {
+  #pragma HLS INLINE
+  return Functor::Apply(std::forward<T_in>(a), std::forward<T_out>(b));
+}
+
+// Between two scalars wrapped in vector types
+template <typename T, unsigned W_in, unsigned W_out, class Functor,
+          typename T_in, typename T_out>
+typename std::enable_if<(IsRandomAccess<T_in>::value &&
+                         IsRandomAccess<T_out>::value && W_in == 1 &&
+                         W_out == 1),
+                        T_out>::type
+ReduceImpl(T_in &&a, T_out &&b) {
+  #pragma HLS INLINE
+  typename std::remove_reference<T_out>::type result;
+  result[0] = Functor::Apply(a[0], b[0]);
+  return result;
+}
+
+// Vector-to-vector, apply the reduction on every index
+template <typename T, unsigned W_in, unsigned W_out, class Functor,
+          typename T_in, typename T_out>
+typename std::enable_if<(IsRandomAccess<T_in>() && IsRandomAccess<T_out>() &&
+                         W_in > 1 && W_out > 1),
+                        T_out>::type
+ReduceImpl(T_in &&a, T_out &&b) {
+  #pragma HLS INLINE
+  return hlslib::op::Wide<Functor, T_out, W_out>(std::forward<T_in>(a),
+                                                 std::forward<T_out>(b));
+}
+
+}  // End anonymous namespace
+
+////////////////////////////////////////////////////////////////////////////////
+// Function exposed to DACE 
+////////////////////////////////////////////////////////////////////////////////
+
+template <typename T, unsigned W_in, unsigned W_out, class Functor,
+          typename T_in, typename T_out>
+T Reduce(T_in &&a, T_out &&b) {
+  #pragma HLS INLINE
+  static_assert(W_out <= W_in,
+                "Output vector length must be shorter or identical to input "
+                "vector length.");
+  return ReduceImpl<T, W_in, W_out, Functor>(Read<T, W_in>(a),
+                                             Read<T, W_out>(b));
+}
+
+}  // End namespace dace
diff --git a/dace/runtime/include/dace/xilinx/stream.h b/dace/runtime/include/dace/xilinx/stream.h
new file mode 100644
index 0000000000..6c2a98debe
--- /dev/null
+++ b/dace/runtime/include/dace/xilinx/stream.h
@@ -0,0 +1,157 @@
+#pragma once
+
+#include "hlslib/xilinx/Stream.h"
+#include "dace/xilinx/vec.h"
+#ifndef DACE_SYNTHESIS
+#include <string>  // std::to_string
+#endif
+
+namespace dace {
+
+/// Proxy class that wraps hlslib::Stream in a dace::Stream-compatible
+/// interface.
+template <typename T, unsigned vector_length, unsigned capacity>
+class FIFO {
+ public:
+  FIFO() : stream_(capacity) {
+    #pragma HLS INLINE
+    #pragma HLS STREAM variable=stream_ depth=capacity
+  }
+  FIFO(char const *const name) : stream_(name, capacity) {
+    #pragma HLS INLINE
+    #pragma HLS STREAM variable=stream_ depth=capacity
+  }
+  FIFO(FIFO const&) = delete;
+  FIFO(FIFO&&) = delete;
+  FIFO& operator=(FIFO const&) = delete;
+  FIFO& operator=(FIFO&&) = delete;
+  ~FIFO() = default;
+
+  using Data_t = dace::vec<T, vector_length>;
+
+  Data_t pop_blocking() {
+    #pragma HLS INLINE
+    return stream_.ReadBlocking();
+  }
+
+  Data_t pop() {
+    #pragma HLS INLINE
+    return pop_blocking();
+  }
+
+  bool pop_try(Data_t& output) {
+    #pragma HLS INLINE
+    return stream_.ReadNonBlocking(output);
+  }
+
+  template <typename U>
+  void push_blocking(U&& val) {
+    #pragma HLS INLINE
+    return stream_.WriteBlocking(std::forward<U>(val));
+  }
+
+  template <typename U>
+  void push(U&& val) {
+    #pragma HLS INLINE
+    return push_blocking(val);
+  }
+
+  // ArrayView-compatible interface
+
+  template <typename U>
+  void write(U&& val) {
+    #pragma HLS INLINE
+    return push(std::forward<U>(val));
+  }
+
+  template <typename U>
+  void operator=(U&& val) {
+    #pragma HLS INLINE
+    push(std::forward<U>(val));
+  }
+
+  operator Data_t() { 
+    #pragma HLS INLINE
+    return pop_blocking();
+  }
+
+#ifndef DACE_SYNTHESIS
+  void SetName(std::string const &str) {
+    stream_.set_name(str.c_str());
+  }
+#endif
+
+ private:
+  hlslib::Stream<Data_t, capacity> stream_;
+};
+
+// DataView interface for streams
+template <typename T, unsigned vector_length, unsigned capacity>
+class StreamView {
+ public:
+  StreamView(FIFO<T, vector_length, capacity>& stream) : stream_(stream) {}
+  StreamView(StreamView<T, vector_length, capacity> const &) = default;
+  StreamView(StreamView<T, vector_length, capacity> &&) = default;
+  StreamView() = delete;
+  ~StreamView() = default;
+
+  using Data_t = dace::vec<T, vector_length>;
+
+  template <typename U>
+  void write(U&& val) {
+    #pragma HLS INLINE
+    stream_.push_blocking(std::forward<U>(val));
+  }
+
+  template <typename U>
+  void push(U&& val) {
+    #pragma HLS INLINE
+    write(std::forward<U>(val));
+  }
+
+  template <typename U>
+  void operator=(U&& val) {
+    #pragma HLS INLINE
+    return write(std::forward<U>(val));
+  }
+
+  operator Data_t() {
+    #pragma HLS INLINE
+    return stream_.pop_blocking();
+  }
+
+  Data_t pop() {
+    #pragma HLS INLINE
+    return Data_t(*this);
+  }
+
+private:
+  FIFO<T, vector_length, capacity> &stream_;
+};
+
+template <typename T, unsigned vector_length, unsigned capacity>
+StreamView<T, vector_length, capacity> make_streamview(
+    FIFO<T, vector_length, capacity>& stream) {
+  #pragma HLS INLINE
+  return StreamView<T, vector_length, capacity>(stream);
+}
+
+template <typename T, unsigned vector_length, unsigned capacity>
+StreamView<T, vector_length, capacity>& make_streamview(
+    StreamView<T, vector_length, capacity>& view) {
+  #pragma HLS INLINE
+  return view;
+}
+
+template <typename T, unsigned vector_length, unsigned capacity>
+void SetNames(FIFO<T, vector_length, capacity> fifos[], char const *const str,
+              const unsigned num) {
+  #pragma HLS INLINE
+#ifndef DACE_SYNTHESIS
+  for (unsigned i = 0; i < num; ++i) {
+    fifos[i].SetName(std::string(str) + "[" + std::to_string(i) + "]");
+  }
+#endif
+}
+
+}  // End namespace dace
diff --git a/dace/runtime/include/dace/xilinx/vec.h b/dace/runtime/include/dace/xilinx/vec.h
new file mode 100644
index 0000000000..fa3e624642
--- /dev/null
+++ b/dace/runtime/include/dace/xilinx/vec.h
@@ -0,0 +1,16 @@
+#pragma once
+
+#include "hlslib/xilinx/DataPack.h"
+#include <type_traits>
+
+namespace dace {
+
+template <typename T, unsigned width>
+using vec =
+    typename std::conditional<(width > 1), hlslib::DataPack<T, width>, T>::type;
+
+// Don't distinguish aligned and unaligned on FPGA
+template <typename T, unsigned width>
+using vecu = vec<T, width>; 
+
+} // End namespace dace
diff --git a/dace/runtime/include/dace/xilinx/view.h b/dace/runtime/include/dace/xilinx/view.h
new file mode 100644
index 0000000000..f7caf5b926
--- /dev/null
+++ b/dace/runtime/include/dace/xilinx/view.h
@@ -0,0 +1,428 @@
+#pragma once
+
+#include <utility>
+#ifndef DACE_SYNTHESIS
+#include <stdexcept>
+#include <string>
+#endif
+
+#include "dace/types.h"
+#include "dace/xilinx/array_interface.h"
+#include "dace/xilinx/reduce.h"
+#include "dace/xilinx/vec.h"
+
+namespace dace {
+
+namespace {
+
+template <unsigned vector_length>
+struct WriteImpl {
+  template <typename T>
+  static void Write(T *ptr,
+                    vec<T, vector_length> const &value) {
+    #pragma HLS INLINE
+    value.Unpack(ptr);
+  }
+  template <typename T>
+  static void Write(vec<T, vector_length> *ptr,
+                    vec<T, vector_length> const &value) {
+    #pragma HLS INLINE
+    *ptr = value;
+  }
+  template <typename T, typename TOffset>
+  static void Write(vec<T, vector_length> *ptr,
+                    TOffset offset, vec<T, vector_length> const &value) {
+    #pragma HLS INLINE
+    ptr[offset] = value;
+  }
+  template <typename T, typename TOffset>
+  static void Write(T *ptr,
+                    TOffset offset, vec<T, vector_length> const &value) {
+    #pragma HLS INLINE
+    value.Unpack(&ptr[offset]);
+  }
+  template <typename TVec>
+  static void Write(TVec &ref, TVec const &value) {
+    #pragma HLS INLINE
+    ref = value;
+  }
+};
+
+template <>
+struct WriteImpl<1> {
+  template <typename T>
+  static void Write(vec<T, 1> *ptr, vec<T, 1> const &value) {
+    #pragma HLS INLINE
+    *ptr = value;
+  }
+  template <typename T, typename TOffset>
+  static void Write(vec<T, 1> *ptr, TOffset offset,
+                    vec<T, 1> const &value) {
+    #pragma HLS INLINE
+    ptr[offset] = value;
+  }
+  template <typename T>
+  static void Write(vec<T, 1> &ref, vec<T, 1> const &value) {
+    #pragma HLS INLINE
+    ref = value;
+  }
+};
+
+template <typename T, unsigned dims, unsigned vector_length, int num_accesses,
+          typename TIndex>
+class ArrayViewImpl {
+
+ public:
+
+  template <typename... Is>
+  ArrayViewImpl(Is const &... strides)
+      : strides_{static_cast<TIndex>(strides)...} {}
+
+  template <TIndex dim, typename I, typename... Is>
+  void IndexImpl(TIndex &offset, I const &index_val,
+                 Is const &... index_vals) const {
+    #pragma HLS INLINE
+    offset += TIndex(index_val) * strides_[dim];
+    IndexImpl<dim + 1, Is...>(offset, index_vals...);
+  }
+
+  template <TIndex dim, typename I>
+  void IndexImpl(TIndex &offset, I const &index_val) const {
+    #pragma HLS INLINE
+    offset += TIndex(index_val) * strides_[dim] * vector_length;
+  }
+
+  template <typename... Is>
+  TIndex Index(Is const &... indices) const {
+    #pragma HLS INLINE
+    TIndex offset = 0;
+    IndexImpl<0>(offset, indices...);
+    // We use packed types, so divide the final index expression with the vector
+    // length to apply this to all dimensions
+    return offset / vector_length;
+  }
+private:
+  TIndex strides_[dims];
+};
+
+}  // End anonymous namespace
+
+template <typename T, unsigned dims, unsigned vector_length,
+          int num_accesses>
+class ArrayViewIn {
+ private:
+  using Index_t = unsigned;
+  using Vec_t = vec<T, vector_length>;
+
+ public:
+
+  template <typename... Is>
+  ArrayViewIn(ArrayInterface<T, vector_length> ptr, Is const &... strides)
+      : ptr_(ptr.ptr_in()), impl_{strides...} {
+    static_assert(sizeof...(strides) == static_cast<int>(dims),
+                  "Dimension mismatch");
+  }
+
+  template <typename... Is>
+  explicit ArrayViewIn(Vec_t const *ptr, Is const &... strides)
+      : ptr_(ptr), impl_{strides...} {
+    static_assert(sizeof...(strides) == static_cast<int>(dims),
+                  "Dimension mismatch");
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t const *ptr() const {
+    #pragma HLS INLINE
+    return ptr_;
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t const &ref() const {
+    #pragma HLS INLINE
+    return *ptr();
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t val() const {
+    #pragma HLS INLINE
+    return *ptr();
+  }
+
+  operator Vec_t const *() const {
+    #pragma HLS INLINE
+    return ptr();
+  }
+
+  template <typename... Is>
+  Vec_t operator()(Is const &... indices) const {
+    #pragma HLS INLINE
+    static_assert(sizeof...(indices) == dims, "Dimension mismatch");
+    return get(indices...);
+  }
+
+ private:
+  template <typename... Is>
+  Vec_t get(Is const &... indices) const {
+    #pragma HLS INLINE
+    const auto i = impl_.Index(indices...);
+    return ptr_[i];
+  }
+
+ private:
+  Vec_t const *ptr_;
+  ArrayViewImpl<T, dims, vector_length, num_accesses, Index_t> impl_; 
+};
+
+template <typename T, unsigned dims, unsigned vector_length,
+          int num_accesses>
+class ArrayViewOut {
+
+ private:
+
+  using Index_t = unsigned;
+  using Vec_t = vec<T, vector_length>;
+
+ public:
+
+  template <typename... Is>
+  ArrayViewOut(ArrayInterface<T, vector_length> ptr, Is const &... strides)
+      : ptr_(ptr.ptr_out()), impl_{strides...} {
+    #pragma HLS INLINE
+    static_assert(sizeof...(strides) == static_cast<int>(dims),
+                  "Dimension mismatch");
+  }
+
+  template <typename... Is>
+  explicit ArrayViewOut(Vec_t *ptr, Is const &... strides)
+      : ptr_(ptr), impl_{strides...} {
+    static_assert(sizeof...(strides) == static_cast<int>(dims),
+                  "Dimension mismatch");
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t *ptr() {
+    #pragma HLS INLINE
+    return ptr_;
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t &ref() {
+    #pragma HLS INLINE
+    return *ptr();
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t val() {
+    #pragma HLS INLINE
+    return *ptr();
+  }
+
+  operator Vec_t*() {
+    #pragma HLS INLINE
+    return ptr();
+  }
+
+  template <typename... Is>
+  void write(Vec_t const &value, Is const &... indices) {
+    #pragma HLS INLINE
+    static_assert(sizeof...(indices) == dims, "Dimension mismatch");
+    set(value, indices...);
+  }
+
+  void operator=(Vec_t const &value) {
+    #pragma HLS INLINE 
+    write(value);
+  }
+
+  template <ReductionType reduction_type, typename... Is>
+  void write_and_resolve(vec<T, vector_length> const &value,
+                         Is const &... indices) {
+    #pragma HLS INLINE
+    using Functor =
+        typename ConvertReduction<reduction_type>::template Operator<T>;
+    const auto i = impl_.Index(indices...);
+    WriteImpl<vector_length>::template Write<T, decltype(i)>(
+        ptr_, i,
+        Reduce<T, vector_length, vector_length, Functor>(ptr_[i], value));
+  }
+
+  template <ReductionType reduction_type, typename... Is>
+  void write_and_resolve_nc(vec<T, vector_length> const &value,
+                            Is const &... indices) {
+    #pragma HLS INLINE
+    return write_and_resolve<reduction_type>(value, indices...);
+  }
+
+ private:
+
+  template <typename... Is>
+  void set(vec<T, vector_length> const &value, Is const &... indices) const {
+    #pragma HLS INLINE
+    const auto i = impl_.Index(indices...);
+    WriteImpl<vector_length>::template Write<T, decltype(i)>(ptr_, i, value);
+    // #pragma HLS DEPENDENCE variable=ptr_ false
+  }
+
+ private:
+  Vec_t *ptr_;
+  ArrayViewImpl<T, dims, vector_length, num_accesses, Index_t> impl_;
+};
+
+/// Scalar version special case
+template <typename T, unsigned vector_length, int num_accesses>
+class ArrayViewIn<T, 0, vector_length, num_accesses> {
+ private:
+  using Index_t = unsigned;
+  using Vec_t = vec<T, vector_length>;
+
+ public:
+  ArrayViewIn(ArrayInterface<T, vector_length> interface)
+      : ptr_(interface.ptr_in()) {
+    #pragma HLS INLINE 
+  }
+
+  explicit ArrayViewIn(Vec_t const *ptr) : ptr_(ptr) {
+    #pragma HLS INLINE 
+  }
+
+  explicit ArrayViewIn(Vec_t const &ref) : ptr_(&ref) {
+    #pragma HLS INLINE 
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t const *ptr() const {
+    #pragma HLS INLINE
+    return ptr_;
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t const &ref() const {
+    #pragma HLS INLINE
+    return *ptr();
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t val() const {
+    #pragma HLS INLINE
+    return *ptr();
+  }
+
+  operator Vec_t const *() const {
+    #pragma HLS INLINE
+    return ptr();
+  }
+
+  operator Vec_t() const {
+    #pragma HLS INLINE
+    return val();
+  }
+
+  template <typename I>
+  T operator()(I const &i) const {
+    #pragma HLS INLINE
+#ifndef DACE_SYNTHESIS
+    if (i < 0 || i >= vector_length) {
+      throw std::runtime_error("Vector index out of bounds: " +
+                               std::to_string(i));
+    }    
+#endif
+    return val()[i];
+  }
+
+ private:
+  Vec_t const *ptr_;
+};
+
+/// Scalar version special case
+template <typename T, unsigned vector_length, int num_accesses>
+class ArrayViewOut<T, 0, vector_length, num_accesses> {
+ private:
+  using Index_t = unsigned;
+  using Vec_t = vec<T, vector_length>;
+
+ public:
+  ArrayViewOut(ArrayInterface<T, vector_length> interface)
+      : ptr_(interface.ptr_out()) {
+    #pragma HLS INLINE 
+  }
+
+  explicit ArrayViewOut(Vec_t *ptr) : ptr_(ptr) {
+    #pragma HLS INLINE 
+    // #pragma HLS DEPENDENCE variable=ptr_ false
+  }
+
+  ArrayViewOut(Vec_t &ref) : ptr_(&ref) {
+    #pragma HLS INLINE
+  }
+
+  // Conforms to interface by allowing template argument, but will fail for
+  // mismatching vector length
+  template <unsigned vector_length_other = vector_length>
+  Vec_t *ptr() {
+    #pragma HLS INLINE
+    return ptr_;
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t &ref() {
+    #pragma HLS INLINE
+    return *ptr();
+  }
+
+  template <unsigned vector_length_other = vector_length>
+  Vec_t val() {
+    #pragma HLS INLINE
+    return *ptr();
+  }
+
+  operator Vec_t*() {
+    #pragma HLS INLINE
+    return ptr();
+  }
+
+  operator Vec_t&() {
+    #pragma HLS INLINE
+    return ref();
+  }
+
+  template <typename I>
+  hlslib::DataPackProxy<T, vector_length> operator()(I const &i) {
+    #pragma HLS INLINE
+#ifndef DACE_SYNTHESIS
+    if (i < 0 || i >= vector_length) {
+      throw std::runtime_error("Vector index out of bounds: " +
+                               std::to_string(i));
+    }    
+#endif
+    return ref()[i];
+  }
+
+  void write(Vec_t const &value) {
+    #pragma HLS INLINE
+    WriteImpl<vector_length>::template Write<T>(ptr_, value);
+  }
+
+  void operator=(Vec_t const &value) {
+    #pragma HLS INLINE 
+    return write(value);
+  }
+
+  template <ReductionType reduction_type>
+  void write_and_resolve(vec<T, vector_length> const &value) {
+    #pragma HLS INLINE
+    using Functor =
+        typename ConvertReduction<reduction_type>::template Operator<T>;
+    write(Reduce<T, vector_length, vector_length, Functor>(*ptr_, value));
+  }
+
+  template <ReductionType reduction_type>
+  void write_and_resolve_nc(vec<T, vector_length> const &value) {
+    #pragma HLS INLINE
+    return write_and_resolve<reduction_type>(value);
+  }
+
+ private:
+  Vec_t *ptr_;
+};
+
+}  // End namespace dace
diff --git a/dace/sdfg.py b/dace/sdfg.py
new file mode 100644
index 0000000000..00993d7f54
--- /dev/null
+++ b/dace/sdfg.py
@@ -0,0 +1,3210 @@
+import astunparse
+import collections
+import copy
+import errno
+import itertools
+from inspect import getframeinfo, stack
+import os
+import pickle
+from pydoc import locate
+from typing import Any, Dict, Set, Tuple, List
+
+import networkx as nx
+import numpy as np
+import sympy as sp
+
+import dace
+from dace import data as dt, memlet as mm, subsets as sbs, types, properties, symbolic
+from dace.config import Config
+from dace.frontend.python import ndarray
+from dace.graph import edges as ed, nodes as nd, labeling
+from dace.graph.labeling import propagate_memlet, propagate_labels_sdfg
+from dace.data import validate_name
+from dace.graph import dot, nxutil
+from dace.graph.graph import (OrderedDiGraph, OrderedMultiDiConnectorGraph,
+                              SubgraphView, Edge, MultiConnectorEdge)
+from dace.properties import make_properties, Property
+
+
+def getcaller() -> Tuple[str, int]:
+    """ Returns the file and line of the function that called the current
+        function (the one that calls getcaller()).
+        @return: 2-tuple of file and line.
+    """
+    caller = getframeinfo(stack()[2][0])
+    return (caller.filename, caller.lineno)
+
+
+def getdebuginfo(old_dinfo=None) -> types.DebugInfo:
+    """ Returns a DebugInfo object for the position that called this function.
+        @param old_dinfo: Another DebugInfo object that will override the
+                          return value of this function
+        @return: DebugInfo containing line number and calling file.
+    """
+    if old_dinfo is not None:
+        return old_dinfo
+
+    caller = getframeinfo(stack()[2][0])
+    return types.DebugInfo(caller.lineno, 0, caller.lineno, 0, caller.filename)
+
+
+class InvalidSDFGError(Exception):
+    """ A class of exceptions thrown when SDFG validation fails. """
+
+    def __init__(self, message: str, sdfg, state_id):
+        self.message = message
+        self.sdfg = sdfg
+        self.state_id = state_id
+
+    def __str__(self):
+        if self.state_id is not None:
+            state = self.sdfg.nodes()[self.state_id]
+            return '%s (at state %s)' % (self.message, str(state.label))
+        else:
+            return '%s' % self.message
+
+
+class InvalidSDFGInterstateEdgeError(InvalidSDFGError):
+    """ Exceptions of invalid inter-state edges in an SDFG. """
+
+    def __init__(self, message: str, sdfg, edge_id):
+        self.message = message
+        self.sdfg = sdfg
+        self.edge_id = edge_id
+
+    def __str__(self):
+        if self.edge_id is not None:
+            e = self.sdfg.edges()[self.edge_id]
+            edgestr = ' (at edge "%s" (%s -> %s)' % (e.data.label, str(e.src),
+                                                     str(e.dst))
+        else:
+            edgestr = ''
+
+        return '%s%s' % (self.message, edgestr)
+
+
+class InvalidSDFGNodeError(InvalidSDFGError):
+    """ Exceptions of invalid nodes in an SDFG state. """
+
+    def __init__(self, message: str, sdfg, state_id, node_id):
+        self.message = message
+        self.sdfg = sdfg
+        self.state_id = state_id
+        self.node_id = node_id
+
+    def __str__(self):
+        state = self.sdfg.nodes()[self.state_id]
+
+        if self.node_id is not None:
+            node = state.nodes()[self.node_id]
+            nodestr = ', node %s' % str(node)
+        else:
+            nodestr = ''
+
+        return '%s (at state %s%s)' % (self.message, str(state.label), nodestr)
+
+
+class InvalidSDFGEdgeError(InvalidSDFGError):
+    """ Exceptions of invalid edges in an SDFG state. """
+
+    def __init__(self, message: str, sdfg, state_id, edge_id):
+        self.message = message
+        self.sdfg = sdfg
+        self.state_id = state_id
+        self.edge_id = edge_id
+
+    def __str__(self):
+        state = self.sdfg.nodes()[self.state_id]
+
+        if self.edge_id is not None:
+            e = state.edges()[self.edge_id]
+            edgestr = ', edge %s (%s:%s -> %s:%s)' % (str(e.data), str(
+                e.src), e.src_conn, str(e.dst), e.dst_conn)
+        else:
+            edgestr = ''
+
+        return '%s (at state %s%s)' % (self.message, str(state.label), edgestr)
+
+
+class SDFG(OrderedDiGraph):
+    """ The main intermediate representation of code in DaCe.
+
+        A Stateful DataFlow multiGraph (SDFG) is a directed graph of directed
+        acyclic multigraphs (i.e., where two nodes can be connected by more
+        than one edge). The top-level directed graph represents a state
+        machine, where edges can contain state transition conditions and
+        assignments (see the `InterstateEdge` class documentation). The nested
+        acyclic multigraphs represent dataflow, where nodes may represent data
+        regions in memory, tasklets, or parametric graph scopes (see
+        `dace.graph.nodes` for a full list of available node types); edges in the multigraph represent data movement using memlets, as described in the `Memlet` class documentation.
+    """
+
+    def __init__(self,
+                 name: str,
+                 arg_types: Dict[str, dt.Data] = collections.OrderedDict(),
+                 constants: Dict[str, Any] = {},
+                 propagate: bool = True,
+                 parent=None):
+        """ Constructs a new SDFG.
+            @param name: Name for the SDFG (also used as the filename for
+                         the compiled shared library).
+            @param arg_types: An Ordered dictionary mapping between argument
+                              names and their data descriptors. Can be used
+                              to call an SDFG with non-keyword arguments.
+            @param constants: A dictionary of compile-time constant values
+                              (or numpy `ndarray`s) to use when generating
+                              code from the SDFG. These can also be added
+                              after creation using `sdfg.add_constants`.
+            @param propagate: If False, disables automatic propagation of
+                              memlet subsets from scopes outwards. Saves
+                              processing time but disallows certain
+                              transformations.
+            @param parent: The parent SDFG or SDFG state (for nested SDFGs).
+        """
+        super(SDFG, self).__init__()
+        self._name = name
+        if name is not None and not validate_name(name):
+            raise InvalidSDFGError('Invalid SDFG name "%s"' % name, self, None)
+
+        if not isinstance(arg_types, collections.OrderedDict):
+            raise TypeError
+        self._arg_types = arg_types  # OrderedDict(str, typeclass)
+        self._constants = constants  # type: Dict[str, Any]
+        self._propagate = propagate
+        self._parent = parent
+        self._parent_sdfg = None
+        self._sdfg_list = [self]
+        self._instrumented_parent = False  # Same as above. This flag is needed to know if the parent is instrumented (it's possible for a parent to be serial and instrumented.)
+        self._start_state = None
+        self._arrays = {None: None}  # type: Dict[str, dt.Array]
+
+    @property
+    def arrays(self):
+        """ Returns a dictionary of data descriptors (`Data` objects) used
+            in this SDFG, with an extra `None` entry for empty memlets.
+        """
+        return self._arrays
+
+    @property
+    def start_state(self):
+        """ Returns the starting state of this SDFG. """
+        if self._start_state is None:
+            return self.source_nodes()[0]
+
+        return self.node(self._start_state)
+
+    def set_start_state(self, state_id):
+        """ Manually sets the starting state of this SDFG.
+            @param state_id: The node ID (use `node_id(state)`) of the
+                             state to set.
+        """
+        if state_id < 0 or state_id >= len(self.nodes()):
+            raise ValueError('Invalid state ID')
+        self._start_state = state_id
+
+    def has_instrumented_parent(self):
+        return self._instrumented_parent
+
+    def set_instrumented_parent(self):
+        self._instrumented_parent = True  # When this is set: Under no circumstances try instrumenting this (or any transitive children)
+
+    def remove_data(self, name, validate=True):
+        """ Removes a data descriptor from the SDFG.
+            @param name: The name of the data descriptor to remove.
+            @param validate: If True, verifies that there are no access
+                             nodes that are using this data descriptor
+                             prior to removing it.
+        """
+        # Verify first that there are no access nodes that use this data
+        if validate:
+            for state in self.nodes():
+                for node in state.nodes():
+                    if isinstance(node, nd.AccessNode) and nd.data == name:
+                        raise ValueError(
+                            'Data descriptor %s is already used'
+                            'in node %s, state %s' % (name, node, state))
+
+        del self._arrays[name]
+
+    def update_sdfg_list(self, sdfg_list):
+        sub_sdfg_list = self._sdfg_list
+        for sdfg in sdfg_list:
+            if sdfg not in sub_sdfg_list:
+                sub_sdfg_list.append(sdfg)
+        if self._parent_sdfg is not None:
+            self._parent_sdfg.update_sdfg_list(sub_sdfg_list)
+            self._sdfg_list = self._parent_sdfg.sdfg_list
+        else:
+            self._sdfg_list = sub_sdfg_list
+
+    @property
+    def sdfg_list(self):
+        return self._sdfg_list
+
+    def set_sourcecode(self, code, lang=None):
+        """ Set the source code of this SDFG (for IDE purposes).
+            @param code: A string of source code.
+            @param lang: A string representing the language of the source code,
+                         for syntax highlighting and completion.
+        """
+        self.sourcecode = code
+        self.language = lang
+
+    @property
+    def name(self):
+        """ The name of this SDFG. """
+        return self._name
+
+    @property
+    def label(self):
+        """ The name of this SDFG. """
+        return self._name
+
+    @property
+    def arg_types(self):
+        return self._arg_types
+
+    @property
+    def constants(self):
+        """ A dictionary of compile-time constants defined in this SDFG. """
+        result = {}
+        # Merge with parent's constants
+        if self._parent_sdfg is not None:
+            result.update(self._parent_sdfg.constants)
+
+        result.update(self._constants)
+        return result
+
+    def add_constants(self, new_constants: Dict[str, Any]):
+        """ Adds new compile-time constants to this SDFG.
+            @param new_constants: Dictionary of new constants to add.
+        """
+        self._constants.update(new_constants)
+
+    def reset_constants(self, constants: Dict[str, Any]):
+        """ Resets compile-time constants of this SDFG to a given dictionary.
+            @param constants: Dictionary of new constants to set.
+        """
+        self._constants = constants
+
+    @property
+    def propagate(self):
+        return self._propagate
+
+    @propagate.setter
+    def propagate(self, propagate: bool):
+        self._propagate = propagate
+
+    @property
+    def parent(self):
+        return self._parent
+
+    @parent.setter
+    def parent(self, value):
+        self._parent = value
+
+    def add_node(self, node, is_start_state=False):
+        """ Adds a new node to the SDFG. Must be an SDFGState or a subclass
+            thereof.
+            @param node: The node to add.
+            @param is_start_state: If True, sets this node as the starting
+                                   state.
+        """
+        if not isinstance(node, SDFGState):
+            raise TypeError("Expected SDFGState, got " + str(type(node)))
+
+        # If no start state has been defined, define to be the first state
+        if is_start_state == True:
+            self._start_state = len(self.nodes())
+
+        return super(SDFG, self).add_node(node)
+
+    def add_edge(self, u, v, edge):
+        """ Adds a new edge to the SDFG. Must be an InterstateEdge or a
+            subclass thereof.
+            @param u: Source node.
+            @param v: Destination node.
+            @param edge: The edge to add.
+        """
+        if not isinstance(u, SDFGState):
+            raise TypeError("Expected SDFGState, got: {}".format(
+                type(u).__name__))
+        if not isinstance(v, SDFGState):
+            raise TypeError("Expected SDFGState, got: {}".format(
+                type(v).__name__))
+        if not isinstance(edge, ed.InterstateEdge):
+            raise TypeError("Expected InterstateEdge, got: {}".format(
+                type(edge).__name__))
+        return super(SDFG, self).add_edge(u, v, edge)
+
+    def states(self):
+        """ Alias that returns the nodes (states) in this SDFG. """
+        return self.nodes()
+
+    def all_nodes_recursive(self):
+        """ Iterate over all nodes in this SDFG, including states, nodes in
+            states, and recursive states and nodes within nested SDFGs,
+            returning tuples on the form (node, parent), where the parent is
+            either the SDFG (for states) or a DFG (nodes). """
+        all_nodes = []
+        for node in self.nodes():
+            all_nodes.append((node, self))
+            all_nodes += node.all_nodes_recursive()
+        return all_nodes
+
+    def arrays_recursive(self):
+        """ Iterate over all arrays in this SDFG, including arrays within
+            nested SDFGs. Yields 3-tuples of (sdfg, array name, array)."""
+        for aname, arr in self.arrays.items():
+            yield self, aname, arr
+        for state in self.nodes():
+            for node in state.nodes():
+                if isinstance(node, nd.NestedSDFG):
+                    yield from node.sdfg.arrays_recursive()
+
+    def interstate_symbols(self):
+        """ Returns variables are assigned/used in the top-level and can be
+            shared between states.
+        """
+
+        assigned = collections.OrderedDict()
+        used = collections.OrderedDict()
+
+        # Find symbols in inter-state edges
+        for _, _, edge_data in self.edges():
+            for var, expr in edge_data.assignments.items():
+                assigned[var] = dt.Scalar(symbolic.symtype(expr))
+                if isinstance(expr, str):
+                    expr = sp.sympify(expr)  # Convert string to sympy expr
+                if isinstance(expr, sp.Expr):
+                    for s in dace.symbolic.symbols_in_sympy_expr(expr):
+                        used[s] = dt.Scalar(symbolic.symbol(s).dtype)
+                elif expr is None or isinstance(expr, int):
+                    pass  # Nothing to extract, or a constant
+                else:
+                    raise TypeError("Unexpected type: {}".format(type(expr)))
+            for s in edge_data.condition_symbols():
+                used[s] = dt.Scalar(symbolic.symbol(s).dtype)
+        for state in self.nodes():
+            a, u = state.interstate_symbols()
+            assigned.update(a)
+            used.update(u)
+
+        return assigned, used
+
+    def scalar_parameters(self, include_constants):
+        """ Returns all scalar data arguments to the SDFG (this excludes
+            symbols used to define array sizes)."""
+        return [
+            (name, data) for name, data in self._arg_types.items()
+            if isinstance(data, dace.data.Scalar)
+            # Exclude constant variables if requested
+            and (include_constants or (name not in self.constants))
+        ]
+
+    def symbols_defined_at(self, node, state=None):
+        """ Returns all symbols available to a given node, including only
+            scope-defined variables that encompass the node, assuming that all
+            required inputs to the SDFG have been resolved. """
+
+        # From e.g., Data or SDFG to the corresponding node
+        resolved = self.resolve_node(node)
+        if len(resolved) > 1:
+            raise ValueError("Node {} is present multiple times in SDFG: "
+                             "result is ambiguous".format(node))
+        node = resolved[0]
+
+        if state is None:
+            state = self.states_for_node(node)
+            if len(state) > 1:
+                raise ValueError("Node \"{}\" is present in multiple states, "
+                                 "result is ambiguous: {}".format(
+                                     node, ", ".join(state)))
+            state = state[0]
+        else:
+            if node not in state.nodes():
+                raise ValueError(
+                    "Node \"{}\" does not exist in state \"{}\"".format(
+                        node, state))
+
+        # All scalar inputs, data symbols and interstate symbols are assumed to
+        # have been resolved at this point
+        symbols = collections.OrderedDict(
+            (name, data) for name, data in self.scalar_parameters(True))
+        symbols.update(self.data_symbols(True))
+        assigned, used = self.interstate_symbols()
+        symbols.update(assigned)
+        symbols.update(used)
+
+        # Explore scope of node to find iteration variables
+        scope_dict = state.scope_dict()
+        if isinstance(node, dace.graph.nodes.EntryNode):
+            scope = node
+        else:
+            scope = scope_dict[node]
+        while scope is not None:
+            if isinstance(scope, dace.graph.nodes.MapEntry):
+                for param in scope.params:
+                    symbols[param] = dt.Scalar(symbolic.symbol(param).dtype)
+            else:
+                raise TypeError("Unsupported entry node type: {}".format(
+                    type(scope).__name__))
+            scope = scope_dict[scope]
+
+        # Call recursively on parents
+        if self.parent is not None:
+            symbols.update(self.parent.symbols_defined_at(self))
+
+        symbols.update(self.constants)
+
+        return symbols
+
+    def data_symbols(self, include_constants):
+        """ Returns all symbols used in data nodes within the SDFG. """
+        symbols = collections.OrderedDict()
+        for state in self.nodes():
+            symbols.update(state.data_symbols())
+        if include_constants:
+            return symbols
+        else:
+            return collections.OrderedDict((key, val)
+                                           for key, val in symbols.items()
+                                           if key not in self.constants)
+
+    def scope_symbols(self):
+        """ Returns all symbols used in scopes (maps) within the SDFG. """
+        iteration_variables = collections.OrderedDict()
+        subset_symbols = collections.OrderedDict()
+        for state in self.nodes():
+            iv, ss = state.scope_symbols()
+            iteration_variables.update(iv)
+            subset_symbols.update(ss)
+        return iteration_variables, subset_symbols
+
+    def all_symbols(self, include_constants):
+        """ Returns all symbols used in this SDFG, including scalar parameters
+            to the SDFG, loop iteration variables, array sizes and variables
+            used in interstate edges. """
+        symbols = collections.OrderedDict(
+            (name, data) for name, data in self.scalar_parameters())
+        symbols.update(self.data_symbols(True))
+        assigned, used = self.interstate_symbols()
+        symbols.update(used)
+        iteration_variables, subset_symbols = self.scope_symbols()
+        symbols.update(subset_symbols)
+        symbols.update(iteration_variables)
+        if include_constants:
+            return symbols
+        else:
+            return collections.OrderedDict((key, val)
+                                           for key, val in symbols.items()
+                                           if key not in self.constants)
+
+    def undefined_symbols(self, include_scalar_data):
+        """ Returns all symbols used in this SDFG that are undefined, and thus
+            must be given as input parameters. """
+        return undefined_symbols(self, self, include_scalar_data)
+
+    def resolve_node(self, node):
+        """ Resolves data objects and SDFG objects into their corresponding
+            nodes in the SDFG. """
+        if isinstance(node, dace.graph.nodes.Node):
+            return [node]
+        all_nodes = [(self, None)] + self.all_nodes_recursive()
+        if isinstance(node, dace.data.Data):
+            resolved = [
+                n for n, _ in all_nodes
+                if isinstance(n, dace.graph.nodes.AccessNode)
+                and n.desc(self) == node
+            ]
+        elif isinstance(node, SDFG):
+            resolved = [
+                n for n, _ in all_nodes if
+                isinstance(n, dace.graph.nodes.NestedSDFG) and n.sdfg == node
+            ]
+        else:
+            raise TypeError("Unrecognized type {} passed.".format(
+                type(node).__name__))
+        if len(resolved) == 0:
+            raise RuntimeError("Node {} of type {} not found "
+                               "in SDFG {}.".format(node.data,
+                                                    type(node).__name__,
+                                                    self.name))
+        return resolved
+
+    def states_for_node(self, node):
+        """ Finds which states a node is located in. """
+        if isinstance(node, dace.data.Data):
+            states = [
+                s for s in self.nodes()
+                if node in [n.data for n in s.data_nodes()]
+            ]
+        elif isinstance(node, SDFG):
+            states = [
+                s for s in self.nodes() if node in [
+                    n.sdfg for n in s.nodes()
+                    if isinstance(n, dace.graph.nodes.NestedSDFG)
+                ]
+            ]
+        else:
+            states = [s for s in self.nodes() if node in s.nodes()]
+        if len(states) == 0:
+            raise ValueError("Node \"{}\" not found".format(node))
+        return states
+
+    def arglist(self):
+        """ Returns a list of argument names required to call this SDFG.
+            The return type is a dictionary of names to types. """
+        data_args = []
+        for state in self.nodes():
+            data_args += [
+                (n.data, n.desc(self)) for n in state.nodes()
+                if isinstance(n, nd.AccessNode) and not n.desc(self).transient
+            ]
+        data_args = types.deduplicate(data_args)
+
+        sym_args = sorted(self.undefined_symbols(False).items())
+
+        # Arguments are sorted as follows:
+        # 1. Program arguments, as given in the dace program definition
+        # 2. Other free symbols, sorted by name
+        # 3. Data arguments inferred from the SDFG, if not given in the program
+        #    definition (or if not created from a dace.program)
+        arg_list = collections.OrderedDict(self._arg_types)
+        for key, val in itertools.chain(data_args, sym_args):
+            if key not in self._constants and key not in arg_list:
+                arg_list[key] = val
+
+        return arg_list
+
+    def signature_arglist(self, with_types=True):
+        """ Returns a list of arguments necessary to call this SDFG,
+            formatted as a list of C definitions.
+            @param with_types: If True, includes argment types in the result.
+            @return: A list of strings. For example: `['float *A', 'int b']`.
+        """
+        arg_list = self.arglist()
+
+        signature_args = []
+        for name, arg_type in arg_list.items():
+            if isinstance(arg_type, dace.data.Data):
+                signature_args.append(
+                    arg_type.signature(name=name, with_types=with_types))
+            else:
+                raise TypeError('Unsupported argument type')
+
+        return signature_args
+
+    def signature(self, with_types=True):
+        """ Returns a C/C++ signature of this SDFG, used when generating code.
+            @param with_types: If True, includes argument types (can be used
+                               for a function prototype). If False, only
+                               include argument names (can be used for function
+                               calls).
+        """
+        return ", ".join(self.signature_arglist(with_types))
+
+    def draw_to_file(self,
+                     filename='sdfg.dot',
+                     fill_connectors=True,
+                     recursive=True):
+        """ Draws the SDFG to a GraphViz (.dot) file.
+            @param filename: The file to draw the SDFG to (will be written to
+                             '_dotgraphs/<filename>').
+            @param fill_connectors: Whether to fill missing scope (e.g., "IN_")
+                                    connectors prior to drawing the graph.
+            @param recursive: If True, also draws nested SDFGs.
+        """
+        if fill_connectors:
+            self.fill_scope_connectors()
+
+        try:
+            os.makedirs("_dotgraphs")
+        # Python 2.x does not have FileExistsError
+        except OSError as e:
+            if e.errno == errno.EEXIST:
+                pass
+            else:
+                raise
+
+        with open(os.path.join("_dotgraphs", filename), "w") as outFile:
+            outFile.write(self.draw())
+
+        if recursive:
+            for state in self.nodes():
+                for node in state.nodes():
+                    if isinstance(node, dace.graph.nodes.NestedSDFG):
+                        node.sdfg.draw_to_file(
+                            filename=node.sdfg.name + "_" + filename,
+                            recursive=True)
+
+    def draw(self):
+        """ Creates a GraphViz representation of the full SDFG, including all
+            states and transitions.
+            @return: A string representing the SDFG in .dot format.
+        """
+
+        nodes = []
+
+        # Redirect all edges between states to point at the boundaries
+        edges = []
+        for ind, edge in enumerate(self.edges()):
+            srcState, dstState, data = edge
+            srcDotName = "state_" + str(self.node_id(srcState))
+            dstDotName = "state_" + str(self.node_id(dstState))
+            srcCluster = "cluster_" + srcDotName
+            dstCluster = "cluster_" + dstDotName
+
+            if len(srcState.nodes()) > 0:
+                srcNode = srcState.sink_nodes()[0]
+                srcName = 's%d_%d' % (self.node_id(srcState),
+                                      srcState.node_id(srcNode))
+            else:
+                srcName = 'dummy_' + str(self.node_id(srcState))
+            if len(dstState.nodes()) > 0:
+                dstNode = dstState.source_nodes()[0]
+                dstName = 's%d_%d' % (self.node_id(dstState),
+                                      dstState.node_id(dstNode))
+            else:
+                dstName = 'dummy_' + str(self.node_id(dstState))
+
+            if srcState != dstState:
+                edges.append(
+                    dot.draw_interstate_edge_by_name(
+                        srcName,
+                        dstName,
+                        edge,
+                        self,
+                        srcState,
+                        ltail=srcCluster,
+                        lhead=dstCluster))
+            else:
+                redName = srcDotName + "_to_" + dstDotName
+                nodes.append(dot.draw_invisible_node(redName))
+
+                edges.append(
+                    dot.draw_edge_explicit(
+                        srcName,
+                        redName,
+                        Edge(srcState, srcState, ed.RedirectEdge()),
+                        self,
+                        srcState,
+                        ltail=srcCluster))
+                edges.append(
+                    dot.draw_edge_explicit(
+                        redName,
+                        dstName,
+                        edge,
+                        self,
+                        srcState,
+                        lhead=dstCluster))
+
+        # Mark first and last states
+        first = self.start_state
+
+        # A state is considered a last state if it has no outgoing edges that
+        # lead to another state
+        last = self.sink_nodes()
+
+        clusters = []
+        for state in self.nodes():
+            if state == first and state not in last:
+                clusterLabel = state.label + " (BEGIN)"
+                clusterColor = "#f7dede"
+            elif state in last and state != first:
+                clusterLabel = state.label + " (END)"
+                clusterColor = "#f7dede"
+            else:
+                clusterLabel = state.label
+                clusterColor = "#deebf7"
+            cluster = ('''
+subgraph cluster_state_{state} {{
+      label = "{label}";
+      labeljust = r;
+      bgcolor = "{color}"; color = "{color}";'''.format(
+                state=self.node_id(state),
+                label=clusterLabel,
+                color=clusterColor))
+            subNodes, subEdges = dot.draw_graph(self, state, standalone=False)
+            cluster += "\n        ".join(subNodes + subEdges)
+            if len(subNodes) == 0:
+                cluster += "\n"
+                cluster += dot.draw_invisible_node("dummy_" +
+                                                   str(self.node_id(state)))
+            cluster += "\n}"
+            clusters.append(cluster)
+
+        return (
+            "digraph SDFG {\n    outputorder=nodesfirst;\n" +
+            "    compound=true;\n" + "    newrank=true;\n" +
+            "\n    ".join(nodes + edges) + "\n" + "\n".join(clusters) + "\n}")
+
+    def _repr_svg_(self):
+        """ SVG representation of the SDFG, used mainly for Jupyter
+            notebooks. """
+        import graphviz
+        return graphviz.Source(self.draw())._repr_svg_()
+
+    def transients(self):
+        """ Returns a dictionary mapping transient data descriptors to their
+            parent scope entry node, or None if top-level (i.e., exists in
+            multiple scopes). """
+
+        result = {}
+        tstate = {}
+
+        for (i, state) in enumerate(self.nodes()):
+            scope_dict = state.scope_dict()
+            for node in state.nodes():
+                if isinstance(node,
+                              nd.AccessNode) and node.desc(self).transient:
+                    arrname = node.data
+                    # If transient is accessed in more than one state, it is a
+                    # top-level transient
+                    if arrname in tstate and tstate[arrname] != i:
+                        tstate[arrname] = None
+                        result[arrname] = None
+                    else:
+                        tstate[arrname] = i
+                        result[arrname] = scope_dict[node]
+
+        return result
+
+    def shared_transients(self):
+        """ Returns a list of transient data that appears in more than one
+            state. """
+        seen = {}
+        shared = []
+        for state in self.nodes():
+            for node in state.nodes():
+                if isinstance(node,
+                              nd.AccessNode) and node.desc(self).transient:
+                    # If transient is accessed in more than one state, it is a
+                    # shared transient
+                    if (node.desc(self).toplevel or
+                        (node.data in seen and seen[node.data] != state)):
+                        shared.append(node.data)
+                    seen[node.data] = state
+        return types.deduplicate(shared)
+
+    def input_arrays(self):
+        """ Returns a list of input arrays that need to be fed into the SDFG.
+        """
+        result = []
+        for state in self.nodes():
+            for node in state.source_nodes():
+                if isinstance(node, nd.AccessNode):
+                    if node not in result:
+                        result.append(node)
+        return result
+
+    def output_arrays(self):
+        """ Returns a list of output arrays that need to be returned from the
+            SDFG. """
+        result = []
+        for state in self.nodes():
+            for node in state.sink_nodes():
+                if isinstance(node, nd.AccessNode):
+                    if node not in result:
+                        result.append(node)
+        return result
+
+    def save(self, filename: str):
+        """ Save this SDFG to a file (uses Pickle as the default format).
+            @param filename: File name to save to.
+        """
+        with open(filename, 'wb') as fp:
+            pickle.dump(self, fp)
+
+    @staticmethod
+    def from_file(filename: str):
+        """ Constructs an SDFG from a file.
+            @param filename: File name to load SDFG from.
+            @return: An SDFG.
+        """
+        with open(filename, 'rb') as fp:
+            sdfg = pickle.load(fp)
+            if not isinstance(sdfg, SDFG):
+                raise TypeError('Loaded file is not an SDFG (loaded '
+                                'type: %s)' % type(sdfg).__name__)
+            return sdfg
+
+    # Dynamic SDFG creation API
+    ##############################
+    def add_state(self, label=None, is_start_state=False):
+        """ Adds a new SDFG state to this graph and returns it.
+            @param label: State label.
+            @param is_start_state: If True, resets SDFG starting state to this
+                                   state.
+            @return: A new SDFGState object.
+        """
+        if label is None or any([s.label == label for s in self.nodes()]):
+            i = len(self)
+            base = "state" if label is None else label
+            while True:
+                # Append a number. If the state already exists, increment the
+                # number until it doesn't
+                label = "{}_{}".format(base, i)
+                if any([s.label == label for s in self.nodes()]):
+                    i += 1
+                else:
+                    break
+        state = SDFGState(label, self)
+        self.add_node(state, is_start_state=is_start_state)
+        return state
+
+    def add_array(self,
+                  name: str,
+                  shape,
+                  dtype,
+                  storage=types.StorageType.Default,
+                  materialize_func=None,
+                  transient=False,
+                  strides=None,
+                  offset=None,
+                  toplevel=False,
+                  debuginfo=None,
+                  allow_conflicts=False,
+                  access_order=None):
+        """ Adds an array to the SDFG data descriptor store. """
+
+        if not isinstance(name, str):
+            raise TypeError(
+                'Array name must be a string. Got %s' % type(name).__name__)
+
+        # If exists, fail
+        if name in self._arrays:
+            raise NameError('Array or Stream with name "%s" already exists '
+                            'in SDFG' % name)
+
+        # convert strings to int if possible
+        newshape = []
+        for s in shape:
+            try:
+                newshape.append(int(s))
+            except:
+                newshape.append(dace.symbolic.pystr_to_symbolic(s))
+        shape = newshape
+
+        if isinstance(dtype, type) and dtype in types._CONSTANT_TYPES[:-1]:
+            dtype = types.typeclass(dtype)
+
+        desc = dt.Array(
+            dtype,
+            shape,
+            storage=storage,
+            materialize_func=materialize_func,
+            allow_conflicts=allow_conflicts,
+            access_order=access_order,
+            transient=transient,
+            strides=strides,
+            offset=offset,
+            toplevel=toplevel,
+            debuginfo=debuginfo)
+
+        self._arrays[name] = desc
+        return desc
+
+    def add_stream(self,
+                   name: str,
+                   dtype,
+                   veclen=1,
+                   buffer_size=1,
+                   shape=(1, ),
+                   storage=types.StorageType.Default,
+                   transient=False,
+                   strides=None,
+                   offset=None,
+                   toplevel=False,
+                   debuginfo=None):
+        """ Adds a stream to the SDFG data descriptor store. """
+        if not isinstance(name, str):
+            raise TypeError(
+                'Stream name must be a string. Got %s' % type(name).__name__)
+
+        # If exists, fail
+        if name in self._arrays:
+            raise NameError('Array or Stream with name "%s" already exists '
+                            'in SDFG' % name)
+
+        if isinstance(dtype, type) and dtype in types._CONSTANT_TYPES[:-1]:
+            dtype = types.typeclass(dtype)
+
+        desc = dt.Stream(
+            dtype,
+            veclen,
+            buffer_size,
+            shape=shape,
+            storage=storage,
+            transient=transient,
+            strides=strides,
+            offset=offset,
+            toplevel=toplevel,
+            debuginfo=debuginfo)
+
+        self._arrays[name] = desc
+        return desc
+
+    def add_scalar(self,
+                   name: str,
+                   dtype,
+                   storage=types.StorageType.Default,
+                   transient=False,
+                   toplevel=False,
+                   debuginfo=None):
+        """ Adds a scalar to the SDFG data descriptor store. """
+        if not isinstance(name, str):
+            raise TypeError(
+                'Scalar name must be a string. Got %s' % type(name).__name__)
+        # If exists, fail
+        if name in self._arrays:
+            raise NameError('Array or Stream with name "%s" already exists '
+                            'in SDFG' % name)
+
+        if isinstance(dtype, type) and dtype in types._CONSTANT_TYPES[:-1]:
+            dtype = types.typeclass(dtype)
+
+        desc = dt.Scalar(
+            dtype,
+            storage=storage,
+            transient=transient,
+            toplevel=toplevel,
+            debuginfo=debuginfo)
+
+        self._arrays[name] = desc
+        return desc
+
+    def add_transient(self,
+                      name,
+                      shape,
+                      dtype,
+                      storage=types.StorageType.Default,
+                      materialize_func=None,
+                      strides=None,
+                      offset=None,
+                      toplevel=False,
+                      debuginfo=None,
+                      allow_conflicts=False,
+                      access_order=None):
+        """ Convenience function to add a transient array to the data
+            descriptor store. """
+        return self.add_array(
+            name,
+            shape,
+            dtype,
+            storage,
+            materialize_func,
+            True,
+            strides,
+            offset,
+            toplevel=toplevel,
+            debuginfo=None,
+            allow_conflicts=allow_conflicts,
+            access_order=access_order)
+
+    def add_datadesc(self, name: str, datadesc: dt.Data):
+        """ Adds an existing data descriptor to the SDFG array store.
+            @param name: Name to use.
+            @param datadesc: Data descriptor to add.
+        """
+        if not isinstance(name, str):
+            raise TypeError('Data descriptor name must be a string. Got %s' %
+                            type(name).__name__)
+        # If exists, fail
+        if name in self._arrays:
+            raise NameError('Array or Stream with name "%s" already exists '
+                            'in SDFG' % name)
+        self._arrays[name] = datadesc
+
+    # SDFG queries
+    ##############################
+
+    def find_state(self, state_id_or_label):
+        """ Finds a state according to its ID (if integer is provided) or
+            label (if string is provided).
+
+            @param state_id_or_label: State ID (if int) or label (if str).
+            @return: An SDFGState object.
+        """
+
+        if isinstance(state_id_or_label, str):
+            for s in self.nodes():
+                if s.label == state_id_or_label:
+                    return s
+            raise LookupError('State %s not found' % state_id_or_label)
+        elif isinstance(state_id_or_label, int):
+            return self.nodes()[state_id_or_label]
+        else:
+            raise TypeError(
+                'state_id_or_label is not an int nor string: {}'.format(
+                    state_id_or_label))
+
+    def find_node(self, state_id_or_label, node_id_or_label):
+        """ Finds a node within a state according to its ID (if integer is
+            provided) or label (if string is provided).
+
+            @param state_id_or_label: State ID (if int) or label (if str).
+            @param node_id_or_label:  Node ID (if int) or label (if str)
+                                      within the given state.
+            @return: A nodes.Node object.
+        """
+        state = self.find_state(state_id_or_label)
+        return state.find_node(node_id_or_label)
+
+    def specialize(self, additional_symbols={}, specialize_all_symbols=True):
+        """ Sets symbolic values in this SDFG to constants.
+            @param additional_symbols: Additional values to specialize.
+            @param specialize_all_symbols: If True, raises an
+                   UnboundLocalError if at least one of the symbols in the
+                   SDFG is unset.
+        """
+        syms = {}
+        undefined_symbols = self.undefined_symbols(False)
+        #scalar_arguments = self.scalar_parameters(False)
+        for symname in undefined_symbols:  #itertools.chain(undefined_symbols, scalar_arguments):
+            try:
+                syms[symname] = symbolic.symbol(symname).get()
+            except UnboundLocalError:
+                # Allow scalar arguments to remain undefined, but fail on
+                # symbols
+                if (specialize_all_symbols
+                        and symname not in additional_symbols):
+                    pass
+
+        # Augment symbol values from additional symbols
+        syms.update({
+            # If symbols are passed, extract the value. If constants are
+            # passed, use them directly.
+            name: val.get() if isinstance(val, dace.symbolic.symbol) else val
+            for name, val in additional_symbols.items()
+        })
+
+        # Update constants
+        self._constants.update(syms)
+
+    def compile(self, specialize=None, optimizer=None):
+        """ Compiles a runnable binary from this SDFG.
+
+            @param specialize: If True, specializes all symbols to their
+                               defined values as constants. If None, uses
+                               configuration setting.
+            @param optimizer: If defines a valid class name, it will be called
+                              during compilation to transform the SDFG as
+                              necessary. If None, uses configuration setting.
+            @return: A callable CompiledSDFG object.
+        """
+
+        # Importing these outside creates an import loop
+        from dace.codegen import codegen, compiler
+
+        if Config.get_bool('compiler', 'use_cache'):
+            # Try to see if a cached version of the binary exists
+            #print("looking for cached binary: " + compiler.get_binary_name(self.name))
+            binary_filename = compiler.get_binary_name(self.name)
+            if os.path.isfile(binary_filename):
+                #print("A cached binary was found!")
+                return compiler.load_from_file(self, binary_filename)
+
+        ############################
+        # DaCe Compilation Process #
+
+        # Clone SDFG as the other modules may modify its contents
+        sdfg = copy.deepcopy(self)
+
+        # Propagate memlets in the graph
+        if self._propagate:
+            propagate_labels_sdfg(sdfg)
+
+        # Fill in scope entry/exit connectors
+        sdfg.fill_scope_connectors()
+
+        # Specialize SDFG to its symbol values
+        if ((specialize is None
+             and Config.get_bool('optimizer', 'autospecialize'))
+                or specialize == True):
+            sdfg.specialize()
+
+        # Optimize SDFG using the CLI or external hooks
+        optclass = _get_optimizer_class(optimizer)
+        if optclass is not None:
+            opt = optclass(sdfg)
+            sdfg = opt.optimize(debugprint=Config.get_bool('debugprint'))
+
+        # Generate code for the program by traversing the SDFG state by state
+        program_objects = codegen.generate_code(sdfg)
+
+        # Generate the program folder and write the source files
+        program_folder = compiler.generate_program_folder(
+            program_objects, os.path.join(".dacecache", sdfg.name))
+
+        # Compile the code and get the shared library path
+        shared_library = compiler.configure_and_compile(program_folder)
+
+        # Get the function handle
+        return compiler.get_program_handle(shared_library, sdfg)
+
+    def __call__(self, *args, **kwargs):
+        """ Invokes an SDFG, generating and compiling code if necessary. """
+
+        binaryobj = self.compile()
+
+        # Verify passed arguments (unless disabled by the user)
+        if dace.config.Config.get_bool("execution", "general", "check_args"):
+            expected_args = self.arglist()
+            num_args_passed = len(args) + len(kwargs)
+            num_args_expected = len(expected_args)
+            if num_args_passed < num_args_expected:
+                expected_kwargs = list(expected_args.keys())[len(args):]
+                missing_args = [k for k in expected_kwargs if k not in kwargs]
+                raise RuntimeError("Missing arguments to SDFG: '%s'" %
+                                   (', '.join(missing_args)))
+            elif num_args_passed > num_args_expected:
+                unnecessary_args = []
+                extra_args = len(args) - len(expected_args)
+                if extra_args > 0:
+                    unnecessary_args.extend(
+                        'Argument #%d' % (i + len(expected_args) + 1)
+                        for i in range(extra_args))
+                    unnecessary_args.extend(kwargs.keys())
+                else:
+                    unnecessary_args = [
+                        k for k in kwargs.keys() if k not in expected_args
+                    ]
+                raise RuntimeError(
+                    "Too many arguments to SDFG. Unnecessary "
+                    "arguments: %s" % ', '.join(unnecessary_args))
+            positional_args = list(args)
+            for i, arg in enumerate(expected_args):
+                expected = expected_args[arg]
+                if i < len(positional_args):
+                    passed = positional_args[i]
+                else:
+                    if arg not in kwargs:
+                        raise RuntimeError(
+                            "Missing argument to DaCe program: {}".format(arg))
+                    passed = kwargs[arg]
+                if isinstance(expected, dace.data.Array):
+                    if (not isinstance(passed, ndarray.ndarray)
+                            and not isinstance(passed, np.ndarray)):
+                        raise TypeError("Type mismatch for argument {}: "
+                                        "expected array type, got {}".format(
+                                            arg, type(passed)))
+                elif (isinstance(expected, dace.data.Scalar)
+                      or isinstance(expected, dace.types.typeclass)):
+                    if (not dace.types.isconstant(passed)
+                            and not isinstance(passed, dace.symbolic.symbol)):
+                        raise TypeError("Type mismatch for argument {}: "
+                                        "expected scalar type, got {}".format(
+                                            arg, type(passed)))
+                elif isinstance(expected, dace.data.Stream):
+                    if not isinstance(passed, dace.types.stream):
+                        raise TypeError("Type mismatch for argument {}: "
+                                        "expected stream type, got {}".format(
+                                            arg, type(passed)))
+                else:
+                    raise NotImplementedError(
+                        "Type checking not implemented for type {} (argument "
+                        "{})".format(type(expected).__name__, arg))
+
+        return binaryobj(*args, **kwargs)
+
+    def fill_scope_connectors(self):
+        """ Fills missing scope connectors (i.e., "IN_#"/"OUT_#" on entry/exit
+            nodes) according to data on the memlets. """
+        for state in self.nodes():
+            state.fill_scope_connectors()
+
+    def validate(self) -> None:
+        """ Verifies the correctness of an SDFG by applying multiple tests.
+
+            Raises an InvalidSDFGError with the erroneous node/edge
+            on failure.
+        """
+        # SDFG-level checks
+        if not validate_name(self.name):
+            raise InvalidSDFGError('Invalid name', self, None)
+
+        if len(self.source_nodes()) > 1 and self._start_state is None:
+            raise InvalidSDFGError('Starting state undefined', self, None)
+
+        if len(set([s.label for s in self.nodes()])) != len(self.nodes()):
+            raise InvalidSDFGError('Found multiple states with the same name',
+                                   self, None)
+
+        # Validate array names
+        for name in self._arrays.keys():
+            if name is not None and not validate_name(name):
+                raise InvalidSDFGError('Invalid array name %s' % name, self,
+                                       None)
+
+        # Check every state separately
+        for sid, state in enumerate(self.nodes()):
+            state.validate(self, sid)
+
+        # Interstate edge checks
+        for eid, edge in enumerate(self.edges()):
+
+            # Name validation
+            if len(edge.data.assignments) > 0:
+                for assign in edge.data.assignments.keys():
+                    if not validate_name(assign):
+                        raise InvalidSDFGInterstateEdgeError(
+                            'Invalid interstate symbol name %s' % assign, self,
+                            eid)
+
+        # TODO: Check interstate edges with undefined symbols
+
+        pass
+
+    def is_valid(self) -> bool:
+        """ Returns True if the SDFG is verified correctly (using `validate`).
+        """
+        try:
+            self.validate()
+        except InvalidSDFGError:
+            return False
+        return True
+
+    def apply_strict_transformations(self):
+        """ Applies safe transformations (that will surely increase the
+            performance) on the SDFG. For example, this fuses redundant states
+            (safely) and removes redundant arrays.
+
+            B{Note:} This is an in-place operation on the SDFG.
+        """
+        # Avoiding import loops
+        from dace.transformation import optimizer
+        from dace.transformation.dataflow import RedundantArray
+        from dace.transformation.interstate import StateFusion
+
+        # Apply strict state fusions greedily.
+        opt = optimizer.SDFGOptimizer(self, inplace=True)
+        fusions = 0
+        arrays = 0
+        options = [
+            match for match in opt.get_pattern_matches(strict=True)
+            if isinstance(match, (StateFusion, RedundantArray))
+        ]
+        while options:
+            sdfg = self.sdfg_list[options[0].sdfg_id]
+            options[0].apply(sdfg)
+            self.validate()
+            if isinstance(options[0], StateFusion):
+                fusions += 1
+            if isinstance(options[0], RedundantArray):
+                arrays += 1
+
+            options = [
+                match for match in opt.get_pattern_matches(strict=True)
+                if isinstance(match, (StateFusion, RedundantArray))
+            ]
+
+        if Config.get_bool('debugprint') and (fusions > 0 or arrays > 0):
+            print('Automatically applied {} strict state fusions and removed'
+                  ' {} redundant arrays.'.format(fusions, arrays))
+
+    def apply_gpu_transformations(self, states=None, strict=True):
+        """ Applies a series of transformations on the SDFG for it to
+            generate GPU code.
+
+            B{Note:} This is an in-place operation on the SDFG.
+        """
+        # Avoiding import loops
+        from dace.transformation import optimizer
+        from dace.transformation.dataflow import RedundantArrayCopying
+        from dace.transformation.dataflow import RedundantArrayCopying2
+        from dace.transformation.dataflow import RedundantArrayCopying3
+        from dace.transformation.dataflow import GPUTransformLocalStorage
+
+        # Apply transformations greedily.
+        opt = optimizer.SDFGOptimizer(self, inplace=True)
+        gpu_maps = 0
+        options = [
+            match for match in opt.get_pattern_matches(
+                strict=strict,
+                states=states,
+                patterns=[GPUTransformLocalStorage])
+        ]
+        while options:
+            sdfg = self.sdfg_list[options[0].sdfg_id]
+            options[0].apply(sdfg)
+            self.validate()
+            gpu_maps += 1
+
+            options = [
+                match for match in opt.get_pattern_matches(
+                    strict=strict,
+                    states=states,
+                    patterns=[GPUTransformLocalStorage])
+            ]
+        arrays = 0
+        options = [
+            match for match in opt.get_pattern_matches(
+                strict=True, states=states, patterns=[RedundantArrayCopying])
+        ]
+        while options:
+            sdfg = self.sdfg_list[options[0].sdfg_id]
+            options[0].apply(sdfg)
+            self.validate()
+            arrays += 1
+
+            options = [
+                match for match in opt.get_pattern_matches(
+                    strict=True,
+                    states=states,
+                    patterns=[RedundantArrayCopying])
+            ]
+        options = [
+            match for match in opt.get_pattern_matches(
+                strict=True, states=states, patterns=[RedundantArrayCopying2])
+        ]
+        while options:
+            sdfg = self.sdfg_list[options[0].sdfg_id]
+            options[0].apply(sdfg)
+            self.validate()
+            arrays += 1
+
+            options = [
+                match for match in opt.get_pattern_matches(
+                    strict=True,
+                    states=states,
+                    patterns=[RedundantArrayCopying2])
+            ]
+        options = [
+            match for match in opt.get_pattern_matches(
+                strict=True, states=states, patterns=[RedundantArrayCopying3])
+        ]
+        while options:
+            sdfg = self.sdfg_list[options[0].sdfg_id]
+            options[0].apply(sdfg)
+            self.validate()
+            arrays += 1
+
+            options = [
+                match for match in opt.get_pattern_matches(
+                    strict=True,
+                    states=states,
+                    patterns=[RedundantArrayCopying3])
+            ]
+
+        if Config.get_bool('debugprint') and (gpu_maps > 0 or arrays > 0):
+            print(
+                'Automatically applied {} map GPU transformations and removed'
+                ' {} redundant array copy-operations.'.format(
+                    gpu_maps, arrays))
+
+    def generate_code(self, specialize=None):
+        """ Generates code from this SDFG and returns it.
+            @param specialize: If True, specializes all set symbols to their
+                               values in the generated code. If None,
+                               uses default configuration value.
+            @return: A list of `CodeObject` objects containing the generated
+                      code of different files and languages.
+        """
+
+        # Import loop "fix"
+        from dace.codegen import codegen, compiler
+
+        ################################
+        # DaCe Code Generation Process #
+        sdfg = copy.deepcopy(self)
+
+        # Propagate memlets in the graph
+        if sdfg.propagate:
+            labeling.propagate_labels_sdfg(sdfg)
+
+        # Fill in scope entry/exit connectors
+        sdfg.fill_scope_connectors()
+
+        # Specialize SDFG to its symbol values
+        if ((specialize is None
+             and Config.get_bool('optimizer', 'autospecialize'))
+                or specialize == True):
+            sdfg.specialize()
+
+        # Generate code for the program by traversing the SDFG state by state
+        program_code = codegen.generate_code(sdfg)
+
+        return program_code
+
+
+class ScopeSubgraphView(SubgraphView):
+    """ An extension to SubgraphView that enables the creation of scope
+        dictionaries in subgraphs and free symbols. """
+
+    def __init__(self, graph, subgraph_nodes):
+        super(ScopeSubgraphView, self).__init__(graph, subgraph_nodes)
+        self._clear_scopedict_cache()
+
+    @property
+    def parent(self):
+        return self._graph.parent
+
+    def _clear_scopedict_cache(self):
+        """ Clears the cached results for the scope_dict function.
+
+            For use when the graph mutates (e.g., new edges/nodes, deletions).
+        """
+        self._scope_dict_toparent_cached = None
+        self._scope_dict_tochildren_cached = None
+
+    def scope_dict(self, node_to_children=False):
+        """ Returns a dictionary that segments an SDFG state into
+            entry-node/exit-node scopes.
+
+            @param node_to_children: If False (default), returns a mapping
+                                     of each node to its parent scope
+                                     (ScopeEntry) node. If True, returns a
+                                     mapping of each parent node to a list of
+                                     children nodes.
+            @type node_to_children: bool
+            @return: The mapping from a node to its parent scope node, or the
+                     mapping from a node to a list of children nodes.
+            @rtype: dict(Node, Node) or dict(Node, list(Node))
+        """
+        if (not node_to_children
+                and self._scope_dict_toparent_cached is not None):
+            return copy.copy(self._scope_dict_toparent_cached)
+        elif (node_to_children
+              and self._scope_dict_tochildren_cached is not None):
+            return copy.copy(self._scope_dict_tochildren_cached)
+
+        result = {}
+        node_queue = collections.deque(self.source_nodes())
+        eq = _scope_dict_inner(self, node_queue, None, node_to_children,
+                               result)
+
+        # Sanity check
+        assert len(eq) == 0
+
+        # Cache result
+        if node_to_children:
+            self._scope_dict_tochildren_cached = result
+        else:
+            self._scope_dict_toparent_cached = result
+
+        return copy.copy(result)
+
+    def scope_subgraph(self, entry_node, include_entry=True,
+                       include_exit=True):
+        """ Returns a subgraph that only contains the scope, defined by the
+            given entry node.
+        """
+        return _scope_subgraph(self, entry_node, include_entry, include_exit)
+
+    def top_level_transients(self):
+        return top_level_transients(self)
+
+    def all_transients(self):
+        return all_transients(self)
+
+    def entry_node(self, exit_node):
+        """ Returns the entry node corresponding to the passed exit node. """
+        return self.scope_dict()[exit_node]
+
+    def exit_nodes(self, entry_node):
+        """ Returns the exit node leaving the context opened by
+            the given entry node. """
+
+        if not isinstance(entry_node, nd.EntryNode):
+            raise TypeError(
+                "Received {}: should be dace.nodes.EntryNode".format(
+                    type(entry_node).__name__))
+
+        node_to_children = self.scope_dict(True)
+        return [
+            v for v in node_to_children[entry_node]
+            if isinstance(v, nd.ExitNode)
+        ]
+
+    def data_symbols(self):
+        """Returns all symbols used in data nodes."""
+        return data_symbols(self)
+
+    def scope_symbols(self):
+        """Returns all symbols defined by scopes within this state."""
+        return scope_symbols(self)
+
+    def interstate_symbols(self):
+        """Returns all symbols (assigned, used) in interstate edges in nested
+           SDFGs within this subgraph."""
+        return interstate_symbols(self)
+
+    def undefined_symbols(self, sdfg, include_scalar_data):
+        return undefined_symbols(sdfg, self, include_scalar_data)
+
+    def all_nodes_recursive(self):
+        all_nodes = []
+        for node in self.nodes():
+            all_nodes.append((node, self))
+            if isinstance(node, dace.graph.nodes.NestedSDFG):
+                all_nodes += node.sdfg.all_nodes_recursive()
+        return all_nodes
+
+    def memlet_path(self,
+                    edge: MultiConnectorEdge) -> List[MultiConnectorEdge]:
+        """ Given one edge, returns a list of edges representing a path
+            between its source and sink nodes. Used for memlet tracking.
+
+            @note: Behavior is undefined when there is more than one path
+                   involving this edge!
+            @param edge: An edge within this state.
+            @return: A list of edges from a source node to a destination node.
+        """
+        return _memlet_path(self._graph, edge)
+
+
+# TODO: Use mixin for SDFGState and ScopeSubgraphView for functions like
+#       memlet path and scope dict
+@make_properties
+class SDFGState(OrderedMultiDiConnectorGraph):
+    """ An acyclic dataflow multigraph in an SDFG, corresponding to a
+        single state in the SDFG state machine. """
+
+    is_collapsed = Property(
+        dtype=bool,
+        desc="Show this node/scope/state as collapsed",
+        default=False)
+
+    nosync = Property(
+        dtype=bool,
+        default=False,
+        desc="Do not synchronize at the end of the state")
+
+    def __init__(self, label=None, sdfg=None, debuginfo=None):
+        """ Constructs an SDFG state.
+            @param label: Name for the state (optional).
+            @param sdfg: A reference to the parent SDFG.
+            @param debuginfo: Source code locator for debugging.
+        """
+        super(SDFGState, self).__init__()
+        self._label = label
+        self._parent = sdfg
+        self._clear_scopedict_cache()
+        self._debuginfo = debuginfo
+        self.is_collapsed = False
+        self.nosync = False
+        self._parallel_parent = None  # This (and is_parallel and set_parallel_parent) are duplicated...
+        self._instrumented_parent = False  # Same as above. This flag is needed to know if the parent is instrumented (it's possible for a parent to be serial and instrumented.)
+
+    @property
+    def parent(self):
+        """ Returns the parent SDFG of this state. """
+        return self._parent
+
+    def has_instrumented_parent(self):
+        return self._instrumented_parent
+
+    def set_instrumented_parent(self):
+        self._instrumented_parent = True  # When this is set: Under no circumstances try instrumenting this (or any transitive children)
+
+    def is_parallel(self):
+        return self._parallel_parent != None
+
+    def set_parallel_parent(self, parallel_parent):
+        self._parallel_parent = parallel_parent
+
+    def get_parallel_parent(self):
+        return self._parallel_parent
+
+    def __str__(self):
+        return self._label
+
+    # Clears the cached results for the scope_dict function.
+    # For use when the graph mutates (e.g., new edges/nodes, deletions)
+    def _clear_scopedict_cache(self):
+        self._scope_dict_toparent_cached = None
+        self._scope_dict_tochildren_cached = None
+
+    @property
+    def label(self):
+        return self._label
+
+    @property
+    def name(self):
+        return self._label
+
+    def set_label(self, label):
+        self._label = label
+
+    def add_node(self, node):
+        if not isinstance(node, nd.Node):
+            raise TypeError("Expected Node, got " + str(type(node)))
+        self._clear_scopedict_cache()
+        return super(SDFGState, self).add_node(node)
+
+    def remove_node(self, node):
+        self._clear_scopedict_cache()
+        super(SDFGState, self).remove_node(node)
+
+    def add_edge(self, u, u_connector, v, v_connector, memlet):
+        if not isinstance(u, nd.Node):
+            raise TypeError(
+                'Source node is not of type nd.Node (type: %s)' % str(type(u)))
+        if u_connector is not None and not isinstance(u_connector, str):
+            raise TypeError('Source connector is not string (type: %s)' % str(
+                type(u_connector)))
+        if not isinstance(v, nd.Node):
+            raise TypeError('Destination node is not of type nd.Node (type: ' +
+                            '%s)' % str(type(v)))
+        if v_connector is not None and not isinstance(v_connector, str):
+            raise TypeError('Destination connector is not string (type: %s)' %
+                            str(type(v_connector)))
+        if not isinstance(memlet, mm.Memlet):
+            raise TypeError(
+                'Memlet is not of type Memlet (type: %s)' % str(type(memlet)))
+
+        self._clear_scopedict_cache()
+        return super(SDFGState, self).add_edge(u, u_connector, v, v_connector,
+                                               memlet)
+
+    def remove_edge(self, edge):
+        self._clear_scopedict_cache()
+        super(SDFGState, self).remove_edge(edge)
+
+    def all_nodes_recursive(self):
+        all_nodes = []
+        for node in self.nodes():
+            all_nodes.append((node, self))
+            if isinstance(node, dace.graph.nodes.NestedSDFG):
+                all_nodes += node.sdfg.all_nodes_recursive()
+        return all_nodes
+
+    def defined_symbols_at(self, sdfg, node):
+        """ Returns all symbols available to a given node, including map and
+           state transition variables. """
+        return sdfg.defined_symbols_at(node, state=self)
+
+    def data_symbols(self):
+        """ Returns all symbols used in data nodes. """
+        return data_symbols(self)
+
+    def scope_symbols(self):
+        """ Returns all symbols defined by scopes within this state. """
+        return scope_symbols(self)
+
+    def interstate_symbols(self):
+        """ Returns all symbols assigned/used in interstate edges in nested
+           SDFGs within this state. """
+        return interstate_symbols(self)
+
+    def undefined_symbols(self, sdfg, include_scalar_data):
+        return undefined_symbols(sdfg, self, include_scalar_data)
+
+    def data_nodes(self):
+        """ Returns all data_nodes (arrays) present in this state. """
+        return [n for n in self.nodes() if isinstance(n, nd.AccessNode)]
+
+    def memlet_path(self,
+                    edge: MultiConnectorEdge) -> List[MultiConnectorEdge]:
+        """ Given one edge, returns a list of edges representing a path
+            between its source and sink nodes. Used for memlet tracking.
+
+            @note: Behavior is undefined when there is more than one path
+                   involving this edge.
+            @param edge: An edge within this state.
+            @return: A list of edges from a source node to a destination node.
+        """
+        return _memlet_path(self, edge)
+
+    def memlets_for_array(self, arrayname):
+        return [e for e in self.edges() if e[3].data == arrayname]
+
+    def draw_node(self, graph):
+        return dot.draw_node(graph, self, shape="Msquare")
+
+    def toJSON(self, indent=0):
+        json = " " * indent + "{\n"
+        indent += 2
+        json += " " * indent + "\"type\": \"" + type(self).__name__ + "\",\n"
+        json += " " * indent + "\"collapsed\": " + str(
+            self.is_collapsed).lower() + ",\n"
+        json += " " * indent + "\"nodes\": [\n"
+        indent += 2
+        for n in self.nodes():
+            scope_entry_node_id = "null"
+            if self.entry_node(n) is not None:
+                scope_entry_node_id = str(self.node_id(self.entry_node(n)))
+            json += " " * indent + "{\n"
+            indent += 2
+            json += " " * indent + "\"id\" : \"" + str(
+                self.node_id(n)) + "\",\n"
+            json += " " * indent + "\"scope_entry\" : \"" + scope_entry_node_id + "\",\n"
+            json += " " * indent + "\"scope_exits\" : ["
+            if self.entry_node(n) is not None:
+                ens = self.exit_nodes(self.entry_node(n))
+                json += ",".join([str(self.node_id(x)) for x in ens])
+            json += "],"
+            json += " " * indent + "\"attributes\" : " + n.toJSON(indent) + "\n"
+            indent -= 2
+            if n == self.nodes()[-1]:
+                json += " " * indent + "}\n"
+            else:
+                json += " " * indent + "},\n"
+        indent -= 2
+        json += " " * indent + "],\n"
+
+        json += " " * indent + "\"edges\": [\n"
+        for e in self.edges():
+            json += " " * indent + "{\n"
+            indent += 2
+            json += " " * indent + '"src": "%s",\n' % (str(
+                self.node_id(e.src)))
+            json += " " * indent + '"src_connector": "%s",\n' % (e.src_conn)
+            json += " " * indent + '"dst": "%s",\n' % (str(
+                self.node_id(e.dst)))
+            json += " " * indent + '"dst_connector": "%s",\n' % (e.dst_conn)
+            json += " " * indent + "\"attributes\" : " + e.toJSON(indent) + "\n"
+            indent -= 2
+            if e == self.edges()[-1]:
+                json += " " * indent + "}\n"
+            else:
+                json += " " * indent + "},\n"
+        indent -= 2
+        json += " " * indent + "]\n"
+        json += " " * indent + "}\n"
+        return json
+
+    def scope_dict(self, node_to_children=False):
+        """ Returns a dictionary that segments an SDFG state into
+            entry-node/exit-node scopes.
+
+            @param node_to_children: If False (default), returns a mapping
+                                     of each node to its parent scope
+                                     (ScopeEntry) node. If True, returns a
+                                     mapping of each parent node to a list of
+                                     children nodes.
+            @type node_to_children: bool
+            @return: The mapping from a node to its parent scope node, or the
+                     mapping from a node to a list of children nodes.
+            @rtype: dict(Node, Node) or dict(Node, list(Node))
+        """
+        if (not node_to_children
+                and self._scope_dict_toparent_cached is not None):
+            return copy.copy(self._scope_dict_toparent_cached)
+        elif (node_to_children
+              and self._scope_dict_tochildren_cached is not None):
+            return copy.copy(self._scope_dict_tochildren_cached)
+
+        result = {}
+        node_queue = collections.deque(self.source_nodes())
+        eq = _scope_dict_inner(self, node_queue, None, node_to_children,
+                               result)
+
+        # Sanity check
+        if len(eq) != 0:
+            raise RuntimeError("Leftover nodes in queue: {}".format(eq))
+
+        # Cache result
+        if node_to_children:
+            self._scope_dict_tochildren_cached = result
+        else:
+            self._scope_dict_toparent_cached = result
+
+        return copy.copy(result)
+
+    def scope_subgraph(self, entry_node, include_entry=True,
+                       include_exit=True):
+        return _scope_subgraph(self, entry_node, include_entry, include_exit)
+
+    def top_level_transients(self):
+        """Iterate over top-level transients of this state."""
+        return top_level_transients(self)  # Free function
+
+    def all_transients(self):
+        """Iterate over all transients in this state."""
+        return all_transients(self)
+
+    def entry_node(self, node):
+        """ Returns the scope entry node of the given node, or None if
+            top-level. """
+        return self.scope_dict(False)[node]
+
+    def exit_nodes(self, entry_node):
+        """ Returns the exit node leaving the context opened by
+            the given entry node. """
+
+        if not isinstance(entry_node, nd.EntryNode):
+            raise TypeError(
+                "Received {}: should be dace.nodes.EntryNode".format(
+                    type(entry_node).__name__))
+
+        node_to_children = self.scope_dict(True)
+        return [
+            v for v in node_to_children[entry_node]
+            if isinstance(v, nd.ExitNode)
+        ]
+
+    # Dynamic SDFG creation API
+    ##############################
+    def add_read(self, array_or_stream_name: str,
+                 debuginfo=None) -> nd.AccessNode:
+        """ Adds a read-only access node to this SDFG state.
+            @param array_or_stream_name: The name of the array/stream.
+            @return: An array access node.
+        """
+        debuginfo = getdebuginfo(debuginfo)
+        node = nd.AccessNode(
+            array_or_stream_name,
+            types.AccessType.ReadOnly,
+            debuginfo=debuginfo)
+        self.add_node(node)
+        return node
+
+    def add_write(self, array_or_stream_name: str,
+                  debuginfo=None) -> nd.AccessNode:
+        """ Adds a write-only access node to this SDFG state.
+            @param array_or_stream_name: The name of the array/stream.
+            @return: An array access node.
+        """
+        debuginfo = getdebuginfo(debuginfo)
+        node = nd.AccessNode(
+            array_or_stream_name,
+            types.AccessType.WriteOnly,
+            debuginfo=debuginfo)
+        self.add_node(node)
+        return node
+
+    def add_access(self, array_or_stream_name: str,
+                   debuginfo=None) -> nd.AccessNode:
+        """ Adds a general (read/write) access node to this SDFG state.
+            @param array_or_stream_name: The name of the array/stream.
+            @return: An array access node.
+        """
+        debuginfo = getdebuginfo(debuginfo)
+        node = nd.AccessNode(
+            array_or_stream_name,
+            types.AccessType.ReadWrite,
+            debuginfo=debuginfo)
+        self.add_node(node)
+        return node
+
+    def add_tasklet(self,
+                    name: str,
+                    inputs: Set[str],
+                    outputs: Set[str],
+                    code: str,
+                    language: types.Language = types.Language.Python,
+                    code_global: str = '',
+                    code_init: str = '',
+                    code_exit: str = '',
+                    location: str = '-1',
+                    debuginfo=None):
+        """ Adds a tasklet to the SDFG state. """
+        debuginfo = getdebuginfo(debuginfo)
+        tasklet = nd.Tasklet(
+            name,
+            inputs,
+            outputs,
+            code,
+            language,
+            code_global=code_global,
+            code_init=code_init,
+            code_exit=code_exit,
+            location=location,
+            debuginfo=debuginfo)
+        self.add_node(tasklet)
+        return tasklet
+
+    def add_nested_sdfg(self,
+                        sdfg: SDFG,
+                        parent,
+                        inputs: Set[str],
+                        outputs: Set[str],
+                        name=None,
+                        schedule=types.ScheduleType.Default,
+                        location='-1',
+                        debuginfo=None):
+        """ Adds a nested SDFG to the SDFG state. """
+        if name is None:
+            name = sdfg.label
+        debuginfo = getdebuginfo(debuginfo)
+
+        if sdfg.parent is not None and sdfg.parent != parent:
+            raise ValueError("SDFG \"{}\" already has a parent".format(
+                sdfg.label))
+        sdfg.parent = self
+        sdfg._parent_sdfg = parent
+        sdfg.update_sdfg_list([])
+
+        s = nd.NestedSDFG(
+            name,
+            sdfg,
+            inputs,
+            outputs,
+            schedule=schedule,
+            location=location,
+            debuginfo=debuginfo)
+        self.add_node(s)
+        return s
+
+    def _map_from_ndrange(self,
+                          name,
+                          schedule,
+                          unroll,
+                          ndrange,
+                          debuginfo=None):
+        # Input can either be a dictionary or a list of pairs
+        if isinstance(ndrange, list):
+            params = [k for k, v in ndrange]
+            ndrange = {k: v for k, v in ndrange}
+        else:
+            params = list(ndrange.keys())
+
+        map_range = properties.SubsetProperty.from_string(', '.join(
+            [ndrange[p] for p in params]))
+        map = nd.Map(
+            name, params, map_range, schedule, unroll, debuginfo=debuginfo)
+        return map
+
+    def add_map(self,
+                name,
+                ndrange: Dict[str, str],
+                schedule=types.ScheduleType.Default,
+                unroll=False,
+                debuginfo=None) -> Tuple[nd.Node]:
+        """ Adds a map entry and map exit.
+            @param name:      Map label
+            @param ndrange:   Mapping between range variable names and their
+                              subsets (parsed from strings)
+            @param schedule:  Map schedule type
+            @param unroll:    True if should unroll the map in code generation
+
+            @return: (map_entry, map_exit) node 2-tuple
+        """
+        debuginfo = getdebuginfo(debuginfo)
+        map = self._map_from_ndrange(
+            name, schedule, unroll, ndrange, debuginfo=debuginfo)
+        map_entry = nd.MapEntry(map)
+        map_exit = nd.MapExit(map)
+        self.add_nodes_from([map_entry, map_exit])
+        return map_entry, map_exit
+
+    def add_consume(self,
+                    name,
+                    elements: Tuple[str, str],
+                    condition: str = None,
+                    schedule=types.ScheduleType.Default,
+                    chunksize=1,
+                    debuginfo=None) -> Tuple[nd.Node]:
+        """ Adds consume entry and consume exit nodes.
+            @param name:      Label
+            @param elements:  A 2-tuple signifying the processing element
+                              index and number of total processing elements
+            @param condition: Quiescence condition to finish consuming, or
+                              None (by default) to consume until the stream
+                              is empty for the first time. If false, will
+                              consume forever.
+            @param schedule:  Consume schedule type
+            @param chunksize: Maximal number of elements to consume at a time
+
+            @return: (consume_entry, consume_exit) node 2-tuple
+        """
+        if len(elements) != 2:
+            raise TypeError('Elements must be a 2-tuple of '
+                            '(PE_index, num_PEs)')
+        pe_tuple = (elements[0],
+                    properties.SymbolicProperty.from_string(elements[1]))
+
+        debuginfo = getdebuginfo(debuginfo)
+        consume = nd.Consume(
+            name,
+            pe_tuple,
+            condition,
+            schedule,
+            chunksize,
+            debuginfo=debuginfo)
+        entry = nd.ConsumeEntry(consume)
+        exit = nd.ConsumeExit(consume)
+
+        self.add_nodes_from([entry, exit])
+        return entry, exit
+
+    def add_mapped_tasklet(self,
+                           name: str,
+                           map_ranges: Dict[str, sbs.Subset],
+                           inputs: Dict[str, mm.Memlet],
+                           code: str,
+                           outputs: Dict[str, mm.Memlet],
+                           schedule=types.ScheduleType.Default,
+                           unroll_map=False,
+                           code_global='',
+                           code_init='',
+                           code_exit='',
+                           location='-1',
+                           language=types.Language.Python,
+                           debuginfo=None,
+                           external_edges=False) -> Tuple[nd.Node]:
+        """ Convenience function that adds a map entry, tasklet, map exit,
+            and the respective edges to external arrays.
+            @param name:       Tasklet (and wrapping map) name
+            @param map_ranges: Mapping between variable names and their
+                               subsets
+            @param inputs:     Mapping between input local variable names and
+                               their memlets
+            @param code:       Code (written in `language`)
+            @param outputs:    Mapping between output local variable names and
+                               their memlets
+            @param schedule:   Map schedule
+            @param unroll_map: True if map should be unrolled in code
+                               generation
+            @param code_global: (optional) Global code (outside functions)
+            @param language:   Programming language in which the code is
+                               written
+            @param debuginfo:  Debugging information (mostly for DIODE)
+            @param external_edges: Create external access nodes and connect
+                                   them with memlets automatically
+
+            @return: tuple of (tasklet, map_entry, map_exit)
+        """
+        map_name = name + '_map'
+        debuginfo = getdebuginfo(debuginfo)
+        tasklet = nd.Tasklet(
+            name,
+            set(inputs.keys()),
+            set(outputs.keys()),
+            code,
+            language=language,
+            code_global=code_global,
+            code_init=code_init,
+            code_exit=code_exit,
+            location=location,
+            debuginfo=debuginfo)
+        map = self._map_from_ndrange(
+            map_name, schedule, unroll_map, map_ranges, debuginfo=debuginfo)
+        map_entry = nd.MapEntry(map)
+        map_exit = nd.MapExit(map)
+        self.add_nodes_from([map_entry, tasklet, map_exit])
+
+        # Create access nodes
+        if external_edges:
+            input_data = set(memlet.data for memlet in inputs.values())
+            output_data = set(memlet.data for memlet in outputs.values())
+            inpdict = {}
+            outdict = {}
+            for inp in input_data:
+                inpdict[inp] = self.add_read(inp)
+            for out in output_data:
+                outdict[out] = self.add_write(out)
+
+        # Connect inputs from map to tasklet
+        tomemlet = {}
+        for name, memlet in inputs.items():
+            # Set memlet local name
+            memlet.name = name
+            # Add internal memlet edge
+            self.add_edge(map_entry, None, tasklet, name, memlet)
+            tomemlet[memlet.data] = memlet
+
+        # If there are no inputs, add empty memlet
+        if len(inputs) == 0:
+            self.add_edge(map_entry, None, tasklet, None, mm.EmptyMemlet())
+
+        if external_edges:
+            for inp, inpnode in inpdict.items():
+                # Add external edge
+                outer_memlet = propagate_memlet(self, tomemlet[inp], map_entry,
+                                                True)
+                self.add_edge(inpnode, None, map_entry, 'IN_' + inp,
+                              outer_memlet)
+
+                # Add connectors to internal edges
+                for e in self.out_edges(map_entry):
+                    if e.data.data == inp:
+                        e._src_conn = 'OUT_' + inp
+
+                # Add connectors to map entry
+                map_entry.add_in_connector('IN_' + inp)
+                map_entry.add_out_connector('OUT_' + inp)
+
+        # Connect outputs from tasklet to map
+        tomemlet = {}
+        for name, memlet in outputs.items():
+            # Set memlet local name
+            memlet.name = name
+            # Add internal memlet edge
+            self.add_edge(tasklet, name, map_exit, None, memlet)
+            tomemlet[memlet.data] = memlet
+
+        # If there are no outputs, add empty memlet
+        if len(outputs) == 0:
+            self.add_edge(tasklet, None, map_exit, None, mm.EmptyMemlet())
+
+        if external_edges:
+            for out, outnode in outdict.items():
+                # Add external edge
+                outer_memlet = propagate_memlet(self, tomemlet[out], map_exit,
+                                                True)
+                self.add_edge(map_exit, 'OUT_' + out, outnode, None,
+                              outer_memlet)
+
+                # Add connectors to internal edges
+                for e in self.in_edges(map_exit):
+                    if e.data.data == out:
+                        e._dst_conn = 'IN_' + out
+
+                # Add connectors to map entry
+                map_exit.add_in_connector('IN_' + out)
+                map_exit.add_out_connector('OUT_' + out)
+
+        return tasklet, map_entry, map_exit
+
+    def add_reduce(self,
+                   wcr,
+                   axes,
+                   wcr_identity=None,
+                   schedule=types.ScheduleType.Default,
+                   debuginfo=None):
+        """ Adds a reduction node.
+            @param wcr: A lambda function representing the reduction operation
+            @param axes: A tuple of axes to reduce the input memlet from, or
+                         None for all axes
+            @param wcr_identity: If not None, initializes output memlet values
+                                 with this value
+            @param schedule: Reduction schedule type
+
+            @return: A Reduce node
+        """
+        debuginfo = getdebuginfo(debuginfo)
+        result = nd.Reduce(
+            wcr, axes, wcr_identity, schedule, debuginfo=debuginfo)
+        self.add_node(result)
+        return result
+
+    def add_edge_pair(self,
+                      scope_node,
+                      internal_node,
+                      external_node,
+                      internal_memlet,
+                      external_memlet=None,
+                      scope_connector=None,
+                      internal_connector=None,
+                      external_connector=None):
+        """ Adds two edges around a scope node (e.g., map entry, consume
+            exit).
+
+            The internal memlet (connecting to the internal node) has to be
+            specified. If external_memlet (i.e., connecting to the node out
+            of the scope) is not specified, it is propagated automatically
+            using internal_memlet and the scope.
+
+            @param scope_node: A scope node (for example, map exit) to add
+                               edges around.
+            @param internal_node: The node within the scope to connect to. If
+                                  `scope_node` is an entry node, this means
+                                  the node connected to the outgoing edge,
+                                  else incoming.
+            @param external_node: The node out of the scope to connect to.
+            @param internal_memlet: The memlet on the edge to/from
+                                    internal_node.
+            @param external_memlet: The memlet on the edge to/from
+                                    external_node (optional, will propagate
+                                    internal_memlet if not specified).
+            @param scope_connector: A scope connector name (or a unique
+                                    number if not specified).
+            @param internal_connector: The connector on internal_node to
+                                       connect to.
+            @param external_connector: The connector on external_node to
+                                       connect to.
+            @return: A 2-tuple representing the (internal, external) edges.
+        """
+        if not isinstance(scope_node, (nd.EntryNode, nd.ExitNode)):
+            raise ValueError('scope_node is not a scope entry/exit')
+
+        # Autodetermine scope connector ID
+        if scope_connector is None:
+            # Pick out numbered connectors that do not lead into the scope range
+            conn_id = 1
+            for conn in scope_node.in_connectors | scope_node.out_connectors:
+                if conn.startswith('IN_') or conn.startswith('OUT_'):
+                    conn_name = conn[conn.find('_') + 1:]
+                    try:
+                        cid = int(conn_name)
+                        if cid >= conn_id:
+                            conn_id = cid + 1
+                    except (TypeError, ValueError):
+                        pass
+            scope_connector = str(conn_id)
+
+        # Add connectors
+        scope_node.add_in_connector('IN_' + scope_connector)
+        scope_node.add_out_connector('OUT_' + scope_connector)
+        ##########################
+
+        # Add internal edge
+        if isinstance(scope_node, nd.EntryNode):
+            iedge = self.add_edge(scope_node, 'OUT_' + scope_connector,
+                                  internal_node, internal_connector,
+                                  internal_memlet)
+        else:
+            iedge = self.add_edge(internal_node, internal_connector,
+                                  scope_node, 'IN_' + scope_connector,
+                                  internal_memlet)
+
+        # Add external edge
+        if external_memlet is None:
+            # If undefined, propagate
+            external_memlet = propagate_memlet(self, internal_memlet,
+                                               scope_node, True)
+
+        if isinstance(scope_node, nd.EntryNode):
+            eedge = self.add_edge(external_node, external_connector,
+                                  scope_node, 'IN_' + scope_connector,
+                                  external_memlet)
+        else:
+            eedge = self.add_edge(scope_node, 'OUT_' + scope_connector,
+                                  external_node, external_connector,
+                                  external_memlet)
+
+        return (iedge, eedge)
+
+    def add_memlet_path(self,
+                        *path_nodes,
+                        memlet=None,
+                        src_conn=None,
+                        dst_conn=None):
+        """ Adds a path of memlet edges between the given nodes, propagating
+            from the given innermost memlet.
+
+            @param *path_nodes: Nodes participating in the path (in the given
+                                order).
+            @keyword memlet: (mandatory) The memlet at the innermost scope
+                             (e.g., the incoming memlet to a tasklet (last
+                             node), or an outgoing memlet from an array
+                             (first node), followed by scope exits).
+            @keyword src_conn: Connector at the beginning of the path.
+            @keyword dst_conn: Connector at the end of the path.
+        """
+        if memlet is None:
+            raise TypeError('Innermost memlet cannot be None')
+        if len(path_nodes) < 2:
+            raise ValueError('Memlet path must consist of at least 2 nodes')
+
+        src_node = path_nodes[0]
+        dst_node = path_nodes[-1]
+
+        # Add edges first so that scopes can be understood
+        edges = [
+            self.add_edge(path_nodes[i], None, path_nodes[i + 1], None,
+                          mm.EmptyMemlet())
+            for i in range(len(path_nodes) - 1)
+        ]
+
+        if not isinstance(memlet, dace.memlet.Memlet):
+            raise TypeError("Expected Memlet, got: {}".format(
+                type(memlet).__name__))
+
+        if scope_contains_scope(self.scope_dict(), src_node, dst_node):
+            propagate_forward = False
+        else:  # dst node's scope is higher than src node, propagate out
+            propagate_forward = True
+
+        # Innermost edge memlet
+        cur_memlet = memlet
+
+        # Verify that connectors exists
+        if (not isinstance(memlet, dace.memlet.EmptyMemlet)
+                and hasattr(edges[0].src, "out_connectors")
+                and isinstance(edges[0].src, nd.CodeNode) and
+            (src_conn is None or src_conn not in edges[0].src.out_connectors)):
+            raise ValueError("Output connector {} does not exist in {}".format(
+                src_conn, edges[0].src.label))
+        if (not isinstance(memlet, dace.memlet.EmptyMemlet)
+                and hasattr(edges[-1].dst, "in_connectors")
+                and isinstance(edges[-1].dst, nd.CodeNode) and
+            (dst_conn is None or dst_conn not in edges[-1].dst.in_connectors)):
+            raise ValueError("Input connector {} does not exist in {}".format(
+                dst_conn, edges[-1].dst.label))
+
+        path = edges if propagate_forward else reversed(edges)
+        # Propagate and add edges
+        for i, edge in enumerate(path):
+            # Figure out source and destination connectors
+            if propagate_forward:
+                sconn = src_conn if i == 0 else (
+                    'OUT_' + edge.src.last_connector())
+                dconn = dst_conn if i == len(edges) - 1 else (
+                    'IN_' + edge.dst.next_connector())
+            else:
+                sconn = src_conn if i == len(edges) - 1 else (
+                    'OUT_' + edge.src.next_connector())
+                dconn = dst_conn if i == 0 else (
+                    'IN_' + edge.dst.last_connector())
+
+            # If edge with current data already exists, replace it with
+            # our newly propagated one
+            existing_edges = [
+                e for e in self.edges_between(edge.src, edge.dst)
+                if isinstance(e.src, (nd.EntryNode, nd.ExitNode))
+                and isinstance(e.dst, (nd.EntryNode, nd.ExitNode))
+            ]
+            for e in existing_edges:
+                if e.data.data == cur_memlet.data:
+                    self.remove_edge(e)
+
+            # Modify edge to match memlet path
+            edge._src_conn = sconn
+            edge._dst_conn = dconn
+            edge._data = cur_memlet
+
+            # Add connectors to edges
+            if propagate_forward:
+                if dconn is not None:
+                    edge.dst.add_in_connector(dconn)
+                if sconn is not None:
+                    edge.src.add_out_connector(sconn)
+            else:
+                if dconn is not None:
+                    edge.dst.add_in_connector(dconn)
+                if sconn is not None:
+                    edge.src.add_out_connector(sconn)
+
+            # Propagate current memlet to produce the next one
+            if i < len(edges) - 1:
+                snode = edge.dst if propagate_forward else edge.src
+                cur_memlet = propagate_memlet(self, cur_memlet, snode, True)
+
+    # DEPRECATED FUNCTIONS
+    ######################################
+    def add_array(self,
+                  name,
+                  shape,
+                  dtype,
+                  storage=types.StorageType.Default,
+                  materialize_func=None,
+                  transient=False,
+                  strides=None,
+                  offset=None,
+                  toplevel=False,
+                  debuginfo=None):
+        """ @attention: This function is deprecated. """
+        print('WARNING: The "SDFGState.add_array" API is deprecated, please '
+              'use "SDFG.add_array" and "SDFGState.add_access"')
+        # Workaround to allow this legacy API
+        if name in self.parent._arrays:
+            del self.parent._arrays[name]
+        self.parent.add_array(name, shape, dtype, storage, materialize_func,
+                              transient, strides, offset, toplevel, debuginfo)
+        return self.add_access(name, debuginfo)
+
+    def add_stream(self,
+                   name,
+                   dtype,
+                   veclen=1,
+                   buffer_size=1,
+                   shape=(1, ),
+                   storage=types.StorageType.Default,
+                   transient=False,
+                   strides=None,
+                   offset=None,
+                   toplevel=False,
+                   debuginfo=None):
+        """ @attention: This function is deprecated. """
+        print('WARNING: The "SDFGState.add_stream" API is deprecated, please '
+              'use "SDFG.add_stream" and "SDFGState.add_access"')
+        # Workaround to allow this legacy API
+        if name in self.parent._arrays:
+            del self.parent._arrays[name]
+        self.parent.add_stream(name, dtype, veclen, buffer_size, shape,
+                               storage, transient, strides, offset, toplevel,
+                               debuginfo)
+        return self.add_access(name, debuginfo)
+
+    def add_scalar(self,
+                   name,
+                   dtype,
+                   storage=types.StorageType.Default,
+                   transient=False,
+                   toplevel=False,
+                   debuginfo=None):
+        """ @attention: This function is deprecated. """
+        print('WARNING: The "SDFGState.add_scalar" API is deprecated, please '
+              'use "SDFG.add_scalar" and "SDFGState.add_access"')
+        # Workaround to allow this legacy API
+        if name in self.parent._arrays:
+            del self.parent._arrays[name]
+        self.parent.add_scalar(name, dtype, storage, transient, toplevel,
+                               debuginfo)
+        return self.add_access(name, debuginfo)
+
+    def add_transient(self,
+                      name,
+                      shape,
+                      dtype,
+                      storage=types.StorageType.Default,
+                      materialize_func=None,
+                      strides=None,
+                      offset=None,
+                      toplevel=False,
+                      debuginfo=None):
+        """ @attention: This function is deprecated. """
+        return self.add_array(name, shape, dtype, storage, materialize_func,
+                              True, strides, offset, toplevel, debuginfo)
+
+    # SDFG queries
+    ######################################
+    def find_node(self, node_id_or_label):
+        """ Finds a node according to its ID (if integer is
+            provided) or label (if string is provided).
+
+            @param node_id_or_label  Node ID (if int) or label (if str)
+            @return A nodes.Node object
+        """
+
+        if isinstance(node_id_or_label, str):
+            for n in self.nodes():
+                if n.label == node_id_or_label:
+                    return n
+            raise LookupError('Node %s not found' % node_id_or_label)
+        elif isinstance(node_id_or_label, int):
+            return self.nodes()[node_id_or_label]
+        else:
+            raise TypeError('node_id_or_label is not an int nor string')
+
+    def is_empty(self):
+        return len([
+            n for n in self.nodes() if not isinstance(n, nd.EmptyTasklet)
+        ]) == 0
+
+    def fill_scope_connectors(self):
+        """ Creates new "IN_%d" and "OUT_%d" connectors on each scope entry
+            and exit, depending on array names. """
+        for nid, node in enumerate(self.nodes()):
+            ####################################################
+            # Add connectors to scope entries
+            if isinstance(node, nd.EntryNode):
+                # Find current number of input connectors
+                num_inputs = len([
+                    e for e in self.in_edges(node)
+                    if e.dst_conn is not None and e.dst_conn.startswith('IN_')
+                ])
+
+                conn_to_data = {}
+
+                # Append input connectors and get mapping of connectors to data
+                for edge in self.in_edges(node):
+                    if (edge.dst_conn is not None
+                            and edge.dst_conn.startswith('IN_')):
+                        conn_to_data[edge.data.data] = edge.dst_conn[3:]
+
+                    # We're only interested in edges without connectors
+                    if edge.dst_conn is not None: continue
+                    edge._dst_conn = 'IN_' + str(num_inputs + 1)
+                    node._in_connectors.add(edge.dst_conn)
+                    conn_to_data[edge.data.data] = num_inputs + 1
+
+                    num_inputs += 1
+
+                # Set the corresponding output connectors
+                for edge in self.out_edges(node):
+                    if edge.src_conn is not None: continue
+                    if edge.data.data is None: continue
+                    edge._src_conn = 'OUT_' + str(conn_to_data[edge.data.data])
+                    node._out_connectors.add(edge.src_conn)
+            ####################################################
+            # Same treatment for scope exits
+            if isinstance(node, nd.ExitNode):
+                # Find current number of output connectors
+                num_outputs = len([
+                    e for e in self.out_edges(node)
+                    if e.src_conn is not None and e.src_conn.startswith('OUT_')
+                ])
+
+                conn_to_data = {}
+
+                # Append output connectors and get mapping of connectors to data
+                for edge in self.out_edges(node):
+                    if (edge.src_conn is not None
+                            and edge.src_conn.startswith('OUT_')):
+                        conn_to_data[edge.data.data] = edge.src_conn[4:]
+
+                    # We're only interested in edges without connectors
+                    if edge.src_conn is not None: continue
+                    edge._src_conn = 'OUT_' + str(num_outputs + 1)
+                    node._out_connectors.add(edge.src_conn)
+                    conn_to_data[edge.data.data] = num_outputs + 1
+
+                    num_outputs += 1
+
+                # Set the corresponding input connectors
+                for edge in self.in_edges(node):
+                    if edge.dst_conn is not None: continue
+                    if edge.data.data is None: continue
+                    edge._dst_conn = 'IN_' + str(conn_to_data[edge.data.data])
+                    node._in_connectors.add(edge.dst_conn)
+
+    def validate(self, sdfg, state_id) -> None:
+        """ Verifies the correctness of an SDFG state by applying multiple
+            tests. Raises an InvalidSDFGError with the erroneous node on
+            failure.
+        """
+        if not validate_name(self._label):
+            raise InvalidSDFGError("Invalid state name", sdfg, state_id)
+
+        if self._parent != sdfg:
+            raise InvalidSDFGError(
+                "State does not point to the correct "
+                "parent", sdfg, state_id)
+
+        # Unreachable
+        ########################################
+        if (sdfg.number_of_nodes() > 1 and sdfg.in_degree(self) == 0
+                and sdfg.out_degree(self) == 0):
+            raise InvalidSDFGError("Unreachable state", sdfg, state_id)
+
+        for nid, node in enumerate(self.nodes()):
+            # Node validation
+            try:
+                node.validate(sdfg, self)
+            except Exception as ex:
+                raise InvalidSDFGNodeError(
+                    'Node validation failed: ' + str(ex), sdfg, state_id, nid)
+
+            # Isolated nodes
+            ########################################
+            if self.in_degree(node) + self.out_degree(node) == 0:
+                # One corner case: OK if this is an empty state and there
+                # is only one empty tasklet
+                if isinstance(node, nd.EmptyTasklet):
+                    pass
+                # Another corner case: a tasklet with external code
+                elif (isinstance(node, nd.Tasklet)
+                      and node.language != types.Language.Python):
+                    pass
+                else:
+                    raise InvalidSDFGNodeError('Isolated node', sdfg, state_id,
+                                               nid)
+
+            # Scope tests
+            ########################################
+            if isinstance(node, nd.EntryNode):
+                if len(self.exit_nodes(node)) == 0:
+                    raise InvalidSDFGNodeError(
+                        'Entry node does not have matching '
+                        'exit node', sdfg, state_id, nid)
+
+            if isinstance(node, (nd.EntryNode, nd.ExitNode)):
+                for iconn in node.in_connectors:
+                    if (iconn is not None and iconn.startswith('IN_') and
+                        ('OUT_' + iconn[3:]) not in node.out_connectors):
+                        raise InvalidSDFGNodeError(
+                            'No match for input connector %s in output '
+                            'connectors' % iconn, sdfg, state_id, nid)
+                for oconn in node.out_connectors:
+                    if (oconn is not None and oconn.startswith('OUT_')
+                            and ('IN_' + oconn[4:]) not in node.in_connectors):
+                        raise InvalidSDFGNodeError(
+                            'No match for output connector %s in input '
+                            'connectors' % oconn, sdfg, state_id, nid)
+
+            # Node-specific tests
+            ########################################
+            if (isinstance(node, nd.AccessNode)
+                    and node.data not in sdfg.arrays):
+                raise InvalidSDFGNodeError(
+                    'Access node must point to a valid array name in the SDFG',
+                    sdfg, state_id, nid)
+            if (isinstance(node, nd.Reduce)
+                    and (len(self.in_edges(node)) != 1
+                         or len(self.out_edges(node)) != 1)):
+                raise InvalidSDFGNodeError(
+                    'Reduce node must have exactly one input and output edges',
+                    sdfg, state_id, nid)
+
+            if (isinstance(node, nd.ConsumeEntry)
+                    and 'IN_stream' not in node.in_connectors):
+                raise InvalidSDFGNodeError(
+                    'Consume entry node must have an input stream', sdfg,
+                    state_id, nid)
+            if (isinstance(node, nd.ConsumeEntry)
+                    and 'OUT_stream' not in node.out_connectors):
+                raise InvalidSDFGNodeError(
+                    'Consume entry node must have an internal stream', sdfg,
+                    state_id, nid)
+
+            # Connector tests
+            ########################################
+            # Check for duplicate connector names
+            if len(node.in_connectors & node.out_connectors) > 0:
+                dups = node.in_connectors & node.out_connectors
+                raise InvalidSDFGNodeError(
+                    'Duplicate connectors: ' + str(dups), sdfg, state_id, nid)
+
+            # Check for dangling connectors (incoming)
+            for conn in node.in_connectors:
+                incoming_edges = 0
+                for e in self.in_edges(node):
+                    # Connector found
+                    if e.dst_conn == conn:
+                        incoming_edges += 1
+
+                if incoming_edges == 0:
+                    raise InvalidSDFGNodeError(
+                        'Dangling in-connector %s' % conn, sdfg, state_id, nid)
+                # Connectors may have only one incoming edge
+                # Due to input connectors of scope exit, this is only correct
+                # in some cases:
+                if incoming_edges > 1 and not isinstance(node, nd.ExitNode):
+                    raise InvalidSDFGNodeError(
+                        'Connector %s cannot have more '
+                        'than one incoming edge, found %d' %
+                        (conn, incoming_edges), sdfg, state_id, nid)
+
+            # Check for dangling connectors (outgoing)
+            for conn in node.out_connectors:
+                outgoing_edges = 0
+                for e in self.out_edges(node):
+                    # Connector found
+                    if e.src_conn == conn:
+                        outgoing_edges += 1
+
+                if outgoing_edges == 0:
+                    raise InvalidSDFGNodeError(
+                        'Dangling out-connector %s' % conn, sdfg, state_id,
+                        nid)
+
+                # In case of scope exit, only one outgoing edge per connector
+                # is allowed.
+                if outgoing_edges > 1 and isinstance(node, nd.ExitNode):
+                    raise InvalidSDFGNodeError(
+                        'Connector %s cannot have more '
+                        'than one outgoing edge, found %d' %
+                        (conn, outgoing_edges), sdfg, state_id, nid)
+
+            # Check for edges to nonexistent connectors
+            for e in self.in_edges(node):
+                if (e.dst_conn is not None
+                        and e.dst_conn not in node.in_connectors):
+                    raise InvalidSDFGNodeError(
+                        ('Memlet %s leading to ' + 'nonexistent connector %s')
+                        % (str(e.data), e.dst_conn), sdfg, state_id, nid)
+            for e in self.out_edges(node):
+                if (e.src_conn is not None
+                        and e.src_conn not in node.out_connectors):
+                    raise InvalidSDFGNodeError(
+                        ('Memlet %s coming from ' + 'nonexistent connector %s')
+                        % (str(e.data), e.src_conn), sdfg, state_id, nid)
+            ########################################
+
+        # Memlet checks
+        scope = self.scope_dict()
+        for eid, e in enumerate(self.edges()):
+            # Edge validation
+            try:
+                e.data.validate(sdfg, self)
+            except Exception as ex:
+                raise InvalidSDFGEdgeError(
+                    'Edge validation failed: ' + str(ex), sdfg, state_id, eid)
+
+            # For every memlet, obtain its full path in the DFG
+            path = self.memlet_path(e)
+            src_node = path[0].src
+            dst_node = path[-1].dst
+
+            # Check if memlet data matches src or dst nodes
+            if (e.data.data is not None
+                    and (isinstance(src_node, nd.AccessNode)
+                         or isinstance(dst_node, nd.AccessNode))
+                    and (not isinstance(src_node, nd.AccessNode)
+                         or e.data.data != src_node.data)
+                    and (not isinstance(dst_node, nd.AccessNode)
+                         or e.data.data != dst_node.data)):
+                raise InvalidSDFGEdgeError(
+                    'Memlet data does not match source or destination '
+                    'data nodes)', sdfg, state_id, eid)
+
+            # Memlet path scope lifetime checks
+            # If scope(src) == scope(dst): OK
+            if (scope[src_node] == scope[dst_node]
+                    or src_node == scope[dst_node]):
+                pass
+            # If scope(src) contains scope(dst), then src must be a data node
+            elif scope_contains_scope(scope, src_node, dst_node):
+                if not isinstance(src_node, nd.AccessNode):
+                    raise InvalidSDFGEdgeError(
+                        'Memlet creates an '
+                        'invalid path (source node %s should '
+                        'be a data node)' % str(src_node), sdfg, state_id, eid)
+            # If scope(dst) contains scope(src), then dst must be a data node
+            elif scope_contains_scope(scope, dst_node, src_node):
+                if not isinstance(dst_node, nd.AccessNode):
+                    raise InvalidSDFGEdgeError(
+                        'Memlet creates an '
+                        'invalid path (sink node %s should '
+                        'be a data node)' % str(dst_node), sdfg, state_id, eid)
+            # If scope(dst) is disjoint from scope(src), it's an illegal memlet
+            else:
+                raise InvalidSDFGEdgeError(
+                    'Illegal memlet between disjoint scopes', sdfg, state_id,
+                    eid)
+
+            # Check dimensionality of memory access
+            if isinstance(e.data.subset, (sbs.Range, sbs.Indices)):
+                if e.data.subset.dims() != len(sdfg.arrays[e.data.data].shape):
+                    raise InvalidSDFGEdgeError(
+                        'Memlet subset uses the wrong dimensions'
+                        ' (%dD for a %dD data node)' %
+                        (e.data.subset.dims(),
+                         len(sdfg.arrays[e.data.data].shape)), sdfg, state_id,
+                        eid)
+
+            # Verify that source and destination subsets contain the same
+            # number of elements
+            if e.data.other_subset is not None:
+                if (e.data.subset.num_elements() !=
+                        e.data.other_subset.num_elements()):
+                    raise InvalidSDFGEdgeError(
+                        'Dimensionality mismatch between src/dst subsets',
+                        sdfg, state_id, eid)
+        ########################################
+
+
+def scope_contains_scope(sdict, node, other_node):
+    """ Returns true iff scope of `node` contains the scope of  `other_node`.
+    """
+    curnode = other_node
+    nodescope = sdict[node]
+    while curnode != None:
+        curnode = sdict[curnode]
+        if curnode == nodescope:
+            return True
+    return False
+
+
+def _memlet_path(state: ScopeSubgraphView,
+                 edge: MultiConnectorEdge) -> List[MultiConnectorEdge]:
+    """ Given one edge, returns a list of edges representing a path
+        between its source and sink nodes. Used for memlet tracking.
+
+        @note: Behavior is undefined when there is more than one path
+               involving this edge.
+        @param edge: An edge within this state.
+        @return: A list of edges from a source node to a destination node.
+        """
+    result = [edge]
+
+    # If empty memlet, return itself as the path
+    if (edge.src_conn is None and edge.dst_conn is None
+            and edge.data.data is None):
+        return result
+
+    # Prepend incoming edges until reaching the source node
+    curedge = edge
+    while not isinstance(curedge.src, (nd.CodeNode, nd.AccessNode, nd.Reduce)):
+        # Trace through scopes using OUT_# -> IN_#
+        if isinstance(curedge.src, (nd.EntryNode, nd.ExitNode)):
+            if curedge.src_conn is None:
+                raise ValueError(
+                    "Source connector cannot be None for {}".format(
+                        curedge.src))
+            assert curedge.src_conn.startswith('OUT_')
+            next_edge = next(
+                e for e in state.in_edges(curedge.src)
+                if e.dst_conn == 'IN_' + curedge.src_conn[4:])
+            result.insert(0, next_edge)
+            curedge = next_edge
+
+    # Prepend outgoing edges until reaching the sink node
+    curedge = edge
+    while not isinstance(curedge.dst, (nd.CodeNode, nd.AccessNode, nd.Reduce)):
+        # Trace through scope entry using OUT_# -> IN_#
+        if isinstance(curedge.dst, (nd.EntryNode, nd.ExitNode)):
+            if curedge.dst_conn is None:
+                raise ValueError(
+                    "Destination connector cannot be None for {}".format(
+                        curedge.dst))
+            if not curedge.dst_conn.startswith('IN_'):  # Map variable
+                break
+            next_edge = next(
+                e for e in state.out_edges(curedge.dst)
+                if e.src_conn == 'OUT_' + curedge.dst_conn[3:])
+            result.append(next_edge)
+            curedge = next_edge
+
+    return result
+
+
+def find_input_arraynode(graph, edge):
+    result = _memlet_path(graph, edge)[0]
+    if not isinstance(result.src, nd.AccessNode):
+        raise RuntimeError('Input array node not found for memlet ' +
+                           str(edge.data))
+    return result.src
+
+
+def find_output_arraynode(graph, edge):
+    result = _memlet_path(graph, edge)[-1]
+    if not isinstance(result.dst, nd.AccessNode):
+        raise RuntimeError('Output array node not found for memlet ' +
+                           str(edge.data))
+    return result.dst
+
+
+def _scope_subgraph(graph, entry_node, include_entry, include_exit):
+    if not isinstance(entry_node, nd.EntryNode):
+        raise TypeError("Received {}: should be dace.nodes.EntryNode".format(
+            type(entry_node).__name__))
+    node_to_children = graph.scope_dict(True)
+    if include_exit:
+        children_nodes = set(node_to_children[entry_node])
+    else:
+        # Assume the last node in the scope list is the exit node
+        children_nodes = set(node_to_children[entry_node][:-1])
+    map_nodes = [
+        node for node in children_nodes if isinstance(node, nd.EntryNode)
+    ]
+    while len(map_nodes) > 0:
+        next_map_nodes = []
+        # Traverse children map nodes
+        for map_node in map_nodes:
+            # Get child map subgraph (1 level)
+            more_nodes = set(node_to_children[map_node])
+            # Unionize children_nodes with new nodes
+            children_nodes |= more_nodes
+            # Add nodes of the next level to next_map_nodes
+            next_map_nodes.extend([
+                node for node in more_nodes if isinstance(node, nd.EntryNode)
+            ])
+        map_nodes = next_map_nodes
+
+    if include_entry:
+        children_nodes.add(entry_node)
+
+    # Preserve order of nodes
+    return ScopeSubgraphView(graph,
+                             [n for n in graph.nodes() if n in children_nodes])
+
+
+def _scope_dict_inner(graph, node_queue, current_scope, node_to_children,
+                      result):
+    """ Returns a queue of nodes that are external to the current scope. """
+    # Initialize an empty list, if necessary
+    if node_to_children and current_scope not in result:
+        result[current_scope] = []
+
+    external_queue = collections.deque()
+
+    visited = set()
+    while len(node_queue) > 0:
+        node = node_queue.popleft()
+
+        # If this node has been visited already, skip it
+        if node in visited:
+            continue
+        visited.add(node)
+
+        # Set the node parent (or its parent's children)
+        if not node_to_children:
+            result[node] = current_scope
+        else:
+            result[current_scope].append(node)
+
+        successors = [n for n in graph.successors(node) if n not in visited]
+
+        # If this is an Entry Node, we need to recurse further
+        if isinstance(node, nd.EntryNode):
+            node_queue.extend(
+                _scope_dict_inner(graph, collections.deque(successors), node,
+                                  node_to_children, result))
+        # If this is an Exit Node, we push the successors to the external
+        # queue
+        elif isinstance(node, nd.ExitNode):
+            external_queue.extend(successors)
+        # Otherwise, it is a plain node, and we push its successors to the
+        # same queue
+        else:
+            node_queue.extend(successors)
+
+    return external_queue
+
+
+def concurrent_subgraphs(graph):
+    """ Finds subgraphs of an SDFGState or ScopeSubgraphView that can
+        run concurrently. """
+    if (not (isinstance(graph, SDFGState)
+             or isinstance(graph, ScopeSubgraphView))):
+        raise TypeError(
+            "Expected SDFGState or ScopeSubgraphView, got: {}".format(
+                type(graph).__name__))
+    candidates = graph.source_nodes()
+    components = collections.OrderedDict()  # {start node: nodes in component}
+    for cand in candidates:
+        if isinstance(cand, dace.graph.nodes.AccessNode):
+            # AccessNodes can be read from multiple concurrent components, so
+            # check all out edges
+            start_nodes = [e.dst for e in graph.out_edges(cand)]
+            for n in start_nodes:
+                if n not in components:
+                    components[n] = {cand, n}
+                else:
+                    # Components can read from multiple start arrays
+                    components[n].add(cand)
+        else:
+            # The source node == the first control or compute node
+            components[cand] = {cand}
+    subgraphs = []  # [{nodes in subgraph}]
+    for i, start_node in enumerate(components):
+        # Do BFS and find all nodes reachable from this start node
+        seen = set()
+        to_search = [start_node]
+        while len(to_search) > 0:
+            node = to_search.pop()
+            if node in seen:
+                continue
+            seen.add(node)
+            for e in graph.out_edges(node):
+                if e.dst not in seen:
+                    to_search.append(e.dst)
+        # If this component overlaps with any previously determined components,
+        # fuse them
+        for other in subgraphs:
+            if len(other & seen) > 0:
+                # Add both traversed node and potential data source nodes
+                other |= (seen | components[start_node])
+                break
+        else:
+            # If there was no overlap, this is a concurrent subgraph
+            subgraphs.append(seen | components[start_node])
+    # Now stick each of the found components in a ScopeSubgraphView and return
+    # them. Sort according to original order of nodes
+    all_nodes = graph.nodes()
+    return [
+        ScopeSubgraphView(graph, [n for n in all_nodes if n in sg])
+        for sg in subgraphs
+    ]
+
+
+def scope_symbols(dfg):
+    """ Returns all symbols used in scopes within the given DFG, separated
+        into (iteration variables, symbols used in subsets). """
+    iteration_variables = collections.OrderedDict()
+    subset_symbols = collections.OrderedDict()
+    for n in dfg.nodes():
+        if isinstance(n, dace.graph.nodes.NestedSDFG):
+            iv, ss = n.sdfg.scope_symbols()
+            iteration_variables.update(iv)
+            subset_symbols.update(ss)
+            continue
+        if not isinstance(n, dace.graph.nodes.EntryNode):
+            continue
+        if isinstance(n, dace.graph.nodes.MapEntry):
+            for param in n.params:
+                iteration_variables[param] = dt.Scalar(
+                    symbolic.symbol(param).dtype)
+            for dim in n.map.range:
+                try:
+                    for i in dim:
+                        if isinstance(i, sp.Expr):
+                            subset_symbols.update((k.name, dt.Scalar(k.dtype))
+                                                  for k in i.free_symbols)
+                except TypeError:  # X object is not iterable
+                    if isinstance(dim, sp.Expr):
+                        result.update((k.name, dt.Scalar(k.dtype))
+                                      for k in dim.free_symbols)
+                    else:
+                        raise TypeError(
+                            "Unexpected map range type for {}: {}".format(
+                                n.map,
+                                type(n.map.range).__name__))
+        elif isinstance(n, dace.graph.nodes.ConsumeEntry):
+            # Add PE index as iteration variable
+            iteration_variables[n.consume.pe_index] = dt.Scalar(
+                symbolic.symbol(n.consume.pe_index).dtype)
+            if isinstance(n.consume.num_pes, sp.Expr):
+                subset_symbols.update((k.name, dt.Scalar(k.dtype))
+                                      for k in n.consume.num_pes.free_symbols)
+        else:
+            raise TypeError("Unsupported entry node type: {}".format(
+                type(n).__name__))
+    return iteration_variables, subset_symbols
+
+
+def data_symbols(dfg):
+    """ Returns all symbols used in data nodes within the specified DFG. """
+    sdfg = dfg.parent
+    result = collections.OrderedDict()
+    # Scalars determining the size of arrays
+    for d in dfg.nodes():
+        # Update symbols with symbols in nested SDFGs
+        if isinstance(d, nd.NestedSDFG):
+            result.update(d.sdfg.data_symbols(True))
+            continue
+        if not isinstance(d, nd.AccessNode):
+            continue
+        ddesc = d.desc(sdfg)
+        for s in itertools.chain(ddesc.shape, ddesc.strides, ddesc.offset):
+            if isinstance(s, sp.Expr):
+                result.update((k.name, dt.Scalar(k.dtype))
+                              for k in s.free_symbols
+                              if not k.name.startswith("__dace"))
+    return result
+
+
+def undefined_symbols(sdfg, obj, include_scalar_data):
+    """ Returns all symbols used in this object that are undefined, and thus
+        must be given as input parameters. """
+    scalar_arguments = sdfg.scalar_parameters(False)
+    if include_scalar_data:
+        symbols = collections.OrderedDict(
+            (name, data) for name, data in scalar_arguments)
+    else:
+        symbols = collections.OrderedDict()
+    defined = set(sdfg.constants.keys())
+    symbols.update(
+        obj.data_symbols(True)
+        if isinstance(obj, SDFG) else obj.data_symbols())
+    assigned, used = obj.interstate_symbols()
+    defined |= assigned.keys()
+    symbols.update(used)
+    iteration_variables, subset_symbols = obj.scope_symbols()
+    symbols.update(subset_symbols)
+    if sdfg.parent is not None:
+        defined |= sdfg.parent.symbols_defined_at(sdfg).keys()
+    # Don't include iteration variables
+    # (TODO: this is too lenient; take scope into account)
+    defined |= iteration_variables.keys()
+    defined |= {
+        n.data
+        for n, scope in obj.all_nodes_recursive() if
+        isinstance(n, dace.graph.nodes.AccessNode) and n.desc(scope).transient
+    }
+    symbols = collections.OrderedDict(
+        (key, value) for key, value in symbols.items() if key not in defined)
+    return symbols
+
+
+def interstate_symbols(dfg):
+    """ Returns all symbols used in interstate edges in nested SDFGs within
+        this state. """
+    assigned = collections.OrderedDict()
+    used = collections.OrderedDict()
+    for node in dfg.nodes():
+        if isinstance(node, dace.graph.nodes.NestedSDFG):
+            a, u = node.sdfg.interstate_symbols()
+            assigned.update(a)
+            used.update(u)
+    return assigned, used
+
+
+def top_level_transients(dfg):
+    """ Iterate over top-level transients (i.e., ones that exist in multiple
+        states or scopes) of the passed dataflow graph. """
+    sdfg = dfg.parent
+    visited_transients = set()
+    scope_dict = dfg.scope_dict(node_to_children=True)
+    for node in scope_dict[None]:  # Top-level nodes
+        if not isinstance(node, nd.AccessNode):
+            continue
+        if node.data in visited_transients:
+            continue
+        if not node.desc(sdfg).transient:
+            continue
+        visited_transients.add(node.data)
+        yield node.data
+
+
+def all_transients(dfg):
+    """ Iterate over all transient data in the specified dataflow graph. """
+    visited = set()
+    for node in dfg.nodes():
+        if not isinstance(node, dace.graph.nodes.AccessNode):
+            continue
+        if not node.desc(dfg.parent).transient:
+            continue
+        if node.data in visited:
+            continue
+        visited.add(node.data)
+        yield node.data
+
+
+def local_transients(sdfg, dfg, entry_node):
+    """ Returns transients local to the scope defined by the specified entry
+        node in the dataflow graph. """
+    scope_dict = dfg.scope_dict(node_to_children=False)
+    shared_transients = set(sdfg.shared_transients())
+    in_scope = set()
+    out_scope = set()
+    for node in dfg.nodes():
+        if not isinstance(node, nd.AccessNode):
+            continue
+        if not node.desc(sdfg).transient:
+            continue
+        if node.data in shared_transients:
+            continue
+        if scope_dict[node] == entry_node:
+            in_scope.add(node.data)
+        else:
+            # Since nodes can appear in multiple places, make sure it's not
+            # present anywhere else by keeping track of transients not in this
+            # scope
+            out_scope.add(node.data)
+    transients = types.deduplicate([
+        n.data for n in dfg.nodes()
+        if isinstance(n, dace.graph.nodes.AccessNode) and n.data in in_scope
+        and n.data not in out_scope
+    ])
+    return transients
+
+
+def compile(function_or_sdfg, *args, specialize=None):
+    """ Obtain a runnable binary from a Python (@dace.program) function. """
+    if isinstance(function_or_sdfg, dace.frontend.python.parser.DaceProgram):
+        sdfg = dace.frontend.python.parser.parse_from_function(
+            function_or_sdfg, *args)
+    elif isinstance(function_or_sdfg, SDFG):
+        sdfg = function_or_sdfg
+    else:
+        raise TypeError('Unsupported function type')
+    return sdfg.compile(specialize=specialize)
+
+
+def is_devicelevel(sdfg: SDFG, state: SDFGState, node: dace.graph.nodes.Node):
+    """ Tests whether a node in an SDFG is contained within GPU device-level
+        code.
+        @param sdfg: The SDFG in which the node resides.
+        @param state: The SDFG state in which the node resides.
+        @param node: The node in question
+        @return: True if node is in device-level code, False otherwise.
+    """
+    while sdfg is not None:
+        sdict = state.scope_dict()
+        scope = sdict[node]
+        while scope is not None:
+            if scope.schedule in types.GPU_SCHEDULES:
+                return True
+            scope = sdict[scope]
+        # Traverse up nested SDFGs
+        if sdfg.parent is not None:
+            if isinstance(sdfg.parent, SDFGState):
+                parent = sdfg.parent.parent
+            else:
+                parent = sdfg.parent
+            state, node = next(
+                (s, n) for s in parent.nodes() for n in s.nodes()
+                if isinstance(n, nd.NestedSDFG) and n.sdfg == sdfg)
+        else:
+            parent = sdfg.parent
+        sdfg = parent
+    return False
+
+
+def _get_optimizer_class(class_override):
+    """ Imports and returns a class string defined in the configuration
+        (under "optimizer.interface") or overridden in the input
+        class_override argument. Empty string, False, or failure to find the
+        class skips the process.
+
+        @note: This method uses pydoc to locate the class.
+    """
+    clazz = class_override
+    if class_override is None:
+        clazz = Config.get('optimizer', 'interface')
+
+    if clazz == '' or clazz == False:
+        return None
+
+    result = locate(clazz)
+    if result is None:
+        print('WARNING: Optimizer interface class "%s" not found' % clazz)
+
+    return result
diff --git a/dace/subsets.py b/dace/subsets.py
new file mode 100644
index 0000000000..1cf003343e
--- /dev/null
+++ b/dace/subsets.py
@@ -0,0 +1,583 @@
+from dace import data, symbolic, types
+import re
+import sympy as sp
+from functools import reduce
+from sympy.core.sympify import SympifyError
+
+
+class Subset(object):
+    """ Defines a subset of a data descriptor. """
+
+    def covers(self, other):
+        """ Returns True if this subset covers (using a bounding box) another
+            subset. """
+        try:
+            return all([
+                rb <= orb and re >= ore for rb, re, orb, ore in zip(
+                    self.min_element(), self.max_element(),
+                    other.min_element(), other.max_element())
+            ])
+        except TypeError:
+            return False
+
+    def __repr__(self):
+        return '%s (%s)' % (type(self).__name__, self.__str__())
+
+
+def _simplified_str(val):
+    try:
+        return str(int(val))
+    except TypeError:
+        return str(val)
+
+
+def _expr(val):
+    if isinstance(val, symbolic.SymExpr):
+        return val.expr
+    return val
+
+
+def _tuple_to_symexpr(val):
+    return (symbolic.SymExpr(val[0], val[1])
+            if isinstance(val, tuple) else symbolic.pystr_to_symbolic(val))
+
+
+class Range(Subset):
+    """ Subset defined in terms of a fixed range. """
+
+    def __init__(self, ranges):
+        parsed_ranges = []
+        parsed_tiles = []
+        for r in ranges:
+            if len(r) != 3 and len(r) != 4:
+                raise ValueError("Expected 3-tuple or 4-tuple")
+            parsed_ranges.append((_tuple_to_symexpr(r[0]),
+                                  _tuple_to_symexpr(r[1]),
+                                  _tuple_to_symexpr(r[2])))
+            if len(r) == 3:
+                parsed_tiles.append(symbolic.pystr_to_symbolic(1))
+            else:
+                parsed_tiles.append(symbolic.pystr_to_symbolic(r[3]))
+        self.ranges = parsed_ranges
+        self.tile_sizes = parsed_tiles
+
+    @staticmethod
+    def from_array(array):
+        """ Constructs a range that covers the full array given as input. 
+            @type array: dace.data.Data """
+        return Range([(0, s - 1, 1) for s in array.shape])
+
+    def __hash__(self):
+        return hash(tuple(r for r in self.ranges))
+
+    def __add__(self, other):
+        sum_ranges = self.ranges + other.ranges
+        return Range(sum_ranges)
+
+    def num_elements(self):
+        return reduce(sp.mul.Mul, self.size(), 1)
+
+    def size(self, for_codegen=False):
+        """ Returns the number of elements in each dimension. """
+        if for_codegen == True:
+            int_ceil = sp.Function('int_ceil')
+            return [
+                ts * int_ceil(
+                    ((iMax.approx
+                      if isinstance(iMax, symbolic.SymExpr) else iMax) + 1 -
+                     (iMin.approx
+                      if isinstance(iMin, symbolic.SymExpr) else iMin)),
+                    (step.approx
+                     if isinstance(step, symbolic.SymExpr) else step))
+                for (iMin, iMax, step), ts in zip(self.ranges, self.tile_sizes)
+            ]
+        else:
+            return [
+                ts * sp.ceiling(
+                    ((iMax.approx
+                      if isinstance(iMax, symbolic.SymExpr) else iMax) + 1 -
+                     (iMin.approx
+                      if isinstance(iMin, symbolic.SymExpr) else iMin)) /
+                    (step.approx
+                     if isinstance(step, symbolic.SymExpr) else step))
+                for (iMin, iMax, step), ts in zip(self.ranges, self.tile_sizes)
+            ]
+
+    def bounding_box_size(self):
+        """ Returns the size of a bounding box around this range. """
+        return [
+            # sp.floor((iMax - iMin) / step) - iMin
+            ts * ((iMax.approx
+                   if isinstance(iMax, symbolic.SymExpr) else iMax) -
+                  (iMin.approx
+                   if isinstance(iMin, symbolic.SymExpr) else iMin) + 1)
+            for (iMin, iMax, step), ts in zip(self.ranges, self.tile_sizes)
+        ]
+
+    def min_element(self):
+        return [_expr(x[0]) for x in self.ranges]
+
+    def max_element(self):
+        return [_expr(x[1]) for x in self.ranges]
+        # return [(sp.floor((iMax - iMin) / step) - 1) * step
+        #        for iMin, iMax, step in self.ranges]
+
+    def coord_at(self, i):
+        """ Returns the offseted coordinates of this subset at
+            the given index tuple.
+            
+            For example, the range [2:10:2] at index 2 would return 6 (2+2*2).
+            
+            @param i: A tuple of the same dimensionality as subset.dims() or
+                      subset.data_dims().
+            @return: Absolute coordinates for index i (length equal to
+                     `data_dims()`, may be larger than `dims()`).
+        """
+        tiles = sum(1 if ts != 1 else 0 for ts in self.tile_sizes)
+        if len(i) != len(self.ranges) and len(i) != len(self.ranges) + tiles:
+            raise ValueError('Invalid dimensionality of input tuple (expected'
+                             ' %d, got %d)' % (len(self.ranges), len(i)))
+
+        # Pad with zeros for tiles
+        ts_len = len(i) - len(self.ranges)
+        ti = i[len(self.ranges):] + [0] * (tiles - ts_len)
+
+        i = i[:len(self.ranges)]
+
+        return tuple(
+            _expr(rb) + k * _expr(rs)
+            for k, (rb, _, rs) in zip(i, self.ranges)) + tuple(ti)
+
+    def at(self, i, global_shape):
+        """ Returns the absolute index (1D memory layout) of this subset at
+            the given index tuple.
+            
+            For example, the range [2:10:2] at index 2 would return 6 (2+2*2).
+            
+            @param i: A tuple of the same dimensionality as subset.dims() or
+                      subset.data_dims().
+            @param global_shape: The full size of the set that we are 
+                                 subsetting (e.g., full array strides/padded 
+                                 shape).
+            @return: Absolute 1D index at coordinate i.
+        """
+        coord = self.coord_at(i)
+
+        # Return i0 + i1*size0 + i2*size1*size0 + ....
+        # Cancel out stride since we determine the initial offset only here
+        return sum(
+            _expr(s) * _expr(astr) / _expr(rs) for s, (_, _, rs), astr in zip(
+                coord, self.ranges, self.absolute_strides(global_shape)))
+
+    def data_dims(self):
+        return (
+            sum(1 if (re - rb + 1) != 1 else 0
+                for rb, re, _ in self.ranges) + sum(1 if ts != 1 else 0
+                                                    for ts in self.tile_sizes))
+
+    def offset(self, other, negative):
+        if not isinstance(other, Subset):
+            other = Indices([other for _ in self.ranges])
+        mult = -1 if negative else 1
+        for i, off in enumerate(other.min_element()):
+            rb, re, rs = self.ranges[i]
+            self.ranges[i] = (rb + mult * off, re + mult * off, rs)
+
+    def dims(self):
+        return len(self.ranges)
+
+    def absolute_strides(self, global_shape):
+        """ Returns a list of strides for advancing one element in each
+            dimension. Size of the list is equal to `data_dims()`, which may
+            be larger than `dims()` depending on tile sizes. """
+        # ..., stride2*size1*size0, stride1*size0, stride0, ..., tile strides
+        return [
+            rs * reduce(sp.mul.Mul, global_shape[i + 1:], 1)
+            for i, (_, _, rs) in enumerate(self.ranges)
+        ] + [
+            reduce(sp.mul.Mul, global_shape[i + 1:], 1)
+            for i, ts in enumerate(self.tile_sizes) if ts != 1
+        ]
+
+    def strides(self):
+        return [rs for _, _, rs in self.ranges]
+
+    @staticmethod
+    def _range_pystr(range):
+        return "(" + ", ".join(map(str, range)) + ")"
+
+    def pystr(self):
+        return "[" + ", ".join(map(Range._range_pystr, self.ranges)) + "]"
+
+    @property
+    def free_symbols(self):
+        result = set()
+        for dim in self.ranges:
+            for d in dim:
+                result.update(set(symbolic.symlist(d)))
+        return result
+
+    def reorder(self, order):
+        """ Re-orders the dimensions in-place according to a permutation list.
+            @param order: List or tuple of integers from 0 to self.dims() - 1,
+                          indicating the desired order of the dimensions.
+        """
+        new_ranges = [self.ranges[o] for o in order]
+        self.ranges = new_ranges
+
+    @staticmethod
+    def dim_to_string(d, t=1):
+        if isinstance(d, tuple):
+            dres = _simplified_str(d[0])
+            if d[1] is not None:
+                if d[1] - d[0] != 0:
+                    dres += ':' + _simplified_str(d[1] + 1)
+            if d[2] != 1:
+                if d[1] is None:
+                    dres += ':'
+                dres += ':' + _simplified_str(d[2])
+            if t != 1:
+                if d[1] is None and d[2] == 1:
+                    dres += '::'
+                elif d[2] == 1:
+                    dres += ':'
+                dres += ':' + _simplified_str(t)
+            return dres
+        else:
+            return _simplified_str(d)
+
+    @staticmethod
+    def from_string(string):
+
+        # The following code uses regular expressions in order to support the
+        # use of comma not only for separating range dimensions, but also
+        # inside function calls.
+
+        # Example (with 2 dimensions):
+        # tile_i * ts_i : min(int_ceil(M, rs_i), tile_i * ts_i + ts_i),
+        # regtile_j * rs_j : min(K, regtile_j * rs_j + rs_j)
+
+        ranges = []
+
+        # Split string to tokens separated by colons.
+        # tokens = [
+        #   'tile_i * ts_i ',
+        #   'min(int_ceil(M, rs_i), tile_i * ts_i + ts_i), regtile_j * rs_j ',
+        #   'min(K, regtile_j * rs_j + rs_j)'
+        # ]
+        tokens = string.split(':')
+
+        # In the example, the second token must be split to 2 separate tokens.
+
+        # List of list of tokens (one list per range dimension)
+        multi_dim_tokens = []
+        # List of tokens (single dimension)
+        uni_dim_tokens = []
+
+        for token in tokens:
+
+            i = 0  # Character index in the token
+            count = 0  # Number of open parenthesis
+
+            while i < len(token):
+                # Comma found while not in a function or any other expression
+                # with parenthesis. This is a comma separating range dimensions.
+                if token[i] == ',' and count == 0:
+                    # Split the token to token[:i] and token[i+1:]
+                    # Append token[:i] to the current range dimension
+                    uni_dim_tokens.append(token[0:i])
+                    # Append current range dimension to the list of lists
+                    multi_dim_tokens.append(uni_dim_tokens)
+                    # Start a new range dimension
+                    uni_dim_tokens = []
+                    # Adjust the token
+                    token = token[i + 1:]
+                    i = 0
+                    continue
+                # Open parenthesis found, increase count by 1
+                if token[i] == '(':
+                    count += 1
+                # Closing parenthesis found, decrease cound by 1
+                elif token[i] == ')':
+                    count -= 1
+                # Move to the next character
+                i += 1
+
+            # Append token to the current range dimension
+            uni_dim_tokens.append(token)
+
+        # Append current range dimension to the list of lists
+        multi_dim_tokens.append(uni_dim_tokens)
+
+        # Generate ranges
+        for uni_dim_tokens in multi_dim_tokens:
+            # If dimension has only 1 token, then it is an index (not a range),
+            # treat as range of size 1
+            if len(uni_dim_tokens) < 2:
+                ranges.append((symbolic.pystr_to_symbolic(uni_dim_tokens[0]),
+                               symbolic.pystr_to_symbolic(uni_dim_tokens[0]),
+                               1))
+                continue
+                #return Range(ranges)
+            # If dimension has more than 4 tokens, the range is invalid
+            if len(uni_dim_tokens) > 4:
+                raise SyntaxError("Invalid range: {}".format(multi_dim_tokens))
+            # Support for SymExpr
+            tokens = []
+            for token in uni_dim_tokens:
+                expr = token.split('|')
+                if len(expr) == 1:
+                    tokens.append(expr[0])
+                elif len(expr) == 2:
+                    tokens.append((expr[0], expr[1]))
+                else:
+                    raise SyntaxError(
+                        "Invalid range: {}".format(multi_dim_tokens))
+            # Parse tokens
+            try:
+                if isinstance(tokens[0], tuple):
+                    begin = symbolic.SymExpr(tokens[0][0], tokens[0][1])
+                else:
+                    begin = symbolic.pystr_to_symbolic(tokens[0])
+                if isinstance(tokens[1], tuple):
+                    end = symbolic.SymExpr(tokens[1][0], tokens[1][1]) - 1
+                else:
+                    end = symbolic.pystr_to_symbolic(tokens[1]) - 1
+                if len(tokens) >= 3:
+                    if isinstance(tokens[2], tuple):
+                        step = symbolic.SymExpr(tokens[2][0], tokens[2][1])
+                    else:
+                        step = symbolic.SymExpr(tokens[2])
+                else:
+                    step = 1
+                if len(tokens) >= 4:
+                    if isinstance(tokens[3], tuple):
+                        tsize = tokens[3][0]
+                    else:
+                        tsize = tokens[3]
+                else:
+                    tsize = 1
+            except SympifyError:
+                raise SyntaxError("Invalid range: {}".format(string))
+            # Append range
+            ranges.append((begin, end, step, tsize))
+
+        return Range(ranges)
+
+    @staticmethod
+    def ndslice_to_string(slice, tile_sizes=None):
+        if tile_sizes is None:
+            return ", ".join([Range.dim_to_string(s) for s in slice])
+        return ", ".join(
+            [Range.dim_to_string(s, t) for s, t in zip(slice, tile_sizes)])
+
+    def ndrange(self):
+        return [(rb, re, rs) for rb, re, rs in self.ranges]
+
+    def __str__(self):
+        return Range.ndslice_to_string(self.ranges, self.tile_sizes)
+
+    def __iter__(self):
+        return iter(self.ranges)
+
+    def __len__(self):
+        return len(self.ranges)
+
+    def __getitem__(self, key):
+        return self.ranges.__getitem__(key)
+
+    def __setitem__(self, key, value):
+        return self.ranges.__setitem__(key, value)
+
+    def __eq__(self, other):
+        if not isinstance(other, Range):
+            return False
+        if len(self.ranges) != len(other.ranges):
+            return False
+        return all([(rb == orb and re == ore and rs == ors)
+                    for (rb, re, rs), (orb, ore,
+                                       ors) in zip(self.ranges, other.ranges)])
+
+    def __ne__(self, other):
+        return not self.__eq__(other)
+
+    def compose(self, other):
+        if not isinstance(other, Subset):
+            raise TypeError("Cannot compose ranges with non-subsets")
+        if self.data_dims() != other.dims():
+            raise ValueError("Dimension mismatch in composition")
+        new_subset = []
+        idx = 0
+        for (rb, re, rs), rt in zip(self.ranges, self.tile_sizes):
+            if re - rb == 0:
+                if isinstance(other, Indices):
+                    new_subset.append(rb)
+                else:
+                    new_subset.append((rb, re, rs, rt))
+            else:
+                if isinstance(other[idx], tuple):
+                    new_subset.append(
+                        (rb + rs * other[idx][0], rb + rs * other[idx][1],
+                         rs * other[idx][2], rt))
+                else:
+                    new_subset.append(rb + rs * other[idx])
+                idx += 1
+        if isinstance(other, Range):
+            return Range(new_subset)
+        elif isinstance(other, Indices):
+            return Indices(new_subset)
+        else:
+            raise NotImplementedError
+
+
+class Indices(Subset):
+    """ A subset of one element representing a single index in an
+        N-dimensional data descriptor. """
+
+    def __init__(self, indices):
+        if indices is None or len(indices) == 0:
+            raise TypeError('Expected an array of index expressions: got empty'
+                            ' array or None')
+        if isinstance(indices, str):
+            raise TypeError("Expected collection of index expression: got str")
+        if isinstance(indices, tuple):
+            self.indices = symbolic.SymExpr(indices[0], indices[1])
+        else:
+            self.indices = symbolic.pystr_to_symbolic(indices)
+        self.tile_sizes = [1]
+
+    def __hash__(self):
+        return hash(tuple(i for i in self.indices))
+
+    def num_elements(self):
+        return 1
+
+    def bounding_box_size(self):
+        return [1] * len(self.indices)
+
+    def size(self):
+        return [1] * len(self.indices)
+
+    def min_element(self):
+        return self.indices
+
+    def max_element(self):
+        return self.indices
+
+    def data_dims(self):
+        return 0
+
+    def dims(self):
+        return len(self.indices)
+
+    def strides(self):
+        return [1] * len(self.indices)
+
+    def absolute_strides(self, global_shape):
+        return [1] * len(self.indices)
+
+    def offset(self, other, negative):
+        if not isinstance(other, Subset):
+            other = Indices([other for _ in self.indices])
+        mult = -1 if negative else 1
+        for i, off in enumerate(other.min_element()):
+            self.indices[i] += mult * off
+
+    def coord_at(self, i):
+        """ Returns the offseted coordinates of this subset at
+            the given index tuple.
+            For example, the range [2:10:2] at index 2 would return 6 (2+2*2).
+            @param i: A tuple of the same dimensionality as subset.dims().
+            @return: Absolute coordinates for index i.
+        """
+        if len(i) != len(self.indices):
+            raise ValueError('Invalid dimensionality of input tuple (expected'
+                             ' %d, got %d)' % (len(self.indices), len(i)))
+        if any([k != 0 for k in i]):
+            raise ValueError('Value out of bounds')
+
+        return tuple(r for r in self.indices)
+
+    def at(self, i, global_shape):
+        """ Returns the absolute index (1D memory layout) of this subset at
+            the given index tuple.
+            For example, the range [2:10::2] at index 2 would return 6 (2+2*2).
+            @param i: A tuple of the same dimensionality as subset.dims().
+            @param global_shape: The full size of the set that we are 
+                                 subsetting (e.g., full array strides/padded 
+                                 shape).
+            @return: Absolute 1D index at coordinate i.
+        """
+        coord = self.coord_at(i)
+
+        # Return i0 + i1*size0 + i2*size1*size0 + ....
+        return sum(s * reduce(sp.mul.Mul, global_shape[i + 1:], 1)
+                   for i, s in enumerate(coord))
+
+    def pystr(self):
+        return str(self.indices)
+
+    def __str__(self):
+        return ", ".join(map(str, self.indices))
+
+    @property
+    def free_symbols(self):
+        result = set()
+        for dim in self.indices:
+            result.update(set(symbolic.symlist(d)))
+        return result
+
+    @staticmethod
+    def from_string(s):
+        return Indices([
+            symbolic.pystr_to_symbolic(m.group(0))
+            for m in re.finditer("[^,;:]+", s)
+        ])
+
+    def __iter__(self):
+        return iter(self.indices)
+
+    def __len__(self):
+        return len(self.indices)
+
+    def __getitem__(self, key):
+        return self.indices.__getitem__(key)
+
+    def __setitem__(self, key, value):
+        return self.indices.__setitem__(key, value)
+
+    def __eq__(self, other):
+        if not isinstance(other, Indices):
+            return False
+        if len(self.indices) != len(other.indices):
+            return False
+        return all([i == o_i for i, o_i in zip(self.indices, other.indices)])
+
+    def reorder(self, order):
+        """ Re-orders the dimensions in-place according to a permutation list.
+            @param order: List or tuple of integers from 0 to self.dims() - 1,
+                          indicating the desired order of the dimensions.
+        """
+        new_indices = [self.indices[o] for o in order]
+        self.indices = new_indices
+
+    def __ne__(self, other):
+        return not self.__eq__(other)
+
+    def ndrange(self):
+        return [(i, i, 1) for i in self.indices]
+
+    def compose(self, other):
+        raise TypeError('Index subsets cannot be composed with other subsets')
+
+
+def bounding_box_union(subset_a: Subset, subset_b: Subset) -> Range:
+    """ Perform union by creating a bounding-box of two subsets. """
+    if subset_a.dims() != subset_b.dims():
+        raise ValueError('Dimension mismatch between %s and %s' %
+                         (str(subset_a), str(subset_b)))
+
+    result = [(min(arb, brb), max(are, bre), 1) for arb, brb, are, bre in zip(
+        subset_a.min_element(), subset_b.min_element(), subset_a.max_element(),
+        subset_b.max_element())]
+    return Range(result)
diff --git a/dace/symbolic.py b/dace/symbolic.py
new file mode 100644
index 0000000000..95fe8caaa8
--- /dev/null
+++ b/dace/symbolic.py
@@ -0,0 +1,645 @@
+import ast
+import sympy
+
+from sympy import Sum, Product, log, floor, ceiling
+import sympy as functions
+from sympy.abc import _clash
+from sympy.printing.str import StrPrinter
+
+from dace import types
+
+DEFAULT_SYMBOL_TYPE = types.int32
+
+
+class symbol(sympy.Symbol):
+    """ Defines a symbolic expression. Extends SymPy symbols with DaCe-related
+        information. """
+
+    s_currentsymbol = 0
+    s_values = {}
+    s_types = {}
+    s_constraints = {}
+
+    @staticmethod
+    def erase_symbols(symlist):
+        for sym in symlist:
+            del symbol.s_values[sym]
+            del symbol.s_types[sym]
+            del symbol.s_constraints[sym]
+
+    def __new__(cls, name=None, dtype=DEFAULT_SYMBOL_TYPE, **assumptions):
+        if name is None:
+            # Set name dynamically
+            name = "sym_" + str(symbol.s_currentsymbol)
+            symbol.s_currentsymbol += 1
+        elif name.startswith('__DACE'):
+            raise NameError('Symbols cannot start with __DACE')
+
+        if not isinstance(dtype, types.typeclass):
+            raise TypeError('dtype must be a DaCe type, got %s' % str(dtype))
+
+        if 'integer' in assumptions or 'int' not in str(dtype):
+            self = sympy.Symbol.__new__(cls, name, **assumptions)
+        else:
+            self = sympy.Symbol.__new__(cls, name, integer=True, **assumptions)
+
+        if name not in symbol.s_types:
+            symbol.s_values[name] = None
+            symbol.s_constraints[name] = []
+            symbol.s_types[name] = dtype
+        else:
+            if dtype != DEFAULT_SYMBOL_TYPE and dtype != symbol.s_types[name]:
+                raise TypeError('Type mismatch for existing symbol "%s" (%s) '
+                                'and new type %s' %
+                                (name, str(symbol.s_types[name]), str(dtype)))
+
+        # Arrays to update when value is set
+        self._arrays_to_update = []
+
+        return self
+
+    @staticmethod
+    def from_name(name, **kwargs):
+        if name in symbol.s_types:
+            return symbol(name, symbol.s_types[name], **kwargs)
+        return symbol(name, **kwargs)
+
+    def set(self, value):
+        if value is not None:
+            # First, check constraints
+            self.check_constraints(value)
+
+        symbol.s_values[self.name] = symbol.s_types[self.name](value)
+
+        for arr in self._arrays_to_update:
+            arr.update_resolved_symbol(self)
+
+    def reset(self):
+        self.set(None)
+
+    def is_initialized(self):
+        return symbol.s_values[self.name] is not None
+
+    def get(self):
+        if symbol.s_values[self.name] is None:
+            raise UnboundLocalError('Uninitialized symbol value for \'' +
+                                    self.name + '\'')
+        return symbol.s_values[self.name]
+
+    def set_constraints(self, constraint_list):
+        try:
+            iter(constraint_list)
+            symbol.s_constraints[self.name] = constraint_list
+        except TypeError:  # constraint_list is not iterable
+            symbol.s_constraints[self.name] = [constraint_list]
+
+        # Check for the new constraints and reset symbol value if necessary
+        if symbol.s_values[self.name] is not None:
+            try:
+                self.check_constraints(symbol.s_values[self.name])
+            except RuntimeError:
+                self.reset()  # Reset current value
+                raise
+
+    def add_constraints(self, constraint_list):
+        try:
+            iter(constraint_list)
+            symbol.s_constraints[self.name].extend(constraint_list)
+        except TypeError:  # constraint_list is not iterable
+            symbol.s_constraints[self.name].append(constraint_list)
+
+        # Check for the new constraints and reset symbol value if necessary
+        if symbol.s_values[self.name] is not None:
+            try:
+                self.check_constraints(symbol.s_values[self.name])
+            except RuntimeError:
+                self.reset()  # Reset current value
+                raise
+
+    @property
+    def constraints(self):
+        return symbol.s_constraints[self.name]
+
+    @property
+    def dtype(self):
+        return symbol.s_types[self.name]
+
+    def check_constraints(self, value):
+        fail = None
+        for constraint in symbol.s_constraints[self.name]:
+            try:
+                eval_cons = constraint.subs({self: value})
+                if not eval_cons:
+                    fail = constraint
+                    break
+            except (AttributeError, TypeError, ValueError):
+                raise RuntimeError(
+                    'Cannot validate constraint %s for symbol %s' %
+                    (str(constraint), self.name))
+        if fail is not None:
+            raise RuntimeError(
+                'Value %s invalidates constraint %s for symbol %s' %
+                (str(value), str(fail), self.name))
+
+    def get_or_return(self, uninitialized_ret):
+        if symbol.s_values[self.name] is None:
+            return uninitialized_ret
+        return symbol.s_values[self.name]
+
+
+class SymExpr(object):
+    """ Symbolic expressions with support for an overapproximation expression.
+    """
+
+    def __init__(self, main_expr: str, approx_expr: str = None):
+        self._main_expr = pystr_to_symbolic(main_expr)
+        if approx_expr is None:
+            self._approx_expr = self._main_expr
+        else:
+            self._approx_expr = pystr_to_symbolic(approx_expr)
+
+    @property
+    def expr(self):
+        return self._main_expr
+
+    @property
+    def approx(self):
+        return self._approx_expr
+
+    def __str__(self):
+        if self.expr != self.approx:
+            return str(self.expr) + " (~" + str(self.approx) + ")"
+        else:
+            return str(self.expr)
+
+    def __add__(self, other):
+        if isinstance(other, SymExpr):
+            return SymExpr(self.expr + other.expr, self.approx + other.approx)
+        if isinstance(other, sympy.Expr):
+            return SymExpr(self.expr + other, self.approx + other)
+        return self + pystr_to_symbolic(other)
+
+    def __sub__(self, other):
+        if isinstance(other, SymExpr):
+            return SymExpr(self.expr - other.expr, self.approx - other.approx)
+        if isinstance(other, sympy.Expr):
+            return SymExpr(self.expr - other, self.approx - other)
+        return self - pystr_to_symbolic(other)
+
+    def __mul__(self, other):
+        if isinstance(other, SymExpr):
+            return SymExpr(self.expr * other.expr, self.approx * other.approx)
+        if isinstance(other, sympy.Expr):
+            return SymExpr(self.expr * other, self.approx * other)
+        return self * pystr_to_symbolic(other)
+
+    def __div__(self, other):
+        if isinstance(other, SymExpr):
+            return SymExpr(self.expr / other.expr, self.approx / other.approx)
+        if isinstance(other, sympy.Expr):
+            return SymExpr(self.expr / other, self.approx / other)
+        return self / pystr_to_symbolic(other)
+
+    __truediv__ = __div__
+
+    def __floordiv__(self, other):
+        if isinstance(other, SymExpr):
+            return SymExpr(self.expr // other.expr,
+                           self.approx // other.approx)
+        if isinstance(other, sympy.Expr):
+            return SymExpr(self.expr // other, self.approx // other)
+        return self // pystr_to_symbolic(other)
+
+    def __mod__(self, other):
+        if isinstance(other, SymExpr):
+            return SymExpr(self.expr % other.expr, self.approx % other.approx)
+        if isinstance(other, sympy.Expr):
+            return SymExpr(self.expr % other, self.approx % other)
+        return self % pystr_to_symbolic(other)
+
+    def __pow__(self, other):
+        if isinstance(other, SymExpr):
+            return SymExpr(self.expr**other.expr, self.approx**other.approx)
+        if isinstance(other, sympy.Expr):
+            return SymExpr(self.expr**other, self.approx**other)
+        return self**pystr_to_symbolic(other)
+
+
+def symvalue(val):
+    """ Returns the symbol value if it is a symbol. """
+    if isinstance(val, symbol):
+        return val.get()
+    return val
+
+
+# http://stackoverflow.com/q/3844948/
+def _checkEqualIvo(lst):
+    return not lst or lst.count(lst[0]) == len(lst)
+
+
+def symtype(expr):
+    """ Returns the inferred symbol type from a symbolic expression. """
+    stypes = [s.dtype for s in symlist(expr).values()]
+    if len(stypes) == 0:
+        return DEFAULT_SYMBOL_TYPE
+    elif _checkEqualIvo(stypes):
+        return stypes[0]
+    else:
+        raise TypeError(
+            'Cannot infer symbolic type from expression "%s"'
+            ' with symbols [%s]' % (str(expr), ', '.join(
+                [str(s) + ": " + str(s.dtype) for s in symlist(expr)])))
+
+
+def eval(expr,
+         uninitialized_value=None,
+         keep_uninitialized=False,
+         constants={}):
+    """ Evaluates a complex expression with symbols, replacing all
+        symbols with their values. """
+    if isinstance(expr, SymExpr):
+        return eval(expr.expr, uninitialized_value, keep_uninitialized)
+    if not isinstance(expr, sympy.Expr):
+        return expr
+
+    result = expr
+    if uninitialized_value is None:
+        for atom in expr.atoms():
+            if isinstance(atom, symbol):
+                if atom.name in constants:
+                    result = result.replace(atom, constants[atom.name])
+                else:
+                    try:
+                        result = result.replace(atom, atom.get())
+                    except (AttributeError, TypeError, ValueError):
+                        if keep_uninitialized: pass
+                        else: raise
+    else:
+        for atom in expr.atoms():
+            if isinstance(atom, symbol):
+                if atom.name in constants:
+                    result = result.replace(atom, constants[atom.name])
+                else:
+                    result = result.replace(
+                        atom, atom.get_or_return(uninitialized_value))
+
+    if isinstance(result, sympy.Integer):
+        return int(sympy.N(result))
+    elif isinstance(result, sympy.Float):
+        return float(sympy.N(result))
+
+    return sympy.N(result)
+
+
+def symlist(values):
+    """ Finds symbol dependencies of expressions. """
+    result = {}
+    try:
+        values = iter(values)
+    except TypeError:
+        values = [values]
+
+    for expr in values:
+        if isinstance(expr, SymExpr):
+            true_expr = expr.expr
+        elif isinstance(expr, sympy.Basic):
+            true_expr = expr
+        else:
+            continue
+        for atom in true_expr.atoms():
+            if isinstance(atom, symbol):
+                result[atom.name] = atom
+    return result
+
+
+# TODO: Merge with symlist
+def symbols_in_sympy_expr(expr):
+    """ Returns a list of free symbols in a SymPy Expression. """
+    if not isinstance(expr, sympy.Expr):
+        raise TypeError("Expected sympy.Expr, got: {}".format(
+            type(expr).__name__))
+    symbols = expr.free_symbols
+    return map(str, symbols)
+
+
+def issymbolic(value, constants={}):
+    """ Returns True if an expression is symbolic with respect to its contents
+        and a given dictionary of constant values. """
+    if isinstance(value, SymExpr):
+        return issymbolic(value.expr)
+    if isinstance(value, symbol) and value.name not in constants:
+        return True
+    if isinstance(value, sympy.Basic):
+        for atom in value.atoms():
+            if isinstance(atom, symbol) and atom.name not in constants:
+                return True
+    return False
+
+
+def overapproximate(expr):
+    """ Takes a sympy expression and returns its maximal possible value
+        in specific cases. """
+    if isinstance(expr, SymExpr):
+        if expr.expr != expr.approx:
+            return expr.approx
+        else:
+            return overapproximate(expr.expr)
+    if not isinstance(expr, sympy.Basic):
+        return expr
+    a = sympy.Wild('a')
+    b = sympy.Wild('b')
+    c = sympy.Wild('c')
+
+    # If Min(x, N-y), return the non-symbolic of the two components
+    match = expr.match(sympy.Min(a, b) + c)
+    if match is not None and len(match) == 3:
+        # First, construct the min expression with "c" inline
+        newexpr = sympy.Min(match[a] + match[c], match[b] + match[c])
+        # Match again
+        match = newexpr.match(sympy.Min(a, b))
+        if match is not None and len(match) == 2:
+            if issymbolic(match[a]) and not issymbolic(match[b]):
+                return match[b]
+            if issymbolic(match[b]) and not issymbolic(match[a]):
+                return match[a]
+
+    # If ceiling((k * ((N - 1) / k))) + k), return N
+    a = sympy.Wild('a', properties=[lambda k: k.is_Symbol or k.is_Integer])
+    b = sympy.Wild('b', properties=[lambda k: k.is_Symbol or k.is_Integer])
+    int_floor = sympy.Function('int_floor')
+    match = expr.match(sympy.ceiling(b * int_floor(a - 1, b)) + b)
+    if match is not None and len(match) == 2:
+        return match[a]
+
+    return expr
+
+
+def symbols_in_ast(tree):
+    """ Walks an AST and finds all names, excluding function names. """
+    to_visit = list(tree.__dict__.items())
+    symbols = []
+    while len(to_visit) > 0:
+        (key, val) = to_visit.pop()
+        if key == "func":
+            continue
+        if isinstance(val, ast.Name):
+            symbols.append(val.id)
+            continue
+        if isinstance(val, ast.expr):
+            to_visit += list(val.__dict__.items())
+        if isinstance(val, list):
+            to_visit += [(key, v) for v in val]
+    return types.deduplicate(symbols)
+
+
+def getsymbols(compilation_args):
+    """ Helper function to get symbols from a list of decorator arguments
+        ('@dace.program(...)/sdfg.compile'). """
+    from dace import data
+
+    result = {}
+    for arg in compilation_args:
+        if issymbolic(arg):
+            # If argument is a symbol, we will resolve it on call.
+            # No need for it to be a dependency
+            pass
+        elif isinstance(arg, data.Array):
+            for d in arg.shape:
+                if issymbolic(d):
+                    result.update(symlist(d))
+        else:
+            try:
+                result.update(getattr(
+                    arg,
+                    '_symlist'))  # Add all (not yet added) symbols to result
+            except AttributeError:
+                pass
+
+    return result
+
+
+def symbol_name_or_value(val):
+    """ Returns the symbol name if symbol, otherwise the value as a string. """
+    if isinstance(val, symbol):
+        return val.name
+    return str(val)
+
+
+def sympy_to_dace(exprs, symbol_map={}):
+    """ Convert all `sympy.Symbol`s to DaCe symbols, according to 
+        `symbol_map`. """
+    repl = {}
+
+    oneelem = False
+    try:
+        exprs_iter = iter(exprs)
+    except TypeError:
+        oneelem = True
+        exprs = [exprs]
+
+    for i, expr in enumerate(exprs):
+        if isinstance(expr, sympy.Basic):
+            for atom in expr.atoms():
+                if isinstance(atom, sympy.Symbol):
+                    try:
+                        repl[atom] = symbol_map[atom.name]
+                    except KeyError:
+                        # Symbol is not in map, create a DaCe symbol with same assumptions
+                        repl[atom] = symbol.from_name(atom.name,
+                                                      **atom.assumptions0)
+            exprs[i] = expr.subs(repl)
+    if oneelem:
+        return exprs[0]
+    return exprs
+
+
+def is_sympy_userfunction(expr):
+    """ Returns True if the expression is a SymPy function. """
+    return issubclass(type(type(expr)), sympy.function.UndefinedFunction)
+
+
+def swalk(expr, enter_functions=False):
+    """ Walk over a symbolic expression tree (similar to `ast.walk`).
+        Returns an iterator that yields the values and recurses into functions,
+        if specified.
+    """
+    yield expr
+    for arg in expr.args:
+        if not enter_functions and is_sympy_userfunction(arg):
+            yield arg
+            continue
+        yield from swalk(arg)
+
+
+def contains_sympy_functions(expr):
+    """ Returns True if expression contains Sympy functions. """
+    if is_sympy_userfunction(expr):
+        return True
+    for arg in expr.args:
+        if contains_sympy_functions(arg):
+            return True
+    return False
+
+
+def sympy_numeric_fix(expr):
+    """ Fix for printing out integers as floats with ".00000000". 
+        Converts the float constants in a given expression to integers. """
+    if not isinstance(expr, sympy.Basic):
+        if int(expr) == expr:
+            return int(expr)
+        return expr
+
+    if isinstance(expr, sympy.Number) and expr == int(expr):
+        return int(expr)
+    return expr
+
+
+def sympy_ceiling_fix(expr):
+    """ Fix for SymPy printing out reciprocal values when they should be 
+        integral in "ceiling/floor" sympy functions.
+    """
+    nexpr = expr
+    if not isinstance(expr, sympy.Basic):
+        return expr
+
+    # The properties avoid matching the silly case "ceiling(N/32)" as
+    # ceiling of 1/N and 1/32
+    a = sympy.Wild('a', properties=[lambda k: k.is_Symbol or k.is_Integer])
+    b = sympy.Wild('b', properties=[lambda k: k.is_Symbol or k.is_Integer])
+    c = sympy.Wild('c')
+    d = sympy.Wild('d')
+    int_ceil = sympy.Function('int_ceil')
+    int_floor = sympy.Function('int_floor')
+
+    processed = 1
+    while processed > 0:
+        processed = 0
+        for ceil in nexpr.find(sympy.ceiling):
+            # Simple ceiling
+            m = ceil.match(sympy.ceiling(a / b))
+            if m is not None:
+                nexpr = nexpr.subs(ceil, int_ceil(m[a], m[b]))
+                processed += 1
+                continue
+            # Ceiling of ceiling: "ceil(ceil(c/d) / b)"
+            m = ceil.match(sympy.ceiling(int_ceil(c, d) / b))
+            if m is not None:
+                nexpr = nexpr.subs(ceil, int_ceil(int_ceil(m[c], m[d]), m[b]))
+                processed += 1
+                continue
+            # Ceiling of ceiling: "ceil(a / ceil(c/d))"
+            m = ceil.match(sympy.ceiling(a / int_ceil(c, d)))
+            if m is not None:
+                nexpr = nexpr.subs(ceil, int_ceil(m[a], int_ceil(m[c], m[d])))
+                processed += 1
+                continue
+            # Match ceiling of multiplication with our custom integer functions
+            m = ceil.match(sympy.ceiling(a * int_floor(c, d)))
+            if m is not None:
+                nexpr = nexpr.subs(ceil, m[a] * int_floor(m[c], m[d]))
+                processed += 1
+                continue
+            m = ceil.match(sympy.ceiling(a * int_ceil(c, d)))
+            if m is not None:
+                nexpr = nexpr.subs(ceil, m[a] * int_ceil(m[c], m[d]))
+                processed += 1
+                continue
+
+    return nexpr
+
+
+def sympy_divide_fix(expr):
+    """ Fix SymPy printouts where integer division such as "tid/2" turns 
+        into ".5*tid".
+    """
+    nexpr = expr
+    if not isinstance(expr, sympy.Basic):
+        return expr
+
+    int_floor = sympy.Function('int_floor')
+
+    processed = 1
+    while processed > 0:
+        processed = 0
+        for candidate in nexpr.find(sympy.mul.Mul):
+            for i, arg in enumerate(candidate.args):
+                if isinstance(arg, sympy.Number) and abs(arg) >= 1:
+                    continue
+                if isinstance(arg, sympy.Number) and (1 / arg) == int(1 / arg):
+                    ri = i
+                    break
+            else:
+                continue
+            nexpr = nexpr.subs(
+                candidate,
+                int_floor(
+                    sympy.mul.Mul(*(
+                        candidate.args[:ri] + candidate.args[ri + 1:])),
+                    int(1 / candidate.args[ri])))
+            processed += 1
+
+    return nexpr
+
+
+def pystr_to_symbolic(expr, symbol_map={}):
+    """ Takes a Python string and converts it into a symbolic expression. """
+    if isinstance(expr, SymExpr):
+        return expr
+
+    locals = {'min': sympy.Min, 'max': sympy.Max}
+    # _clash1 enables all one-letter variables like N as symbols
+    # _clash also allows pi, beta, zeta and other common greek letters
+    locals.update(_clash)
+
+    # Sympy processes "not" as direct evaluation rather than negation
+    if isinstance(expr, str) and 'not' in expr:
+        expr = expr.replace('not', 'Not')
+
+    return sympy_to_dace(sympy.sympify(expr, locals), symbol_map)
+
+
+class DaceSympyPrinter(StrPrinter):
+    """ Several notational corrections for integer math and C++ translation
+        that sympy.printing.cxxcode does not provide. """
+
+    def _print_Float(self, expr):
+        if int(expr) == expr:
+            return str(int(expr))
+        return super()._print_Float(expr)
+
+    def _print_Function(self, expr):
+        if str(expr.func) == 'int_floor':
+            return '((%s) / (%s))' % (self._print(expr.args[0]),
+                                      self._print(expr.args[1]))
+        return super()._print_Function(expr)
+
+    def _print_Mod(self, expr):
+        return '((%s) %% (%s))' % (self._print(expr.args[0]),
+                                   self._print(expr.args[1]))
+
+
+def symstr(sym):
+    """ Convert a symbolic expression to a C++ compilable expression. """
+
+    def repstr(s):
+        return s.replace('Min', 'min').replace('Max', 'max')
+
+    if isinstance(sym, SymExpr):
+        return symstr(sym.expr)
+
+    try:
+        sym = sympy_numeric_fix(sym)
+        sym = sympy_ceiling_fix(sym)
+        sym = sympy_divide_fix(sym)
+
+        sstr = DaceSympyPrinter().doprint(sym)
+
+        if isinstance(sym,
+                      symbol) or isinstance(sym, sympy.Symbol) or isinstance(
+                          sym, sympy.Number) or types.isconstant(sym):
+            return repstr(sstr)
+        else:
+            return '(' + repstr(sstr) + ')'
+    except (AttributeError, TypeError, ValueError):
+        sstr = DaceSympyPrinter().doprint(sym)
+        return '(' + repstr(sstr) + ')'
diff --git a/dace/transformation/__init__.py b/dace/transformation/__init__.py
new file mode 100644
index 0000000000..e69de29bb2
diff --git a/dace/transformation/dataflow/__init__.py b/dace/transformation/dataflow/__init__.py
new file mode 100644
index 0000000000..04b5b2cb27
--- /dev/null
+++ b/dace/transformation/dataflow/__init__.py
@@ -0,0 +1,30 @@
+""" This module initializes the dataflow transformations package. """
+
+# Map-related
+from .mapreduce import MapReduceFusion
+from .map_expansion import MapExpansion
+from .map_collapse import MapCollapse
+from .map_for_loop import MapToForLoop
+from .map_interchange import MapInterchange
+from .map_fusion import MapFusion
+
+# Data movement
+from .strip_mining import StripMining
+from .tiling import OrthogonalTiling
+from .vectorization import Vectorization
+
+# Data-related
+from .stream_transient import StreamTransient, InLocalStorage, OutLocalStorage
+from .reduce_expansion import ReduceExpansion
+
+# Complexity reduction
+from .redundant_array import RedundantArray
+from .redundant_array_copying import (
+    RedundantArrayCopying, RedundantArrayCopying2, RedundantArrayCopying3)
+
+# Device-related
+from .copy_to_device import CopyToDevice
+from .gpu_transform import GPUTransformMap
+from .gpu_transform_local_storage import GPUTransformLocalStorage
+from .fpga_transform import FPGATransformMap
+from .mpi import MPITransformMap
diff --git a/dace/transformation/dataflow/copy_to_device.py b/dace/transformation/dataflow/copy_to_device.py
new file mode 100644
index 0000000000..625179c844
--- /dev/null
+++ b/dace/transformation/dataflow/copy_to_device.py
@@ -0,0 +1,156 @@
+""" Contains classes and functions that implement copying a nested SDFG
+    and its dependencies to a given device. """
+
+import dace
+from copy import deepcopy as dcpy
+from dace import data, properties, symbolic, types, subsets
+from dace.graph import edges, graph, nodes, nxutil
+from dace.transformation import pattern_matching
+from math import ceil
+import sympy
+import networkx as nx
+
+
+def change_storage(sdfg, storage):
+    for state in sdfg.nodes():
+        for node in state.nodes():
+            if isinstance(node, nodes.AccessNode):
+                node.desc(sdfg).storage = storage
+            if isinstance(node, nodes.NestedSDFG):
+                change_storage(node.sdfg, storage)
+
+
+@properties.make_properties
+class CopyToDevice(pattern_matching.Transformation):
+    """ Implements the copy-to-device transformation, which copies a nested
+        SDFG and its dependencies to a given device.
+
+        The transformation changes all data storage types of a nested SDFG to 
+        the given `storage` property, and creates new arrays and copies around
+        the nested SDFG to that storage.
+    """
+
+    _nested_sdfg = nodes.NestedSDFG("", graph.OrderedDiGraph(), set(), set())
+
+    storage = properties.Property(
+        dtype=types.StorageType,
+        desc="Nested SDFG storage",
+        enum=types.StorageType,
+        from_string=lambda x: types.StorageType[x],
+        default=types.StorageType.Default)
+
+    @staticmethod
+    def annotates_memlets():
+        return True
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(CopyToDevice._nested_sdfg)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        nested_sdfg = graph.nodes()[candidate[CopyToDevice._nested_sdfg]]
+        return nested_sdfg.label
+
+    def apply(self, sdfg):
+        state = sdfg.nodes()[self.state_id]
+        nested_sdfg = state.nodes()[self.subgraph[CopyToDevice._nested_sdfg]]
+        storage = self.storage
+
+        for _, edge in enumerate(state.in_edges(nested_sdfg)):
+
+            src, src_conn, dst, dst_conn, memlet = edge
+            dataname = memlet.data
+            memdata = sdfg.arrays[dataname]
+
+            if isinstance(memdata, data.Array):
+                new_data = sdfg.add_array(
+                    'device_' + dataname + '_in',
+                    memdata.dtype, [
+                        symbolic.overapproximate(r)
+                        for r in memlet.bounding_box_size()
+                    ],
+                    transient=True,
+                    storage=storage)
+            elif isinstance(memdata, data.Scalar):
+                new_data = sdfg.add_scalar(
+                    'device_' + dataname + '_in',
+                    memdata.dtype,
+                    transient=True,
+                    storage=storage)
+            else:
+                raise NotImplementedError
+
+            data_node = nodes.AccessNode('device_' + dataname + '_in')
+
+            to_data_mm = dcpy(memlet)
+            from_data_mm = dcpy(memlet)
+            from_data_mm.data = 'device_' + dataname + '_in'
+            offset = []
+            for ind, r in enumerate(memlet.subset):
+                offset.append(r[0])
+                if isinstance(memlet.subset[ind], tuple):
+                    begin = memlet.subset[ind][0] - r[0]
+                    end = memlet.subset[ind][1] - r[0]
+                    step = memlet.subset[ind][2]
+                    from_data_mm.subset[ind] = (begin, end, step)
+                else:
+                    from_data_mm.subset[ind] -= r[0]
+
+            state.remove_edge(edge)
+            state.add_edge(src, src_conn, data_node, None, to_data_mm)
+            state.add_edge(data_node, None, dst, dst_conn, from_data_mm)
+
+        for _, edge in enumerate(state.out_edges(nested_sdfg)):
+
+            src, src_conn, dst, dst_conn, memlet = edge
+            dataname = memlet.data
+            memdata = sdfg.arrays[dataname]
+
+            if isinstance(memdata, data.Array):
+                new_data = data.Array(
+                    'device_' + dataname + '_out',
+                    memdata.dtype, [
+                        symbolic.overapproximate(r)
+                        for r in memlet.bounding_box_size()
+                    ],
+                    transient=True,
+                    storage=storage)
+            elif isinstance(memdata, data.Scalar):
+                new_data = sdfg.add_scalar(
+                    'device_' + dataname + '_out',
+                    memdata.dtype,
+                    transient=True,
+                    storage=storage)
+            else:
+                raise NotImplementedError
+
+            data_node = nodes.AccessNode('device_' + dataname + '_out')
+
+            to_data_mm = dcpy(memlet)
+            from_data_mm = dcpy(memlet)
+            to_data_mm.data = 'device_' + dataname + '_out'
+            offset = []
+            for ind, r in enumerate(memlet.subset):
+                offset.append(r[0])
+                if isinstance(memlet.subset[ind], tuple):
+                    begin = memlet.subset[ind][0] - r[0]
+                    end = memlet.subset[ind][1] - r[0]
+                    step = memlet.subset[ind][2]
+                    to_data_mm.subset[ind] = (begin, end, step)
+                else:
+                    to_data_mm.subset[ind] -= r[0]
+
+            state.remove_edge(edge)
+            state.add_edge(src, src_conn, data_node, None, to_data_mm)
+            state.add_edge(data_node, None, dst, dst_conn, from_data_mm)
+
+        # Change storage for all data inside nested SDFG to device.
+        change_storage(nested_sdfg.sdfg, storage)
+
+
+pattern_matching.Transformation.register_pattern(CopyToDevice)
diff --git a/dace/transformation/dataflow/fpga_transform.py b/dace/transformation/dataflow/fpga_transform.py
new file mode 100644
index 0000000000..eab77242be
--- /dev/null
+++ b/dace/transformation/dataflow/fpga_transform.py
@@ -0,0 +1,178 @@
+""" Contains the FPGA Transform Map transformation. """
+
+import copy
+import itertools
+
+from dace import data, types, sdfg as sd, symbolic
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+
+
+class FPGATransformMap(pattern_matching.Transformation):
+    """ Implements the FPGATransformMap transformation.
+
+        Converts a single map to an FPGA-scheduled map and creates FPGA arrays
+        outside it, generating CPU<->FPGA memory copies automatically.
+  """
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(FPGATransformMap._map_entry)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        map_entry = graph.nodes()[candidate[FPGATransformMap._map_entry]]
+        candidate_map = map_entry.map
+
+        # No more than 3 dimensions
+        if candidate_map.range.dims() > 3: return False
+
+        # Map schedules that are disallowed to transform to FPGAs
+        if (candidate_map.schedule in [
+                types.ScheduleType.MPI, types.ScheduleType.GPU_Device,
+                types.ScheduleType.FPGA_Device,
+                types.ScheduleType.GPU_ThreadBlock
+        ]):
+            return False
+
+        # Recursively check parent for FPGA schedules
+        sdict = graph.scope_dict()
+        current_node = map_entry
+        while current_node != None:
+            if (current_node.map.schedule in [
+                    types.ScheduleType.GPU_Device,
+                    types.ScheduleType.FPGA_Device,
+                    types.ScheduleType.GPU_ThreadBlock
+            ]):
+                return False
+            current_node = sdict[current_node]
+
+        # Ensure that map does not include internal arrays that are allocated
+        # on non-default space
+        subgraph = graph.scope_subgraph(map_entry)
+        for node in subgraph.nodes():
+            if (isinstance(node, nodes.AccessNode)
+                    and node.desc(sdfg).storage != types.StorageType.Default):
+                return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        map_entry = graph.nodes()[candidate[FPGATransformMap._map_entry]]
+
+        return str(map_entry)
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        map_entry = graph.nodes()[self.subgraph[FPGATransformMap._map_entry]]
+        map_entry.map._schedule = types.ScheduleType.FPGA_Device
+
+        # Find map exit nodes
+        exit_nodes = graph.exit_nodes(map_entry)
+
+        fpga_storage_types = [
+            types.StorageType.FPGA_Global, types.StorageType.FPGA_Local,
+            types.StorageType.CPU_Pinned
+        ]
+
+        #######################################################
+        # Add FPGA copies of CPU arrays (i.e., not already on FPGA)
+
+        # First, understand which arrays to clone
+        all_out_edges = []
+        for enode in exit_nodes:
+            all_out_edges.extend(list(graph.out_edges(enode)))
+        in_arrays_to_clone = set()
+        out_arrays_to_clone = set()
+        for e in graph.in_edges(map_entry):
+            data_node = sd.find_input_arraynode(graph, e)
+            if data_node.desc(sdfg).storage not in fpga_storage_types:
+                in_arrays_to_clone.add(data_node)
+        for e in all_out_edges:
+            data_node = sd.find_output_arraynode(graph, e)
+            if data_node.desc(sdfg).storage not in fpga_storage_types:
+                out_arrays_to_clone.add(data_node)
+
+        # Second, create a FPGA clone of each array
+        cloned_arrays = {}
+        in_cloned_arraynodes = {}
+        out_cloned_arraynodes = {}
+        for array_node in in_arrays_to_clone:
+            array = array_node.desc(sdfg)
+            if array_node.data in cloned_arrays:
+                cloned_array = cloned_arrays[array_node.data]
+            else:
+                cloned_array = sdfg.add_array(
+                    'fpga_' + array_node.data,
+                    array.dtype,
+                    array.shape,
+                    materialize_func=array.materialize_func,
+                    transient=True,
+                    storage=types.StorageType.FPGA_Global,
+                    allow_conflicts=array.allow_conflicts,
+                    access_order=array.access_order,
+                    strides=array.strides,
+                    offset=array.offset)
+                cloned_arrays[array_node.data] = 'fpga_' + array_node.data
+            cloned_node = type(array_node)(cloned_array)
+
+            in_cloned_arraynodes[array_node.data] = cloned_node
+        for array_node in out_arrays_to_clone:
+            array = array_node.desc(sdfg)
+            if array_node.data in cloned_arrays:
+                cloned_array = cloned_arrays[array_node.data]
+            else:
+                cloned_array = sdfg.add_array(
+                    'fpga_' + array_node.data,
+                    array.dtype,
+                    array.shape,
+                    materialize_func=array.materialize_func,
+                    transient=True,
+                    storage=types.StorageType.FPGA_Global,
+                    allow_conflicts=array.allow_conflicts,
+                    access_order=array.access_order,
+                    strides=array.strides,
+                    offset=array.offset)
+                cloned_arrays[array_node.data] = 'fpga_' + array_node.data
+            cloned_node = type(array_node)(cloned_array)
+
+            out_cloned_arraynodes[array_node.data] = cloned_node
+
+        # Third, connect the cloned arrays to the originals
+        # TODO(later): Shift indices and create only the necessary sub-arrays
+        for array_name, node in in_cloned_arraynodes.items():
+            graph.add_node(node)
+            for edge in graph.in_edges(map_entry):
+                if edge.data.data == array_name:
+                    graph.remove_edge(edge)
+                    graph.add_edge(edge.src, None, node, None, edge.data)
+                    newmemlet = copy.copy(edge.data)
+                    newmemlet.data = node.data
+                    graph.add_edge(node, edge.src_conn, edge.dst,
+                                   edge.dst_conn, newmemlet)
+        for array_name, node in out_cloned_arraynodes.items():
+            graph.add_node(node)
+            for edge in all_out_edges:
+                if edge.data.data == array_name:
+                    graph.remove_edge(edge)
+                    graph.add_edge(node, None, edge.dst, None, edge.data)
+                    newmemlet = copy.copy(edge.data)
+                    newmemlet.data = node.data
+                    graph.add_edge(edge.src, edge.src_conn, node,
+                                   edge.dst_conn, newmemlet)
+
+        # Fourth, replace memlet arrays as necessary
+        scope_subgraph = graph.scope_subgraph(map_entry)
+        for edge in scope_subgraph.edges():
+            if (edge.data.data is not None
+                    and edge.data.data in cloned_arrays):
+                edge.data.data = cloned_arrays[edge.data.data]
+
+    def modifies_graph(self):
+        return True
+
+
+pattern_matching.Transformation.register_pattern(FPGATransformMap)
diff --git a/dace/transformation/dataflow/gpu_transform.py b/dace/transformation/dataflow/gpu_transform.py
new file mode 100644
index 0000000000..2d2d113003
--- /dev/null
+++ b/dace/transformation/dataflow/gpu_transform.py
@@ -0,0 +1,226 @@
+""" Contains the GPU Transform Map transformation. """
+
+import copy
+import itertools
+
+from dace import data, types, sdfg as sd, subsets as sbs, symbolic
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from dace.properties import Property, make_properties
+
+
+@make_properties
+class GPUTransformMap(pattern_matching.Transformation):
+    """ Implements the GPUTransformMap transformation.
+
+        Converts a single map to a GPU-scheduled map and creates GPU arrays
+        outside it, generating CPU<->GPU memory copies automatically.
+    """
+
+    fullcopy = Property(
+        desc="Copy whole arrays rather than used subset",
+        dtype=bool,
+        default=False)
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _reduce = nodes.Reduce('lambda: None', None)
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(GPUTransformMap._map_entry),
+            nxutil.node_path_graph(GPUTransformMap._reduce)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        if expr_index == 0:
+            map_entry = graph.nodes()[candidate[GPUTransformMap._map_entry]]
+            candidate_map = map_entry.map
+
+            # Map schedules that are disallowed to transform to GPUs
+            if (candidate_map.schedule == types.ScheduleType.MPI
+                    or candidate_map.schedule == types.ScheduleType.GPU_Device
+                    or candidate_map.schedule ==
+                    types.ScheduleType.GPU_ThreadBlock):
+                return False
+
+            # Recursively check parent for GPU schedules
+            sdict = graph.scope_dict()
+            current_node = map_entry
+            while current_node != None:
+                if (current_node.map.schedule == types.ScheduleType.GPU_Device
+                        or current_node.map.schedule ==
+                        types.ScheduleType.GPU_ThreadBlock):
+                    return False
+                current_node = sdict[current_node]
+
+            # Ensure that map does not include internal arrays that are allocated
+            # on non-default space
+            subgraph = graph.scope_subgraph(map_entry)
+            for node in subgraph.nodes():
+                if (isinstance(node, nodes.AccessNode) and
+                        node.desc(sdfg).storage != types.StorageType.Default
+                        and
+                        node.desc(sdfg).storage != types.StorageType.Register):
+                    return False
+
+            return True
+        elif expr_index == 1:
+            reduce = graph.nodes()[candidate[GPUTransformMap._reduce]]
+
+            # Map schedules that are disallowed to transform to GPUs
+            if (reduce.schedule == types.ScheduleType.MPI
+                    or reduce.schedule == types.ScheduleType.GPU_Device
+                    or reduce.schedule == types.ScheduleType.GPU_ThreadBlock):
+                return False
+
+            # Recursively check parent for GPU schedules
+            sdict = graph.scope_dict()
+            current_node = sdict[reduce]
+            while current_node != None:
+                if (current_node.map.schedule == types.ScheduleType.GPU_Device
+                        or current_node.map.schedule ==
+                        types.ScheduleType.GPU_ThreadBlock):
+                    return False
+                current_node = sdict[current_node]
+
+            return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        if GPUTransformMap._reduce in candidate:
+            return str(graph.nodes()[candidate[GPUTransformMap._reduce]])
+        else:
+            map_entry = graph.nodes()[candidate[GPUTransformMap._map_entry]]
+            return str(map_entry)
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        if self.expr_index == 0:
+            cnode = graph.nodes()[self.subgraph[GPUTransformMap._map_entry]]
+            node_schedprop = cnode.map
+            exit_nodes = graph.exit_nodes(cnode)
+        else:
+            cnode = graph.nodes()[self.subgraph[GPUTransformMap._reduce]]
+            node_schedprop = cnode
+            exit_nodes = [cnode]
+
+        # Change schedule
+        node_schedprop._schedule = types.ScheduleType.GPU_Device
+
+        gpu_storage_types = [
+            types.StorageType.GPU_Global,
+            types.StorageType.GPU_Shared,
+            types.StorageType.GPU_Stack  #, types.StorageType.CPU_Pinned
+        ]
+
+        #######################################################
+        # Add GPU copies of CPU arrays (i.e., not already on GPU)
+
+        # First, understand which arrays to clone
+        all_out_edges = []
+        for enode in exit_nodes:
+            all_out_edges.extend(list(graph.out_edges(enode)))
+        in_arrays_to_clone = set()
+        out_arrays_to_clone = set()
+        for e in graph.in_edges(cnode):
+            data_node = sd.find_input_arraynode(graph, e)
+            if data_node.desc(sdfg).storage not in gpu_storage_types:
+                in_arrays_to_clone.add(data_node)
+        for e in all_out_edges:
+            data_node = sd.find_output_arraynode(graph, e)
+            if data_node.desc(sdfg).storage not in gpu_storage_types:
+                out_arrays_to_clone.add(data_node)
+
+        # Second, create a GPU clone of each array
+        cloned_arrays = {}
+        in_cloned_arraynodes = {}
+        out_cloned_arraynodes = {}
+        for array_node in in_arrays_to_clone:
+            array = array_node.desc(sdfg)
+            if array_node.data in cloned_arrays:
+                cloned_array = cloned_arrays[array_node.data]
+            else:
+                cloned_array = sdfg.add_array(
+                    'gpu_' + array_node.data,
+                    array.shape,
+                    array.dtype,
+                    materialize_func=array.materialize_func,
+                    transient=True,
+                    storage=types.StorageType.GPU_Global,
+                    allow_conflicts=array.allow_conflicts,
+                    access_order=array.access_order,
+                    strides=array.strides,
+                    offset=array.offset)
+                cloned_arrays[array_node.data] = 'gpu_' + array_node.data
+            cloned_node = type(array_node)('gpu_' + array_node.data)
+
+            in_cloned_arraynodes[array_node.data] = cloned_node
+        for array_node in out_arrays_to_clone:
+            array = array_node.desc(sdfg)
+            if array_node.data in cloned_arrays:
+                cloned_array = cloned_arrays[array_node.data]
+            else:
+                cloned_array = sdfg.add_array(
+                    'gpu_' + array_node.data,
+                    array.shape,
+                    array.dtype,
+                    materialize_func=array.materialize_func,
+                    transient=True,
+                    storage=types.StorageType.GPU_Global,
+                    allow_conflicts=array.allow_conflicts,
+                    access_order=array.access_order,
+                    strides=array.strides,
+                    offset=array.offset)
+                cloned_arrays[array_node.data] = 'gpu_' + array_node.data
+            cloned_node = type(array_node)('gpu_' + array_node.data)
+
+            out_cloned_arraynodes[array_node.data] = cloned_node
+
+        # Third, connect the cloned arrays to the originals
+        # TODO(later): Shift indices and create only the necessary sub-arrays
+        for array_name, node in in_cloned_arraynodes.items():
+            graph.add_node(node)
+            for edge in graph.in_edges(cnode):
+                if edge.data.data == array_name:
+                    graph.remove_edge(edge)
+                    newmemlet = copy.copy(edge.data)
+                    newmemlet.data = node.data
+                    graph.add_edge(node, edge.src_conn, edge.dst,
+                                   edge.dst_conn, newmemlet)
+
+                    if self.fullcopy:
+                        edge.data.subset = sbs.Range.from_array(
+                            node.desc(sdfg))
+                    edge.data.other_subset = edge.data.subset
+                    graph.add_edge(edge.src, None, node, None, edge.data)
+        for array_name, node in out_cloned_arraynodes.items():
+            graph.add_node(node)
+            for edge in all_out_edges:
+                if edge.data.data == array_name:
+                    graph.remove_edge(edge)
+                    newmemlet = copy.copy(edge.data)
+                    newmemlet.data = node.data
+                    graph.add_edge(edge.src, edge.src_conn, node,
+                                   edge.dst_conn, newmemlet)
+                    edge.data.wcr = None
+                    if self.fullcopy:
+                        edge.data.subset = sbs.Range.from_array(
+                            node.desc(sdfg))
+                    edge.data.other_subset = edge.data.subset
+                    graph.add_edge(node, None, edge.dst, None, edge.data)
+
+        # Fourth, replace memlet arrays as necessary
+        if self.expr_index == 0:
+            scope_subgraph = graph.scope_subgraph(cnode)
+            for edge in scope_subgraph.edges():
+                if (edge.data.data is not None
+                        and edge.data.data in cloned_arrays):
+                    edge.data.data = cloned_arrays[edge.data.data]
+
+    def modifies_graph(self):
+        return True
+
+
+pattern_matching.Transformation.register_pattern(GPUTransformMap)
diff --git a/dace/transformation/dataflow/gpu_transform_local_storage.py b/dace/transformation/dataflow/gpu_transform_local_storage.py
new file mode 100644
index 0000000000..853a7930ba
--- /dev/null
+++ b/dace/transformation/dataflow/gpu_transform_local_storage.py
@@ -0,0 +1,482 @@
+""" Contains classes and functions that implement the GPU transformation
+    (with local storage). """
+
+import copy
+import itertools
+
+from dace import data, types, sdfg as sd, subsets as sbs, symbolic
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from dace.properties import Property, make_properties
+
+
+def in_scope(graph, node, parent):
+    """ Returns True if `node` is in the scope of `parent`. """
+    scope_dict = graph.scope_dict()
+    scope = scope_dict[node]
+    while scope is not None:
+        if scope == parent:
+            return True
+        scope = scope_dict[scope]
+    return False
+
+
+def in_path(path, edge, nodetype, forward=True):
+    if not forward:
+        path.reverse()
+    start = path.index(edge)
+    for e in path[start:]:
+        if isinstance(e.dst, nodetype):
+            return True
+    return False
+
+
+@make_properties
+class GPUTransformLocalStorage(pattern_matching.Transformation):
+    """Implements the GPUTransformLocalStorage transformation.
+
+        Similar to GPUTransformMap, but takes multiple maps leading from the 
+        same data node into account, creating a local storage for each range.
+
+        @see: GPUTransformMap
+    """
+
+    fullcopy = Property(
+        desc="Copy whole arrays rather than used subset",
+        dtype=bool,
+        default=False)
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _reduce = nodes.Reduce('lambda: None', None)
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(GPUTransformLocalStorage._map_entry),
+            nxutil.node_path_graph(GPUTransformLocalStorage._reduce)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        if expr_index == 0:
+            map_entry = graph.nodes()[candidate[
+                GPUTransformLocalStorage._map_entry]]
+            candidate_map = map_entry.map
+
+            # Disallow GPUTransform on nested maps in strict mode
+            if strict:
+                if graph.scope_dict()[map_entry] is not None:
+                    return False
+
+            # Map schedules that are disallowed to transform to GPUs
+            if (candidate_map.schedule == types.ScheduleType.MPI
+                    or candidate_map.schedule == types.ScheduleType.GPU_Device
+                    or candidate_map.schedule ==
+                    types.ScheduleType.GPU_ThreadBlock or
+                    candidate_map.schedule == types.ScheduleType.Sequential):
+                return False
+
+            # Recursively check parent for GPU schedules
+            sdict = graph.scope_dict()
+            current_node = map_entry
+            while current_node != None:
+                if (current_node.map.schedule == types.ScheduleType.GPU_Device
+                        or current_node.map.schedule ==
+                        types.ScheduleType.GPU_ThreadBlock):
+                    return False
+                current_node = sdict[current_node]
+
+            # Ensure that map does not include internal arrays that are
+            # allocated on non-default space
+            subgraph = graph.scope_subgraph(map_entry)
+            for node in subgraph.nodes():
+                if (isinstance(node, nodes.AccessNode) and
+                        node.desc(sdfg).storage != types.StorageType.Default
+                        and
+                        node.desc(sdfg).storage != types.StorageType.Register):
+                    return False
+
+            return True
+        elif expr_index == 1:
+            reduce = graph.nodes()[candidate[GPUTransformLocalStorage._reduce]]
+
+            # Map schedules that are disallowed to transform to GPUs
+            if (reduce.schedule == types.ScheduleType.MPI
+                    or reduce.schedule == types.ScheduleType.GPU_Device
+                    or reduce.schedule == types.ScheduleType.GPU_ThreadBlock):
+                return False
+
+            # Recursively check parent for GPU schedules
+            sdict = graph.scope_dict()
+            current_node = sdict[reduce]
+            while current_node != None:
+                if (current_node.map.schedule == types.ScheduleType.GPU_Device
+                        or current_node.map.schedule ==
+                        types.ScheduleType.GPU_ThreadBlock):
+                    return False
+                current_node = sdict[current_node]
+
+            return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        if GPUTransformLocalStorage._reduce in candidate:
+            return str(
+                graph.nodes()[candidate[GPUTransformLocalStorage._reduce]])
+        else:
+            map_entry = graph.nodes()[candidate[
+                GPUTransformLocalStorage._map_entry]]
+            return str(map_entry)
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        if self.expr_index == 0:
+            cnode = graph.nodes()[self.subgraph[
+                GPUTransformLocalStorage._map_entry]]
+            node_schedprop = cnode.map
+            exit_nodes = graph.exit_nodes(cnode)
+        else:
+            cnode = graph.nodes()[self.subgraph[
+                GPUTransformLocalStorage._reduce]]
+            node_schedprop = cnode
+            exit_nodes = [cnode]
+
+        # Change schedule
+        node_schedprop._schedule = types.ScheduleType.GPU_Device
+
+        gpu_storage_types = [
+            types.StorageType.GPU_Global, types.StorageType.GPU_Shared,
+            types.StorageType.GPU_Stack
+        ]
+
+        #######################################################
+        # Add GPU copies of CPU arrays (i.e., not already on GPU)
+
+        # First, understand which arrays to clone
+        all_out_edges = []
+        for enode in exit_nodes:
+            all_out_edges.extend(list(graph.out_edges(enode)))
+        in_arrays_to_clone = set()
+        out_arrays_to_clone = set()
+        for e in graph.in_edges(cnode):
+            data_node = sd.find_input_arraynode(graph, e)
+            if data_node.desc(sdfg).storage not in gpu_storage_types:
+                in_arrays_to_clone.add((data_node, e.data))
+        for e in all_out_edges:
+            data_node = sd.find_output_arraynode(graph, e)
+            if data_node.desc(sdfg).storage not in gpu_storage_types:
+                out_arrays_to_clone.add((data_node, e.data))
+
+        # Second, create a GPU clone of each array
+        # TODO: Overapproximate union of memlets
+        cloned_arrays = {}
+        in_cloned_arraynodes = {}
+        out_cloned_arraynodes = {}
+        for array_node, memlet in in_arrays_to_clone:
+            array = array_node.desc(sdfg)
+            cloned_name = 'gpu_' + array_node.data
+            for i, r in enumerate(memlet.bounding_box_size()):
+                size = symbolic.overapproximate(r)
+                try:
+                    if int(size) == 1:
+                        suffix = []
+                        for c in str(memlet.subset[i][0]):
+                            if c.isalpha() or c.isdigit() or c == '_':
+                                suffix.append(c)
+                            elif c == '+':
+                                suffix.append('p')
+                            elif c == '-':
+                                suffix.append('m')
+                            elif c == '*':
+                                suffix.append('t')
+                            elif c == '/':
+                                suffix.append('d')
+                        cloned_name += '_' + ''.join(suffix)
+                except:
+                    continue
+            if cloned_name in sdfg.arrays.keys():
+                cloned_array = sdfg.arrays[cloned_name]
+            elif array_node.data in cloned_arrays:
+                cloned_array = cloned_arrays[array_node.data]
+            else:
+                full_shape = []
+                for r in memlet.bounding_box_size():
+                    size = symbolic.overapproximate(r)
+                    try:
+                        full_shape.append(int(size))
+                    except:
+                        full_shape.append(size)
+                actual_dims = [
+                    idx for idx, r in enumerate(full_shape)
+                    if not (isinstance(r, int) and r == 1)
+                ]
+                if len(actual_dims) == 0:  # abort
+                    actual_dims = [len(full_shape) - 1]
+                if isinstance(array, data.Scalar):
+                    cloned_array = sdfg.add_array(
+                        name=cloned_name,
+                        shape=[1],
+                        dtype=array.dtype,
+                        transient=True,
+                        storage=types.StorageType.GPU_Global)
+                else:
+                    cloned_array = sdfg.add_array(
+                        name=cloned_name,
+                        shape=[full_shape[d] for d in actual_dims],
+                        dtype=array.dtype,
+                        materialize_func=array.materialize_func,
+                        transient=True,
+                        storage=types.StorageType.GPU_Global,
+                        allow_conflicts=array.allow_conflicts,
+                        access_order=tuple(
+                            [array.access_order[d] for d in actual_dims]),
+                        strides=[array.strides[d] for d in actual_dims],
+                        offset=[array.offset[d] for d in actual_dims])
+                cloned_arrays[array_node.data] = cloned_name
+            cloned_node = type(array_node)(cloned_name)
+
+            in_cloned_arraynodes[array_node.data] = cloned_node
+        for array_node, memlet in out_arrays_to_clone:
+            array = array_node.desc(sdfg)
+            cloned_name = 'gpu_' + array_node.data
+            for i, r in enumerate(memlet.bounding_box_size()):
+                size = symbolic.overapproximate(r)
+                try:
+                    if int(size) == 1:
+                        suffix = []
+                        for c in str(memlet.subset[i][0]):
+                            if c.isalpha() or c.isdigit() or c == '_':
+                                suffix.append(c)
+                            elif c == '+':
+                                suffix.append('p')
+                            elif c == '-':
+                                suffix.append('m')
+                            elif c == '*':
+                                suffix.append('t')
+                            elif c == '/':
+                                suffix.append('d')
+                        cloned_name += '_' + ''.join(suffix)
+                except:
+                    continue
+            if cloned_name in sdfg.arrays.keys():
+                cloned_array = sdfg.arrays[cloned_name]
+            elif array_node.data in cloned_arrays:
+                cloned_array = cloned_arrays[array_node.data]
+            else:
+                full_shape = []
+                for r in memlet.bounding_box_size():
+                    size = symbolic.overapproximate(r)
+                    try:
+                        full_shape.append(int(size))
+                    except:
+                        full_shape.append(size)
+                actual_dims = [
+                    idx for idx, r in enumerate(full_shape)
+                    if not (isinstance(r, int) and r == 1)
+                ]
+                if len(actual_dims) == 0:  # abort
+                    actual_dims = [len(full_shape) - 1]
+                if isinstance(array, data.Scalar):
+                    cloned_array = sdfg.add_array(
+                        name=cloned_name,
+                        shape=[1],
+                        dtype=array.dtype,
+                        transient=True,
+                        storage=types.StorageType.GPU_Global)
+                else:
+                    cloned_array = sdfg.add_array(
+                        name=cloned_name,
+                        shape=[full_shape[d] for d in actual_dims],
+                        dtype=array.dtype,
+                        materialize_func=array.materialize_func,
+                        transient=True,
+                        storage=types.StorageType.GPU_Global,
+                        allow_conflicts=array.allow_conflicts,
+                        access_order=tuple(
+                            [array.access_order[d] for d in actual_dims]),
+                        strides=[array.strides[d] for d in actual_dims],
+                        offset=[array.offset[d] for d in actual_dims])
+                cloned_arrays[array_node.data] = cloned_name
+            cloned_node = type(array_node)(cloned_name)
+            cloned_node.setzero = True
+
+            out_cloned_arraynodes[array_node.data] = cloned_node
+
+        # Third, connect the cloned arrays to the originals
+        for array_name, node in in_cloned_arraynodes.items():
+            graph.add_node(node)
+            is_scalar = isinstance(sdfg.arrays[array_name], data.Scalar)
+            for edge in graph.in_edges(cnode):
+                if edge.data.data == array_name:
+                    newmemlet = copy.deepcopy(edge.data)
+                    newmemlet.data = node.data
+
+                    if is_scalar:
+                        newmemlet.subset = sbs.Indices([0])
+                    else:
+                        offset = []
+                        lost_dims = []
+                        lost_ranges = []
+                        newsubset = [None] * len(edge.data.subset)
+                        for ind, r in enumerate(edge.data.subset):
+                            offset.append(r[0])
+                            if isinstance(edge.data.subset[ind], tuple):
+                                begin = edge.data.subset[ind][0] - r[0]
+                                end = edge.data.subset[ind][1] - r[0]
+                                step = edge.data.subset[ind][2]
+                                if begin == end:
+                                    lost_dims.append(ind)
+                                    lost_ranges.append((begin, end, step))
+                                else:
+                                    newsubset[ind] = (begin, end, step)
+                            else:
+                                newsubset[ind] -= r[0]
+                        if len(lost_dims) == len(edge.data.subset):
+                            lost_dims.pop()
+                            newmemlet.subset = type(
+                                edge.data.subset)([lost_ranges[-1]])
+                        else:
+                            newmemlet.subset = type(edge.data.subset)(
+                                [r for r in newsubset if r is not None])
+
+                    graph.add_edge(node, None, edge.dst, edge.dst_conn,
+                                   newmemlet)
+
+                    for e in graph.bfs_edges(edge.dst, reverse=False):
+                        parent, _, _child, _, memlet = e
+                        if parent != edge.dst and not in_scope(
+                                graph, parent, edge.dst):
+                            break
+                        if memlet.data != edge.data.data:
+                            continue
+                        path = graph.memlet_path(e)
+                        if not isinstance(path[-1].dst, nodes.CodeNode):
+                            if in_path(path, e, nodes.ExitNode, forward=True):
+                                if isinstance(parent, nodes.CodeNode):
+                                    # Output edge
+                                    break
+                                else:
+                                    continue
+                        if is_scalar:
+                            memlet.subset = sbs.Indices([0])
+                        else:
+                            newsubset = [None] * len(memlet.subset)
+                            for ind, r in enumerate(memlet.subset):
+                                if ind in lost_dims:
+                                    continue
+                                if isinstance(memlet.subset[ind], tuple):
+                                    begin = r[0] - offset[ind]
+                                    end = r[1] - offset[ind]
+                                    step = r[2]
+                                    newsubset[ind] = (begin, end, step)
+                                else:
+                                    newsubset[ind] = (r - offset[ind],
+                                                      r - offset[ind] + 1, 1)
+                            memlet.subset = type(edge.data.subset)(
+                                [r for r in newsubset if r is not None])
+                        memlet.data = node.data
+
+                    if self.fullcopy:
+                        edge.data.subset = sbs.Range.from_array(
+                            node.desc(sdfg))
+                    edge.data.other_subset = newmemlet.subset
+                    graph.add_edge(edge.src, edge.src_conn, node, None,
+                                   edge.data)
+                    graph.remove_edge(edge)
+
+        for array_name, node in out_cloned_arraynodes.items():
+            graph.add_node(node)
+            is_scalar = isinstance(sdfg.arrays[array_name], data.Scalar)
+            for edge in all_out_edges:
+                if edge.data.data == array_name:
+                    newmemlet = copy.deepcopy(edge.data)
+                    newmemlet.data = node.data
+
+                    if is_scalar:
+                        newmemlet.subset = sbs.Indices([0])
+                    else:
+                        offset = []
+                        lost_dims = []
+                        lost_ranges = []
+                        newsubset = [None] * len(edge.data.subset)
+                        for ind, r in enumerate(edge.data.subset):
+                            offset.append(r[0])
+                            if isinstance(edge.data.subset[ind], tuple):
+                                begin = edge.data.subset[ind][0] - r[0]
+                                end = edge.data.subset[ind][1] - r[0]
+                                step = edge.data.subset[ind][2]
+                                if begin == end:
+                                    lost_dims.append(ind)
+                                    lost_ranges.append((begin, end, step))
+                                else:
+                                    newsubset[ind] = (begin, end, step)
+                            else:
+                                newsubset[ind] -= r[0]
+                        if len(lost_dims) == len(edge.data.subset):
+                            lost_dims.pop()
+                            newmemlet.subset = type(
+                                edge.data.subset)([lost_ranges[-1]])
+                        else:
+                            newmemlet.subset = type(edge.data.subset)(
+                                [r for r in newsubset if r is not None])
+
+                    graph.add_edge(edge.src, edge.src_conn, node, None,
+                                   newmemlet)
+
+                    end_node = graph.scope_dict()[edge.src]
+                    for e in graph.bfs_edges(edge.src, reverse=True):
+                        parent, _, _child, _, memlet = e
+                        if parent == end_node:
+                            break
+                        if memlet.data != edge.data.data:
+                            continue
+                        path = graph.memlet_path(e)
+                        if not isinstance(path[0].dst, nodes.CodeNode):
+                            if in_path(
+                                    path, e, nodes.EntryNode, forward=False):
+                                if isinstance(parent, nodes.CodeNode):
+                                    # Output edge
+                                    break
+                                else:
+                                    continue
+                        if is_scalar:
+                            memlet.subset = sbs.Indices([0])
+                        else:
+                            newsubset = [None] * len(memlet.subset)
+                            for ind, r in enumerate(memlet.subset):
+                                if ind in lost_dims:
+                                    continue
+                                if isinstance(memlet.subset[ind], tuple):
+                                    begin = r[0] - offset[ind]
+                                    end = r[1] - offset[ind]
+                                    step = r[2]
+                                    newsubset[ind] = (begin, end, step)
+                                else:
+                                    newsubset[ind] = (r - offset[ind],
+                                                      r - offset[ind] + 1, 1)
+                            memlet.subset = type(edge.data.subset)(
+                                [r for r in newsubset if r is not None])
+                        memlet.data = node.data
+
+                    edge.data.wcr = None
+                    if self.fullcopy:
+                        edge.data.subset = sbs.Range.from_array(
+                            node.desc(sdfg))
+                    edge.data.other_subset = newmemlet.subset
+                    graph.add_edge(node, None, edge.dst, edge.dst_conn,
+                                   edge.data)
+                    graph.remove_edge(edge)
+
+        # Fourth, replace memlet arrays as necessary
+        if self.expr_index == 0:
+            scope_subgraph = graph.scope_subgraph(cnode)
+            for edge in scope_subgraph.edges():
+                if (edge.data.data is not None
+                        and edge.data.data in cloned_arrays):
+                    edge.data.data = cloned_arrays[edge.data.data]
+
+    def modifies_graph(self):
+        return True
+
+
+pattern_matching.Transformation.register_pattern(GPUTransformLocalStorage)
diff --git a/dace/transformation/dataflow/map_collapse.py b/dace/transformation/dataflow/map_collapse.py
new file mode 100644
index 0000000000..0e9acd16f7
--- /dev/null
+++ b/dace/transformation/dataflow/map_collapse.py
@@ -0,0 +1,98 @@
+""" Contains classes that implement the map-collapse transformation. """
+
+from copy import deepcopy as dcpy
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from dace.properties import make_properties
+
+
+@make_properties
+class MapCollapse(pattern_matching.Transformation):
+    """ Implements the Map Collapse pattern.
+
+        Map-collapse takes two nested maps with M and N dimensions respectively,
+        and collapses them to a single M+N dimensional map.
+    """
+
+    _outer_map_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _inner_map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(
+                MapCollapse._outer_map_entry,
+                MapCollapse._inner_map_entry,
+            )
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        # Check the edges between the entries of the two maps.
+        outer_map_entry = graph.nodes()[candidate[
+            MapCollapse._outer_map_entry]]
+        inner_map_entry = graph.nodes()[candidate[
+            MapCollapse._inner_map_entry]]
+
+        # Check that the destination of all the outgoing edges
+        # from the outer map's entry is the inner map's entry.
+        for _src, _, dest, _, _ in graph.out_edges(outer_map_entry):
+            if dest != inner_map_entry:
+                return False
+
+        # Check that the source of all the incoming edges
+        # to the inner map's entry is the outer map's entry.
+        for src, _, _dest, _, _ in graph.in_edges(inner_map_entry):
+            if src != outer_map_entry:
+                return False
+
+        # Check the edges between the exits of the two maps.
+        inner_map_exits = graph.exit_nodes(inner_map_entry)
+        outer_map_exits = graph.exit_nodes(outer_map_entry)
+        if len(inner_map_exits) > 1 or len(outer_map_exits) > 1:
+            return False
+
+        inner_map_exit = inner_map_exits[0]
+        outer_map_exit = outer_map_exits[0]
+
+        # Check that the destination of all the outgoing edges
+        # from the inner map's exit is the outer map's exit.
+        for _src, _, dest, _, _ in graph.out_edges(inner_map_exit):
+            if dest != outer_map_exit:
+                return False
+
+        # Check that the source of all the incoming edges
+        # to the outer map's exit is the inner map's exit.
+        for src, _, _dest, _, _ in graph.in_edges(outer_map_exit):
+            if src != inner_map_exit:
+                return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        outer_map_entry = graph.nodes()[candidate[
+            MapCollapse._outer_map_entry]]
+        inner_map_entry = graph.nodes()[candidate[
+            MapCollapse._inner_map_entry]]
+
+        return ' -> '.join(entry.map.label + ': ' + str(entry.map.params)
+                           for entry in [outer_map_entry, inner_map_entry])
+
+    def apply(self, sdfg):
+        # Extract the parameters and ranges of the inner/outer maps.
+        graph = sdfg.nodes()[self.state_id]
+        outer_map_entry = graph.nodes()[self.subgraph[
+            MapCollapse._outer_map_entry]]
+        inner_map_entry = graph.nodes()[self.subgraph[
+            MapCollapse._inner_map_entry]]
+        inner_map_exit = graph.exit_nodes(inner_map_entry)[0]
+        outer_map_exit = graph.exit_nodes(outer_map_entry)[0]
+
+        nxutil.merge_maps(graph, outer_map_entry, outer_map_exit,
+                          inner_map_entry, inner_map_exit)
+
+        return
+
+
+pattern_matching.Transformation.register_pattern(MapCollapse)
diff --git a/dace/transformation/dataflow/map_dim_interchange.py b/dace/transformation/dataflow/map_dim_interchange.py
new file mode 100644
index 0000000000..c7879d5853
--- /dev/null
+++ b/dace/transformation/dataflow/map_dim_interchange.py
@@ -0,0 +1,72 @@
+""" Contains an implementation of the map-dimension-interchange transformation.
+"""
+
+from dace.graph import nodes, nxutil
+from dace.properties import ShapeProperty
+from dace.transformation import pattern_matching as pm
+
+
+@make_properties
+class MapDimInterchange(pm.Transformation):
+    """ Implements the map-dimension-interchange pattern.
+
+        Map-dimension-interchange re-orders the dimensions of a map.
+    """
+
+    _map_entry = nodes.MapEntry(None)
+
+    order = ShapeProperty()
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(MapDimInterchange._map_entry)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        """ A candidate subgraph matches the map-dimension-interchange 
+            transformation when a map has at least two dimensions.
+        """
+        map_entry = graph.nodes()[candidate[MapDimInterchange._map_entry]]
+        return map_entry.map.get_param_num() > 1
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        map_entry = candidate[MapDimInterchange._map_entry]
+
+        return str(map_entry)
+
+    def apply(self, sdfg):
+        """ Reorders the dimensions of the map by reordering the
+            parameters and the range of the map as specified through the 
+            properties.
+        """
+
+        # Extract the map and its entry node.
+        graph = sdfg.nodes()[self.state_id]
+        map_entry = graph.nodes()[self.subgraph[MapDimInterchange._map_entry]]
+        current_map = map_entry.map
+
+        order = self.order
+        if len(self.order) != current_map.get_param_num():
+            # 'order' must be of the same length as the number of map
+            # dimensions.
+            return
+
+        # Re-order the map dimensions
+        current_map.params = [current_map.params[idx] for idx in order]
+        current_map.range.reorder(order)
+
+        return
+
+    def __init__(self, *args, **kwargs):
+        self.entry = nodes.EntryNode()
+        self.tasklet = nodes.Tasklet('_')
+        self.exit = nodes.ExitNode()
+        self.pairs = None
+        super().__init__(*args, **kwargs)
+
+    def modifies_graph(self):
+        return True
+
+
+pm.Transformation.register_pattern(MapDimInterchange)
diff --git a/dace/transformation/dataflow/map_expansion.py b/dace/transformation/dataflow/map_expansion.py
new file mode 100644
index 0000000000..f56750420a
--- /dev/null
+++ b/dace/transformation/dataflow/map_expansion.py
@@ -0,0 +1,114 @@
+""" Contains classes that implement the map-expansion transformation. """
+
+import copy
+from typing import Dict
+import dace
+from dace import types, subsets, symbolic
+from dace.graph import nodes, nxutil
+from dace.graph.graph import OrderedMultiDiConnectorGraph
+from dace.transformation import pattern_matching as pm
+
+
+class MapExpansion(pm.Transformation):
+    """ Implements the map-expansion pattern.
+
+        Map-expansion takes an N-dimensional map and expands it to N 
+        unidimensional maps.
+    """
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(MapExpansion._map_entry)]
+
+    @staticmethod
+    def can_be_applied(graph: dace.graph.graph.OrderedMultiDiConnectorGraph,
+                       candidate: Dict[dace.graph.nodes.Node, int],
+                       expr_index: int,
+                       sdfg: dace.SDFG,
+                       strict: bool = False):
+        # A candidate subgraph matches the map-expansion pattern when it includes
+        # a N-dimensional map, with N greater than one.
+        map_entry = graph.nodes()[candidate[MapExpansion._map_entry]]
+        return map_entry.map.get_param_num() > 1
+
+    @staticmethod
+    def match_to_str(graph: dace.graph.graph.OrderedMultiDiConnectorGraph,
+                     candidate: Dict[dace.graph.nodes.Node, int]):
+        map_entry = graph.nodes()[candidate[MapExpansion._map_entry]]
+        return map_entry.map.label + ': ' + str(map_entry.map.params)
+
+    def apply(self, sdfg: dace.SDFG):
+        # Extract the map and its entry and exit nodes.
+        graph = sdfg.nodes()[self.state_id]
+        map_entry = graph.nodes()[self.subgraph[MapExpansion._map_entry]]
+        map_exit = graph.exit_nodes(map_entry)[0]
+        current_map = map_entry.map
+
+        # Create new maps
+        maps = [
+            nodes.Map(
+                current_map.label + '_' + str(param), [param],
+                subsets.Range([param_range]),
+                schedule=types.ScheduleType.Sequential) for param, param_range
+            in zip(current_map.params, current_map.range)
+        ]
+        maps[0]._schedule = types.ScheduleType.Default
+
+        # Create new map entries
+        entries = [nodes.MapEntry(new_map) for new_map in maps]
+        entries[0].in_connectors = map_entry.in_connectors
+        entries[0].out_connectors = map_entry.out_connectors
+        num_entry_out_edges = len(graph.out_edges(map_entry))
+        for i in range(1, len(entries)):
+            entries[i].in_connectors = set(
+                'IN_' + str(i + 1) for i in range(num_entry_out_edges))
+            entries[i].out_connectors = set(
+                'OUT_' + str(i + 1) for i in range(num_entry_out_edges))
+
+        # Create new map exits
+        exits = [nodes.MapExit(new_map) for new_map in maps]
+        exits.reverse()
+        exits[-1].in_connectors = map_exit.in_connectors
+        exits[-1].out_connectors = map_exit.out_connectors
+        num_entry_out_edges = len(graph.out_edges(map_exit))
+        for i in range(0, len(exits) - 1):
+            exits[i].in_connectors = set(
+                'IN_' + str(i + 1) for i in range(num_entry_out_edges))
+            exits[i].out_connectors = set(
+                'OUT_' + str(i + 1) for i in range(num_entry_out_edges))
+
+        # Add new nodes to state
+        graph.add_nodes_from(entries)
+        graph.add_nodes_from(exits)
+
+        # Redirect edges to new nodes
+        dace.graph.nxutil.change_edge_dest(graph, map_entry, entries[0])
+        dace.graph.nxutil.change_edge_src(graph, map_exit, exits[-1])
+
+        for i, e in enumerate(graph.out_edges(map_entry)):
+            graph.remove_edge(e)
+            graph.add_edge(entries[0], e.src_conn, entries[1],
+                           'IN_' + str(i + 1), copy.deepcopy(e.data))
+            graph.add_edge(entries[-1], 'OUT_' + str(i + 1), e.dst, e.dst_conn,
+                           copy.deepcopy(e.data))
+            for j in range(1, len(entries) - 1):
+                graph.add_edge(entries[j], 'OUT_' + str(i + 1), entries[j + 1],
+                               'IN_' + str(i + 1), copy.deepcopy(e.data))
+        for i, e in enumerate(graph.in_edges(map_exit)):
+            graph.remove_edge(e)
+            graph.add_edge(e.src, e.src_conn, exits[0], 'IN_' + str(i + 1),
+                           copy.deepcopy(e.data))
+            graph.add_edge(exits[-2], 'OUT_' + str(i + 1), exits[-1],
+                           e.dst_conn, copy.deepcopy(e.data))
+            for j in range(0, len(exits) - 2):
+                graph.add_edge(exits[j], 'OUT_' + str(i + 1), exits[j + 1],
+                               'IN_' + str(i + 1), copy.deepcopy(e.data))
+
+        # Remove old nodes
+        graph.remove_node(map_entry)
+        graph.remove_node(map_exit)
+
+
+pm.Transformation.register_pattern(MapExpansion)
diff --git a/dace/transformation/dataflow/map_for_loop.py b/dace/transformation/dataflow/map_for_loop.py
new file mode 100644
index 0000000000..9b8104544a
--- /dev/null
+++ b/dace/transformation/dataflow/map_for_loop.py
@@ -0,0 +1,184 @@
+""" This module contains classes that implement a map->for loop transformation.
+"""
+
+import dace
+from copy import deepcopy as dcpy
+from dace import data, symbolic, types, subsets
+from dace.graph import edges, nodes, nxutil
+from dace.transformation import pattern_matching
+from math import ceil
+import sympy
+import networkx as nx
+
+
+class MapToForLoop(pattern_matching.Transformation):
+    """ Implements the Map to for-loop transformation.
+
+        Takes a map and enforces a sequential schedule by transforming it into
+        a state-machine of a for-loop. Creates a nested SDFG, if necessary.
+    """
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    @staticmethod
+    def annotates_memlets():
+        return True
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(MapToForLoop._map_entry)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        # Only uni-dimensional maps are accepted.
+        map_entry = graph.nodes()[candidate[MapToForLoop._map_entry]]
+        if len(map_entry.map.params) > 1:
+            return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        map_entry = graph.nodes()[candidate[MapToForLoop._map_entry]]
+        return map_entry.map.label + ': ' + str(map_entry.map.params)
+
+    def apply(self, sdfg):
+        # Retrieve map entry and exit nodes.
+        graph = sdfg.nodes()[self.state_id]
+        map_entry = graph.nodes()[self.subgraph[MapToForLoop._map_entry]]
+        map_exits = graph.exit_nodes(map_entry)
+        loop_idx = map_entry.map.params[0]
+        loop_from, loop_to, loop_step = map_entry.map.range[0]
+
+        nested_sdfg = dace.SDFG(graph.label + '_' + map_entry.map.label)
+
+        # Construct nested SDFG
+        begin = nested_sdfg.add_state('begin')
+        guard = nested_sdfg.add_state('guard')
+        body = nested_sdfg.add_state('body')
+        end = nested_sdfg.add_state('end')
+
+        nested_sdfg.add_edge(
+            begin,
+            guard,
+            edges.InterstateEdge(assignments={str(loop_idx): str(loop_from)}))
+        nested_sdfg.add_edge(
+            guard,
+            body,
+            edges.InterstateEdge(condition = str(loop_idx) + ' <= ' + \
+                                             str(loop_to))
+        )
+        nested_sdfg.add_edge(
+            guard,
+            end,
+            edges.InterstateEdge(condition = str(loop_idx) + ' > ' + \
+                                             str(loop_to))
+        )
+        nested_sdfg.add_edge(
+            body,
+            guard,
+            edges.InterstateEdge(assignments = {str(loop_idx): str(loop_idx) + \
+                                                ' + ' +str(loop_step)})
+        )
+
+        # Add map contents
+        map_subgraph = graph.scope_subgraph(map_entry)
+        for node in map_subgraph.nodes():
+            if node is not map_entry and node not in map_exits:
+                body.add_node(node)
+        for src, src_conn, dst, dst_conn, memlet in map_subgraph.edges():
+            if src is not map_entry and dst not in map_exits:
+                body.add_edge(src, src_conn, dst, dst_conn, memlet)
+
+        # Reconnect inputs
+        nested_in_data_nodes = {}
+        nested_in_connectors = {}
+        nested_in_memlets = {}
+        for i, edge in enumerate(graph.in_edges(map_entry)):
+            src, src_conn, dst, dst_conn, memlet = edge
+            data_label = '_in_' + memlet.data
+            memdata = sdfg.arrays[memlet.data]
+            if isinstance(memdata, data.Array):
+                data_array = sdfg.add_array(data_label, memdata.dtype, [
+                    symbolic.overapproximate(r)
+                    for r in memlet.bounding_box_size()
+                ])
+            elif isinstance(memdata, data.Scalar):
+                data_array = sdfg.add_scalar(data_label, memdata.dtype)
+            else:
+                raise NotImplementedError()
+            data_node = nodes.AccessNode(data_label)
+            body.add_node(data_node)
+            nested_in_data_nodes.update({i: data_node})
+            nested_in_connectors.update({i: data_label})
+            nested_in_memlets.update({i: memlet})
+            for _, _, _, _, old_memlet in body.edges():
+                if old_memlet.data == memlet.data:
+                    old_memlet.data = data_label
+            #body.add_edge(data_node, None, dst, dst_conn, memlet)
+
+        # Reconnect outputs
+        nested_out_data_nodes = {}
+        nested_out_connectors = {}
+        nested_out_memlets = {}
+        for map_exit in map_exits:
+            for i, edge in enumerate(graph.out_edges(map_exit)):
+                src, src_conn, dst, dst_conn, memlet = edge
+                data_label = '_out_' + memlet.data
+                memdata = sdfg.arrays[memlet.data]
+                if isinstance(memdata, data.Array):
+                    data_array = sdfg.add_array(data_label, memdata.dtype, [
+                        symbolic.overapproximate(r)
+                        for r in memlet.bounding_box_size()
+                    ])
+                elif isinstance(memdata, data.Scalar):
+                    data_array = sdfg.add_scalar(data_label, memdata.dtype)
+                else:
+                    raise NotImplementedError()
+                data_node = nodes.AccessNode(data_label)
+                body.add_node(data_node)
+                nested_out_data_nodes.update({i: data_node})
+                nested_out_connectors.update({i: data_label})
+                nested_out_memlets.update({i: memlet})
+                for _, _, _, _, old_memlet in body.edges():
+                    if old_memlet.data == memlet.data:
+                        old_memlet.data = data_label
+                #body.add_edge(src, src_conn, data_node, None, memlet)
+
+        # Add nested SDFG and reconnect it
+        nested_node = graph.add_nested_sdfg(
+            nested_sdfg, sdfg, set(nested_in_connectors.values()),
+            set(nested_out_connectors.values()))
+
+        for i, edge in enumerate(graph.in_edges(map_entry)):
+            src, src_conn, dst, dst_conn, memlet = edge
+            graph.add_edge(src, src_conn, nested_node, nested_in_connectors[i],
+                           nested_in_memlets[i])
+
+        for map_exit in map_exits:
+            for i, edge in enumerate(graph.out_edges(map_exit)):
+                src, src_conn, dst, dst_conn, memlet = edge
+                graph.add_edge(nested_node, nested_out_connectors[i], dst,
+                               dst_conn, nested_out_memlets[i])
+
+        for src, src_conn, dst, dst_conn, memlet in graph.out_edges(map_entry):
+            i = int(src_conn[4:]) - 1
+            new_memlet = dcpy(memlet)
+            new_memlet.data = nested_in_data_nodes[i].data
+            body.add_edge(nested_in_data_nodes[i], None, dst, dst_conn,
+                          new_memlet)
+
+        for map_exit in map_exits:
+            for src, src_conn, dst, dst_conn, memlet in graph.in_edges(
+                    map_exit):
+                i = int(dst_conn[3:]) - 1
+                new_memlet = dcpy(memlet)
+                new_memlet.data = nested_out_data_nodes[i].data
+                body.add_edge(src, src_conn, nested_out_data_nodes[i], None,
+                              new_memlet)
+
+        for node in map_subgraph:
+            graph.remove_node(node)
+
+
+pattern_matching.Transformation.register_pattern(MapToForLoop)
diff --git a/dace/transformation/dataflow/map_fusion.py b/dace/transformation/dataflow/map_fusion.py
new file mode 100644
index 0000000000..2233030add
--- /dev/null
+++ b/dace/transformation/dataflow/map_fusion.py
@@ -0,0 +1,187 @@
+""" This module contains classes that implement the map fusion transformation.
+"""
+
+from copy import deepcopy as dcpy
+from dace import data, types, subsets, symbolic
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from dace.properties import ShapeProperty
+import sympy
+
+
+def calc_set_union(set_a: subsets.Subset,
+                   set_b: subsets.Subset) -> subsets.Range:
+    """ Computes the union of two Subset objects. """
+
+    if isinstance(set_a, subsets.Indices) or isinstance(
+            set_b, subsets.Indices):
+        raise NotImplementedError('Set union with indices is not implemented.')
+    if not (isinstance(set_a, subsets.Range)
+            and isinstance(set_b, subsets.Range)):
+        raise TypeError('Can only compute the union of ranges.')
+    if len(set_a) != len(set_b):
+        raise ValueError('Range dimensions do not match')
+    union = []
+    for range_a, range_b in zip(set_a, set_b):
+        union.append([
+            sympy.Min(range_a[0], range_b[0]),
+            sympy.Max(range_a[1], range_b[1]),
+            sympy.Min(range_a[2], range_b[2]),
+        ])
+    return subsets.Range(union)
+
+
+class MapFusion(pattern_matching.Transformation):
+    """ Implements the map fusion pattern.
+
+        Map Fusion takes two maps that are connected in series and have the 
+        same range, and fuses them to one map. The tasklets in the new map are
+        connected in the same manner as they were before the fusion.
+    """
+
+    _first_map_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _second_map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    @staticmethod
+    def annotates_memlets():
+        return False
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(MapFusion._first_map_entry)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        # The first map must have a non-conflicting map exit.
+        # (cannot fuse with CR in the first map)
+        first_map_entry = graph.nodes()[candidate[MapFusion._first_map_entry]]
+        first_exits = graph.exit_nodes(first_map_entry)
+        first_exit = first_exits[0]
+        if any([e.data.wcr is not None for e in graph.in_edges(first_exit)]):
+            return False
+
+        # Check whether there is a pattern map -> data -> map.
+        data_nodes = []
+        for _, _, dst, _, _ in graph.out_edges(first_exit):
+            if isinstance(dst, nodes.AccessNode):
+                data_nodes.append(dst)
+            else:
+                return False
+        second_map_entry = None
+        for data_node in data_nodes:
+            for _, _, dst, _, _ in graph.out_edges(data_node):
+                if isinstance(dst, nodes.MapEntry):
+                    if second_map_entry is None:
+                        second_map_entry = dst
+                    elif dst != second_map_entry:
+                        return False
+                else:
+                    return False
+        if second_map_entry is None:
+            return False
+        for src, _, _, _, _ in graph.in_edges(second_map_entry):
+            if not src in data_nodes:
+                return False
+
+        # Check map spaces (this should be generalized to ignore order).
+        first_range = first_map_entry.map.range
+        second_range = second_map_entry.map.range
+        if first_range != second_range:
+            return False
+
+        # Success
+        candidate[MapFusion._second_map_entry] = graph.nodes().index(
+            second_map_entry)
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        first_map_entry = graph.nodes()[candidate[MapFusion._first_map_entry]]
+        second_map_entry = graph.nodes()[candidate[
+            MapFusion._second_map_entry]]
+
+        return ' -> '.join(entry.map.label + ': ' + str(entry.map.params)
+                           for entry in [first_map_entry, second_map_entry])
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        first_map_entry = graph.nodes()[self.subgraph[
+            MapFusion._first_map_entry]]
+        first_map_exit = graph.exit_nodes(first_map_entry)[0]
+        second_map_entry = graph.nodes()[self.subgraph[
+            MapFusion._second_map_entry]]
+        second_exits = graph.exit_nodes(second_map_entry)
+        first_map_params = [
+            symbolic.pystr_to_symbolic(p) for p in first_map_entry.map.params
+        ]
+        second_map_params = [
+            symbolic.pystr_to_symbolic(p) for p in second_map_entry.map.params
+        ]
+
+        # Fix exits
+        for exit_node in second_exits:
+            if isinstance(exit_node, nodes.MapExit):
+                exit_node.map = first_map_entry.map
+
+        # Substitute symbols in second map.
+        for _parent, _, _child, _, memlet in graph.bfs_edges(
+                second_map_entry, reverse=False):
+            for fp, sp in zip(first_map_params, second_map_params):
+                for ind, r in enumerate(memlet.subset):
+                    if isinstance(memlet.subset[ind], tuple):
+                        begin = r[0].subs(sp, fp)
+                        end = r[1].subs(sp, fp)
+                        step = r[2].subs(sp, fp)
+                        memlet.subset[ind] = (begin, end, step)
+                    else:
+                        memlet.subset[ind] = memlet.subset[ind].subs(sp, fp)
+
+        transients = {}
+        for _, _, dst, _, memlet in graph.out_edges(first_map_exit):
+            if not memlet.data in transients:
+                transients[memlet.data] = dst
+        new_edges = []
+        for src, src_conn, _, dst_conn, memlet in graph.in_edges(
+                first_map_exit):
+            new_memlet = dcpy(memlet)
+            new_edges.append((src, src_conn, transients[memlet.data], dst_conn,
+                              new_memlet))
+        for _, src_conn, dst, dst_conn, memlet in graph.out_edges(
+                second_map_entry):
+            new_memlet = dcpy(memlet)
+            new_edges.append((transients[memlet.data], src_conn, dst, dst_conn,
+                              new_memlet))
+
+        # Delete nodes/edges
+        for edge in graph.in_edges(first_map_exit):
+            graph.remove_edge(edge)
+        for edge in graph.out_edges(second_map_entry):
+            graph.remove_edge(edge)
+        data_nodes = []
+        for _, _, dst, _, _ in graph.out_edges(first_map_exit):
+            data_nodes.append(dst)
+        for data_node in data_nodes:
+            for edge in graph.all_edges(data_node):
+                graph.remove_edge(edge)
+        graph.remove_node(first_map_exit)
+        graph.remove_node(second_map_entry)
+
+        # Add edges
+        for edge in new_edges:
+            graph.add_edge(*edge)
+
+        # Reduce transient sizes
+        for data_node in data_nodes:
+            data_desc = data_node.desc(sdfg)
+            if data_desc.transient:
+                edges = graph.in_edges(data_node)
+                subset = edges[0].data.subset
+                for idx in range(1, len(edges)):
+                    subset = calc_set_union(subset, edges[idx].data.subset)
+                data_desc.shape = subset.bounding_box_size()
+                data_desc.strides = list(subset.bounding_box_size())
+                data_desc.offset = [0] * subset.dims()
+
+
+pattern_matching.Transformation.register_pattern(MapFusion)
diff --git a/dace/transformation/dataflow/map_interchange.py b/dace/transformation/dataflow/map_interchange.py
new file mode 100644
index 0000000000..2468b0c880
--- /dev/null
+++ b/dace/transformation/dataflow/map_interchange.py
@@ -0,0 +1,127 @@
+""" Implements the map interchange transformation. """
+
+from copy import deepcopy as dcpy
+import dace
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from dace.properties import make_properties
+
+
+@make_properties
+class MapInterchange(pattern_matching.Transformation):
+    """ Implements the map-interchange transformation.
+    
+        Map-interchange takes two nested maps and interchanges their position.
+    """
+
+    _outer_map_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _inner_map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(MapInterchange._outer_map_entry,
+                                   MapInterchange._inner_map_entry)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        # TODO: Add matching condition that the map variables are independent
+        # of each other.
+        # TODO: Assuming that the subsets on the edges between the two map
+        # entries/exits are the union of separate inner subsets, is it possible
+        # that inverting these edges breaks the continuity of union? What about
+        # the opposite?
+
+        # Check the edges between the entries of the two maps.
+        outer_map_entry = graph.nodes()[candidate[
+            MapInterchange._outer_map_entry]]
+        inner_map_entry = graph.nodes()[candidate[
+            MapInterchange._inner_map_entry]]
+        # Check that the destination of all the outgoing edges
+        # from the outer map's entry is the inner map's entry.
+        for e in graph.out_edges(outer_map_entry):
+            if e.dst != inner_map_entry:
+                return False
+        # Check that the source of all the incoming edges
+        # to the inner map's entry is the outer map's entry.
+        for e in graph.in_edges(inner_map_entry):
+            if e.src != outer_map_entry:
+                return False
+
+        # Check the edges between the exits of the two maps.
+        inner_map_exits = graph.exit_nodes(inner_map_entry)
+        outer_map_exits = graph.exit_nodes(outer_map_entry)
+        inner_map_exit = inner_map_exits[0]
+        outer_map_exit = outer_map_exits[0]
+
+        # Check that the destination of all the outgoing edges
+        # from the inner map's exit is the outer map's exit.
+        for e in graph.out_edges(inner_map_exit):
+            if e.dst != outer_map_exit:
+                return False
+        # Check that the source of all the incoming edges
+        # to the outer map's exit is the inner map's exit.
+        for e in graph.in_edges(outer_map_exit):
+            if e.src != inner_map_exit:
+                return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        outer_map_entry = graph.nodes()[candidate[
+            MapInterchange._outer_map_entry]]
+        inner_map_entry = graph.nodes()[candidate[
+            MapInterchange._inner_map_entry]]
+
+        return ' -> '.join(entry.map.label + ': ' + str(entry.map.params)
+                           for entry in [outer_map_entry, inner_map_entry])
+
+    def apply(self, sdfg):
+        # Extract the parameters and ranges of the inner/outer maps.
+        graph = sdfg.nodes()[self.state_id]
+        outer_map_entry = graph.nodes()[self.subgraph[
+            MapInterchange._outer_map_entry]]
+        inner_map_entry = graph.nodes()[self.subgraph[
+            MapInterchange._inner_map_entry]]
+        inner_map_exits = graph.exit_nodes(inner_map_entry)
+        outer_map_exits = graph.exit_nodes(outer_map_entry)
+        if len(inner_map_exits) > 1 or len(outer_map_exits) > 1:
+            raise NotImplementedError('Map interchange does not work with ' +
+                                      'multiple map exits')
+        inner_map_exit = inner_map_exits[0]
+        outer_map_exit = outer_map_exits[0]
+
+        # Switch connectors
+        outer_map_entry.in_connectors, inner_map_entry.in_connectors = \
+            inner_map_entry.in_connectors, outer_map_entry.in_connectors
+        outer_map_entry.out_connectors, inner_map_entry.out_connectors = \
+            inner_map_entry.out_connectors, outer_map_entry.out_connectors
+        outer_map_exit.in_connectors, inner_map_exit.in_connectors = \
+            inner_map_exit.in_connectors, outer_map_exit.in_connectors
+        outer_map_exit.out_connectors, inner_map_exit.out_connectors = \
+            inner_map_exit.out_connectors, outer_map_exit.out_connectors
+
+        # Get edges between the map entries and exits.
+        entry_edges = graph.edges_between(outer_map_entry, inner_map_entry)
+        exit_edges = graph.edges_between(inner_map_exit, outer_map_exit)
+        for e in entry_edges + exit_edges:
+            graph.remove_edge(e)
+
+        # Change source and destination of edges.
+        dace.graph.nxutil.change_edge_dest(graph, outer_map_entry,
+                                           inner_map_entry)
+        dace.graph.nxutil.change_edge_src(graph, inner_map_entry,
+                                          outer_map_entry)
+        dace.graph.nxutil.change_edge_dest(graph, inner_map_exit,
+                                           outer_map_exit)
+        dace.graph.nxutil.change_edge_src(graph, outer_map_exit,
+                                          inner_map_exit)
+
+        # Add edges between the map entries and exits.
+        for e in entry_edges + exit_edges:
+            graph.add_edge(e.dst, e.src_conn, e.src, e.dst_conn, e.data)
+
+
+pattern_matching.Transformation.register_pattern(MapInterchange)
diff --git a/dace/transformation/dataflow/mapreduce.py b/dace/transformation/dataflow/mapreduce.py
new file mode 100644
index 0000000000..b47eefcc32
--- /dev/null
+++ b/dace/transformation/dataflow/mapreduce.py
@@ -0,0 +1,460 @@
+""" Contains classes and functions that implement the map-reduce-fusion 
+    transformation. """
+
+import copy
+from dace import data as dt, types, subsets, symbolic
+from dace.memlet import Memlet
+from dace.graph import nodes, nxutil
+from dace.sdfg import SDFGState
+from dace.transformation import pattern_matching as pm
+from dace.properties import ShapeProperty
+
+
+class MapReduceFusion(pm.Transformation):
+    """ Implements the map-reduce-fusion transformation.
+        Fuses a map with an immediately following reduction, where the array
+        between the map and the reduction is not used anywhere else.
+    """
+
+    _tasklet = nodes.Tasklet('_')
+    _tmap_exit = nodes.MapExit(nodes.Map("", [], []))
+    _in_array = nodes.AccessNode('_')
+    _rmap_in_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _rmap_in_tasklet = nodes.Tasklet('_')
+    _rmap_in_cr = nodes.MapExit(nodes.Map("", [], []))
+    _rmap_out_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _rmap_out_exit = nodes.MapExit(nodes.Map("", [], []))
+    _out_array = nodes.AccessNode('_')
+    _reduce = nodes.Reduce('lambda: None', None)
+
+    @staticmethod
+    def expressions():
+        return [
+            # Map, then reduce of all axes
+            nxutil.node_path_graph(
+                MapReduceFusion._tasklet, MapReduceFusion._tmap_exit,
+                MapReduceFusion._in_array, MapReduceFusion._rmap_in_entry,
+                MapReduceFusion._rmap_in_tasklet, MapReduceFusion._rmap_in_cr,
+                MapReduceFusion._out_array),
+            # Map, then partial reduction of axes
+            nxutil.node_path_graph(
+                MapReduceFusion._tasklet, MapReduceFusion._tmap_exit,
+                MapReduceFusion._in_array, MapReduceFusion._rmap_out_entry,
+                MapReduceFusion._rmap_in_entry,
+                MapReduceFusion._rmap_in_tasklet, MapReduceFusion._rmap_in_cr,
+                MapReduceFusion._rmap_out_exit, MapReduceFusion._out_array),
+            # Map, then reduce node
+            nxutil.node_path_graph(
+                MapReduceFusion._tasklet, MapReduceFusion._tmap_exit,
+                MapReduceFusion._in_array, MapReduceFusion._reduce,
+                MapReduceFusion._out_array)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        tmap_exit = graph.nodes()[candidate[MapReduceFusion._tmap_exit]]
+        in_array = graph.nodes()[candidate[MapReduceFusion._in_array]]
+        if expr_index == 0:  # Reduce without outer map
+            rmap_entry = graph.nodes()[candidate[
+                MapReduceFusion._rmap_in_entry]]
+            # rmap_in_entry = rmap_entry
+        elif expr_index == 1:  # Reduce with outer map
+            rmap_entry = graph.nodes()[candidate[
+                MapReduceFusion._rmap_out_entry]]
+            # rmap_in_entry = graph.nodes()[candidate[
+            #     MapReduceFusion._rmap_in_entry]]
+        else:  # Reduce node
+            rmap_entry = graph.nodes()[candidate[MapReduceFusion._reduce]]
+
+        # Make sure that the array is only accessed by the map and the reduce
+        if any([
+                src != tmap_exit
+                for src, _, _, _, memlet in graph.in_edges(in_array)
+        ]):
+            return False
+        if any([
+                dest != rmap_entry
+                for _, _, dest, _, memlet in graph.out_edges(in_array)
+        ]):
+            return False
+
+        # Make sure that there is a reduction in the second map
+        if expr_index < 2:
+            rmap_cr = graph.nodes()[candidate[MapReduceFusion._rmap_in_cr]]
+            reduce_edge = graph.in_edges(rmap_cr)[0]
+            if reduce_edge.data.wcr is None:
+                return False
+
+        # Make sure that the transient is not accessed by other states
+        # if garr.get_unique_name() in cgen_state.sdfg.shared_transients():
+        #     return False
+
+        # reduce_inarr = reduce.in_array
+        # reduce_outarr = reduce.out_array
+        # reduce_inslice = reduce.inslice
+        # reduce_outslice = reduce.outslice
+
+        # insize = cgen_state.var_sizes[reduce_inarr]
+        # outsize = cgen_state.var_sizes[reduce_outarr]
+
+        # Currently only supports full-range arrays
+        # TODO(later): Support fusion of partial reductions and refactor slice/subarray handling
+        #if not nxutil.fullrange(reduce_inslice, insize) or \
+        #   not nxutil.fullrange(reduce_outslice, outsize):
+        #    return False
+
+        # Verify acceses from tasklet through MapExit
+        #already_found = False
+        #for _src, _, _dest, _, memlet in graph.in_edges(map_exit):
+        #    if isinstance(memlet.subset, subsets.Indices):
+        #        # Make sure that only one value is reduced at a time
+        #        if memlet.data == in_array.desc:
+        #            if already_found:
+        #                return False
+        #            already_found = True
+
+        ## Find axes after reduction
+        #indims = len(reduce.inslice)
+        #axis_after_reduce = [None] * indims
+        #ctr = 0
+        #for i in range(indims):
+        #    if reduce.axes is not None and i in reduce.axes:
+        #        axis_after_reduce[i] = None
+        #    else:
+        #        axis_after_reduce[i] = ctr
+        #        ctr += 1
+
+        ## Match map ranges with reduce ranges
+        #curaxis = 0
+        #for dim, var in enumerate(memlet.subset):
+        #    # Make sure that indices are direct symbols
+        #    #if not isinstance(symbolic.pystr_to_symbolic(var), sympy.Symbol):
+        #    #    return False
+        #    perm = None
+        #    for i, mapvar in enumerate(map_exit.map.params):
+        #        if symbolic.pystr_to_symbolic(mapvar) == var:
+        #            perm = i
+        #            break
+        #    if perm is None:  # If symbol is not found in map range
+        #        return False
+
+        #    # Make sure that map ranges match output slice after reduction
+        #    map_range = map_exit.map.range[perm]
+        #    if map_range[0] != 0:
+        #        return False  # Disallow start from middle
+        #    if map_range[2] is not None and map_range[2] != 1:
+        #        return False  # Disallow skip
+        #    if reduce.axes is not None and dim not in reduce.axes:
+        #        if map_range[1] != symbolic.pystr_to_symbolic(
+        #                reduce.outslice[axis_after_reduce[dim]][1]):
+        #            return False  # Range check (output axis)
+        #    else:
+        #        if map_range[1] != symbolic.pystr_to_symbolic(reduce.inslice[dim][1]):
+        #            return False  # Range check (reduction axis)
+
+        # Verify that reduction ranges match tasklet map
+        tout_memlet = graph.in_edges(in_array)[0].data
+        rin_memlet = graph.out_edges(in_array)[0].data
+        if tout_memlet.subset != rin_memlet.subset:
+            return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        tasklet = candidate[MapReduceFusion._tasklet]
+        map_exit = candidate[MapReduceFusion._tmap_exit]
+        if len(candidate) == 5:  # Expression 2
+            reduce = candidate[MapReduceFusion._reduce]
+        else:
+            reduce = candidate[MapReduceFusion._rmap_in_cr]
+
+        return ' -> '.join(str(node) for node in [tasklet, map_exit, reduce])
+
+    @staticmethod
+    def find_memlet_map_permutation(memlet: Memlet, map: nodes.Map):
+        perm = [None] * len(memlet.subset)
+        indices = set()
+        for i, dim in enumerate(memlet.subset):
+            for j, mapdim in enumerate(map.params):
+                if symbolic.pystr_to_symbolic(
+                        mapdim) == dim and j not in indices:
+                    perm[i] = j
+                    indices.add(j)
+                    break
+        return perm
+
+    @staticmethod
+    def find_permutation(tasklet_map: nodes.Map, red_outer_map: nodes.Map,
+                         red_inner_map: nodes.Map, tmem: Memlet):
+        """ Find permutation between tasklet-exit memlet and tasklet map. """
+        result = [], []
+
+        assert len(tasklet_map.range) == len(red_inner_map.range) + len(
+            red_outer_map.range)
+
+        # Match map ranges with reduce ranges
+        unavailable_ranges_out = set()
+        unavailable_ranges_in = set()
+        for i, tmap_rng in enumerate(tasklet_map.range):
+            found = False
+            for j, rng in enumerate(red_outer_map.range):
+                if tmap_rng == rng and j not in unavailable_ranges_out:
+                    result[0].append(i)
+                    unavailable_ranges_out.add(j)
+                    found = True
+                    break
+            if found: continue
+            for j, rng in enumerate(red_inner_map.range):
+                if tmap_rng == rng and j not in unavailable_ranges_in:
+                    result[1].append(i)
+                    unavailable_ranges_in.add(j)
+                    found = True
+                    break
+            if not found: break
+
+        # Ensure all map variables matched with reduce variables
+        assert len(result[0]) + len(result[1]) == len(tasklet_map.range)
+
+        # Returns ([outer map indices], [inner (CR) map indices])
+        return result
+
+    @staticmethod
+    def find_permutation_reduce(tasklet_map: nodes.Map,
+                                reduce_node: nodes.Reduce, graph: SDFGState,
+                                tmem: Memlet):
+
+        in_memlet = graph.in_edges(reduce_node)[0].data
+        out_memlet = graph.out_edges(reduce_node)[0].data
+        assert len(tasklet_map.range) == in_memlet.subset.dims()
+
+        # Find permutation between tasklet-exit memlet and tasklet map
+        tmem_perm = MapReduceFusion.find_memlet_map_permutation(
+            tmem, tasklet_map)
+        mapred_perm = []
+
+        # Match map ranges with reduce ranges
+        unavailable_ranges = set()
+        for i, tmap_rng in enumerate(tasklet_map.range):
+            found = False
+
+            for j, in_rng in enumerate(in_memlet.subset):
+                if tmap_rng == in_rng and j not in unavailable_ranges:
+                    mapred_perm.append(i)
+                    unavailable_ranges.add(j)
+                    found = True
+                    break
+            if not found: break
+
+        # Ensure all map variables matched with reduce variables
+        assert len(tmem_perm) == len(tmem.subset)
+        assert len(mapred_perm) == len(in_memlet.subset)
+
+        # Prepare result from the two permutations and the reduction axes
+        result = []
+        for i in range(len(mapred_perm)):
+            if reduce_node.axes is None or i in reduce_node.axes:
+                continue
+            result.append(mapred_perm[tmem_perm[i]])
+
+        return result
+
+    def apply(self, sdfg):
+        def gnode(nname):
+            return graph.nodes()[self.subgraph[nname]]
+
+        expr_index = self.expr_index
+        graph = sdfg.nodes()[self.state_id]
+        tasklet = gnode(MapReduceFusion._tasklet)
+        tmap_exit = graph.nodes()[self.subgraph[MapReduceFusion._tmap_exit]]
+        in_array = graph.nodes()[self.subgraph[MapReduceFusion._in_array]]
+        if expr_index == 0:  # Reduce without outer map
+            rmap_entry = graph.nodes()[self.subgraph[
+                MapReduceFusion._rmap_in_entry]]
+        elif expr_index == 1:  # Reduce with outer map
+            rmap_out_entry = graph.nodes()[self.subgraph[
+                MapReduceFusion._rmap_out_entry]]
+            rmap_out_exit = graph.nodes()[self.subgraph[
+                MapReduceFusion._rmap_out_exit]]
+            rmap_in_entry = graph.nodes()[self.subgraph[
+                MapReduceFusion._rmap_in_entry]]
+            rmap_tasklet = graph.nodes()[self.subgraph[
+                MapReduceFusion._rmap_in_tasklet]]
+
+        if expr_index == 2:
+            rmap_cr = graph.nodes()[self.subgraph[MapReduceFusion._reduce]]
+        else:
+            rmap_cr = graph.nodes()[self.subgraph[MapReduceFusion._rmap_in_cr]]
+        out_array = gnode(MapReduceFusion._out_array)
+
+        # Set nodes to remove according to the expression index
+        nodes_to_remove = [in_array]
+        if expr_index == 0:
+            nodes_to_remove.append(gnode(MapReduceFusion._rmap_in_entry))
+        elif expr_index == 1:
+            nodes_to_remove.append(gnode(MapReduceFusion._rmap_out_entry))
+            nodes_to_remove.append(gnode(MapReduceFusion._rmap_in_entry))
+            nodes_to_remove.append(gnode(MapReduceFusion._rmap_out_exit))
+        else:
+            nodes_to_remove.append(gnode(MapReduceFusion._reduce))
+
+        # If no other edges lead to mapexit, remove it. Otherwise, keep
+        # it and remove reduction incoming/outgoing edges
+        if expr_index != 2 and len(graph.in_edges(tmap_exit)) == 1:
+            nodes_to_remove.append(tmap_exit)
+
+        memlet_edge = None
+        for edge in graph.in_edges(tmap_exit):
+            if edge.data.data == in_array.data:
+                memlet_edge = edge
+                break
+        if memlet_edge is None:
+            raise RuntimeError('Reduction memlet cannot be None')
+
+        if expr_index == 0:  # Reduce without outer map
+            # Index order does not matter, merge as-is
+            pass
+        elif expr_index == 1:  # Reduce with outer map
+            tmap = tmap_exit.map
+            perm_outer, perm_inner = MapReduceFusion.find_permutation(
+                tmap, rmap_out_entry.map, rmap_in_entry.map, memlet_edge.data)
+
+            # Split tasklet map into tmap_out -> tmap_in (according to
+            # reduction)
+            omap = nodes.Map(
+                tmap.label + '_nonreduce',
+                [p for i, p in enumerate(tmap.params) if i in perm_outer],
+                [r for i, r in enumerate(tmap.range) if i in perm_outer],
+                tmap.schedule, tmap.unroll, tmap.is_async)
+            tmap.params = [
+                p for i, p in enumerate(tmap.params) if i in perm_inner
+            ]
+            tmap.range = [
+                r for i, r in enumerate(tmap.range) if i in perm_inner
+            ]
+            omap_entry = nodes.MapEntry(omap)
+            omap_exit = rmap_out_exit
+            rmap_out_exit.map = omap
+
+            # Reconnect graph to new map
+            tmap_entry = graph.entry_node(tmap_exit)
+            tmap_in_edges = list(graph.in_edges(tmap_entry))
+            for e in tmap_in_edges:
+                nxutil.change_edge_dest(graph, tmap_entry, omap_entry)
+            for e in tmap_in_edges:
+                graph.add_edge(omap_entry, e.src_conn, tmap_entry, e.dst_conn,
+                               copy.copy(e.data))
+        elif expr_index == 2:  # Reduce node
+            # Find correspondence between map indices and array outputs
+            tmap = tmap_exit.map
+            perm = MapReduceFusion.find_permutation_reduce(
+                tmap, rmap_cr, graph, memlet_edge.data)
+
+            output_subset = [tmap.params[d] for d in perm]
+            if len(output_subset) == 0:  # Output is a scalar
+                output_subset = [0]
+
+            array_edge = graph.out_edges(rmap_cr)[0]
+
+            # Delete relevant edges and nodes
+            graph.remove_edge(memlet_edge)
+            graph.remove_nodes_from(nodes_to_remove)
+
+            # Add new edges and nodes
+            #   From tasklet to map exit
+            graph.add_edge(
+                memlet_edge.src, memlet_edge.src_conn, memlet_edge.dst,
+                memlet_edge.dst_conn,
+                Memlet(out_array.data, memlet_edge.data.num_accesses,
+                       subsets.Indices(output_subset), memlet_edge.data.veclen,
+                       rmap_cr.wcr, rmap_cr.identity))
+
+            #   From map exit to output array
+            graph.add_edge(
+                memlet_edge.dst, 'OUT_' + memlet_edge.dst_conn[3:],
+                array_edge.dst, array_edge.dst_conn,
+                Memlet(array_edge.data.data, array_edge.data.num_accesses,
+                       array_edge.data.subset, array_edge.data.veclen,
+                       rmap_cr.wcr, rmap_cr.identity))
+
+            return
+
+        # Remove tmp array node prior to the others, so that a new one
+        # can be created in its stead (see below)
+        graph.remove_node(nodes_to_remove[0])
+        nodes_to_remove = nodes_to_remove[1:]
+
+        # Create tasklet -> tmp -> tasklet connection
+        tmp = graph.add_array(
+            'tmp',
+            memlet_edge.data.subset.bounding_box_size(),
+            sdfg.arrays[memlet_edge.data.data].dtype,
+            transient=True)
+        tasklet_tmp_memlet = copy.deepcopy(memlet_edge.data)
+        tasklet_tmp_memlet.data = tmp.data
+        tasklet_tmp_memlet.subset = ShapeProperty.to_string(tmp.shape)
+
+        # Modify memlet to point to output array
+        memlet_edge.data.data = out_array.data
+
+        # Recover reduction axes from CR reduce subset
+        reduce_cr_subset = graph.in_edges(rmap_tasklet)[0].data.subset
+        reduce_axes = []
+        for ind, crvar in enumerate(reduce_cr_subset.indices):
+            if '__i' in str(crvar):
+                reduce_axes.append(ind)
+
+        # Modify memlet access index by filtering out reduction axes
+        if True:  # expr_index == 0:
+            newindices = []
+            for ind, ovar in enumerate(memlet_edge.data.subset.indices):
+                if ind not in reduce_axes:
+                    newindices.append(ovar)
+        if len(newindices) == 0:
+            newindices = [0]
+
+        memlet_edge.data.subset = subsets.Indices(newindices)
+
+        graph.remove_edge(memlet_edge)
+
+        graph.add_edge(memlet_edge.src, memlet_edge.src_conn, tmp,
+                       memlet_edge.dst_conn, tasklet_tmp_memlet)
+
+        red_edges = list(graph.in_edges(rmap_tasklet))
+        if len(red_edges) != 1:
+            raise RuntimeError('CR edge must be unique')
+
+        tmp_tasklet_memlet = copy.deepcopy(tasklet_tmp_memlet)
+        graph.add_edge(tmp, None, rmap_tasklet, red_edges[0].dst_conn,
+                       tmp_tasklet_memlet)
+
+        for e in graph.edges_between(rmap_tasklet, rmap_cr):
+            e.data.subset = memlet_edge.data.subset
+
+        # Move output edges to point directly to CR node
+        if expr_index == 1:
+            # Set output memlet between CR node and outer reduction map to
+            # contain the same subset as the one pointing to the CR node
+            for e in graph.out_edges(rmap_cr):
+                e.data.subset = memlet_edge.data.subset
+
+            rmap_out = gnode(MapReduceFusion._rmap_out_exit)
+            nxutil.change_edge_src(graph, rmap_out, omap_exit)
+
+        # Remove nodes
+        graph.remove_nodes_from(nodes_to_remove)
+
+        # For unrelated outputs, connect original output to rmap_out
+        if expr_index == 1 and tmap_exit not in nodes_to_remove:
+            other_out_edges = list(graph.out_edges(tmap_exit))
+            for e in other_out_edges:
+                graph.remove_edge(e)
+                graph.add_edge(e.src, e.src_conn, omap_exit, None, e.data)
+                graph.add_edge(omap_exit, None, e.dst, e.dst_conn,
+                               copy.copy(e.data))
+
+    def modifies_graph(self):
+        return True
+
+
+pm.Transformation.register_pattern(MapReduceFusion)
diff --git a/dace/transformation/dataflow/mpi.py b/dace/transformation/dataflow/mpi.py
new file mode 100644
index 0000000000..de807bbd85
--- /dev/null
+++ b/dace/transformation/dataflow/mpi.py
@@ -0,0 +1,178 @@
+""" Contains the MPITransformMap transformation. """
+
+from dace import types, symbolic
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from dace.properties import make_properties
+
+
+@make_properties
+class MPITransformMap(pattern_matching.Transformation):
+    """ Implements the MPI parallelization pattern.
+
+        Takes a map and makes it an MPI-scheduled map, introduces transients
+        that keep locally accessed data.
+        
+         Original SDFG
+         =============
+         ```
+         Input1 -                                            Output1
+                 \                                          /
+         Input2 --- MapEntry -- Arbitrary R  -- MapExit -- Output2
+                 /                                          \
+         InputN -                                            OutputN
+         ```
+
+         Nothing in R may access other inputs/outputs that are not defined in R
+         itself and do not go through MapEntry/MapExit
+         Map must be a one-dimensional map for now.
+         The range of the map must be a Range object.
+
+         Output:
+         =======
+        
+         * Add transients for the accessed parts
+         * The schedule property of Map is set to MPI
+         * The range of Map is changed to
+           var = startexpr + p * chunksize ... startexpr + p + 1 * chunksize
+           where p is the current rank and P is the total number of ranks,
+           and chunksize is defined as (endexpr - startexpr) / P, adding the 
+           remaining K iterations to the first K procs.
+         * For each input InputI, create a new transient transInputI, which 
+           has an attribute that specifies that it needs to be filled with 
+           (possibly) remote data
+         * Collect all accesses to InputI within R, assume their convex hull is
+           InputI[rs ... re]
+         * The transInputI transient will contain InputI[rs ... re]
+         * Change all accesses to InputI within R to accesses to transInputI
+    """
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    @staticmethod
+    def annotates_memlets():
+        return True
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(MPITransformMap._map_entry)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        map_entry = graph.nodes()[candidate[MPITransformMap._map_entry]]
+
+        # Check if the map is one-dimensional
+        if map_entry.map.range.dims() != 1:
+            return False
+
+        # We cannot transform a map which is already of schedule type MPI
+        if map_entry.map.schedule == types.ScheduleType.MPI:
+            return False
+
+        # We cannot transform a map which is already inside a MPI map, or in
+        # another device
+        schedule_whitelist = [
+            types.ScheduleType.Default, types.ScheduleType.Sequential
+        ]
+        sdict = graph.scope_dict()
+        parent = sdict[map_entry]
+        while parent is not None:
+            if parent.map.schedule not in schedule_whitelist:
+                return False
+            parent = sdict[parent]
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        map_entry = graph.nodes()[candidate[MPITransformMap._map_entry]]
+
+        return map_entry.map.label
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+
+        map_entry = graph.nodes()[self.subgraph[MPITransformMap._map_entry]]
+
+        # Avoiding import loops
+        from dace.transformation.dataflow import StripMining
+        from dace.transformation.dataflow.stream_transient import (
+            InLocalStorage)
+        from dace.transformation.dataflow.stream_transient import (
+            OutLocalStorage)
+        from dace.graph import labeling
+
+        rangeexpr = str(map_entry.map.range.num_elements())
+
+        stripmine_subgraph = {
+            StripMining._map_entry: self.subgraph[MPITransformMap._map_entry]
+        }
+        sdfg_id = sdfg.sdfg_list.index(sdfg)
+        stripmine = StripMining(sdfg_id, self.state_id, stripmine_subgraph,
+                                self.expr_index)
+        stripmine.dim_idx = -1
+        stripmine.new_dim_prefix = "mpi"
+        stripmine.tile_size = "(" + rangeexpr + "/__dace_comm_size)"
+        stripmine.divides_evenly = True
+        stripmine.apply(sdfg)
+
+        # Find all in-edges that lead to candidate[MPITransformMap._map_entry]
+        outer_map = None
+        edges = [
+            e for e in graph.in_edges(map_entry)
+            if isinstance(e.src, nodes.EntryNode)
+        ]
+
+        outer_map = edges[0].src
+
+        # We need a tasklet for InLocalStorage
+        tasklet = None
+        for e in graph.out_edges(map_entry):
+            if isinstance(e.dst, nodes.CodeNode):
+                tasklet = e.dst
+                break
+
+        if tasklet is None:
+            raise ValueError("Tasklet not found")
+
+        # Add MPI schedule attribute to outer map
+        outer_map.map._schedule = types.ScheduleType.MPI
+
+        # Now create a transient for each array
+        for e in edges:
+            in_local_storage_subgraph = {
+                InLocalStorage._outer_map_entry:
+                graph.node_id(outer_map),
+                InLocalStorage._inner_map_entry:
+                self.subgraph[MPITransformMap._map_entry]
+            }
+            sdfg_id = sdfg.sdfg_list.index(sdfg)
+            in_local_storage = InLocalStorage(sdfg_id, self.state_id,
+                                              in_local_storage_subgraph,
+                                              self.expr_index)
+            in_local_storage.array = e.data.data
+            in_local_storage.apply(sdfg)
+
+        # Transform OutLocalStorage for each output of the MPI map
+        in_map_exits = graph.exit_nodes(map_entry)
+        out_map_exits = graph.exit_nodes(outer_map)
+        in_map_exit = in_map_exits[0]
+        out_map_exit = out_map_exits[0]
+
+        for e in graph.out_edges(out_map_exit):
+            name = e.data.data
+            outlocalstorage_subgraph = {
+                OutLocalStorage._inner_map_exit: graph.node_id(in_map_exit),
+                OutLocalStorage._outer_map_exit: graph.node_id(out_map_exit)
+            }
+            sdfg_id = sdfg.sdfg_list.index(sdfg)
+            outlocalstorage = OutLocalStorage(sdfg_id, self.state_id,
+                                              outlocalstorage_subgraph,
+                                              self.expr_index)
+            outlocalstorage.array = name
+            outlocalstorage.apply(sdfg)
+
+        return
+
+
+pattern_matching.Transformation.register_pattern(MPITransformMap)
diff --git a/dace/transformation/dataflow/reduce_expansion.py b/dace/transformation/dataflow/reduce_expansion.py
new file mode 100644
index 0000000000..a7433dc5b2
--- /dev/null
+++ b/dace/transformation/dataflow/reduce_expansion.py
@@ -0,0 +1,179 @@
+""" Contains classes that implement the reduce-expansion transformation. """
+
+from copy import deepcopy as dcpy
+from dace import sdfg, subsets, types, symbolic
+from dace.graph import nodes, nxutil
+from dace.graph.graph import OrderedMultiDiGraph
+from dace.transformation import pattern_matching as pm
+
+
+class ReduceExpansion(pm.Transformation):
+    """ Implements the reduce-expansion transformation.
+
+        Reduce-expansion replaces a reduce node with nested maps and edges with
+        WCR.
+    """
+
+    _reduce = nodes.Reduce(wcr='lambda x: x', axes=None)
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(ReduceExpansion._reduce)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        red_node = graph.nodes()[candidate[ReduceExpansion._reduce]]
+        return "{}: {} on {}".format(red_node, red_node.wcr, red_node.axes)
+
+    def apply(self, sdfg):
+        """ The method creates two nested maps. The inner map ranges over the
+            reduction axes, while the outer map ranges over the rest of the 
+            input dimensions. The inner map contains a trivial tasklet, while
+            the outgoing edges copy the reduction WCR.
+        """
+        graph = sdfg.nodes()[self.state_id]
+        red_node = graph.nodes()[self.subgraph[ReduceExpansion._reduce]]
+
+        inputs = []
+        in_memlets = []
+        for src, _, _, _, memlet in graph.in_edges(red_node):
+            if src not in inputs:
+                inputs.append(src)
+                in_memlets.append(memlet)
+        if len(inputs) > 1:
+            raise NotImplementedError
+
+        outputs = []
+        out_memlets = []
+        for _, _, dst, _, memlet in graph.out_edges(red_node):
+            if dst not in outputs:
+                outputs.append(dst)
+                out_memlets.append(memlet)
+        if len(outputs) > 1:
+            raise NotImplementedError
+
+        axes = red_node.axes
+        if axes is None:
+            axes = tuple(i for i in range(in_memlets[0].subset.dims()))
+
+        outer_map_range = {}
+        inner_map_range = {}
+        for idx, r in enumerate(in_memlets[0].subset):
+            if idx in axes:
+                inner_map_range.update({
+                    "__dim_{}".format(str(idx)):
+                    subsets.Range.dim_to_string(r)
+                })
+            else:
+                outer_map_range.update({
+                    "__dim_{}".format(str(idx)):
+                    subsets.Range.dim_to_string(r)
+                })
+
+        if len(outer_map_range) > 0:
+            outer_map_entry, outer_map_exit = graph.add_map(
+                'reduce_outer', outer_map_range, schedule=red_node.schedule)
+
+        inner_map_entry, inner_map_exit = graph.add_map(
+            'reduce_inner',
+            inner_map_range,
+            schedule=(types.ScheduleType.Default
+                      if len(outer_map_range) > 0 else red_node.schedule))
+
+        tasklet = graph.add_tasklet(
+            name='red_tasklet',
+            inputs={'in_1'},
+            outputs={'out_1'},
+            code='out_1 = in_1')
+
+        inner_map_entry.in_connectors = {'IN_1'}
+        inner_map_entry.out_connectors = {'OUT_1'}
+
+        outer_in_memlet = dcpy(in_memlets[0])
+
+        if len(outer_map_range) > 0:
+            outer_map_entry.in_connectors = {'IN_1'}
+            outer_map_entry.out_connectors = {'OUT_1'}
+            graph.add_edge(inputs[0], None, outer_map_entry, 'IN_1',
+                           outer_in_memlet)
+        else:
+            graph.add_edge(inputs[0], None, inner_map_entry, 'IN_1',
+                           outer_in_memlet)
+
+        med_in_memlet = dcpy(in_memlets[0])
+        med_in_range = []
+        for idx, r in enumerate(med_in_memlet.subset):
+            if idx in axes:
+                med_in_range.append(r)
+            else:
+                med_in_range.append(("__dim_{}".format(str(idx)),
+                                     "__dim_{}".format(str(idx)), 1))
+        med_in_memlet.subset = subsets.Range(med_in_range)
+        med_in_memlet.num_accesses = med_in_memlet.subset.num_elements()
+
+        if len(outer_map_range) > 0:
+            graph.add_edge(outer_map_entry, 'OUT_1', inner_map_entry, 'IN_1',
+                           med_in_memlet)
+
+        inner_in_memlet = dcpy(med_in_memlet)
+        inner_in_idx = []
+        for idx in range(len(inner_in_memlet.subset)):
+            inner_in_idx.append("__dim_{}".format(str(idx)))
+        inner_in_memlet.subset = subsets.Indices(inner_in_idx)
+        inner_in_memlet.num_accesses = inner_in_memlet.subset.num_elements()
+        graph.add_edge(inner_map_entry, 'OUT_1', tasklet, 'in_1',
+                       inner_in_memlet)
+        inner_map_exit.in_connectors = {'IN_1'}
+        inner_map_exit.out_connectors = {'OUT_1'}
+
+        inner_out_memlet = dcpy(out_memlets[0])
+        inner_out_idx = []
+        for idx, r in enumerate(inner_in_memlet.subset):
+            if idx not in axes:
+                inner_out_idx.append(r)
+        if len(inner_out_idx) == 0:
+            inner_out_idx = [0]
+
+        inner_out_memlet.subset = subsets.Indices(inner_out_idx)
+        inner_out_memlet.wcr = red_node.wcr
+        inner_out_memlet.num_accesses = inner_out_memlet.subset.num_elements()
+        graph.add_edge(tasklet, 'out_1', inner_map_exit, 'IN_1',
+                       inner_out_memlet)
+
+        outer_out_memlet = dcpy(out_memlets[0])
+        outer_out_range = []
+        for idx, r in enumerate(outer_out_memlet.subset):
+            if idx not in axes:
+                outer_out_range.append(r)
+        if len(outer_out_range) == 0:
+            outer_out_range = [(0, 0, 1)]
+
+        outer_out_memlet.subset = subsets.Range(outer_out_range)
+        outer_out_memlet.wcr = red_node.wcr
+
+        if len(outer_map_range) > 0:
+            outer_map_exit.in_connectors = {'IN_1'}
+            outer_map_exit.out_connectors = {'OUT_1'}
+            med_out_memlet = dcpy(inner_out_memlet)
+            med_out_memlet.num_accesses = med_out_memlet.subset.num_elements()
+            graph.add_edge(inner_map_exit, 'OUT_1', outer_map_exit, 'IN_1',
+                           med_out_memlet)
+
+            graph.add_edge(outer_map_exit, 'OUT_1', outputs[0], None,
+                           outer_out_memlet)
+        else:
+            graph.add_edge(inner_map_exit, 'OUT_1', outputs[0], None,
+                           outer_out_memlet)
+
+        graph.remove_edge(graph.in_edges(red_node)[0])
+        graph.remove_edge(graph.out_edges(red_node)[0])
+        graph.remove_node(red_node)
+
+        return
+
+
+pm.Transformation.register_pattern(ReduceExpansion)
diff --git a/dace/transformation/dataflow/redundant_array.py b/dace/transformation/dataflow/redundant_array.py
new file mode 100644
index 0000000000..06cee1fbb8
--- /dev/null
+++ b/dace/transformation/dataflow/redundant_array.py
@@ -0,0 +1,97 @@
+""" Contains classes that implement a redundant array removal transformation.
+"""
+
+import copy
+from dace import data as dt, types, subsets, symbolic
+from dace.memlet import Memlet
+from dace.graph import nodes, nxutil
+from dace.sdfg import SDFGState
+from dace.transformation import pattern_matching as pm
+from dace.properties import ShapeProperty
+
+
+class RedundantArray(pm.Transformation):
+    """ Implements the redundant array removal transformation, applied
+        when a transient array is copied to and from (to another array),
+        but never used anywhere else. """
+
+    _in_array = nodes.AccessNode('_')
+    _out_array = nodes.AccessNode('_')
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(RedundantArray._in_array,
+                                   RedundantArray._out_array),
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        in_array = graph.nodes()[candidate[RedundantArray._in_array]]
+        out_array = graph.nodes()[candidate[RedundantArray._out_array]]
+
+        # Ensure out degree is one (only one target, which is out_array)
+        if graph.out_degree(in_array) != 1:
+            return False
+
+        # Make sure that the candidate is a transient variable
+        if not in_array.desc(sdfg).transient:
+            return False
+
+        # Make sure that both arrays are using the same storage location
+        if in_array.desc(sdfg).storage != out_array.desc(sdfg).storage:
+            return False
+
+        # Find occurrences in this and other states
+        occurrences = []
+        for state in sdfg.nodes():
+            occurrences.extend([
+                n for n in state.nodes() if isinstance(n, nodes.AccessNode)
+                and n.desc(sdfg) == in_array.desc(sdfg)
+            ])
+
+        if len(occurrences) > 1:
+            return False
+
+        # Only apply if arrays are of same shape (no need to modify memlet subset)
+        if (len(in_array.desc(sdfg).shape) != len(out_array.desc(sdfg).shape)
+                or any(i != o for i, o in zip(
+                    in_array.desc(sdfg).shape,
+                    out_array.desc(sdfg).shape))):
+            return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        in_array = graph.nodes()[candidate[RedundantArray._in_array]]
+
+        return 'Remove ' + str(in_array)
+
+    def apply(self, sdfg):
+        def gnode(nname):
+            return graph.nodes()[self.subgraph[nname]]
+
+        graph = sdfg.nodes()[self.state_id]
+        in_array = gnode(RedundantArray._in_array)
+        out_array = gnode(RedundantArray._out_array)
+
+        for e in graph.in_edges(in_array):
+            # Modify all incoming edges to point to out_array
+            path = graph.memlet_path(e)
+            for pe in path:
+                if pe.data.data == in_array.data:
+                    pe.data.data = out_array.data
+
+            # Redirect edge to out_array
+            graph.remove_edge(e)
+            graph.add_edge(e.src, e.src_conn, out_array, e.dst_conn, e.data)
+
+        # Finally, remove in_array node
+        graph.remove_node(in_array)
+
+    def modifies_graph(self):
+        return True
+
+
+pm.Transformation.register_pattern(RedundantArray)
diff --git a/dace/transformation/dataflow/redundant_array_copying.py b/dace/transformation/dataflow/redundant_array_copying.py
new file mode 100644
index 0000000000..a5d6617ff3
--- /dev/null
+++ b/dace/transformation/dataflow/redundant_array_copying.py
@@ -0,0 +1,236 @@
+""" Contains redundant array removal transformations. """
+
+import copy
+from dace import data as dt, types, subsets, symbolic
+from dace.memlet import Memlet
+from dace.graph import nodes, nxutil
+from dace.sdfg import SDFGState
+from dace.transformation import pattern_matching as pm
+from dace.properties import ShapeProperty
+
+
+class RedundantArrayCopying(pm.Transformation):
+    """ Implements the redundant array removal transformation. Removes array B
+        in pattern A -> B -> A.
+    """
+
+    _in_array = nodes.AccessNode('_')
+    _med_array = nodes.AccessNode('_')
+    _out_array = nodes.AccessNode('_')
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(RedundantArrayCopying._in_array,
+                                   RedundantArrayCopying._med_array,
+                                   RedundantArrayCopying._out_array),
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        in_array = graph.nodes()[candidate[RedundantArrayCopying._in_array]]
+        med_array = graph.nodes()[candidate[RedundantArrayCopying._med_array]]
+        out_array = graph.nodes()[candidate[RedundantArrayCopying._out_array]]
+
+        # Ensure out degree is one (only one target, which is out_array)
+        if graph.out_degree(in_array) != 1:
+            return False
+
+        # Make sure that the candidate is a transient variable
+        # if not in_array.desc.transient:
+        #     return False
+
+        # Make sure that both arrays are using the same storage location
+        if in_array.desc(sdfg).storage != out_array.desc(sdfg).storage:
+            return False
+
+        # Find occurrences in this and other states
+        # (This could be relaxed)
+        # occurrences = []
+        # for state in sdfg.nodes():
+        #     occurrences.extend([
+        #         n for n in state.nodes()
+        #         if isinstance(n, nodes.AccessNode) and n.desc == med_array.desc
+        #     ])
+
+        # if len(occurrences) > 1:
+        #     return False
+
+        # Only apply if arrays are of same shape (no need to modify memlet subset)
+        if (len(in_array.desc(sdfg).shape) != len(out_array.desc(sdfg).shape)
+                or any(i != o for i, o in zip(
+                    in_array.desc(sdfg).shape,
+                    out_array.desc(sdfg).shape))):
+            return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        med_array = graph.nodes()[candidate[RedundantArrayCopying._med_array]]
+
+        return 'Remove ' + str(med_array)
+
+    def apply(self, sdfg):
+        def gnode(nname):
+            return graph.nodes()[self.subgraph[nname]]
+
+        graph = sdfg.nodes()[self.state_id]
+        in_array = gnode(RedundantArrayCopying._in_array)
+        med_array = gnode(RedundantArrayCopying._med_array)
+        out_array = gnode(RedundantArrayCopying._out_array)
+
+        med_edges = len(graph.out_edges(med_array))
+        med_out_edges = 0
+        for med_e in graph.out_edges(med_array):
+            if (isinstance(med_e.dst, nodes.AccessNode)
+                    and med_e.dst.data == out_array.data):
+                # Modify all outcoming edges to point to in_array
+                for out_e in graph.out_edges(med_e.dst):
+                    path = graph.memlet_path(out_e)
+                    for pe in path:
+                        if pe.data.data == out_array.data:
+                            pe.data.data = in_array.data
+                    # Redirect edge to in_array
+                    graph.remove_edge(out_e)
+                    graph.add_edge(in_array, out_e.src_conn, out_e.dst,
+                                   out_e.dst_conn, out_e.data)
+                # Remove out_array
+                for e in graph.edges_between(med_e, med_e.dst):
+                    graph.remove_edge(e)
+                graph.remove_node(med_e.dst)
+                med_out_edges += 1
+
+        # Finally, med_array node
+        if med_array.desc(sdfg).transient and med_edges == med_out_edges:
+            for e in graph.edges_between(in_array, med_array):
+                graph.remove_edge(e)
+            graph.remove_node(med_array)
+
+    def modifies_graph(self):
+        return True
+
+
+pm.Transformation.register_pattern(RedundantArrayCopying)
+
+
+class RedundantArrayCopying2(pm.Transformation):
+    """ Implements the redundant array removal transformation. Removes 
+        multiples of array B in pattern A -> B.
+    """
+
+    _in_array = nodes.AccessNode('_')
+    _out_array = nodes.AccessNode('_')
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(RedundantArrayCopying2._in_array,
+                                   RedundantArrayCopying2._out_array),
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        in_array = graph.nodes()[candidate[RedundantArrayCopying2._in_array]]
+        out_array = graph.nodes()[candidate[RedundantArrayCopying2._out_array]]
+
+        # Ensure out degree is one (only one target, which is out_array)
+        found = 0
+        for _, _, dst, _, _ in graph.out_edges(in_array):
+            if (isinstance(dst, nodes.AccessNode) and dst != out_array
+                    and dst.data == out_array.data):
+                found += 1
+
+        return found > 0
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        out_array = graph.nodes()[candidate[RedundantArrayCopying2._out_array]]
+
+        return 'Remove ' + str(out_array)
+
+    def apply(self, sdfg):
+        def gnode(nname):
+            return graph.nodes()[self.subgraph[nname]]
+
+        graph = sdfg.nodes()[self.state_id]
+        in_array = gnode(RedundantArrayCopying2._in_array)
+        out_array = gnode(RedundantArrayCopying2._out_array)
+
+        for e1 in graph.out_edges(in_array):
+            dst = e1.dst
+            if (isinstance(dst, nodes.AccessNode) and dst != out_array
+                    and dst.data == out_array.data):
+                for e2 in graph.out_edges(dst):
+                    graph.add_edge(out_array, None, e2.dst, e2.dst_conn,
+                                   e2.data)
+                    graph.remove_edge(e2)
+                graph.remove_edge(e1)
+                graph.remove_node(dst)
+
+    def modifies_graph(self):
+        return True
+
+
+pm.Transformation.register_pattern(RedundantArrayCopying2)
+
+
+class RedundantArrayCopying3(pm.Transformation):
+    """ Implements the redundant array removal transformation. Removes multiples
+        of array B in pattern MapEntry -> B.
+    """
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _out_array = nodes.AccessNode('_')
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(RedundantArrayCopying3._map_entry,
+                                   RedundantArrayCopying3._out_array),
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        map_entry = graph.nodes()[candidate[RedundantArrayCopying3._map_entry]]
+        out_array = graph.nodes()[candidate[RedundantArrayCopying3._out_array]]
+
+        # Ensure out degree is one (only one target, which is out_array)
+        found = 0
+        for _, _, dst, _, _ in graph.out_edges(map_entry):
+            if (isinstance(dst, nodes.AccessNode) and dst != out_array
+                    and dst.data == out_array.data):
+                found += 1
+
+        return found > 0
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        out_array = graph.nodes()[candidate[RedundantArrayCopying3._out_array]]
+
+        return 'Remove ' + str(out_array)
+
+    def apply(self, sdfg):
+        def gnode(nname):
+            return graph.nodes()[self.subgraph[nname]]
+
+        graph = sdfg.nodes()[self.state_id]
+        map_entry = gnode(RedundantArrayCopying3._map_entry)
+        out_array = gnode(RedundantArrayCopying3._out_array)
+
+        for e1 in graph.out_edges(map_entry):
+            dst = e1.dst
+            if (isinstance(dst, nodes.AccessNode) and dst != out_array
+                    and dst.data == out_array.data):
+                for e2 in graph.out_edges(dst):
+                    graph.add_edge(out_array, None, e2.dst, e2.dst_conn,
+                                   e2.data)
+                    graph.remove_edge(e2)
+                graph.remove_edge(e1)
+                graph.remove_node(dst)
+
+    def modifies_graph(self):
+        return True
+
+
+pm.Transformation.register_pattern(RedundantArrayCopying3)
diff --git a/dace/transformation/dataflow/stream_transient.py b/dace/transformation/dataflow/stream_transient.py
new file mode 100644
index 0000000000..52cf630382
--- /dev/null
+++ b/dace/transformation/dataflow/stream_transient.py
@@ -0,0 +1,452 @@
+""" Contains classes that implement transformations relating to streams
+    and transient nodes. """
+import copy
+import networkx as nx
+from dace import data, types, symbolic, subsets
+from dace.properties import Property, make_properties, DataProperty
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+
+
+def calc_set_image_index(map_idx, map_set, array_idx):
+    image = []
+    for a_idx in array_idx.indices:
+        new_range = [a_idx, a_idx, 1]
+        for m_idx, m_range in zip(map_idx, map_set):
+            symbol = symbolic.pystr_to_symbolic(m_idx)
+            new_range[0] = new_range[0].subs(symbol, m_range[0])
+            new_range[1] = new_range[1].subs(symbol, m_range[1])
+        image.append(new_range)
+    return subsets.Range(image)
+
+
+def calc_set_image_range(map_idx, map_set, array_range):
+    image = []
+    for a_range in array_range:
+        new_range = a_range
+        for m_idx, m_range in zip(map_idx, map_set):
+            symbol = symbolic.pystr_to_symbolic(m_idx)
+            new_range = [
+                new_range[i].subs(symbol, m_range[i]) for i in range(0, 3)
+            ]
+        image.append(new_range)
+    return subsets.Range(image)
+
+
+def calc_set_image(map_idx, map_set, array_set):
+    if isinstance(array_set, subsets.Range):
+        return calc_set_image_range(map_idx, map_set, array_set)
+    if isinstance(array_set, subsets.Indices):
+        return calc_set_image_index(map_idx, map_set, array_set)
+
+
+@make_properties
+class StreamTransient(pattern_matching.Transformation):
+    """ Implements the StreamTransient transformation, which adds a transient
+        stream node between nested maps that lead to a stream. The transient
+        then acts as a local buffer.
+    """
+
+    _tasklet = nodes.Tasklet('_')
+    _map_exit = nodes.MapExit(nodes.Map("", [], []))
+    _outer_map_exit = nodes.MapExit(nodes.Map("", [], []))
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(StreamTransient._tasklet,
+                                   StreamTransient._map_exit,
+                                   StreamTransient._outer_map_exit)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        map_exit = graph.nodes()[candidate[StreamTransient._map_exit]]
+        outer_map_exit = graph.nodes()[candidate[
+            StreamTransient._outer_map_exit]]
+
+        # Check if there is a streaming output
+        for _src, _, dest, _, memlet in graph.out_edges(map_exit):
+            if isinstance(sdfg.arrays[memlet.data],
+                          data.Stream) and dest == outer_map_exit:
+                return True
+
+        return False
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        tasklet = candidate[StreamTransient._tasklet]
+        map_exit = candidate[StreamTransient._map_exit]
+        outer_map_exit = candidate[StreamTransient._outer_map_exit]
+
+        return ' -> '.join(
+            str(node) for node in [tasklet, map_exit, outer_map_exit])
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        tasklet = graph.nodes()[self.subgraph[StreamTransient._tasklet]]
+        map_exit = graph.nodes()[self.subgraph[StreamTransient._map_exit]]
+        outer_map_exit = graph.nodes()[self.subgraph[
+            StreamTransient._outer_map_exit]]
+        memlet = None
+        edge = None
+        for e in graph.out_edges(map_exit):
+            memlet = e.data
+            # TODO: What if there's more than one?
+            if e.dst == outer_map_exit and isinstance(sdfg.arrays[memlet.data],
+                                                      data.Stream):
+                edge = e
+                break
+        tasklet_memlet = None
+        for e in graph.out_edges(tasklet):
+            tasklet_memlet = e.data
+            if tasklet_memlet.data == memlet.data:
+                break
+
+        bbox = map_exit.map.range.bounding_box_size()
+        bbox_approx = [symbolic.overapproximate(dim) for dim in bbox]
+        dataname = memlet.data
+
+        # Create the new node: Temporary stream and an access node
+        newstream = sdfg.add_stream(
+            'tile_' + dataname,
+            sdfg.arrays[memlet.data].dtype,
+            1,
+            bbox_approx[0],
+            [1],
+            transient=True,
+        )
+        snode = nodes.AccessNode('tile_' + dataname)
+
+        to_stream_mm = copy.deepcopy(memlet)
+        to_stream_mm.data = snode.data
+        tasklet_memlet.data = snode.data
+
+        # Reconnect, assuming one edge to the stream
+        graph.remove_edge(edge)
+        graph.add_edge(map_exit, None, snode, None, to_stream_mm)
+        graph.add_edge(snode, None, outer_map_exit, None, memlet)
+
+        return
+
+    def modifies_graph(self):
+        return True
+
+
+pattern_matching.Transformation.register_pattern(StreamTransient)
+
+
+@make_properties
+class AccumulateTransient(pattern_matching.Transformation):
+    """ Implements the AccumulateTransient transformation, which adds
+        transient stream and data nodes between nested maps that lead to a 
+        stream. The transient data nodes then act as a local accumulator.
+    """
+
+    _tasklet = nodes.Tasklet('_')
+    _map_exit = nodes.MapExit(nodes.Map("", [], []))
+    _outer_map_exit = nodes.MapExit(nodes.Map("", [], []))
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(StreamTransient._tasklet,
+                                   StreamTransient._map_exit,
+                                   StreamTransient._outer_map_exit)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        tasklet = graph.nodes()[candidate[StreamTransient._tasklet]]
+        map_exit = graph.nodes()[candidate[StreamTransient._map_exit]]
+
+        # Check if there is a streaming output
+        for _src, _, dest, _, memlet in graph.out_edges(tasklet):
+            if memlet.wcr is not None and dest == map_exit:
+                return True
+
+        return False
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        tasklet = candidate[StreamTransient._tasklet]
+        map_exit = candidate[StreamTransient._map_exit]
+        outer_map_exit = candidate[StreamTransient._outer_map_exit]
+
+        return ' -> '.join(
+            str(node) for node in [tasklet, map_exit, outer_map_exit])
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        tasklet = graph.nodes()[self.subgraph[StreamTransient._tasklet]]
+        map_exit = graph.nodes()[self.subgraph[StreamTransient._map_exit]]
+        outer_map_exit = graph.nodes()[self.subgraph[
+            StreamTransient._outer_map_exit]]
+        memlet = None
+        edge = None
+        for e in graph.out_edges(tasklet):
+            memlet = e.data
+            # TODO: What if there's more than one?
+            if e.dst == map_exit and e.data.wcr is not None:
+                break
+        out_memlet = None
+        for e in graph.out_edges(map_exit):
+            out_memlet = e.data
+            if out_memlet.data == memlet.data:
+                edge = e
+                break
+        dataname = memlet.data
+
+        # Create a new node with the same size as the output
+        newdata = sdfg.add_array(
+            'trans_' + dataname,
+            sdfg.arrays[memlet.data].shape,
+            sdfg.arrays[memlet.data].dtype,
+            transient=True)
+        dnode = nodes.AccessNode('trans_' + dataname)
+
+        to_data_mm = copy.deepcopy(memlet)
+        to_data_mm.data = dnode.data
+        to_data_mm.num_accesses = memlet.num_elements()
+
+        to_exit_mm = copy.deepcopy(out_memlet)
+        to_exit_mm.num_accesses = out_memlet.num_elements()
+        memlet.data = dnode.data
+
+        # Reconnect, assuming one edge to the stream
+        graph.remove_edge(edge)
+        graph.add_edge(map_exit, edge.src_conn, dnode, None, to_data_mm)
+        graph.add_edge(dnode, None, outer_map_exit, edge.dst_conn, to_exit_mm)
+
+        return
+
+    def modifies_graph(self):
+        return True
+
+
+pattern_matching.Transformation.register_pattern(AccumulateTransient)
+
+
+@make_properties
+class OutLocalStorage(pattern_matching.Transformation):
+    """ Implements the OutLocalStorage transformation, which adds a transient
+        data node between nested map exits.
+    """
+
+    _inner_map_exit = nodes.MapExit(nodes.Map("", [], []))
+    _outer_map_exit = nodes.MapExit(nodes.Map("", [], []))
+
+    @staticmethod
+    def annotates_memlets():
+        return True
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(  #OutLocalStorage._tasklet,
+                OutLocalStorage._inner_map_exit,
+                OutLocalStorage._outer_map_exit)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        inner_map_exit = candidate[OutLocalStorage._inner_map_exit]
+        outer_map_exit = candidate[OutLocalStorage._outer_map_exit]
+
+        return ' -> '.join(
+            str(node) for node in [inner_map_exit, outer_map_exit])
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        inner_map_exit = graph.nodes()[self.subgraph[
+            OutLocalStorage._inner_map_exit]]
+        outer_map_exit = graph.nodes()[self.subgraph[
+            OutLocalStorage._outer_map_exit]]
+
+        original_edge = None
+        invariant_memlet = None
+        array = None
+        for edge in graph.in_edges(outer_map_exit):
+            src = edge.src
+            if src != inner_map_exit:
+                continue
+            memlet = edge.data
+            original_edge = edge
+            invariant_memlet = memlet
+            array = memlet.data
+            break
+
+        new_data = sdfg.add_array(
+            graph.label + '_trans_' + invariant_memlet.data, [
+                symbolic.overapproximate(r)
+                for r in invariant_memlet.bounding_box_size()
+            ],
+            sdfg.arrays[invariant_memlet.data].dtype,
+            transient=True)
+        data_node = nodes.AccessNode(graph.label + '_trans_' +
+                                     invariant_memlet.data)
+        data_node.setzero = True
+
+        from_data_mm = copy.deepcopy(invariant_memlet)
+        to_data_mm = copy.deepcopy(invariant_memlet)
+        to_data_mm.data = data_node.data
+        offset = []
+        for ind, r in enumerate(invariant_memlet.subset):
+            offset.append(r[0])
+            if isinstance(invariant_memlet.subset[ind], tuple):
+                begin = invariant_memlet.subset[ind][0] - r[0]
+                end = invariant_memlet.subset[ind][1] - r[0]
+                step = invariant_memlet.subset[ind][2]
+                to_data_mm.subset[ind] = (begin, end, step)
+            else:
+                to_data_mm.subset[ind] -= r[0]
+
+        # Reconnect, assuming one edge to the stream
+        graph.remove_edge(original_edge)
+        graph.add_edge(inner_map_exit, original_edge.src_conn, data_node, None,
+                       to_data_mm)
+        graph.add_edge(data_node, None, outer_map_exit, original_edge.dst_conn,
+                       from_data_mm)
+
+        for _parent, _, _child, _, memlet in graph.bfs_edges(
+                inner_map_exit, reverse=True):
+            if isinstance(_child, nodes.CodeNode):
+                break
+            if memlet.data != array:
+                continue
+            for ind, r in enumerate(memlet.subset):
+                if isinstance(memlet.subset[ind], tuple):
+                    begin = r[0] - offset[ind]
+                    end = r[1] - offset[ind]
+                    step = r[2]
+                    memlet.subset[ind] = (begin, end, step)
+                else:
+                    memlet.subset[ind] -= offset[ind]
+            memlet.data = graph.label + '_trans_' + invariant_memlet.data
+
+        return
+
+
+pattern_matching.Transformation.register_pattern(OutLocalStorage)
+
+
+@make_properties
+class InLocalStorage(pattern_matching.Transformation):
+    """ Implements the InLocalStorage transformation, which adds a transient
+        data node between nested map entry nodes.
+    """
+
+    _outer_map_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _inner_map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    array = DataProperty(
+        desc="Array to create local storage for", default="gpu_V")
+
+    @staticmethod
+    def annotates_memlets():
+        return True
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(InLocalStorage._outer_map_entry,
+                                   InLocalStorage._inner_map_entry)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        outer_map_entry = candidate[InLocalStorage._outer_map_entry]
+        inner_map_entry = candidate[InLocalStorage._inner_map_entry]
+
+        return ' -> '.join(
+            str(node) for node in [outer_map_entry, inner_map_entry])
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        outer_map_entry = graph.nodes()[self.subgraph[
+            InLocalStorage._outer_map_entry]]
+        inner_map_entry = graph.nodes()[self.subgraph[
+            InLocalStorage._inner_map_entry]]
+
+        original_edge = None
+        invariant_memlet = None
+        for edge in graph.in_edges(inner_map_entry):
+            src = edge.src
+            if src != outer_map_entry:
+                continue
+            memlet = edge.data
+            if self.array == memlet.data:
+                original_edge = edge
+                invariant_memlet = memlet
+                break
+        if invariant_memlet is None:
+            for edge in graph.in_edges(inner_map_entry):
+                src = edge.src
+                if src != outer_map_entry:
+                    continue
+                original_edge = edge
+                invariant_memlet = edge.data
+                print('WARNING: Array %s not found! Using array %s instead.' %
+                      (self.array, invariant_memlet.data))
+                self.array = invariant_memlet.data
+                break
+        if invariant_memlet is None:
+            raise KeyError('Array %s not found!' % self.array)
+
+        new_data = sdfg.add_array(
+            'trans_' + invariant_memlet.data, [
+                symbolic.overapproximate(r)
+                for r in invariant_memlet.bounding_box_size()
+            ],
+            sdfg.arrays[invariant_memlet.data].dtype,
+            transient=True)
+        data_node = nodes.AccessNode('trans_' + invariant_memlet.data)
+
+        to_data_mm = copy.deepcopy(invariant_memlet)
+        from_data_mm = copy.deepcopy(invariant_memlet)
+        from_data_mm.data = data_node.data
+        offset = []
+        for ind, r in enumerate(invariant_memlet.subset):
+            offset.append(r[0])
+            if isinstance(invariant_memlet.subset[ind], tuple):
+                begin = invariant_memlet.subset[ind][0] - r[0]
+                end = invariant_memlet.subset[ind][1] - r[0]
+                step = invariant_memlet.subset[ind][2]
+                from_data_mm.subset[ind] = (begin, end, step)
+            else:
+                from_data_mm.subset[ind] -= r[0]
+        to_data_mm.other_subset = copy.deepcopy(from_data_mm.subset)
+
+        # Reconnect, assuming one edge to the stream
+        graph.remove_edge(original_edge)
+        graph.add_edge(outer_map_entry, original_edge.src_conn, data_node,
+                       None, to_data_mm)
+        graph.add_edge(data_node, None, inner_map_entry,
+                       original_edge.dst_conn, from_data_mm)
+
+        for _parent, _, _child, _, memlet in graph.bfs_edges(
+                inner_map_entry, reverse=False):
+            if memlet.data != self.array:
+                continue
+            for ind, r in enumerate(memlet.subset):
+                if isinstance(memlet.subset[ind], tuple):
+                    begin = r[0] - offset[ind]
+                    end = r[1] - offset[ind]
+                    step = r[2]
+                    memlet.subset[ind] = (begin, end, step)
+                else:
+                    memlet.subset[ind] -= offset[ind]
+            memlet.data = 'trans_' + invariant_memlet.data
+
+        return
+
+
+pattern_matching.Transformation.register_pattern(InLocalStorage)
diff --git a/dace/transformation/dataflow/strip_mining.py b/dace/transformation/dataflow/strip_mining.py
new file mode 100644
index 0000000000..f1855aba91
--- /dev/null
+++ b/dace/transformation/dataflow/strip_mining.py
@@ -0,0 +1,359 @@
+""" This module contains classes and functions that implement the strip-mining
+    transformation."""
+
+import dace
+from copy import deepcopy as dcpy
+from dace import types, subsets, symbolic
+from dace.properties import make_properties, Property
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from math import ceil
+import sympy
+import networkx as nx
+
+
+def calc_set_image_index(map_idx, map_set, array_idx):
+    image = []
+    for a_idx in array_idx.indices:
+        new_range = [a_idx, a_idx, 1]
+        for m_idx, m_range in zip(map_idx, map_set):
+            symbol = symbolic.pystr_to_symbolic(m_idx)
+            new_range[0] = new_range[0].subs(
+                symbol, dace.symbolic.overapproximate(m_range[0]))
+            new_range[1] = new_range[1].subs(
+                symbol, dace.symbolic.overapproximate(m_range[1]))
+        image.append(new_range)
+    return subsets.Range(image)
+
+
+def calc_set_image_range(map_idx, map_set, array_range):
+    image = []
+    for a_range in array_range:
+        new_range = a_range
+        for m_idx, m_range in zip(map_idx, map_set):
+            symbol = symbolic.pystr_to_symbolic(m_idx)
+            new_range = [
+                new_range[i].subs(symbol,
+                                  dace.symbolic.overapproximate(m_range[i]))
+                for i in range(0, 3)
+            ]
+        image.append(new_range)
+    return subsets.Range(image)
+
+
+def calc_set_image(map_idx, map_set, array_set):
+    if isinstance(array_set, subsets.Range):
+        return calc_set_image_range(map_idx, map_set, array_set)
+    if isinstance(array_set, subsets.Indices):
+        return calc_set_image_index(map_idx, map_set, array_set)
+
+
+def calc_set_union(set_a, set_b):
+    if isinstance(set_a, subsets.Indices) or isinstance(
+            set_b, subsets.Indices):
+        raise NotImplementedError('Set union with indices is not implemented.')
+    if not (isinstance(set_a, subsets.Range)
+            and isinstance(set_b, subsets.Range)):
+        raise TypeError('Can only compute the union of ranges.')
+    if len(set_a) != len(set_b):
+        raise ValueError('Range dimensions do not match')
+    union = []
+    for range_a, range_b in zip(set_a, set_b):
+        union.append([
+            sympy.Min(range_a[0], range_b[0]),
+            sympy.Max(range_a[1], range_b[1]),
+            sympy.Min(range_a[2], range_b[2]),
+        ])
+    return subsets.Range(union)
+
+
+@make_properties
+class StripMining(pattern_matching.Transformation):
+    """ Implements the strip-mining transformation.
+
+        Strip-mining takes as input a map dimension and splits it into
+        two dimensions. The new dimension iterates over the range of
+        the original one with a parameterizable step, called the tile
+        size. The original dimension is changed to iterates over the
+        range of the tile size, with the same step as before.
+    """
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    # Properties
+    dim_idx = Property(
+        dtype=int, default=-1, desc="Index of dimension to be strip-mined")
+    new_dim_prefix = Property(
+        dtype=str, default="tile", desc="Prefix for new dimension name")
+    tile_size = Property(
+        dtype=str, default="64", desc="Tile size of strip-mined dimension")
+    divides_evenly = Property(
+        dtype=bool,
+        default=False,
+        desc="Tile size divides dimension range evenly?")
+    strided = Property(
+        dtype=bool,
+        default=False,
+        desc="Continuous (false) or strided (true) elements in tile")
+
+    @staticmethod
+    def annotates_memlets():
+        return True
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(StripMining._map_entry)
+            # kStripMining._tasklet, StripMining._map_exit)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        map_entry = graph.nodes()[candidate[StripMining._map_entry]]
+        return map_entry.map.label + ': ' + str(map_entry.map.params)
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        # Strip-mine selected dimension.
+        target_dim, new_dim, new_map = self.__stripmine(
+            sdfg, graph, self.subgraph)
+        # Modify edges to match strip-mined dimension.
+        # StripMining.__modify_edges(sdfg, graph, self.subgraph, target_dim, new_dim)
+        return new_map
+
+    # def __init__(self, tag=True):
+    def __init__(self, *args, **kwargs):
+        self._entry = nodes.EntryNode()
+        self._tasklet = nodes.Tasklet('_')
+        self._exit = nodes.ExitNode()
+        super().__init__(*args, **kwargs)
+        # self.tag = tag
+
+    @property
+    def entry(self):
+        return self._entry
+
+    @property
+    def exit(self):
+        return self._exit
+
+    @property
+    def tasklet(self):
+        return self._tasklet
+
+    def print_match_pattern(self, candidate):
+        gentry = candidate[self.entry]
+        return str(gentry.map.params[-1])
+
+    def modifies_graph(self):
+        return True
+
+    def __stripmine(self, sdfg, graph, candidate):
+
+        # Retrieve map entry and exit nodes.
+        map_entry = graph.nodes()[candidate[StripMining._map_entry]]
+        map_exits = graph.exit_nodes(map_entry)
+
+        # Retrieve transformation properties.
+        dim_idx = self.dim_idx
+        new_dim_prefix = self.new_dim_prefix
+        tile_size = self.tile_size
+        divides_evenly = self.divides_evenly
+        strided = self.strided
+
+        # Retrieve parameter and range of dimension to be strip-mined.
+        target_dim = map_entry.map.params[dim_idx]
+        td_from, td_to, td_step = map_entry.map.range[dim_idx]
+
+        # Create new map. Replace by cloning???
+        new_dim = new_dim_prefix + '_' + target_dim
+        nd_from = 0
+        nd_to = symbolic.pystr_to_symbolic(
+            'int_ceil(%s + 1 - %s, %s) - 1' %
+            (symbolic.symstr(td_to), symbolic.symstr(td_from), tile_size))
+        nd_step = 1
+        new_dim_range = (nd_from, nd_to, nd_step)
+        new_map = nodes.Map(new_dim + '_' + map_entry.map.label, [new_dim],
+                            subsets.Range([new_dim_range]))
+        new_map_entry = nodes.MapEntry(new_map)
+
+        # Change the range of the selected dimension to iterate over a single
+        # tile
+        if strided:
+            td_from_new = symbolic.pystr_to_symbolic(new_dim)
+            td_to_new_approx = td_to
+            td_step = symbolic.pystr_to_symbolic(tile_size)
+        else:
+            td_from_new = symbolic.pystr_to_symbolic(
+                '%s + %s * %s' % (symbolic.symstr(td_from), str(new_dim),
+                                  tile_size))
+            td_to_new_exact = symbolic.pystr_to_symbolic(
+                'min(%s + 1, %s + %s * %s + %s) - 1' %
+                (symbolic.symstr(td_to), symbolic.symstr(td_from), tile_size,
+                 str(new_dim), tile_size))
+            td_to_new_approx = symbolic.pystr_to_symbolic(
+                '%s + %s * %s + %s - 1' % (symbolic.symstr(td_from), tile_size,
+                                           str(new_dim), tile_size))
+        if divides_evenly or strided:
+            td_to_new = td_to_new_approx
+        else:
+            td_to_new = dace.symbolic.SymExpr(td_to_new_exact,
+                                              td_to_new_approx)
+        map_entry.map.range[dim_idx] = (td_from_new, td_to_new, td_step)
+
+        # Make internal map's schedule to "not parallel"
+        map_entry.map._schedule = types.ScheduleType.Default
+
+        # Redirect/create edges.
+        new_in_edges = {}
+        for _src, conn, _dest, _, memlet in graph.out_edges(map_entry):
+            if not isinstance(sdfg.arrays[memlet.data], dace.data.Scalar):
+                new_subset = calc_set_image(
+                    map_entry.map.params,
+                    map_entry.map.range,
+                    memlet.subset,
+                )
+                if memlet.data in new_in_edges:
+                    src, src_conn, dest, dest_conn, new_memlet, num = \
+                        new_in_edges[memlet.data]
+                    new_memlet.subset = calc_set_union(new_memlet.subset,
+                                                       new_subset)
+                    new_memlet.num_accesses = new_memlet.num_elements()
+                    new_in_edges.update({
+                        memlet.data: (src, src_conn, dest, dest_conn,
+                                      new_memlet, min(num, int(conn[4:])))
+                    })
+                else:
+                    new_memlet = dcpy(memlet)
+                    new_memlet.subset = new_subset
+                    new_memlet.num_accesses = new_memlet.num_elements()
+                    new_in_edges.update({
+                        memlet.data: (new_map_entry, None, map_entry, None,
+                                      new_memlet, int(conn[4:]))
+                    })
+        nxutil.change_edge_dest(graph, map_entry, new_map_entry)
+
+        new_out_edges = {}
+        new_exits = []
+        for map_exit in map_exits:
+            if isinstance(map_exit, nodes.MapExit):
+                new_exit = nodes.MapExit(new_map)
+                new_exits.append(new_exit)
+            for _src, conn, _dest, _, memlet in graph.in_edges(map_exit):
+                if not isinstance(sdfg.arrays[memlet.data], dace.data.Scalar):
+                    new_subset = calc_set_image(
+                        map_entry.map.params,
+                        map_entry.map.range,
+                        memlet.subset,
+                    )
+                    if memlet.data in new_out_edges:
+                        src, src_conn, dest, dest_conn, new_memlet, num = \
+                            new_out_edges[memlet.data]
+                        new_memlet.subset = calc_set_union(
+                            new_memlet.subset, new_subset)
+                        new_memlet.num_accesses = new_memlet.num_elements()
+                        new_out_edges.update({
+                            memlet.data: (src, src_conn, dest, dest_conn,
+                                          new_memlet, min(num, conn[4:]))
+                        })
+                    else:
+                        new_memlet = dcpy(memlet)
+                        new_memlet.subset = new_subset
+                        new_memlet.num_accesses = new_memlet.num_elements()
+                        new_out_edges.update({
+                            memlet.data: (map_exit, None, new_exit, None,
+                                          new_memlet, conn[4:])
+                        })
+            nxutil.change_edge_src(graph, map_exit, new_exit)
+
+        in_conn_nums = []
+        for _, e in new_in_edges.items():
+            _, _, _, _, _, num = e
+            in_conn_nums.append(num)
+        in_conn = {}
+        for i, num in enumerate(in_conn_nums):
+            in_conn.update({num: i + 1})
+
+        entry_in_connectors = set()
+        entry_out_connectors = set()
+        for i in range(len(in_conn_nums)):
+            entry_in_connectors.add('IN_' + str(i + 1))
+            entry_out_connectors.add('OUT_' + str(i + 1))
+        new_map_entry.in_connectors = entry_in_connectors
+        new_map_entry.out_connectors = entry_out_connectors
+
+        for _, e in new_in_edges.items():
+            src, _, dst, _, memlet, num = e
+            graph.add_edge(src, 'OUT_' + str(in_conn[num]), dst,
+                           'IN_' + str(in_conn[num]), memlet)
+
+        for new_exit in new_exits:
+
+            out_conn_nums = []
+            for _, e in new_out_edges.items():
+                _, _, dst, _, _, num = e
+                if dst is not new_exit:
+                    continue
+                out_conn_nums.append(num)
+            out_conn = {}
+            for i, num in enumerate(out_conn_nums):
+                out_conn.update({num: i + 1})
+
+            exit_in_connectors = set()
+            exit_out_connectors = set()
+            for i in range(len(out_conn_nums)):
+                exit_in_connectors.add('IN_' + str(i + 1))
+                exit_out_connectors.add('OUT_' + str(i + 1))
+            new_exit.in_connectors = exit_in_connectors
+            new_exit.out_connectors = exit_out_connectors
+
+            for _, e in new_out_edges.items():
+                src, _, dst, _, memlet, num = e
+                graph.add_edge(src, 'OUT_' + str(out_conn[num]), dst,
+                               'IN_' + str(out_conn[num]), memlet)
+
+        # Return strip-mined dimension.
+        return target_dim, new_dim, new_map
+
+    @staticmethod
+    def __modify_edges(sdfg, graph, candidate, target_dim, new_dim):
+        map_entry = graph.nodes()[candidate[StripMining._map_entry]]
+
+        processed = []
+        for src, _dest, memlet, _scope in nxutil.traverse_sdfg_scope(
+                graph, map_entry, True):
+            if memlet in processed:
+                continue
+            processed.append(memlet)
+
+            # Corner cases
+            if isinstance(sdfg.arrays[memlet.data], dace.data.Stream):
+                continue
+            if memlet.wcr is not None:
+                memlet.num_accesses = 1
+                continue
+
+            for i, dim in enumerate(memlet.subset):
+                if isinstance(dim, tuple):
+                    dim = tuple(
+                        symbolic.pystr_to_symbolic(d).subs(
+                            symbolic.pystr_to_symbolic(target_dim),
+                            symbolic.pystr_to_symbolic(
+                                '%s + %s' % (str(new_dim), str(target_dim))))
+                        for d in dim)
+                else:
+                    dim = symbolic.pystr_to_symbolic(dim).subs(
+                        symbolic.pystr_to_symbolic(target_dim),
+                        symbolic.pystr_to_symbolic(
+                            '%s + %s' % (str(new_dim), str(target_dim))))
+
+                memlet.subset[i] = dim
+
+        return
+
+
+pattern_matching.Transformation.register_pattern(StripMining)
diff --git a/dace/transformation/dataflow/tiling.py b/dace/transformation/dataflow/tiling.py
new file mode 100644
index 0000000000..daccdaa45e
--- /dev/null
+++ b/dace/transformation/dataflow/tiling.py
@@ -0,0 +1,374 @@
+""" This module contains classes and functions that implement the orthogonal
+    tiling transformation. """
+
+import copy
+import dace
+from copy import deepcopy as dcpy
+from dace import types, subsets, symbolic
+from dace.properties import make_properties, Property, ParamsProperty, ShapeProperty
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from math import ceil
+import sympy
+import networkx as nx
+
+
+def calc_set_image_index(map_idx, map_set, array_idx):
+    image = []
+    for a_idx in array_idx.indices:
+        new_range = [a_idx, a_idx, a_idx]
+        for m_idx, m_range in zip(map_idx, map_set):
+            symbol = symbolic.pystr_to_symbolic(m_idx)
+            new_range[0] = new_range[0].subs(
+                symbol, dace.symbolic.overapproximate(m_range[0]))
+            new_range[1] = new_range[1].subs(
+                symbol, dace.symbolic.overapproximate(m_range[1]))
+            new_range[2] = new_range[2].subs(
+                symbol, dace.symbolic.overapproximate(m_range[2]))
+        image.append(new_range)
+    return subsets.Range(image)
+
+
+def calc_set_image_range(map_idx, map_set, array_range, strided):
+    image = []
+    n = len(array_range) - len(strided)
+    if n > 0:
+        strided.append([strided[-1]] * n)
+    for a_range, stride in zip(array_range, strided):
+        new_range = list(a_range)
+        for m_idx, m_range in zip(map_idx, map_set):
+            symbol = symbolic.pystr_to_symbolic(m_idx)
+            new_range[0] = new_range[0].subs(
+                symbol, dace.symbolic.overapproximate(m_range[0]))
+            new_range[1] = new_range[1].subs(
+                symbol, dace.symbolic.overapproximate(m_range[1]))
+            if stride:
+                new_range[2] = symbolic.pystr_to_symbolic('%s / %s' % (str(
+                    new_range[2]), symbolic.symstr(m_range[1])))
+            else:
+                new_range[2] = new_range[2].subs(
+                    symbol, dace.symbolic.overapproximate(m_range[2]))
+        image.append(new_range)
+    return subsets.Range(image)
+
+
+def calc_set_image(map_idx, map_set, array_set, strided):
+    if isinstance(array_set, subsets.Range):
+        return calc_set_image_range(map_idx, map_set, array_set, strided)
+    if isinstance(array_set, subsets.Indices):
+        return calc_set_image_index(map_idx, map_set, array_set)
+
+
+def calc_set_union(aname, array, set_a, set_b):
+    if isinstance(set_a, subsets.Indices) or isinstance(
+            set_b, subsets.Indices):
+        # raise NotImplementedError('Set union with indices is not implemented.')
+        return subsets.Range.from_array(array)
+    if not (isinstance(set_a, subsets.Range)
+            and isinstance(set_b, subsets.Range)):
+        raise TypeError('Can only compute the union of ranges.')
+    if len(set_a) != len(set_b):
+        raise ValueError('Range dimensions do not match')
+    union = []
+    for range_a, range_b in zip(set_a, set_b):
+        union.append([
+            sympy.Min(range_a[0], range_b[0]),
+            sympy.Max(range_a[1], range_b[1]),
+            sympy.Min(range_a[2], range_b[2]),
+        ])
+    return subsets.Range(union)
+
+
+@make_properties
+class OrthogonalTiling(pattern_matching.Transformation):
+    """ Implements the orthogonal tiling transformation.
+
+        Orthogonal tiling is a type of nested map fission that creates tiles
+        in every dimension of the matched Map.
+    """
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+
+    # Properties
+    prefix = Property(
+        dtype=str, default="tile", desc="Prefix for new iterators")
+    tile_sizes = ShapeProperty(
+        dtype=tuple, default=(128, 128, 128), desc="Tile size per dimension")
+    divides_evenly = Property(
+        dtype=bool,
+        default=False,
+        desc="Tile size divides dimension length evenly")
+
+    @staticmethod
+    def annotates_memlets():
+        return False
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(OrthogonalTiling._map_entry)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        map_entry = graph.nodes()[candidate[OrthogonalTiling._map_entry]]
+        return map_entry.map.label + ': ' + str(map_entry.map.params)
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        # Tile map.
+        target_dim, new_dim, new_map = self.__stripmine(
+            sdfg, graph, self.subgraph)
+        return new_map
+
+    def __stripmine(self, sdfg, graph, candidate):
+        # Retrieve map entry and exit nodes.
+        map_entry = graph.nodes()[candidate[OrthogonalTiling._map_entry]]
+        map_exit = graph.exit_nodes(map_entry)[0]
+
+        # Map subgraph
+        map_subgraph = graph.scope_subgraph(map_entry)
+
+        # Retrieve transformation properties.
+        prefix = self.prefix
+        tile_sizes = self.tile_sizes
+        divides_evenly = self.divides_evenly
+
+        new_param = []
+        new_range = []
+
+        for dim_idx in range(len(map_entry.map.params)):
+
+            if dim_idx >= len(tile_sizes):
+                tile_size = tile_sizes[-1]
+            else:
+                tile_size = tile_sizes[dim_idx]
+
+            # Retrieve parameter and range of dimension to be strip-mined.
+            target_dim = map_entry.map.params[dim_idx]
+            td_from, td_to, td_step = map_entry.map.range[dim_idx]
+
+            new_dim = prefix + '_' + target_dim
+
+            # Basic values
+            if divides_evenly:
+                tile_num = '(%s + 1 - %s) / %s' % (symbolic.symstr(td_to),
+                                                   symbolic.symstr(td_from),
+                                                   str(tile_size))
+            else:
+                tile_num = 'int_ceil((%s + 1 - %s), %s)' % (symbolic.symstr(
+                    td_to), symbolic.symstr(td_from), str(tile_size))
+
+            # Outer map values (over all tiles)
+            nd_from = 0
+            nd_to = symbolic.pystr_to_symbolic(str(tile_num) + ' - 1')
+            nd_step = 1
+
+            # Inner map values (over one tile)
+            td_from_new = dace.symbolic.pystr_to_symbolic(td_from)
+            td_to_new_exact = symbolic.pystr_to_symbolic(
+                'min(%s + 1 - %s * %s, %s + %s) - 1' %
+                (symbolic.symstr(td_to), str(new_dim), str(tile_size),
+                 td_from_new, str(tile_size)))
+            td_to_new_approx = symbolic.pystr_to_symbolic(
+                '%s + %s - 1' % (td_from_new, str(tile_size)))
+
+            # Outer map (over all tiles)
+            new_dim_range = (nd_from, nd_to, nd_step)
+            new_param.append(new_dim)
+            new_range.append(new_dim_range)
+
+            # Inner map (over one tile)
+            if divides_evenly:
+                td_to_new = td_to_new_approx
+            else:
+                td_to_new = dace.symbolic.SymExpr(td_to_new_exact,
+                                                  td_to_new_approx)
+            map_entry.map.range[dim_idx] = (td_from_new, td_to_new, td_step)
+
+            # Fix subgraph memlets
+            target_dim = dace.symbolic.pystr_to_symbolic(target_dim)
+            offset = dace.symbolic.pystr_to_symbolic(
+                '%s * %s' % (new_dim, str(tile_size)))
+            for _, _, _, _, memlet in map_subgraph.edges():
+                old_subset = memlet.subset
+                if isinstance(old_subset, dace.subsets.Indices):
+                    new_indices = []
+                    for idx in old_subset:
+                        new_idx = idx.subs(target_dim, target_dim + offset)
+                        new_indices.append(new_idx)
+                    memlet.subset = dace.subsets.Indices(new_indices)
+                elif isinstance(old_subset, dace.subsets.Range):
+                    new_ranges = []
+                    for i, old_range in enumerate(old_subset):
+                        if len(old_range) == 3:
+                            b, e, s, = old_range
+                            t = old_subset.tile_sizes[i]
+                        else:
+                            raise ValueError(
+                                'Range %s is invalid.' % old_range)
+                        new_b = b.subs(target_dim, target_dim + offset)
+                        new_e = e.subs(target_dim, target_dim + offset)
+                        new_s = s.subs(target_dim, target_dim + offset)
+                        new_t = t.subs(target_dim, target_dim + offset)
+                        new_ranges.append((new_b, new_e, new_s, new_t))
+                    memlet.subset = dace.subsets.Range(new_ranges)
+                else:
+                    raise NotImplementedError
+
+        new_map = nodes.Map(prefix + '_' + map_entry.map.label, new_param,
+                            subsets.Range(new_range))
+        new_map_entry = nodes.MapEntry(new_map)
+        new_exit = nodes.MapExit(new_map)
+
+        # Make internal map's schedule to "not parallel"
+        map_entry.map._schedule = types.ScheduleType.Default
+
+        # Redirect/create edges.
+        new_in_edges = {}
+        for _src, conn, _dest, _, memlet in graph.out_edges(map_entry):
+            if not isinstance(sdfg.arrays[memlet.data], dace.data.Scalar):
+                new_subset = copy.deepcopy(memlet.subset)
+                # new_subset = calc_set_image(map_entry.map.params,
+                #                             map_entry.map.range, memlet.subset,
+                #                             cont_or_strided)
+                if memlet.data in new_in_edges:
+                    src, src_conn, dest, dest_conn, new_memlet, num = \
+                        new_in_edges[memlet.data]
+                    new_memlet.subset = calc_set_union(
+                        new_memlet.data, sdfg.arrays[nnew_memlet.data],
+                        new_memlet.subset, new_subset)
+                    new_memlet.num_accesses = new_memlet.num_elements()
+                    new_in_edges.update({
+                        memlet.data: (src, src_conn, dest, dest_conn,
+                                      new_memlet, min(num, int(conn[4:])))
+                    })
+                else:
+                    new_memlet = dcpy(memlet)
+                    new_memlet.subset = new_subset
+                    new_memlet.num_accesses = new_memlet.num_elements()
+                    new_in_edges.update({
+                        memlet.data: (new_map_entry, None, map_entry, None,
+                                      new_memlet, int(conn[4:]))
+                    })
+        nxutil.change_edge_dest(graph, map_entry, new_map_entry)
+
+        new_out_edges = {}
+        for _src, conn, _dest, _, memlet in graph.in_edges(map_exit):
+            if not isinstance(sdfg.arrays[memlet.data], dace.data.Scalar):
+                new_subset = memlet.subset
+                # new_subset = calc_set_image(map_entry.map.params,
+                #                             map_entry.map.range,
+                #                             memlet.subset, cont_or_strided)
+                if memlet.data in new_out_edges:
+                    src, src_conn, dest, dest_conn, new_memlet, num = \
+                        new_out_edges[memlet.data]
+                    new_memlet.subset = calc_set_union(
+                        new_memlet.data, sdfg.arrays[nnew_memlet.data],
+                        new_memlet.subset, new_subset)
+                    new_memlet.num_accesses = new_memlet.num_elements()
+                    new_out_edges.update({
+                        memlet.data: (src, src_conn, dest, dest_conn,
+                                      new_memlet, min(num, conn[4:]))
+                    })
+                else:
+                    new_memlet = dcpy(memlet)
+                    new_memlet.subset = new_subset
+                    new_memlet.num_accesses = new_memlet.num_elements()
+                    new_out_edges.update({
+                        memlet.data: (map_exit, None, new_exit, None,
+                                      new_memlet, conn[4:])
+                    })
+        nxutil.change_edge_src(graph, map_exit, new_exit)
+
+        # Connector related work follows
+        # 1. Dictionary 'old_connector_number': 'new_connector_numer'
+        # 2. New node in/out connectors
+        # 3. New edges
+
+        in_conn_nums = []
+        for _, e in new_in_edges.items():
+            _, _, _, _, _, num = e
+            in_conn_nums.append(num)
+        in_conn = {}
+        for i, num in enumerate(in_conn_nums):
+            in_conn.update({num: i + 1})
+
+        entry_in_connectors = set()
+        entry_out_connectors = set()
+        for i in range(len(in_conn_nums)):
+            entry_in_connectors.add('IN_' + str(i + 1))
+            entry_out_connectors.add('OUT_' + str(i + 1))
+        new_map_entry.in_connectors = entry_in_connectors
+        new_map_entry.out_connectors = entry_out_connectors
+
+        for _, e in new_in_edges.items():
+            src, _, dst, _, memlet, num = e
+            graph.add_edge(src, 'OUT_' + str(in_conn[num]), dst,
+                           'IN_' + str(in_conn[num]), memlet)
+
+        out_conn_nums = []
+        for _, e in new_out_edges.items():
+            _, _, dst, _, _, num = e
+            if dst is not new_exit:
+                continue
+            out_conn_nums.append(num)
+        out_conn = {}
+        for i, num in enumerate(out_conn_nums):
+            out_conn.update({num: i + 1})
+
+        exit_in_connectors = set()
+        exit_out_connectors = set()
+        for i in range(len(out_conn_nums)):
+            exit_in_connectors.add('IN_' + str(i + 1))
+            exit_out_connectors.add('OUT_' + str(i + 1))
+        new_exit.in_connectors = exit_in_connectors
+        new_exit.out_connectors = exit_out_connectors
+
+        for _, e in new_out_edges.items():
+            src, _, dst, _, memlet, num = e
+            graph.add_edge(src, 'OUT_' + str(out_conn[num]), dst,
+                           'IN_' + str(out_conn[num]), memlet)
+
+        # Return strip-mined dimension.
+        return target_dim, new_dim, new_map
+
+    @staticmethod
+    def __modify_edges(sdfg, graph, candidate, target_dim, new_dim):
+        map_entry = graph.nodes()[candidate[OrthogonalTiling._map_entry]]
+
+        processed = []
+        for src, _dest, memlet, _scope in nxutil.traverse_sdfg_scope(
+                graph, map_entry, True):
+            if memlet in processed:
+                continue
+            processed.append(memlet)
+
+            # Corner cases
+            if isinstance(sdfg.arrays[memlet.data], dace.data.Stream):
+                continue
+            if memlet.wcr is not None:
+                memlet.num_accesses = 1
+                continue
+
+            for i, dim in enumerate(memlet.subset):
+                if isinstance(dim, tuple):
+                    dim = tuple(
+                        symbolic.pystr_to_symbolic(d).subs(
+                            symbolic.pystr_to_symbolic(target_dim),
+                            symbolic.pystr_to_symbolic(
+                                '%s + %s' % (str(new_dim), str(target_dim))))
+                        for d in dim)
+                else:
+                    dim = symbolic.pystr_to_symbolic(dim).subs(
+                        symbolic.pystr_to_symbolic(target_dim),
+                        symbolic.pystr_to_symbolic(
+                            '%s + %s' % (str(new_dim), str(target_dim))))
+
+                memlet.subset[i] = dim
+        return
+
+
+pattern_matching.Transformation.register_pattern(OrthogonalTiling)
diff --git a/dace/transformation/dataflow/vectorization.py b/dace/transformation/dataflow/vectorization.py
new file mode 100644
index 0000000000..962727cbc3
--- /dev/null
+++ b/dace/transformation/dataflow/vectorization.py
@@ -0,0 +1,136 @@
+""" Contains classes that implement the vectorization transformation. """
+from dace import data, types, symbolic
+from dace.graph import nodes, nxutil
+from dace.transformation import pattern_matching
+from dace.properties import Property, make_properties
+
+
+@make_properties
+class Vectorization(pattern_matching.Transformation):
+    """ Implements the vectorization transformation.
+
+        Vectorization matches when all the input and output memlets of a 
+        tasklet inside a map access the inner-most loop variable in their last
+        dimension. The transformation changes the step of the inner-most loop
+        to be equal to the length of the vector and vectorizes the memlets.
+  """
+
+    vector_len = Property(desc="Vector length", dtype=int, default=4)
+
+    _map_entry = nodes.MapEntry(nodes.Map("", [], []))
+    _tasklet = nodes.Tasklet('_')
+    _map_exit = nodes.MapExit(nodes.Map("", [], []))
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(Vectorization._map_entry,
+                                   Vectorization._tasklet,
+                                   Vectorization._map_exit)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        map_entry = graph.nodes()[candidate[Vectorization._map_entry]]
+        tasklet = graph.nodes()[candidate[Vectorization._tasklet]]
+        param = symbolic.pystr_to_symbolic(map_entry.map.params[-1])
+        found = False
+        dtype = None
+
+        # Check if all edges, adjacent to the tasklet,
+        # use the parameter in their last dimension.
+        for _src, _, _dest, _, memlet in graph.all_edges(tasklet):
+
+            # Cases that do not matter for vectorization
+            if isinstance(sdfg.arrays[memlet.data], data.Stream):
+                continue
+            if memlet.wcr is not None:
+                continue
+
+            try:
+                subset = memlet.subset
+                veclen = memlet.veclen
+            except AttributeError:
+                return False
+
+            if subset is None:
+                return False
+
+            try:
+                if veclen > symbolic.pystr_to_symbolic('1'):
+                    return False
+
+                for idx, expr in enumerate(subset):
+                    if isinstance(expr, tuple):
+                        for ex in expr:
+                            symbolic.pystr_to_symbolic(ex)
+                            symbols = ex.free_symbols
+                            if param in symbols:
+                                if idx == subset.dims() - 1:
+                                    found = True
+                                else:
+                                    return False
+                    else:
+                        expr = symbolic.pystr_to_symbolic(expr)
+                        symbols = expr.free_symbols
+                        if param in symbols:
+                            if idx == subset.dims() - 1:
+                                found = True
+                            else:
+                                return False
+            except TypeError:  # cannot determine truth value of Relational
+                return False
+
+        return found
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+
+        map_entry = candidate[Vectorization._map_entry]
+        tasklet = candidate[Vectorization._tasklet]
+        map_exit = candidate[Vectorization._map_exit]
+
+        return ' -> '.join(
+            str(node) for node in [map_entry, tasklet, map_exit])
+
+    def apply(self, sdfg):
+        graph = sdfg.nodes()[self.state_id]
+        map_entry = graph.nodes()[self.subgraph[Vectorization._map_entry]]
+        tasklet = graph.nodes()[self.subgraph[Vectorization._tasklet]]
+        map_exit = graph.nodes()[self.subgraph[Vectorization._map_exit]]
+        param = symbolic.pystr_to_symbolic(map_entry.map.params[-1])
+
+        # Create new vector size.
+        vector_size = self.vector_len
+
+        # Change the step of the inner-most dimension.
+        dim_from, dim_to, _dim_step = map_entry.map.range[-1]
+        map_entry.map.range[-1] = (dim_from, dim_to, vector_size)
+
+        # Vectorize memlets adjacent to the tasklet.
+        for _src, _, _dest, _, memlet in graph.all_edges(tasklet):
+            subset = memlet.subset
+            lastindex = memlet.subset[-1]
+            if isinstance(lastindex, tuple):
+                symbols = set()
+                for indd in lastindex:
+                    symbols.update(
+                        symbolic.pystr_to_symbolic(indd).free_symbols)
+            else:
+                symbols = symbolic.pystr_to_symbolic(
+                    memlet.subset[-1]).free_symbols
+            if param in symbols:
+                try:
+                    memlet.veclen = vector_size
+                except AttributeError:
+                    return
+
+        # TODO: Create new map for non-vectorizable part.
+
+        return
+
+    def modifies_graph(self):
+        return True
+
+
+pattern_matching.Transformation.register_pattern(Vectorization)
diff --git a/dace/transformation/interstate/__init__.py b/dace/transformation/interstate/__init__.py
new file mode 100644
index 0000000000..605f3f102c
--- /dev/null
+++ b/dace/transformation/interstate/__init__.py
@@ -0,0 +1,8 @@
+""" This module initializes the inter-state transformations package."""
+
+from .state_fusion import StateFusion
+from .fpga_transform_state import FPGATransformState
+from .fpga_transform_sdfg import FPGATransformSDFG
+from .gpu_transform_state import GPUTransformState
+from .sdfg_nesting import NestSDFG
+from .double_buffering import DoubleBuffering
\ No newline at end of file
diff --git a/dace/transformation/interstate/double_buffering.py b/dace/transformation/interstate/double_buffering.py
new file mode 100644
index 0000000000..5cf87f5afb
--- /dev/null
+++ b/dace/transformation/interstate/double_buffering.py
@@ -0,0 +1,160 @@
+"""Contains classes that implement the double buffering pattern. """
+
+import copy
+import itertools
+
+import dace
+from dace import data, types, sdfg as sd, subsets, symbolic
+from dace.graph import edges, nodes, nxutil
+from dace.transformation import pattern_matching
+
+
+class DoubleBuffering(pattern_matching.Transformation):
+    """ Implements the double buffering pattern, which pipelines reading
+        and processing data by creating a second copy of the memory. """
+
+    _begin = sd.SDFGState()
+    _guard = sd.SDFGState()
+    _body = sd.SDFGState()
+    _end = sd.SDFGState()
+
+    @staticmethod
+    def expressions():
+        for_loop_graph = dace.graph.graph.OrderedDiGraph()
+        for_loop_graph.add_nodes_from([
+            DoubleBuffering._begin, DoubleBuffering._guard,
+            DoubleBuffering._body, DoubleBuffering._end
+        ])
+        for_loop_graph.add_edge(DoubleBuffering._begin, DoubleBuffering._guard,
+                                None)
+        for_loop_graph.add_edge(DoubleBuffering._guard, DoubleBuffering._body,
+                                None)
+        for_loop_graph.add_edge(DoubleBuffering._body, DoubleBuffering._guard,
+                                None)
+        for_loop_graph.add_edge(DoubleBuffering._guard, DoubleBuffering._end,
+                                None)
+
+        return [for_loop_graph]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        begin = graph.nodes()[candidate[DoubleBuffering._begin]]
+        guard = graph.nodes()[candidate[DoubleBuffering._guard]]
+        body = graph.nodes()[candidate[DoubleBuffering._body]]
+        end = graph.nodes()[candidate[DoubleBuffering._end]]
+
+        if not begin.is_empty():
+            return False
+        if not guard.is_empty():
+            return False
+        if not end.is_empty():
+            return False
+        if body.is_empty():
+            return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        begin = graph.nodes()[candidate[DoubleBuffering._begin]]
+        guard = graph.nodes()[candidate[DoubleBuffering._guard]]
+        body = graph.nodes()[candidate[DoubleBuffering._body]]
+        end = graph.nodes()[candidate[DoubleBuffering._end]]
+
+        return ', '.join(state.label for state in [begin, guard, body, end])
+
+    def apply(self, sdfg):
+        begin = sdfg.nodes()[self.subgraph[DoubleBuffering._begin]]
+        guard = sdfg.nodes()[self.subgraph[DoubleBuffering._guard]]
+        body = sdfg.nodes()[self.subgraph[DoubleBuffering._body]]
+        end = sdfg.nodes()[self.subgraph[DoubleBuffering._end]]
+
+        loop_vars = []
+        for _, dst, e in sdfg.out_edges(body):
+            if dst is guard:
+                for var in e.assignments.keys():
+                    loop_vars.append(var)
+
+        if len(loop_vars) != 1:
+            raise NotImplementedError()
+
+        loop_var = loop_vars[0]
+        sym_var = dace.symbolic.pystr_to_symbolic(loop_var)
+
+        # Find source/sink (data) nodes
+        input_nodes = nxutil.find_source_nodes(body)
+        #output_nodes = nxutil.find_sink_nodes(body)
+
+        copied_nodes = set()
+        db_nodes = {}
+        for node in input_nodes:
+            for _, _, dst, _, mem in body.out_edges(node):
+                if (isinstance(dst, dace.graph.nodes.AccessNode)
+                        and loop_var in mem.subset.free_symbols):
+                    # Create new data and nodes in guard
+                    if node not in copied_nodes:
+                        guard.add_node(node)
+                        copied_nodes.add(node)
+                    if dst not in copied_nodes:
+                        old_data = dst.desc(sdfg)
+                        if isinstance(old_data, dace.data.Array):
+                            new_shape = tuple([2] + list(old_data.shape))
+                            new_data = sdfg.add_array(
+                                old_data.data,
+                                old_data.dtype,
+                                new_shape,
+                                transient=True)
+                        elif isinstance(old_data, data.Scalar):
+                            new_data = sdfg.add_array(
+                                old_data.data,
+                                old_data.dtype, (2),
+                                transient=True)
+                        else:
+                            raise NotImplementedError()
+                        new_node = dace.graph.nodes.AccessNode(old_data.data)
+                        guard.add_node(new_node)
+                        copied_nodes.add(dst)
+                        db_nodes.update({dst: new_node})
+                    # Create memlet in guard
+                    new_mem = copy.deepcopy(mem)
+                    old_index = new_mem.other_subset
+                    if isinstance(old_index, dace.subsets.Range):
+                        new_ranges = [(0, 0, 1)] + old_index.ranges
+                        new_mem.other_subset = dace.subsets.Range(new_ranges)
+                    elif isinstance(old_index, dace.subsets.Indices):
+                        new_indices = [0] + old_index.indices
+                        new_mem.other_subset = dace.subsets.Indices(
+                            new_indices)
+                    guard.add_edge(node, None, new_node, None, new_mem)
+                    # Create nodes, memlets in body
+                    first_node = copy.deepcopy(new_node)
+                    second_node = copy.deepcopy(new_node)
+                    body.add_nodes_from([first_node, second_node])
+                    dace.graph.nxutil.change_edge_dest(body, dst, first_node)
+                    dace.graph.nxutil.change_edge_src(body, dst, second_node)
+                    for src, _, dest, _, mem in body.edges():
+                        if src is node and dest is first_node:
+                            old_index = mem.other_subset
+                            idx = (sym_var + 1) % 2
+                            if isinstance(old_index, dace.subsets.Range):
+                                new_ranges = [(idx, idx, 1)] + old_index.ranges
+                            elif isinstance(old_index, dace.subsets.Indices):
+                                new_ranges = [(idx, idx, 1)]
+                                for index in old_index.indices:
+                                    new_ranges.append((index, index, 1))
+                            mem.other_subset = dace.subsets.Range(new_ranges)
+                        elif mem.data == dst.data:
+                            old_index = mem.subset
+                            idx = sym_var % 2
+                            if isinstance(old_index, dace.subsets.Range):
+                                new_ranges = [(idx, idx, 1)] + old_index.ranges
+                            elif isinstance(old_index, dace.subsets.Indices):
+                                new_ranges = [(idx, idx, 1)]
+                                for index in old_index.indices:
+                                    new_ranges.append((index, index, 1))
+                            mem.subset = dace.subsets.Range(new_ranges)
+                            mem.data = first_node.data
+                    body.remove_node(dst)
+
+
+pattern_matching.Transformation.register_stateflow_pattern(DoubleBuffering)
diff --git a/dace/transformation/interstate/fpga_transform_sdfg.py b/dace/transformation/interstate/fpga_transform_sdfg.py
new file mode 100644
index 0000000000..fff29ec443
--- /dev/null
+++ b/dace/transformation/interstate/fpga_transform_sdfg.py
@@ -0,0 +1,49 @@
+""" Contains inter-state transformations of an SDFG to run on an FPGA. """
+
+import copy
+import itertools
+import networkx as nx
+
+import dace
+from dace import data, memlet, types, sdfg as sd, subsets, symbolic
+from dace.graph import edges, nodes, nxutil
+from dace.transformation import pattern_matching
+
+
+class FPGATransformSDFG(pattern_matching.Transformation):
+    """ Implements the FPGATransformSDFG transformation, which takes an entire
+        SDFG and transforms it into an FPGA-capable SDFG. """
+
+    @staticmethod
+    def annotates_memlets():
+        return True
+
+    @staticmethod
+    def expressions():
+        # Match anything
+        return [nx.DiGraph()]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        return graph.label
+
+    def apply(self, sdfg):
+        # Avoid import loops
+        from dace.transformation.interstate import NestSDFG
+        from dace.transformation.interstate import FPGATransformState
+
+        sdfg_id = sdfg.sdfg_list.index(sdfg)
+        nesting = NestSDFG(sdfg_id, -1, {}, self.expr_index)
+        nesting.promote_global_trans = True
+        nesting.apply(sdfg)
+
+        fpga_transform = FPGATransformState(
+            sdfg_id, -1, {FPGATransformState._state: 0}, self.expr_index)
+        fpga_transform.apply(sdfg)
+
+
+pattern_matching.Transformation.register_stateflow_pattern(FPGATransformSDFG)
diff --git a/dace/transformation/interstate/fpga_transform_state.py b/dace/transformation/interstate/fpga_transform_state.py
new file mode 100644
index 0000000000..ce13f4bc0c
--- /dev/null
+++ b/dace/transformation/interstate/fpga_transform_state.py
@@ -0,0 +1,197 @@
+""" Contains inter-state transformations of an SDFG to run on an FPGA. """
+
+import copy
+import itertools
+
+import dace
+from dace import data, memlet, types, sdfg as sd, subsets, symbolic
+from dace.graph import edges, nodes, nxutil
+from dace.transformation import pattern_matching
+
+
+def fpga_update(state, depth):
+    scope_dict = state.scope_dict()
+    for node in state.nodes():
+        if (isinstance(node, nodes.AccessNode)
+                and node.desc(sdfg).storage == types.StorageType.Default):
+            nodedesc = node.desc(sdfg)
+            if depth >= 2:
+                nodedesc.storage = types.StorageType.FPGA_Local
+            else:
+                if scope_dict[node]:
+                    nodedesc.storage = types.StorageType.FPGA_Local
+                else:
+                    nodedesc.storage = types.StorageType.FPGA_Global
+        if (hasattr(node, "schedule")
+                and node.schedule == dace.types.ScheduleType.Default):
+            node.schedule = dace.types.ScheduleType.FPGA_Device
+        if isinstance(node, nodes.NestedSDFG):
+            for s in node.sdfg:
+                fpga_update(s, depth + 1)
+
+
+class FPGATransformState(pattern_matching.Transformation):
+    """ Implements the FPGATransformState transformation. """
+
+    _state = sd.SDFGState()
+
+    @staticmethod
+    def expressions():
+        return [nxutil.node_path_graph(FPGATransformState._state)]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        state = graph.nodes()[candidate[FPGATransformState._state]]
+
+        for node in state.nodes():
+
+            if (isinstance(node, nodes.AccessNode)
+                    and node.desc(sdfg).storage != types.StorageType.Default):
+                return False
+
+            if not isinstance(node, nodes.MapEntry):
+                continue
+
+            map_entry = node
+            candidate_map = map_entry.map
+
+            # No more than 3 dimensions
+            if candidate_map.range.dims() > 3: return False
+
+            # Map schedules that are disallowed to transform to FPGAs
+            if (candidate_map.schedule == types.ScheduleType.MPI
+                    or candidate_map.schedule == types.ScheduleType.GPU_Device
+                    or candidate_map.schedule == types.ScheduleType.FPGA_Device
+                    or candidate_map.schedule ==
+                    types.ScheduleType.GPU_ThreadBlock):
+                return False
+
+            # Recursively check parent for FPGA schedules
+            sdict = state.scope_dict()
+            current_node = map_entry
+            while current_node != None:
+                if (current_node.map.schedule == types.ScheduleType.GPU_Device
+                        or current_node.map.schedule ==
+                        types.ScheduleType.FPGA_Device
+                        or current_node.map.schedule ==
+                        types.ScheduleType.GPU_ThreadBlock):
+                    return False
+                current_node = sdict[current_node]
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        state = graph.nodes()[candidate[FPGATransformState._state]]
+
+        return state.label
+
+    def apply(self, sdfg):
+        state = sdfg.nodes()[self.subgraph[FPGATransformState._state]]
+
+        # Find source/sink (data) nodes
+        input_nodes = nxutil.find_source_nodes(state)
+        output_nodes = nxutil.find_sink_nodes(state)
+
+        fpga_data = {}
+
+        if input_nodes:
+
+            pre_state = sd.SDFGState('pre_' + state.label, sdfg)
+
+            for node in input_nodes:
+
+                if (not isinstance(node, dace.graph.nodes.AccessNode)
+                        or not isinstance(node.desc(sdfg), dace.data.Array)):
+                    # Only transfer array nodes
+                    # TODO: handle streams
+                    continue
+
+                array = node.desc(sdfg)
+                if array.name in fpga_data:
+                    fpga_array = fpga_data[node.data]
+                else:
+                    fpga_array = sdfg.add_array(
+                        'fpga_' + node.data,
+                        array.dtype,
+                        array.shape,
+                        materialize_func=array.materialize_func,
+                        transient=True,
+                        storage=types.StorageType.FPGA_Global,
+                        allow_conflicts=array.allow_conflicts,
+                        access_order=array.access_order,
+                        strides=array.strides,
+                        offset=array.offset)
+                    fpga_data[array.name] = fpga_array
+                fpga_node = type(node)(fpga_array)
+
+                pre_state.add_node(node)
+                pre_state.add_node(fpga_node)
+                full_range = subsets.Range(
+                    [(0, s - 1, 1) for s in array.shape])
+                mem = memlet.Memlet(array, full_range.num_elements(),
+                                    full_range, 1)
+                pre_state.add_edge(node, None, fpga_node, None, mem)
+
+                state.add_node(fpga_node)
+                nxutil.change_edge_src(state, node, fpga_node)
+                state.remove_node(node)
+
+            sdfg.add_node(pre_state)
+            nxutil.change_edge_dest(sdfg, state, pre_state)
+            sdfg.add_edge(pre_state, state, edges.InterstateEdge())
+
+        if output_nodes:
+
+            post_state = sd.SDFGState('post_' + state.label, sdfg)
+
+            for node in output_nodes:
+
+                if (not isinstance(node, dace.graph.nodes.AccessNode)
+                        or not isinstance(node.desc(sdfg), dace.data.Array)):
+                    # Only transfer array nodes
+                    # TODO: handle streams
+                    continue
+
+                array = node.desc(sdfg)
+                if node.data in fpga_data:
+                    fpga_array = fpga_data[node.data]
+                else:
+                    fpga_array = sdfg.add_array(
+                        'fpga_' + node.data,
+                        array.dtype,
+                        array.shape,
+                        materialize_func=array.materialize_func,
+                        transient=True,
+                        storage=types.StorageType.FPGA_Global,
+                        allow_conflicts=array.allow_conflicts,
+                        access_order=array.access_order,
+                        strides=array.strides,
+                        offset=array.offset)
+                    fpga_data[node.data] = fpga_array
+                fpga_node = type(node)(fpga_array)
+
+                post_state.add_node(node)
+                post_state.add_node(fpga_node)
+                full_range = subsets.Range(
+                    [(0, s - 1, 1) for s in array.shape])
+                mem = memlet.Memlet(fpga_array, full_range.num_elements(),
+                                    full_range, 1)
+                post_state.add_edge(fpga_node, None, node, None, mem)
+
+                state.add_node(fpga_node)
+                nxutil.change_edge_dest(state, node, fpga_node)
+                state.remove_node(node)
+
+            sdfg.add_node(post_state)
+            nxutil.change_edge_src(sdfg, state, post_state)
+            sdfg.add_edge(state, post_state, edges.InterstateEdge())
+
+        for src, _, dst, _, mem in state.edges():
+            if mem.data is not None and mem.data in fpga_data:
+                mem.data = 'fpga_' + node.data
+
+        fpga_update(state, 0)
+
+
+pattern_matching.Transformation.register_stateflow_pattern(FPGATransformState)
diff --git a/dace/transformation/interstate/gpu_transform_state.py b/dace/transformation/interstate/gpu_transform_state.py
new file mode 100644
index 0000000000..94ce27323c
--- /dev/null
+++ b/dace/transformation/interstate/gpu_transform_state.py
@@ -0,0 +1,269 @@
+""" Contains inter-state transformations of an SDFG to run on the GPU. """
+
+import copy
+import itertools
+
+from dace import data, memlet, types, sdfg as sd, subsets as sbs, symbolic
+from dace.config import Config
+from dace.graph import nodes, nxutil, edges as ed
+from dace.transformation import pattern_matching, optimizer
+from dace.properties import Property, make_properties
+
+from dace.transformation.dataflow import RedundantArray
+from dace.transformation.interstate import StateFusion
+
+
+@make_properties
+class GPUTransformState(pattern_matching.Transformation):
+    """ Implements the GPUTransformState transformation.
+
+        Transforms a whole SDFG to run on the GPU:
+        Steps of the full GPU transform
+          0. Acquire metadata about SDFG and arrays
+          1. Replace all non-transients with their GPU counterparts
+          2. Copy-in state from host to GPU
+          3. Copy-out state from GPU to host
+          4. Re-store Default-top/CPU_Heap transients as GPU_Global
+          5. Global tasklets are wrapped with a map of size 1
+          6. Global Maps are re-scheduled to use the GPU
+          7. Re-apply strict transformations to get rid of extra states and 
+             transients
+    """
+
+    toplevel_trans = Property(
+        desc="Make all GPU transients top-level", dtype=bool, default=True)
+    register_trans = Property(
+        desc="Make all transients inside GPU maps registers",
+        dtype=bool,
+        default=True)
+    sequential_innermaps = Property(
+        desc="Make all internal maps Sequential", dtype=bool, default=True)
+    strict_transform = Property(
+        desc='Reapply strict transformations after modifying graph',
+        dtype=bool,
+        default=True)
+
+    @staticmethod
+    def annotates_memlets():
+        # Skip memlet propagation for now
+        return True
+
+    @staticmethod
+    def expressions():
+        # Matches anything
+        return [sd.SDFG('_')]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        return graph.label
+
+    def modifies_graph(self):
+        return True
+
+    def apply(self, sdfg: sd.SDFG):
+
+        #######################################################
+        # Step 0: SDFG metadata
+
+        # Find all input and output data descriptors
+        input_nodes = []
+        output_nodes = []
+        global_code_nodes = [[] for _ in sdfg.nodes()]
+
+        for i, state in enumerate(sdfg.nodes()):
+            sdict = state.scope_dict()
+            for node in state.nodes():
+                if (isinstance(node, nodes.AccessNode)
+                        and node.desc(sdfg).transient == False):
+                    if (state.out_degree(node) > 0
+                            and node.data not in input_nodes):
+                        input_nodes.append((node.data, node.desc(sdfg)))
+                    if (state.in_degree(node) > 0
+                            and node.data not in output_nodes):
+                        output_nodes.append((node.data, node.desc(sdfg)))
+                elif isinstance(node, nodes.CodeNode) and sdict[node] is None:
+                    if not isinstance(node, nodes.EmptyTasklet):
+                        global_code_nodes[i].append(node)
+
+            # Input nodes may also be nodes with WCR memlets and no identity
+            for e in state.edges():
+                if e.data.wcr is not None and e.data.wcr_identity is None:
+                    if (e.data.data not in input_nodes
+                            and sdfg.arrays[e.data.data].transient == False):
+                        input_nodes.append(e.data.data)
+
+        start_state = sdfg.start_state
+        end_states = sdfg.sink_nodes()
+
+        #######################################################
+        # Step 1: Create cloned GPU arrays and replace originals
+
+        cloned_arrays = {}
+        for inodename, inode in input_nodes:
+            newdesc = inode.clone()
+            newdesc.storage = types.StorageType.GPU_Global
+            newdesc.transient = True
+            sdfg.add_datadesc('gpu_' + inodename, newdesc)
+            cloned_arrays[inodename] = 'gpu_' + inodename
+
+        for onodename, onode in output_nodes:
+            if onodename in cloned_arrays:
+                continue
+            newdesc = onode.clone()
+            newdesc.storage = types.StorageType.GPU_Global
+            newdesc.transient = True
+            sdfg.add_datadesc('gpu_' + onodename, newdesc)
+            cloned_arrays[onodename] = 'gpu_' + onodename
+
+        # Replace nodes
+        for state in sdfg.nodes():
+            for node in state.nodes():
+                if (isinstance(node, nodes.AccessNode)
+                        and node.data in cloned_arrays):
+                    node.data = cloned_arrays[node.data]
+
+        # Replace memlets
+        for state in sdfg.nodes():
+            for edge in state.edges():
+                if edge.data.data in cloned_arrays:
+                    edge.data.data = cloned_arrays[edge.data.data]
+
+        #######################################################
+        # Step 2: Create copy-in state
+
+        copyin_state = sdfg.add_state(sdfg.label + '_copyin')
+        sdfg.add_edge(copyin_state, start_state, ed.InterstateEdge())
+
+        for nname, desc in input_nodes:
+            src_array = nodes.AccessNode(nname, debuginfo=desc.debuginfo)
+            dst_array = nodes.AccessNode(
+                cloned_arrays[nname], debuginfo=desc.debuginfo)
+            copyin_state.add_node(src_array)
+            copyin_state.add_node(dst_array)
+            copyin_state.add_nedge(
+                src_array, dst_array,
+                memlet.Memlet.from_array(src_array.data, src_array.desc(sdfg)))
+
+        #######################################################
+        # Step 3: Create copy-out state
+
+        copyout_state = sdfg.add_state(sdfg.label + '_copyout')
+        for state in end_states:
+            sdfg.add_edge(state, copyout_state, ed.InterstateEdge())
+
+        for nname, desc in output_nodes:
+            src_array = nodes.AccessNode(
+                cloned_arrays[nname], debuginfo=desc.debuginfo)
+            dst_array = nodes.AccessNode(nname, debuginfo=desc.debuginfo)
+            copyout_state.add_node(src_array)
+            copyout_state.add_node(dst_array)
+            copyout_state.add_nedge(
+                src_array, dst_array,
+                memlet.Memlet.from_array(dst_array.data, dst_array.desc(sdfg)))
+
+        #######################################################
+        # Step 4: Modify transient data storage
+
+        for state in sdfg.nodes():
+            sdict = state.scope_dict()
+            for node in state.nodes():
+                if isinstance(node,
+                              nodes.AccessNode) and node.desc(sdfg).transient:
+                    nodedesc = node.desc(sdfg)
+                    if sdict[node] is None:
+                        # NOTE: the cloned arrays match too but it's the same
+                        # storage so we don't care
+                        nodedesc.storage = types.StorageType.GPU_Global
+
+                        # Try to move allocation/deallocation out of loops
+                        if self.toplevel_trans:
+                            nodedesc.toplevel = True
+                    else:
+                        # Make internal transients registers
+                        if self.register_trans:
+                            nodedesc.storage = types.StorageType.Register
+
+        #######################################################
+        # Step 5: Wrap free tasklets and nested SDFGs with a GPU map
+
+        for state, gcodes in zip(sdfg.nodes(), global_code_nodes):
+            for gcode in gcodes:
+                # Create map and connectors
+                me, mx = state.add_map(
+                    gcode.label + '_gmap', {gcode.label + '__gmapi': '0:1'},
+                    schedule=types.ScheduleType.GPU_Device)
+                # Store in/out edges in lists so that they don't get corrupted
+                # when they are removed from the graph
+                in_edges = list(state.in_edges(gcode))
+                out_edges = list(state.out_edges(gcode))
+                me.in_connectors = set('IN_' + e.dst_conn for e in in_edges)
+                me.out_connectors = set('OUT_' + e.dst_conn for e in in_edges)
+                mx.in_connectors = set('IN_' + e.src_conn for e in out_edges)
+                mx.out_connectors = set('OUT_' + e.src_conn for e in out_edges)
+
+                # Create memlets through map
+                for e in in_edges:
+                    state.remove_edge(e)
+                    state.add_edge(e.src, e.src_conn, me, 'IN_' + e.dst_conn,
+                                   e.data)
+                    state.add_edge(me, 'OUT_' + e.dst_conn, e.dst, e.dst_conn,
+                                   e.data)
+                for e in out_edges:
+                    state.remove_edge(e)
+                    state.add_edge(e.src, e.src_conn, mx, 'IN_' + e.src_conn,
+                                   e.data)
+                    state.add_edge(mx, 'OUT_' + e.src_conn, e.dst, e.dst_conn,
+                                   e.data)
+
+                # Map without inputs
+                if len(in_edges) == 0:
+                    state.add_nedge(me, gcode, memlet.EmptyMemlet())
+        #######################################################
+        # Step 6: Change all top-level maps to GPU maps
+
+        for i, state in enumerate(sdfg.nodes()):
+            sdict = state.scope_dict()
+            for node in state.nodes():
+                if isinstance(node, nodes.EntryNode):
+                    if sdict[node] is None:
+                        node.schedule = types.ScheduleType.GPU_Device
+                    elif self.sequential_innermaps:
+                        node.schedule = types.ScheduleType.Sequential
+
+        #######################################################
+        # Step 7: Strict transformations
+        if not self.strict_transform:
+            return
+
+        # Apply strict state fusions greedily.
+        opt = optimizer.SDFGOptimizer(sdfg, inplace=True)
+        fusions = 0
+        arrays = 0
+        options = [
+            match for match in opt.get_pattern_matches(strict=True)
+            if isinstance(match, (StateFusion, RedundantArray))
+        ]
+        while options:
+            ssdfg = sdfg.sdfg_list[options[0].sdfg_id]
+            options[0].apply(ssdfg)
+            ssdfg.validate()
+            if isinstance(options[0], StateFusion):
+                fusions += 1
+            if isinstance(options[0], RedundantArray):
+                arrays += 1
+
+            options = [
+                match for match in opt.get_pattern_matches(strict=True)
+                if isinstance(match, (StateFusion, RedundantArray))
+            ]
+
+        if Config.get_bool('debugprint') and (fusions > 0 or arrays > 0):
+            print('Automatically applied {} strict state fusions and removed'
+                  ' {} redundant arrays.'.format(fusions, arrays))
+
+
+pattern_matching.Transformation.register_stateflow_pattern(GPUTransformState)
diff --git a/dace/transformation/interstate/sdfg_nesting.py b/dace/transformation/interstate/sdfg_nesting.py
new file mode 100644
index 0000000000..644317db39
--- /dev/null
+++ b/dace/transformation/interstate/sdfg_nesting.py
@@ -0,0 +1,147 @@
+""" SDFG nesting transformation. """
+
+from copy import deepcopy as dc
+import networkx as nx
+
+import dace
+from dace import data as dt, memlet, sdfg as sd, subsets, symbolic
+from dace.graph import edges, nodes, nxutil
+from dace.transformation import pattern_matching
+from dace.properties import make_properties, Property
+
+
+@make_properties
+class NestSDFG(pattern_matching.Transformation):
+    """ Implements SDFG Nesting, taking an SDFG as an input and creating a
+        nested SDFG node from it. """
+
+    promote_global_trans = Property(
+        dtype=bool,
+        default=False,
+        desc="Promotes transients to be allocated once")
+
+    @staticmethod
+    def annotates_memlets():
+        return True
+
+    @staticmethod
+    def expressions():
+        # Matches anything
+        return [nx.DiGraph()]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        return graph.label
+
+    def apply(self, sdfg):
+        # Copy SDFG to nested SDFG
+        nested_sdfg = dace.SDFG('nested_' + sdfg.label)
+        nested_sdfg.add_nodes_from(sdfg.nodes())
+        for src, dst, data in sdfg.edges():
+            nested_sdfg.add_edge(src, dst, data)
+
+        input_orig = {}
+        input_data = set()
+        input_nodes = {}
+        output_orig = {}
+        output_data = set()
+        output_nodes = {}
+        for state in sdfg.nodes():
+            for node in nxutil.find_source_nodes(state):
+                if isinstance(
+                        node,
+                        nodes.AccessNode) and not node.desc(sdfg).transient:
+                    if node.data not in input_data:
+                        input_orig.update({node.data + '_in': node.data})
+                        input_nodes.update({node.data + '_in': dc(node)})
+                        new_data = dc(node.desc(sdfg))
+                        input_data.add(node.data)
+                        sdfg.arrays.update({node.data + '_in': new_data})
+                    node.data = node.data + '_in'
+            for node in nxutil.find_sink_nodes(state):
+                if isinstance(
+                        node,
+                        nodes.AccessNode) and not node.desc(sdfg).transient:
+                    if node.data not in output_data:
+                        output_orig.update({node.data + '_out': node.data})
+                        output_nodes.update({node.data + '_out': dc(node)})
+                        new_data = dc(node.desc(sdfg))
+                        output_data.add(node.data)
+                        sdfg.arrays.update({node.data + '_out': new_data})
+
+                        # WCR Fix
+                        if self.promote_global_trans:
+                            for edge in state.in_edges(node):
+                                if sd._memlet_path(state, edge)[0].data.wcr:
+                                    if node.data not in input_data:
+                                        input_orig.update({
+                                            node.data + '_in':
+                                            node.data
+                                        })
+                                        input_nodes.update({
+                                            node.data + '_in':
+                                            dc(node)
+                                        })
+                                        new_data = dc(node.desc(sdfg))
+                                        sdfg.arrays.update({
+                                            node.data + '_in':
+                                            new_data
+                                        })
+                                        input_data.add(node.data + '_in')
+                                    break
+
+                    node.data = node.data + '_out'
+            if self.promote_global_trans:
+                scope_dict = state.scope_dict()
+                for node in state.nodes():
+                    if (isinstance(node, nodes.AccessNode)
+                            and node.desc(sdfg).transient
+                            and not scope_dict[node]):
+                        if node.data not in output_data:
+                            output_orig.update({node.data + '_out': node.data})
+                            output_nodes.update({node.data + '_out': dc(node)})
+                            new_data = dc(node.desc(sdfg))
+                            output_data.add(node.data + '_out')
+                            sdfg.arrays.update({node.data + '_out': new_data})
+                        node.data = node.data + '_out'
+                        node.desc(sdfg).transient = False
+            for _, edge in enumerate(state.edges()):
+                _, _, _, _, mem = edge
+                src = sd._memlet_path(state, edge)[0].src
+                dst = sd._memlet_path(state, edge)[-1].dst
+                if isinstance(src,
+                              nodes.AccessNode) and src.data in input_data:
+                    mem.data = src.data
+                if isinstance(src,
+                              nodes.AccessNode) and src.data in output_data:
+                    mem.data = src.data
+                if isinstance(dst,
+                              nodes.AccessNode) and dst.data in output_data:
+                    mem.data = dst.data
+
+        sdfg.remove_nodes_from(sdfg.nodes())
+
+        state = sdfg.add_state(sdfg.label)
+        state.add_nodes_from(input_nodes.values())
+        state.add_nodes_from(output_nodes.values())
+
+        nested_node = state.add_nested_sdfg(nested_sdfg, sdfg,
+                                            input_data.keys(),
+                                            output_data.keys())
+        for key, val in input_nodes.items():
+            state.add_edge(
+                val, None, nested_node, key,
+                memlet.Memlet.simple(
+                    val, str(subsets.Range.from_array(val.desc(sdfg)))))
+        for key, val in output_nodes.items():
+            state.add_edge(
+                nested_node, key, val, None,
+                memlet.Memlet.simple(
+                    val, str(subsets.Range.from_array(val.desc(sdfg)))))
+
+
+pattern_matching.Transformation.register_stateflow_pattern(NestSDFG)
diff --git a/dace/transformation/interstate/state_fusion.py b/dace/transformation/interstate/state_fusion.py
new file mode 100644
index 0000000000..9470cc1abb
--- /dev/null
+++ b/dace/transformation/interstate/state_fusion.py
@@ -0,0 +1,219 @@
+"""State fusion transformation"""
+
+import networkx as nx
+
+from dace import sdfg, symbolic
+from dace.graph import edges, nodes, nxutil
+from dace.transformation import pattern_matching
+
+
+class StateFusion(pattern_matching.Transformation):
+    """ Implements the state-fusion transformation.
+        
+        State-fusion takes two states that are connected through a single edge,
+        and fuses them into one state. If strict, only applies if no memory 
+        access hazards are created.
+    """
+
+    _first_state = sdfg.SDFGState()
+    _edge = edges.InterstateEdge()
+    _second_state = sdfg.SDFGState()
+
+    @staticmethod
+    def annotates_memlets():
+        return False
+
+    @staticmethod
+    def expressions():
+        return [
+            nxutil.node_path_graph(StateFusion._first_state,
+                                   StateFusion._second_state)
+        ]
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        first_state = graph.nodes()[candidate[StateFusion._first_state]]
+        second_state = graph.nodes()[candidate[StateFusion._second_state]]
+
+        out_edges = graph.out_edges(first_state)
+        in_edges = graph.in_edges(first_state)
+
+        # First state must have only one output edge (with dst the second
+        # state).
+        if len(out_edges) != 1:
+            return False
+        # The interstate edge must not have a condition.
+        if out_edges[0].data.condition.as_string != '':
+            return False
+        # The interstate edge may have assignments, as long as there are input
+        # edges to the first state, that can absorb them.
+        if out_edges[0].data.assignments and not in_edges:
+            return False
+        # There can be no state that have output edges pointing to both the
+        # first and the second state. Such a case will produce a multi-graph.
+        for src, _, _ in in_edges:
+            for _, dst, _ in graph.out_edges(src):
+                if dst == second_state:
+                    return False
+
+        if strict:
+
+            # If second state has other input edges, there might be issues
+            second_in_edges = graph.in_edges(second_state)
+            if ((not second_state.is_empty() or not first_state.is_empty())
+                    and len(second_in_edges) != 1):
+                return False
+
+            # Get connected components.
+            first_cc = [
+                cc_nodes
+                for cc_nodes in nx.weakly_connected_components(first_state._nx)
+            ]
+            second_cc = [
+                cc_nodes for cc_nodes in nx.weakly_connected_components(
+                    second_state._nx)
+            ]
+
+            # Find source/sink (data) nodes
+            first_input = {
+                node
+                for node in nxutil.find_source_nodes(first_state)
+                if isinstance(node, nodes.AccessNode)
+            }
+            first_output = {
+                node
+                for node in first_state.nodes() if
+                isinstance(node, nodes.AccessNode) and node not in first_input
+            }
+            second_input = {
+                node
+                for node in nxutil.find_source_nodes(second_state)
+                if isinstance(node, nodes.AccessNode)
+            }
+            second_output = {
+                node
+                for node in second_state.nodes() if
+                isinstance(node, nodes.AccessNode) and node not in second_input
+            }
+
+            # Find source/sink (data) nodes by connected component
+            first_cc_input = [cc.intersection(first_input) for cc in first_cc]
+            first_cc_output = [
+                cc.intersection(first_output) for cc in first_cc
+            ]
+            second_cc_input = [
+                cc.intersection(second_input) for cc in second_cc
+            ]
+            second_cc_output = [
+                cc.intersection(second_output) for cc in second_cc
+            ]
+
+            check_strict = len(first_cc)
+            for cc_output in first_cc_output:
+                for node in cc_output:
+                    if next((x for x in second_input
+                             if x.label == node.label), None) is not None:
+                        check_strict -= 1
+                        break
+
+            if check_strict > 0:
+                # Check strict conditions
+                # RW dependency
+                for node in first_input:
+                    if next((x for x in second_output
+                             if x.label == node.label), None) is not None:
+                        return False
+                # WW dependency
+                for node in first_output:
+                    if next((x for x in second_output
+                             if x.label == node.label), None) is not None:
+                        return False
+
+        return True
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        first_state = graph.nodes()[candidate[StateFusion._first_state]]
+        second_state = graph.nodes()[candidate[StateFusion._second_state]]
+
+        return ' -> '.join(
+            state.label for state in [first_state, second_state])
+
+    def apply(self, sdfg):
+        first_state = sdfg.nodes()[self.subgraph[StateFusion._first_state]]
+        second_state = sdfg.nodes()[self.subgraph[StateFusion._second_state]]
+
+        # Remove interstate edge(s)
+        edges = sdfg.edges_between(first_state, second_state)
+        for edge in edges:
+            if edge.data.assignments:
+                for src, dst, other_data in sdfg.in_edges(first_state):
+                    other_data.assignments.update(edge.data.assignments)
+            sdfg.remove_edge(edge)
+
+        # Special case 1: first state is empty
+        if first_state.is_empty():
+            nxutil.change_edge_dest(sdfg, first_state, second_state)
+            sdfg.remove_node(first_state)
+            return
+
+        # Special case 2: second state is empty
+        if second_state.is_empty():
+            nxutil.change_edge_src(sdfg, second_state, first_state)
+            nxutil.change_edge_dest(sdfg, second_state, first_state)
+            sdfg.remove_node(second_state)
+            return
+
+        # Normal case: both states are not empty
+
+        # Find source/sink (data) nodes
+        first_input = [
+            node for node in nxutil.find_source_nodes(first_state)
+            if isinstance(node, nodes.AccessNode)
+        ]
+        first_output = [
+            node for node in nxutil.find_sink_nodes(first_state)
+            if isinstance(node, nodes.AccessNode)
+        ]
+        second_input = [
+            node for node in nxutil.find_source_nodes(second_state)
+            if isinstance(node, nodes.AccessNode)
+        ]
+
+        # first input = first input - first output
+        first_input = [
+            node for node in first_input
+            if next((x for x in first_output
+                     if x.label == node.label), None) is None
+        ]
+
+        # Merge second state to first state
+        for node in second_state.nodes():
+            first_state.add_node(node)
+        for src, src_conn, dst, dst_conn, data in second_state.edges():
+            first_state.add_edge(src, src_conn, dst, dst_conn, data)
+
+        # Merge common (data) nodes
+        for node in first_input:
+            try:
+                old_node = next(
+                    x for x in second_input if x.label == node.label)
+            except StopIteration:
+                continue
+            nxutil.change_edge_src(first_state, old_node, node)
+            first_state.remove_node(old_node)
+        for node in first_output:
+            try:
+                new_node = next(
+                    x for x in second_input if x.label == node.label)
+            except StopIteration:
+                continue
+            nxutil.change_edge_dest(first_state, node, new_node)
+            first_state.remove_node(node)
+
+        # Redirect edges and remove second state
+        nxutil.change_edge_src(sdfg, second_state, first_state)
+        sdfg.remove_node(second_state)
+
+
+pattern_matching.Transformation.register_stateflow_pattern(StateFusion)
diff --git a/dace/transformation/optimizer.py b/dace/transformation/optimizer.py
new file mode 100644
index 0000000000..0f8581dafc
--- /dev/null
+++ b/dace/transformation/optimizer.py
@@ -0,0 +1,247 @@
+""" Contains classes and functions related to optimization of the stateful
+    dataflow graph representation. """
+
+import copy
+import os
+import re
+import time
+
+import dace
+from dace.config import Config
+from dace.graph import labeling
+from dace.transformation import pattern_matching
+
+# This import is necessary since it registers all the patterns
+from dace.transformation import dataflow, interstate
+
+
+class SDFGOptimizer(object):
+    """ Implements methods for optimizing a DaCe program stateful dataflow
+        graph representation, by matching patterns and applying 
+        transformations on it.
+    """
+
+    def __init__(self, sdfg, inplace=False):
+        """ Constructs an SDFG optimizer.
+            @param sdfg: The SDFG to transform.
+            @param inplace: If True, performs transformations on the given SDFG
+                            in-place. Uses a copy of the SDFG otherwise, and
+                            stores it as `self.sdfg`.
+        """
+        if inplace == True:
+            self.sdfg = sdfg
+        else:
+            self.sdfg = copy.deepcopy(sdfg)
+
+        # Initialize patterns to search for
+        self.patterns = pattern_matching.Transformation.patterns()
+        self.stateflow_patterns = pattern_matching.Transformation.stateflow_patterns(
+        )
+        self.applied_patterns = set()
+
+    def get_pattern_matches(self, strict=False, states=None, patterns=None):
+        """ Returns all possible transformations for the current SDFG.
+            @param strict: Only consider strict transformations (i.e., ones
+                           that surely increase performance or enhance
+                           readability)
+            @param states: An iterable of SDFG states to consider when pattern
+                           matching. If None, considers all.
+            @param patterns: An iterable of transformation classes to consider
+                             when matching. If None, considers all registered
+                             transformations in `Transformation`.
+            @return: List of matching `Transformation` objects.
+            @see: Transformation
+        """
+
+        matches = []
+
+        if states is None:
+            if patterns is None:
+                _patterns = self.stateflow_patterns
+            else:
+                _patterns = [
+                    p for p in patterns if p in self.stateflow_patterns
+                ]
+
+            for pattern in _patterns:
+                matches += pattern_matching.match_stateflow_pattern(
+                    self.sdfg, pattern, strict=strict)
+
+        state_enum = []
+        if states is None:
+            for state_id, state in enumerate(self.sdfg.nodes()):
+                state_enum.append((state_id, state))
+        else:
+            for state in states:
+                state_id = self.sdfg.nodes().index(state)
+                state_enum.append((state_id, state))
+
+        if patterns is None:
+            _patterns = self.patterns
+        else:
+            _patterns = [p for p in patterns if p in self.patterns]
+        for state_id, state in state_enum:
+            for pattern in _patterns:
+                matches += pattern_matching.match_pattern(
+                    state_id, state, pattern, self.sdfg, strict=strict)
+
+        return matches
+
+    def optimize(self, debugprint=True):
+        """ A command-line UI for applying patterns on the SDFG.
+            @param debugprint: Whether to print verbose information to the 
+                               console.
+            @return: An optimized SDFG object
+        """
+
+        # Visualize SDFGs during optimization process
+        VISUALIZE = Config.get_bool('optimizer', 'visualize')
+        SAVE_DOTS = Config.get_bool('optimizer', 'savedots')
+
+        if SAVE_DOTS:
+            with open('before.dot', 'w') as dot_file:
+                dot_file.write(self.sdfg.draw())
+            if VISUALIZE:
+                os.system('xdot before.dot&')
+
+        # Optimize until there is not pattern matching or user stops the process.
+        pattern_counter = 0
+        while True:
+            # Print in the UI all the pattern matching options.
+            ui_options = self.get_pattern_matches()
+            ui_options_idx = 0
+            for pattern_match in ui_options:
+                sdfg = self.sdfg.sdfg_list[pattern_match.sdfg_id]
+                print('%d. Transformation %s' %
+                      (ui_options_idx, pattern_match.print_match(sdfg)))
+                ui_options_idx += 1
+
+            # If no pattern matchings were found, quit.
+            if ui_options_idx == 0:
+                print('No viable transformations found')
+                break
+
+            # Code working for both python 2.x and 3.x.
+            try:
+                ui_input = raw_input(
+                    'Select the pattern to apply (0 - %d or name$id): ' %
+                    (ui_options_idx - 1))
+            except NameError:
+                ui_input = input(
+                    'Select the pattern to apply (0 - %d or name$id): ' %
+                    (ui_options_idx - 1))
+
+            pattern_name, occurrence, param_dict = _parse_cli_input(ui_input)
+
+            pattern_match = None
+            if (pattern_name is None and occurrence >= 0
+                    and occurrence < ui_options_idx):
+                pattern_match = ui_options[occurrence]
+            elif pattern_name is not None:
+                counter = 0
+                for match in ui_options:
+                    if type(match).__name__ == pattern_name:
+                        if occurrence == counter:
+                            pattern_match = match
+                            break
+                        counter = counter + 1
+
+            if pattern_match is None:
+                print(
+                    'You did not select a valid option. Quitting optimization ...'
+                )
+                break
+
+            match_id = (str(occurrence) if pattern_name is None else
+                        '%s$%d' % (pattern_name, occurrence))
+            sdfg = self.sdfg.sdfg_list[pattern_match.sdfg_id]
+            print('You selected (%s) pattern %s with parameters %s' %
+                  (match_id, pattern_match.print_match(sdfg), str(param_dict)))
+
+            # Set each parameter of the parameter dictionary separately
+            for k, v in param_dict.items():
+                setattr(pattern_match, k, v)
+
+            pattern_match.apply(sdfg)
+            self.applied_patterns.add(type(pattern_match))
+
+            if SAVE_DOTS:
+                with open(
+                        'after_%d_%s_b4lprop.dot' %
+                    (pattern_counter + 1, type(pattern_match).__name__),
+                        'w') as dot_file:
+                    dot_file.write(self.sdfg.draw())
+
+            if not pattern_match.annotates_memlets():
+                labeling.propagate_labels_sdfg(self.sdfg)
+
+            if True:
+                pattern_counter += 1
+                if SAVE_DOTS:
+                    with open(
+                            'after_%d_%s.dot' % (pattern_counter,
+                                                 type(pattern_match).__name__),
+                            'w') as dot_file:
+                        dot_file.write(self.sdfg.draw())
+                    if VISUALIZE:
+                        time.sleep(0.7)
+                        os.system(
+                            'xdot after_%d_%s.dot&' %
+                            (pattern_counter, type(pattern_match).__name__))
+
+        return self.sdfg
+
+
+def _parse_cli_input(line):
+    """ Parses a command line input, which may include a transformation name
+        (optional), its occurrence ID, and its parameters (optional).
+        Syntax Examples:
+            * 5                  - Chooses the fifth transformation
+            * MapReduceFusion$0  - First occurrence of MapReduceFusion
+            * 4(array='A')       - Transformation number 4 with one parameter
+            * StripMining$1(param='i', tile_size=64) - Strip mining #2 with
+                                                       parameters
+        @param line: Input line string
+        @return: A tuple with (transformation name or None if not given,
+                                      occurrence or -1 if not given,
+                                      parameter dictionary or {} if not given)
+    """
+    # First try matching explicit all-inclusive string "A$num(values)"
+    match = re.findall(r'(.*)\$(\d+)\((.*)\)', line)
+    if len(match) == 1:
+        trans_name, occurrence, param_dict = match[0]
+    else:
+        # Then, try to match "num(values)"
+        match = re.findall(r'(\d+)\((.*)\)', line)
+        if len(match) == 1:
+            trans_name = None
+            occurrence, param_dict = match[0]
+        else:
+            # After that, try to match "A$num"
+            match = re.findall(r'(.*)\$(\d+)', line)
+            if len(match) == 1:
+                trans_name, occurrence = match[0]
+                param_dict = {}
+            else:
+                # Finally, try to match "num"
+                match = re.findall(r'(\d+)', line)
+                if len(match) == 1:
+                    trans_name = None
+                    occurrence = match[0]
+                    param_dict = {}
+                else:
+                    return (None, -1, {})
+
+    # Try to parse the results
+    try:
+        occurrence = int(occurrence)
+    except ValueError:
+        occurrence = -1
+    try:
+        if isinstance(param_dict, str):
+            param_dict = eval('dict(' + param_dict + ')')
+    except:  # Here we have to catch ANY exception since literally anything
+        # can happen
+        param_dict = {}
+
+    return trans_name, occurrence, param_dict
diff --git a/dace/transformation/pattern_matching.py b/dace/transformation/pattern_matching.py
new file mode 100644
index 0000000000..0c5a2232f7
--- /dev/null
+++ b/dace/transformation/pattern_matching.py
@@ -0,0 +1,438 @@
+"""Contains classes and functions related to patterns/transformations.
+"""
+
+from __future__ import print_function
+import bisect
+import timeit
+from types import GeneratorType
+import dace
+from dace import sdfg as sd
+from dace.properties import make_properties, Property
+from dace.graph import labeling, graph as gr
+import networkx as nx
+from networkx.algorithms import isomorphism as iso
+
+
+@make_properties
+class Transformation(object):
+    """ Base class for transformations, as well as a static registry of 
+        transformations, where new transformations can be added in a 
+        decentralized manner.
+    """
+
+    ####################################################################
+    # Transformation registry
+
+    # Class attributes
+
+    _patterns = set()
+    _stateflow_patterns = set()
+
+    # Static methods
+
+    @staticmethod
+    def patterns():
+        """ Returns a list of single-state (dataflow) transformations 
+            currently in the registry. """
+
+        pattern_list = sorted(
+            Transformation._patterns, key=lambda cls: cls.__name__)
+        return pattern_list
+
+    @staticmethod
+    def stateflow_patterns():
+        """ Returns a list of multiple-state (interstate) transformations 
+            currently in the registry. """
+
+        pattern_list = sorted(
+            Transformation._stateflow_patterns, key=lambda cls: cls.__name__)
+        return pattern_list
+
+    @staticmethod
+    def register_pattern(clazz):
+        """ Registers a single-state (dataflow) transformation in the registry.
+            @param clazz: The Transformation class type.
+        """
+
+        if not issubclass(clazz, Transformation):
+            raise TypeError
+        Transformation._patterns.add(clazz)
+
+    @staticmethod
+    def register_stateflow_pattern(clazz):
+        """ Registers a multi-state transformation in the registry.
+            @param clazz: The Transformation class type.
+        """
+
+        if not issubclass(clazz, Transformation):
+            raise TypeError
+        Transformation._stateflow_patterns.add(clazz)
+
+    @staticmethod
+    def register_pattern_file(filename):
+        """ Registers all transformations in a single Python file. """
+
+        pattern_members = {}
+        with open(pattern_path) as pattern_file:
+            exec(pattern_file.read(), pattern_members)
+        for member in pattern_members.values():
+            if inspect.isclass(member) and issubclass(member, Transformation):
+                Transformation.register_pattern(member)
+
+    @staticmethod
+    def deregister_pattern(clazz):
+        """ De-registers a transformation.
+            @param clazz: The Transformation class type.
+        """
+
+        if not issubclass(clazz, Transformation):
+            raise TypeError
+        Transformation._patterns.remove(clazz)
+
+    ####################################################################
+    # Static and object methods
+
+    # Properties
+    sdfg_id = Property(dtype=int)
+    state_id = Property(dtype=int)
+    subgraph = Property(dtype=dict)
+    expr_index = Property(dtype=int)
+
+    @staticmethod
+    def annotates_memlets():
+        """ Indicates whether the transformation annotates the edges it creates
+            or modifies with the appropriate memlets. This determines
+            whether to apply memlet propagation after the transformation.
+        """
+
+        return False
+
+    @staticmethod
+    def expressions():
+        """ Returns a list of Graph objects that will be matched in the 
+            subgraph isomorphism phase. Used as a pre-pass before calling 
+            `can_be_applied`.
+            @see Transformation.can_be_applied
+        """
+
+        raise NotImplementedError
+
+    @staticmethod
+    def can_be_applied(graph, candidate, expr_index, sdfg, strict=False):
+        """ Returns True if this transformation can be applied on the candidate
+            matched subgraph.
+            @param graph: SDFGState object if this Transformation is 
+                          single-state, or SDFG object otherwise.
+            @param candidate: A mapping between node IDs returned from 
+                              `Transformation.expressions` and the nodes in 
+                              `graph`.
+            @param expr_index: The list index from `Transformation.expressions`
+                               that was matched.
+            @param sdfg: If `graph` is an SDFGState, its parent SDFG. Otherwise
+                         should be equal to `graph`.
+            @return: True if the transformation can be applied.
+        """
+        raise NotImplementedError
+
+    @staticmethod
+    def match_to_str(graph, candidate):
+        """ Returns a string representation of the pattern match on the 
+            candidate subgraph. Used when identifying matches in the console 
+            UI.
+        """
+        raise NotImplementedError
+
+    def __init__(self, sdfg_id, state_id, subgraph, expr_index):
+        """ Initializes an instance of Transformation.
+            @param sdfg_id: A unique ID of the SDFG.
+            @param state_id: The node ID of the SDFG state, if applicable.
+            @param subgraph: A mapping between node IDs returned from 
+                             `Transformation.expressions` and the nodes in 
+                             `graph`.
+            @param expr_index: The list index from `Transformation.expressions`
+                               that was matched.
+            @raise TypeError: When transformation is not subclass of
+                              Transformation.
+            @raise TypeError: When state_id is not instance of int.
+            @raise TypeError: When subgraph is not a dict of 
+                              dace.graph.nodes.Node : int.
+        """
+
+        self.sdfg_id = sdfg_id
+        self.state_id = state_id
+        for value in subgraph.values():
+            if not isinstance(value, int):
+                raise TypeError('All values of '
+                                'subgraph'
+                                ' dictionary must be '
+                                'instances of int.')
+        self.subgraph = subgraph
+        self.expr_index = expr_index
+
+    def __lt__(self, other):
+        """ Comparing two transformations by their class name and node IDs
+            in match. Used for ordering transformations consistently.
+        """
+        if type(self) != type(other):
+            return type(self) < type(other)
+
+        self_ids = iter(self.subgraph.values())
+        other_ids = iter(self.subgraph.values())
+
+        try:
+            self_id = next(self_ids)
+        except StopIteration:
+            return True
+        try:
+            other_id = next(other_ids)
+        except StopIteration:
+            return False
+
+        while self_id is not None and other_id is not None:
+            if self_id != other_id:
+                return self_id < other_id
+            try:
+                self_id = next(self_ids)
+            except StopIteration:
+                return True
+            try:
+                other_id = next(other_ids)
+            except StopIteration:
+                return False
+
+    def apply_pattern(self, sdfg):
+        """ Applies this transformation on the given SDFG. """
+        self.apply(sdfg)
+        if not self.annotates_memlets():
+            labeling.propagate_labels_sdfg(sdfg)
+
+    def __str__(self):
+        raise NotImplementedError
+
+    def print_match(self, sdfg):
+        """ Returns a string representation of the pattern match on the 
+            given SDFG. Used for printing matches in the console UI.
+        """
+        if not isinstance(sdfg, dace.SDFG):
+            raise TypeError("Expected SDFG, got: {}".format(
+                type(sdfg).__name__))
+        if self.state_id == -1:
+            graph = sdfg
+        else:
+            graph = sdfg.nodes()[self.state_id]
+        string = type(self).__name__ + ' in '
+        string += type(self).match_to_str(graph, self.subgraph)
+        return string
+
+
+# Module functions ############################################################
+
+
+def collapse_multigraph_to_nx(graph: gr.MultiDiGraph) -> nx.DiGraph:
+    """ Collapses a directed multigraph into a networkx directed graph.
+
+        In the output directed graph, each node is a number, which contains 
+        itself as node_data['node'], while each edge contains a list of the 
+        data from the original edges as its attribute (edge_data[0...N]).
+
+        @param graph: Directed multigraph object to be collapsed.
+        @return: Collapsed directed graph object.
+  """
+
+    # Create the digraph nodes.
+    digraph_nodes = [None] * graph.number_of_nodes()
+    node_id = {}
+    for i, node in enumerate(graph.nodes()):
+        digraph_nodes[i] = (i, {'node': node})
+        node_id[node] = i
+
+    # Create the digraph edges.
+    digraph_edges = {}
+    for edge in graph.edges():
+        src = node_id[edge.src]
+        dest = node_id[edge.dst]
+
+        if (src, dest) in digraph_edges:
+            edge_num = len(digraph_edges[src, dest])
+            digraph_edges[src, dest].update({edge_num: edge.data})
+        else:
+            digraph_edges[src, dest] = {0: edge.data}
+
+    # Create the digraph
+    result = nx.DiGraph()
+    result.add_nodes_from(digraph_nodes)
+    result.add_edges_from(digraph_edges)
+
+    return result
+
+
+def type_match(node_a, node_b):
+    """ Checks whether the node types of the inputs match.
+        @param node_a: First node.
+        @param node_b: Second node.
+        @return: True if the object types of the nodes match, False otherwise.
+        @raise TypeError: When at least one of the inputs is not a dictionary 
+                          or does not have a 'node' attribute.
+        @raise KeyError: When at least one of the inputs is a dictionary, 
+                         but does not have a 'node' key.
+    """
+    return isinstance(node_a['node'], type(node_b['node']))
+
+
+def match_expression(graph,
+                     expressions,
+                     node_match=type_match,
+                     edge_match=None,
+                     pattern_match=None,
+                     strict=False):
+    """ Returns a generator which yields a subgraph mapping from 
+        `expression_node` to `graph_node`.
+        @param graph: Directed multigraph object to be searched for subgraphs.
+        @param expressions: List of directed graphs, isomorphic to any 
+                            (sub)graph that potentially matches a 
+                            transformation.
+        @param node_match: Function for checking whether two nodes match.
+        @param edge_match: Function for checking whether two edges match.
+        @param pattern_match: Function for checking whether a subgraph matches
+                              a transformation.
+        @return: Generator of 2-tuples: (subgraph, expression index in 
+                 `expressions`).
+    """
+
+    # Collapse multigraph into directed graph
+    digraph = collapse_multigraph_to_nx(graph)
+
+    # If expression is a list, try to match each one of them
+    if not isinstance(expressions, list) and not isinstance(
+            expressions, GeneratorType):
+        expressions = [expressions]
+
+    for expr_index, expr in enumerate(expressions):
+        # Also collapse expression multigraph
+        cexpr = collapse_multigraph_to_nx(expr)
+
+        # Find candidate subgraphs (per-node / per-edge matching)
+        graph_matcher = iso.DiGraphMatcher(
+            digraph, cexpr, node_match=node_match, edge_match=edge_match)
+        for subgraph in graph_matcher.subgraph_isomorphisms_iter():
+            # Convert candidate to original graph node representation
+            # The type of subgraph is {graph_node_id: subgraph_node_id}
+            # We return the inverse mapping: {subgraph_node: graph_node} for
+            # ease of access
+            subgraph = {
+                cexpr.node[j]['node']: digraph.node[i]['node']
+                for (i, j) in subgraph.items()
+            }
+
+            # Match original (regular) expression on found candidate
+            if pattern_match is None:
+                # Yield mapping and index of expression found
+                yield subgraph, expr_index
+            else:
+                match_found = pattern_match(graph, subgraph)
+                if match_found:
+                    # Yield mapping and index of expression found
+                    # expr_index_list = list(range(match_num))
+                    yield subgraph, expr_index  # expr_index_list
+
+
+def match_pattern(state_id,
+                  state,
+                  pattern,
+                  sdfg,
+                  node_match=type_match,
+                  edge_match=None,
+                  strict=False):
+    """ Returns a list of single-state Transformations of a certain class that
+        match the input SDFG.
+        @param state_id: The node ID of the state in the given SDFG.
+        @param state: An SDFGState object to match.
+        @param pattern: Transformation object to match.
+        @param sdfg: The SDFG to match in.
+        @param node_match: Function for checking whether two nodes match.
+        @param edge_match: Function for checking whether two edges match.
+        @param strict: Only match transformation if strict (i.e., can only
+                       improve the performance/reduce complexity of the SDFG).
+        @return: A list of Transformation objects that match.
+    """
+
+    # Collapse multigraph into directed graph
+    # Handling VF2 in networkx for now
+    digraph = collapse_multigraph_to_nx(state)
+
+    matches = []
+
+    for idx, expression in enumerate(pattern.expressions()):
+        cexpr = collapse_multigraph_to_nx(expression)
+        graph_matcher = iso.DiGraphMatcher(
+            digraph, cexpr, node_match=node_match, edge_match=edge_match)
+        for subgraph in graph_matcher.subgraph_isomorphisms_iter():
+            subgraph = {
+                cexpr.node[j]['node']: state.node_id(digraph.node[i]['node'])
+                for (i, j) in subgraph.items()
+            }
+            match_found = pattern.can_be_applied(
+                state, subgraph, idx, sdfg, strict=strict)
+            if match_found:
+                bisect.insort_left(
+                    matches,
+                    pattern(
+                        sdfg.sdfg_list.index(sdfg), state_id, subgraph, idx))
+
+    # Recursive call for nested SDFGs
+    for node in state.nodes():
+        if isinstance(node, dace.graph.nodes.NestedSDFG):
+            sub_sdfg = node.sdfg
+            for i, sub_state in enumerate(sub_sdfg.nodes()):
+                matches += match_pattern(i, sub_state, pattern, sub_sdfg)
+
+    return matches
+
+
+def match_stateflow_pattern(sdfg,
+                            pattern,
+                            node_match=type_match,
+                            edge_match=None,
+                            strict=False):
+    """ Returns a list of multi-state Transformations of a certain class that
+        match the input SDFG.
+        @param sdfg: The SDFG to match in.
+        @param pattern: Transformation object to match.
+        @param node_match: Function for checking whether two nodes match.
+        @param edge_match: Function for checking whether two edges match.
+        @param strict: Only match transformation if strict (i.e., can only
+                       improve the performance/reduce complexity of the SDFG).
+        @return: A list of Transformation objects that match.
+    """
+
+    # Collapse multigraph into directed graph
+    # Handling VF2 in networkx for now
+    digraph = collapse_multigraph_to_nx(sdfg)
+
+    matches = []
+
+    for idx, expression in enumerate(pattern.expressions()):
+        cexpr = collapse_multigraph_to_nx(expression)
+        graph_matcher = iso.DiGraphMatcher(
+            digraph, cexpr, node_match=node_match, edge_match=edge_match)
+        for subgraph in graph_matcher.subgraph_isomorphisms_iter():
+            subgraph = {
+                cexpr.node[j]['node']: sdfg.node_id(digraph.node[i]['node'])
+                for (i, j) in subgraph.items()
+            }
+            match_found = pattern.can_be_applied(sdfg, subgraph, idx, sdfg,
+                                                 strict)
+            if match_found:
+                bisect.insort_left(
+                    matches,
+                    pattern(sdfg.sdfg_list.index(sdfg), -1, subgraph, idx))
+                # matches.append(
+                #     pattern(pattern, state_id, subgraph, options))
+
+    # Recursive call for nested SDFGs
+    for state in sdfg.nodes():
+        for node in state.nodes():
+            if isinstance(node, dace.graph.nodes.NestedSDFG):
+                matches += match_stateflow_pattern(node.sdfg, pattern)
+
+    return matches
diff --git a/dace/types.py b/dace/types.py
new file mode 100644
index 0000000000..b222b2b343
--- /dev/null
+++ b/dace/types.py
@@ -0,0 +1,445 @@
+""" A module that contains various DaCe type definitions. """
+from __future__ import print_function
+import ctypes
+import enum
+import inspect
+import numpy
+import itertools
+import numpy.ctypeslib as npct
+
+
+class AutoNumber(enum.Enum):
+    """ Backwards-compatible version of Enum's `auto()` """
+
+    def __new__(cls):
+        value = len(cls.__members__) + 1
+        obj = object.__new__(cls)
+        obj._value_ = value
+        return obj
+
+
+class StorageType(AutoNumber):
+    """ Available data storage types in the SDFG. """
+    Default = ()  # Scope-default storage location
+    Immaterial = ()  # Needs materialize function
+    Register = ()  # Tasklet storage location
+    CPU_Pinned = ()  # NOTE: Can be DMA accessed from accelerators
+    CPU_Heap = ()  # NOTE: Allocated with new[]
+    CPU_Stack = ()  # NOTE: Allocated on stack
+    GPU_Global = ()  # Global memory
+    GPU_Shared = ()  # Shared memory
+    GPU_Stack = ()  # GPU registers
+    FPGA_Global = ()  # Off-chip global memory (DRAM)
+    FPGA_Local = ()  # On-chip memory (bulk storage)
+    FPGA_Registers = ()  # On-chip memory (fully partitioned registers)
+
+
+class ScheduleType(AutoNumber):
+    """ Available map schedule types in the SDFG. """
+    Default = ()  # Scope-default parallel schedule
+    Sequential = ()  # Sequential code (single-core)
+    MPI = ()  # MPI processes
+    CPU_Multicore = ()  # OpenMP
+    GPU_Device = ()  # Kernel
+    GPU_ThreadBlock = ()  # Thread-block code
+    GPU_ThreadBlock_Dynamic = ()  # Allows rescheduling work within a block
+    FPGA_Device = ()
+
+
+# A subset of GPU schedule types
+GPU_SCHEDULES = [
+    ScheduleType.GPU_Device, ScheduleType.GPU_ThreadBlock,
+    ScheduleType.GPU_ThreadBlock_Dynamic
+]
+
+
+class ReductionType(AutoNumber):
+    """ Reduction types natively supported by the SDFG compiler. """
+    Custom = ()  # Defined by an arbitrary lambda function
+    Min = ()  # Minimum value
+    Max = ()  # Maximum value
+    Sum = ()  # Sum
+    Product = ()  # Product
+    Logical_And = ()  # Logical AND (&&)
+    Bitwise_And = ()  # Bitwise AND (&)
+    Logical_Or = ()  # Logical OR (||)
+    Bitwise_Or = ()  # Bitwise OR (|)
+    Logical_Xor = ()  # Logical XOR (!=)
+    Bitwise_Xor = ()  # Bitwise XOR (^)
+    Min_Location = ()  # Minimum value and its location
+    Max_Location = ()  # Maximum value and its location
+
+
+class Language(AutoNumber):
+    """ Available programming languages for SDFG tasklets. """
+    Python = ()
+    CPP = ()
+
+
+class AccessType(AutoNumber):
+    """ Types of access to an `AccessNode`. """
+    ReadOnly = ()
+    WriteOnly = ()
+    ReadWrite = ()
+
+
+# Maps from ScheduleType to default StorageType
+SCOPEDEFAULT_STORAGE = {
+    None: StorageType.CPU_Heap,
+    ScheduleType.Sequential: StorageType.Register,
+    ScheduleType.MPI: StorageType.CPU_Heap,
+    ScheduleType.CPU_Multicore: StorageType.CPU_Stack,
+    ScheduleType.GPU_Device: StorageType.GPU_Shared,
+    ScheduleType.GPU_ThreadBlock: StorageType.GPU_Stack,
+    ScheduleType.GPU_ThreadBlock_Dynamic: StorageType.GPU_Stack,
+    ScheduleType.FPGA_Device: StorageType.FPGA_Global,
+}
+
+# Maps from ScheduleType to default ScheduleType for sub-scopes
+SCOPEDEFAULT_SCHEDULE = {
+    None: ScheduleType.CPU_Multicore,
+    ScheduleType.Sequential: ScheduleType.Sequential,
+    ScheduleType.MPI: ScheduleType.CPU_Multicore,
+    ScheduleType.CPU_Multicore: ScheduleType.Sequential,
+    ScheduleType.GPU_Device: ScheduleType.GPU_ThreadBlock,
+    ScheduleType.GPU_ThreadBlock: ScheduleType.Sequential,
+    ScheduleType.GPU_ThreadBlock_Dynamic: ScheduleType.Sequential,
+    ScheduleType.FPGA_Device: ScheduleType.FPGA_Device,
+}
+
+# Identifier for dynamic number of Memlet accesses.
+DYNAMIC = -1
+
+# Translation of types to C types
+_CTYPES = {
+    int: 'int',
+    float: 'float',
+    bool: 'bool',
+    numpy.bool: 'bool',
+    numpy.int8: 'char',
+    numpy.int16: 'short',
+    numpy.int32: 'int',
+    numpy.int64: 'long long',
+    numpy.uint8: 'unsigned char',
+    numpy.uint16: 'unsigned short',
+    numpy.uint32: 'unsigned int',
+    numpy.uint64: 'unsigned long long',
+    numpy.float16: 'half',
+    numpy.float32: 'float',
+    numpy.float64: 'double',
+    numpy.complex64: 'dace::complex64',
+    numpy.complex128: 'dace::complex128'
+}
+
+# Translation of types to ctypes types
+_FFI_CTYPES = {
+    int: ctypes.c_int,
+    float: ctypes.c_float,
+    bool: ctypes.c_bool,
+    numpy.bool: ctypes.c_bool,
+    numpy.int8: ctypes.c_int8,
+    numpy.int16: ctypes.c_int16,
+    numpy.int32: ctypes.c_int32,
+    numpy.int64: ctypes.c_int64,
+    numpy.uint8: ctypes.c_uint8,
+    numpy.uint16: ctypes.c_uint16,
+    numpy.uint32: ctypes.c_uint32,
+    numpy.uint64: ctypes.c_uint64,
+    numpy.float16: ctypes.c_uint16,
+    numpy.float32: ctypes.c_float,
+    numpy.float64: ctypes.c_double,
+    numpy.complex64: ctypes.c_uint64,
+    numpy.complex128: ctypes.c_longdouble
+}
+
+# Number of bytes per data type
+_BYTES = {
+    int: 4,
+    float: 4,
+    bool: 1,
+    numpy.bool: 1,
+    numpy.int8: 1,
+    numpy.int8: 1,
+    numpy.int16: 2,
+    numpy.int32: 4,
+    numpy.int64: 8,
+    numpy.uint8: 1,
+    numpy.uint16: 2,
+    numpy.uint32: 4,
+    numpy.uint64: 8,
+    numpy.float16: 2,
+    numpy.float32: 4,
+    numpy.float64: 8,
+    numpy.complex64: 8,
+    numpy.complex128: 16
+}
+
+
+class typeclass(object):
+    """ An extension of types that enables their use in DaCe.
+        
+        These types are defined for three reasons:
+            1. Controlling DaCe types
+            2. Enabling declaration syntax: `dace.float32[M,N]`
+            3. Enabling extensions such as `dace.struct` and `dace.immaterial`
+    """
+
+    def __init__(self, wrapped_type):
+        self.type = wrapped_type
+        self.ctype = _CTYPES[wrapped_type]
+        self.ctype_unaligned = self.ctype
+        self.dtype = self
+        self.bytes = _BYTES[wrapped_type]
+        self.materialize_func = None
+
+    def __hash__(self):
+        return hash((self.type, self.ctype, self.materialize_func))
+
+    def to_string(self):
+        """ A Numpy-like string-representation of the underlying data type. """
+        return self.type.__name__
+
+    def is_complex(self):
+        if self.type == numpy.complex64 or self.type == numpy.complex128:
+            return True
+        return False
+
+    # Create a new type
+    def __call__(self, *args, **kwargs):
+        return self.type(*args, **kwargs)
+
+    def __eq__(self, other):
+        return self.ctype == other.ctype
+
+    def __ne__(self, other):
+        return self.ctype != other.ctype
+
+    def __getitem__(self, s):
+        """ This is syntactic sugar that allows us to define an array type
+            with the following syntax: dace.uint32[N,M] 
+            @return: A data.Array data descriptor.
+        """
+        from dace import data
+        if isinstance(s, list) or isinstance(s, tuple):
+            return data.Array(self, tuple(s))
+        return data.Array(self, (s, ))
+
+    def __repr__(self):
+        return self.ctype
+
+
+class pointer(typeclass):
+    """ A data type for a pointer to an existing typeclass.
+        
+        Example use: 
+            `dace.pointer(dace.struct(x=dace.float32, y=dace.float32))`. """
+
+    def __init__(self, wrapped_typeclass):
+        self._typeclass = wrapped_typeclass
+        self.type = wrapped_typeclass.type
+        self.bytes = int64.bytes
+        self.ctype = wrapped_typeclass.ctype + "*"
+        self.ctype_unaligned = wrapped_typeclass.ctype_unaligned + "*"
+        self.dtype = self
+        self.materialize_func = None
+
+
+def immaterial(dace_data, materialize_func):
+    """ A data type with a materialize/serialize function. Data objects with 
+        this type do not allocate new memory. Whenever it is accessed, the 
+        materialize/serialize function is invoked instead. """
+    dace_data.materialize_func = materialize_func
+    return dace_data
+
+
+class struct(typeclass):
+    """ A data type for a struct of existing typeclasses.
+    
+        Example use: `dace.struct(a=dace.int32, b=dace.float64)`.
+    """
+
+    def __init__(self, name, **fields_and_types):
+        # self._data = fields_and_types
+        self.type = ctypes.Structure
+        self.name = name
+        # TODO: Assuming no alignment! Get from ctypes
+        # self.bytes = sum(t.bytes for t in fields_and_types.values())
+        self.ctype = name
+        self.ctype_unaligned = name
+        self.dtype = self
+        self.materialize_func = None
+        self._parse_field_and_types(**fields_and_types)
+
+    def _parse_field_and_types(self, **fields_and_types):
+        self._data = dict()
+        self._length = dict()
+        self.bytes = 0
+        for k, v in fields_and_types.items():
+            if isinstance(v, tuple):
+                t, l = v
+                if not isinstance(t, pointer):
+                    raise TypeError('Only pointer types may have a length.')
+                if l not in fields_and_types.keys():
+                    raise ValueError(
+                        'Length {} not a field of struct {}'.format(
+                            l, self.name))
+                self._data[k] = t
+                self._length[k] = l
+                self.bytes += t.bytes
+            else:
+                if isinstance(v, pointer):
+                    raise TypeError('Pointer types must have a length.')
+                self._data[k] = v
+                self.bytes += v.bytes
+
+    def as_ctypes(self):
+        # Populate the ctype fields for the struct class.
+        fields = []
+        for k, v in self._data.items():
+            if isinstance(v, pointer):
+                fields.append(
+                    (k,
+                     ctypes.c_void_p))  #ctypes.POINTER(_FFI_CTYPES[v.type])))
+            else:
+                fields.append((k, _FFI_CTYPES[v.type]))
+        fields = sorted(fields, key=lambda f: f[0])
+        # Create new struct class.
+        struct_class = type('NewStructClass', (ctypes.Structure, ),
+                            {"_fields_": fields})
+        return struct_class
+
+    def emit_definition(self):
+        return '''struct {name} {{
+{types}
+}};'''.format(
+            name=self.name,
+            types='\n'.join([
+                '    %s %s;' % (t.ctype, tname)
+                for tname, t in sorted(self._data.items())
+            ]))
+
+
+int8 = typeclass(numpy.int8)
+int16 = typeclass(numpy.int16)
+int32 = typeclass(numpy.int32)
+int64 = typeclass(numpy.int64)
+uint8 = typeclass(numpy.uint8)
+uint16 = typeclass(numpy.uint16)
+uint32 = typeclass(numpy.uint32)
+uint64 = typeclass(numpy.uint64)
+float16 = typeclass(numpy.float16)
+float32 = typeclass(numpy.float32)
+float64 = typeclass(numpy.float64)
+complex64 = typeclass(numpy.complex64)
+complex128 = typeclass(numpy.complex128)
+
+TYPECLASS_STRINGS = [
+    'int8', 'int16', 'int32', 'int64', 'uint8', 'uint16', 'uint32', 'uint64',
+    'float16', 'float32', 'float64', 'complex64', 'complex128'
+]
+
+#######################################################
+# Allowed types
+
+# Helper function to determine whether a global variable is a constant
+_CONSTANT_TYPES = [
+    int,
+    float,
+    numpy.intc,
+    numpy.intp,
+    numpy.int8,
+    numpy.int16,
+    numpy.int32,
+    numpy.int64,
+    numpy.uint8,
+    numpy.uint16,
+    numpy.uint32,
+    numpy.uint64,
+    numpy.float16,
+    numpy.float32,
+    numpy.float64,
+    numpy.complex64,
+    numpy.complex128,
+    typeclass  #, type
+]
+
+
+def isconstant(var):
+    """ Returns True if a variable is designated a constant (i.e., that can be
+        directly generated in code). """
+    return type(var) in _CONSTANT_TYPES
+
+
+# Lists allowed modules and maps them to C++ namespaces for code generation
+_ALLOWED_MODULES = {
+    "builtins": '',
+    "dace": 'dace::',
+    "math": 'dace::math::',
+    "cmath": 'dace::cmath::'
+}
+
+
+def ismoduleallowed(var):
+    """ Helper function to determine the source module of an object, and 
+        whether it is allowed in DaCe programs. """
+    mod = inspect.getmodule(var)
+    try:
+        for m in _ALLOWED_MODULES:
+            if mod.__name__ == m or mod.__name__.startswith(m + '.'):
+                return True
+    except AttributeError:
+        return False
+    return False
+
+
+def ismodule_and_allowed(var):
+    """ Returns True if a given object is a module and is one of the allowed 
+        modules in DaCe programs. """
+    if inspect.ismodule(var):
+        if var.__name__ in _ALLOWED_MODULES:
+            return True
+    return False
+
+
+def isallowed(var):
+    """ Returns True if a given object is allowed in a DaCe program. """
+    return isconstant(var) or ismoduleallowed(var)
+
+
+class _external_function(object):
+    def __init__(self, f, alt_imps=None):
+        self.func = f
+        if alt_imps is None:
+            self.alt_imps = {}
+        else:
+            self.alt_imps = alt_imps
+
+    def __call__(self, *args, **kwargs):
+        return self.func(*args, **kwargs)
+
+
+class DebugInfo:
+    """ Source code location identifier of a node/edge in an SDFG. Used for 
+        IDE and debugging purposes. """
+
+    def __init__(self,
+                 start_line,
+                 start_column,
+                 end_line,
+                 end_column,
+                 filename=None):
+        self.start_line = start_line
+        self.end_line = end_line
+        self.start_column = start_column
+        self.end_column = end_column
+        self.filename = filename
+
+
+######################################################
+# Static (utility) functions
+
+
+def deduplicate(iterable):
+    """ Removes duplicates in the passed iterable. """
+    return type(iterable)(
+        [i for i in sorted(set(iterable), key=lambda x: iterable.index(x))])
diff --git a/diode/Chart.bundle.min.js b/diode/Chart.bundle.min.js
new file mode 100644
index 0000000000..5a0039662c
--- /dev/null
+++ b/diode/Chart.bundle.min.js
@@ -0,0 +1,10 @@
+/*!
+ * Chart.js
+ * http://chartjs.org/
+ * Version: 2.7.2
+ *
+ * Copyright 2018 Chart.js Contributors
+ * Released under the MIT license
+ * https://github.com/chartjs/Chart.js/blob/master/LICENSE.md
+ */
+!function(t){if("object"==typeof exports&&"undefined"!=typeof module)module.exports=t();else if("function"==typeof define&&define.amd)define([],t);else{("undefined"!=typeof window?window:"undefined"!=typeof global?global:"undefined"!=typeof self?self:this).Chart=t()}}(function(){return function t(e,i,n){function a(o,s){if(!i[o]){if(!e[o]){var l="function"==typeof require&&require;if(!s&&l)return l(o,!0);if(r)return r(o,!0);var u=new Error("Cannot find module '"+o+"'");throw u.code="MODULE_NOT_FOUND",u}var d=i[o]={exports:{}};e[o][0].call(d.exports,function(t){var i=e[o][1][t];return a(i||t)},d,d.exports,t,e,i,n)}return i[o].exports}for(var r="function"==typeof require&&require,o=0;o<n.length;o++)a(n[o]);return a}({1:[function(t,e,i){var n=t(5);function a(t){if(t){var e=[0,0,0],i=1,a=t.match(/^#([a-fA-F0-9]{3})$/i);if(a){a=a[1];for(var r=0;r<e.length;r++)e[r]=parseInt(a[r]+a[r],16)}else if(a=t.match(/^#([a-fA-F0-9]{6})$/i)){a=a[1];for(r=0;r<e.length;r++)e[r]=parseInt(a.slice(2*r,2*r+2),16)}else if(a=t.match(/^rgba?\(\s*([+-]?\d+)\s*,\s*([+-]?\d+)\s*,\s*([+-]?\d+)\s*(?:,\s*([+-]?[\d\.]+)\s*)?\)$/i)){for(r=0;r<e.length;r++)e[r]=parseInt(a[r+1]);i=parseFloat(a[4])}else if(a=t.match(/^rgba?\(\s*([+-]?[\d\.]+)\%\s*,\s*([+-]?[\d\.]+)\%\s*,\s*([+-]?[\d\.]+)\%\s*(?:,\s*([+-]?[\d\.]+)\s*)?\)$/i)){for(r=0;r<e.length;r++)e[r]=Math.round(2.55*parseFloat(a[r+1]));i=parseFloat(a[4])}else if(a=t.match(/(\w+)/)){if("transparent"==a[1])return[0,0,0,0];if(!(e=n[a[1]]))return}for(r=0;r<e.length;r++)e[r]=d(e[r],0,255);return i=i||0==i?d(i,0,1):1,e[3]=i,e}}function r(t){if(t){var e=t.match(/^hsla?\(\s*([+-]?\d+)(?:deg)?\s*,\s*([+-]?[\d\.]+)%\s*,\s*([+-]?[\d\.]+)%\s*(?:,\s*([+-]?[\d\.]+)\s*)?\)/);if(e){var i=parseFloat(e[4]);return[d(parseInt(e[1]),0,360),d(parseFloat(e[2]),0,100),d(parseFloat(e[3]),0,100),d(isNaN(i)?1:i,0,1)]}}}function o(t){if(t){var e=t.match(/^hwb\(\s*([+-]?\d+)(?:deg)?\s*,\s*([+-]?[\d\.]+)%\s*,\s*([+-]?[\d\.]+)%\s*(?:,\s*([+-]?[\d\.]+)\s*)?\)/);if(e){var i=parseFloat(e[4]);return[d(parseInt(e[1]),0,360),d(parseFloat(e[2]),0,100),d(parseFloat(e[3]),0,100),d(isNaN(i)?1:i,0,1)]}}}function s(t,e){return void 0===e&&(e=void 0!==t[3]?t[3]:1),"rgba("+t[0]+", "+t[1]+", "+t[2]+", "+e+")"}function l(t,e){return"rgba("+Math.round(t[0]/255*100)+"%, "+Math.round(t[1]/255*100)+"%, "+Math.round(t[2]/255*100)+"%, "+(e||t[3]||1)+")"}function u(t,e){return void 0===e&&(e=void 0!==t[3]?t[3]:1),"hsla("+t[0]+", "+t[1]+"%, "+t[2]+"%, "+e+")"}function d(t,e,i){return Math.min(Math.max(e,t),i)}function h(t){var e=t.toString(16).toUpperCase();return e.length<2?"0"+e:e}e.exports={getRgba:a,getHsla:r,getRgb:function(t){var e=a(t);return e&&e.slice(0,3)},getHsl:function(t){var e=r(t);return e&&e.slice(0,3)},getHwb:o,getAlpha:function(t){var e=a(t);{if(e)return e[3];if(e=r(t))return e[3];if(e=o(t))return e[3]}},hexString:function(t){return"#"+h(t[0])+h(t[1])+h(t[2])},rgbString:function(t,e){if(e<1||t[3]&&t[3]<1)return s(t,e);return"rgb("+t[0]+", "+t[1]+", "+t[2]+")"},rgbaString:s,percentString:function(t,e){if(e<1||t[3]&&t[3]<1)return l(t,e);var i=Math.round(t[0]/255*100),n=Math.round(t[1]/255*100),a=Math.round(t[2]/255*100);return"rgb("+i+"%, "+n+"%, "+a+"%)"},percentaString:l,hslString:function(t,e){if(e<1||t[3]&&t[3]<1)return u(t,e);return"hsl("+t[0]+", "+t[1]+"%, "+t[2]+"%)"},hslaString:u,hwbString:function(t,e){void 0===e&&(e=void 0!==t[3]?t[3]:1);return"hwb("+t[0]+", "+t[1]+"%, "+t[2]+"%"+(void 0!==e&&1!==e?", "+e:"")+")"},keyword:function(t){return c[t.slice(0,3)]}};var c={};for(var f in n)c[n[f]]=f},{5:5}],2:[function(t,e,i){var n=t(4),a=t(1),r=function(t){return t instanceof r?t:this instanceof r?(this.valid=!1,this.values={rgb:[0,0,0],hsl:[0,0,0],hsv:[0,0,0],hwb:[0,0,0],cmyk:[0,0,0,0],alpha:1},void("string"==typeof t?(e=a.getRgba(t))?this.setValues("rgb",e):(e=a.getHsla(t))?this.setValues("hsl",e):(e=a.getHwb(t))&&this.setValues("hwb",e):"object"==typeof t&&(void 0!==(e=t).r||void 0!==e.red?this.setValues("rgb",e):void 0!==e.l||void 0!==e.lightness?this.setValues("hsl",e):void 0!==e.v||void 0!==e.value?this.setValues("hsv",e):void 0!==e.w||void 0!==e.whiteness?this.setValues("hwb",e):void 0===e.c&&void 0===e.cyan||this.setValues("cmyk",e)))):new r(t);var e};r.prototype={isValid:function(){return this.valid},rgb:function(){return this.setSpace("rgb",arguments)},hsl:function(){return this.setSpace("hsl",arguments)},hsv:function(){return this.setSpace("hsv",arguments)},hwb:function(){return this.setSpace("hwb",arguments)},cmyk:function(){return this.setSpace("cmyk",arguments)},rgbArray:function(){return this.values.rgb},hslArray:function(){return this.values.hsl},hsvArray:function(){return this.values.hsv},hwbArray:function(){var t=this.values;return 1!==t.alpha?t.hwb.concat([t.alpha]):t.hwb},cmykArray:function(){return this.values.cmyk},rgbaArray:function(){var t=this.values;return t.rgb.concat([t.alpha])},hslaArray:function(){var t=this.values;return t.hsl.concat([t.alpha])},alpha:function(t){return void 0===t?this.values.alpha:(this.setValues("alpha",t),this)},red:function(t){return this.setChannel("rgb",0,t)},green:function(t){return this.setChannel("rgb",1,t)},blue:function(t){return this.setChannel("rgb",2,t)},hue:function(t){return t&&(t=(t%=360)<0?360+t:t),this.setChannel("hsl",0,t)},saturation:function(t){return this.setChannel("hsl",1,t)},lightness:function(t){return this.setChannel("hsl",2,t)},saturationv:function(t){return this.setChannel("hsv",1,t)},whiteness:function(t){return this.setChannel("hwb",1,t)},blackness:function(t){return this.setChannel("hwb",2,t)},value:function(t){return this.setChannel("hsv",2,t)},cyan:function(t){return this.setChannel("cmyk",0,t)},magenta:function(t){return this.setChannel("cmyk",1,t)},yellow:function(t){return this.setChannel("cmyk",2,t)},black:function(t){return this.setChannel("cmyk",3,t)},hexString:function(){return a.hexString(this.values.rgb)},rgbString:function(){return a.rgbString(this.values.rgb,this.values.alpha)},rgbaString:function(){return a.rgbaString(this.values.rgb,this.values.alpha)},percentString:function(){return a.percentString(this.values.rgb,this.values.alpha)},hslString:function(){return a.hslString(this.values.hsl,this.values.alpha)},hslaString:function(){return a.hslaString(this.values.hsl,this.values.alpha)},hwbString:function(){return a.hwbString(this.values.hwb,this.values.alpha)},keyword:function(){return a.keyword(this.values.rgb,this.values.alpha)},rgbNumber:function(){var t=this.values.rgb;return t[0]<<16|t[1]<<8|t[2]},luminosity:function(){for(var t=this.values.rgb,e=[],i=0;i<t.length;i++){var n=t[i]/255;e[i]=n<=.03928?n/12.92:Math.pow((n+.055)/1.055,2.4)}return.2126*e[0]+.7152*e[1]+.0722*e[2]},contrast:function(t){var e=this.luminosity(),i=t.luminosity();return e>i?(e+.05)/(i+.05):(i+.05)/(e+.05)},level:function(t){var e=this.contrast(t);return e>=7.1?"AAA":e>=4.5?"AA":""},dark:function(){var t=this.values.rgb;return(299*t[0]+587*t[1]+114*t[2])/1e3<128},light:function(){return!this.dark()},negate:function(){for(var t=[],e=0;e<3;e++)t[e]=255-this.values.rgb[e];return this.setValues("rgb",t),this},lighten:function(t){var e=this.values.hsl;return e[2]+=e[2]*t,this.setValues("hsl",e),this},darken:function(t){var e=this.values.hsl;return e[2]-=e[2]*t,this.setValues("hsl",e),this},saturate:function(t){var e=this.values.hsl;return e[1]+=e[1]*t,this.setValues("hsl",e),this},desaturate:function(t){var e=this.values.hsl;return e[1]-=e[1]*t,this.setValues("hsl",e),this},whiten:function(t){var e=this.values.hwb;return e[1]+=e[1]*t,this.setValues("hwb",e),this},blacken:function(t){var e=this.values.hwb;return e[2]+=e[2]*t,this.setValues("hwb",e),this},greyscale:function(){var t=this.values.rgb,e=.3*t[0]+.59*t[1]+.11*t[2];return this.setValues("rgb",[e,e,e]),this},clearer:function(t){var e=this.values.alpha;return this.setValues("alpha",e-e*t),this},opaquer:function(t){var e=this.values.alpha;return this.setValues("alpha",e+e*t),this},rotate:function(t){var e=this.values.hsl,i=(e[0]+t)%360;return e[0]=i<0?360+i:i,this.setValues("hsl",e),this},mix:function(t,e){var i=this,n=t,a=void 0===e?.5:e,r=2*a-1,o=i.alpha()-n.alpha(),s=((r*o==-1?r:(r+o)/(1+r*o))+1)/2,l=1-s;return this.rgb(s*i.red()+l*n.red(),s*i.green()+l*n.green(),s*i.blue()+l*n.blue()).alpha(i.alpha()*a+n.alpha()*(1-a))},toJSON:function(){return this.rgb()},clone:function(){var t,e,i=new r,n=this.values,a=i.values;for(var o in n)n.hasOwnProperty(o)&&(t=n[o],"[object Array]"===(e={}.toString.call(t))?a[o]=t.slice(0):"[object Number]"===e?a[o]=t:console.error("unexpected color value:",t));return i}},r.prototype.spaces={rgb:["red","green","blue"],hsl:["hue","saturation","lightness"],hsv:["hue","saturation","value"],hwb:["hue","whiteness","blackness"],cmyk:["cyan","magenta","yellow","black"]},r.prototype.maxes={rgb:[255,255,255],hsl:[360,100,100],hsv:[360,100,100],hwb:[360,100,100],cmyk:[100,100,100,100]},r.prototype.getValues=function(t){for(var e=this.values,i={},n=0;n<t.length;n++)i[t.charAt(n)]=e[t][n];return 1!==e.alpha&&(i.a=e.alpha),i},r.prototype.setValues=function(t,e){var i,a,r=this.values,o=this.spaces,s=this.maxes,l=1;if(this.valid=!0,"alpha"===t)l=e;else if(e.length)r[t]=e.slice(0,t.length),l=e[t.length];else if(void 0!==e[t.charAt(0)]){for(i=0;i<t.length;i++)r[t][i]=e[t.charAt(i)];l=e.a}else if(void 0!==e[o[t][0]]){var u=o[t];for(i=0;i<t.length;i++)r[t][i]=e[u[i]];l=e.alpha}if(r.alpha=Math.max(0,Math.min(1,void 0===l?r.alpha:l)),"alpha"===t)return!1;for(i=0;i<t.length;i++)a=Math.max(0,Math.min(s[t][i],r[t][i])),r[t][i]=Math.round(a);for(var d in o)d!==t&&(r[d]=n[t][d](r[t]));return!0},r.prototype.setSpace=function(t,e){var i=e[0];return void 0===i?this.getValues(t):("number"==typeof i&&(i=Array.prototype.slice.call(e)),this.setValues(t,i),this)},r.prototype.setChannel=function(t,e,i){var n=this.values[t];return void 0===i?n[e]:i===n[e]?this:(n[e]=i,this.setValues(t,n),this)},"undefined"!=typeof window&&(window.Color=r),e.exports=r},{1:1,4:4}],3:[function(t,e,i){function n(t){var e,i,n=t[0]/255,a=t[1]/255,r=t[2]/255,o=Math.min(n,a,r),s=Math.max(n,a,r),l=s-o;return s==o?e=0:n==s?e=(a-r)/l:a==s?e=2+(r-n)/l:r==s&&(e=4+(n-a)/l),(e=Math.min(60*e,360))<0&&(e+=360),i=(o+s)/2,[e,100*(s==o?0:i<=.5?l/(s+o):l/(2-s-o)),100*i]}function a(t){var e,i,n=t[0],a=t[1],r=t[2],o=Math.min(n,a,r),s=Math.max(n,a,r),l=s-o;return i=0==s?0:l/s*1e3/10,s==o?e=0:n==s?e=(a-r)/l:a==s?e=2+(r-n)/l:r==s&&(e=4+(n-a)/l),(e=Math.min(60*e,360))<0&&(e+=360),[e,i,s/255*1e3/10]}function o(t){var e=t[0],i=t[1],a=t[2];return[n(t)[0],100*(1/255*Math.min(e,Math.min(i,a))),100*(a=1-1/255*Math.max(e,Math.max(i,a)))]}function s(t){var e,i=t[0]/255,n=t[1]/255,a=t[2]/255;return[100*((1-i-(e=Math.min(1-i,1-n,1-a)))/(1-e)||0),100*((1-n-e)/(1-e)||0),100*((1-a-e)/(1-e)||0),100*e]}function l(t){return S[JSON.stringify(t)]}function u(t){var e=t[0]/255,i=t[1]/255,n=t[2]/255;return[100*(.4124*(e=e>.04045?Math.pow((e+.055)/1.055,2.4):e/12.92)+.3576*(i=i>.04045?Math.pow((i+.055)/1.055,2.4):i/12.92)+.1805*(n=n>.04045?Math.pow((n+.055)/1.055,2.4):n/12.92)),100*(.2126*e+.7152*i+.0722*n),100*(.0193*e+.1192*i+.9505*n)]}function d(t){var e=u(t),i=e[0],n=e[1],a=e[2];return n/=100,a/=108.883,i=(i/=95.047)>.008856?Math.pow(i,1/3):7.787*i+16/116,[116*(n=n>.008856?Math.pow(n,1/3):7.787*n+16/116)-16,500*(i-n),200*(n-(a=a>.008856?Math.pow(a,1/3):7.787*a+16/116))]}function h(t){var e,i,n,a,r,o=t[0]/360,s=t[1]/100,l=t[2]/100;if(0==s)return[r=255*l,r,r];e=2*l-(i=l<.5?l*(1+s):l+s-l*s),a=[0,0,0];for(var u=0;u<3;u++)(n=o+1/3*-(u-1))<0&&n++,n>1&&n--,r=6*n<1?e+6*(i-e)*n:2*n<1?i:3*n<2?e+(i-e)*(2/3-n)*6:e,a[u]=255*r;return a}function c(t){var e=t[0]/60,i=t[1]/100,n=t[2]/100,a=Math.floor(e)%6,r=e-Math.floor(e),o=255*n*(1-i),s=255*n*(1-i*r),l=255*n*(1-i*(1-r));n*=255;switch(a){case 0:return[n,l,o];case 1:return[s,n,o];case 2:return[o,n,l];case 3:return[o,s,n];case 4:return[l,o,n];case 5:return[n,o,s]}}function f(t){var e,i,n,a,o=t[0]/360,s=t[1]/100,l=t[2]/100,u=s+l;switch(u>1&&(s/=u,l/=u),n=6*o-(e=Math.floor(6*o)),0!=(1&e)&&(n=1-n),a=s+n*((i=1-l)-s),e){default:case 6:case 0:r=i,g=a,b=s;break;case 1:r=a,g=i,b=s;break;case 2:r=s,g=i,b=a;break;case 3:r=s,g=a,b=i;break;case 4:r=a,g=s,b=i;break;case 5:r=i,g=s,b=a}return[255*r,255*g,255*b]}function m(t){var e=t[0]/100,i=t[1]/100,n=t[2]/100,a=t[3]/100;return[255*(1-Math.min(1,e*(1-a)+a)),255*(1-Math.min(1,i*(1-a)+a)),255*(1-Math.min(1,n*(1-a)+a))]}function p(t){var e,i,n,a=t[0]/100,r=t[1]/100,o=t[2]/100;return i=-.9689*a+1.8758*r+.0415*o,n=.0557*a+-.204*r+1.057*o,e=(e=3.2406*a+-1.5372*r+-.4986*o)>.0031308?1.055*Math.pow(e,1/2.4)-.055:e*=12.92,i=i>.0031308?1.055*Math.pow(i,1/2.4)-.055:i*=12.92,n=n>.0031308?1.055*Math.pow(n,1/2.4)-.055:n*=12.92,[255*(e=Math.min(Math.max(0,e),1)),255*(i=Math.min(Math.max(0,i),1)),255*(n=Math.min(Math.max(0,n),1))]}function v(t){var e=t[0],i=t[1],n=t[2];return i/=100,n/=108.883,e=(e/=95.047)>.008856?Math.pow(e,1/3):7.787*e+16/116,[116*(i=i>.008856?Math.pow(i,1/3):7.787*i+16/116)-16,500*(e-i),200*(i-(n=n>.008856?Math.pow(n,1/3):7.787*n+16/116))]}function y(t){var e,i,n,a,r=t[0],o=t[1],s=t[2];return r<=8?a=(i=100*r/903.3)/100*7.787+16/116:(i=100*Math.pow((r+16)/116,3),a=Math.pow(i/100,1/3)),[e=e/95.047<=.008856?e=95.047*(o/500+a-16/116)/7.787:95.047*Math.pow(o/500+a,3),i,n=n/108.883<=.008859?n=108.883*(a-s/200-16/116)/7.787:108.883*Math.pow(a-s/200,3)]}function x(t){var e,i=t[0],n=t[1],a=t[2];return(e=360*Math.atan2(a,n)/2/Math.PI)<0&&(e+=360),[i,Math.sqrt(n*n+a*a),e]}function _(t){return p(y(t))}function k(t){var e,i=t[0],n=t[1];return e=t[2]/360*2*Math.PI,[i,n*Math.cos(e),n*Math.sin(e)]}function w(t){return M[t]}e.exports={rgb2hsl:n,rgb2hsv:a,rgb2hwb:o,rgb2cmyk:s,rgb2keyword:l,rgb2xyz:u,rgb2lab:d,rgb2lch:function(t){return x(d(t))},hsl2rgb:h,hsl2hsv:function(t){var e=t[0],i=t[1]/100,n=t[2]/100;if(0===n)return[0,0,0];return[e,100*(2*(i*=(n*=2)<=1?n:2-n)/(n+i)),100*((n+i)/2)]},hsl2hwb:function(t){return o(h(t))},hsl2cmyk:function(t){return s(h(t))},hsl2keyword:function(t){return l(h(t))},hsv2rgb:c,hsv2hsl:function(t){var e,i,n=t[0],a=t[1]/100,r=t[2]/100;return e=a*r,[n,100*(e=(e/=(i=(2-a)*r)<=1?i:2-i)||0),100*(i/=2)]},hsv2hwb:function(t){return o(c(t))},hsv2cmyk:function(t){return s(c(t))},hsv2keyword:function(t){return l(c(t))},hwb2rgb:f,hwb2hsl:function(t){return n(f(t))},hwb2hsv:function(t){return a(f(t))},hwb2cmyk:function(t){return s(f(t))},hwb2keyword:function(t){return l(f(t))},cmyk2rgb:m,cmyk2hsl:function(t){return n(m(t))},cmyk2hsv:function(t){return a(m(t))},cmyk2hwb:function(t){return o(m(t))},cmyk2keyword:function(t){return l(m(t))},keyword2rgb:w,keyword2hsl:function(t){return n(w(t))},keyword2hsv:function(t){return a(w(t))},keyword2hwb:function(t){return o(w(t))},keyword2cmyk:function(t){return s(w(t))},keyword2lab:function(t){return d(w(t))},keyword2xyz:function(t){return u(w(t))},xyz2rgb:p,xyz2lab:v,xyz2lch:function(t){return x(v(t))},lab2xyz:y,lab2rgb:_,lab2lch:x,lch2lab:k,lch2xyz:function(t){return y(k(t))},lch2rgb:function(t){return _(k(t))}};var M={aliceblue:[240,248,255],antiquewhite:[250,235,215],aqua:[0,255,255],aquamarine:[127,255,212],azure:[240,255,255],beige:[245,245,220],bisque:[255,228,196],black:[0,0,0],blanchedalmond:[255,235,205],blue:[0,0,255],blueviolet:[138,43,226],brown:[165,42,42],burlywood:[222,184,135],cadetblue:[95,158,160],chartreuse:[127,255,0],chocolate:[210,105,30],coral:[255,127,80],cornflowerblue:[100,149,237],cornsilk:[255,248,220],crimson:[220,20,60],cyan:[0,255,255],darkblue:[0,0,139],darkcyan:[0,139,139],darkgoldenrod:[184,134,11],darkgray:[169,169,169],darkgreen:[0,100,0],darkgrey:[169,169,169],darkkhaki:[189,183,107],darkmagenta:[139,0,139],darkolivegreen:[85,107,47],darkorange:[255,140,0],darkorchid:[153,50,204],darkred:[139,0,0],darksalmon:[233,150,122],darkseagreen:[143,188,143],darkslateblue:[72,61,139],darkslategray:[47,79,79],darkslategrey:[47,79,79],darkturquoise:[0,206,209],darkviolet:[148,0,211],deeppink:[255,20,147],deepskyblue:[0,191,255],dimgray:[105,105,105],dimgrey:[105,105,105],dodgerblue:[30,144,255],firebrick:[178,34,34],floralwhite:[255,250,240],forestgreen:[34,139,34],fuchsia:[255,0,255],gainsboro:[220,220,220],ghostwhite:[248,248,255],gold:[255,215,0],goldenrod:[218,165,32],gray:[128,128,128],green:[0,128,0],greenyellow:[173,255,47],grey:[128,128,128],honeydew:[240,255,240],hotpink:[255,105,180],indianred:[205,92,92],indigo:[75,0,130],ivory:[255,255,240],khaki:[240,230,140],lavender:[230,230,250],lavenderblush:[255,240,245],lawngreen:[124,252,0],lemonchiffon:[255,250,205],lightblue:[173,216,230],lightcoral:[240,128,128],lightcyan:[224,255,255],lightgoldenrodyellow:[250,250,210],lightgray:[211,211,211],lightgreen:[144,238,144],lightgrey:[211,211,211],lightpink:[255,182,193],lightsalmon:[255,160,122],lightseagreen:[32,178,170],lightskyblue:[135,206,250],lightslategray:[119,136,153],lightslategrey:[119,136,153],lightsteelblue:[176,196,222],lightyellow:[255,255,224],lime:[0,255,0],limegreen:[50,205,50],linen:[250,240,230],magenta:[255,0,255],maroon:[128,0,0],mediumaquamarine:[102,205,170],mediumblue:[0,0,205],mediumorchid:[186,85,211],mediumpurple:[147,112,219],mediumseagreen:[60,179,113],mediumslateblue:[123,104,238],mediumspringgreen:[0,250,154],mediumturquoise:[72,209,204],mediumvioletred:[199,21,133],midnightblue:[25,25,112],mintcream:[245,255,250],mistyrose:[255,228,225],moccasin:[255,228,181],navajowhite:[255,222,173],navy:[0,0,128],oldlace:[253,245,230],olive:[128,128,0],olivedrab:[107,142,35],orange:[255,165,0],orangered:[255,69,0],orchid:[218,112,214],palegoldenrod:[238,232,170],palegreen:[152,251,152],paleturquoise:[175,238,238],palevioletred:[219,112,147],papayawhip:[255,239,213],peachpuff:[255,218,185],peru:[205,133,63],pink:[255,192,203],plum:[221,160,221],powderblue:[176,224,230],purple:[128,0,128],rebeccapurple:[102,51,153],red:[255,0,0],rosybrown:[188,143,143],royalblue:[65,105,225],saddlebrown:[139,69,19],salmon:[250,128,114],sandybrown:[244,164,96],seagreen:[46,139,87],seashell:[255,245,238],sienna:[160,82,45],silver:[192,192,192],skyblue:[135,206,235],slateblue:[106,90,205],slategray:[112,128,144],slategrey:[112,128,144],snow:[255,250,250],springgreen:[0,255,127],steelblue:[70,130,180],tan:[210,180,140],teal:[0,128,128],thistle:[216,191,216],tomato:[255,99,71],turquoise:[64,224,208],violet:[238,130,238],wheat:[245,222,179],white:[255,255,255],whitesmoke:[245,245,245],yellow:[255,255,0],yellowgreen:[154,205,50]},S={};for(var D in M)S[JSON.stringify(M[D])]=D},{}],4:[function(t,e,i){var n=t(3),a=function(){return new u};for(var r in n){a[r+"Raw"]=function(t){return function(e){return"number"==typeof e&&(e=Array.prototype.slice.call(arguments)),n[t](e)}}(r);var o=/(\w+)2(\w+)/.exec(r),s=o[1],l=o[2];(a[s]=a[s]||{})[l]=a[r]=function(t){return function(e){"number"==typeof e&&(e=Array.prototype.slice.call(arguments));var i=n[t](e);if("string"==typeof i||void 0===i)return i;for(var a=0;a<i.length;a++)i[a]=Math.round(i[a]);return i}}(r)}var u=function(){this.convs={}};u.prototype.routeSpace=function(t,e){var i=e[0];return void 0===i?this.getValues(t):("number"==typeof i&&(i=Array.prototype.slice.call(e)),this.setValues(t,i))},u.prototype.setValues=function(t,e){return this.space=t,this.convs={},this.convs[t]=e,this},u.prototype.getValues=function(t){var e=this.convs[t];if(!e){var i=this.space,n=this.convs[i];e=a[i][t](n),this.convs[t]=e}return e},["rgb","hsl","hsv","cmyk","keyword"].forEach(function(t){u.prototype[t]=function(e){return this.routeSpace(t,arguments)}}),e.exports=a},{3:3}],5:[function(t,e,i){"use strict";e.exports={aliceblue:[240,248,255],antiquewhite:[250,235,215],aqua:[0,255,255],aquamarine:[127,255,212],azure:[240,255,255],beige:[245,245,220],bisque:[255,228,196],black:[0,0,0],blanchedalmond:[255,235,205],blue:[0,0,255],blueviolet:[138,43,226],brown:[165,42,42],burlywood:[222,184,135],cadetblue:[95,158,160],chartreuse:[127,255,0],chocolate:[210,105,30],coral:[255,127,80],cornflowerblue:[100,149,237],cornsilk:[255,248,220],crimson:[220,20,60],cyan:[0,255,255],darkblue:[0,0,139],darkcyan:[0,139,139],darkgoldenrod:[184,134,11],darkgray:[169,169,169],darkgreen:[0,100,0],darkgrey:[169,169,169],darkkhaki:[189,183,107],darkmagenta:[139,0,139],darkolivegreen:[85,107,47],darkorange:[255,140,0],darkorchid:[153,50,204],darkred:[139,0,0],darksalmon:[233,150,122],darkseagreen:[143,188,143],darkslateblue:[72,61,139],darkslategray:[47,79,79],darkslategrey:[47,79,79],darkturquoise:[0,206,209],darkviolet:[148,0,211],deeppink:[255,20,147],deepskyblue:[0,191,255],dimgray:[105,105,105],dimgrey:[105,105,105],dodgerblue:[30,144,255],firebrick:[178,34,34],floralwhite:[255,250,240],forestgreen:[34,139,34],fuchsia:[255,0,255],gainsboro:[220,220,220],ghostwhite:[248,248,255],gold:[255,215,0],goldenrod:[218,165,32],gray:[128,128,128],green:[0,128,0],greenyellow:[173,255,47],grey:[128,128,128],honeydew:[240,255,240],hotpink:[255,105,180],indianred:[205,92,92],indigo:[75,0,130],ivory:[255,255,240],khaki:[240,230,140],lavender:[230,230,250],lavenderblush:[255,240,245],lawngreen:[124,252,0],lemonchiffon:[255,250,205],lightblue:[173,216,230],lightcoral:[240,128,128],lightcyan:[224,255,255],lightgoldenrodyellow:[250,250,210],lightgray:[211,211,211],lightgreen:[144,238,144],lightgrey:[211,211,211],lightpink:[255,182,193],lightsalmon:[255,160,122],lightseagreen:[32,178,170],lightskyblue:[135,206,250],lightslategray:[119,136,153],lightslategrey:[119,136,153],lightsteelblue:[176,196,222],lightyellow:[255,255,224],lime:[0,255,0],limegreen:[50,205,50],linen:[250,240,230],magenta:[255,0,255],maroon:[128,0,0],mediumaquamarine:[102,205,170],mediumblue:[0,0,205],mediumorchid:[186,85,211],mediumpurple:[147,112,219],mediumseagreen:[60,179,113],mediumslateblue:[123,104,238],mediumspringgreen:[0,250,154],mediumturquoise:[72,209,204],mediumvioletred:[199,21,133],midnightblue:[25,25,112],mintcream:[245,255,250],mistyrose:[255,228,225],moccasin:[255,228,181],navajowhite:[255,222,173],navy:[0,0,128],oldlace:[253,245,230],olive:[128,128,0],olivedrab:[107,142,35],orange:[255,165,0],orangered:[255,69,0],orchid:[218,112,214],palegoldenrod:[238,232,170],palegreen:[152,251,152],paleturquoise:[175,238,238],palevioletred:[219,112,147],papayawhip:[255,239,213],peachpuff:[255,218,185],peru:[205,133,63],pink:[255,192,203],plum:[221,160,221],powderblue:[176,224,230],purple:[128,0,128],rebeccapurple:[102,51,153],red:[255,0,0],rosybrown:[188,143,143],royalblue:[65,105,225],saddlebrown:[139,69,19],salmon:[250,128,114],sandybrown:[244,164,96],seagreen:[46,139,87],seashell:[255,245,238],sienna:[160,82,45],silver:[192,192,192],skyblue:[135,206,235],slateblue:[106,90,205],slategray:[112,128,144],slategrey:[112,128,144],snow:[255,250,250],springgreen:[0,255,127],steelblue:[70,130,180],tan:[210,180,140],teal:[0,128,128],thistle:[216,191,216],tomato:[255,99,71],turquoise:[64,224,208],violet:[238,130,238],wheat:[245,222,179],white:[255,255,255],whitesmoke:[245,245,245],yellow:[255,255,0],yellowgreen:[154,205,50]}},{}],6:[function(t,e,i){var n,a;n=this,a=function(){"use strict";var i,n;function a(){return i.apply(null,arguments)}function r(t){return t instanceof Array||"[object Array]"===Object.prototype.toString.call(t)}function o(t){return null!=t&&"[object Object]"===Object.prototype.toString.call(t)}function s(t){return void 0===t}function l(t){return"number"==typeof t||"[object Number]"===Object.prototype.toString.call(t)}function u(t){return t instanceof Date||"[object Date]"===Object.prototype.toString.call(t)}function d(t,e){var i,n=[];for(i=0;i<t.length;++i)n.push(e(t[i],i));return n}function h(t,e){return Object.prototype.hasOwnProperty.call(t,e)}function c(t,e){for(var i in e)h(e,i)&&(t[i]=e[i]);return h(e,"toString")&&(t.toString=e.toString),h(e,"valueOf")&&(t.valueOf=e.valueOf),t}function f(t,e,i,n){return Pe(t,e,i,n,!0).utc()}function g(t){return null==t._pf&&(t._pf={empty:!1,unusedTokens:[],unusedInput:[],overflow:-2,charsLeftOver:0,nullInput:!1,invalidMonth:null,invalidFormat:!1,userInvalidated:!1,iso:!1,parsedDateParts:[],meridiem:null,rfc2822:!1,weekdayMismatch:!1}),t._pf}function m(t){if(null==t._isValid){var e=g(t),i=n.call(e.parsedDateParts,function(t){return null!=t}),a=!isNaN(t._d.getTime())&&e.overflow<0&&!e.empty&&!e.invalidMonth&&!e.invalidWeekday&&!e.weekdayMismatch&&!e.nullInput&&!e.invalidFormat&&!e.userInvalidated&&(!e.meridiem||e.meridiem&&i);if(t._strict&&(a=a&&0===e.charsLeftOver&&0===e.unusedTokens.length&&void 0===e.bigHour),null!=Object.isFrozen&&Object.isFrozen(t))return a;t._isValid=a}return t._isValid}function p(t){var e=f(NaN);return null!=t?c(g(e),t):g(e).userInvalidated=!0,e}n=Array.prototype.some?Array.prototype.some:function(t){for(var e=Object(this),i=e.length>>>0,n=0;n<i;n++)if(n in e&&t.call(this,e[n],n,e))return!0;return!1};var v=a.momentProperties=[];function y(t,e){var i,n,a;if(s(e._isAMomentObject)||(t._isAMomentObject=e._isAMomentObject),s(e._i)||(t._i=e._i),s(e._f)||(t._f=e._f),s(e._l)||(t._l=e._l),s(e._strict)||(t._strict=e._strict),s(e._tzm)||(t._tzm=e._tzm),s(e._isUTC)||(t._isUTC=e._isUTC),s(e._offset)||(t._offset=e._offset),s(e._pf)||(t._pf=g(e)),s(e._locale)||(t._locale=e._locale),v.length>0)for(i=0;i<v.length;i++)s(a=e[n=v[i]])||(t[n]=a);return t}var b=!1;function x(t){y(this,t),this._d=new Date(null!=t._d?t._d.getTime():NaN),this.isValid()||(this._d=new Date(NaN)),!1===b&&(b=!0,a.updateOffset(this),b=!1)}function _(t){return t instanceof x||null!=t&&null!=t._isAMomentObject}function k(t){return t<0?Math.ceil(t)||0:Math.floor(t)}function w(t){var e=+t,i=0;return 0!==e&&isFinite(e)&&(i=k(e)),i}function M(t,e,i){var n,a=Math.min(t.length,e.length),r=Math.abs(t.length-e.length),o=0;for(n=0;n<a;n++)(i&&t[n]!==e[n]||!i&&w(t[n])!==w(e[n]))&&o++;return o+r}function S(t){!1===a.suppressDeprecationWarnings&&"undefined"!=typeof console&&console.warn&&console.warn("Deprecation warning: "+t)}function D(t,e){var i=!0;return c(function(){if(null!=a.deprecationHandler&&a.deprecationHandler(null,t),i){for(var n,r=[],o=0;o<arguments.length;o++){if(n="","object"==typeof arguments[o]){for(var s in n+="\n["+o+"] ",arguments[0])n+=s+": "+arguments[0][s]+", ";n=n.slice(0,-2)}else n=arguments[o];r.push(n)}S(t+"\nArguments: "+Array.prototype.slice.call(r).join("")+"\n"+(new Error).stack),i=!1}return e.apply(this,arguments)},e)}var C,P={};function T(t,e){null!=a.deprecationHandler&&a.deprecationHandler(t,e),P[t]||(S(e),P[t]=!0)}function O(t){return t instanceof Function||"[object Function]"===Object.prototype.toString.call(t)}function I(t,e){var i,n=c({},t);for(i in e)h(e,i)&&(o(t[i])&&o(e[i])?(n[i]={},c(n[i],t[i]),c(n[i],e[i])):null!=e[i]?n[i]=e[i]:delete n[i]);for(i in t)h(t,i)&&!h(e,i)&&o(t[i])&&(n[i]=c({},n[i]));return n}function A(t){null!=t&&this.set(t)}a.suppressDeprecationWarnings=!1,a.deprecationHandler=null,C=Object.keys?Object.keys:function(t){var e,i=[];for(e in t)h(t,e)&&i.push(e);return i};var F={};function R(t,e){var i=t.toLowerCase();F[i]=F[i+"s"]=F[e]=t}function L(t){return"string"==typeof t?F[t]||F[t.toLowerCase()]:void 0}function W(t){var e,i,n={};for(i in t)h(t,i)&&(e=L(i))&&(n[e]=t[i]);return n}var Y={};function N(t,e){Y[t]=e}function z(t,e,i){var n=""+Math.abs(t),a=e-n.length;return(t>=0?i?"+":"":"-")+Math.pow(10,Math.max(0,a)).toString().substr(1)+n}var H=/(\[[^\[]*\])|(\\)?([Hh]mm(ss)?|Mo|MM?M?M?|Do|DDDo|DD?D?D?|ddd?d?|do?|w[o|w]?|W[o|W]?|Qo?|YYYYYY|YYYYY|YYYY|YY|gg(ggg?)?|GG(GGG?)?|e|E|a|A|hh?|HH?|kk?|mm?|ss?|S{1,9}|x|X|zz?|ZZ?|.)/g,V=/(\[[^\[]*\])|(\\)?(LTS|LT|LL?L?L?|l{1,4})/g,B={},E={};function j(t,e,i,n){var a=n;"string"==typeof n&&(a=function(){return this[n]()}),t&&(E[t]=a),e&&(E[e[0]]=function(){return z(a.apply(this,arguments),e[1],e[2])}),i&&(E[i]=function(){return this.localeData().ordinal(a.apply(this,arguments),t)})}function U(t,e){return t.isValid()?(e=q(e,t.localeData()),B[e]=B[e]||function(t){var e,i,n,a=t.match(H);for(e=0,i=a.length;e<i;e++)E[a[e]]?a[e]=E[a[e]]:a[e]=(n=a[e]).match(/\[[\s\S]/)?n.replace(/^\[|\]$/g,""):n.replace(/\\/g,"");return function(e){var n,r="";for(n=0;n<i;n++)r+=O(a[n])?a[n].call(e,t):a[n];return r}}(e),B[e](t)):t.localeData().invalidDate()}function q(t,e){var i=5;function n(t){return e.longDateFormat(t)||t}for(V.lastIndex=0;i>=0&&V.test(t);)t=t.replace(V,n),V.lastIndex=0,i-=1;return t}var G=/\d/,Z=/\d\d/,X=/\d{3}/,J=/\d{4}/,K=/[+-]?\d{6}/,$=/\d\d?/,Q=/\d\d\d\d?/,tt=/\d\d\d\d\d\d?/,et=/\d{1,3}/,it=/\d{1,4}/,nt=/[+-]?\d{1,6}/,at=/\d+/,rt=/[+-]?\d+/,ot=/Z|[+-]\d\d:?\d\d/gi,st=/Z|[+-]\d\d(?::?\d\d)?/gi,lt=/[0-9]{0,256}['a-z\u00A0-\u05FF\u0700-\uD7FF\uF900-\uFDCF\uFDF0-\uFF07\uFF10-\uFFEF]{1,256}|[\u0600-\u06FF\/]{1,256}(\s*?[\u0600-\u06FF]{1,256}){1,2}/i,ut={};function dt(t,e,i){ut[t]=O(e)?e:function(t,n){return t&&i?i:e}}function ht(t,e){return h(ut,t)?ut[t](e._strict,e._locale):new RegExp(ct(t.replace("\\","").replace(/\\(\[)|\\(\])|\[([^\]\[]*)\]|\\(.)/g,function(t,e,i,n,a){return e||i||n||a})))}function ct(t){return t.replace(/[-\/\\^$*+?.()|[\]{}]/g,"\\$&")}var ft={};function gt(t,e){var i,n=e;for("string"==typeof t&&(t=[t]),l(e)&&(n=function(t,i){i[e]=w(t)}),i=0;i<t.length;i++)ft[t[i]]=n}function mt(t,e){gt(t,function(t,i,n,a){n._w=n._w||{},e(t,n._w,n,a)})}var pt=0,vt=1,yt=2,bt=3,xt=4,_t=5,kt=6,wt=7,Mt=8;function St(t){return Dt(t)?366:365}function Dt(t){return t%4==0&&t%100!=0||t%400==0}j("Y",0,0,function(){var t=this.year();return t<=9999?""+t:"+"+t}),j(0,["YY",2],0,function(){return this.year()%100}),j(0,["YYYY",4],0,"year"),j(0,["YYYYY",5],0,"year"),j(0,["YYYYYY",6,!0],0,"year"),R("year","y"),N("year",1),dt("Y",rt),dt("YY",$,Z),dt("YYYY",it,J),dt("YYYYY",nt,K),dt("YYYYYY",nt,K),gt(["YYYYY","YYYYYY"],pt),gt("YYYY",function(t,e){e[pt]=2===t.length?a.parseTwoDigitYear(t):w(t)}),gt("YY",function(t,e){e[pt]=a.parseTwoDigitYear(t)}),gt("Y",function(t,e){e[pt]=parseInt(t,10)}),a.parseTwoDigitYear=function(t){return w(t)+(w(t)>68?1900:2e3)};var Ct,Pt=Tt("FullYear",!0);function Tt(t,e){return function(i){return null!=i?(It(this,t,i),a.updateOffset(this,e),this):Ot(this,t)}}function Ot(t,e){return t.isValid()?t._d["get"+(t._isUTC?"UTC":"")+e]():NaN}function It(t,e,i){t.isValid()&&!isNaN(i)&&("FullYear"===e&&Dt(t.year())&&1===t.month()&&29===t.date()?t._d["set"+(t._isUTC?"UTC":"")+e](i,t.month(),At(i,t.month())):t._d["set"+(t._isUTC?"UTC":"")+e](i))}function At(t,e){if(isNaN(t)||isNaN(e))return NaN;var i,n=(e%(i=12)+i)%i;return t+=(e-n)/12,1===n?Dt(t)?29:28:31-n%7%2}Ct=Array.prototype.indexOf?Array.prototype.indexOf:function(t){var e;for(e=0;e<this.length;++e)if(this[e]===t)return e;return-1},j("M",["MM",2],"Mo",function(){return this.month()+1}),j("MMM",0,0,function(t){return this.localeData().monthsShort(this,t)}),j("MMMM",0,0,function(t){return this.localeData().months(this,t)}),R("month","M"),N("month",8),dt("M",$),dt("MM",$,Z),dt("MMM",function(t,e){return e.monthsShortRegex(t)}),dt("MMMM",function(t,e){return e.monthsRegex(t)}),gt(["M","MM"],function(t,e){e[vt]=w(t)-1}),gt(["MMM","MMMM"],function(t,e,i,n){var a=i._locale.monthsParse(t,n,i._strict);null!=a?e[vt]=a:g(i).invalidMonth=t});var Ft=/D[oD]?(\[[^\[\]]*\]|\s)+MMMM?/,Rt="January_February_March_April_May_June_July_August_September_October_November_December".split("_");var Lt="Jan_Feb_Mar_Apr_May_Jun_Jul_Aug_Sep_Oct_Nov_Dec".split("_");function Wt(t,e){var i;if(!t.isValid())return t;if("string"==typeof e)if(/^\d+$/.test(e))e=w(e);else if(!l(e=t.localeData().monthsParse(e)))return t;return i=Math.min(t.date(),At(t.year(),e)),t._d["set"+(t._isUTC?"UTC":"")+"Month"](e,i),t}function Yt(t){return null!=t?(Wt(this,t),a.updateOffset(this,!0),this):Ot(this,"Month")}var Nt=lt;var zt=lt;function Ht(){function t(t,e){return e.length-t.length}var e,i,n=[],a=[],r=[];for(e=0;e<12;e++)i=f([2e3,e]),n.push(this.monthsShort(i,"")),a.push(this.months(i,"")),r.push(this.months(i,"")),r.push(this.monthsShort(i,""));for(n.sort(t),a.sort(t),r.sort(t),e=0;e<12;e++)n[e]=ct(n[e]),a[e]=ct(a[e]);for(e=0;e<24;e++)r[e]=ct(r[e]);this._monthsRegex=new RegExp("^("+r.join("|")+")","i"),this._monthsShortRegex=this._monthsRegex,this._monthsStrictRegex=new RegExp("^("+a.join("|")+")","i"),this._monthsShortStrictRegex=new RegExp("^("+n.join("|")+")","i")}function Vt(t){var e=new Date(Date.UTC.apply(null,arguments));return t<100&&t>=0&&isFinite(e.getUTCFullYear())&&e.setUTCFullYear(t),e}function Bt(t,e,i){var n=7+e-i;return-((7+Vt(t,0,n).getUTCDay()-e)%7)+n-1}function Et(t,e,i,n,a){var r,o,s=1+7*(e-1)+(7+i-n)%7+Bt(t,n,a);return s<=0?o=St(r=t-1)+s:s>St(t)?(r=t+1,o=s-St(t)):(r=t,o=s),{year:r,dayOfYear:o}}function jt(t,e,i){var n,a,r=Bt(t.year(),e,i),o=Math.floor((t.dayOfYear()-r-1)/7)+1;return o<1?n=o+Ut(a=t.year()-1,e,i):o>Ut(t.year(),e,i)?(n=o-Ut(t.year(),e,i),a=t.year()+1):(a=t.year(),n=o),{week:n,year:a}}function Ut(t,e,i){var n=Bt(t,e,i),a=Bt(t+1,e,i);return(St(t)-n+a)/7}j("w",["ww",2],"wo","week"),j("W",["WW",2],"Wo","isoWeek"),R("week","w"),R("isoWeek","W"),N("week",5),N("isoWeek",5),dt("w",$),dt("ww",$,Z),dt("W",$),dt("WW",$,Z),mt(["w","ww","W","WW"],function(t,e,i,n){e[n.substr(0,1)]=w(t)});j("d",0,"do","day"),j("dd",0,0,function(t){return this.localeData().weekdaysMin(this,t)}),j("ddd",0,0,function(t){return this.localeData().weekdaysShort(this,t)}),j("dddd",0,0,function(t){return this.localeData().weekdays(this,t)}),j("e",0,0,"weekday"),j("E",0,0,"isoWeekday"),R("day","d"),R("weekday","e"),R("isoWeekday","E"),N("day",11),N("weekday",11),N("isoWeekday",11),dt("d",$),dt("e",$),dt("E",$),dt("dd",function(t,e){return e.weekdaysMinRegex(t)}),dt("ddd",function(t,e){return e.weekdaysShortRegex(t)}),dt("dddd",function(t,e){return e.weekdaysRegex(t)}),mt(["dd","ddd","dddd"],function(t,e,i,n){var a=i._locale.weekdaysParse(t,n,i._strict);null!=a?e.d=a:g(i).invalidWeekday=t}),mt(["d","e","E"],function(t,e,i,n){e[n]=w(t)});var qt="Sunday_Monday_Tuesday_Wednesday_Thursday_Friday_Saturday".split("_");var Gt="Sun_Mon_Tue_Wed_Thu_Fri_Sat".split("_");var Zt="Su_Mo_Tu_We_Th_Fr_Sa".split("_");var Xt=lt;var Jt=lt;var Kt=lt;function $t(){function t(t,e){return e.length-t.length}var e,i,n,a,r,o=[],s=[],l=[],u=[];for(e=0;e<7;e++)i=f([2e3,1]).day(e),n=this.weekdaysMin(i,""),a=this.weekdaysShort(i,""),r=this.weekdays(i,""),o.push(n),s.push(a),l.push(r),u.push(n),u.push(a),u.push(r);for(o.sort(t),s.sort(t),l.sort(t),u.sort(t),e=0;e<7;e++)s[e]=ct(s[e]),l[e]=ct(l[e]),u[e]=ct(u[e]);this._weekdaysRegex=new RegExp("^("+u.join("|")+")","i"),this._weekdaysShortRegex=this._weekdaysRegex,this._weekdaysMinRegex=this._weekdaysRegex,this._weekdaysStrictRegex=new RegExp("^("+l.join("|")+")","i"),this._weekdaysShortStrictRegex=new RegExp("^("+s.join("|")+")","i"),this._weekdaysMinStrictRegex=new RegExp("^("+o.join("|")+")","i")}function Qt(){return this.hours()%12||12}function te(t,e){j(t,0,0,function(){return this.localeData().meridiem(this.hours(),this.minutes(),e)})}function ee(t,e){return e._meridiemParse}j("H",["HH",2],0,"hour"),j("h",["hh",2],0,Qt),j("k",["kk",2],0,function(){return this.hours()||24}),j("hmm",0,0,function(){return""+Qt.apply(this)+z(this.minutes(),2)}),j("hmmss",0,0,function(){return""+Qt.apply(this)+z(this.minutes(),2)+z(this.seconds(),2)}),j("Hmm",0,0,function(){return""+this.hours()+z(this.minutes(),2)}),j("Hmmss",0,0,function(){return""+this.hours()+z(this.minutes(),2)+z(this.seconds(),2)}),te("a",!0),te("A",!1),R("hour","h"),N("hour",13),dt("a",ee),dt("A",ee),dt("H",$),dt("h",$),dt("k",$),dt("HH",$,Z),dt("hh",$,Z),dt("kk",$,Z),dt("hmm",Q),dt("hmmss",tt),dt("Hmm",Q),dt("Hmmss",tt),gt(["H","HH"],bt),gt(["k","kk"],function(t,e,i){var n=w(t);e[bt]=24===n?0:n}),gt(["a","A"],function(t,e,i){i._isPm=i._locale.isPM(t),i._meridiem=t}),gt(["h","hh"],function(t,e,i){e[bt]=w(t),g(i).bigHour=!0}),gt("hmm",function(t,e,i){var n=t.length-2;e[bt]=w(t.substr(0,n)),e[xt]=w(t.substr(n)),g(i).bigHour=!0}),gt("hmmss",function(t,e,i){var n=t.length-4,a=t.length-2;e[bt]=w(t.substr(0,n)),e[xt]=w(t.substr(n,2)),e[_t]=w(t.substr(a)),g(i).bigHour=!0}),gt("Hmm",function(t,e,i){var n=t.length-2;e[bt]=w(t.substr(0,n)),e[xt]=w(t.substr(n))}),gt("Hmmss",function(t,e,i){var n=t.length-4,a=t.length-2;e[bt]=w(t.substr(0,n)),e[xt]=w(t.substr(n,2)),e[_t]=w(t.substr(a))});var ie,ne=Tt("Hours",!0),ae={calendar:{sameDay:"[Today at] LT",nextDay:"[Tomorrow at] LT",nextWeek:"dddd [at] LT",lastDay:"[Yesterday at] LT",lastWeek:"[Last] dddd [at] LT",sameElse:"L"},longDateFormat:{LTS:"h:mm:ss A",LT:"h:mm A",L:"MM/DD/YYYY",LL:"MMMM D, YYYY",LLL:"MMMM D, YYYY h:mm A",LLLL:"dddd, MMMM D, YYYY h:mm A"},invalidDate:"Invalid date",ordinal:"%d",dayOfMonthOrdinalParse:/\d{1,2}/,relativeTime:{future:"in %s",past:"%s ago",s:"a few seconds",ss:"%d seconds",m:"a minute",mm:"%d minutes",h:"an hour",hh:"%d hours",d:"a day",dd:"%d days",M:"a month",MM:"%d months",y:"a year",yy:"%d years"},months:Rt,monthsShort:Lt,week:{dow:0,doy:6},weekdays:qt,weekdaysMin:Zt,weekdaysShort:Gt,meridiemParse:/[ap]\.?m?\.?/i},re={},oe={};function se(t){return t?t.toLowerCase().replace("_","-"):t}function le(i){var n=null;if(!re[i]&&void 0!==e&&e&&e.exports)try{n=ie._abbr,t("./locale/"+i),ue(n)}catch(t){}return re[i]}function ue(t,e){var i;return t&&(i=s(e)?he(t):de(t,e))&&(ie=i),ie._abbr}function de(t,e){if(null!==e){var i=ae;if(e.abbr=t,null!=re[t])T("defineLocaleOverride","use moment.updateLocale(localeName, config) to change an existing locale. moment.defineLocale(localeName, config) should only be used for creating a new locale See http://momentjs.com/guides/#/warnings/define-locale/ for more info."),i=re[t]._config;else if(null!=e.parentLocale){if(null==re[e.parentLocale])return oe[e.parentLocale]||(oe[e.parentLocale]=[]),oe[e.parentLocale].push({name:t,config:e}),null;i=re[e.parentLocale]._config}return re[t]=new A(I(i,e)),oe[t]&&oe[t].forEach(function(t){de(t.name,t.config)}),ue(t),re[t]}return delete re[t],null}function he(t){var e;if(t&&t._locale&&t._locale._abbr&&(t=t._locale._abbr),!t)return ie;if(!r(t)){if(e=le(t))return e;t=[t]}return function(t){for(var e,i,n,a,r=0;r<t.length;){for(e=(a=se(t[r]).split("-")).length,i=(i=se(t[r+1]))?i.split("-"):null;e>0;){if(n=le(a.slice(0,e).join("-")))return n;if(i&&i.length>=e&&M(a,i,!0)>=e-1)break;e--}r++}return null}(t)}function ce(t){var e,i=t._a;return i&&-2===g(t).overflow&&(e=i[vt]<0||i[vt]>11?vt:i[yt]<1||i[yt]>At(i[pt],i[vt])?yt:i[bt]<0||i[bt]>24||24===i[bt]&&(0!==i[xt]||0!==i[_t]||0!==i[kt])?bt:i[xt]<0||i[xt]>59?xt:i[_t]<0||i[_t]>59?_t:i[kt]<0||i[kt]>999?kt:-1,g(t)._overflowDayOfYear&&(e<pt||e>yt)&&(e=yt),g(t)._overflowWeeks&&-1===e&&(e=wt),g(t)._overflowWeekday&&-1===e&&(e=Mt),g(t).overflow=e),t}function fe(t,e,i){return null!=t?t:null!=e?e:i}function ge(t){var e,i,n,r,o,s=[];if(!t._d){var l,u;for(l=t,u=new Date(a.now()),n=l._useUTC?[u.getUTCFullYear(),u.getUTCMonth(),u.getUTCDate()]:[u.getFullYear(),u.getMonth(),u.getDate()],t._w&&null==t._a[yt]&&null==t._a[vt]&&function(t){var e,i,n,a,r,o,s,l;if(null!=(e=t._w).GG||null!=e.W||null!=e.E)r=1,o=4,i=fe(e.GG,t._a[pt],jt(Te(),1,4).year),n=fe(e.W,1),((a=fe(e.E,1))<1||a>7)&&(l=!0);else{r=t._locale._week.dow,o=t._locale._week.doy;var u=jt(Te(),r,o);i=fe(e.gg,t._a[pt],u.year),n=fe(e.w,u.week),null!=e.d?((a=e.d)<0||a>6)&&(l=!0):null!=e.e?(a=e.e+r,(e.e<0||e.e>6)&&(l=!0)):a=r}n<1||n>Ut(i,r,o)?g(t)._overflowWeeks=!0:null!=l?g(t)._overflowWeekday=!0:(s=Et(i,n,a,r,o),t._a[pt]=s.year,t._dayOfYear=s.dayOfYear)}(t),null!=t._dayOfYear&&(o=fe(t._a[pt],n[pt]),(t._dayOfYear>St(o)||0===t._dayOfYear)&&(g(t)._overflowDayOfYear=!0),i=Vt(o,0,t._dayOfYear),t._a[vt]=i.getUTCMonth(),t._a[yt]=i.getUTCDate()),e=0;e<3&&null==t._a[e];++e)t._a[e]=s[e]=n[e];for(;e<7;e++)t._a[e]=s[e]=null==t._a[e]?2===e?1:0:t._a[e];24===t._a[bt]&&0===t._a[xt]&&0===t._a[_t]&&0===t._a[kt]&&(t._nextDay=!0,t._a[bt]=0),t._d=(t._useUTC?Vt:function(t,e,i,n,a,r,o){var s=new Date(t,e,i,n,a,r,o);return t<100&&t>=0&&isFinite(s.getFullYear())&&s.setFullYear(t),s}).apply(null,s),r=t._useUTC?t._d.getUTCDay():t._d.getDay(),null!=t._tzm&&t._d.setUTCMinutes(t._d.getUTCMinutes()-t._tzm),t._nextDay&&(t._a[bt]=24),t._w&&void 0!==t._w.d&&t._w.d!==r&&(g(t).weekdayMismatch=!0)}}var me=/^\s*((?:[+-]\d{6}|\d{4})-(?:\d\d-\d\d|W\d\d-\d|W\d\d|\d\d\d|\d\d))(?:(T| )(\d\d(?::\d\d(?::\d\d(?:[.,]\d+)?)?)?)([\+\-]\d\d(?::?\d\d)?|\s*Z)?)?$/,pe=/^\s*((?:[+-]\d{6}|\d{4})(?:\d\d\d\d|W\d\d\d|W\d\d|\d\d\d|\d\d))(?:(T| )(\d\d(?:\d\d(?:\d\d(?:[.,]\d+)?)?)?)([\+\-]\d\d(?::?\d\d)?|\s*Z)?)?$/,ve=/Z|[+-]\d\d(?::?\d\d)?/,ye=[["YYYYYY-MM-DD",/[+-]\d{6}-\d\d-\d\d/],["YYYY-MM-DD",/\d{4}-\d\d-\d\d/],["GGGG-[W]WW-E",/\d{4}-W\d\d-\d/],["GGGG-[W]WW",/\d{4}-W\d\d/,!1],["YYYY-DDD",/\d{4}-\d{3}/],["YYYY-MM",/\d{4}-\d\d/,!1],["YYYYYYMMDD",/[+-]\d{10}/],["YYYYMMDD",/\d{8}/],["GGGG[W]WWE",/\d{4}W\d{3}/],["GGGG[W]WW",/\d{4}W\d{2}/,!1],["YYYYDDD",/\d{7}/]],be=[["HH:mm:ss.SSSS",/\d\d:\d\d:\d\d\.\d+/],["HH:mm:ss,SSSS",/\d\d:\d\d:\d\d,\d+/],["HH:mm:ss",/\d\d:\d\d:\d\d/],["HH:mm",/\d\d:\d\d/],["HHmmss.SSSS",/\d\d\d\d\d\d\.\d+/],["HHmmss,SSSS",/\d\d\d\d\d\d,\d+/],["HHmmss",/\d\d\d\d\d\d/],["HHmm",/\d\d\d\d/],["HH",/\d\d/]],xe=/^\/?Date\((\-?\d+)/i;function _e(t){var e,i,n,a,r,o,s=t._i,l=me.exec(s)||pe.exec(s);if(l){for(g(t).iso=!0,e=0,i=ye.length;e<i;e++)if(ye[e][1].exec(l[1])){a=ye[e][0],n=!1!==ye[e][2];break}if(null==a)return void(t._isValid=!1);if(l[3]){for(e=0,i=be.length;e<i;e++)if(be[e][1].exec(l[3])){r=(l[2]||" ")+be[e][0];break}if(null==r)return void(t._isValid=!1)}if(!n&&null!=r)return void(t._isValid=!1);if(l[4]){if(!ve.exec(l[4]))return void(t._isValid=!1);o="Z"}t._f=a+(r||"")+(o||""),De(t)}else t._isValid=!1}var ke=/^(?:(Mon|Tue|Wed|Thu|Fri|Sat|Sun),?\s)?(\d{1,2})\s(Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep|Oct|Nov|Dec)\s(\d{2,4})\s(\d\d):(\d\d)(?::(\d\d))?\s(?:(UT|GMT|[ECMP][SD]T)|([Zz])|([+-]\d{4}))$/;function we(t,e,i,n,a,r){var o=[function(t){var e=parseInt(t,10);{if(e<=49)return 2e3+e;if(e<=999)return 1900+e}return e}(t),Lt.indexOf(e),parseInt(i,10),parseInt(n,10),parseInt(a,10)];return r&&o.push(parseInt(r,10)),o}var Me={UT:0,GMT:0,EDT:-240,EST:-300,CDT:-300,CST:-360,MDT:-360,MST:-420,PDT:-420,PST:-480};function Se(t){var e,i,n,a=ke.exec(t._i.replace(/\([^)]*\)|[\n\t]/g," ").replace(/(\s\s+)/g," ").trim());if(a){var r=we(a[4],a[3],a[2],a[5],a[6],a[7]);if(e=a[1],i=r,n=t,e&&Gt.indexOf(e)!==new Date(i[0],i[1],i[2]).getDay()&&(g(n).weekdayMismatch=!0,n._isValid=!1,1))return;t._a=r,t._tzm=function(t,e,i){if(t)return Me[t];if(e)return 0;var n=parseInt(i,10),a=n%100;return(n-a)/100*60+a}(a[8],a[9],a[10]),t._d=Vt.apply(null,t._a),t._d.setUTCMinutes(t._d.getUTCMinutes()-t._tzm),g(t).rfc2822=!0}else t._isValid=!1}function De(t){if(t._f!==a.ISO_8601)if(t._f!==a.RFC_2822){t._a=[],g(t).empty=!0;var e,i,n,r,o,s,l,u,d=""+t._i,c=d.length,f=0;for(n=q(t._f,t._locale).match(H)||[],e=0;e<n.length;e++)r=n[e],(i=(d.match(ht(r,t))||[])[0])&&((o=d.substr(0,d.indexOf(i))).length>0&&g(t).unusedInput.push(o),d=d.slice(d.indexOf(i)+i.length),f+=i.length),E[r]?(i?g(t).empty=!1:g(t).unusedTokens.push(r),s=r,u=t,null!=(l=i)&&h(ft,s)&&ft[s](l,u._a,u,s)):t._strict&&!i&&g(t).unusedTokens.push(r);g(t).charsLeftOver=c-f,d.length>0&&g(t).unusedInput.push(d),t._a[bt]<=12&&!0===g(t).bigHour&&t._a[bt]>0&&(g(t).bigHour=void 0),g(t).parsedDateParts=t._a.slice(0),g(t).meridiem=t._meridiem,t._a[bt]=function(t,e,i){var n;if(null==i)return e;return null!=t.meridiemHour?t.meridiemHour(e,i):null!=t.isPM?((n=t.isPM(i))&&e<12&&(e+=12),n||12!==e||(e=0),e):e}(t._locale,t._a[bt],t._meridiem),ge(t),ce(t)}else Se(t);else _e(t)}function Ce(t){var e,i,n,h,f=t._i,v=t._f;return t._locale=t._locale||he(t._l),null===f||void 0===v&&""===f?p({nullInput:!0}):("string"==typeof f&&(t._i=f=t._locale.preparse(f)),_(f)?new x(ce(f)):(u(f)?t._d=f:r(v)?function(t){var e,i,n,a,r;if(0===t._f.length)return g(t).invalidFormat=!0,void(t._d=new Date(NaN));for(a=0;a<t._f.length;a++)r=0,e=y({},t),null!=t._useUTC&&(e._useUTC=t._useUTC),e._f=t._f[a],De(e),m(e)&&(r+=g(e).charsLeftOver,r+=10*g(e).unusedTokens.length,g(e).score=r,(null==n||r<n)&&(n=r,i=e));c(t,i||e)}(t):v?De(t):s(i=(e=t)._i)?e._d=new Date(a.now()):u(i)?e._d=new Date(i.valueOf()):"string"==typeof i?(n=e,null===(h=xe.exec(n._i))?(_e(n),!1===n._isValid&&(delete n._isValid,Se(n),!1===n._isValid&&(delete n._isValid,a.createFromInputFallback(n)))):n._d=new Date(+h[1])):r(i)?(e._a=d(i.slice(0),function(t){return parseInt(t,10)}),ge(e)):o(i)?function(t){if(!t._d){var e=W(t._i);t._a=d([e.year,e.month,e.day||e.date,e.hour,e.minute,e.second,e.millisecond],function(t){return t&&parseInt(t,10)}),ge(t)}}(e):l(i)?e._d=new Date(i):a.createFromInputFallback(e),m(t)||(t._d=null),t))}function Pe(t,e,i,n,a){var s,l={};return!0!==i&&!1!==i||(n=i,i=void 0),(o(t)&&function(t){if(Object.getOwnPropertyNames)return 0===Object.getOwnPropertyNames(t).length;var e;for(e in t)if(t.hasOwnProperty(e))return!1;return!0}(t)||r(t)&&0===t.length)&&(t=void 0),l._isAMomentObject=!0,l._useUTC=l._isUTC=a,l._l=i,l._i=t,l._f=e,l._strict=n,(s=new x(ce(Ce(l))))._nextDay&&(s.add(1,"d"),s._nextDay=void 0),s}function Te(t,e,i,n){return Pe(t,e,i,n,!1)}a.createFromInputFallback=D("value provided is not in a recognized RFC2822 or ISO format. moment construction falls back to js Date(), which is not reliable across all browsers and versions. Non RFC2822/ISO date formats are discouraged and will be removed in an upcoming major release. Please refer to http://momentjs.com/guides/#/warnings/js-date/ for more info.",function(t){t._d=new Date(t._i+(t._useUTC?" UTC":""))}),a.ISO_8601=function(){},a.RFC_2822=function(){};var Oe=D("moment().min is deprecated, use moment.max instead. http://momentjs.com/guides/#/warnings/min-max/",function(){var t=Te.apply(null,arguments);return this.isValid()&&t.isValid()?t<this?this:t:p()}),Ie=D("moment().max is deprecated, use moment.min instead. http://momentjs.com/guides/#/warnings/min-max/",function(){var t=Te.apply(null,arguments);return this.isValid()&&t.isValid()?t>this?this:t:p()});function Ae(t,e){var i,n;if(1===e.length&&r(e[0])&&(e=e[0]),!e.length)return Te();for(i=e[0],n=1;n<e.length;++n)e[n].isValid()&&!e[n][t](i)||(i=e[n]);return i}var Fe=["year","quarter","month","week","day","hour","minute","second","millisecond"];function Re(t){var e=W(t),i=e.year||0,n=e.quarter||0,a=e.month||0,r=e.week||0,o=e.day||0,s=e.hour||0,l=e.minute||0,u=e.second||0,d=e.millisecond||0;this._isValid=function(t){for(var e in t)if(-1===Ct.call(Fe,e)||null!=t[e]&&isNaN(t[e]))return!1;for(var i=!1,n=0;n<Fe.length;++n)if(t[Fe[n]]){if(i)return!1;parseFloat(t[Fe[n]])!==w(t[Fe[n]])&&(i=!0)}return!0}(e),this._milliseconds=+d+1e3*u+6e4*l+1e3*s*60*60,this._days=+o+7*r,this._months=+a+3*n+12*i,this._data={},this._locale=he(),this._bubble()}function Le(t){return t instanceof Re}function We(t){return t<0?-1*Math.round(-1*t):Math.round(t)}function Ye(t,e){j(t,0,0,function(){var t=this.utcOffset(),i="+";return t<0&&(t=-t,i="-"),i+z(~~(t/60),2)+e+z(~~t%60,2)})}Ye("Z",":"),Ye("ZZ",""),dt("Z",st),dt("ZZ",st),gt(["Z","ZZ"],function(t,e,i){i._useUTC=!0,i._tzm=ze(st,t)});var Ne=/([\+\-]|\d\d)/gi;function ze(t,e){var i=(e||"").match(t);if(null===i)return null;var n=((i[i.length-1]||[])+"").match(Ne)||["-",0,0],a=60*n[1]+w(n[2]);return 0===a?0:"+"===n[0]?a:-a}function He(t,e){var i,n;return e._isUTC?(i=e.clone(),n=(_(t)||u(t)?t.valueOf():Te(t).valueOf())-i.valueOf(),i._d.setTime(i._d.valueOf()+n),a.updateOffset(i,!1),i):Te(t).local()}function Ve(t){return 15*-Math.round(t._d.getTimezoneOffset()/15)}function Be(){return!!this.isValid()&&(this._isUTC&&0===this._offset)}a.updateOffset=function(){};var Ee=/^(\-|\+)?(?:(\d*)[. ])?(\d+)\:(\d+)(?:\:(\d+)(\.\d*)?)?$/,je=/^(-|\+)?P(?:([-+]?[0-9,.]*)Y)?(?:([-+]?[0-9,.]*)M)?(?:([-+]?[0-9,.]*)W)?(?:([-+]?[0-9,.]*)D)?(?:T(?:([-+]?[0-9,.]*)H)?(?:([-+]?[0-9,.]*)M)?(?:([-+]?[0-9,.]*)S)?)?$/;function Ue(t,e){var i,n,a,r=t,o=null;return Le(t)?r={ms:t._milliseconds,d:t._days,M:t._months}:l(t)?(r={},e?r[e]=t:r.milliseconds=t):(o=Ee.exec(t))?(i="-"===o[1]?-1:1,r={y:0,d:w(o[yt])*i,h:w(o[bt])*i,m:w(o[xt])*i,s:w(o[_t])*i,ms:w(We(1e3*o[kt]))*i}):(o=je.exec(t))?(i="-"===o[1]?-1:(o[1],1),r={y:qe(o[2],i),M:qe(o[3],i),w:qe(o[4],i),d:qe(o[5],i),h:qe(o[6],i),m:qe(o[7],i),s:qe(o[8],i)}):null==r?r={}:"object"==typeof r&&("from"in r||"to"in r)&&(a=function(t,e){var i;if(!t.isValid()||!e.isValid())return{milliseconds:0,months:0};e=He(e,t),t.isBefore(e)?i=Ge(t,e):((i=Ge(e,t)).milliseconds=-i.milliseconds,i.months=-i.months);return i}(Te(r.from),Te(r.to)),(r={}).ms=a.milliseconds,r.M=a.months),n=new Re(r),Le(t)&&h(t,"_locale")&&(n._locale=t._locale),n}function qe(t,e){var i=t&&parseFloat(t.replace(",","."));return(isNaN(i)?0:i)*e}function Ge(t,e){var i={milliseconds:0,months:0};return i.months=e.month()-t.month()+12*(e.year()-t.year()),t.clone().add(i.months,"M").isAfter(e)&&--i.months,i.milliseconds=+e-+t.clone().add(i.months,"M"),i}function Ze(t,e){return function(i,n){var a;return null===n||isNaN(+n)||(T(e,"moment()."+e+"(period, number) is deprecated. Please use moment()."+e+"(number, period). See http://momentjs.com/guides/#/warnings/add-inverted-param/ for more info."),a=i,i=n,n=a),Xe(this,Ue(i="string"==typeof i?+i:i,n),t),this}}function Xe(t,e,i,n){var r=e._milliseconds,o=We(e._days),s=We(e._months);t.isValid()&&(n=null==n||n,s&&Wt(t,Ot(t,"Month")+s*i),o&&It(t,"Date",Ot(t,"Date")+o*i),r&&t._d.setTime(t._d.valueOf()+r*i),n&&a.updateOffset(t,o||s))}Ue.fn=Re.prototype,Ue.invalid=function(){return Ue(NaN)};var Je=Ze(1,"add"),Ke=Ze(-1,"subtract");function $e(t,e){var i=12*(e.year()-t.year())+(e.month()-t.month()),n=t.clone().add(i,"months");return-(i+(e-n<0?(e-n)/(n-t.clone().add(i-1,"months")):(e-n)/(t.clone().add(i+1,"months")-n)))||0}function Qe(t){var e;return void 0===t?this._locale._abbr:(null!=(e=he(t))&&(this._locale=e),this)}a.defaultFormat="YYYY-MM-DDTHH:mm:ssZ",a.defaultFormatUtc="YYYY-MM-DDTHH:mm:ss[Z]";var ti=D("moment().lang() is deprecated. Instead, use moment().localeData() to get the language configuration. Use moment().locale() to change languages.",function(t){return void 0===t?this.localeData():this.locale(t)});function ei(){return this._locale}function ii(t,e){j(0,[t,t.length],0,e)}function ni(t,e,i,n,a){var r;return null==t?jt(this,n,a).year:(e>(r=Ut(t,n,a))&&(e=r),function(t,e,i,n,a){var r=Et(t,e,i,n,a),o=Vt(r.year,0,r.dayOfYear);return this.year(o.getUTCFullYear()),this.month(o.getUTCMonth()),this.date(o.getUTCDate()),this}.call(this,t,e,i,n,a))}j(0,["gg",2],0,function(){return this.weekYear()%100}),j(0,["GG",2],0,function(){return this.isoWeekYear()%100}),ii("gggg","weekYear"),ii("ggggg","weekYear"),ii("GGGG","isoWeekYear"),ii("GGGGG","isoWeekYear"),R("weekYear","gg"),R("isoWeekYear","GG"),N("weekYear",1),N("isoWeekYear",1),dt("G",rt),dt("g",rt),dt("GG",$,Z),dt("gg",$,Z),dt("GGGG",it,J),dt("gggg",it,J),dt("GGGGG",nt,K),dt("ggggg",nt,K),mt(["gggg","ggggg","GGGG","GGGGG"],function(t,e,i,n){e[n.substr(0,2)]=w(t)}),mt(["gg","GG"],function(t,e,i,n){e[n]=a.parseTwoDigitYear(t)}),j("Q",0,"Qo","quarter"),R("quarter","Q"),N("quarter",7),dt("Q",G),gt("Q",function(t,e){e[vt]=3*(w(t)-1)}),j("D",["DD",2],"Do","date"),R("date","D"),N("date",9),dt("D",$),dt("DD",$,Z),dt("Do",function(t,e){return t?e._dayOfMonthOrdinalParse||e._ordinalParse:e._dayOfMonthOrdinalParseLenient}),gt(["D","DD"],yt),gt("Do",function(t,e){e[yt]=w(t.match($)[0])});var ai=Tt("Date",!0);j("DDD",["DDDD",3],"DDDo","dayOfYear"),R("dayOfYear","DDD"),N("dayOfYear",4),dt("DDD",et),dt("DDDD",X),gt(["DDD","DDDD"],function(t,e,i){i._dayOfYear=w(t)}),j("m",["mm",2],0,"minute"),R("minute","m"),N("minute",14),dt("m",$),dt("mm",$,Z),gt(["m","mm"],xt);var ri=Tt("Minutes",!1);j("s",["ss",2],0,"second"),R("second","s"),N("second",15),dt("s",$),dt("ss",$,Z),gt(["s","ss"],_t);var oi,si=Tt("Seconds",!1);for(j("S",0,0,function(){return~~(this.millisecond()/100)}),j(0,["SS",2],0,function(){return~~(this.millisecond()/10)}),j(0,["SSS",3],0,"millisecond"),j(0,["SSSS",4],0,function(){return 10*this.millisecond()}),j(0,["SSSSS",5],0,function(){return 100*this.millisecond()}),j(0,["SSSSSS",6],0,function(){return 1e3*this.millisecond()}),j(0,["SSSSSSS",7],0,function(){return 1e4*this.millisecond()}),j(0,["SSSSSSSS",8],0,function(){return 1e5*this.millisecond()}),j(0,["SSSSSSSSS",9],0,function(){return 1e6*this.millisecond()}),R("millisecond","ms"),N("millisecond",16),dt("S",et,G),dt("SS",et,Z),dt("SSS",et,X),oi="SSSS";oi.length<=9;oi+="S")dt(oi,at);function li(t,e){e[kt]=w(1e3*("0."+t))}for(oi="S";oi.length<=9;oi+="S")gt(oi,li);var ui=Tt("Milliseconds",!1);j("z",0,0,"zoneAbbr"),j("zz",0,0,"zoneName");var di=x.prototype;function hi(t){return t}di.add=Je,di.calendar=function(t,e){var i=t||Te(),n=He(i,this).startOf("day"),r=a.calendarFormat(this,n)||"sameElse",o=e&&(O(e[r])?e[r].call(this,i):e[r]);return this.format(o||this.localeData().calendar(r,this,Te(i)))},di.clone=function(){return new x(this)},di.diff=function(t,e,i){var n,a,r;if(!this.isValid())return NaN;if(!(n=He(t,this)).isValid())return NaN;switch(a=6e4*(n.utcOffset()-this.utcOffset()),e=L(e)){case"year":r=$e(this,n)/12;break;case"month":r=$e(this,n);break;case"quarter":r=$e(this,n)/3;break;case"second":r=(this-n)/1e3;break;case"minute":r=(this-n)/6e4;break;case"hour":r=(this-n)/36e5;break;case"day":r=(this-n-a)/864e5;break;case"week":r=(this-n-a)/6048e5;break;default:r=this-n}return i?r:k(r)},di.endOf=function(t){return void 0===(t=L(t))||"millisecond"===t?this:("date"===t&&(t="day"),this.startOf(t).add(1,"isoWeek"===t?"week":t).subtract(1,"ms"))},di.format=function(t){t||(t=this.isUtc()?a.defaultFormatUtc:a.defaultFormat);var e=U(this,t);return this.localeData().postformat(e)},di.from=function(t,e){return this.isValid()&&(_(t)&&t.isValid()||Te(t).isValid())?Ue({to:this,from:t}).locale(this.locale()).humanize(!e):this.localeData().invalidDate()},di.fromNow=function(t){return this.from(Te(),t)},di.to=function(t,e){return this.isValid()&&(_(t)&&t.isValid()||Te(t).isValid())?Ue({from:this,to:t}).locale(this.locale()).humanize(!e):this.localeData().invalidDate()},di.toNow=function(t){return this.to(Te(),t)},di.get=function(t){return O(this[t=L(t)])?this[t]():this},di.invalidAt=function(){return g(this).overflow},di.isAfter=function(t,e){var i=_(t)?t:Te(t);return!(!this.isValid()||!i.isValid())&&("millisecond"===(e=L(s(e)?"millisecond":e))?this.valueOf()>i.valueOf():i.valueOf()<this.clone().startOf(e).valueOf())},di.isBefore=function(t,e){var i=_(t)?t:Te(t);return!(!this.isValid()||!i.isValid())&&("millisecond"===(e=L(s(e)?"millisecond":e))?this.valueOf()<i.valueOf():this.clone().endOf(e).valueOf()<i.valueOf())},di.isBetween=function(t,e,i,n){return("("===(n=n||"()")[0]?this.isAfter(t,i):!this.isBefore(t,i))&&(")"===n[1]?this.isBefore(e,i):!this.isAfter(e,i))},di.isSame=function(t,e){var i,n=_(t)?t:Te(t);return!(!this.isValid()||!n.isValid())&&("millisecond"===(e=L(e||"millisecond"))?this.valueOf()===n.valueOf():(i=n.valueOf(),this.clone().startOf(e).valueOf()<=i&&i<=this.clone().endOf(e).valueOf()))},di.isSameOrAfter=function(t,e){return this.isSame(t,e)||this.isAfter(t,e)},di.isSameOrBefore=function(t,e){return this.isSame(t,e)||this.isBefore(t,e)},di.isValid=function(){return m(this)},di.lang=ti,di.locale=Qe,di.localeData=ei,di.max=Ie,di.min=Oe,di.parsingFlags=function(){return c({},g(this))},di.set=function(t,e){if("object"==typeof t)for(var i=function(t){var e=[];for(var i in t)e.push({unit:i,priority:Y[i]});return e.sort(function(t,e){return t.priority-e.priority}),e}(t=W(t)),n=0;n<i.length;n++)this[i[n].unit](t[i[n].unit]);else if(O(this[t=L(t)]))return this[t](e);return this},di.startOf=function(t){switch(t=L(t)){case"year":this.month(0);case"quarter":case"month":this.date(1);case"week":case"isoWeek":case"day":case"date":this.hours(0);case"hour":this.minutes(0);case"minute":this.seconds(0);case"second":this.milliseconds(0)}return"week"===t&&this.weekday(0),"isoWeek"===t&&this.isoWeekday(1),"quarter"===t&&this.month(3*Math.floor(this.month()/3)),this},di.subtract=Ke,di.toArray=function(){var t=this;return[t.year(),t.month(),t.date(),t.hour(),t.minute(),t.second(),t.millisecond()]},di.toObject=function(){var t=this;return{years:t.year(),months:t.month(),date:t.date(),hours:t.hours(),minutes:t.minutes(),seconds:t.seconds(),milliseconds:t.milliseconds()}},di.toDate=function(){return new Date(this.valueOf())},di.toISOString=function(t){if(!this.isValid())return null;var e=!0!==t,i=e?this.clone().utc():this;return i.year()<0||i.year()>9999?U(i,e?"YYYYYY-MM-DD[T]HH:mm:ss.SSS[Z]":"YYYYYY-MM-DD[T]HH:mm:ss.SSSZ"):O(Date.prototype.toISOString)?e?this.toDate().toISOString():new Date(this._d.valueOf()).toISOString().replace("Z",U(i,"Z")):U(i,e?"YYYY-MM-DD[T]HH:mm:ss.SSS[Z]":"YYYY-MM-DD[T]HH:mm:ss.SSSZ")},di.inspect=function(){if(!this.isValid())return"moment.invalid(/* "+this._i+" */)";var t="moment",e="";this.isLocal()||(t=0===this.utcOffset()?"moment.utc":"moment.parseZone",e="Z");var i="["+t+'("]',n=0<=this.year()&&this.year()<=9999?"YYYY":"YYYYYY",a=e+'[")]';return this.format(i+n+"-MM-DD[T]HH:mm:ss.SSS"+a)},di.toJSON=function(){return this.isValid()?this.toISOString():null},di.toString=function(){return this.clone().locale("en").format("ddd MMM DD YYYY HH:mm:ss [GMT]ZZ")},di.unix=function(){return Math.floor(this.valueOf()/1e3)},di.valueOf=function(){return this._d.valueOf()-6e4*(this._offset||0)},di.creationData=function(){return{input:this._i,format:this._f,locale:this._locale,isUTC:this._isUTC,strict:this._strict}},di.year=Pt,di.isLeapYear=function(){return Dt(this.year())},di.weekYear=function(t){return ni.call(this,t,this.week(),this.weekday(),this.localeData()._week.dow,this.localeData()._week.doy)},di.isoWeekYear=function(t){return ni.call(this,t,this.isoWeek(),this.isoWeekday(),1,4)},di.quarter=di.quarters=function(t){return null==t?Math.ceil((this.month()+1)/3):this.month(3*(t-1)+this.month()%3)},di.month=Yt,di.daysInMonth=function(){return At(this.year(),this.month())},di.week=di.weeks=function(t){var e=this.localeData().week(this);return null==t?e:this.add(7*(t-e),"d")},di.isoWeek=di.isoWeeks=function(t){var e=jt(this,1,4).week;return null==t?e:this.add(7*(t-e),"d")},di.weeksInYear=function(){var t=this.localeData()._week;return Ut(this.year(),t.dow,t.doy)},di.isoWeeksInYear=function(){return Ut(this.year(),1,4)},di.date=ai,di.day=di.days=function(t){if(!this.isValid())return null!=t?this:NaN;var e,i,n=this._isUTC?this._d.getUTCDay():this._d.getDay();return null!=t?(e=t,i=this.localeData(),t="string"!=typeof e?e:isNaN(e)?"number"==typeof(e=i.weekdaysParse(e))?e:null:parseInt(e,10),this.add(t-n,"d")):n},di.weekday=function(t){if(!this.isValid())return null!=t?this:NaN;var e=(this.day()+7-this.localeData()._week.dow)%7;return null==t?e:this.add(t-e,"d")},di.isoWeekday=function(t){if(!this.isValid())return null!=t?this:NaN;if(null!=t){var e=(i=t,n=this.localeData(),"string"==typeof i?n.weekdaysParse(i)%7||7:isNaN(i)?null:i);return this.day(this.day()%7?e:e-7)}return this.day()||7;var i,n},di.dayOfYear=function(t){var e=Math.round((this.clone().startOf("day")-this.clone().startOf("year"))/864e5)+1;return null==t?e:this.add(t-e,"d")},di.hour=di.hours=ne,di.minute=di.minutes=ri,di.second=di.seconds=si,di.millisecond=di.milliseconds=ui,di.utcOffset=function(t,e,i){var n,r=this._offset||0;if(!this.isValid())return null!=t?this:NaN;if(null!=t){if("string"==typeof t){if(null===(t=ze(st,t)))return this}else Math.abs(t)<16&&!i&&(t*=60);return!this._isUTC&&e&&(n=Ve(this)),this._offset=t,this._isUTC=!0,null!=n&&this.add(n,"m"),r!==t&&(!e||this._changeInProgress?Xe(this,Ue(t-r,"m"),1,!1):this._changeInProgress||(this._changeInProgress=!0,a.updateOffset(this,!0),this._changeInProgress=null)),this}return this._isUTC?r:Ve(this)},di.utc=function(t){return this.utcOffset(0,t)},di.local=function(t){return this._isUTC&&(this.utcOffset(0,t),this._isUTC=!1,t&&this.subtract(Ve(this),"m")),this},di.parseZone=function(){if(null!=this._tzm)this.utcOffset(this._tzm,!1,!0);else if("string"==typeof this._i){var t=ze(ot,this._i);null!=t?this.utcOffset(t):this.utcOffset(0,!0)}return this},di.hasAlignedHourOffset=function(t){return!!this.isValid()&&(t=t?Te(t).utcOffset():0,(this.utcOffset()-t)%60==0)},di.isDST=function(){return this.utcOffset()>this.clone().month(0).utcOffset()||this.utcOffset()>this.clone().month(5).utcOffset()},di.isLocal=function(){return!!this.isValid()&&!this._isUTC},di.isUtcOffset=function(){return!!this.isValid()&&this._isUTC},di.isUtc=Be,di.isUTC=Be,di.zoneAbbr=function(){return this._isUTC?"UTC":""},di.zoneName=function(){return this._isUTC?"Coordinated Universal Time":""},di.dates=D("dates accessor is deprecated. Use date instead.",ai),di.months=D("months accessor is deprecated. Use month instead",Yt),di.years=D("years accessor is deprecated. Use year instead",Pt),di.zone=D("moment().zone is deprecated, use moment().utcOffset instead. http://momentjs.com/guides/#/warnings/zone/",function(t,e){return null!=t?("string"!=typeof t&&(t=-t),this.utcOffset(t,e),this):-this.utcOffset()}),di.isDSTShifted=D("isDSTShifted is deprecated. See http://momentjs.com/guides/#/warnings/dst-shifted/ for more information",function(){if(!s(this._isDSTShifted))return this._isDSTShifted;var t={};if(y(t,this),(t=Ce(t))._a){var e=t._isUTC?f(t._a):Te(t._a);this._isDSTShifted=this.isValid()&&M(t._a,e.toArray())>0}else this._isDSTShifted=!1;return this._isDSTShifted});var ci=A.prototype;function fi(t,e,i,n){var a=he(),r=f().set(n,e);return a[i](r,t)}function gi(t,e,i){if(l(t)&&(e=t,t=void 0),t=t||"",null!=e)return fi(t,e,i,"month");var n,a=[];for(n=0;n<12;n++)a[n]=fi(t,n,i,"month");return a}function mi(t,e,i,n){"boolean"==typeof t?(l(e)&&(i=e,e=void 0),e=e||""):(i=e=t,t=!1,l(e)&&(i=e,e=void 0),e=e||"");var a,r=he(),o=t?r._week.dow:0;if(null!=i)return fi(e,(i+o)%7,n,"day");var s=[];for(a=0;a<7;a++)s[a]=fi(e,(a+o)%7,n,"day");return s}ci.calendar=function(t,e,i){var n=this._calendar[t]||this._calendar.sameElse;return O(n)?n.call(e,i):n},ci.longDateFormat=function(t){var e=this._longDateFormat[t],i=this._longDateFormat[t.toUpperCase()];return e||!i?e:(this._longDateFormat[t]=i.replace(/MMMM|MM|DD|dddd/g,function(t){return t.slice(1)}),this._longDateFormat[t])},ci.invalidDate=function(){return this._invalidDate},ci.ordinal=function(t){return this._ordinal.replace("%d",t)},ci.preparse=hi,ci.postformat=hi,ci.relativeTime=function(t,e,i,n){var a=this._relativeTime[i];return O(a)?a(t,e,i,n):a.replace(/%d/i,t)},ci.pastFuture=function(t,e){var i=this._relativeTime[t>0?"future":"past"];return O(i)?i(e):i.replace(/%s/i,e)},ci.set=function(t){var e,i;for(i in t)O(e=t[i])?this[i]=e:this["_"+i]=e;this._config=t,this._dayOfMonthOrdinalParseLenient=new RegExp((this._dayOfMonthOrdinalParse.source||this._ordinalParse.source)+"|"+/\d{1,2}/.source)},ci.months=function(t,e){return t?r(this._months)?this._months[t.month()]:this._months[(this._months.isFormat||Ft).test(e)?"format":"standalone"][t.month()]:r(this._months)?this._months:this._months.standalone},ci.monthsShort=function(t,e){return t?r(this._monthsShort)?this._monthsShort[t.month()]:this._monthsShort[Ft.test(e)?"format":"standalone"][t.month()]:r(this._monthsShort)?this._monthsShort:this._monthsShort.standalone},ci.monthsParse=function(t,e,i){var n,a,r;if(this._monthsParseExact)return function(t,e,i){var n,a,r,o=t.toLocaleLowerCase();if(!this._monthsParse)for(this._monthsParse=[],this._longMonthsParse=[],this._shortMonthsParse=[],n=0;n<12;++n)r=f([2e3,n]),this._shortMonthsParse[n]=this.monthsShort(r,"").toLocaleLowerCase(),this._longMonthsParse[n]=this.months(r,"").toLocaleLowerCase();return i?"MMM"===e?-1!==(a=Ct.call(this._shortMonthsParse,o))?a:null:-1!==(a=Ct.call(this._longMonthsParse,o))?a:null:"MMM"===e?-1!==(a=Ct.call(this._shortMonthsParse,o))?a:-1!==(a=Ct.call(this._longMonthsParse,o))?a:null:-1!==(a=Ct.call(this._longMonthsParse,o))?a:-1!==(a=Ct.call(this._shortMonthsParse,o))?a:null}.call(this,t,e,i);for(this._monthsParse||(this._monthsParse=[],this._longMonthsParse=[],this._shortMonthsParse=[]),n=0;n<12;n++){if(a=f([2e3,n]),i&&!this._longMonthsParse[n]&&(this._longMonthsParse[n]=new RegExp("^"+this.months(a,"").replace(".","")+"$","i"),this._shortMonthsParse[n]=new RegExp("^"+this.monthsShort(a,"").replace(".","")+"$","i")),i||this._monthsParse[n]||(r="^"+this.months(a,"")+"|^"+this.monthsShort(a,""),this._monthsParse[n]=new RegExp(r.replace(".",""),"i")),i&&"MMMM"===e&&this._longMonthsParse[n].test(t))return n;if(i&&"MMM"===e&&this._shortMonthsParse[n].test(t))return n;if(!i&&this._monthsParse[n].test(t))return n}},ci.monthsRegex=function(t){return this._monthsParseExact?(h(this,"_monthsRegex")||Ht.call(this),t?this._monthsStrictRegex:this._monthsRegex):(h(this,"_monthsRegex")||(this._monthsRegex=zt),this._monthsStrictRegex&&t?this._monthsStrictRegex:this._monthsRegex)},ci.monthsShortRegex=function(t){return this._monthsParseExact?(h(this,"_monthsRegex")||Ht.call(this),t?this._monthsShortStrictRegex:this._monthsShortRegex):(h(this,"_monthsShortRegex")||(this._monthsShortRegex=Nt),this._monthsShortStrictRegex&&t?this._monthsShortStrictRegex:this._monthsShortRegex)},ci.week=function(t){return jt(t,this._week.dow,this._week.doy).week},ci.firstDayOfYear=function(){return this._week.doy},ci.firstDayOfWeek=function(){return this._week.dow},ci.weekdays=function(t,e){return t?r(this._weekdays)?this._weekdays[t.day()]:this._weekdays[this._weekdays.isFormat.test(e)?"format":"standalone"][t.day()]:r(this._weekdays)?this._weekdays:this._weekdays.standalone},ci.weekdaysMin=function(t){return t?this._weekdaysMin[t.day()]:this._weekdaysMin},ci.weekdaysShort=function(t){return t?this._weekdaysShort[t.day()]:this._weekdaysShort},ci.weekdaysParse=function(t,e,i){var n,a,r;if(this._weekdaysParseExact)return function(t,e,i){var n,a,r,o=t.toLocaleLowerCase();if(!this._weekdaysParse)for(this._weekdaysParse=[],this._shortWeekdaysParse=[],this._minWeekdaysParse=[],n=0;n<7;++n)r=f([2e3,1]).day(n),this._minWeekdaysParse[n]=this.weekdaysMin(r,"").toLocaleLowerCase(),this._shortWeekdaysParse[n]=this.weekdaysShort(r,"").toLocaleLowerCase(),this._weekdaysParse[n]=this.weekdays(r,"").toLocaleLowerCase();return i?"dddd"===e?-1!==(a=Ct.call(this._weekdaysParse,o))?a:null:"ddd"===e?-1!==(a=Ct.call(this._shortWeekdaysParse,o))?a:null:-1!==(a=Ct.call(this._minWeekdaysParse,o))?a:null:"dddd"===e?-1!==(a=Ct.call(this._weekdaysParse,o))?a:-1!==(a=Ct.call(this._shortWeekdaysParse,o))?a:-1!==(a=Ct.call(this._minWeekdaysParse,o))?a:null:"ddd"===e?-1!==(a=Ct.call(this._shortWeekdaysParse,o))?a:-1!==(a=Ct.call(this._weekdaysParse,o))?a:-1!==(a=Ct.call(this._minWeekdaysParse,o))?a:null:-1!==(a=Ct.call(this._minWeekdaysParse,o))?a:-1!==(a=Ct.call(this._weekdaysParse,o))?a:-1!==(a=Ct.call(this._shortWeekdaysParse,o))?a:null}.call(this,t,e,i);for(this._weekdaysParse||(this._weekdaysParse=[],this._minWeekdaysParse=[],this._shortWeekdaysParse=[],this._fullWeekdaysParse=[]),n=0;n<7;n++){if(a=f([2e3,1]).day(n),i&&!this._fullWeekdaysParse[n]&&(this._fullWeekdaysParse[n]=new RegExp("^"+this.weekdays(a,"").replace(".",".?")+"$","i"),this._shortWeekdaysParse[n]=new RegExp("^"+this.weekdaysShort(a,"").replace(".",".?")+"$","i"),this._minWeekdaysParse[n]=new RegExp("^"+this.weekdaysMin(a,"").replace(".",".?")+"$","i")),this._weekdaysParse[n]||(r="^"+this.weekdays(a,"")+"|^"+this.weekdaysShort(a,"")+"|^"+this.weekdaysMin(a,""),this._weekdaysParse[n]=new RegExp(r.replace(".",""),"i")),i&&"dddd"===e&&this._fullWeekdaysParse[n].test(t))return n;if(i&&"ddd"===e&&this._shortWeekdaysParse[n].test(t))return n;if(i&&"dd"===e&&this._minWeekdaysParse[n].test(t))return n;if(!i&&this._weekdaysParse[n].test(t))return n}},ci.weekdaysRegex=function(t){return this._weekdaysParseExact?(h(this,"_weekdaysRegex")||$t.call(this),t?this._weekdaysStrictRegex:this._weekdaysRegex):(h(this,"_weekdaysRegex")||(this._weekdaysRegex=Xt),this._weekdaysStrictRegex&&t?this._weekdaysStrictRegex:this._weekdaysRegex)},ci.weekdaysShortRegex=function(t){return this._weekdaysParseExact?(h(this,"_weekdaysRegex")||$t.call(this),t?this._weekdaysShortStrictRegex:this._weekdaysShortRegex):(h(this,"_weekdaysShortRegex")||(this._weekdaysShortRegex=Jt),this._weekdaysShortStrictRegex&&t?this._weekdaysShortStrictRegex:this._weekdaysShortRegex)},ci.weekdaysMinRegex=function(t){return this._weekdaysParseExact?(h(this,"_weekdaysRegex")||$t.call(this),t?this._weekdaysMinStrictRegex:this._weekdaysMinRegex):(h(this,"_weekdaysMinRegex")||(this._weekdaysMinRegex=Kt),this._weekdaysMinStrictRegex&&t?this._weekdaysMinStrictRegex:this._weekdaysMinRegex)},ci.isPM=function(t){return"p"===(t+"").toLowerCase().charAt(0)},ci.meridiem=function(t,e,i){return t>11?i?"pm":"PM":i?"am":"AM"},ue("en",{dayOfMonthOrdinalParse:/\d{1,2}(th|st|nd|rd)/,ordinal:function(t){var e=t%10;return t+(1===w(t%100/10)?"th":1===e?"st":2===e?"nd":3===e?"rd":"th")}}),a.lang=D("moment.lang is deprecated. Use moment.locale instead.",ue),a.langData=D("moment.langData is deprecated. Use moment.localeData instead.",he);var pi=Math.abs;function vi(t,e,i,n){var a=Ue(e,i);return t._milliseconds+=n*a._milliseconds,t._days+=n*a._days,t._months+=n*a._months,t._bubble()}function yi(t){return t<0?Math.floor(t):Math.ceil(t)}function bi(t){return 4800*t/146097}function xi(t){return 146097*t/4800}function _i(t){return function(){return this.as(t)}}var ki=_i("ms"),wi=_i("s"),Mi=_i("m"),Si=_i("h"),Di=_i("d"),Ci=_i("w"),Pi=_i("M"),Ti=_i("y");function Oi(t){return function(){return this.isValid()?this._data[t]:NaN}}var Ii=Oi("milliseconds"),Ai=Oi("seconds"),Fi=Oi("minutes"),Ri=Oi("hours"),Li=Oi("days"),Wi=Oi("months"),Yi=Oi("years");var Ni=Math.round,zi={ss:44,s:45,m:45,h:22,d:26,M:11};var Hi=Math.abs;function Vi(t){return(t>0)-(t<0)||+t}function Bi(){if(!this.isValid())return this.localeData().invalidDate();var t,e,i=Hi(this._milliseconds)/1e3,n=Hi(this._days),a=Hi(this._months);e=k((t=k(i/60))/60),i%=60,t%=60;var r=k(a/12),o=a%=12,s=n,l=e,u=t,d=i?i.toFixed(3).replace(/\.?0+$/,""):"",h=this.asSeconds();if(!h)return"P0D";var c=h<0?"-":"",f=Vi(this._months)!==Vi(h)?"-":"",g=Vi(this._days)!==Vi(h)?"-":"",m=Vi(this._milliseconds)!==Vi(h)?"-":"";return c+"P"+(r?f+r+"Y":"")+(o?f+o+"M":"")+(s?g+s+"D":"")+(l||u||d?"T":"")+(l?m+l+"H":"")+(u?m+u+"M":"")+(d?m+d+"S":"")}var Ei=Re.prototype;return Ei.isValid=function(){return this._isValid},Ei.abs=function(){var t=this._data;return this._milliseconds=pi(this._milliseconds),this._days=pi(this._days),this._months=pi(this._months),t.milliseconds=pi(t.milliseconds),t.seconds=pi(t.seconds),t.minutes=pi(t.minutes),t.hours=pi(t.hours),t.months=pi(t.months),t.years=pi(t.years),this},Ei.add=function(t,e){return vi(this,t,e,1)},Ei.subtract=function(t,e){return vi(this,t,e,-1)},Ei.as=function(t){if(!this.isValid())return NaN;var e,i,n=this._milliseconds;if("month"===(t=L(t))||"year"===t)return e=this._days+n/864e5,i=this._months+bi(e),"month"===t?i:i/12;switch(e=this._days+Math.round(xi(this._months)),t){case"week":return e/7+n/6048e5;case"day":return e+n/864e5;case"hour":return 24*e+n/36e5;case"minute":return 1440*e+n/6e4;case"second":return 86400*e+n/1e3;case"millisecond":return Math.floor(864e5*e)+n;default:throw new Error("Unknown unit "+t)}},Ei.asMilliseconds=ki,Ei.asSeconds=wi,Ei.asMinutes=Mi,Ei.asHours=Si,Ei.asDays=Di,Ei.asWeeks=Ci,Ei.asMonths=Pi,Ei.asYears=Ti,Ei.valueOf=function(){return this.isValid()?this._milliseconds+864e5*this._days+this._months%12*2592e6+31536e6*w(this._months/12):NaN},Ei._bubble=function(){var t,e,i,n,a,r=this._milliseconds,o=this._days,s=this._months,l=this._data;return r>=0&&o>=0&&s>=0||r<=0&&o<=0&&s<=0||(r+=864e5*yi(xi(s)+o),o=0,s=0),l.milliseconds=r%1e3,t=k(r/1e3),l.seconds=t%60,e=k(t/60),l.minutes=e%60,i=k(e/60),l.hours=i%24,s+=a=k(bi(o+=k(i/24))),o-=yi(xi(a)),n=k(s/12),s%=12,l.days=o,l.months=s,l.years=n,this},Ei.clone=function(){return Ue(this)},Ei.get=function(t){return t=L(t),this.isValid()?this[t+"s"]():NaN},Ei.milliseconds=Ii,Ei.seconds=Ai,Ei.minutes=Fi,Ei.hours=Ri,Ei.days=Li,Ei.weeks=function(){return k(this.days()/7)},Ei.months=Wi,Ei.years=Yi,Ei.humanize=function(t){if(!this.isValid())return this.localeData().invalidDate();var e,i,n,a,r,o,s,l,u,d,h,c=this.localeData(),f=(i=!t,n=c,a=Ue(e=this).abs(),r=Ni(a.as("s")),o=Ni(a.as("m")),s=Ni(a.as("h")),l=Ni(a.as("d")),u=Ni(a.as("M")),d=Ni(a.as("y")),(h=r<=zi.ss&&["s",r]||r<zi.s&&["ss",r]||o<=1&&["m"]||o<zi.m&&["mm",o]||s<=1&&["h"]||s<zi.h&&["hh",s]||l<=1&&["d"]||l<zi.d&&["dd",l]||u<=1&&["M"]||u<zi.M&&["MM",u]||d<=1&&["y"]||["yy",d])[2]=i,h[3]=+e>0,h[4]=n,function(t,e,i,n,a){return a.relativeTime(e||1,!!i,t,n)}.apply(null,h));return t&&(f=c.pastFuture(+this,f)),c.postformat(f)},Ei.toISOString=Bi,Ei.toString=Bi,Ei.toJSON=Bi,Ei.locale=Qe,Ei.localeData=ei,Ei.toIsoString=D("toIsoString() is deprecated. Please use toISOString() instead (notice the capitals)",Bi),Ei.lang=ti,j("X",0,0,"unix"),j("x",0,0,"valueOf"),dt("x",rt),dt("X",/[+-]?\d+(\.\d{1,3})?/),gt("X",function(t,e,i){i._d=new Date(1e3*parseFloat(t,10))}),gt("x",function(t,e,i){i._d=new Date(w(t))}),a.version="2.20.1",i=Te,a.fn=di,a.min=function(){return Ae("isBefore",[].slice.call(arguments,0))},a.max=function(){return Ae("isAfter",[].slice.call(arguments,0))},a.now=function(){return Date.now?Date.now():+new Date},a.utc=f,a.unix=function(t){return Te(1e3*t)},a.months=function(t,e){return gi(t,e,"months")},a.isDate=u,a.locale=ue,a.invalid=p,a.duration=Ue,a.isMoment=_,a.weekdays=function(t,e,i){return mi(t,e,i,"weekdays")},a.parseZone=function(){return Te.apply(null,arguments).parseZone()},a.localeData=he,a.isDuration=Le,a.monthsShort=function(t,e){return gi(t,e,"monthsShort")},a.weekdaysMin=function(t,e,i){return mi(t,e,i,"weekdaysMin")},a.defineLocale=de,a.updateLocale=function(t,e){if(null!=e){var i,n,a=ae;null!=(n=le(t))&&(a=n._config),(i=new A(e=I(a,e))).parentLocale=re[t],re[t]=i,ue(t)}else null!=re[t]&&(null!=re[t].parentLocale?re[t]=re[t].parentLocale:null!=re[t]&&delete re[t]);return re[t]},a.locales=function(){return C(re)},a.weekdaysShort=function(t,e,i){return mi(t,e,i,"weekdaysShort")},a.normalizeUnits=L,a.relativeTimeRounding=function(t){return void 0===t?Ni:"function"==typeof t&&(Ni=t,!0)},a.relativeTimeThreshold=function(t,e){return void 0!==zi[t]&&(void 0===e?zi[t]:(zi[t]=e,"s"===t&&(zi.ss=e-1),!0))},a.calendarFormat=function(t,e){var i=t.diff(e,"days",!0);return i<-6?"sameElse":i<-1?"lastWeek":i<0?"lastDay":i<1?"sameDay":i<2?"nextDay":i<7?"nextWeek":"sameElse"},a.prototype=di,a.HTML5_FMT={DATETIME_LOCAL:"YYYY-MM-DDTHH:mm",DATETIME_LOCAL_SECONDS:"YYYY-MM-DDTHH:mm:ss",DATETIME_LOCAL_MS:"YYYY-MM-DDTHH:mm:ss.SSS",DATE:"YYYY-MM-DD",TIME:"HH:mm",TIME_SECONDS:"HH:mm:ss",TIME_MS:"HH:mm:ss.SSS",WEEK:"YYYY-[W]WW",MONTH:"YYYY-MM"},a},"object"==typeof i&&void 0!==e?e.exports=a():n.moment=a()},{}],7:[function(t,e,i){var n=t(29)();n.helpers=t(45),t(27)(n),n.defaults=t(25),n.Element=t(26),n.elements=t(40),n.Interaction=t(28),n.layouts=t(30),n.platform=t(48),n.plugins=t(31),n.Ticks=t(34),t(22)(n),t(23)(n),t(24)(n),t(33)(n),t(32)(n),t(35)(n),t(55)(n),t(53)(n),t(54)(n),t(56)(n),t(57)(n),t(58)(n),t(15)(n),t(16)(n),t(17)(n),t(18)(n),t(19)(n),t(20)(n),t(21)(n),t(8)(n),t(9)(n),t(10)(n),t(11)(n),t(12)(n),t(13)(n),t(14)(n);var a=t(49);for(var r in a)a.hasOwnProperty(r)&&n.plugins.register(a[r]);n.platform.initialize(),e.exports=n,"undefined"!=typeof window&&(window.Chart=n),n.Legend=a.legend._element,n.Title=a.title._element,n.pluginService=n.plugins,n.PluginBase=n.Element.extend({}),n.canvasHelpers=n.helpers.canvas,n.layoutService=n.layouts},{10:10,11:11,12:12,13:13,14:14,15:15,16:16,17:17,18:18,19:19,20:20,21:21,22:22,23:23,24:24,25:25,26:26,27:27,28:28,29:29,30:30,31:31,32:32,33:33,34:34,35:35,40:40,45:45,48:48,49:49,53:53,54:54,55:55,56:56,57:57,58:58,8:8,9:9}],8:[function(t,e,i){"use strict";e.exports=function(t){t.Bar=function(e,i){return i.type="bar",new t(e,i)}}},{}],9:[function(t,e,i){"use strict";e.exports=function(t){t.Bubble=function(e,i){return i.type="bubble",new t(e,i)}}},{}],10:[function(t,e,i){"use strict";e.exports=function(t){t.Doughnut=function(e,i){return i.type="doughnut",new t(e,i)}}},{}],11:[function(t,e,i){"use strict";e.exports=function(t){t.Line=function(e,i){return i.type="line",new t(e,i)}}},{}],12:[function(t,e,i){"use strict";e.exports=function(t){t.PolarArea=function(e,i){return i.type="polarArea",new t(e,i)}}},{}],13:[function(t,e,i){"use strict";e.exports=function(t){t.Radar=function(e,i){return i.type="radar",new t(e,i)}}},{}],14:[function(t,e,i){"use strict";e.exports=function(t){t.Scatter=function(e,i){return i.type="scatter",new t(e,i)}}},{}],15:[function(t,e,i){"use strict";var n=t(25),a=t(40),r=t(45);n._set("bar",{hover:{mode:"label"},scales:{xAxes:[{type:"category",categoryPercentage:.8,barPercentage:.9,offset:!0,gridLines:{offsetGridLines:!0}}],yAxes:[{type:"linear"}]}}),n._set("horizontalBar",{hover:{mode:"index",axis:"y"},scales:{xAxes:[{type:"linear",position:"bottom"}],yAxes:[{position:"left",type:"category",categoryPercentage:.8,barPercentage:.9,offset:!0,gridLines:{offsetGridLines:!0}}]},elements:{rectangle:{borderSkipped:"left"}},tooltips:{callbacks:{title:function(t,e){var i="";return t.length>0&&(t[0].yLabel?i=t[0].yLabel:e.labels.length>0&&t[0].index<e.labels.length&&(i=e.labels[t[0].index])),i},label:function(t,e){return(e.datasets[t.datasetIndex].label||"")+": "+t.xLabel}},mode:"index",axis:"y"}}),e.exports=function(t){t.controllers.bar=t.DatasetController.extend({dataElementType:a.Rectangle,initialize:function(){var e;t.DatasetController.prototype.initialize.apply(this,arguments),(e=this.getMeta()).stack=this.getDataset().stack,e.bar=!0},update:function(t){var e,i,n=this.getMeta().data;for(this._ruler=this.getRuler(),e=0,i=n.length;e<i;++e)this.updateElement(n[e],e,t)},updateElement:function(t,e,i){var n=this,a=n.chart,o=n.getMeta(),s=n.getDataset(),l=t.custom||{},u=a.options.elements.rectangle;t._xScale=n.getScaleForId(o.xAxisID),t._yScale=n.getScaleForId(o.yAxisID),t._datasetIndex=n.index,t._index=e,t._model={datasetLabel:s.label,label:a.data.labels[e],borderSkipped:l.borderSkipped?l.borderSkipped:u.borderSkipped,backgroundColor:l.backgroundColor?l.backgroundColor:r.valueAtIndexOrDefault(s.backgroundColor,e,u.backgroundColor),borderColor:l.borderColor?l.borderColor:r.valueAtIndexOrDefault(s.borderColor,e,u.borderColor),borderWidth:l.borderWidth?l.borderWidth:r.valueAtIndexOrDefault(s.borderWidth,e,u.borderWidth)},n.updateElementGeometry(t,e,i),t.pivot()},updateElementGeometry:function(t,e,i){var n=this,a=t._model,r=n.getValueScale(),o=r.getBasePixel(),s=r.isHorizontal(),l=n._ruler||n.getRuler(),u=n.calculateBarValuePixels(n.index,e),d=n.calculateBarIndexPixels(n.index,e,l);a.horizontal=s,a.base=i?o:u.base,a.x=s?i?o:u.head:d.center,a.y=s?d.center:i?o:u.head,a.height=s?d.size:void 0,a.width=s?void 0:d.size},getValueScaleId:function(){return this.getMeta().yAxisID},getIndexScaleId:function(){return this.getMeta().xAxisID},getValueScale:function(){return this.getScaleForId(this.getValueScaleId())},getIndexScale:function(){return this.getScaleForId(this.getIndexScaleId())},_getStacks:function(t){var e,i,n=this.chart,a=this.getIndexScale().options.stacked,r=void 0===t?n.data.datasets.length:t+1,o=[];for(e=0;e<r;++e)(i=n.getDatasetMeta(e)).bar&&n.isDatasetVisible(e)&&(!1===a||!0===a&&-1===o.indexOf(i.stack)||void 0===a&&(void 0===i.stack||-1===o.indexOf(i.stack)))&&o.push(i.stack);return o},getStackCount:function(){return this._getStacks().length},getStackIndex:function(t,e){var i=this._getStacks(t),n=void 0!==e?i.indexOf(e):-1;return-1===n?i.length-1:n},getRuler:function(){var t,e,i=this.getIndexScale(),n=this.getStackCount(),a=this.index,o=i.isHorizontal(),s=o?i.left:i.top,l=s+(o?i.width:i.height),u=[];for(t=0,e=this.getMeta().data.length;t<e;++t)u.push(i.getPixelForValue(null,t,a));return{min:r.isNullOrUndef(i.options.barThickness)?function(t,e){var i,n,a,r,o=t.isHorizontal()?t.width:t.height,s=t.getTicks();for(a=1,r=e.length;a<r;++a)o=Math.min(o,e[a]-e[a-1]);for(a=0,r=s.length;a<r;++a)n=t.getPixelForTick(a),o=a>0?Math.min(o,n-i):o,i=n;return o}(i,u):-1,pixels:u,start:s,end:l,stackCount:n,scale:i}},calculateBarValuePixels:function(t,e){var i,n,a,r,o,s,l=this.chart,u=this.getMeta(),d=this.getValueScale(),h=l.data.datasets,c=d.getRightValue(h[t].data[e]),f=d.options.stacked,g=u.stack,m=0;if(f||void 0===f&&void 0!==g)for(i=0;i<t;++i)(n=l.getDatasetMeta(i)).bar&&n.stack===g&&n.controller.getValueScaleId()===d.id&&l.isDatasetVisible(i)&&(a=d.getRightValue(h[i].data[e]),(c<0&&a<0||c>=0&&a>0)&&(m+=a));return r=d.getPixelForValue(m),{size:s=((o=d.getPixelForValue(m+c))-r)/2,base:r,head:o,center:o+s/2}},calculateBarIndexPixels:function(t,e,i){var n,a,o,s,l,u,d,h,c,f,g,m,p,v,y,b,x,_=i.scale.options,k="flex"===_.barThickness?(c=e,g=_,p=(f=i).pixels,v=p[c],y=c>0?p[c-1]:null,b=c<p.length-1?p[c+1]:null,x=g.categoryPercentage,null===y&&(y=v-(null===b?f.end-v:b-v)),null===b&&(b=v+v-y),m=v-(v-y)/2*x,{chunk:(b-y)/2*x/f.stackCount,ratio:g.barPercentage,start:m}):(n=e,a=i,u=(o=_).barThickness,d=a.stackCount,h=a.pixels[n],r.isNullOrUndef(u)?(s=a.min*o.categoryPercentage,l=o.barPercentage):(s=u*d,l=1),{chunk:s/d,ratio:l,start:h-s/2}),w=this.getStackIndex(t,this.getMeta().stack),M=k.start+k.chunk*w+k.chunk/2,S=Math.min(r.valueOrDefault(_.maxBarThickness,1/0),k.chunk*k.ratio);return{base:M-S/2,head:M+S/2,center:M,size:S}},draw:function(){var t=this.chart,e=this.getValueScale(),i=this.getMeta().data,n=this.getDataset(),a=i.length,o=0;for(r.canvas.clipArea(t.ctx,t.chartArea);o<a;++o)isNaN(e.getRightValue(n.data[o]))||i[o].draw();r.canvas.unclipArea(t.ctx)},setHoverStyle:function(t){var e=this.chart.data.datasets[t._datasetIndex],i=t._index,n=t.custom||{},a=t._model;a.backgroundColor=n.hoverBackgroundColor?n.hoverBackgroundColor:r.valueAtIndexOrDefault(e.hoverBackgroundColor,i,r.getHoverColor(a.backgroundColor)),a.borderColor=n.hoverBorderColor?n.hoverBorderColor:r.valueAtIndexOrDefault(e.hoverBorderColor,i,r.getHoverColor(a.borderColor)),a.borderWidth=n.hoverBorderWidth?n.hoverBorderWidth:r.valueAtIndexOrDefault(e.hoverBorderWidth,i,a.borderWidth)},removeHoverStyle:function(t){var e=this.chart.data.datasets[t._datasetIndex],i=t._index,n=t.custom||{},a=t._model,o=this.chart.options.elements.rectangle;a.backgroundColor=n.backgroundColor?n.backgroundColor:r.valueAtIndexOrDefault(e.backgroundColor,i,o.backgroundColor),a.borderColor=n.borderColor?n.borderColor:r.valueAtIndexOrDefault(e.borderColor,i,o.borderColor),a.borderWidth=n.borderWidth?n.borderWidth:r.valueAtIndexOrDefault(e.borderWidth,i,o.borderWidth)}}),t.controllers.horizontalBar=t.controllers.bar.extend({getValueScaleId:function(){return this.getMeta().xAxisID},getIndexScaleId:function(){return this.getMeta().yAxisID}})}},{25:25,40:40,45:45}],16:[function(t,e,i){"use strict";var n=t(25),a=t(40),r=t(45);n._set("bubble",{hover:{mode:"single"},scales:{xAxes:[{type:"linear",position:"bottom",id:"x-axis-0"}],yAxes:[{type:"linear",position:"left",id:"y-axis-0"}]},tooltips:{callbacks:{title:function(){return""},label:function(t,e){var i=e.datasets[t.datasetIndex].label||"",n=e.datasets[t.datasetIndex].data[t.index];return i+": ("+t.xLabel+", "+t.yLabel+", "+n.r+")"}}}}),e.exports=function(t){t.controllers.bubble=t.DatasetController.extend({dataElementType:a.Point,update:function(t){var e=this,i=e.getMeta().data;r.each(i,function(i,n){e.updateElement(i,n,t)})},updateElement:function(t,e,i){var n=this,a=n.getMeta(),r=t.custom||{},o=n.getScaleForId(a.xAxisID),s=n.getScaleForId(a.yAxisID),l=n._resolveElementOptions(t,e),u=n.getDataset().data[e],d=n.index,h=i?o.getPixelForDecimal(.5):o.getPixelForValue("object"==typeof u?u:NaN,e,d),c=i?s.getBasePixel():s.getPixelForValue(u,e,d);t._xScale=o,t._yScale=s,t._options=l,t._datasetIndex=d,t._index=e,t._model={backgroundColor:l.backgroundColor,borderColor:l.borderColor,borderWidth:l.borderWidth,hitRadius:l.hitRadius,pointStyle:l.pointStyle,radius:i?0:l.radius,skip:r.skip||isNaN(h)||isNaN(c),x:h,y:c},t.pivot()},setHoverStyle:function(t){var e=t._model,i=t._options;e.backgroundColor=r.valueOrDefault(i.hoverBackgroundColor,r.getHoverColor(i.backgroundColor)),e.borderColor=r.valueOrDefault(i.hoverBorderColor,r.getHoverColor(i.borderColor)),e.borderWidth=r.valueOrDefault(i.hoverBorderWidth,i.borderWidth),e.radius=i.radius+i.hoverRadius},removeHoverStyle:function(t){var e=t._model,i=t._options;e.backgroundColor=i.backgroundColor,e.borderColor=i.borderColor,e.borderWidth=i.borderWidth,e.radius=i.radius},_resolveElementOptions:function(t,e){var i,n,a,o=this.chart,s=o.data.datasets[this.index],l=t.custom||{},u=o.options.elements.point,d=r.options.resolve,h=s.data[e],c={},f={chart:o,dataIndex:e,dataset:s,datasetIndex:this.index},g=["backgroundColor","borderColor","borderWidth","hoverBackgroundColor","hoverBorderColor","hoverBorderWidth","hoverRadius","hitRadius","pointStyle"];for(i=0,n=g.length;i<n;++i)c[a=g[i]]=d([l[a],s[a],u[a]],f,e);return c.radius=d([l.radius,h?h.r:void 0,s.radius,u.radius],f,e),c}})}},{25:25,40:40,45:45}],17:[function(t,e,i){"use strict";var n=t(25),a=t(40),r=t(45);n._set("doughnut",{animation:{animateRotate:!0,animateScale:!1},hover:{mode:"single"},legendCallback:function(t){var e=[];e.push('<ul class="'+t.id+'-legend">');var i=t.data,n=i.datasets,a=i.labels;if(n.length)for(var r=0;r<n[0].data.length;++r)e.push('<li><span style="background-color:'+n[0].backgroundColor[r]+'"></span>'),a[r]&&e.push(a[r]),e.push("</li>");return e.push("</ul>"),e.join("")},legend:{labels:{generateLabels:function(t){var e=t.data;return e.labels.length&&e.datasets.length?e.labels.map(function(i,n){var a=t.getDatasetMeta(0),o=e.datasets[0],s=a.data[n],l=s&&s.custom||{},u=r.valueAtIndexOrDefault,d=t.options.elements.arc;return{text:i,fillStyle:l.backgroundColor?l.backgroundColor:u(o.backgroundColor,n,d.backgroundColor),strokeStyle:l.borderColor?l.borderColor:u(o.borderColor,n,d.borderColor),lineWidth:l.borderWidth?l.borderWidth:u(o.borderWidth,n,d.borderWidth),hidden:isNaN(o.data[n])||a.data[n].hidden,index:n}}):[]}},onClick:function(t,e){var i,n,a,r=e.index,o=this.chart;for(i=0,n=(o.data.datasets||[]).length;i<n;++i)(a=o.getDatasetMeta(i)).data[r]&&(a.data[r].hidden=!a.data[r].hidden);o.update()}},cutoutPercentage:50,rotation:-.5*Math.PI,circumference:2*Math.PI,tooltips:{callbacks:{title:function(){return""},label:function(t,e){var i=e.labels[t.index],n=": "+e.datasets[t.datasetIndex].data[t.index];return r.isArray(i)?(i=i.slice())[0]+=n:i+=n,i}}}}),n._set("pie",r.clone(n.doughnut)),n._set("pie",{cutoutPercentage:0}),e.exports=function(t){t.controllers.doughnut=t.controllers.pie=t.DatasetController.extend({dataElementType:a.Arc,linkScales:r.noop,getRingIndex:function(t){for(var e=0,i=0;i<t;++i)this.chart.isDatasetVisible(i)&&++e;return e},update:function(t){var e=this,i=e.chart,n=i.chartArea,a=i.options,o=a.elements.arc,s=n.right-n.left-o.borderWidth,l=n.bottom-n.top-o.borderWidth,u=Math.min(s,l),d={x:0,y:0},h=e.getMeta(),c=a.cutoutPercentage,f=a.circumference;if(f<2*Math.PI){var g=a.rotation%(2*Math.PI),m=(g+=2*Math.PI*(g>=Math.PI?-1:g<-Math.PI?1:0))+f,p=Math.cos(g),v=Math.sin(g),y=Math.cos(m),b=Math.sin(m),x=g<=0&&m>=0||g<=2*Math.PI&&2*Math.PI<=m,_=g<=.5*Math.PI&&.5*Math.PI<=m||g<=2.5*Math.PI&&2.5*Math.PI<=m,k=g<=-Math.PI&&-Math.PI<=m||g<=Math.PI&&Math.PI<=m,w=g<=.5*-Math.PI&&.5*-Math.PI<=m||g<=1.5*Math.PI&&1.5*Math.PI<=m,M=c/100,S=k?-1:Math.min(p*(p<0?1:M),y*(y<0?1:M)),D=w?-1:Math.min(v*(v<0?1:M),b*(b<0?1:M)),C=x?1:Math.max(p*(p>0?1:M),y*(y>0?1:M)),P=_?1:Math.max(v*(v>0?1:M),b*(b>0?1:M)),T=.5*(C-S),O=.5*(P-D);u=Math.min(s/T,l/O),d={x:-.5*(C+S),y:-.5*(P+D)}}i.borderWidth=e.getMaxBorderWidth(h.data),i.outerRadius=Math.max((u-i.borderWidth)/2,0),i.innerRadius=Math.max(c?i.outerRadius/100*c:0,0),i.radiusLength=(i.outerRadius-i.innerRadius)/i.getVisibleDatasetCount(),i.offsetX=d.x*i.outerRadius,i.offsetY=d.y*i.outerRadius,h.total=e.calculateTotal(),e.outerRadius=i.outerRadius-i.radiusLength*e.getRingIndex(e.index),e.innerRadius=Math.max(e.outerRadius-i.radiusLength,0),r.each(h.data,function(i,n){e.updateElement(i,n,t)})},updateElement:function(t,e,i){var n=this,a=n.chart,o=a.chartArea,s=a.options,l=s.animation,u=(o.left+o.right)/2,d=(o.top+o.bottom)/2,h=s.rotation,c=s.rotation,f=n.getDataset(),g=i&&l.animateRotate?0:t.hidden?0:n.calculateCircumference(f.data[e])*(s.circumference/(2*Math.PI)),m=i&&l.animateScale?0:n.innerRadius,p=i&&l.animateScale?0:n.outerRadius,v=r.valueAtIndexOrDefault;r.extend(t,{_datasetIndex:n.index,_index:e,_model:{x:u+a.offsetX,y:d+a.offsetY,startAngle:h,endAngle:c,circumference:g,outerRadius:p,innerRadius:m,label:v(f.label,e,a.data.labels[e])}});var y=t._model;this.removeHoverStyle(t),i&&l.animateRotate||(y.startAngle=0===e?s.rotation:n.getMeta().data[e-1]._model.endAngle,y.endAngle=y.startAngle+y.circumference),t.pivot()},removeHoverStyle:function(e){t.DatasetController.prototype.removeHoverStyle.call(this,e,this.chart.options.elements.arc)},calculateTotal:function(){var t,e=this.getDataset(),i=this.getMeta(),n=0;return r.each(i.data,function(i,a){t=e.data[a],isNaN(t)||i.hidden||(n+=Math.abs(t))}),n},calculateCircumference:function(t){var e=this.getMeta().total;return e>0&&!isNaN(t)?2*Math.PI*(Math.abs(t)/e):0},getMaxBorderWidth:function(t){for(var e,i,n=0,a=this.index,r=t.length,o=0;o<r;o++)e=t[o]._model?t[o]._model.borderWidth:0,n=(i=t[o]._chart?t[o]._chart.config.data.datasets[a].hoverBorderWidth:0)>(n=e>n?e:n)?i:n;return n}})}},{25:25,40:40,45:45}],18:[function(t,e,i){"use strict";var n=t(25),a=t(40),r=t(45);n._set("line",{showLines:!0,spanGaps:!1,hover:{mode:"label"},scales:{xAxes:[{type:"category",id:"x-axis-0"}],yAxes:[{type:"linear",id:"y-axis-0"}]}}),e.exports=function(t){function e(t,e){return r.valueOrDefault(t.showLine,e.showLines)}t.controllers.line=t.DatasetController.extend({datasetElementType:a.Line,dataElementType:a.Point,update:function(t){var i,n,a,o=this,s=o.getMeta(),l=s.dataset,u=s.data||[],d=o.chart.options,h=d.elements.line,c=o.getScaleForId(s.yAxisID),f=o.getDataset(),g=e(f,d);for(g&&(a=l.custom||{},void 0!==f.tension&&void 0===f.lineTension&&(f.lineTension=f.tension),l._scale=c,l._datasetIndex=o.index,l._children=u,l._model={spanGaps:f.spanGaps?f.spanGaps:d.spanGaps,tension:a.tension?a.tension:r.valueOrDefault(f.lineTension,h.tension),backgroundColor:a.backgroundColor?a.backgroundColor:f.backgroundColor||h.backgroundColor,borderWidth:a.borderWidth?a.borderWidth:f.borderWidth||h.borderWidth,borderColor:a.borderColor?a.borderColor:f.borderColor||h.borderColor,borderCapStyle:a.borderCapStyle?a.borderCapStyle:f.borderCapStyle||h.borderCapStyle,borderDash:a.borderDash?a.borderDash:f.borderDash||h.borderDash,borderDashOffset:a.borderDashOffset?a.borderDashOffset:f.borderDashOffset||h.borderDashOffset,borderJoinStyle:a.borderJoinStyle?a.borderJoinStyle:f.borderJoinStyle||h.borderJoinStyle,fill:a.fill?a.fill:void 0!==f.fill?f.fill:h.fill,steppedLine:a.steppedLine?a.steppedLine:r.valueOrDefault(f.steppedLine,h.stepped),cubicInterpolationMode:a.cubicInterpolationMode?a.cubicInterpolationMode:r.valueOrDefault(f.cubicInterpolationMode,h.cubicInterpolationMode)},l.pivot()),i=0,n=u.length;i<n;++i)o.updateElement(u[i],i,t);for(g&&0!==l._model.tension&&o.updateBezierControlPoints(),i=0,n=u.length;i<n;++i)u[i].pivot()},getPointBackgroundColor:function(t,e){var i=this.chart.options.elements.point.backgroundColor,n=this.getDataset(),a=t.custom||{};return a.backgroundColor?i=a.backgroundColor:n.pointBackgroundColor?i=r.valueAtIndexOrDefault(n.pointBackgroundColor,e,i):n.backgroundColor&&(i=n.backgroundColor),i},getPointBorderColor:function(t,e){var i=this.chart.options.elements.point.borderColor,n=this.getDataset(),a=t.custom||{};return a.borderColor?i=a.borderColor:n.pointBorderColor?i=r.valueAtIndexOrDefault(n.pointBorderColor,e,i):n.borderColor&&(i=n.borderColor),i},getPointBorderWidth:function(t,e){var i=this.chart.options.elements.point.borderWidth,n=this.getDataset(),a=t.custom||{};return isNaN(a.borderWidth)?!isNaN(n.pointBorderWidth)||r.isArray(n.pointBorderWidth)?i=r.valueAtIndexOrDefault(n.pointBorderWidth,e,i):isNaN(n.borderWidth)||(i=n.borderWidth):i=a.borderWidth,i},updateElement:function(t,e,i){var n,a,o=this,s=o.getMeta(),l=t.custom||{},u=o.getDataset(),d=o.index,h=u.data[e],c=o.getScaleForId(s.yAxisID),f=o.getScaleForId(s.xAxisID),g=o.chart.options.elements.point;void 0!==u.radius&&void 0===u.pointRadius&&(u.pointRadius=u.radius),void 0!==u.hitRadius&&void 0===u.pointHitRadius&&(u.pointHitRadius=u.hitRadius),n=f.getPixelForValue("object"==typeof h?h:NaN,e,d),a=i?c.getBasePixel():o.calculatePointY(h,e,d),t._xScale=f,t._yScale=c,t._datasetIndex=d,t._index=e,t._model={x:n,y:a,skip:l.skip||isNaN(n)||isNaN(a),radius:l.radius||r.valueAtIndexOrDefault(u.pointRadius,e,g.radius),pointStyle:l.pointStyle||r.valueAtIndexOrDefault(u.pointStyle,e,g.pointStyle),backgroundColor:o.getPointBackgroundColor(t,e),borderColor:o.getPointBorderColor(t,e),borderWidth:o.getPointBorderWidth(t,e),tension:s.dataset._model?s.dataset._model.tension:0,steppedLine:!!s.dataset._model&&s.dataset._model.steppedLine,hitRadius:l.hitRadius||r.valueAtIndexOrDefault(u.pointHitRadius,e,g.hitRadius)}},calculatePointY:function(t,e,i){var n,a,r,o=this.chart,s=this.getMeta(),l=this.getScaleForId(s.yAxisID),u=0,d=0;if(l.options.stacked){for(n=0;n<i;n++)if(a=o.data.datasets[n],"line"===(r=o.getDatasetMeta(n)).type&&r.yAxisID===l.id&&o.isDatasetVisible(n)){var h=Number(l.getRightValue(a.data[e]));h<0?d+=h||0:u+=h||0}var c=Number(l.getRightValue(t));return c<0?l.getPixelForValue(d+c):l.getPixelForValue(u+c)}return l.getPixelForValue(t)},updateBezierControlPoints:function(){var t,e,i,n,a=this.getMeta(),o=this.chart.chartArea,s=a.data||[];function l(t,e,i){return Math.max(Math.min(t,i),e)}if(a.dataset._model.spanGaps&&(s=s.filter(function(t){return!t._model.skip})),"monotone"===a.dataset._model.cubicInterpolationMode)r.splineCurveMonotone(s);else for(t=0,e=s.length;t<e;++t)i=s[t]._model,n=r.splineCurve(r.previousItem(s,t)._model,i,r.nextItem(s,t)._model,a.dataset._model.tension),i.controlPointPreviousX=n.previous.x,i.controlPointPreviousY=n.previous.y,i.controlPointNextX=n.next.x,i.controlPointNextY=n.next.y;if(this.chart.options.elements.line.capBezierPoints)for(t=0,e=s.length;t<e;++t)(i=s[t]._model).controlPointPreviousX=l(i.controlPointPreviousX,o.left,o.right),i.controlPointPreviousY=l(i.controlPointPreviousY,o.top,o.bottom),i.controlPointNextX=l(i.controlPointNextX,o.left,o.right),i.controlPointNextY=l(i.controlPointNextY,o.top,o.bottom)},draw:function(){var t=this.chart,i=this.getMeta(),n=i.data||[],a=t.chartArea,o=n.length,s=0;for(r.canvas.clipArea(t.ctx,a),e(this.getDataset(),t.options)&&i.dataset.draw(),r.canvas.unclipArea(t.ctx);s<o;++s)n[s].draw(a)},setHoverStyle:function(t){var e=this.chart.data.datasets[t._datasetIndex],i=t._index,n=t.custom||{},a=t._model;a.radius=n.hoverRadius||r.valueAtIndexOrDefault(e.pointHoverRadius,i,this.chart.options.elements.point.hoverRadius),a.backgroundColor=n.hoverBackgroundColor||r.valueAtIndexOrDefault(e.pointHoverBackgroundColor,i,r.getHoverColor(a.backgroundColor)),a.borderColor=n.hoverBorderColor||r.valueAtIndexOrDefault(e.pointHoverBorderColor,i,r.getHoverColor(a.borderColor)),a.borderWidth=n.hoverBorderWidth||r.valueAtIndexOrDefault(e.pointHoverBorderWidth,i,a.borderWidth)},removeHoverStyle:function(t){var e=this,i=e.chart.data.datasets[t._datasetIndex],n=t._index,a=t.custom||{},o=t._model;void 0!==i.radius&&void 0===i.pointRadius&&(i.pointRadius=i.radius),o.radius=a.radius||r.valueAtIndexOrDefault(i.pointRadius,n,e.chart.options.elements.point.radius),o.backgroundColor=e.getPointBackgroundColor(t,n),o.borderColor=e.getPointBorderColor(t,n),o.borderWidth=e.getPointBorderWidth(t,n)}})}},{25:25,40:40,45:45}],19:[function(t,e,i){"use strict";var n=t(25),a=t(40),r=t(45);n._set("polarArea",{scale:{type:"radialLinear",angleLines:{display:!1},gridLines:{circular:!0},pointLabels:{display:!1},ticks:{beginAtZero:!0}},animation:{animateRotate:!0,animateScale:!0},startAngle:-.5*Math.PI,legendCallback:function(t){var e=[];e.push('<ul class="'+t.id+'-legend">');var i=t.data,n=i.datasets,a=i.labels;if(n.length)for(var r=0;r<n[0].data.length;++r)e.push('<li><span style="background-color:'+n[0].backgroundColor[r]+'"></span>'),a[r]&&e.push(a[r]),e.push("</li>");return e.push("</ul>"),e.join("")},legend:{labels:{generateLabels:function(t){var e=t.data;return e.labels.length&&e.datasets.length?e.labels.map(function(i,n){var a=t.getDatasetMeta(0),o=e.datasets[0],s=a.data[n].custom||{},l=r.valueAtIndexOrDefault,u=t.options.elements.arc;return{text:i,fillStyle:s.backgroundColor?s.backgroundColor:l(o.backgroundColor,n,u.backgroundColor),strokeStyle:s.borderColor?s.borderColor:l(o.borderColor,n,u.borderColor),lineWidth:s.borderWidth?s.borderWidth:l(o.borderWidth,n,u.borderWidth),hidden:isNaN(o.data[n])||a.data[n].hidden,index:n}}):[]}},onClick:function(t,e){var i,n,a,r=e.index,o=this.chart;for(i=0,n=(o.data.datasets||[]).length;i<n;++i)(a=o.getDatasetMeta(i)).data[r].hidden=!a.data[r].hidden;o.update()}},tooltips:{callbacks:{title:function(){return""},label:function(t,e){return e.labels[t.index]+": "+t.yLabel}}}}),e.exports=function(t){t.controllers.polarArea=t.DatasetController.extend({dataElementType:a.Arc,linkScales:r.noop,update:function(t){var e=this,i=e.chart,n=i.chartArea,a=e.getMeta(),o=i.options,s=o.elements.arc,l=Math.min(n.right-n.left,n.bottom-n.top);i.outerRadius=Math.max((l-s.borderWidth/2)/2,0),i.innerRadius=Math.max(o.cutoutPercentage?i.outerRadius/100*o.cutoutPercentage:1,0),i.radiusLength=(i.outerRadius-i.innerRadius)/i.getVisibleDatasetCount(),e.outerRadius=i.outerRadius-i.radiusLength*e.index,e.innerRadius=e.outerRadius-i.radiusLength,a.count=e.countVisibleElements(),r.each(a.data,function(i,n){e.updateElement(i,n,t)})},updateElement:function(t,e,i){for(var n=this,a=n.chart,o=n.getDataset(),s=a.options,l=s.animation,u=a.scale,d=a.data.labels,h=n.calculateCircumference(o.data[e]),c=u.xCenter,f=u.yCenter,g=0,m=n.getMeta(),p=0;p<e;++p)isNaN(o.data[p])||m.data[p].hidden||++g;var v=s.startAngle,y=t.hidden?0:u.getDistanceFromCenterForValue(o.data[e]),b=v+h*g,x=b+(t.hidden?0:h),_=l.animateScale?0:u.getDistanceFromCenterForValue(o.data[e]);r.extend(t,{_datasetIndex:n.index,_index:e,_scale:u,_model:{x:c,y:f,innerRadius:0,outerRadius:i?_:y,startAngle:i&&l.animateRotate?v:b,endAngle:i&&l.animateRotate?v:x,label:r.valueAtIndexOrDefault(d,e,d[e])}}),n.removeHoverStyle(t),t.pivot()},removeHoverStyle:function(e){t.DatasetController.prototype.removeHoverStyle.call(this,e,this.chart.options.elements.arc)},countVisibleElements:function(){var t=this.getDataset(),e=this.getMeta(),i=0;return r.each(e.data,function(e,n){isNaN(t.data[n])||e.hidden||i++}),i},calculateCircumference:function(t){var e=this.getMeta().count;return e>0&&!isNaN(t)?2*Math.PI/e:0}})}},{25:25,40:40,45:45}],20:[function(t,e,i){"use strict";var n=t(25),a=t(40),r=t(45);n._set("radar",{scale:{type:"radialLinear"},elements:{line:{tension:0}}}),e.exports=function(t){t.controllers.radar=t.DatasetController.extend({datasetElementType:a.Line,dataElementType:a.Point,linkScales:r.noop,update:function(t){var e=this,i=e.getMeta(),n=i.dataset,a=i.data,o=n.custom||{},s=e.getDataset(),l=e.chart.options.elements.line,u=e.chart.scale;void 0!==s.tension&&void 0===s.lineTension&&(s.lineTension=s.tension),r.extend(i.dataset,{_datasetIndex:e.index,_scale:u,_children:a,_loop:!0,_model:{tension:o.tension?o.tension:r.valueOrDefault(s.lineTension,l.tension),backgroundColor:o.backgroundColor?o.backgroundColor:s.backgroundColor||l.backgroundColor,borderWidth:o.borderWidth?o.borderWidth:s.borderWidth||l.borderWidth,borderColor:o.borderColor?o.borderColor:s.borderColor||l.borderColor,fill:o.fill?o.fill:void 0!==s.fill?s.fill:l.fill,borderCapStyle:o.borderCapStyle?o.borderCapStyle:s.borderCapStyle||l.borderCapStyle,borderDash:o.borderDash?o.borderDash:s.borderDash||l.borderDash,borderDashOffset:o.borderDashOffset?o.borderDashOffset:s.borderDashOffset||l.borderDashOffset,borderJoinStyle:o.borderJoinStyle?o.borderJoinStyle:s.borderJoinStyle||l.borderJoinStyle}}),i.dataset.pivot(),r.each(a,function(i,n){e.updateElement(i,n,t)},e),e.updateBezierControlPoints()},updateElement:function(t,e,i){var n=this,a=t.custom||{},o=n.getDataset(),s=n.chart.scale,l=n.chart.options.elements.point,u=s.getPointPositionForValue(e,o.data[e]);void 0!==o.radius&&void 0===o.pointRadius&&(o.pointRadius=o.radius),void 0!==o.hitRadius&&void 0===o.pointHitRadius&&(o.pointHitRadius=o.hitRadius),r.extend(t,{_datasetIndex:n.index,_index:e,_scale:s,_model:{x:i?s.xCenter:u.x,y:i?s.yCenter:u.y,tension:a.tension?a.tension:r.valueOrDefault(o.lineTension,n.chart.options.elements.line.tension),radius:a.radius?a.radius:r.valueAtIndexOrDefault(o.pointRadius,e,l.radius),backgroundColor:a.backgroundColor?a.backgroundColor:r.valueAtIndexOrDefault(o.pointBackgroundColor,e,l.backgroundColor),borderColor:a.borderColor?a.borderColor:r.valueAtIndexOrDefault(o.pointBorderColor,e,l.borderColor),borderWidth:a.borderWidth?a.borderWidth:r.valueAtIndexOrDefault(o.pointBorderWidth,e,l.borderWidth),pointStyle:a.pointStyle?a.pointStyle:r.valueAtIndexOrDefault(o.pointStyle,e,l.pointStyle),hitRadius:a.hitRadius?a.hitRadius:r.valueAtIndexOrDefault(o.pointHitRadius,e,l.hitRadius)}}),t._model.skip=a.skip?a.skip:isNaN(t._model.x)||isNaN(t._model.y)},updateBezierControlPoints:function(){var t=this.chart.chartArea,e=this.getMeta();r.each(e.data,function(i,n){var a=i._model,o=r.splineCurve(r.previousItem(e.data,n,!0)._model,a,r.nextItem(e.data,n,!0)._model,a.tension);a.controlPointPreviousX=Math.max(Math.min(o.previous.x,t.right),t.left),a.controlPointPreviousY=Math.max(Math.min(o.previous.y,t.bottom),t.top),a.controlPointNextX=Math.max(Math.min(o.next.x,t.right),t.left),a.controlPointNextY=Math.max(Math.min(o.next.y,t.bottom),t.top),i.pivot()})},setHoverStyle:function(t){var e=this.chart.data.datasets[t._datasetIndex],i=t.custom||{},n=t._index,a=t._model;a.radius=i.hoverRadius?i.hoverRadius:r.valueAtIndexOrDefault(e.pointHoverRadius,n,this.chart.options.elements.point.hoverRadius),a.backgroundColor=i.hoverBackgroundColor?i.hoverBackgroundColor:r.valueAtIndexOrDefault(e.pointHoverBackgroundColor,n,r.getHoverColor(a.backgroundColor)),a.borderColor=i.hoverBorderColor?i.hoverBorderColor:r.valueAtIndexOrDefault(e.pointHoverBorderColor,n,r.getHoverColor(a.borderColor)),a.borderWidth=i.hoverBorderWidth?i.hoverBorderWidth:r.valueAtIndexOrDefault(e.pointHoverBorderWidth,n,a.borderWidth)},removeHoverStyle:function(t){var e=this.chart.data.datasets[t._datasetIndex],i=t.custom||{},n=t._index,a=t._model,o=this.chart.options.elements.point;a.radius=i.radius?i.radius:r.valueAtIndexOrDefault(e.pointRadius,n,o.radius),a.backgroundColor=i.backgroundColor?i.backgroundColor:r.valueAtIndexOrDefault(e.pointBackgroundColor,n,o.backgroundColor),a.borderColor=i.borderColor?i.borderColor:r.valueAtIndexOrDefault(e.pointBorderColor,n,o.borderColor),a.borderWidth=i.borderWidth?i.borderWidth:r.valueAtIndexOrDefault(e.pointBorderWidth,n,o.borderWidth)}})}},{25:25,40:40,45:45}],21:[function(t,e,i){"use strict";t(25)._set("scatter",{hover:{mode:"single"},scales:{xAxes:[{id:"x-axis-1",type:"linear",position:"bottom"}],yAxes:[{id:"y-axis-1",type:"linear",position:"left"}]},showLines:!1,tooltips:{callbacks:{title:function(){return""},label:function(t){return"("+t.xLabel+", "+t.yLabel+")"}}}}),e.exports=function(t){t.controllers.scatter=t.controllers.line}},{25:25}],22:[function(t,e,i){"use strict";var n=t(25),a=t(26),r=t(45);n._set("global",{animation:{duration:1e3,easing:"easeOutQuart",onProgress:r.noop,onComplete:r.noop}}),e.exports=function(t){t.Animation=a.extend({chart:null,currentStep:0,numSteps:60,easing:"",render:null,onAnimationProgress:null,onAnimationComplete:null}),t.animationService={frameDuration:17,animations:[],dropFrames:0,request:null,addAnimation:function(t,e,i,n){var a,r,o=this.animations;for(e.chart=t,n||(t.animating=!0),a=0,r=o.length;a<r;++a)if(o[a].chart===t)return void(o[a]=e);o.push(e),1===o.length&&this.requestAnimationFrame()},cancelAnimation:function(t){var e=r.findIndex(this.animations,function(e){return e.chart===t});-1!==e&&(this.animations.splice(e,1),t.animating=!1)},requestAnimationFrame:function(){var t=this;null===t.request&&(t.request=r.requestAnimFrame.call(window,function(){t.request=null,t.startDigest()}))},startDigest:function(){var t=this,e=Date.now(),i=0;t.dropFrames>1&&(i=Math.floor(t.dropFrames),t.dropFrames=t.dropFrames%1),t.advance(1+i);var n=Date.now();t.dropFrames+=(n-e)/t.frameDuration,t.animations.length>0&&t.requestAnimationFrame()},advance:function(t){for(var e,i,n=this.animations,a=0;a<n.length;)i=(e=n[a]).chart,e.currentStep=(e.currentStep||0)+t,e.currentStep=Math.min(e.currentStep,e.numSteps),r.callback(e.render,[i,e],i),r.callback(e.onAnimationProgress,[e],i),e.currentStep>=e.numSteps?(r.callback(e.onAnimationComplete,[e],i),i.animating=!1,n.splice(a,1)):++a}},Object.defineProperty(t.Animation.prototype,"animationObject",{get:function(){return this}}),Object.defineProperty(t.Animation.prototype,"chartInstance",{get:function(){return this.chart},set:function(t){this.chart=t}})}},{25:25,26:26,45:45}],23:[function(t,e,i){"use strict";var n=t(25),a=t(45),r=t(28),o=t(30),s=t(48),l=t(31);e.exports=function(t){function e(t){return"top"===t||"bottom"===t}t.types={},t.instances={},t.controllers={},a.extend(t.prototype,{construct:function(e,i){var r,o,l=this;(o=(r=(r=i)||{}).data=r.data||{}).datasets=o.datasets||[],o.labels=o.labels||[],r.options=a.configMerge(n.global,n[r.type],r.options||{}),i=r;var u=s.acquireContext(e,i),d=u&&u.canvas,h=d&&d.height,c=d&&d.width;l.id=a.uid(),l.ctx=u,l.canvas=d,l.config=i,l.width=c,l.height=h,l.aspectRatio=h?c/h:null,l.options=i.options,l._bufferedRender=!1,l.chart=l,l.controller=l,t.instances[l.id]=l,Object.defineProperty(l,"data",{get:function(){return l.config.data},set:function(t){l.config.data=t}}),u&&d?(l.initialize(),l.update()):console.error("Failed to create chart: can't acquire context from the given item")},initialize:function(){var t=this;return l.notify(t,"beforeInit"),a.retinaScale(t,t.options.devicePixelRatio),t.bindEvents(),t.options.responsive&&t.resize(!0),t.ensureScalesHaveIDs(),t.buildOrUpdateScales(),t.initToolTip(),l.notify(t,"afterInit"),t},clear:function(){return a.canvas.clear(this),this},stop:function(){return t.animationService.cancelAnimation(this),this},resize:function(t){var e=this,i=e.options,n=e.canvas,r=i.maintainAspectRatio&&e.aspectRatio||null,o=Math.max(0,Math.floor(a.getMaximumWidth(n))),s=Math.max(0,Math.floor(r?o/r:a.getMaximumHeight(n)));if((e.width!==o||e.height!==s)&&(n.width=e.width=o,n.height=e.height=s,n.style.width=o+"px",n.style.height=s+"px",a.retinaScale(e,i.devicePixelRatio),!t)){var u={width:o,height:s};l.notify(e,"resize",[u]),e.options.onResize&&e.options.onResize(e,u),e.stop(),e.update(e.options.responsiveAnimationDuration)}},ensureScalesHaveIDs:function(){var t=this.options,e=t.scales||{},i=t.scale;a.each(e.xAxes,function(t,e){t.id=t.id||"x-axis-"+e}),a.each(e.yAxes,function(t,e){t.id=t.id||"y-axis-"+e}),i&&(i.id=i.id||"scale")},buildOrUpdateScales:function(){var i=this,n=i.options,r=i.scales||{},o=[],s=Object.keys(r).reduce(function(t,e){return t[e]=!1,t},{});n.scales&&(o=o.concat((n.scales.xAxes||[]).map(function(t){return{options:t,dtype:"category",dposition:"bottom"}}),(n.scales.yAxes||[]).map(function(t){return{options:t,dtype:"linear",dposition:"left"}}))),n.scale&&o.push({options:n.scale,dtype:"radialLinear",isDefault:!0,dposition:"chartArea"}),a.each(o,function(n){var o=n.options,l=o.id,u=a.valueOrDefault(o.type,n.dtype);e(o.position)!==e(n.dposition)&&(o.position=n.dposition),s[l]=!0;var d=null;if(l in r&&r[l].type===u)(d=r[l]).options=o,d.ctx=i.ctx,d.chart=i;else{var h=t.scaleService.getScaleConstructor(u);if(!h)return;d=new h({id:l,type:u,options:o,ctx:i.ctx,chart:i}),r[d.id]=d}d.mergeTicksOptions(),n.isDefault&&(i.scale=d)}),a.each(s,function(t,e){t||delete r[e]}),i.scales=r,t.scaleService.addScalesToLayout(this)},buildOrUpdateControllers:function(){var e=this,i=[],n=[];return a.each(e.data.datasets,function(a,r){var o=e.getDatasetMeta(r),s=a.type||e.config.type;if(o.type&&o.type!==s&&(e.destroyDatasetMeta(r),o=e.getDatasetMeta(r)),o.type=s,i.push(o.type),o.controller)o.controller.updateIndex(r),o.controller.linkScales();else{var l=t.controllers[o.type];if(void 0===l)throw new Error('"'+o.type+'" is not a chart type.');o.controller=new l(e,r),n.push(o.controller)}},e),n},resetElements:function(){var t=this;a.each(t.data.datasets,function(e,i){t.getDatasetMeta(i).controller.reset()},t)},reset:function(){this.resetElements(),this.tooltip.initialize()},update:function(e){var i,n,r=this;if(e&&"object"==typeof e||(e={duration:e,lazy:arguments[1]}),n=(i=r).options,a.each(i.scales,function(t){o.removeBox(i,t)}),n=a.configMerge(t.defaults.global,t.defaults[i.config.type],n),i.options=i.config.options=n,i.ensureScalesHaveIDs(),i.buildOrUpdateScales(),i.tooltip._options=n.tooltips,i.tooltip.initialize(),l._invalidate(r),!1!==l.notify(r,"beforeUpdate")){r.tooltip._data=r.data;var s=r.buildOrUpdateControllers();a.each(r.data.datasets,function(t,e){r.getDatasetMeta(e).controller.buildOrUpdateElements()},r),r.updateLayout(),r.options.animation&&r.options.animation.duration&&a.each(s,function(t){t.reset()}),r.updateDatasets(),r.tooltip.initialize(),r.lastActive=[],l.notify(r,"afterUpdate"),r._bufferedRender?r._bufferedRequest={duration:e.duration,easing:e.easing,lazy:e.lazy}:r.render(e)}},updateLayout:function(){!1!==l.notify(this,"beforeLayout")&&(o.update(this,this.width,this.height),l.notify(this,"afterScaleUpdate"),l.notify(this,"afterLayout"))},updateDatasets:function(){if(!1!==l.notify(this,"beforeDatasetsUpdate")){for(var t=0,e=this.data.datasets.length;t<e;++t)this.updateDataset(t);l.notify(this,"afterDatasetsUpdate")}},updateDataset:function(t){var e=this.getDatasetMeta(t),i={meta:e,index:t};!1!==l.notify(this,"beforeDatasetUpdate",[i])&&(e.controller.update(),l.notify(this,"afterDatasetUpdate",[i]))},render:function(e){var i=this;e&&"object"==typeof e||(e={duration:e,lazy:arguments[1]});var n=e.duration,r=e.lazy;if(!1!==l.notify(i,"beforeRender")){var o=i.options.animation,s=function(t){l.notify(i,"afterRender"),a.callback(o&&o.onComplete,[t],i)};if(o&&(void 0!==n&&0!==n||void 0===n&&0!==o.duration)){var u=new t.Animation({numSteps:(n||o.duration)/16.66,easing:e.easing||o.easing,render:function(t,e){var i=a.easing.effects[e.easing],n=e.currentStep,r=n/e.numSteps;t.draw(i(r),r,n)},onAnimationProgress:o.onProgress,onAnimationComplete:s});t.animationService.addAnimation(i,u,n,r)}else i.draw(),s(new t.Animation({numSteps:0,chart:i}));return i}},draw:function(t){var e=this;e.clear(),a.isNullOrUndef(t)&&(t=1),e.transition(t),!1!==l.notify(e,"beforeDraw",[t])&&(a.each(e.boxes,function(t){t.draw(e.chartArea)},e),e.scale&&e.scale.draw(),e.drawDatasets(t),e._drawTooltip(t),l.notify(e,"afterDraw",[t]))},transition:function(t){for(var e=0,i=(this.data.datasets||[]).length;e<i;++e)this.isDatasetVisible(e)&&this.getDatasetMeta(e).controller.transition(t);this.tooltip.transition(t)},drawDatasets:function(t){var e=this;if(!1!==l.notify(e,"beforeDatasetsDraw",[t])){for(var i=(e.data.datasets||[]).length-1;i>=0;--i)e.isDatasetVisible(i)&&e.drawDataset(i,t);l.notify(e,"afterDatasetsDraw",[t])}},drawDataset:function(t,e){var i=this.getDatasetMeta(t),n={meta:i,index:t,easingValue:e};!1!==l.notify(this,"beforeDatasetDraw",[n])&&(i.controller.draw(e),l.notify(this,"afterDatasetDraw",[n]))},_drawTooltip:function(t){var e=this.tooltip,i={tooltip:e,easingValue:t};!1!==l.notify(this,"beforeTooltipDraw",[i])&&(e.draw(),l.notify(this,"afterTooltipDraw",[i]))},getElementAtEvent:function(t){return r.modes.single(this,t)},getElementsAtEvent:function(t){return r.modes.label(this,t,{intersect:!0})},getElementsAtXAxis:function(t){return r.modes["x-axis"](this,t,{intersect:!0})},getElementsAtEventForMode:function(t,e,i){var n=r.modes[e];return"function"==typeof n?n(this,t,i):[]},getDatasetAtEvent:function(t){return r.modes.dataset(this,t,{intersect:!0})},getDatasetMeta:function(t){var e=this.data.datasets[t];e._meta||(e._meta={});var i=e._meta[this.id];return i||(i=e._meta[this.id]={type:null,data:[],dataset:null,controller:null,hidden:null,xAxisID:null,yAxisID:null}),i},getVisibleDatasetCount:function(){for(var t=0,e=0,i=this.data.datasets.length;e<i;++e)this.isDatasetVisible(e)&&t++;return t},isDatasetVisible:function(t){var e=this.getDatasetMeta(t);return"boolean"==typeof e.hidden?!e.hidden:!this.data.datasets[t].hidden},generateLegend:function(){return this.options.legendCallback(this)},destroyDatasetMeta:function(t){var e=this.id,i=this.data.datasets[t],n=i._meta&&i._meta[e];n&&(n.controller.destroy(),delete i._meta[e])},destroy:function(){var e,i,n=this,r=n.canvas;for(n.stop(),e=0,i=n.data.datasets.length;e<i;++e)n.destroyDatasetMeta(e);r&&(n.unbindEvents(),a.canvas.clear(n),s.releaseContext(n.ctx),n.canvas=null,n.ctx=null),l.notify(n,"destroy"),delete t.instances[n.id]},toBase64Image:function(){return this.canvas.toDataURL.apply(this.canvas,arguments)},initToolTip:function(){var e=this;e.tooltip=new t.Tooltip({_chart:e,_chartInstance:e,_data:e.data,_options:e.options.tooltips},e)},bindEvents:function(){var t=this,e=t._listeners={},i=function(){t.eventHandler.apply(t,arguments)};a.each(t.options.events,function(n){s.addEventListener(t,n,i),e[n]=i}),t.options.responsive&&(i=function(){t.resize()},s.addEventListener(t,"resize",i),e.resize=i)},unbindEvents:function(){var t=this,e=t._listeners;e&&(delete t._listeners,a.each(e,function(e,i){s.removeEventListener(t,i,e)}))},updateHoverStyle:function(t,e,i){var n,a,r,o=i?"setHoverStyle":"removeHoverStyle";for(a=0,r=t.length;a<r;++a)(n=t[a])&&this.getDatasetMeta(n._datasetIndex).controller[o](n)},eventHandler:function(t){var e=this,i=e.tooltip;if(!1!==l.notify(e,"beforeEvent",[t])){e._bufferedRender=!0,e._bufferedRequest=null;var n=e.handleEvent(t);i&&(n=i._start?i.handleEvent(t):n|i.handleEvent(t)),l.notify(e,"afterEvent",[t]);var a=e._bufferedRequest;return a?e.render(a):n&&!e.animating&&(e.stop(),e.render(e.options.hover.animationDuration,!0)),e._bufferedRender=!1,e._bufferedRequest=null,e}},handleEvent:function(t){var e,i=this,n=i.options||{},r=n.hover;return i.lastActive=i.lastActive||[],"mouseout"===t.type?i.active=[]:i.active=i.getElementsAtEventForMode(t,r.mode,r),a.callback(n.onHover||n.hover.onHover,[t.native,i.active],i),"mouseup"!==t.type&&"click"!==t.type||n.onClick&&n.onClick.call(i,t.native,i.active),i.lastActive.length&&i.updateHoverStyle(i.lastActive,r.mode,!1),i.active.length&&r.mode&&i.updateHoverStyle(i.active,r.mode,!0),e=!a.arrayEquals(i.active,i.lastActive),i.lastActive=i.active,e}}),t.Controller=t}},{25:25,28:28,30:30,31:31,45:45,48:48}],24:[function(t,e,i){"use strict";var n=t(45);e.exports=function(t){var e=["push","pop","shift","splice","unshift"];function i(t,i){var n=t._chartjs;if(n){var a=n.listeners,r=a.indexOf(i);-1!==r&&a.splice(r,1),a.length>0||(e.forEach(function(e){delete t[e]}),delete t._chartjs)}}t.DatasetController=function(t,e){this.initialize(t,e)},n.extend(t.DatasetController.prototype,{datasetElementType:null,dataElementType:null,initialize:function(t,e){this.chart=t,this.index=e,this.linkScales(),this.addElements()},updateIndex:function(t){this.index=t},linkScales:function(){var t=this,e=t.getMeta(),i=t.getDataset();null!==e.xAxisID&&e.xAxisID in t.chart.scales||(e.xAxisID=i.xAxisID||t.chart.options.scales.xAxes[0].id),null!==e.yAxisID&&e.yAxisID in t.chart.scales||(e.yAxisID=i.yAxisID||t.chart.options.scales.yAxes[0].id)},getDataset:function(){return this.chart.data.datasets[this.index]},getMeta:function(){return this.chart.getDatasetMeta(this.index)},getScaleForId:function(t){return this.chart.scales[t]},reset:function(){this.update(!0)},destroy:function(){this._data&&i(this._data,this)},createMetaDataset:function(){var t=this.datasetElementType;return t&&new t({_chart:this.chart,_datasetIndex:this.index})},createMetaData:function(t){var e=this.dataElementType;return e&&new e({_chart:this.chart,_datasetIndex:this.index,_index:t})},addElements:function(){var t,e,i=this.getMeta(),n=this.getDataset().data||[],a=i.data;for(t=0,e=n.length;t<e;++t)a[t]=a[t]||this.createMetaData(t);i.dataset=i.dataset||this.createMetaDataset()},addElementAndReset:function(t){var e=this.createMetaData(t);this.getMeta().data.splice(t,0,e),this.updateElement(e,t,!0)},buildOrUpdateElements:function(){var t,a,r=this,o=r.getDataset(),s=o.data||(o.data=[]);r._data!==s&&(r._data&&i(r._data,r),a=r,(t=s)._chartjs?t._chartjs.listeners.push(a):(Object.defineProperty(t,"_chartjs",{configurable:!0,enumerable:!1,value:{listeners:[a]}}),e.forEach(function(e){var i="onData"+e.charAt(0).toUpperCase()+e.slice(1),a=t[e];Object.defineProperty(t,e,{configurable:!0,enumerable:!1,value:function(){var e=Array.prototype.slice.call(arguments),r=a.apply(this,e);return n.each(t._chartjs.listeners,function(t){"function"==typeof t[i]&&t[i].apply(t,e)}),r}})})),r._data=s),r.resyncElements()},update:n.noop,transition:function(t){for(var e=this.getMeta(),i=e.data||[],n=i.length,a=0;a<n;++a)i[a].transition(t);e.dataset&&e.dataset.transition(t)},draw:function(){var t=this.getMeta(),e=t.data||[],i=e.length,n=0;for(t.dataset&&t.dataset.draw();n<i;++n)e[n].draw()},removeHoverStyle:function(t,e){var i=this.chart.data.datasets[t._datasetIndex],a=t._index,r=t.custom||{},o=n.valueAtIndexOrDefault,s=t._model;s.backgroundColor=r.backgroundColor?r.backgroundColor:o(i.backgroundColor,a,e.backgroundColor),s.borderColor=r.borderColor?r.borderColor:o(i.borderColor,a,e.borderColor),s.borderWidth=r.borderWidth?r.borderWidth:o(i.borderWidth,a,e.borderWidth)},setHoverStyle:function(t){var e=this.chart.data.datasets[t._datasetIndex],i=t._index,a=t.custom||{},r=n.valueAtIndexOrDefault,o=n.getHoverColor,s=t._model;s.backgroundColor=a.hoverBackgroundColor?a.hoverBackgroundColor:r(e.hoverBackgroundColor,i,o(s.backgroundColor)),s.borderColor=a.hoverBorderColor?a.hoverBorderColor:r(e.hoverBorderColor,i,o(s.borderColor)),s.borderWidth=a.hoverBorderWidth?a.hoverBorderWidth:r(e.hoverBorderWidth,i,s.borderWidth)},resyncElements:function(){var t=this.getMeta(),e=this.getDataset().data,i=t.data.length,n=e.length;n<i?t.data.splice(n,i-n):n>i&&this.insertElements(i,n-i)},insertElements:function(t,e){for(var i=0;i<e;++i)this.addElementAndReset(t+i)},onDataPush:function(){this.insertElements(this.getDataset().data.length-1,arguments.length)},onDataPop:function(){this.getMeta().data.pop()},onDataShift:function(){this.getMeta().data.shift()},onDataSplice:function(t,e){this.getMeta().data.splice(t,e),this.insertElements(t,arguments.length-2)},onDataUnshift:function(){this.insertElements(0,arguments.length)}}),t.DatasetController.extend=n.inherits}},{45:45}],25:[function(t,e,i){"use strict";var n=t(45);e.exports={_set:function(t,e){return n.merge(this[t]||(this[t]={}),e)}}},{45:45}],26:[function(t,e,i){"use strict";var n=t(2),a=t(45);var r=function(t){a.extend(this,t),this.initialize.apply(this,arguments)};a.extend(r.prototype,{initialize:function(){this.hidden=!1},pivot:function(){var t=this;return t._view||(t._view=a.clone(t._model)),t._start={},t},transition:function(t){var e=this,i=e._model,a=e._start,r=e._view;return i&&1!==t?(r||(r=e._view={}),a||(a=e._start={}),function(t,e,i,a){var r,o,s,l,u,d,h,c,f,g=Object.keys(i);for(r=0,o=g.length;r<o;++r)if(d=i[s=g[r]],e.hasOwnProperty(s)||(e[s]=d),(l=e[s])!==d&&"_"!==s[0]){if(t.hasOwnProperty(s)||(t[s]=l),(h=typeof d)==typeof(u=t[s]))if("string"===h){if((c=n(u)).valid&&(f=n(d)).valid){e[s]=f.mix(c,a).rgbString();continue}}else if("number"===h&&isFinite(u)&&isFinite(d)){e[s]=u+(d-u)*a;continue}e[s]=d}}(a,r,i,t),e):(e._view=i,e._start=null,e)},tooltipPosition:function(){return{x:this._model.x,y:this._model.y}},hasValue:function(){return a.isNumber(this._model.x)&&a.isNumber(this._model.y)}}),r.extend=a.inherits,e.exports=r},{2:2,45:45}],27:[function(t,e,i){"use strict";var n=t(2),a=t(25),r=t(45);e.exports=function(t){function e(t,e,i){var n;return"string"==typeof t?(n=parseInt(t,10),-1!==t.indexOf("%")&&(n=n/100*e.parentNode[i])):n=t,n}function i(t){return null!=t&&"none"!==t}function o(t,n,a){var r=document.defaultView,o=t.parentNode,s=r.getComputedStyle(t)[n],l=r.getComputedStyle(o)[n],u=i(s),d=i(l),h=Number.POSITIVE_INFINITY;return u||d?Math.min(u?e(s,t,a):h,d?e(l,o,a):h):"none"}r.configMerge=function(){return r.merge(r.clone(arguments[0]),[].slice.call(arguments,1),{merger:function(e,i,n,a){var o=i[e]||{},s=n[e];"scales"===e?i[e]=r.scaleMerge(o,s):"scale"===e?i[e]=r.merge(o,[t.scaleService.getScaleDefaults(s.type),s]):r._merger(e,i,n,a)}})},r.scaleMerge=function(){return r.merge(r.clone(arguments[0]),[].slice.call(arguments,1),{merger:function(e,i,n,a){if("xAxes"===e||"yAxes"===e){var o,s,l,u=n[e].length;for(i[e]||(i[e]=[]),o=0;o<u;++o)l=n[e][o],s=r.valueOrDefault(l.type,"xAxes"===e?"category":"linear"),o>=i[e].length&&i[e].push({}),!i[e][o].type||l.type&&l.type!==i[e][o].type?r.merge(i[e][o],[t.scaleService.getScaleDefaults(s),l]):r.merge(i[e][o],l)}else r._merger(e,i,n,a)}})},r.where=function(t,e){if(r.isArray(t)&&Array.prototype.filter)return t.filter(e);var i=[];return r.each(t,function(t){e(t)&&i.push(t)}),i},r.findIndex=Array.prototype.findIndex?function(t,e,i){return t.findIndex(e,i)}:function(t,e,i){i=void 0===i?t:i;for(var n=0,a=t.length;n<a;++n)if(e.call(i,t[n],n,t))return n;return-1},r.findNextWhere=function(t,e,i){r.isNullOrUndef(i)&&(i=-1);for(var n=i+1;n<t.length;n++){var a=t[n];if(e(a))return a}},r.findPreviousWhere=function(t,e,i){r.isNullOrUndef(i)&&(i=t.length);for(var n=i-1;n>=0;n--){var a=t[n];if(e(a))return a}},r.isNumber=function(t){return!isNaN(parseFloat(t))&&isFinite(t)},r.almostEquals=function(t,e,i){return Math.abs(t-e)<i},r.almostWhole=function(t,e){var i=Math.round(t);return i-e<t&&i+e>t},r.max=function(t){return t.reduce(function(t,e){return isNaN(e)?t:Math.max(t,e)},Number.NEGATIVE_INFINITY)},r.min=function(t){return t.reduce(function(t,e){return isNaN(e)?t:Math.min(t,e)},Number.POSITIVE_INFINITY)},r.sign=Math.sign?function(t){return Math.sign(t)}:function(t){return 0===(t=+t)||isNaN(t)?t:t>0?1:-1},r.log10=Math.log10?function(t){return Math.log10(t)}:function(t){var e=Math.log(t)*Math.LOG10E,i=Math.round(e);return t===Math.pow(10,i)?i:e},r.toRadians=function(t){return t*(Math.PI/180)},r.toDegrees=function(t){return t*(180/Math.PI)},r.getAngleFromPoint=function(t,e){var i=e.x-t.x,n=e.y-t.y,a=Math.sqrt(i*i+n*n),r=Math.atan2(n,i);return r<-.5*Math.PI&&(r+=2*Math.PI),{angle:r,distance:a}},r.distanceBetweenPoints=function(t,e){return Math.sqrt(Math.pow(e.x-t.x,2)+Math.pow(e.y-t.y,2))},r.aliasPixel=function(t){return t%2==0?0:.5},r.splineCurve=function(t,e,i,n){var a=t.skip?e:t,r=e,o=i.skip?e:i,s=Math.sqrt(Math.pow(r.x-a.x,2)+Math.pow(r.y-a.y,2)),l=Math.sqrt(Math.pow(o.x-r.x,2)+Math.pow(o.y-r.y,2)),u=s/(s+l),d=l/(s+l),h=n*(u=isNaN(u)?0:u),c=n*(d=isNaN(d)?0:d);return{previous:{x:r.x-h*(o.x-a.x),y:r.y-h*(o.y-a.y)},next:{x:r.x+c*(o.x-a.x),y:r.y+c*(o.y-a.y)}}},r.EPSILON=Number.EPSILON||1e-14,r.splineCurveMonotone=function(t){var e,i,n,a,o,s,l,u,d,h=(t||[]).map(function(t){return{model:t._model,deltaK:0,mK:0}}),c=h.length;for(e=0;e<c;++e)if(!(n=h[e]).model.skip){if(i=e>0?h[e-1]:null,(a=e<c-1?h[e+1]:null)&&!a.model.skip){var f=a.model.x-n.model.x;n.deltaK=0!==f?(a.model.y-n.model.y)/f:0}!i||i.model.skip?n.mK=n.deltaK:!a||a.model.skip?n.mK=i.deltaK:this.sign(i.deltaK)!==this.sign(n.deltaK)?n.mK=0:n.mK=(i.deltaK+n.deltaK)/2}for(e=0;e<c-1;++e)n=h[e],a=h[e+1],n.model.skip||a.model.skip||(r.almostEquals(n.deltaK,0,this.EPSILON)?n.mK=a.mK=0:(o=n.mK/n.deltaK,s=a.mK/n.deltaK,(u=Math.pow(o,2)+Math.pow(s,2))<=9||(l=3/Math.sqrt(u),n.mK=o*l*n.deltaK,a.mK=s*l*n.deltaK)));for(e=0;e<c;++e)(n=h[e]).model.skip||(i=e>0?h[e-1]:null,a=e<c-1?h[e+1]:null,i&&!i.model.skip&&(d=(n.model.x-i.model.x)/3,n.model.controlPointPreviousX=n.model.x-d,n.model.controlPointPreviousY=n.model.y-d*n.mK),a&&!a.model.skip&&(d=(a.model.x-n.model.x)/3,n.model.controlPointNextX=n.model.x+d,n.model.controlPointNextY=n.model.y+d*n.mK))},r.nextItem=function(t,e,i){return i?e>=t.length-1?t[0]:t[e+1]:e>=t.length-1?t[t.length-1]:t[e+1]},r.previousItem=function(t,e,i){return i?e<=0?t[t.length-1]:t[e-1]:e<=0?t[0]:t[e-1]},r.niceNum=function(t,e){var i=Math.floor(r.log10(t)),n=t/Math.pow(10,i);return(e?n<1.5?1:n<3?2:n<7?5:10:n<=1?1:n<=2?2:n<=5?5:10)*Math.pow(10,i)},r.requestAnimFrame="undefined"==typeof window?function(t){t()}:window.requestAnimationFrame||window.webkitRequestAnimationFrame||window.mozRequestAnimationFrame||window.oRequestAnimationFrame||window.msRequestAnimationFrame||function(t){return window.setTimeout(t,1e3/60)},r.getRelativePosition=function(t,e){var i,n,a=t.originalEvent||t,o=t.currentTarget||t.srcElement,s=o.getBoundingClientRect(),l=a.touches;l&&l.length>0?(i=l[0].clientX,n=l[0].clientY):(i=a.clientX,n=a.clientY);var u=parseFloat(r.getStyle(o,"padding-left")),d=parseFloat(r.getStyle(o,"padding-top")),h=parseFloat(r.getStyle(o,"padding-right")),c=parseFloat(r.getStyle(o,"padding-bottom")),f=s.right-s.left-u-h,g=s.bottom-s.top-d-c;return{x:i=Math.round((i-s.left-u)/f*o.width/e.currentDevicePixelRatio),y:n=Math.round((n-s.top-d)/g*o.height/e.currentDevicePixelRatio)}},r.getConstraintWidth=function(t){return o(t,"max-width","clientWidth")},r.getConstraintHeight=function(t){return o(t,"max-height","clientHeight")},r.getMaximumWidth=function(t){var e=t.parentNode;if(!e)return t.clientWidth;var i=parseInt(r.getStyle(e,"padding-left"),10),n=parseInt(r.getStyle(e,"padding-right"),10),a=e.clientWidth-i-n,o=r.getConstraintWidth(t);return isNaN(o)?a:Math.min(a,o)},r.getMaximumHeight=function(t){var e=t.parentNode;if(!e)return t.clientHeight;var i=parseInt(r.getStyle(e,"padding-top"),10),n=parseInt(r.getStyle(e,"padding-bottom"),10),a=e.clientHeight-i-n,o=r.getConstraintHeight(t);return isNaN(o)?a:Math.min(a,o)},r.getStyle=function(t,e){return t.currentStyle?t.currentStyle[e]:document.defaultView.getComputedStyle(t,null).getPropertyValue(e)},r.retinaScale=function(t,e){var i=t.currentDevicePixelRatio=e||window.devicePixelRatio||1;if(1!==i){var n=t.canvas,a=t.height,r=t.width;n.height=a*i,n.width=r*i,t.ctx.scale(i,i),n.style.height||n.style.width||(n.style.height=a+"px",n.style.width=r+"px")}},r.fontString=function(t,e,i){return e+" "+t+"px "+i},r.longestText=function(t,e,i,n){var a=(n=n||{}).data=n.data||{},o=n.garbageCollect=n.garbageCollect||[];n.font!==e&&(a=n.data={},o=n.garbageCollect=[],n.font=e),t.font=e;var s=0;r.each(i,function(e){null!=e&&!0!==r.isArray(e)?s=r.measureText(t,a,o,s,e):r.isArray(e)&&r.each(e,function(e){null==e||r.isArray(e)||(s=r.measureText(t,a,o,s,e))})});var l=o.length/2;if(l>i.length){for(var u=0;u<l;u++)delete a[o[u]];o.splice(0,l)}return s},r.measureText=function(t,e,i,n,a){var r=e[a];return r||(r=e[a]=t.measureText(a).width,i.push(a)),r>n&&(n=r),n},r.numberOfLabelLines=function(t){var e=1;return r.each(t,function(t){r.isArray(t)&&t.length>e&&(e=t.length)}),e},r.color=n?function(t){return t instanceof CanvasGradient&&(t=a.global.defaultColor),n(t)}:function(t){return console.error("Color.js not found!"),t},r.getHoverColor=function(t){return t instanceof CanvasPattern?t:r.color(t).saturate(.5).darken(.1).rgbString()}}},{2:2,25:25,45:45}],28:[function(t,e,i){"use strict";var n=t(45);function a(t,e){return t.native?{x:t.x,y:t.y}:n.getRelativePosition(t,e)}function r(t,e){var i,n,a,r,o;for(n=0,r=t.data.datasets.length;n<r;++n)if(t.isDatasetVisible(n))for(a=0,o=(i=t.getDatasetMeta(n)).data.length;a<o;++a){var s=i.data[a];s._view.skip||e(s)}}function o(t,e){var i=[];return r(t,function(t){t.inRange(e.x,e.y)&&i.push(t)}),i}function s(t,e,i,n){var a=Number.POSITIVE_INFINITY,o=[];return r(t,function(t){if(!i||t.inRange(e.x,e.y)){var r=t.getCenterPoint(),s=n(e,r);s<a?(o=[t],a=s):s===a&&o.push(t)}}),o}function l(t){var e=-1!==t.indexOf("x"),i=-1!==t.indexOf("y");return function(t,n){var a=e?Math.abs(t.x-n.x):0,r=i?Math.abs(t.y-n.y):0;return Math.sqrt(Math.pow(a,2)+Math.pow(r,2))}}function u(t,e,i){var n=a(e,t);i.axis=i.axis||"x";var r=l(i.axis),u=i.intersect?o(t,n):s(t,n,!1,r),d=[];return u.length?(t.data.datasets.forEach(function(e,i){if(t.isDatasetVisible(i)){var n=t.getDatasetMeta(i).data[u[0]._index];n&&!n._view.skip&&d.push(n)}}),d):[]}e.exports={modes:{single:function(t,e){var i=a(e,t),n=[];return r(t,function(t){if(t.inRange(i.x,i.y))return n.push(t),n}),n.slice(0,1)},label:u,index:u,dataset:function(t,e,i){var n=a(e,t);i.axis=i.axis||"xy";var r=l(i.axis),u=i.intersect?o(t,n):s(t,n,!1,r);return u.length>0&&(u=t.getDatasetMeta(u[0]._datasetIndex).data),u},"x-axis":function(t,e){return u(t,e,{intersect:!1})},point:function(t,e){return o(t,a(e,t))},nearest:function(t,e,i){var n=a(e,t);i.axis=i.axis||"xy";var r=l(i.axis),o=s(t,n,i.intersect,r);return o.length>1&&o.sort(function(t,e){var i=t.getArea()-e.getArea();return 0===i&&(i=t._datasetIndex-e._datasetIndex),i}),o.slice(0,1)},x:function(t,e,i){var n=a(e,t),o=[],s=!1;return r(t,function(t){t.inXRange(n.x)&&o.push(t),t.inRange(n.x,n.y)&&(s=!0)}),i.intersect&&!s&&(o=[]),o},y:function(t,e,i){var n=a(e,t),o=[],s=!1;return r(t,function(t){t.inYRange(n.y)&&o.push(t),t.inRange(n.x,n.y)&&(s=!0)}),i.intersect&&!s&&(o=[]),o}}}},{45:45}],29:[function(t,e,i){"use strict";t(25)._set("global",{responsive:!0,responsiveAnimationDuration:0,maintainAspectRatio:!0,events:["mousemove","mouseout","click","touchstart","touchmove"],hover:{onHover:null,mode:"nearest",intersect:!0,animationDuration:400},onClick:null,defaultColor:"rgba(0,0,0,0.1)",defaultFontColor:"#666",defaultFontFamily:"'Helvetica Neue', 'Helvetica', 'Arial', sans-serif",defaultFontSize:12,defaultFontStyle:"normal",showLines:!0,elements:{},layout:{padding:{top:0,right:0,bottom:0,left:0}}}),e.exports=function(){var t=function(t,e){return this.construct(t,e),this};return t.Chart=t,t}},{25:25}],30:[function(t,e,i){"use strict";var n=t(45);function a(t,e){return n.where(t,function(t){return t.position===e})}function r(t,e){t.forEach(function(t,e){return t._tmpIndex_=e,t}),t.sort(function(t,i){var n=e?i:t,a=e?t:i;return n.weight===a.weight?n._tmpIndex_-a._tmpIndex_:n.weight-a.weight}),t.forEach(function(t){delete t._tmpIndex_})}e.exports={defaults:{},addBox:function(t,e){t.boxes||(t.boxes=[]),e.fullWidth=e.fullWidth||!1,e.position=e.position||"top",e.weight=e.weight||0,t.boxes.push(e)},removeBox:function(t,e){var i=t.boxes?t.boxes.indexOf(e):-1;-1!==i&&t.boxes.splice(i,1)},configure:function(t,e,i){for(var n,a=["fullWidth","position","weight"],r=a.length,o=0;o<r;++o)n=a[o],i.hasOwnProperty(n)&&(e[n]=i[n])},update:function(t,e,i){if(t){var o=t.options.layout||{},s=n.options.toPadding(o.padding),l=s.left,u=s.right,d=s.top,h=s.bottom,c=a(t.boxes,"left"),f=a(t.boxes,"right"),g=a(t.boxes,"top"),m=a(t.boxes,"bottom"),p=a(t.boxes,"chartArea");r(c,!0),r(f,!1),r(g,!0),r(m,!1);var v=e-l-u,y=i-d-h,b=y/2,x=(e-v/2)/(c.length+f.length),_=(i-b)/(g.length+m.length),k=v,w=y,M=[];n.each(c.concat(f,g,m),function(t){var e,i=t.isHorizontal();i?(e=t.update(t.fullWidth?v:k,_),w-=e.height):(e=t.update(x,w),k-=e.width),M.push({horizontal:i,minSize:e,box:t})});var S=0,D=0,C=0,P=0;n.each(g.concat(m),function(t){if(t.getPadding){var e=t.getPadding();S=Math.max(S,e.left),D=Math.max(D,e.right)}}),n.each(c.concat(f),function(t){if(t.getPadding){var e=t.getPadding();C=Math.max(C,e.top),P=Math.max(P,e.bottom)}});var T=l,O=u,I=d,A=h;n.each(c.concat(f),z),n.each(c,function(t){T+=t.width}),n.each(f,function(t){O+=t.width}),n.each(g.concat(m),z),n.each(g,function(t){I+=t.height}),n.each(m,function(t){A+=t.height}),n.each(c.concat(f),function(t){var e=n.findNextWhere(M,function(e){return e.box===t}),i={left:0,right:0,top:I,bottom:A};e&&t.update(e.minSize.width,w,i)}),T=l,O=u,I=d,A=h,n.each(c,function(t){T+=t.width}),n.each(f,function(t){O+=t.width}),n.each(g,function(t){I+=t.height}),n.each(m,function(t){A+=t.height});var F=Math.max(S-T,0);T+=F,O+=Math.max(D-O,0);var R=Math.max(C-I,0);I+=R,A+=Math.max(P-A,0);var L=i-I-A,W=e-T-O;W===k&&L===w||(n.each(c,function(t){t.height=L}),n.each(f,function(t){t.height=L}),n.each(g,function(t){t.fullWidth||(t.width=W)}),n.each(m,function(t){t.fullWidth||(t.width=W)}),w=L,k=W);var Y=l+F,N=d+R;n.each(c.concat(g),H),Y+=k,N+=w,n.each(f,H),n.each(m,H),t.chartArea={left:T,top:I,right:T+k,bottom:I+w},n.each(p,function(e){e.left=t.chartArea.left,e.top=t.chartArea.top,e.right=t.chartArea.right,e.bottom=t.chartArea.bottom,e.update(k,w)})}function z(t){var e=n.findNextWhere(M,function(e){return e.box===t});if(e)if(t.isHorizontal()){var i={left:Math.max(T,S),right:Math.max(O,D),top:0,bottom:0};t.update(t.fullWidth?v:k,y/2,i)}else t.update(e.minSize.width,w)}function H(t){t.isHorizontal()?(t.left=t.fullWidth?l:T,t.right=t.fullWidth?e-u:T+k,t.top=N,t.bottom=N+t.height,N=t.bottom):(t.left=Y,t.right=Y+t.width,t.top=I,t.bottom=I+w,Y=t.right)}}}},{45:45}],31:[function(t,e,i){"use strict";var n=t(25),a=t(45);n._set("global",{plugins:{}}),e.exports={_plugins:[],_cacheId:0,register:function(t){var e=this._plugins;[].concat(t).forEach(function(t){-1===e.indexOf(t)&&e.push(t)}),this._cacheId++},unregister:function(t){var e=this._plugins;[].concat(t).forEach(function(t){var i=e.indexOf(t);-1!==i&&e.splice(i,1)}),this._cacheId++},clear:function(){this._plugins=[],this._cacheId++},count:function(){return this._plugins.length},getAll:function(){return this._plugins},notify:function(t,e,i){var n,a,r,o,s,l=this.descriptors(t),u=l.length;for(n=0;n<u;++n)if("function"==typeof(s=(r=(a=l[n]).plugin)[e])&&((o=[t].concat(i||[])).push(a.options),!1===s.apply(r,o)))return!1;return!0},descriptors:function(t){var e=t.$plugins||(t.$plugins={});if(e.id===this._cacheId)return e.descriptors;var i=[],r=[],o=t&&t.config||{},s=o.options&&o.options.plugins||{};return this._plugins.concat(o.plugins||[]).forEach(function(t){if(-1===i.indexOf(t)){var e=t.id,o=s[e];!1!==o&&(!0===o&&(o=a.clone(n.global.plugins[e])),i.push(t),r.push({plugin:t,options:o||{}}))}}),e.descriptors=r,e.id=this._cacheId,r},_invalidate:function(t){delete t.$plugins}}},{25:25,45:45}],32:[function(t,e,i){"use strict";var n=t(25),a=t(26),r=t(45),o=t(34);function s(t){var e,i,n=[];for(e=0,i=t.length;e<i;++e)n.push(t[e].label);return n}function l(t,e,i){var n=t.getPixelForTick(e);return i&&(n-=0===e?(t.getPixelForTick(1)-n)/2:(n-t.getPixelForTick(e-1))/2),n}n._set("scale",{display:!0,position:"left",offset:!1,gridLines:{display:!0,color:"rgba(0, 0, 0, 0.1)",lineWidth:1,drawBorder:!0,drawOnChartArea:!0,drawTicks:!0,tickMarkLength:10,zeroLineWidth:1,zeroLineColor:"rgba(0,0,0,0.25)",zeroLineBorderDash:[],zeroLineBorderDashOffset:0,offsetGridLines:!1,borderDash:[],borderDashOffset:0},scaleLabel:{display:!1,labelString:"",lineHeight:1.2,padding:{top:4,bottom:4}},ticks:{beginAtZero:!1,minRotation:0,maxRotation:50,mirror:!1,padding:0,reverse:!1,display:!0,autoSkip:!0,autoSkipPadding:0,labelOffset:0,callback:o.formatters.values,minor:{},major:{}}}),e.exports=function(t){function e(t,e,i){return r.isArray(e)?r.longestText(t,i,e):t.measureText(e).width}function i(t){var e=r.valueOrDefault,i=n.global,a=e(t.fontSize,i.defaultFontSize),o=e(t.fontStyle,i.defaultFontStyle),s=e(t.fontFamily,i.defaultFontFamily);return{size:a,style:o,family:s,font:r.fontString(a,o,s)}}function o(t){return r.options.toLineHeight(r.valueOrDefault(t.lineHeight,1.2),r.valueOrDefault(t.fontSize,n.global.defaultFontSize))}t.Scale=a.extend({getPadding:function(){return{left:this.paddingLeft||0,top:this.paddingTop||0,right:this.paddingRight||0,bottom:this.paddingBottom||0}},getTicks:function(){return this._ticks},mergeTicksOptions:function(){var t=this.options.ticks;for(var e in!1===t.minor&&(t.minor={display:!1}),!1===t.major&&(t.major={display:!1}),t)"major"!==e&&"minor"!==e&&(void 0===t.minor[e]&&(t.minor[e]=t[e]),void 0===t.major[e]&&(t.major[e]=t[e]))},beforeUpdate:function(){r.callback(this.options.beforeUpdate,[this])},update:function(t,e,i){var n,a,o,s,l,u,d=this;for(d.beforeUpdate(),d.maxWidth=t,d.maxHeight=e,d.margins=r.extend({left:0,right:0,top:0,bottom:0},i),d.longestTextCache=d.longestTextCache||{},d.beforeSetDimensions(),d.setDimensions(),d.afterSetDimensions(),d.beforeDataLimits(),d.determineDataLimits(),d.afterDataLimits(),d.beforeBuildTicks(),l=d.buildTicks()||[],d.afterBuildTicks(),d.beforeTickToLabelConversion(),o=d.convertTicksToLabels(l)||d.ticks,d.afterTickToLabelConversion(),d.ticks=o,n=0,a=o.length;n<a;++n)s=o[n],(u=l[n])?u.label=s:l.push(u={label:s,major:!1});return d._ticks=l,d.beforeCalculateTickRotation(),d.calculateTickRotation(),d.afterCalculateTickRotation(),d.beforeFit(),d.fit(),d.afterFit(),d.afterUpdate(),d.minSize},afterUpdate:function(){r.callback(this.options.afterUpdate,[this])},beforeSetDimensions:function(){r.callback(this.options.beforeSetDimensions,[this])},setDimensions:function(){var t=this;t.isHorizontal()?(t.width=t.maxWidth,t.left=0,t.right=t.width):(t.height=t.maxHeight,t.top=0,t.bottom=t.height),t.paddingLeft=0,t.paddingTop=0,t.paddingRight=0,t.paddingBottom=0},afterSetDimensions:function(){r.callback(this.options.afterSetDimensions,[this])},beforeDataLimits:function(){r.callback(this.options.beforeDataLimits,[this])},determineDataLimits:r.noop,afterDataLimits:function(){r.callback(this.options.afterDataLimits,[this])},beforeBuildTicks:function(){r.callback(this.options.beforeBuildTicks,[this])},buildTicks:r.noop,afterBuildTicks:function(){r.callback(this.options.afterBuildTicks,[this])},beforeTickToLabelConversion:function(){r.callback(this.options.beforeTickToLabelConversion,[this])},convertTicksToLabels:function(){var t=this.options.ticks;this.ticks=this.ticks.map(t.userCallback||t.callback,this)},afterTickToLabelConversion:function(){r.callback(this.options.afterTickToLabelConversion,[this])},beforeCalculateTickRotation:function(){r.callback(this.options.beforeCalculateTickRotation,[this])},calculateTickRotation:function(){var t=this,e=t.ctx,n=t.options.ticks,a=s(t._ticks),o=i(n);e.font=o.font;var l=n.minRotation||0;if(a.length&&t.options.display&&t.isHorizontal())for(var u,d=r.longestText(e,o.font,a,t.longestTextCache),h=d,c=t.getPixelForTick(1)-t.getPixelForTick(0)-6;h>c&&l<n.maxRotation;){var f=r.toRadians(l);if(u=Math.cos(f),Math.sin(f)*d>t.maxHeight){l--;break}l++,h=u*d}t.labelRotation=l},afterCalculateTickRotation:function(){r.callback(this.options.afterCalculateTickRotation,[this])},beforeFit:function(){r.callback(this.options.beforeFit,[this])},fit:function(){var t=this,n=t.minSize={width:0,height:0},a=s(t._ticks),l=t.options,u=l.ticks,d=l.scaleLabel,h=l.gridLines,c=l.display,f=t.isHorizontal(),g=i(u),m=l.gridLines.tickMarkLength;if(n.width=f?t.isFullWidth()?t.maxWidth-t.margins.left-t.margins.right:t.maxWidth:c&&h.drawTicks?m:0,n.height=f?c&&h.drawTicks?m:0:t.maxHeight,d.display&&c){var p=o(d)+r.options.toPadding(d.padding).height;f?n.height+=p:n.width+=p}if(u.display&&c){var v=r.longestText(t.ctx,g.font,a,t.longestTextCache),y=r.numberOfLabelLines(a),b=.5*g.size,x=t.options.ticks.padding;if(f){t.longestLabelWidth=v;var _=r.toRadians(t.labelRotation),k=Math.cos(_),w=Math.sin(_)*v+g.size*y+b*(y-1)+b;n.height=Math.min(t.maxHeight,n.height+w+x),t.ctx.font=g.font;var M=e(t.ctx,a[0],g.font),S=e(t.ctx,a[a.length-1],g.font);0!==t.labelRotation?(t.paddingLeft="bottom"===l.position?k*M+3:k*b+3,t.paddingRight="bottom"===l.position?k*b+3:k*S+3):(t.paddingLeft=M/2+3,t.paddingRight=S/2+3)}else u.mirror?v=0:v+=x+b,n.width=Math.min(t.maxWidth,n.width+v),t.paddingTop=g.size/2,t.paddingBottom=g.size/2}t.handleMargins(),t.width=n.width,t.height=n.height},handleMargins:function(){var t=this;t.margins&&(t.paddingLeft=Math.max(t.paddingLeft-t.margins.left,0),t.paddingTop=Math.max(t.paddingTop-t.margins.top,0),t.paddingRight=Math.max(t.paddingRight-t.margins.right,0),t.paddingBottom=Math.max(t.paddingBottom-t.margins.bottom,0))},afterFit:function(){r.callback(this.options.afterFit,[this])},isHorizontal:function(){return"top"===this.options.position||"bottom"===this.options.position},isFullWidth:function(){return this.options.fullWidth},getRightValue:function(t){if(r.isNullOrUndef(t))return NaN;if("number"==typeof t&&!isFinite(t))return NaN;if(t)if(this.isHorizontal()){if(void 0!==t.x)return this.getRightValue(t.x)}else if(void 0!==t.y)return this.getRightValue(t.y);return t},getLabelForIndex:r.noop,getPixelForValue:r.noop,getValueForPixel:r.noop,getPixelForTick:function(t){var e=this,i=e.options.offset;if(e.isHorizontal()){var n=(e.width-(e.paddingLeft+e.paddingRight))/Math.max(e._ticks.length-(i?0:1),1),a=n*t+e.paddingLeft;i&&(a+=n/2);var r=e.left+Math.round(a);return r+=e.isFullWidth()?e.margins.left:0}var o=e.height-(e.paddingTop+e.paddingBottom);return e.top+t*(o/(e._ticks.length-1))},getPixelForDecimal:function(t){var e=this;if(e.isHorizontal()){var i=(e.width-(e.paddingLeft+e.paddingRight))*t+e.paddingLeft,n=e.left+Math.round(i);return n+=e.isFullWidth()?e.margins.left:0}return e.top+t*e.height},getBasePixel:function(){return this.getPixelForValue(this.getBaseValue())},getBaseValue:function(){var t=this.min,e=this.max;return this.beginAtZero?0:t<0&&e<0?e:t>0&&e>0?t:0},_autoSkip:function(t){var e,i,n,a,o=this,s=o.isHorizontal(),l=o.options.ticks.minor,u=t.length,d=r.toRadians(o.labelRotation),h=Math.cos(d),c=o.longestLabelWidth*h,f=[];for(l.maxTicksLimit&&(a=l.maxTicksLimit),s&&(e=!1,(c+l.autoSkipPadding)*u>o.width-(o.paddingLeft+o.paddingRight)&&(e=1+Math.floor((c+l.autoSkipPadding)*u/(o.width-(o.paddingLeft+o.paddingRight)))),a&&u>a&&(e=Math.max(e,Math.floor(u/a)))),i=0;i<u;i++)n=t[i],(e>1&&i%e>0||i%e==0&&i+e>=u)&&i!==u-1&&delete n.label,f.push(n);return f},draw:function(t){var e=this,a=e.options;if(a.display){var s=e.ctx,u=n.global,d=a.ticks.minor,h=a.ticks.major||d,c=a.gridLines,f=a.scaleLabel,g=0!==e.labelRotation,m=e.isHorizontal(),p=d.autoSkip?e._autoSkip(e.getTicks()):e.getTicks(),v=r.valueOrDefault(d.fontColor,u.defaultFontColor),y=i(d),b=r.valueOrDefault(h.fontColor,u.defaultFontColor),x=i(h),_=c.drawTicks?c.tickMarkLength:0,k=r.valueOrDefault(f.fontColor,u.defaultFontColor),w=i(f),M=r.options.toPadding(f.padding),S=r.toRadians(e.labelRotation),D=[],C=e.options.gridLines.lineWidth,P="right"===a.position?e.right:e.right-C-_,T="right"===a.position?e.right+_:e.right,O="bottom"===a.position?e.top+C:e.bottom-_-C,I="bottom"===a.position?e.top+C+_:e.bottom+C;if(r.each(p,function(i,n){if(!r.isNullOrUndef(i.label)){var o,s,h,f,v,y,b,x,k,w,M,A,F,R,L=i.label;n===e.zeroLineIndex&&a.offset===c.offsetGridLines?(o=c.zeroLineWidth,s=c.zeroLineColor,h=c.zeroLineBorderDash,f=c.zeroLineBorderDashOffset):(o=r.valueAtIndexOrDefault(c.lineWidth,n),s=r.valueAtIndexOrDefault(c.color,n),h=r.valueOrDefault(c.borderDash,u.borderDash),f=r.valueOrDefault(c.borderDashOffset,u.borderDashOffset));var W="middle",Y="middle",N=d.padding;if(m){var z=_+N;"bottom"===a.position?(Y=g?"middle":"top",W=g?"right":"center",R=e.top+z):(Y=g?"middle":"bottom",W=g?"left":"center",R=e.bottom-z);var H=l(e,n,c.offsetGridLines&&p.length>1);H<e.left&&(s="rgba(0,0,0,0)"),H+=r.aliasPixel(o),F=e.getPixelForTick(n)+d.labelOffset,v=b=k=M=H,y=O,x=I,w=t.top,A=t.bottom+C}else{var V,B="left"===a.position;d.mirror?(W=B?"left":"right",V=N):(W=B?"right":"left",V=_+N),F=B?e.right-V:e.left+V;var E=l(e,n,c.offsetGridLines&&p.length>1);E<e.top&&(s="rgba(0,0,0,0)"),E+=r.aliasPixel(o),R=e.getPixelForTick(n)+d.labelOffset,v=P,b=T,k=t.left,M=t.right+C,y=x=w=A=E}D.push({tx1:v,ty1:y,tx2:b,ty2:x,x1:k,y1:w,x2:M,y2:A,labelX:F,labelY:R,glWidth:o,glColor:s,glBorderDash:h,glBorderDashOffset:f,rotation:-1*S,label:L,major:i.major,textBaseline:Y,textAlign:W})}}),r.each(D,function(t){if(c.display&&(s.save(),s.lineWidth=t.glWidth,s.strokeStyle=t.glColor,s.setLineDash&&(s.setLineDash(t.glBorderDash),s.lineDashOffset=t.glBorderDashOffset),s.beginPath(),c.drawTicks&&(s.moveTo(t.tx1,t.ty1),s.lineTo(t.tx2,t.ty2)),c.drawOnChartArea&&(s.moveTo(t.x1,t.y1),s.lineTo(t.x2,t.y2)),s.stroke(),s.restore()),d.display){s.save(),s.translate(t.labelX,t.labelY),s.rotate(t.rotation),s.font=t.major?x.font:y.font,s.fillStyle=t.major?b:v,s.textBaseline=t.textBaseline,s.textAlign=t.textAlign;var i=t.label;if(r.isArray(i))for(var n=i.length,a=1.5*y.size,o=e.isHorizontal()?0:-a*(n-1)/2,l=0;l<n;++l)s.fillText(""+i[l],0,o),o+=a;else s.fillText(i,0,0);s.restore()}}),f.display){var A,F,R=0,L=o(f)/2;if(m)A=e.left+(e.right-e.left)/2,F="bottom"===a.position?e.bottom-L-M.bottom:e.top+L+M.top;else{var W="left"===a.position;A=W?e.left+L+M.top:e.right-L-M.top,F=e.top+(e.bottom-e.top)/2,R=W?-.5*Math.PI:.5*Math.PI}s.save(),s.translate(A,F),s.rotate(R),s.textAlign="center",s.textBaseline="middle",s.fillStyle=k,s.font=w.font,s.fillText(f.labelString,0,0),s.restore()}if(c.drawBorder){s.lineWidth=r.valueAtIndexOrDefault(c.lineWidth,0),s.strokeStyle=r.valueAtIndexOrDefault(c.color,0);var Y=e.left,N=e.right+C,z=e.top,H=e.bottom+C,V=r.aliasPixel(s.lineWidth);m?(z=H="top"===a.position?e.bottom:e.top,z+=V,H+=V):(Y=N="left"===a.position?e.right:e.left,Y+=V,N+=V),s.beginPath(),s.moveTo(Y,z),s.lineTo(N,H),s.stroke()}}}})}},{25:25,26:26,34:34,45:45}],33:[function(t,e,i){"use strict";var n=t(25),a=t(45),r=t(30);e.exports=function(t){t.scaleService={constructors:{},defaults:{},registerScaleType:function(t,e,i){this.constructors[t]=e,this.defaults[t]=a.clone(i)},getScaleConstructor:function(t){return this.constructors.hasOwnProperty(t)?this.constructors[t]:void 0},getScaleDefaults:function(t){return this.defaults.hasOwnProperty(t)?a.merge({},[n.scale,this.defaults[t]]):{}},updateScaleDefaults:function(t,e){this.defaults.hasOwnProperty(t)&&(this.defaults[t]=a.extend(this.defaults[t],e))},addScalesToLayout:function(t){a.each(t.scales,function(e){e.fullWidth=e.options.fullWidth,e.position=e.options.position,e.weight=e.options.weight,r.addBox(t,e)})}}}},{25:25,30:30,45:45}],34:[function(t,e,i){"use strict";var n=t(45);e.exports={formatters:{values:function(t){return n.isArray(t)?t:""+t},linear:function(t,e,i){var a=i.length>3?i[2]-i[1]:i[1]-i[0];Math.abs(a)>1&&t!==Math.floor(t)&&(a=t-Math.floor(t));var r=n.log10(Math.abs(a)),o="";if(0!==t){var s=-1*Math.floor(r);s=Math.max(Math.min(s,20),0),o=t.toFixed(s)}else o="0";return o},logarithmic:function(t,e,i){var a=t/Math.pow(10,Math.floor(n.log10(t)));return 0===t?"0":1===a||2===a||5===a||0===e||e===i.length-1?t.toExponential():""}}}},{45:45}],35:[function(t,e,i){"use strict";var n=t(25),a=t(26),r=t(45);n._set("global",{tooltips:{enabled:!0,custom:null,mode:"nearest",position:"average",intersect:!0,backgroundColor:"rgba(0,0,0,0.8)",titleFontStyle:"bold",titleSpacing:2,titleMarginBottom:6,titleFontColor:"#fff",titleAlign:"left",bodySpacing:2,bodyFontColor:"#fff",bodyAlign:"left",footerFontStyle:"bold",footerSpacing:2,footerMarginTop:6,footerFontColor:"#fff",footerAlign:"left",yPadding:6,xPadding:6,caretPadding:2,caretSize:5,cornerRadius:6,multiKeyBackground:"#fff",displayColors:!0,borderColor:"rgba(0,0,0,0)",borderWidth:0,callbacks:{beforeTitle:r.noop,title:function(t,e){var i="",n=e.labels,a=n?n.length:0;if(t.length>0){var r=t[0];r.xLabel?i=r.xLabel:a>0&&r.index<a&&(i=n[r.index])}return i},afterTitle:r.noop,beforeBody:r.noop,beforeLabel:r.noop,label:function(t,e){var i=e.datasets[t.datasetIndex].label||"";return i&&(i+=": "),i+=t.yLabel},labelColor:function(t,e){var i=e.getDatasetMeta(t.datasetIndex).data[t.index]._view;return{borderColor:i.borderColor,backgroundColor:i.backgroundColor}},labelTextColor:function(){return this._options.bodyFontColor},afterLabel:r.noop,afterBody:r.noop,beforeFooter:r.noop,footer:r.noop,afterFooter:r.noop}}}),e.exports=function(t){function e(t,e){var i=r.color(t);return i.alpha(e*i.alpha()).rgbaString()}function i(t,e){return e&&(r.isArray(e)?Array.prototype.push.apply(t,e):t.push(e)),t}function o(t){var e=n.global,i=r.valueOrDefault;return{xPadding:t.xPadding,yPadding:t.yPadding,xAlign:t.xAlign,yAlign:t.yAlign,bodyFontColor:t.bodyFontColor,_bodyFontFamily:i(t.bodyFontFamily,e.defaultFontFamily),_bodyFontStyle:i(t.bodyFontStyle,e.defaultFontStyle),_bodyAlign:t.bodyAlign,bodyFontSize:i(t.bodyFontSize,e.defaultFontSize),bodySpacing:t.bodySpacing,titleFontColor:t.titleFontColor,_titleFontFamily:i(t.titleFontFamily,e.defaultFontFamily),_titleFontStyle:i(t.titleFontStyle,e.defaultFontStyle),titleFontSize:i(t.titleFontSize,e.defaultFontSize),_titleAlign:t.titleAlign,titleSpacing:t.titleSpacing,titleMarginBottom:t.titleMarginBottom,footerFontColor:t.footerFontColor,_footerFontFamily:i(t.footerFontFamily,e.defaultFontFamily),_footerFontStyle:i(t.footerFontStyle,e.defaultFontStyle),footerFontSize:i(t.footerFontSize,e.defaultFontSize),_footerAlign:t.footerAlign,footerSpacing:t.footerSpacing,footerMarginTop:t.footerMarginTop,caretSize:t.caretSize,cornerRadius:t.cornerRadius,backgroundColor:t.backgroundColor,opacity:0,legendColorBackground:t.multiKeyBackground,displayColors:t.displayColors,borderColor:t.borderColor,borderWidth:t.borderWidth}}t.Tooltip=a.extend({initialize:function(){this._model=o(this._options),this._lastActive=[]},getTitle:function(){var t=this._options.callbacks,e=t.beforeTitle.apply(this,arguments),n=t.title.apply(this,arguments),a=t.afterTitle.apply(this,arguments),r=[];return r=i(r=i(r=i(r,e),n),a)},getBeforeBody:function(){var t=this._options.callbacks.beforeBody.apply(this,arguments);return r.isArray(t)?t:void 0!==t?[t]:[]},getBody:function(t,e){var n=this,a=n._options.callbacks,o=[];return r.each(t,function(t){var r={before:[],lines:[],after:[]};i(r.before,a.beforeLabel.call(n,t,e)),i(r.lines,a.label.call(n,t,e)),i(r.after,a.afterLabel.call(n,t,e)),o.push(r)}),o},getAfterBody:function(){var t=this._options.callbacks.afterBody.apply(this,arguments);return r.isArray(t)?t:void 0!==t?[t]:[]},getFooter:function(){var t=this._options.callbacks,e=t.beforeFooter.apply(this,arguments),n=t.footer.apply(this,arguments),a=t.afterFooter.apply(this,arguments),r=[];return r=i(r=i(r=i(r,e),n),a)},update:function(e){var i,n,a,s,l,u,d,h,c,f,g,m,p,v,y,b,x,_,k,w,M=this,S=M._options,D=M._model,C=M._model=o(S),P=M._active,T=M._data,O={xAlign:D.xAlign,yAlign:D.yAlign},I={x:D.x,y:D.y},A={width:D.width,height:D.height},F={x:D.caretX,y:D.caretY};if(P.length){C.opacity=1;var R=[],L=[];F=t.Tooltip.positioners[S.position].call(M,P,M._eventPosition);var W=[];for(i=0,n=P.length;i<n;++i)W.push((b=P[i],x=void 0,_=void 0,void 0,void 0,x=b._xScale,_=b._yScale||b._scale,k=b._index,w=b._datasetIndex,{xLabel:x?x.getLabelForIndex(k,w):"",yLabel:_?_.getLabelForIndex(k,w):"",index:k,datasetIndex:w,x:b._model.x,y:b._model.y}));S.filter&&(W=W.filter(function(t){return S.filter(t,T)})),S.itemSort&&(W=W.sort(function(t,e){return S.itemSort(t,e,T)})),r.each(W,function(t){R.push(S.callbacks.labelColor.call(M,t,M._chart)),L.push(S.callbacks.labelTextColor.call(M,t,M._chart))}),C.title=M.getTitle(W,T),C.beforeBody=M.getBeforeBody(W,T),C.body=M.getBody(W,T),C.afterBody=M.getAfterBody(W,T),C.footer=M.getFooter(W,T),C.x=Math.round(F.x),C.y=Math.round(F.y),C.caretPadding=S.caretPadding,C.labelColors=R,C.labelTextColors=L,C.dataPoints=W,O=function(t,e){var i,n,a,r,o,s=t._model,l=t._chart,u=t._chart.chartArea,d="center",h="center";s.y<e.height?h="top":s.y>l.height-e.height&&(h="bottom");var c=(u.left+u.right)/2,f=(u.top+u.bottom)/2;"center"===h?(i=function(t){return t<=c},n=function(t){return t>c}):(i=function(t){return t<=e.width/2},n=function(t){return t>=l.width-e.width/2}),a=function(t){return t+e.width+s.caretSize+s.caretPadding>l.width},r=function(t){return t-e.width-s.caretSize-s.caretPadding<0},o=function(t){return t<=f?"top":"bottom"},i(s.x)?(d="left",a(s.x)&&(d="center",h=o(s.y))):n(s.x)&&(d="right",r(s.x)&&(d="center",h=o(s.y)));var g=t._options;return{xAlign:g.xAlign?g.xAlign:d,yAlign:g.yAlign?g.yAlign:h}}(this,A=function(t,e){var i=t._chart.ctx,n=2*e.yPadding,a=0,o=e.body,s=o.reduce(function(t,e){return t+e.before.length+e.lines.length+e.after.length},0);s+=e.beforeBody.length+e.afterBody.length;var l=e.title.length,u=e.footer.length,d=e.titleFontSize,h=e.bodyFontSize,c=e.footerFontSize;n+=l*d,n+=l?(l-1)*e.titleSpacing:0,n+=l?e.titleMarginBottom:0,n+=s*h,n+=s?(s-1)*e.bodySpacing:0,n+=u?e.footerMarginTop:0,n+=u*c,n+=u?(u-1)*e.footerSpacing:0;var f=0,g=function(t){a=Math.max(a,i.measureText(t).width+f)};return i.font=r.fontString(d,e._titleFontStyle,e._titleFontFamily),r.each(e.title,g),i.font=r.fontString(h,e._bodyFontStyle,e._bodyFontFamily),r.each(e.beforeBody.concat(e.afterBody),g),f=e.displayColors?h+2:0,r.each(o,function(t){r.each(t.before,g),r.each(t.lines,g),r.each(t.after,g)}),f=0,i.font=r.fontString(c,e._footerFontStyle,e._footerFontFamily),r.each(e.footer,g),{width:a+=2*e.xPadding,height:n}}(this,C)),a=C,s=A,l=O,u=M._chart,d=a.x,h=a.y,c=a.caretSize,f=a.caretPadding,g=a.cornerRadius,m=l.xAlign,p=l.yAlign,v=c+f,y=g+f,"right"===m?d-=s.width:"center"===m&&((d-=s.width/2)+s.width>u.width&&(d=u.width-s.width),d<0&&(d=0)),"top"===p?h+=v:h-="bottom"===p?s.height+v:s.height/2,"center"===p?"left"===m?d+=v:"right"===m&&(d-=v):"left"===m?d-=y:"right"===m&&(d+=y),I={x:d,y:h}}else C.opacity=0;return C.xAlign=O.xAlign,C.yAlign=O.yAlign,C.x=I.x,C.y=I.y,C.width=A.width,C.height=A.height,C.caretX=F.x,C.caretY=F.y,M._model=C,e&&S.custom&&S.custom.call(M,C),M},drawCaret:function(t,e){var i=this._chart.ctx,n=this._view,a=this.getCaretPosition(t,e,n);i.lineTo(a.x1,a.y1),i.lineTo(a.x2,a.y2),i.lineTo(a.x3,a.y3)},getCaretPosition:function(t,e,i){var n,a,r,o,s,l,u=i.caretSize,d=i.cornerRadius,h=i.xAlign,c=i.yAlign,f=t.x,g=t.y,m=e.width,p=e.height;if("center"===c)s=g+p/2,"left"===h?(a=(n=f)-u,r=n,o=s+u,l=s-u):(a=(n=f+m)+u,r=n,o=s-u,l=s+u);else if("left"===h?(n=(a=f+d+u)-u,r=a+u):"right"===h?(n=(a=f+m-d-u)-u,r=a+u):(n=(a=i.caretX)-u,r=a+u),"top"===c)s=(o=g)-u,l=o;else{s=(o=g+p)+u,l=o;var v=r;r=n,n=v}return{x1:n,x2:a,x3:r,y1:o,y2:s,y3:l}},drawTitle:function(t,i,n,a){var o=i.title;if(o.length){n.textAlign=i._titleAlign,n.textBaseline="top";var s,l,u=i.titleFontSize,d=i.titleSpacing;for(n.fillStyle=e(i.titleFontColor,a),n.font=r.fontString(u,i._titleFontStyle,i._titleFontFamily),s=0,l=o.length;s<l;++s)n.fillText(o[s],t.x,t.y),t.y+=u+d,s+1===o.length&&(t.y+=i.titleMarginBottom-d)}},drawBody:function(t,i,n,a){var o=i.bodyFontSize,s=i.bodySpacing,l=i.body;n.textAlign=i._bodyAlign,n.textBaseline="top",n.font=r.fontString(o,i._bodyFontStyle,i._bodyFontFamily);var u=0,d=function(e){n.fillText(e,t.x+u,t.y),t.y+=o+s};n.fillStyle=e(i.bodyFontColor,a),r.each(i.beforeBody,d);var h=i.displayColors;u=h?o+2:0,r.each(l,function(s,l){var u=e(i.labelTextColors[l],a);n.fillStyle=u,r.each(s.before,d),r.each(s.lines,function(r){h&&(n.fillStyle=e(i.legendColorBackground,a),n.fillRect(t.x,t.y,o,o),n.lineWidth=1,n.strokeStyle=e(i.labelColors[l].borderColor,a),n.strokeRect(t.x,t.y,o,o),n.fillStyle=e(i.labelColors[l].backgroundColor,a),n.fillRect(t.x+1,t.y+1,o-2,o-2),n.fillStyle=u),d(r)}),r.each(s.after,d)}),u=0,r.each(i.afterBody,d),t.y-=s},drawFooter:function(t,i,n,a){var o=i.footer;o.length&&(t.y+=i.footerMarginTop,n.textAlign=i._footerAlign,n.textBaseline="top",n.fillStyle=e(i.footerFontColor,a),n.font=r.fontString(i.footerFontSize,i._footerFontStyle,i._footerFontFamily),r.each(o,function(e){n.fillText(e,t.x,t.y),t.y+=i.footerFontSize+i.footerSpacing}))},drawBackground:function(t,i,n,a,r){n.fillStyle=e(i.backgroundColor,r),n.strokeStyle=e(i.borderColor,r),n.lineWidth=i.borderWidth;var o=i.xAlign,s=i.yAlign,l=t.x,u=t.y,d=a.width,h=a.height,c=i.cornerRadius;n.beginPath(),n.moveTo(l+c,u),"top"===s&&this.drawCaret(t,a),n.lineTo(l+d-c,u),n.quadraticCurveTo(l+d,u,l+d,u+c),"center"===s&&"right"===o&&this.drawCaret(t,a),n.lineTo(l+d,u+h-c),n.quadraticCurveTo(l+d,u+h,l+d-c,u+h),"bottom"===s&&this.drawCaret(t,a),n.lineTo(l+c,u+h),n.quadraticCurveTo(l,u+h,l,u+h-c),"center"===s&&"left"===o&&this.drawCaret(t,a),n.lineTo(l,u+c),n.quadraticCurveTo(l,u,l+c,u),n.closePath(),n.fill(),i.borderWidth>0&&n.stroke()},draw:function(){var t=this._chart.ctx,e=this._view;if(0!==e.opacity){var i={width:e.width,height:e.height},n={x:e.x,y:e.y},a=Math.abs(e.opacity<.001)?0:e.opacity,r=e.title.length||e.beforeBody.length||e.body.length||e.afterBody.length||e.footer.length;this._options.enabled&&r&&(this.drawBackground(n,e,t,i,a),n.x+=e.xPadding,n.y+=e.yPadding,this.drawTitle(n,e,t,a),this.drawBody(n,e,t,a),this.drawFooter(n,e,t,a))}},handleEvent:function(t){var e,i=this,n=i._options;return i._lastActive=i._lastActive||[],"mouseout"===t.type?i._active=[]:i._active=i._chart.getElementsAtEventForMode(t,n.mode,n),(e=!r.arrayEquals(i._active,i._lastActive))&&(i._lastActive=i._active,(n.enabled||n.custom)&&(i._eventPosition={x:t.x,y:t.y},i.update(!0),i.pivot())),e}}),t.Tooltip.positioners={average:function(t){if(!t.length)return!1;var e,i,n=0,a=0,r=0;for(e=0,i=t.length;e<i;++e){var o=t[e];if(o&&o.hasValue()){var s=o.tooltipPosition();n+=s.x,a+=s.y,++r}}return{x:Math.round(n/r),y:Math.round(a/r)}},nearest:function(t,e){var i,n,a,o=e.x,s=e.y,l=Number.POSITIVE_INFINITY;for(i=0,n=t.length;i<n;++i){var u=t[i];if(u&&u.hasValue()){var d=u.getCenterPoint(),h=r.distanceBetweenPoints(e,d);h<l&&(l=h,a=u)}}if(a){var c=a.tooltipPosition();o=c.x,s=c.y}return{x:o,y:s}}}}},{25:25,26:26,45:45}],36:[function(t,e,i){"use strict";var n=t(25),a=t(26),r=t(45);n._set("global",{elements:{arc:{backgroundColor:n.global.defaultColor,borderColor:"#fff",borderWidth:2}}}),e.exports=a.extend({inLabelRange:function(t){var e=this._view;return!!e&&Math.pow(t-e.x,2)<Math.pow(e.radius+e.hoverRadius,2)},inRange:function(t,e){var i=this._view;if(i){for(var n=r.getAngleFromPoint(i,{x:t,y:e}),a=n.angle,o=n.distance,s=i.startAngle,l=i.endAngle;l<s;)l+=2*Math.PI;for(;a>l;)a-=2*Math.PI;for(;a<s;)a+=2*Math.PI;var u=a>=s&&a<=l,d=o>=i.innerRadius&&o<=i.outerRadius;return u&&d}return!1},getCenterPoint:function(){var t=this._view,e=(t.startAngle+t.endAngle)/2,i=(t.innerRadius+t.outerRadius)/2;return{x:t.x+Math.cos(e)*i,y:t.y+Math.sin(e)*i}},getArea:function(){var t=this._view;return Math.PI*((t.endAngle-t.startAngle)/(2*Math.PI))*(Math.pow(t.outerRadius,2)-Math.pow(t.innerRadius,2))},tooltipPosition:function(){var t=this._view,e=t.startAngle+(t.endAngle-t.startAngle)/2,i=(t.outerRadius-t.innerRadius)/2+t.innerRadius;return{x:t.x+Math.cos(e)*i,y:t.y+Math.sin(e)*i}},draw:function(){var t=this._chart.ctx,e=this._view,i=e.startAngle,n=e.endAngle;t.beginPath(),t.arc(e.x,e.y,e.outerRadius,i,n),t.arc(e.x,e.y,e.innerRadius,n,i,!0),t.closePath(),t.strokeStyle=e.borderColor,t.lineWidth=e.borderWidth,t.fillStyle=e.backgroundColor,t.fill(),t.lineJoin="bevel",e.borderWidth&&t.stroke()}})},{25:25,26:26,45:45}],37:[function(t,e,i){"use strict";var n=t(25),a=t(26),r=t(45),o=n.global;n._set("global",{elements:{line:{tension:.4,backgroundColor:o.defaultColor,borderWidth:3,borderColor:o.defaultColor,borderCapStyle:"butt",borderDash:[],borderDashOffset:0,borderJoinStyle:"miter",capBezierPoints:!0,fill:!0}}}),e.exports=a.extend({draw:function(){var t,e,i,n,a=this._view,s=this._chart.ctx,l=a.spanGaps,u=this._children.slice(),d=o.elements.line,h=-1;for(this._loop&&u.length&&u.push(u[0]),s.save(),s.lineCap=a.borderCapStyle||d.borderCapStyle,s.setLineDash&&s.setLineDash(a.borderDash||d.borderDash),s.lineDashOffset=a.borderDashOffset||d.borderDashOffset,s.lineJoin=a.borderJoinStyle||d.borderJoinStyle,s.lineWidth=a.borderWidth||d.borderWidth,s.strokeStyle=a.borderColor||o.defaultColor,s.beginPath(),h=-1,t=0;t<u.length;++t)e=u[t],i=r.previousItem(u,t),n=e._view,0===t?n.skip||(s.moveTo(n.x,n.y),h=t):(i=-1===h?i:u[h],n.skip||(h!==t-1&&!l||-1===h?s.moveTo(n.x,n.y):r.canvas.lineTo(s,i._view,e._view),h=t));s.stroke(),s.restore()}})},{25:25,26:26,45:45}],38:[function(t,e,i){"use strict";var n=t(25),a=t(26),r=t(45),o=n.global.defaultColor;function s(t){var e=this._view;return!!e&&Math.abs(t-e.x)<e.radius+e.hitRadius}n._set("global",{elements:{point:{radius:3,pointStyle:"circle",backgroundColor:o,borderColor:o,borderWidth:1,hitRadius:1,hoverRadius:4,hoverBorderWidth:1}}}),e.exports=a.extend({inRange:function(t,e){var i=this._view;return!!i&&Math.pow(t-i.x,2)+Math.pow(e-i.y,2)<Math.pow(i.hitRadius+i.radius,2)},inLabelRange:s,inXRange:s,inYRange:function(t){var e=this._view;return!!e&&Math.abs(t-e.y)<e.radius+e.hitRadius},getCenterPoint:function(){var t=this._view;return{x:t.x,y:t.y}},getArea:function(){return Math.PI*Math.pow(this._view.radius,2)},tooltipPosition:function(){var t=this._view;return{x:t.x,y:t.y,padding:t.radius+t.borderWidth}},draw:function(t){var e=this._view,i=this._model,a=this._chart.ctx,s=e.pointStyle,l=e.radius,u=e.x,d=e.y,h=r.color,c=0;e.skip||(a.strokeStyle=e.borderColor||o,a.lineWidth=r.valueOrDefault(e.borderWidth,n.global.elements.point.borderWidth),a.fillStyle=e.backgroundColor||o,void 0!==t&&(i.x<t.left||1.01*t.right<i.x||i.y<t.top||1.01*t.bottom<i.y)&&(i.x<t.left?c=(u-i.x)/(t.left-i.x):1.01*t.right<i.x?c=(i.x-u)/(i.x-t.right):i.y<t.top?c=(d-i.y)/(t.top-i.y):1.01*t.bottom<i.y&&(c=(i.y-d)/(i.y-t.bottom)),c=Math.round(100*c)/100,a.strokeStyle=h(a.strokeStyle).alpha(c).rgbString(),a.fillStyle=h(a.fillStyle).alpha(c).rgbString()),r.canvas.drawPoint(a,s,l,u,d))}})},{25:25,26:26,45:45}],39:[function(t,e,i){"use strict";var n=t(25),a=t(26);function r(t){return void 0!==t._view.width}function o(t){var e,i,n,a,o=t._view;if(r(t)){var s=o.width/2;e=o.x-s,i=o.x+s,n=Math.min(o.y,o.base),a=Math.max(o.y,o.base)}else{var l=o.height/2;e=Math.min(o.x,o.base),i=Math.max(o.x,o.base),n=o.y-l,a=o.y+l}return{left:e,top:n,right:i,bottom:a}}n._set("global",{elements:{rectangle:{backgroundColor:n.global.defaultColor,borderColor:n.global.defaultColor,borderSkipped:"bottom",borderWidth:0}}}),e.exports=a.extend({draw:function(){var t,e,i,n,a,r,o,s=this._chart.ctx,l=this._view,u=l.borderWidth;if(l.horizontal?(t=l.base,e=l.x,i=l.y-l.height/2,n=l.y+l.height/2,a=e>t?1:-1,r=1,o=l.borderSkipped||"left"):(t=l.x-l.width/2,e=l.x+l.width/2,i=l.y,a=1,r=(n=l.base)>i?1:-1,o=l.borderSkipped||"bottom"),u){var d=Math.min(Math.abs(t-e),Math.abs(i-n)),h=(u=u>d?d:u)/2,c=t+("left"!==o?h*a:0),f=e+("right"!==o?-h*a:0),g=i+("top"!==o?h*r:0),m=n+("bottom"!==o?-h*r:0);c!==f&&(i=g,n=m),g!==m&&(t=c,e=f)}s.beginPath(),s.fillStyle=l.backgroundColor,s.strokeStyle=l.borderColor,s.lineWidth=u;var p=[[t,n],[t,i],[e,i],[e,n]],v=["bottom","left","top","right"].indexOf(o,0);function y(t){return p[(v+t)%4]}-1===v&&(v=0);var b=y(0);s.moveTo(b[0],b[1]);for(var x=1;x<4;x++)b=y(x),s.lineTo(b[0],b[1]);s.fill(),u&&s.stroke()},height:function(){var t=this._view;return t.base-t.y},inRange:function(t,e){var i=!1;if(this._view){var n=o(this);i=t>=n.left&&t<=n.right&&e>=n.top&&e<=n.bottom}return i},inLabelRange:function(t,e){if(!this._view)return!1;var i=o(this);return r(this)?t>=i.left&&t<=i.right:e>=i.top&&e<=i.bottom},inXRange:function(t){var e=o(this);return t>=e.left&&t<=e.right},inYRange:function(t){var e=o(this);return t>=e.top&&t<=e.bottom},getCenterPoint:function(){var t,e,i=this._view;return r(this)?(t=i.x,e=(i.y+i.base)/2):(t=(i.x+i.base)/2,e=i.y),{x:t,y:e}},getArea:function(){var t=this._view;return t.width*Math.abs(t.y-t.base)},tooltipPosition:function(){var t=this._view;return{x:t.x,y:t.y}}})},{25:25,26:26}],40:[function(t,e,i){"use strict";e.exports={},e.exports.Arc=t(36),e.exports.Line=t(37),e.exports.Point=t(38),e.exports.Rectangle=t(39)},{36:36,37:37,38:38,39:39}],41:[function(t,e,i){"use strict";var n=t(42);i=e.exports={clear:function(t){t.ctx.clearRect(0,0,t.width,t.height)},roundedRect:function(t,e,i,n,a,r){if(r){var o=Math.min(r,n/2),s=Math.min(r,a/2);t.moveTo(e+o,i),t.lineTo(e+n-o,i),t.quadraticCurveTo(e+n,i,e+n,i+s),t.lineTo(e+n,i+a-s),t.quadraticCurveTo(e+n,i+a,e+n-o,i+a),t.lineTo(e+o,i+a),t.quadraticCurveTo(e,i+a,e,i+a-s),t.lineTo(e,i+s),t.quadraticCurveTo(e,i,e+o,i)}else t.rect(e,i,n,a)},drawPoint:function(t,e,i,n,a){var r,o,s,l,u,d;if(!e||"object"!=typeof e||"[object HTMLImageElement]"!==(r=e.toString())&&"[object HTMLCanvasElement]"!==r){if(!(isNaN(i)||i<=0)){switch(e){default:t.beginPath(),t.arc(n,a,i,0,2*Math.PI),t.closePath(),t.fill();break;case"triangle":t.beginPath(),u=(o=3*i/Math.sqrt(3))*Math.sqrt(3)/2,t.moveTo(n-o/2,a+u/3),t.lineTo(n+o/2,a+u/3),t.lineTo(n,a-2*u/3),t.closePath(),t.fill();break;case"rect":d=1/Math.SQRT2*i,t.beginPath(),t.fillRect(n-d,a-d,2*d,2*d),t.strokeRect(n-d,a-d,2*d,2*d);break;case"rectRounded":var h=i/Math.SQRT2,c=n-h,f=a-h,g=Math.SQRT2*i;t.beginPath(),this.roundedRect(t,c,f,g,g,i/2),t.closePath(),t.fill();break;case"rectRot":d=1/Math.SQRT2*i,t.beginPath(),t.moveTo(n-d,a),t.lineTo(n,a+d),t.lineTo(n+d,a),t.lineTo(n,a-d),t.closePath(),t.fill();break;case"cross":t.beginPath(),t.moveTo(n,a+i),t.lineTo(n,a-i),t.moveTo(n-i,a),t.lineTo(n+i,a),t.closePath();break;case"crossRot":t.beginPath(),s=Math.cos(Math.PI/4)*i,l=Math.sin(Math.PI/4)*i,t.moveTo(n-s,a-l),t.lineTo(n+s,a+l),t.moveTo(n-s,a+l),t.lineTo(n+s,a-l),t.closePath();break;case"star":t.beginPath(),t.moveTo(n,a+i),t.lineTo(n,a-i),t.moveTo(n-i,a),t.lineTo(n+i,a),s=Math.cos(Math.PI/4)*i,l=Math.sin(Math.PI/4)*i,t.moveTo(n-s,a-l),t.lineTo(n+s,a+l),t.moveTo(n-s,a+l),t.lineTo(n+s,a-l),t.closePath();break;case"line":t.beginPath(),t.moveTo(n-i,a),t.lineTo(n+i,a),t.closePath();break;case"dash":t.beginPath(),t.moveTo(n,a),t.lineTo(n+i,a),t.closePath()}t.stroke()}}else t.drawImage(e,n-e.width/2,a-e.height/2,e.width,e.height)},clipArea:function(t,e){t.save(),t.beginPath(),t.rect(e.left,e.top,e.right-e.left,e.bottom-e.top),t.clip()},unclipArea:function(t){t.restore()},lineTo:function(t,e,i,n){if(i.steppedLine)return"after"===i.steppedLine&&!n||"after"!==i.steppedLine&&n?t.lineTo(e.x,i.y):t.lineTo(i.x,e.y),void t.lineTo(i.x,i.y);i.tension?t.bezierCurveTo(n?e.controlPointPreviousX:e.controlPointNextX,n?e.controlPointPreviousY:e.controlPointNextY,n?i.controlPointNextX:i.controlPointPreviousX,n?i.controlPointNextY:i.controlPointPreviousY,i.x,i.y):t.lineTo(i.x,i.y)}};n.clear=i.clear,n.drawRoundedRectangle=function(t){t.beginPath(),i.roundedRect.apply(i,arguments),t.closePath()}},{42:42}],42:[function(t,e,i){"use strict";var n,a={noop:function(){},uid:(n=0,function(){return n++}),isNullOrUndef:function(t){return null==t},isArray:Array.isArray?Array.isArray:function(t){return"[object Array]"===Object.prototype.toString.call(t)},isObject:function(t){return null!==t&&"[object Object]"===Object.prototype.toString.call(t)},valueOrDefault:function(t,e){return void 0===t?e:t},valueAtIndexOrDefault:function(t,e,i){return a.valueOrDefault(a.isArray(t)?t[e]:t,i)},callback:function(t,e,i){if(t&&"function"==typeof t.call)return t.apply(i,e)},each:function(t,e,i,n){var r,o,s;if(a.isArray(t))if(o=t.length,n)for(r=o-1;r>=0;r--)e.call(i,t[r],r);else for(r=0;r<o;r++)e.call(i,t[r],r);else if(a.isObject(t))for(o=(s=Object.keys(t)).length,r=0;r<o;r++)e.call(i,t[s[r]],s[r])},arrayEquals:function(t,e){var i,n,r,o;if(!t||!e||t.length!==e.length)return!1;for(i=0,n=t.length;i<n;++i)if(r=t[i],o=e[i],r instanceof Array&&o instanceof Array){if(!a.arrayEquals(r,o))return!1}else if(r!==o)return!1;return!0},clone:function(t){if(a.isArray(t))return t.map(a.clone);if(a.isObject(t)){for(var e={},i=Object.keys(t),n=i.length,r=0;r<n;++r)e[i[r]]=a.clone(t[i[r]]);return e}return t},_merger:function(t,e,i,n){var r=e[t],o=i[t];a.isObject(r)&&a.isObject(o)?a.merge(r,o,n):e[t]=a.clone(o)},_mergerIf:function(t,e,i){var n=e[t],r=i[t];a.isObject(n)&&a.isObject(r)?a.mergeIf(n,r):e.hasOwnProperty(t)||(e[t]=a.clone(r))},merge:function(t,e,i){var n,r,o,s,l,u=a.isArray(e)?e:[e],d=u.length;if(!a.isObject(t))return t;for(n=(i=i||{}).merger||a._merger,r=0;r<d;++r)if(e=u[r],a.isObject(e))for(l=0,s=(o=Object.keys(e)).length;l<s;++l)n(o[l],t,e,i);return t},mergeIf:function(t,e){return a.merge(t,e,{merger:a._mergerIf})},extend:function(t){for(var e=function(e,i){t[i]=e},i=1,n=arguments.length;i<n;++i)a.each(arguments[i],e);return t},inherits:function(t){var e=this,i=t&&t.hasOwnProperty("constructor")?t.constructor:function(){return e.apply(this,arguments)},n=function(){this.constructor=i};return n.prototype=e.prototype,i.prototype=new n,i.extend=a.inherits,t&&a.extend(i.prototype,t),i.__super__=e.prototype,i}};e.exports=a,a.callCallback=a.callback,a.indexOf=function(t,e,i){return Array.prototype.indexOf.call(t,e,i)},a.getValueOrDefault=a.valueOrDefault,a.getValueAtIndexOrDefault=a.valueAtIndexOrDefault},{}],43:[function(t,e,i){"use strict";var n=t(42),a={linear:function(t){return t},easeInQuad:function(t){return t*t},easeOutQuad:function(t){return-t*(t-2)},easeInOutQuad:function(t){return(t/=.5)<1?.5*t*t:-.5*(--t*(t-2)-1)},easeInCubic:function(t){return t*t*t},easeOutCubic:function(t){return(t-=1)*t*t+1},easeInOutCubic:function(t){return(t/=.5)<1?.5*t*t*t:.5*((t-=2)*t*t+2)},easeInQuart:function(t){return t*t*t*t},easeOutQuart:function(t){return-((t-=1)*t*t*t-1)},easeInOutQuart:function(t){return(t/=.5)<1?.5*t*t*t*t:-.5*((t-=2)*t*t*t-2)},easeInQuint:function(t){return t*t*t*t*t},easeOutQuint:function(t){return(t-=1)*t*t*t*t+1},easeInOutQuint:function(t){return(t/=.5)<1?.5*t*t*t*t*t:.5*((t-=2)*t*t*t*t+2)},easeInSine:function(t){return 1-Math.cos(t*(Math.PI/2))},easeOutSine:function(t){return Math.sin(t*(Math.PI/2))},easeInOutSine:function(t){return-.5*(Math.cos(Math.PI*t)-1)},easeInExpo:function(t){return 0===t?0:Math.pow(2,10*(t-1))},easeOutExpo:function(t){return 1===t?1:1-Math.pow(2,-10*t)},easeInOutExpo:function(t){return 0===t?0:1===t?1:(t/=.5)<1?.5*Math.pow(2,10*(t-1)):.5*(2-Math.pow(2,-10*--t))},easeInCirc:function(t){return t>=1?t:-(Math.sqrt(1-t*t)-1)},easeOutCirc:function(t){return Math.sqrt(1-(t-=1)*t)},easeInOutCirc:function(t){return(t/=.5)<1?-.5*(Math.sqrt(1-t*t)-1):.5*(Math.sqrt(1-(t-=2)*t)+1)},easeInElastic:function(t){var e=1.70158,i=0,n=1;return 0===t?0:1===t?1:(i||(i=.3),n<1?(n=1,e=i/4):e=i/(2*Math.PI)*Math.asin(1/n),-n*Math.pow(2,10*(t-=1))*Math.sin((t-e)*(2*Math.PI)/i))},easeOutElastic:function(t){var e=1.70158,i=0,n=1;return 0===t?0:1===t?1:(i||(i=.3),n<1?(n=1,e=i/4):e=i/(2*Math.PI)*Math.asin(1/n),n*Math.pow(2,-10*t)*Math.sin((t-e)*(2*Math.PI)/i)+1)},easeInOutElastic:function(t){var e=1.70158,i=0,n=1;return 0===t?0:2==(t/=.5)?1:(i||(i=.45),n<1?(n=1,e=i/4):e=i/(2*Math.PI)*Math.asin(1/n),t<1?n*Math.pow(2,10*(t-=1))*Math.sin((t-e)*(2*Math.PI)/i)*-.5:n*Math.pow(2,-10*(t-=1))*Math.sin((t-e)*(2*Math.PI)/i)*.5+1)},easeInBack:function(t){return t*t*(2.70158*t-1.70158)},easeOutBack:function(t){return(t-=1)*t*(2.70158*t+1.70158)+1},easeInOutBack:function(t){var e=1.70158;return(t/=.5)<1?t*t*((1+(e*=1.525))*t-e)*.5:.5*((t-=2)*t*((1+(e*=1.525))*t+e)+2)},easeInBounce:function(t){return 1-a.easeOutBounce(1-t)},easeOutBounce:function(t){return t<1/2.75?7.5625*t*t:t<2/2.75?7.5625*(t-=1.5/2.75)*t+.75:t<2.5/2.75?7.5625*(t-=2.25/2.75)*t+.9375:7.5625*(t-=2.625/2.75)*t+.984375},easeInOutBounce:function(t){return t<.5?.5*a.easeInBounce(2*t):.5*a.easeOutBounce(2*t-1)+.5}};e.exports={effects:a},n.easingEffects=a},{42:42}],44:[function(t,e,i){"use strict";var n=t(42);e.exports={toLineHeight:function(t,e){var i=(""+t).match(/^(normal|(\d+(?:\.\d+)?)(px|em|%)?)$/);if(!i||"normal"===i[1])return 1.2*e;switch(t=+i[2],i[3]){case"px":return t;case"%":t/=100}return e*t},toPadding:function(t){var e,i,a,r;return n.isObject(t)?(e=+t.top||0,i=+t.right||0,a=+t.bottom||0,r=+t.left||0):e=i=a=r=+t||0,{top:e,right:i,bottom:a,left:r,height:e+a,width:r+i}},resolve:function(t,e,i){var a,r,o;for(a=0,r=t.length;a<r;++a)if(void 0!==(o=t[a])&&(void 0!==e&&"function"==typeof o&&(o=o(e)),void 0!==i&&n.isArray(o)&&(o=o[i]),void 0!==o))return o}}},{42:42}],45:[function(t,e,i){"use strict";e.exports=t(42),e.exports.easing=t(43),e.exports.canvas=t(41),e.exports.options=t(44)},{41:41,42:42,43:43,44:44}],46:[function(t,e,i){e.exports={acquireContext:function(t){return t&&t.canvas&&(t=t.canvas),t&&t.getContext("2d")||null}}},{}],47:[function(t,e,i){"use strict";var n=t(45),a="$chartjs",r="chartjs-",o=r+"render-monitor",s=r+"render-animation",l=["animationstart","webkitAnimationStart"],u={touchstart:"mousedown",touchmove:"mousemove",touchend:"mouseup",pointerenter:"mouseenter",pointerdown:"mousedown",pointermove:"mousemove",pointerup:"mouseup",pointerleave:"mouseout",pointerout:"mouseout"};function d(t,e){var i=n.getStyle(t,e),a=i&&i.match(/^(\d+)(\.\d+)?px$/);return a?Number(a[1]):void 0}var h=!!function(){var t=!1;try{var e=Object.defineProperty({},"passive",{get:function(){t=!0}});window.addEventListener("e",null,e)}catch(t){}return t}()&&{passive:!0};function c(t,e,i){t.addEventListener(e,i,h)}function f(t,e,i){t.removeEventListener(e,i,h)}function g(t,e,i,n,a){return{type:t,chart:e,native:a||null,x:void 0!==i?i:null,y:void 0!==n?n:null}}function m(t,e,i){var u,d,h,f,m,p,v,y,b=t[a]||(t[a]={}),x=b.resizer=function(t){var e=document.createElement("div"),i=r+"size-monitor",n="position:absolute;left:0;top:0;right:0;bottom:0;overflow:hidden;pointer-events:none;visibility:hidden;z-index:-1;";e.style.cssText=n,e.className=i,e.innerHTML='<div class="'+i+'-expand" style="'+n+'"><div style="position:absolute;width:1000000px;height:1000000px;left:0;top:0"></div></div><div class="'+i+'-shrink" style="'+n+'"><div style="position:absolute;width:200%;height:200%;left:0; top:0"></div></div>';var a=e.childNodes[0],o=e.childNodes[1];e._reset=function(){a.scrollLeft=1e6,a.scrollTop=1e6,o.scrollLeft=1e6,o.scrollTop=1e6};var s=function(){e._reset(),t()};return c(a,"scroll",s.bind(a,"expand")),c(o,"scroll",s.bind(o,"shrink")),e}((u=function(){if(b.resizer)return e(g("resize",i))},h=!1,f=[],function(){f=Array.prototype.slice.call(arguments),d=d||this,h||(h=!0,n.requestAnimFrame.call(window,function(){h=!1,u.apply(d,f)}))}));p=function(){if(b.resizer){var e=t.parentNode;e&&e!==x.parentNode&&e.insertBefore(x,e.firstChild),x._reset()}},v=(m=t)[a]||(m[a]={}),y=v.renderProxy=function(t){t.animationName===s&&p()},n.each(l,function(t){c(m,t,y)}),v.reflow=!!m.offsetParent,m.classList.add(o)}function p(t){var e,i,r,s=t[a]||{},u=s.resizer;delete s.resizer,i=(e=t)[a]||{},(r=i.renderProxy)&&(n.each(l,function(t){f(e,t,r)}),delete i.renderProxy),e.classList.remove(o),u&&u.parentNode&&u.parentNode.removeChild(u)}e.exports={_enabled:"undefined"!=typeof window&&"undefined"!=typeof document,initialize:function(){var t,e,i,n="from{opacity:0.99}to{opacity:1}";e="@-webkit-keyframes "+s+"{"+n+"}@keyframes "+s+"{"+n+"}."+o+"{-webkit-animation:"+s+" 0.001s;animation:"+s+" 0.001s;}",i=(t=this)._style||document.createElement("style"),t._style||(t._style=i,e="/* Chart.js */\n"+e,i.setAttribute("type","text/css"),document.getElementsByTagName("head")[0].appendChild(i)),i.appendChild(document.createTextNode(e))},acquireContext:function(t,e){"string"==typeof t?t=document.getElementById(t):t.length&&(t=t[0]),t&&t.canvas&&(t=t.canvas);var i=t&&t.getContext&&t.getContext("2d");return i&&i.canvas===t?(function(t,e){var i=t.style,n=t.getAttribute("height"),r=t.getAttribute("width");if(t[a]={initial:{height:n,width:r,style:{display:i.display,height:i.height,width:i.width}}},i.display=i.display||"block",null===r||""===r){var o=d(t,"width");void 0!==o&&(t.width=o)}if(null===n||""===n)if(""===t.style.height)t.height=t.width/(e.options.aspectRatio||2);else{var s=d(t,"height");void 0!==o&&(t.height=s)}}(t,e),i):null},releaseContext:function(t){var e=t.canvas;if(e[a]){var i=e[a].initial;["height","width"].forEach(function(t){var a=i[t];n.isNullOrUndef(a)?e.removeAttribute(t):e.setAttribute(t,a)}),n.each(i.style||{},function(t,i){e.style[i]=t}),e.width=e.width,delete e[a]}},addEventListener:function(t,e,i){var r=t.canvas;if("resize"!==e){var o=i[a]||(i[a]={});c(r,e,(o.proxies||(o.proxies={}))[t.id+"_"+e]=function(e){var a,r,o,s;i((r=t,o=u[(a=e).type]||a.type,s=n.getRelativePosition(a,r),g(o,r,s.x,s.y,a)))})}else m(r,i,t)},removeEventListener:function(t,e,i){var n=t.canvas;if("resize"!==e){var r=((i[a]||{}).proxies||{})[t.id+"_"+e];r&&f(n,e,r)}else p(n)}},n.addEvent=c,n.removeEvent=f},{45:45}],48:[function(t,e,i){"use strict";var n=t(45),a=t(46),r=t(47),o=r._enabled?r:a;e.exports=n.extend({initialize:function(){},acquireContext:function(){},releaseContext:function(){},addEventListener:function(){},removeEventListener:function(){}},o)},{45:45,46:46,47:47}],49:[function(t,e,i){"use strict";e.exports={},e.exports.filler=t(50),e.exports.legend=t(51),e.exports.title=t(52)},{50:50,51:51,52:52}],50:[function(t,e,i){"use strict";var n=t(25),a=t(40),r=t(45);n._set("global",{plugins:{filler:{propagate:!0}}});var o={dataset:function(t){var e=t.fill,i=t.chart,n=i.getDatasetMeta(e),a=n&&i.isDatasetVisible(e)&&n.dataset._children||[],r=a.length||0;return r?function(t,e){return e<r&&a[e]._view||null}:null},boundary:function(t){var e=t.boundary,i=e?e.x:null,n=e?e.y:null;return function(t){return{x:null===i?t.x:i,y:null===n?t.y:n}}}};function s(t,e,i){var n,a=t._model||{},r=a.fill;if(void 0===r&&(r=!!a.backgroundColor),!1===r||null===r)return!1;if(!0===r)return"origin";if(n=parseFloat(r,10),isFinite(n)&&Math.floor(n)===n)return"-"!==r[0]&&"+"!==r[0]||(n=e+n),!(n===e||n<0||n>=i)&&n;switch(r){case"bottom":return"start";case"top":return"end";case"zero":return"origin";case"origin":case"start":case"end":return r;default:return!1}}function l(t){var e,i=t.el._model||{},n=t.el._scale||{},a=t.fill,r=null;if(isFinite(a))return null;if("start"===a?r=void 0===i.scaleBottom?n.bottom:i.scaleBottom:"end"===a?r=void 0===i.scaleTop?n.top:i.scaleTop:void 0!==i.scaleZero?r=i.scaleZero:n.getBasePosition?r=n.getBasePosition():n.getBasePixel&&(r=n.getBasePixel()),null!=r){if(void 0!==r.x&&void 0!==r.y)return r;if("number"==typeof r&&isFinite(r))return{x:(e=n.isHorizontal())?r:null,y:e?null:r}}return null}function u(t,e,i){var n,a=t[e].fill,r=[e];if(!i)return a;for(;!1!==a&&-1===r.indexOf(a);){if(!isFinite(a))return a;if(!(n=t[a]))return!1;if(n.visible)return a;r.push(a),a=n.fill}return!1}function d(t){return t&&!t.skip}function h(t,e,i,n,a){var o;if(n&&a){for(t.moveTo(e[0].x,e[0].y),o=1;o<n;++o)r.canvas.lineTo(t,e[o-1],e[o]);for(t.lineTo(i[a-1].x,i[a-1].y),o=a-1;o>0;--o)r.canvas.lineTo(t,i[o],i[o-1],!0)}}e.exports={id:"filler",afterDatasetsUpdate:function(t,e){var i,n,r,d,h,c,f,g=(t.data.datasets||[]).length,m=e.propagate,p=[];for(n=0;n<g;++n)d=null,(r=(i=t.getDatasetMeta(n)).dataset)&&r._model&&r instanceof a.Line&&(d={visible:t.isDatasetVisible(n),fill:s(r,n,g),chart:t,el:r}),i.$filler=d,p.push(d);for(n=0;n<g;++n)(d=p[n])&&(d.fill=u(p,n,m),d.boundary=l(d),d.mapper=(void 0,f=void 0,c=(h=d).fill,f="dataset",!1===c?null:(isFinite(c)||(f="boundary"),o[f](h))))},beforeDatasetDraw:function(t,e){var i=e.meta.$filler;if(i){var a=t.ctx,o=i.el,s=o._view,l=o._children||[],u=i.mapper,c=s.backgroundColor||n.global.defaultColor;u&&c&&l.length&&(r.canvas.clipArea(a,t.chartArea),function(t,e,i,n,a,r){var o,s,l,u,c,f,g,m=e.length,p=n.spanGaps,v=[],y=[],b=0,x=0;for(t.beginPath(),o=0,s=m+!!r;o<s;++o)c=i(u=e[l=o%m]._view,l,n),f=d(u),g=d(c),f&&g?(b=v.push(u),x=y.push(c)):b&&x&&(p?(f&&v.push(u),g&&y.push(c)):(h(t,v,y,b,x),b=x=0,v=[],y=[]));h(t,v,y,b,x),t.closePath(),t.fillStyle=a,t.fill()}(a,l,u,s,c,o._loop),r.canvas.unclipArea(a))}}}},{25:25,40:40,45:45}],51:[function(t,e,i){"use strict";var n=t(25),a=t(26),r=t(45),o=t(30),s=r.noop;function l(t,e){return t.usePointStyle?e*Math.SQRT2:t.boxWidth}n._set("global",{legend:{display:!0,position:"top",fullWidth:!0,reverse:!1,weight:1e3,onClick:function(t,e){var i=e.datasetIndex,n=this.chart,a=n.getDatasetMeta(i);a.hidden=null===a.hidden?!n.data.datasets[i].hidden:null,n.update()},onHover:null,labels:{boxWidth:40,padding:10,generateLabels:function(t){var e=t.data;return r.isArray(e.datasets)?e.datasets.map(function(e,i){return{text:e.label,fillStyle:r.isArray(e.backgroundColor)?e.backgroundColor[0]:e.backgroundColor,hidden:!t.isDatasetVisible(i),lineCap:e.borderCapStyle,lineDash:e.borderDash,lineDashOffset:e.borderDashOffset,lineJoin:e.borderJoinStyle,lineWidth:e.borderWidth,strokeStyle:e.borderColor,pointStyle:e.pointStyle,datasetIndex:i}},this):[]}}},legendCallback:function(t){var e=[];e.push('<ul class="'+t.id+'-legend">');for(var i=0;i<t.data.datasets.length;i++)e.push('<li><span style="background-color:'+t.data.datasets[i].backgroundColor+'"></span>'),t.data.datasets[i].label&&e.push(t.data.datasets[i].label),e.push("</li>");return e.push("</ul>"),e.join("")}});var u=a.extend({initialize:function(t){r.extend(this,t),this.legendHitBoxes=[],this.doughnutMode=!1},beforeUpdate:s,update:function(t,e,i){var n=this;return n.beforeUpdate(),n.maxWidth=t,n.maxHeight=e,n.margins=i,n.beforeSetDimensions(),n.setDimensions(),n.afterSetDimensions(),n.beforeBuildLabels(),n.buildLabels(),n.afterBuildLabels(),n.beforeFit(),n.fit(),n.afterFit(),n.afterUpdate(),n.minSize},afterUpdate:s,beforeSetDimensions:s,setDimensions:function(){var t=this;t.isHorizontal()?(t.width=t.maxWidth,t.left=0,t.right=t.width):(t.height=t.maxHeight,t.top=0,t.bottom=t.height),t.paddingLeft=0,t.paddingTop=0,t.paddingRight=0,t.paddingBottom=0,t.minSize={width:0,height:0}},afterSetDimensions:s,beforeBuildLabels:s,buildLabels:function(){var t=this,e=t.options.labels||{},i=r.callback(e.generateLabels,[t.chart],t)||[];e.filter&&(i=i.filter(function(i){return e.filter(i,t.chart.data)})),t.options.reverse&&i.reverse(),t.legendItems=i},afterBuildLabels:s,beforeFit:s,fit:function(){var t=this,e=t.options,i=e.labels,a=e.display,o=t.ctx,s=n.global,u=r.valueOrDefault,d=u(i.fontSize,s.defaultFontSize),h=u(i.fontStyle,s.defaultFontStyle),c=u(i.fontFamily,s.defaultFontFamily),f=r.fontString(d,h,c),g=t.legendHitBoxes=[],m=t.minSize,p=t.isHorizontal();if(p?(m.width=t.maxWidth,m.height=a?10:0):(m.width=a?10:0,m.height=t.maxHeight),a)if(o.font=f,p){var v=t.lineWidths=[0],y=t.legendItems.length?d+i.padding:0;o.textAlign="left",o.textBaseline="top",r.each(t.legendItems,function(e,n){var a=l(i,d)+d/2+o.measureText(e.text).width;v[v.length-1]+a+i.padding>=t.width&&(y+=d+i.padding,v[v.length]=t.left),g[n]={left:0,top:0,width:a,height:d},v[v.length-1]+=a+i.padding}),m.height+=y}else{var b=i.padding,x=t.columnWidths=[],_=i.padding,k=0,w=0,M=d+b;r.each(t.legendItems,function(t,e){var n=l(i,d)+d/2+o.measureText(t.text).width;w+M>m.height&&(_+=k+i.padding,x.push(k),k=0,w=0),k=Math.max(k,n),w+=M,g[e]={left:0,top:0,width:n,height:d}}),_+=k,x.push(k),m.width+=_}t.width=m.width,t.height=m.height},afterFit:s,isHorizontal:function(){return"top"===this.options.position||"bottom"===this.options.position},draw:function(){var t=this,e=t.options,i=e.labels,a=n.global,o=a.elements.line,s=t.width,u=t.lineWidths;if(e.display){var d,h=t.ctx,c=r.valueOrDefault,f=c(i.fontColor,a.defaultFontColor),g=c(i.fontSize,a.defaultFontSize),m=c(i.fontStyle,a.defaultFontStyle),p=c(i.fontFamily,a.defaultFontFamily),v=r.fontString(g,m,p);h.textAlign="left",h.textBaseline="middle",h.lineWidth=.5,h.strokeStyle=f,h.fillStyle=f,h.font=v;var y=l(i,g),b=t.legendHitBoxes,x=t.isHorizontal();d=x?{x:t.left+(s-u[0])/2,y:t.top+i.padding,line:0}:{x:t.left+i.padding,y:t.top+i.padding,line:0};var _=g+i.padding;r.each(t.legendItems,function(n,l){var f,m,p,v,k,w=h.measureText(n.text).width,M=y+g/2+w,S=d.x,D=d.y;x?S+M>=s&&(D=d.y+=_,d.line++,S=d.x=t.left+(s-u[d.line])/2):D+_>t.bottom&&(S=d.x=S+t.columnWidths[d.line]+i.padding,D=d.y=t.top+i.padding,d.line++),function(t,i,n){if(!(isNaN(y)||y<=0)){h.save(),h.fillStyle=c(n.fillStyle,a.defaultColor),h.lineCap=c(n.lineCap,o.borderCapStyle),h.lineDashOffset=c(n.lineDashOffset,o.borderDashOffset),h.lineJoin=c(n.lineJoin,o.borderJoinStyle),h.lineWidth=c(n.lineWidth,o.borderWidth),h.strokeStyle=c(n.strokeStyle,a.defaultColor);var s=0===c(n.lineWidth,o.borderWidth);if(h.setLineDash&&h.setLineDash(c(n.lineDash,o.borderDash)),e.labels&&e.labels.usePointStyle){var l=g*Math.SQRT2/2,u=l/Math.SQRT2,d=t+u,f=i+u;r.canvas.drawPoint(h,n.pointStyle,l,d,f)}else s||h.strokeRect(t,i,y,g),h.fillRect(t,i,y,g);h.restore()}}(S,D,n),b[l].left=S,b[l].top=D,f=n,m=w,v=y+(p=g/2)+S,k=D+p,h.fillText(f.text,v,k),f.hidden&&(h.beginPath(),h.lineWidth=2,h.moveTo(v,k),h.lineTo(v+m,k),h.stroke()),x?d.x+=M+i.padding:d.y+=_})}},handleEvent:function(t){var e=this,i=e.options,n="mouseup"===t.type?"click":t.type,a=!1;if("mousemove"===n){if(!i.onHover)return}else{if("click"!==n)return;if(!i.onClick)return}var r=t.x,o=t.y;if(r>=e.left&&r<=e.right&&o>=e.top&&o<=e.bottom)for(var s=e.legendHitBoxes,l=0;l<s.length;++l){var u=s[l];if(r>=u.left&&r<=u.left+u.width&&o>=u.top&&o<=u.top+u.height){if("click"===n){i.onClick.call(e,t.native,e.legendItems[l]),a=!0;break}if("mousemove"===n){i.onHover.call(e,t.native,e.legendItems[l]),a=!0;break}}}return a}});function d(t,e){var i=new u({ctx:t.ctx,options:e,chart:t});o.configure(t,i,e),o.addBox(t,i),t.legend=i}e.exports={id:"legend",_element:u,beforeInit:function(t){var e=t.options.legend;e&&d(t,e)},beforeUpdate:function(t){var e=t.options.legend,i=t.legend;e?(r.mergeIf(e,n.global.legend),i?(o.configure(t,i,e),i.options=e):d(t,e)):i&&(o.removeBox(t,i),delete t.legend)},afterEvent:function(t,e){var i=t.legend;i&&i.handleEvent(e)}}},{25:25,26:26,30:30,45:45}],52:[function(t,e,i){"use strict";var n=t(25),a=t(26),r=t(45),o=t(30),s=r.noop;n._set("global",{title:{display:!1,fontStyle:"bold",fullWidth:!0,lineHeight:1.2,padding:10,position:"top",text:"",weight:2e3}});var l=a.extend({initialize:function(t){r.extend(this,t),this.legendHitBoxes=[]},beforeUpdate:s,update:function(t,e,i){var n=this;return n.beforeUpdate(),n.maxWidth=t,n.maxHeight=e,n.margins=i,n.beforeSetDimensions(),n.setDimensions(),n.afterSetDimensions(),n.beforeBuildLabels(),n.buildLabels(),n.afterBuildLabels(),n.beforeFit(),n.fit(),n.afterFit(),n.afterUpdate(),n.minSize},afterUpdate:s,beforeSetDimensions:s,setDimensions:function(){var t=this;t.isHorizontal()?(t.width=t.maxWidth,t.left=0,t.right=t.width):(t.height=t.maxHeight,t.top=0,t.bottom=t.height),t.paddingLeft=0,t.paddingTop=0,t.paddingRight=0,t.paddingBottom=0,t.minSize={width:0,height:0}},afterSetDimensions:s,beforeBuildLabels:s,buildLabels:s,afterBuildLabels:s,beforeFit:s,fit:function(){var t=r.valueOrDefault,e=this.options,i=e.display,a=t(e.fontSize,n.global.defaultFontSize),o=this.minSize,s=r.isArray(e.text)?e.text.length:1,l=r.options.toLineHeight(e.lineHeight,a),u=i?s*l+2*e.padding:0;this.isHorizontal()?(o.width=this.maxWidth,o.height=u):(o.width=u,o.height=this.maxHeight),this.width=o.width,this.height=o.height},afterFit:s,isHorizontal:function(){var t=this.options.position;return"top"===t||"bottom"===t},draw:function(){var t=this.ctx,e=r.valueOrDefault,i=this.options,a=n.global;if(i.display){var o,s,l,u=e(i.fontSize,a.defaultFontSize),d=e(i.fontStyle,a.defaultFontStyle),h=e(i.fontFamily,a.defaultFontFamily),c=r.fontString(u,d,h),f=r.options.toLineHeight(i.lineHeight,u),g=f/2+i.padding,m=0,p=this.top,v=this.left,y=this.bottom,b=this.right;t.fillStyle=e(i.fontColor,a.defaultFontColor),t.font=c,this.isHorizontal()?(s=v+(b-v)/2,l=p+g,o=b-v):(s="left"===i.position?v+g:b-g,l=p+(y-p)/2,o=y-p,m=Math.PI*("left"===i.position?-.5:.5)),t.save(),t.translate(s,l),t.rotate(m),t.textAlign="center",t.textBaseline="middle";var x=i.text;if(r.isArray(x))for(var _=0,k=0;k<x.length;++k)t.fillText(x[k],0,_,o),_+=f;else t.fillText(x,0,0,o);t.restore()}}});function u(t,e){var i=new l({ctx:t.ctx,options:e,chart:t});o.configure(t,i,e),o.addBox(t,i),t.titleBlock=i}e.exports={id:"title",_element:l,beforeInit:function(t){var e=t.options.title;e&&u(t,e)},beforeUpdate:function(t){var e=t.options.title,i=t.titleBlock;e?(r.mergeIf(e,n.global.title),i?(o.configure(t,i,e),i.options=e):u(t,e)):i&&(o.removeBox(t,i),delete t.titleBlock)}}},{25:25,26:26,30:30,45:45}],53:[function(t,e,i){"use strict";e.exports=function(t){var e=t.Scale.extend({getLabels:function(){var t=this.chart.data;return this.options.labels||(this.isHorizontal()?t.xLabels:t.yLabels)||t.labels},determineDataLimits:function(){var t,e=this,i=e.getLabels();e.minIndex=0,e.maxIndex=i.length-1,void 0!==e.options.ticks.min&&(t=i.indexOf(e.options.ticks.min),e.minIndex=-1!==t?t:e.minIndex),void 0!==e.options.ticks.max&&(t=i.indexOf(e.options.ticks.max),e.maxIndex=-1!==t?t:e.maxIndex),e.min=i[e.minIndex],e.max=i[e.maxIndex]},buildTicks:function(){var t=this.getLabels();this.ticks=0===this.minIndex&&this.maxIndex===t.length-1?t:t.slice(this.minIndex,this.maxIndex+1)},getLabelForIndex:function(t,e){var i=this.chart.data,n=this.isHorizontal();return i.yLabels&&!n?this.getRightValue(i.datasets[e].data[t]):this.ticks[t-this.minIndex]},getPixelForValue:function(t,e){var i,n=this,a=n.options.offset,r=Math.max(n.maxIndex+1-n.minIndex-(a?0:1),1);if(null!=t&&(i=n.isHorizontal()?t.x:t.y),void 0!==i||void 0!==t&&isNaN(e)){t=i||t;var o=n.getLabels().indexOf(t);e=-1!==o?o:e}if(n.isHorizontal()){var s=n.width/r,l=s*(e-n.minIndex);return a&&(l+=s/2),n.left+Math.round(l)}var u=n.height/r,d=u*(e-n.minIndex);return a&&(d+=u/2),n.top+Math.round(d)},getPixelForTick:function(t){return this.getPixelForValue(this.ticks[t],t+this.minIndex,null)},getValueForPixel:function(t){var e=this.options.offset,i=Math.max(this._ticks.length-(e?0:1),1),n=this.isHorizontal(),a=(n?this.width:this.height)/i;return t-=n?this.left:this.top,e&&(t-=a/2),(t<=0?0:Math.round(t/a))+this.minIndex},getBasePixel:function(){return this.bottom}});t.scaleService.registerScaleType("category",e,{position:"bottom"})}},{}],54:[function(t,e,i){"use strict";var n=t(25),a=t(45),r=t(34);e.exports=function(t){var e={position:"left",ticks:{callback:r.formatters.linear}},i=t.LinearScaleBase.extend({determineDataLimits:function(){var t=this,e=t.options,i=t.chart,n=i.data.datasets,r=t.isHorizontal();function o(e){return r?e.xAxisID===t.id:e.yAxisID===t.id}t.min=null,t.max=null;var s=e.stacked;if(void 0===s&&a.each(n,function(t,e){if(!s){var n=i.getDatasetMeta(e);i.isDatasetVisible(e)&&o(n)&&void 0!==n.stack&&(s=!0)}}),e.stacked||s){var l={};a.each(n,function(n,r){var s=i.getDatasetMeta(r),u=[s.type,void 0===e.stacked&&void 0===s.stack?r:"",s.stack].join(".");void 0===l[u]&&(l[u]={positiveValues:[],negativeValues:[]});var d=l[u].positiveValues,h=l[u].negativeValues;i.isDatasetVisible(r)&&o(s)&&a.each(n.data,function(i,n){var a=+t.getRightValue(i);isNaN(a)||s.data[n].hidden||(d[n]=d[n]||0,h[n]=h[n]||0,e.relativePoints?d[n]=100:a<0?h[n]+=a:d[n]+=a)})}),a.each(l,function(e){var i=e.positiveValues.concat(e.negativeValues),n=a.min(i),r=a.max(i);t.min=null===t.min?n:Math.min(t.min,n),t.max=null===t.max?r:Math.max(t.max,r)})}else a.each(n,function(e,n){var r=i.getDatasetMeta(n);i.isDatasetVisible(n)&&o(r)&&a.each(e.data,function(e,i){var n=+t.getRightValue(e);isNaN(n)||r.data[i].hidden||(null===t.min?t.min=n:n<t.min&&(t.min=n),null===t.max?t.max=n:n>t.max&&(t.max=n))})});t.min=isFinite(t.min)&&!isNaN(t.min)?t.min:0,t.max=isFinite(t.max)&&!isNaN(t.max)?t.max:1,this.handleTickRangeOptions()},getTickLimit:function(){var t,e=this.options.ticks;if(this.isHorizontal())t=Math.min(e.maxTicksLimit?e.maxTicksLimit:11,Math.ceil(this.width/50));else{var i=a.valueOrDefault(e.fontSize,n.global.defaultFontSize);t=Math.min(e.maxTicksLimit?e.maxTicksLimit:11,Math.ceil(this.height/(2*i)))}return t},handleDirectionalChanges:function(){this.isHorizontal()||this.ticks.reverse()},getLabelForIndex:function(t,e){return+this.getRightValue(this.chart.data.datasets[e].data[t])},getPixelForValue:function(t){var e=this.start,i=+this.getRightValue(t),n=this.end-e;return this.isHorizontal()?this.left+this.width/n*(i-e):this.bottom-this.height/n*(i-e)},getValueForPixel:function(t){var e=this.isHorizontal(),i=e?this.width:this.height,n=(e?t-this.left:this.bottom-t)/i;return this.start+(this.end-this.start)*n},getPixelForTick:function(t){return this.getPixelForValue(this.ticksAsNumbers[t])}});t.scaleService.registerScaleType("linear",i,e)}},{25:25,34:34,45:45}],55:[function(t,e,i){"use strict";var n=t(45);e.exports=function(t){var e=n.noop;t.LinearScaleBase=t.Scale.extend({getRightValue:function(e){return"string"==typeof e?+e:t.Scale.prototype.getRightValue.call(this,e)},handleTickRangeOptions:function(){var t=this,e=t.options.ticks;if(e.beginAtZero){var i=n.sign(t.min),a=n.sign(t.max);i<0&&a<0?t.max=0:i>0&&a>0&&(t.min=0)}var r=void 0!==e.min||void 0!==e.suggestedMin,o=void 0!==e.max||void 0!==e.suggestedMax;void 0!==e.min?t.min=e.min:void 0!==e.suggestedMin&&(null===t.min?t.min=e.suggestedMin:t.min=Math.min(t.min,e.suggestedMin)),void 0!==e.max?t.max=e.max:void 0!==e.suggestedMax&&(null===t.max?t.max=e.suggestedMax:t.max=Math.max(t.max,e.suggestedMax)),r!==o&&t.min>=t.max&&(r?t.max=t.min+1:t.min=t.max-1),t.min===t.max&&(t.max++,e.beginAtZero||t.min--)},getTickLimit:e,handleDirectionalChanges:e,buildTicks:function(){var t=this,e=t.options.ticks,i=t.getTickLimit(),a={maxTicks:i=Math.max(2,i),min:e.min,max:e.max,stepSize:n.valueOrDefault(e.fixedStepSize,e.stepSize)},r=t.ticks=function(t,e){var i,a=[];if(t.stepSize&&t.stepSize>0)i=t.stepSize;else{var r=n.niceNum(e.max-e.min,!1);i=n.niceNum(r/(t.maxTicks-1),!0)}var o=Math.floor(e.min/i)*i,s=Math.ceil(e.max/i)*i;t.min&&t.max&&t.stepSize&&n.almostWhole((t.max-t.min)/t.stepSize,i/1e3)&&(o=t.min,s=t.max);var l=(s-o)/i;l=n.almostEquals(l,Math.round(l),i/1e3)?Math.round(l):Math.ceil(l);var u=1;i<1&&(u=Math.pow(10,i.toString().length-2),o=Math.round(o*u)/u,s=Math.round(s*u)/u),a.push(void 0!==t.min?t.min:o);for(var d=1;d<l;++d)a.push(Math.round((o+d*i)*u)/u);return a.push(void 0!==t.max?t.max:s),a}(a,t);t.handleDirectionalChanges(),t.max=n.max(r),t.min=n.min(r),e.reverse?(r.reverse(),t.start=t.max,t.end=t.min):(t.start=t.min,t.end=t.max)},convertTicksToLabels:function(){this.ticksAsNumbers=this.ticks.slice(),this.zeroLineIndex=this.ticks.indexOf(0),t.Scale.prototype.convertTicksToLabels.call(this)}})}},{45:45}],56:[function(t,e,i){"use strict";var n=t(45),a=t(34);e.exports=function(t){var e={position:"left",ticks:{callback:a.formatters.logarithmic}},i=t.Scale.extend({determineDataLimits:function(){var t=this,e=t.options,i=t.chart,a=i.data.datasets,r=t.isHorizontal();function o(e){return r?e.xAxisID===t.id:e.yAxisID===t.id}t.min=null,t.max=null,t.minNotZero=null;var s=e.stacked;if(void 0===s&&n.each(a,function(t,e){if(!s){var n=i.getDatasetMeta(e);i.isDatasetVisible(e)&&o(n)&&void 0!==n.stack&&(s=!0)}}),e.stacked||s){var l={};n.each(a,function(a,r){var s=i.getDatasetMeta(r),u=[s.type,void 0===e.stacked&&void 0===s.stack?r:"",s.stack].join(".");i.isDatasetVisible(r)&&o(s)&&(void 0===l[u]&&(l[u]=[]),n.each(a.data,function(e,i){var n=l[u],a=+t.getRightValue(e);isNaN(a)||s.data[i].hidden||a<0||(n[i]=n[i]||0,n[i]+=a)}))}),n.each(l,function(e){if(e.length>0){var i=n.min(e),a=n.max(e);t.min=null===t.min?i:Math.min(t.min,i),t.max=null===t.max?a:Math.max(t.max,a)}})}else n.each(a,function(e,a){var r=i.getDatasetMeta(a);i.isDatasetVisible(a)&&o(r)&&n.each(e.data,function(e,i){var n=+t.getRightValue(e);isNaN(n)||r.data[i].hidden||n<0||(null===t.min?t.min=n:n<t.min&&(t.min=n),null===t.max?t.max=n:n>t.max&&(t.max=n),0!==n&&(null===t.minNotZero||n<t.minNotZero)&&(t.minNotZero=n))})});this.handleTickRangeOptions()},handleTickRangeOptions:function(){var t=this,e=t.options.ticks,i=n.valueOrDefault;t.min=i(e.min,t.min),t.max=i(e.max,t.max),t.min===t.max&&(0!==t.min&&null!==t.min?(t.min=Math.pow(10,Math.floor(n.log10(t.min))-1),t.max=Math.pow(10,Math.floor(n.log10(t.max))+1)):(t.min=1,t.max=10)),null===t.min&&(t.min=Math.pow(10,Math.floor(n.log10(t.max))-1)),null===t.max&&(t.max=0!==t.min?Math.pow(10,Math.floor(n.log10(t.min))+1):10),null===t.minNotZero&&(t.min>0?t.minNotZero=t.min:t.max<1?t.minNotZero=Math.pow(10,Math.floor(n.log10(t.max))):t.minNotZero=1)},buildTicks:function(){var t=this,e=t.options.ticks,i=!t.isHorizontal(),a={min:e.min,max:e.max},r=t.ticks=function(t,e){var i,a,r=[],o=n.valueOrDefault,s=o(t.min,Math.pow(10,Math.floor(n.log10(e.min)))),l=Math.floor(n.log10(e.max)),u=Math.ceil(e.max/Math.pow(10,l));0===s?(i=Math.floor(n.log10(e.minNotZero)),a=Math.floor(e.minNotZero/Math.pow(10,i)),r.push(s),s=a*Math.pow(10,i)):(i=Math.floor(n.log10(s)),a=Math.floor(s/Math.pow(10,i)));for(var d=i<0?Math.pow(10,Math.abs(i)):1;r.push(s),10==++a&&(a=1,d=++i>=0?1:d),s=Math.round(a*Math.pow(10,i)*d)/d,i<l||i===l&&a<u;);var h=o(t.max,s);return r.push(h),r}(a,t);t.max=n.max(r),t.min=n.min(r),e.reverse?(i=!i,t.start=t.max,t.end=t.min):(t.start=t.min,t.end=t.max),i&&r.reverse()},convertTicksToLabels:function(){this.tickValues=this.ticks.slice(),t.Scale.prototype.convertTicksToLabels.call(this)},getLabelForIndex:function(t,e){return+this.getRightValue(this.chart.data.datasets[e].data[t])},getPixelForTick:function(t){return this.getPixelForValue(this.tickValues[t])},_getFirstTickValue:function(t){var e=Math.floor(n.log10(t));return Math.floor(t/Math.pow(10,e))*Math.pow(10,e)},getPixelForValue:function(e){var i,a,r,o,s,l=this,u=l.options.ticks.reverse,d=n.log10,h=l._getFirstTickValue(l.minNotZero),c=0;return e=+l.getRightValue(e),u?(r=l.end,o=l.start,s=-1):(r=l.start,o=l.end,s=1),l.isHorizontal()?(i=l.width,a=u?l.right:l.left):(i=l.height,s*=-1,a=u?l.top:l.bottom),e!==r&&(0===r&&(i-=c=n.getValueOrDefault(l.options.ticks.fontSize,t.defaults.global.defaultFontSize),r=h),0!==e&&(c+=i/(d(o)-d(r))*(d(e)-d(r))),a+=s*c),a},getValueForPixel:function(e){var i,a,r,o,s=this,l=s.options.ticks.reverse,u=n.log10,d=s._getFirstTickValue(s.minNotZero);if(l?(a=s.end,r=s.start):(a=s.start,r=s.end),s.isHorizontal()?(i=s.width,o=l?s.right-e:e-s.left):(i=s.height,o=l?e-s.top:s.bottom-e),o!==a){if(0===a){var h=n.getValueOrDefault(s.options.ticks.fontSize,t.defaults.global.defaultFontSize);o-=h,i-=h,a=d}o*=u(r)-u(a),o/=i,o=Math.pow(10,u(a)+o)}return o}});t.scaleService.registerScaleType("logarithmic",i,e)}},{34:34,45:45}],57:[function(t,e,i){"use strict";var n=t(25),a=t(45),r=t(34);e.exports=function(t){var e=n.global,i={display:!0,animate:!0,position:"chartArea",angleLines:{display:!0,color:"rgba(0, 0, 0, 0.1)",lineWidth:1},gridLines:{circular:!1},ticks:{showLabelBackdrop:!0,backdropColor:"rgba(255,255,255,0.75)",backdropPaddingY:2,backdropPaddingX:2,callback:r.formatters.linear},pointLabels:{display:!0,fontSize:10,callback:function(t){return t}}};function o(t){var e=t.options;return e.angleLines.display||e.pointLabels.display?t.chart.data.labels.length:0}function s(t){var i=t.options.pointLabels,n=a.valueOrDefault(i.fontSize,e.defaultFontSize),r=a.valueOrDefault(i.fontStyle,e.defaultFontStyle),o=a.valueOrDefault(i.fontFamily,e.defaultFontFamily);return{size:n,style:r,family:o,font:a.fontString(n,r,o)}}function l(t,e,i,n,a){return t===n||t===a?{start:e-i/2,end:e+i/2}:t<n||t>a?{start:e-i-5,end:e}:{start:e,end:e+i+5}}function u(t,e,i,n){if(a.isArray(e))for(var r=i.y,o=1.5*n,s=0;s<e.length;++s)t.fillText(e[s],i.x,r),r+=o;else t.fillText(e,i.x,i.y)}function d(t){return a.isNumber(t)?t:0}var h=t.LinearScaleBase.extend({setDimensions:function(){var t=this,i=t.options,n=i.ticks;t.width=t.maxWidth,t.height=t.maxHeight,t.xCenter=Math.round(t.width/2),t.yCenter=Math.round(t.height/2);var r=a.min([t.height,t.width]),o=a.valueOrDefault(n.fontSize,e.defaultFontSize);t.drawingArea=i.display?r/2-(o/2+n.backdropPaddingY):r/2},determineDataLimits:function(){var t=this,e=t.chart,i=Number.POSITIVE_INFINITY,n=Number.NEGATIVE_INFINITY;a.each(e.data.datasets,function(r,o){if(e.isDatasetVisible(o)){var s=e.getDatasetMeta(o);a.each(r.data,function(e,a){var r=+t.getRightValue(e);isNaN(r)||s.data[a].hidden||(i=Math.min(r,i),n=Math.max(r,n))})}}),t.min=i===Number.POSITIVE_INFINITY?0:i,t.max=n===Number.NEGATIVE_INFINITY?0:n,t.handleTickRangeOptions()},getTickLimit:function(){var t=this.options.ticks,i=a.valueOrDefault(t.fontSize,e.defaultFontSize);return Math.min(t.maxTicksLimit?t.maxTicksLimit:11,Math.ceil(this.drawingArea/(1.5*i)))},convertTicksToLabels:function(){t.LinearScaleBase.prototype.convertTicksToLabels.call(this),this.pointLabels=this.chart.data.labels.map(this.options.pointLabels.callback,this)},getLabelForIndex:function(t,e){return+this.getRightValue(this.chart.data.datasets[e].data[t])},fit:function(){var t,e;this.options.pointLabels.display?function(t){var e,i,n,r=s(t),u=Math.min(t.height/2,t.width/2),d={r:t.width,l:0,t:t.height,b:0},h={};t.ctx.font=r.font,t._pointLabelSizes=[];var c,f,g,m=o(t);for(e=0;e<m;e++){n=t.getPointPosition(e,u),c=t.ctx,f=r.size,g=t.pointLabels[e]||"",i=a.isArray(g)?{w:a.longestText(c,c.font,g),h:g.length*f+1.5*(g.length-1)*f}:{w:c.measureText(g).width,h:f},t._pointLabelSizes[e]=i;var p=t.getIndexAngle(e),v=a.toDegrees(p)%360,y=l(v,n.x,i.w,0,180),b=l(v,n.y,i.h,90,270);y.start<d.l&&(d.l=y.start,h.l=p),y.end>d.r&&(d.r=y.end,h.r=p),b.start<d.t&&(d.t=b.start,h.t=p),b.end>d.b&&(d.b=b.end,h.b=p)}t.setReductions(u,d,h)}(this):(t=this,e=Math.min(t.height/2,t.width/2),t.drawingArea=Math.round(e),t.setCenterPoint(0,0,0,0))},setReductions:function(t,e,i){var n=e.l/Math.sin(i.l),a=Math.max(e.r-this.width,0)/Math.sin(i.r),r=-e.t/Math.cos(i.t),o=-Math.max(e.b-this.height,0)/Math.cos(i.b);n=d(n),a=d(a),r=d(r),o=d(o),this.drawingArea=Math.min(Math.round(t-(n+a)/2),Math.round(t-(r+o)/2)),this.setCenterPoint(n,a,r,o)},setCenterPoint:function(t,e,i,n){var a=this,r=a.width-e-a.drawingArea,o=t+a.drawingArea,s=i+a.drawingArea,l=a.height-n-a.drawingArea;a.xCenter=Math.round((o+r)/2+a.left),a.yCenter=Math.round((s+l)/2+a.top)},getIndexAngle:function(t){return t*(2*Math.PI/o(this))+(this.chart.options&&this.chart.options.startAngle?this.chart.options.startAngle:0)*Math.PI*2/360},getDistanceFromCenterForValue:function(t){if(null===t)return 0;var e=this.drawingArea/(this.max-this.min);return this.options.ticks.reverse?(this.max-t)*e:(t-this.min)*e},getPointPosition:function(t,e){var i=this.getIndexAngle(t)-Math.PI/2;return{x:Math.round(Math.cos(i)*e)+this.xCenter,y:Math.round(Math.sin(i)*e)+this.yCenter}},getPointPositionForValue:function(t,e){return this.getPointPosition(t,this.getDistanceFromCenterForValue(e))},getBasePosition:function(){var t=this.min,e=this.max;return this.getPointPositionForValue(0,this.beginAtZero?0:t<0&&e<0?e:t>0&&e>0?t:0)},draw:function(){var t=this,i=t.options,n=i.gridLines,r=i.ticks,l=a.valueOrDefault;if(i.display){var d=t.ctx,h=this.getIndexAngle(0),c=l(r.fontSize,e.defaultFontSize),f=l(r.fontStyle,e.defaultFontStyle),g=l(r.fontFamily,e.defaultFontFamily),m=a.fontString(c,f,g);a.each(t.ticks,function(i,s){if(s>0||r.reverse){var u=t.getDistanceFromCenterForValue(t.ticksAsNumbers[s]);if(n.display&&0!==s&&function(t,e,i,n){var r=t.ctx;if(r.strokeStyle=a.valueAtIndexOrDefault(e.color,n-1),r.lineWidth=a.valueAtIndexOrDefault(e.lineWidth,n-1),t.options.gridLines.circular)r.beginPath(),r.arc(t.xCenter,t.yCenter,i,0,2*Math.PI),r.closePath(),r.stroke();else{var s=o(t);if(0===s)return;r.beginPath();var l=t.getPointPosition(0,i);r.moveTo(l.x,l.y);for(var u=1;u<s;u++)l=t.getPointPosition(u,i),r.lineTo(l.x,l.y);r.closePath(),r.stroke()}}(t,n,u,s),r.display){var f=l(r.fontColor,e.defaultFontColor);if(d.font=m,d.save(),d.translate(t.xCenter,t.yCenter),d.rotate(h),r.showLabelBackdrop){var g=d.measureText(i).width;d.fillStyle=r.backdropColor,d.fillRect(-g/2-r.backdropPaddingX,-u-c/2-r.backdropPaddingY,g+2*r.backdropPaddingX,c+2*r.backdropPaddingY)}d.textAlign="center",d.textBaseline="middle",d.fillStyle=f,d.fillText(i,0,-u),d.restore()}}}),(i.angleLines.display||i.pointLabels.display)&&function(t){var i=t.ctx,n=t.options,r=n.angleLines,l=n.pointLabels;i.lineWidth=r.lineWidth,i.strokeStyle=r.color;var d,h,c,f,g=t.getDistanceFromCenterForValue(n.ticks.reverse?t.min:t.max),m=s(t);i.textBaseline="top";for(var p=o(t)-1;p>=0;p--){if(r.display){var v=t.getPointPosition(p,g);i.beginPath(),i.moveTo(t.xCenter,t.yCenter),i.lineTo(v.x,v.y),i.stroke(),i.closePath()}if(l.display){var y=t.getPointPosition(p,g+5),b=a.valueAtIndexOrDefault(l.fontColor,p,e.defaultFontColor);i.font=m.font,i.fillStyle=b;var x=t.getIndexAngle(p),_=a.toDegrees(x);i.textAlign=0===(f=_)||180===f?"center":f<180?"left":"right",d=_,h=t._pointLabelSizes[p],c=y,90===d||270===d?c.y-=h.h/2:(d>270||d<90)&&(c.y-=h.h),u(i,t.pointLabels[p]||"",y,m.size)}}}(t)}}});t.scaleService.registerScaleType("radialLinear",h,i)}},{25:25,34:34,45:45}],58:[function(t,e,i){"use strict";var n=t(6);n="function"==typeof n?n:window.moment;var a=t(25),r=t(45),o=Number.MIN_SAFE_INTEGER||-9007199254740991,s=Number.MAX_SAFE_INTEGER||9007199254740991,l={millisecond:{common:!0,size:1,steps:[1,2,5,10,20,50,100,250,500]},second:{common:!0,size:1e3,steps:[1,2,5,10,30]},minute:{common:!0,size:6e4,steps:[1,2,5,10,30]},hour:{common:!0,size:36e5,steps:[1,2,3,6,12]},day:{common:!0,size:864e5,steps:[1,2,5]},week:{common:!1,size:6048e5,steps:[1,2,3,4]},month:{common:!0,size:2628e6,steps:[1,2,3]},quarter:{common:!1,size:7884e6,steps:[1,2,3,4]},year:{common:!0,size:3154e7}},u=Object.keys(l);function d(t,e){return t-e}function h(t){var e,i,n,a={},r=[];for(e=0,i=t.length;e<i;++e)a[n=t[e]]||(a[n]=!0,r.push(n));return r}function c(t,e,i,n){var a=function(t,e,i){for(var n,a,r,o=0,s=t.length-1;o>=0&&o<=s;){if(a=t[(n=o+s>>1)-1]||null,r=t[n],!a)return{lo:null,hi:r};if(r[e]<i)o=n+1;else{if(!(a[e]>i))return{lo:a,hi:r};s=n-1}}return{lo:r,hi:null}}(t,e,i),r=a.lo?a.hi?a.lo:t[t.length-2]:t[0],o=a.lo?a.hi?a.hi:t[t.length-1]:t[1],s=o[e]-r[e],l=s?(i-r[e])/s:0,u=(o[n]-r[n])*l;return r[n]+u}function f(t,e){var i=e.parser,a=e.parser||e.format;return"function"==typeof i?i(t):"string"==typeof t&&"string"==typeof a?n(t,a):(t instanceof n||(t=n(t)),t.isValid()?t:"function"==typeof a?a(t):t)}function g(t,e){if(r.isNullOrUndef(t))return null;var i=e.options.time,n=f(e.getRightValue(t),i);return n.isValid()?(i.round&&n.startOf(i.round),n.valueOf()):null}function m(t){for(var e=u.indexOf(t)+1,i=u.length;e<i;++e)if(l[u[e]].common)return u[e]}function p(t,e,i,a){var o,d=a.time,h=d.unit||function(t,e,i,n){var a,r,o,d=u.length;for(a=u.indexOf(t);a<d-1;++a)if(o=(r=l[u[a]]).steps?r.steps[r.steps.length-1]:s,r.common&&Math.ceil((i-e)/(o*r.size))<=n)return u[a];return u[d-1]}(d.minUnit,t,e,i),c=m(h),f=r.valueOrDefault(d.stepSize,d.unitStepSize),g="week"===h&&d.isoWeekday,p=a.ticks.major.enabled,v=l[h],y=n(t),b=n(e),x=[];for(f||(f=function(t,e,i,n){var a,r,o,s=e-t,u=l[i],d=u.size,h=u.steps;if(!h)return Math.ceil(s/(n*d));for(a=0,r=h.length;a<r&&(o=h[a],!(Math.ceil(s/(d*o))<=n));++a);return o}(t,e,h,i)),g&&(y=y.isoWeekday(g),b=b.isoWeekday(g)),y=y.startOf(g?"day":h),(b=b.startOf(g?"day":h))<e&&b.add(1,h),o=n(y),p&&c&&!g&&!d.round&&(o.startOf(c),o.add(~~((y-o)/(v.size*f))*f,h));o<b;o.add(f,h))x.push(+o);return x.push(+o),x}e.exports=function(t){var e=t.Scale.extend({initialize:function(){if(!n)throw new Error("Chart.js - Moment.js could not be found! You must include it before Chart.js to use the time scale. Download at https://momentjs.com");this.mergeTicksOptions(),t.Scale.prototype.initialize.call(this)},update:function(){var e=this.options;return e.time&&e.time.format&&console.warn("options.time.format is deprecated and replaced by options.time.parser."),t.Scale.prototype.update.apply(this,arguments)},getRightValue:function(e){return e&&void 0!==e.t&&(e=e.t),t.Scale.prototype.getRightValue.call(this,e)},determineDataLimits:function(){var t,e,i,a,l,u,c=this,f=c.chart,m=c.options.time,p=m.unit||"day",v=s,y=o,b=[],x=[],_=[];for(t=0,i=f.data.labels.length;t<i;++t)_.push(g(f.data.labels[t],c));for(t=0,i=(f.data.datasets||[]).length;t<i;++t)if(f.isDatasetVisible(t))if(l=f.data.datasets[t].data,r.isObject(l[0]))for(x[t]=[],e=0,a=l.length;e<a;++e)u=g(l[e],c),b.push(u),x[t][e]=u;else b.push.apply(b,_),x[t]=_.slice(0);else x[t]=[];_.length&&(_=h(_).sort(d),v=Math.min(v,_[0]),y=Math.max(y,_[_.length-1])),b.length&&(b=h(b).sort(d),v=Math.min(v,b[0]),y=Math.max(y,b[b.length-1])),v=g(m.min,c)||v,y=g(m.max,c)||y,v=v===s?+n().startOf(p):v,y=y===o?+n().endOf(p)+1:y,c.min=Math.min(v,y),c.max=Math.max(v+1,y),c._horizontal=c.isHorizontal(),c._table=[],c._timestamps={data:b,datasets:x,labels:_}},buildTicks:function(){var t,e,i,a,r,o,s,d,h,v,y,b,x=this,_=x.min,k=x.max,w=x.options,M=w.time,S=[],D=[];switch(w.ticks.source){case"data":S=x._timestamps.data;break;case"labels":S=x._timestamps.labels;break;case"auto":default:S=p(_,k,x.getLabelCapacity(_),w)}for("ticks"===w.bounds&&S.length&&(_=S[0],k=S[S.length-1]),_=g(M.min,x)||_,k=g(M.max,x)||k,t=0,e=S.length;t<e;++t)(i=S[t])>=_&&i<=k&&D.push(i);return x.min=_,x.max=k,x._unit=M.unit||function(t,e,i,a){var r,o,s=n.duration(n(a).diff(n(i)));for(r=u.length-1;r>=u.indexOf(e);r--)if(o=u[r],l[o].common&&s.as(o)>=t.length)return o;return u[e?u.indexOf(e):0]}(D,M.minUnit,x.min,x.max),x._majorUnit=m(x._unit),x._table=function(t,e,i,n){if("linear"===n||!t.length)return[{time:e,pos:0},{time:i,pos:1}];var a,r,o,s,l,u=[],d=[e];for(a=0,r=t.length;a<r;++a)(s=t[a])>e&&s<i&&d.push(s);for(d.push(i),a=0,r=d.length;a<r;++a)l=d[a+1],o=d[a-1],s=d[a],void 0!==o&&void 0!==l&&Math.round((l+o)/2)===s||u.push({time:s,pos:a/(r-1)});return u}(x._timestamps.data,_,k,w.distribution),x._offsets=(a=x._table,r=D,o=_,s=k,y=0,b=0,(d=w).offset&&r.length&&(d.time.min||(h=r.length>1?r[1]:s,v=r[0],y=(c(a,"time",h,"pos")-c(a,"time",v,"pos"))/2),d.time.max||(h=r[r.length-1],v=r.length>1?r[r.length-2]:o,b=(c(a,"time",h,"pos")-c(a,"time",v,"pos"))/2)),{left:y,right:b}),x._labelFormat=function(t,e){var i,n,a,r=t.length;for(i=0;i<r;i++){if(0!==(n=f(t[i],e)).millisecond())return"MMM D, YYYY h:mm:ss.SSS a";0===n.second()&&0===n.minute()&&0===n.hour()||(a=!0)}return a?"MMM D, YYYY h:mm:ss a":"MMM D, YYYY"}(x._timestamps.data,M),function(t,e){var i,a,r,o,s=[];for(i=0,a=t.length;i<a;++i)r=t[i],o=!!e&&r===+n(r).startOf(e),s.push({value:r,major:o});return s}(D,x._majorUnit)},getLabelForIndex:function(t,e){var i=this.chart.data,n=this.options.time,a=i.labels&&t<i.labels.length?i.labels[t]:"",o=i.datasets[e].data[t];return r.isObject(o)&&(a=this.getRightValue(o)),n.tooltipFormat?f(a,n).format(n.tooltipFormat):"string"==typeof a?a:f(a,n).format(this._labelFormat)},tickFormatFunction:function(t,e,i,n){var a=this.options,o=t.valueOf(),s=a.time.displayFormats,l=s[this._unit],u=this._majorUnit,d=s[u],h=t.clone().startOf(u).valueOf(),c=a.ticks.major,f=c.enabled&&u&&d&&o===h,g=t.format(n||(f?d:l)),m=f?c:a.ticks.minor,p=r.valueOrDefault(m.callback,m.userCallback);return p?p(g,e,i):g},convertTicksToLabels:function(t){var e,i,a=[];for(e=0,i=t.length;e<i;++e)a.push(this.tickFormatFunction(n(t[e].value),e,t));return a},getPixelForOffset:function(t){var e=this,i=e._horizontal?e.width:e.height,n=e._horizontal?e.left:e.top,a=c(e._table,"time",t,"pos");return n+i*(e._offsets.left+a)/(e._offsets.left+1+e._offsets.right)},getPixelForValue:function(t,e,i){var n=null;if(void 0!==e&&void 0!==i&&(n=this._timestamps.datasets[i][e]),null===n&&(n=g(t,this)),null!==n)return this.getPixelForOffset(n)},getPixelForTick:function(t){var e=this.getTicks();return t>=0&&t<e.length?this.getPixelForOffset(e[t].value):null},getValueForPixel:function(t){var e=this,i=e._horizontal?e.width:e.height,a=e._horizontal?e.left:e.top,r=(i?(t-a)/i:0)*(e._offsets.left+1+e._offsets.left)-e._offsets.right,o=c(e._table,"pos",r,"time");return n(o)},getLabelWidth:function(t){var e=this.options.ticks,i=this.ctx.measureText(t).width,n=r.toRadians(e.maxRotation),o=Math.cos(n),s=Math.sin(n);return i*o+r.valueOrDefault(e.fontSize,a.global.defaultFontSize)*s},getLabelCapacity:function(t){var e=this.options.time.displayFormats.millisecond,i=this.tickFormatFunction(n(t),0,[],e),a=this.getLabelWidth(i),r=this.isHorizontal()?this.width:this.height,o=Math.floor(r/a);return o>0?o:1}});t.scaleService.registerScaleType("time",e,{position:"bottom",distribution:"linear",bounds:"data",time:{parser:!1,format:!1,unit:!1,round:!1,displayFormat:!1,isoWeekday:!1,minUnit:"millisecond",displayFormats:{millisecond:"h:mm:ss.SSS a",second:"h:mm:ss a",minute:"h:mm a",hour:"hA",day:"MMM D",week:"ll",month:"MMM YYYY",quarter:"[Q]Q - YYYY",year:"YYYY"}},ticks:{autoSkip:!1,source:"auto",major:{enabled:!1}}})}},{25:25,45:45,6:6}]},{},[7])(7)});
\ No newline at end of file
diff --git a/diode/DataViewSettings.html b/diode/DataViewSettings.html
new file mode 100644
index 0000000000..9006b22127
--- /dev/null
+++ b/diode/DataViewSettings.html
@@ -0,0 +1,32 @@
+<!DOCTYPE html>
+<html>
+<head>
+    <meta charset="utf-8" />
+    <script src="dagre.js"></script> <!-- Graph library. -->
+    <script src="renderer_util.js"></script> <!-- Offloaded display functions and classes -->
+    <script src="sdfg_renderer.js"></script> <!-- Offloaded SDFG drawing functions -->
+    <script src="datahelper.js"></script> <!-- Offloaded data analysis functions and classes -->
+    <script src="Chart.bundle.min.js"></script> <!-- Library for charts -->
+
+
+    <script src="parallelization_button.js"></script>
+    <script src="memory_button.js"></script>
+    <script src="windowing.js"></script>
+    <script src="DataViewSettings.js"></script>
+    <script>
+        window.onload = () => {
+            console.log("Creating settings window");
+            let _thisclient = new SettingsWindow(window);
+            console.log("Logic initialization done");
+            window._thisclient = _thisclient;
+        };
+    </script>
+</head>
+<body>
+    <h3>Settings for $NAME</h3>    
+    <div id="settingsdiv">
+        <!-- Setting options should be inserted here... -->
+
+    </div>
+</body>
+</html>
\ No newline at end of file
diff --git a/diode/DataViewSettings.js b/diode/DataViewSettings.js
new file mode 100644
index 0000000000..6939671197
--- /dev/null
+++ b/diode/DataViewSettings.js
@@ -0,0 +1,151 @@
+// This is a file that provides functionality of View Settings
+
+class SettingsWindow extends ClientSide {
+
+    parseSettings(data, parentObject) {
+        "use strict";
+        let set = data;
+        
+
+        console.log("set: " + JSON.stringify(set));
+        for(let k of ObjectHelper.listKeys(set)) {
+            let val = set[k];
+            console.log("With key " + k);
+            if(val.type == "code") {
+                let label = window.document.createElement('label');
+                let codearea = window.document.createElement('textarea');
+                label.id = "label_" + k;
+                codearea.id = "value_" + k;
+                label.htmlFor = codearea.id;
+                label.innerHTML = k;
+
+                codearea.textContent = val.value;
+
+                parentObject.appendChild(label);
+                parentObject.appendChild(window.document.createElement("br"));
+                parentObject.appendChild(codearea);
+            }
+            else if(val.type == "group") {
+                // Subgroup requested...
+                let fieldset = window.document.createElement("fieldset");
+                fieldset.id = "value_" + k;
+                let legend = window.document.createElement("legend");
+                legend.innerHTML = val["description"];
+
+                console.log("Starting recursion");
+                // Recurse.
+                this.parseSettings(val.value, fieldset);
+
+                fieldset.appendChild(legend);
+
+                parentObject.appendChild(fieldset);
+
+            }
+            else if(val.type == "bool") {
+                // We have to add a checkbox
+                
+                let checkbox = window.document.createElement("input");
+                checkbox.type = "checkbox";
+                checkbox.id = "value_" + k;
+                checkbox.checked = val.value;
+
+                
+                let label = window.document.createElement("label");
+                label.htmlFor = checkbox.id;
+                label.innerHTML = k;
+
+                parentObject.appendChild(label);
+                
+                parentObject.appendChild(checkbox);
+                parentObject.appendChild(window.document.createElement("br"));
+            }
+            else {
+                console.log("Unimplemented type " + val.type + " encountered.");
+            }
+        }
+    }
+
+    restoreSettings(set, parentObject) {
+        "use strict";
+        let retset = {};
+
+        for(let k of ObjectHelper.listKeys(set)) {
+            let val = set[k];
+
+            let _value = {};
+
+            let elem = window.document.getElementById('value_' + k);
+
+            ObjectHelper.assert("Element must exist", elem);
+
+            if(val.type == "code") {
+                _value = {
+                    type: val.type,
+                    value: elem.value.toString()
+                };
+                
+            }
+            else if(val.type == "bool") {
+                _value = {
+                    type: val.type,
+                    value: elem.checked
+                };
+                console.log("The id of elem is " + elem.id);
+                console.log("value of " + k + " is " + elem.value);
+                
+            }
+            else if(val.type == "group") {
+                // We have to recurse
+                let sub = this.restoreSettings(val.value, elem);
+
+                _value = {
+                    type: val.type,
+                    value: sub
+                };
+            }
+            retset[k] = _value;
+        }
+
+        return retset;
+    }
+
+    constructor(thiswindow) {
+        super(thiswindow, data => {
+            //console.log("Settings window got " + JSON.stringify(data));
+
+            if(data.type == "settings-data") {
+                
+                let form = window.document.createElement('form');
+                form.onsubmit = x => x.preventDefault();
+                this.parseSettings(data.data, form);
+
+                let savebutton = window.document.createElement('button');
+                savebutton.innerText = "Save";
+                let _this = this;
+                savebutton.onclick = x => {
+                    console.log("\"Save\" clicked");
+
+                    let retset = this.restoreSettings(data.data, form);
+                    
+                    // Send information back
+                    _this.passMessage({
+                        type: "save-settings",
+                        data: retset
+                    });
+                };
+                form.appendChild(window.document.createElement("br"));
+                form.appendChild(savebutton);
+
+                window.document.body.appendChild(form);
+
+
+                
+            }
+            else {
+                console.log("Undefined type encountered " + JSON.stringify(data));
+            }
+
+            
+        });
+    }
+}
\ No newline at end of file
diff --git a/diode/__init__.py b/diode/__init__.py
new file mode 100644
index 0000000000..a276d6e732
--- /dev/null
+++ b/diode/__init__.py
@@ -0,0 +1,6 @@
+from .abstract_sdfg import AbstractSDFG
+from .rendered_graph import RenderedGraph
+from .config_ui import DIODEConfig
+from .pattern_editor import PatternEditor
+from .performance_plot import PerformancePlot
+from .images import ImageStore
diff --git a/diode/abstract_sdfg.py b/diode/abstract_sdfg.py
new file mode 100644
index 0000000000..e4b27fbae9
--- /dev/null
+++ b/diode/abstract_sdfg.py
@@ -0,0 +1,123 @@
+import uuid
+
+
+class AbstractSDFGNode:
+    def __init__(self):
+        self.uid = str(uuid.uuid4())
+        self.label = None
+        self.nodetype = "Unspecified"
+
+    def set_nodetype(self, nodetype):
+        self.nodetype = nodetype
+        if self.label == None:
+            self.label = nodetype
+
+    def get_uid(self):
+        return self.uid
+
+    def set_label(self, label):
+        self.label = label
+
+    def get_label(self):
+        return self.label
+
+    def to_dot(self):
+        dot = "\"" + self.uid + "\" ["
+        if self.nodetype == "Map":
+            dot += "shape=trapezium"
+        if self.nodetype == "Unmap":
+            dot += "shape=invtrapezium"
+        if self.nodetype == "Array":
+            dot += "shape=ellipse"
+        if self.nodetype == "Tasklet":
+            dot += "shape=octagon"
+        if self.nodetype == "Confres":
+            dot += "shape=invtriangle"
+        if self.nodetype == "Stream":
+            dot += "shape=ellipse, style=dashed"
+        if self.nodetype == "StreamMap":
+            dot += "shape=trapezium, style=dashed"
+        if self.nodetype == "StreamUnmap":
+            dot += "shape=invtrapezium, style=dashed"
+        if self.nodetype == "Reduce":
+            dot += "shape=invhouse"
+        dot += ",label=\"" + str(self.label) + "\""
+        dot += "];\n"
+        return dot
+
+
+class AbstractSDFGEdge:
+    def __init__(self, tail, head):
+        self.head = head
+        self.tail = tail
+        self.label = ""
+
+    def get_head(self):
+        return self.head
+
+    def get_tail(self):
+        return self.tail
+
+    def set_label(self, label):
+        self.label = label
+
+    def get_label(self):
+        return self.label
+
+    def to_dot(self):
+        dot = "\"" + self.tail.get_uid() + "\"" + " -> " + "\"" + \
+              self.head.get_uid() + "\" [label=\"" + self.label + "\"];"
+        return dot
+
+
+class AbstractSDFG:
+    def __init__(self):
+        self.states = []
+        self.interstate_edges = []
+        self.nodes = []
+        self.edges = []
+
+    def to_dot(self):
+        dot = "digraph G {\n"
+        for node in self.nodes:
+            dot += node.to_dot()
+        for edge in self.edges:
+            dot += edge.to_dot()
+        for state in self.states:
+            dot += state.to_dot()
+        for insedge in self.interstate_edges:
+            dot += insedge.to_dot()
+        dot += "}\n"
+        return dot
+
+    def add_node(self, nodetype):
+        node = AbstractSDFGNode()
+        node.set_nodetype(nodetype)
+        self.nodes.append(node)
+
+    def add_edge(self, tailnode, headnode):
+        t = self.find_node(tailnode)
+        h = self.find_node(headnode)
+        e = AbstractSDFGEdge(t, h)
+        self.edges.append(e)
+
+    def find_node(self, uid):
+        for n in self.nodes:
+            if n.get_uid() == uid:
+                return n
+
+    def find_edge(self, tail_uid, head_uid):
+        for e in self.edges:
+            if (e.get_tail().get_uid() == tail_uid) and \
+               (e.get_head().get_uid() == head_uid):
+                return e
+
+    def delete_node(self, uid):
+        for e in self.edges:
+            t = e.get_tail()
+            h = e.get_head()
+            if (t.get_uid() == uid) or (h.get_uid() == uid):
+                self.edges.remove(e)
+        for n in self.nodes:
+            if n.get_uid() == uid:
+                self.nodes.remove(n)
diff --git a/diode/config_ui.py b/diode/config_ui.py
new file mode 100644
index 0000000000..654224ec93
--- /dev/null
+++ b/diode/config_ui.py
@@ -0,0 +1,136 @@
+from dace.config import Config
+
+import gi
+gi.require_version('Gtk', '3.0')
+from gi.repository import Gtk, Pango
+
+_ALLOWED_TYPES = {'int': int, 'str': str, 'float': float}
+
+
+class DIODEConfig:
+    """ This class holds all configuration options for DIODE. The "Preferences"
+        dialog is autogenerated from the description of options in dace.Config.
+        Options are described as a list of dictionaries. Each option is one
+        dictionary with the entry's name, category, details, defualt and type. 
+        
+        @see dace.config.Config
+    """
+
+    def __init__(self):
+        self.window = None
+
+    def __getitem__(self, *key):
+        return Config.get(*key)
+
+    def __setitem__(self, *key, value=None):
+        Config.set(*key, value=value)
+
+    def textfield_callback(self, widget, cpath, meta):
+        value = widget.get_text()
+        casted = _ALLOWED_TYPES[meta['type']](value)
+        Config.set(*cpath, value=casted)
+
+    def switch_callback(self, widget, data, cpath):
+        value = widget.get_active()
+        Config.set(*cpath, value=value)
+
+    def fontbutton_callback(self, widget, cpath):
+        value = widget.get_font_name()
+        Config.set(*cpath, value=value)
+
+    def win_close_callback(self, widget, *data):
+        Config.save()
+
+    def render_config_element(self, cval, cpath, grid, i, meta):
+        grid.insert_row(i)
+        label = Gtk.Label()
+        # If setting was modified from default, mark label as bold
+        if cval != Config.get_default(*cpath):
+            label.set_markup('<b>' + meta['title'] + '</b>')
+        else:
+            label.set_label(meta['title'])
+
+        entry = None
+        if (meta['type'] == "str" or meta['type'] == "int"
+                or meta['type'] == "float"):
+
+            entry = Gtk.Entry()
+            entry.set_text(str(cval))
+            entry.connect("changed", self.textfield_callback, cpath, meta)
+        elif meta['type'] == "bool":
+            entry = Gtk.Switch()
+            entry.set_active(cval)
+            entry.connect("state-set", self.switch_callback, cpath)
+        elif meta['type'] == "font":
+            entry = Gtk.FontButton()
+            entry.set_use_font(True)
+            entry.set_font_name(str(cval))
+            entry.connect("font-set", self.fontbutton_callback, cpath)
+        else:
+            raise ValueError("Unimplemented CV type: " + meta['type'])
+        label.set_tooltip_text(meta['description'])
+        entry.set_tooltip_text(meta['description'])
+        grid.attach(label, 0, i, 1, 1)
+        grid.attach(entry, 1, i, 1, 1)
+
+    def render_config_subtree(self, cv, config_path, grid):
+        # Add notebook to grid and render each child within
+
+        columized = False
+        notebook = Gtk.Notebook()
+        grid.add(notebook)
+        grid.set_hexpand(True)
+        for i, (cname, cval) in enumerate(sorted(cv.items())):
+            # Create current config "path"
+            cpath = tuple(list(config_path) + [cname])
+            meta = Config.get_metadata(*cpath)
+            if meta['type'] == 'dict':
+                gtklabel = Gtk.Label()
+                gtklabel.set_label(meta['title'])
+                ngrid = Gtk.Grid()
+                notebook.append_page(ngrid, gtklabel)
+                self.render_config_subtree(cval, cpath, ngrid)
+                continue
+
+            if columized == False:
+                grid.insert_column(0)
+                grid.insert_column(1)
+                columized = True
+            self.render_config_element(cval, cpath, grid, i, meta)
+
+    def render_config_dialog(self):
+        # Load metadata for configuration
+        Config.load_schema()
+
+        self.window = Gtk.Window()
+        notebook = Gtk.Notebook()
+        notebook.set_scrollable(True)
+        self.window.add(notebook)
+
+        # General (top-level) settings
+        gtklabel = Gtk.Label()
+        gtklabel.set_label('General')
+        general_grid = Gtk.Grid()
+        general_grid.set_hexpand(True)
+        notebook.append_page(general_grid, gtklabel)
+        columized = False
+
+        for i, (cname, cval) in enumerate(sorted(Config.get().items())):
+            meta = Config.get_metadata(cname)
+            if meta['type'] == 'dict':
+                gtklabel = Gtk.Label()
+                gtklabel.set_label(meta['title'])
+                grid = Gtk.Grid()
+                grid.set_hexpand(True)
+                notebook.append_page(grid, gtklabel)
+                self.render_config_subtree(cval, (cname, ), grid)
+                continue
+
+            if columized == False:
+                general_grid.insert_column(0)
+                general_grid.insert_column(1)
+                columized = True
+            self.render_config_element(cval, (cname, ), general_grid, i, meta)
+
+        self.window.show_all()
+        self.window.connect("delete-event", self.win_close_callback, None)
diff --git a/diode/dagre.js b/diode/dagre.js
new file mode 100644
index 0000000000..76c0925326
--- /dev/null
+++ b/diode/dagre.js
@@ -0,0 +1,21217 @@
+(function(f){if(typeof exports==="object"&&typeof module!=="undefined"){module.exports=f()}else if(typeof define==="function"&&define.amd){define([],f)}else{var g;if(typeof window!=="undefined"){g=window}else if(typeof global!=="undefined"){g=global}else if(typeof self!=="undefined"){g=self}else{g=this}g.dagre = f()}})(function(){var define,module,exports;return (function e(t,n,r){function s(o,u){if(!n[o]){if(!t[o]){var a=typeof require=="function"&&require;if(!u&&a)return a(o,!0);if(i)return i(o,!0);var f=new Error("Cannot find module '"+o+"'");throw f.code="MODULE_NOT_FOUND",f}var l=n[o]={exports:{}};t[o][0].call(l.exports,function(e){var n=t[o][1][e];return s(n?n:e)},l,l.exports,e,t,n,r)}return n[o].exports}var i=typeof require=="function"&&require;for(var o=0;o<r.length;o++)s(r[o]);return s})({1:[function(require,module,exports){
+/*
+Copyright (c) 2012-2014 Chris Pettitt
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in
+all copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
+THE SOFTWARE.
+*/
+
+module.exports = {
+  graphlib: require("./lib/graphlib"),
+
+  layout: require("./lib/layout"),
+  debug: require("./lib/debug"),
+  util: {
+    time: require("./lib/util").time,
+    notime: require("./lib/util").notime
+  },
+  version: require("./lib/version")
+};
+
+},{"./lib/debug":6,"./lib/graphlib":7,"./lib/layout":9,"./lib/util":29,"./lib/version":30}],2:[function(require,module,exports){
+"use strict";
+
+var _ = require("./lodash"),
+    greedyFAS = require("./greedy-fas");
+
+module.exports = {
+  run: run,
+  undo: undo
+};
+
+function run(g) {
+  var fas = (g.graph().acyclicer === "greedy"
+                ? greedyFAS(g, weightFn(g))
+                : dfsFAS(g));
+  _.forEach(fas, function(e) {
+    var label = g.edge(e);
+    g.removeEdge(e);
+    label.forwardName = e.name;
+    label.reversed = true;
+    g.setEdge(e.w, e.v, label, _.uniqueId("rev"));
+  });
+
+  function weightFn(g) {
+    return function(e) {
+      return g.edge(e).weight;
+    };
+  }
+}
+
+function dfsFAS(g) {
+  var fas = [],
+      stack = {},
+      visited = {};
+
+  function dfs(v) {
+    if (_.has(visited, v)) {
+      return;
+    }
+    visited[v] = true;
+    stack[v] = true;
+    _.forEach(g.outEdges(v), function(e) {
+      if (_.has(stack, e.w)) {
+        fas.push(e);
+      } else {
+        dfs(e.w);
+      }
+    });
+    delete stack[v];
+  }
+
+  _.forEach(g.nodes(), dfs);
+  return fas;
+}
+
+function undo(g) {
+  _.forEach(g.edges(), function(e) {
+    var label = g.edge(e);
+    if (label.reversed) {
+      g.removeEdge(e);
+
+      var forwardName = label.forwardName;
+      delete label.reversed;
+      delete label.forwardName;
+      g.setEdge(e.w, e.v, label, forwardName);
+    }
+  });
+}
+
+},{"./greedy-fas":8,"./lodash":10}],3:[function(require,module,exports){
+var _ = require("./lodash"),
+    util = require("./util");
+
+module.exports = addBorderSegments;
+
+function addBorderSegments(g) {
+  function dfs(v) {
+    var children = g.children(v),
+        node = g.node(v);
+    if (children.length) {
+      _.forEach(children, dfs);
+    }
+
+    if (_.has(node, "minRank")) {
+      node.borderLeft = [];
+      node.borderRight = [];
+      for (var rank = node.minRank, maxRank = node.maxRank + 1;
+           rank < maxRank;
+           ++rank) {
+        addBorderNode(g, "borderLeft", "_bl", v, node, rank);
+        addBorderNode(g, "borderRight", "_br", v, node, rank);
+      }
+    }
+  }
+
+  _.forEach(g.children(), dfs);
+}
+
+function addBorderNode(g, prop, prefix, sg, sgNode, rank) {
+  var label = { width: 0, height: 0, rank: rank, borderType: prop },
+      prev = sgNode[prop][rank - 1],
+      curr = util.addDummyNode(g, "border", label, prefix);
+  sgNode[prop][rank] = curr;
+  g.setParent(curr, sg);
+  if (prev) {
+    g.setEdge(prev, curr, { weight: 1 });
+  }
+}
+
+},{"./lodash":10,"./util":29}],4:[function(require,module,exports){
+"use strict";
+
+var _ = require("./lodash");
+
+module.exports = {
+  adjust: adjust,
+  undo: undo
+};
+
+function adjust(g) {
+  var rankDir = g.graph().rankdir.toLowerCase();
+  if (rankDir === "lr" || rankDir === "rl") {
+    swapWidthHeight(g);
+  }
+}
+
+function undo(g) {
+  var rankDir = g.graph().rankdir.toLowerCase();
+  if (rankDir === "bt" || rankDir === "rl") {
+    reverseY(g);
+  }
+
+  if (rankDir === "lr" || rankDir === "rl") {
+    swapXY(g);
+    swapWidthHeight(g);
+  }
+}
+
+function swapWidthHeight(g) {
+  _.forEach(g.nodes(), function(v) { swapWidthHeightOne(g.node(v)); });
+  _.forEach(g.edges(), function(e) { swapWidthHeightOne(g.edge(e)); });
+}
+
+function swapWidthHeightOne(attrs) {
+  var w = attrs.width;
+  attrs.width = attrs.height;
+  attrs.height = w;
+}
+
+function reverseY(g) {
+  _.forEach(g.nodes(), function(v) { reverseYOne(g.node(v)); });
+
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    _.forEach(edge.points, reverseYOne);
+    if (_.has(edge, "y")) {
+      reverseYOne(edge);
+    }
+  });
+}
+
+function reverseYOne(attrs) {
+  attrs.y = -attrs.y;
+}
+
+function swapXY(g) {
+  _.forEach(g.nodes(), function(v) { swapXYOne(g.node(v)); });
+
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    _.forEach(edge.points, swapXYOne);
+    if (_.has(edge, "x")) {
+      swapXYOne(edge);
+    }
+  });
+}
+
+function swapXYOne(attrs) {
+  var x = attrs.x;
+  attrs.x = attrs.y;
+  attrs.y = x;
+}
+
+},{"./lodash":10}],5:[function(require,module,exports){
+/*
+ * Simple doubly linked list implementation derived from Cormen, et al.,
+ * "Introduction to Algorithms".
+ */
+
+module.exports = List;
+
+function List() {
+  var sentinel = {};
+  sentinel._next = sentinel._prev = sentinel;
+  this._sentinel = sentinel;
+}
+
+List.prototype.dequeue = function() {
+  var sentinel = this._sentinel,
+      entry = sentinel._prev;
+  if (entry !== sentinel) {
+    unlink(entry);
+    return entry;
+  }
+};
+
+List.prototype.enqueue = function(entry) {
+  var sentinel = this._sentinel;
+  if (entry._prev && entry._next) {
+    unlink(entry);
+  }
+  entry._next = sentinel._next;
+  sentinel._next._prev = entry;
+  sentinel._next = entry;
+  entry._prev = sentinel;
+};
+
+List.prototype.toString = function() {
+  var strs = [],
+      sentinel = this._sentinel,
+      curr = sentinel._prev;
+  while (curr !== sentinel) {
+    strs.push(JSON.stringify(curr, filterOutLinks));
+    curr = curr._prev;
+  }
+  return "[" + strs.join(", ") + "]";
+};
+
+function unlink(entry) {
+  entry._prev._next = entry._next;
+  entry._next._prev = entry._prev;
+  delete entry._next;
+  delete entry._prev;
+}
+
+function filterOutLinks(k, v) {
+  if (k !== "_next" && k !== "_prev") {
+    return v;
+  }
+}
+
+},{}],6:[function(require,module,exports){
+var _ = require("./lodash"),
+    util = require("./util"),
+    Graph = require("./graphlib").Graph;
+
+module.exports = {
+  debugOrdering: debugOrdering
+};
+
+/* istanbul ignore next */
+function debugOrdering(g) {
+  var layerMatrix = util.buildLayerMatrix(g);
+
+  var h = new Graph({ compound: true, multigraph: true }).setGraph({});
+
+  _.forEach(g.nodes(), function(v) {
+    h.setNode(v, { label: v });
+    h.setParent(v, "layer" + g.node(v).rank);
+  });
+
+  _.forEach(g.edges(), function(e) {
+    h.setEdge(e.v, e.w, {}, e.name);
+  });
+
+  _.forEach(layerMatrix, function(layer, i) {
+    var layerV = "layer" + i;
+    h.setNode(layerV, { rank: "same" });
+    _.reduce(layer, function(u, v) {
+      h.setEdge(u, v, { style: "invis" });
+      return v;
+    });
+  });
+
+  return h;
+}
+
+},{"./graphlib":7,"./lodash":10,"./util":29}],7:[function(require,module,exports){
+/* global window */
+
+var graphlib;
+
+if (typeof require === "function") {
+  try {
+    graphlib = require("graphlib");
+  } catch (e) {}
+}
+
+if (!graphlib) {
+  graphlib = window.graphlib;
+}
+
+module.exports = graphlib;
+
+},{"graphlib":31}],8:[function(require,module,exports){
+var _ = require("./lodash"),
+    Graph = require("./graphlib").Graph,
+    List = require("./data/list");
+
+/*
+ * A greedy heuristic for finding a feedback arc set for a graph. A feedback
+ * arc set is a set of edges that can be removed to make a graph acyclic.
+ * The algorithm comes from: P. Eades, X. Lin, and W. F. Smyth, "A fast and
+ * effective heuristic for the feedback arc set problem." This implementation
+ * adjusts that from the paper to allow for weighted edges.
+ */
+module.exports = greedyFAS;
+
+var DEFAULT_WEIGHT_FN = _.constant(1);
+
+function greedyFAS(g, weightFn) {
+  if (g.nodeCount() <= 1) {
+    return [];
+  }
+  var state = buildState(g, weightFn || DEFAULT_WEIGHT_FN);
+  var results = doGreedyFAS(state.graph, state.buckets, state.zeroIdx);
+
+  // Expand multi-edges
+  return _.flatten(_.map(results, function(e) {
+    return g.outEdges(e.v, e.w);
+  }), true);
+}
+
+function doGreedyFAS(g, buckets, zeroIdx) {
+  var results = [],
+      sources = buckets[buckets.length - 1],
+      sinks = buckets[0];
+
+  var entry;
+  while (g.nodeCount()) {
+    while ((entry = sinks.dequeue()))   { removeNode(g, buckets, zeroIdx, entry); }
+    while ((entry = sources.dequeue())) { removeNode(g, buckets, zeroIdx, entry); }
+    if (g.nodeCount()) {
+      for (var i = buckets.length - 2; i > 0; --i) {
+        entry = buckets[i].dequeue();
+        if (entry) {
+          results = results.concat(removeNode(g, buckets, zeroIdx, entry, true));
+          break;
+        }
+      }
+    }
+  }
+
+  return results;
+}
+
+function removeNode(g, buckets, zeroIdx, entry, collectPredecessors) {
+  var results = collectPredecessors ? [] : undefined;
+
+  _.forEach(g.inEdges(entry.v), function(edge) {
+    var weight = g.edge(edge),
+        uEntry = g.node(edge.v);
+
+    if (collectPredecessors) {
+      results.push({ v: edge.v, w: edge.w });
+    }
+
+    uEntry.out -= weight;
+    assignBucket(buckets, zeroIdx, uEntry);
+  });
+
+  _.forEach(g.outEdges(entry.v), function(edge) {
+    var weight = g.edge(edge),
+        w = edge.w,
+        wEntry = g.node(w);
+    wEntry["in"] -= weight;
+    assignBucket(buckets, zeroIdx, wEntry);
+  });
+
+  g.removeNode(entry.v);
+
+  return results;
+}
+
+function buildState(g, weightFn) {
+  var fasGraph = new Graph(),
+      maxIn = 0,
+      maxOut = 0;
+
+  _.forEach(g.nodes(), function(v) {
+    fasGraph.setNode(v, { v: v, "in": 0, out: 0 });
+  });
+
+  // Aggregate weights on nodes, but also sum the weights across multi-edges
+  // into a single edge for the fasGraph.
+  _.forEach(g.edges(), function(e) {
+    var prevWeight = fasGraph.edge(e.v, e.w) || 0,
+        weight = weightFn(e),
+        edgeWeight = prevWeight + weight;
+    fasGraph.setEdge(e.v, e.w, edgeWeight);
+    maxOut = Math.max(maxOut, fasGraph.node(e.v).out += weight);
+    maxIn  = Math.max(maxIn,  fasGraph.node(e.w)["in"]  += weight);
+  });
+
+  var buckets = _.range(maxOut + maxIn + 3).map(function() { return new List(); });
+  var zeroIdx = maxIn + 1;
+
+  _.forEach(fasGraph.nodes(), function(v) {
+    assignBucket(buckets, zeroIdx, fasGraph.node(v));
+  });
+
+  return { graph: fasGraph, buckets: buckets, zeroIdx: zeroIdx };
+}
+
+function assignBucket(buckets, zeroIdx, entry) {
+  if (!entry.out) {
+    buckets[0].enqueue(entry);
+  } else if (!entry["in"]) {
+    buckets[buckets.length - 1].enqueue(entry);
+  } else {
+    buckets[entry.out - entry["in"] + zeroIdx].enqueue(entry);
+  }
+}
+
+},{"./data/list":5,"./graphlib":7,"./lodash":10}],9:[function(require,module,exports){
+"use strict";
+
+var _ = require("./lodash"),
+    acyclic = require("./acyclic"),
+    normalize = require("./normalize"),
+    rank = require("./rank"),
+    normalizeRanks = require("./util").normalizeRanks,
+    parentDummyChains = require("./parent-dummy-chains"),
+    removeEmptyRanks = require("./util").removeEmptyRanks,
+    nestingGraph = require("./nesting-graph"),
+    addBorderSegments = require("./add-border-segments"),
+    coordinateSystem = require("./coordinate-system"),
+    order = require("./order"),
+    position = require("./position"),
+    util = require("./util"),
+    Graph = require("./graphlib").Graph;
+
+module.exports = layout;
+
+function layout(g, opts) {
+  var time = opts && opts.debugTiming ? util.time : util.notime;
+  time("layout", function() {
+    var layoutGraph = time("  buildLayoutGraph",
+                               function() { return buildLayoutGraph(g); });
+    time("  runLayout",        function() { runLayout(layoutGraph, time); });
+    time("  updateInputGraph", function() { updateInputGraph(g, layoutGraph); });
+  });
+}
+
+function runLayout(g, time) {
+  time("    makeSpaceForEdgeLabels", function() { makeSpaceForEdgeLabels(g); });
+  time("    removeSelfEdges",        function() { removeSelfEdges(g); });
+  time("    acyclic",                function() { acyclic.run(g); });
+  time("    nestingGraph.run",       function() { nestingGraph.run(g); });
+  time("    rank",                   function() { rank(util.asNonCompoundGraph(g)); });
+  time("    injectEdgeLabelProxies", function() { injectEdgeLabelProxies(g); });
+  time("    removeEmptyRanks",       function() { removeEmptyRanks(g); });
+  time("    nestingGraph.cleanup",   function() { nestingGraph.cleanup(g); });
+  time("    normalizeRanks",         function() { normalizeRanks(g); });
+  time("    assignRankMinMax",       function() { assignRankMinMax(g); });
+  time("    removeEdgeLabelProxies", function() { removeEdgeLabelProxies(g); });
+  time("    normalize.run",          function() { normalize.run(g); });
+  time("    parentDummyChains",      function() { parentDummyChains(g); });
+  time("    addBorderSegments",      function() { addBorderSegments(g); });
+  time("    order",                  function() { order(g); });
+  time("    insertSelfEdges",        function() { insertSelfEdges(g); });
+  time("    adjustCoordinateSystem", function() { coordinateSystem.adjust(g); });
+  time("    position",               function() { position(g); });
+  time("    positionSelfEdges",      function() { positionSelfEdges(g); });
+  time("    removeBorderNodes",      function() { removeBorderNodes(g); });
+  time("    normalize.undo",         function() { normalize.undo(g); });
+  time("    fixupEdgeLabelCoords",   function() { fixupEdgeLabelCoords(g); });
+  time("    undoCoordinateSystem",   function() { coordinateSystem.undo(g); });
+  time("    translateGraph",         function() { translateGraph(g); });
+  time("    assignNodeIntersects",   function() { assignNodeIntersects(g); });
+  time("    reversePoints",          function() { reversePointsForReversedEdges(g); });
+  time("    acyclic.undo",           function() { acyclic.undo(g); });
+}
+
+/*
+ * Copies final layout information from the layout graph back to the input
+ * graph. This process only copies whitelisted attributes from the layout graph
+ * to the input graph, so it serves as a good place to determine what
+ * attributes can influence layout.
+ */
+function updateInputGraph(inputGraph, layoutGraph) {
+  _.forEach(inputGraph.nodes(), function(v) {
+    var inputLabel = inputGraph.node(v),
+        layoutLabel = layoutGraph.node(v);
+
+    if (inputLabel) {
+      inputLabel.x = layoutLabel.x;
+      inputLabel.y = layoutLabel.y;
+
+      if (layoutGraph.children(v).length) {
+        inputLabel.width = layoutLabel.width;
+        inputLabel.height = layoutLabel.height;
+      }
+    }
+  });
+
+  _.forEach(inputGraph.edges(), function(e) {
+    var inputLabel = inputGraph.edge(e),
+        layoutLabel = layoutGraph.edge(e);
+
+    inputLabel.points = layoutLabel.points;
+    if (_.has(layoutLabel, "x")) {
+      inputLabel.x = layoutLabel.x;
+      inputLabel.y = layoutLabel.y;
+    }
+  });
+
+  inputGraph.graph().width = layoutGraph.graph().width;
+  inputGraph.graph().height = layoutGraph.graph().height;
+}
+
+var graphNumAttrs = ["nodesep", "edgesep", "ranksep", "marginx", "marginy"],
+    graphDefaults = { ranksep: 50, edgesep: 20, nodesep: 50, rankdir: "tb" },
+    graphAttrs = ["acyclicer", "ranker", "rankdir", "align"],
+    nodeNumAttrs = ["width", "height"],
+    nodeDefaults = { width: 0, height: 0 },
+    edgeNumAttrs = ["minlen", "weight", "width", "height", "labeloffset"],
+    edgeDefaults = {
+      minlen: 1, weight: 1, width: 0, height: 0,
+      labeloffset: 10, labelpos: "r"
+    },
+    edgeAttrs = ["labelpos"];
+
+/*
+ * Constructs a new graph from the input graph, which can be used for layout.
+ * This process copies only whitelisted attributes from the input graph to the
+ * layout graph. Thus this function serves as a good place to determine what
+ * attributes can influence layout.
+ */
+function buildLayoutGraph(inputGraph) {
+  var g = new Graph({ multigraph: true, compound: true }),
+      graph = canonicalize(inputGraph.graph());
+
+  g.setGraph(_.merge({},
+    graphDefaults,
+    selectNumberAttrs(graph, graphNumAttrs),
+    _.pick(graph, graphAttrs)));
+
+  _.forEach(inputGraph.nodes(), function(v) {
+    var node = canonicalize(inputGraph.node(v));
+    g.setNode(v, _.defaults(selectNumberAttrs(node, nodeNumAttrs), nodeDefaults));
+    g.setParent(v, inputGraph.parent(v));
+  });
+
+  _.forEach(inputGraph.edges(), function(e) {
+    var edge = canonicalize(inputGraph.edge(e));
+    g.setEdge(e, _.merge({},
+      edgeDefaults,
+      selectNumberAttrs(edge, edgeNumAttrs),
+      _.pick(edge, edgeAttrs)));
+  });
+
+  return g;
+}
+
+/*
+ * This idea comes from the Gansner paper: to account for edge labels in our
+ * layout we split each rank in half by doubling minlen and halving ranksep.
+ * Then we can place labels at these mid-points between nodes.
+ *
+ * We also add some minimal padding to the width to push the label for the edge
+ * away from the edge itself a bit.
+ */
+function makeSpaceForEdgeLabels(g) {
+  var graph = g.graph();
+  graph.ranksep /= 2;
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    edge.minlen *= 2;
+    if (edge.labelpos.toLowerCase() !== "c") {
+      if (graph.rankdir === "TB" || graph.rankdir === "BT") {
+        edge.width += edge.labeloffset;
+      } else {
+        edge.height += edge.labeloffset;
+      }
+    }
+  });
+}
+
+/*
+ * Creates temporary dummy nodes that capture the rank in which each edge's
+ * label is going to, if it has one of non-zero width and height. We do this
+ * so that we can safely remove empty ranks while preserving balance for the
+ * label's position.
+ */
+function injectEdgeLabelProxies(g) {
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    if (edge.width && edge.height) {
+      var v = g.node(e.v),
+          w = g.node(e.w),
+          label = { rank: (w.rank - v.rank) / 2 + v.rank, e: e };
+      util.addDummyNode(g, "edge-proxy", label, "_ep");
+    }
+  });
+}
+
+function assignRankMinMax(g) {
+  var maxRank = 0;
+  _.forEach(g.nodes(), function(v) {
+    var node = g.node(v);
+    if (node.borderTop) {
+      node.minRank = g.node(node.borderTop).rank;
+      node.maxRank = g.node(node.borderBottom).rank;
+      maxRank = _.max(maxRank, node.maxRank);
+    }
+  });
+  g.graph().maxRank = maxRank;
+}
+
+function removeEdgeLabelProxies(g) {
+  _.forEach(g.nodes(), function(v) {
+    var node = g.node(v);
+    if (node.dummy === "edge-proxy") {
+      g.edge(node.e).labelRank = node.rank;
+      g.removeNode(v);
+    }
+  });
+}
+
+function translateGraph(g) {
+  var minX = Number.POSITIVE_INFINITY,
+      maxX = 0,
+      minY = Number.POSITIVE_INFINITY,
+      maxY = 0,
+      graphLabel = g.graph(),
+      marginX = graphLabel.marginx || 0,
+      marginY = graphLabel.marginy || 0;
+
+  function getExtremes(attrs) {
+    var x = attrs.x,
+        y = attrs.y,
+        w = attrs.width,
+        h = attrs.height;
+    minX = Math.min(minX, x - w / 2);
+    maxX = Math.max(maxX, x + w / 2);
+    minY = Math.min(minY, y - h / 2);
+    maxY = Math.max(maxY, y + h / 2);
+  }
+
+  _.forEach(g.nodes(), function(v) { getExtremes(g.node(v)); });
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    if (_.has(edge, "x")) {
+      getExtremes(edge);
+    }
+  });
+
+  minX -= marginX;
+  minY -= marginY;
+
+  _.forEach(g.nodes(), function(v) {
+    var node = g.node(v);
+    node.x -= minX;
+    node.y -= minY;
+  });
+
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    _.forEach(edge.points, function(p) {
+      p.x -= minX;
+      p.y -= minY;
+    });
+    if (_.has(edge, "x")) { edge.x -= minX; }
+    if (_.has(edge, "y")) { edge.y -= minY; }
+  });
+
+  graphLabel.width = maxX - minX + marginX;
+  graphLabel.height = maxY - minY + marginY;
+}
+
+function assignNodeIntersects(g) {
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e),
+        nodeV = g.node(e.v),
+        nodeW = g.node(e.w),
+        p1, p2;
+    if (!edge.points) {
+      edge.points = [];
+      p1 = nodeW;
+      p2 = nodeV;
+    } else {
+      p1 = edge.points[0];
+      p2 = edge.points[edge.points.length - 1];
+    }
+    edge.points.unshift(util.intersectRect(nodeV, p1));
+    edge.points.push(util.intersectRect(nodeW, p2));
+  });
+}
+
+function fixupEdgeLabelCoords(g) {
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    if (_.has(edge, "x")) {
+      if (edge.labelpos === "l" || edge.labelpos === "r") {
+        edge.width -= edge.labeloffset;
+      }
+      switch (edge.labelpos) {
+        case "l": edge.x -= edge.width / 2 + edge.labeloffset; break;
+        case "r": edge.x += edge.width / 2 + edge.labeloffset; break;
+      }
+    }
+  });
+}
+
+function reversePointsForReversedEdges(g) {
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    if (edge.reversed) {
+      edge.points.reverse();
+    }
+  });
+}
+
+function removeBorderNodes(g) {
+  _.forEach(g.nodes(), function(v) {
+    if (g.children(v).length) {
+      var node = g.node(v),
+          t = g.node(node.borderTop),
+          b = g.node(node.borderBottom),
+          l = g.node(_.last(node.borderLeft)),
+          r = g.node(_.last(node.borderRight));
+
+      node.width = Math.abs(r.x - l.x);
+      node.height = Math.abs(b.y - t.y);
+      node.x = l.x + node.width / 2;
+      node.y = t.y + node.height / 2;
+    }
+  });
+
+  _.forEach(g.nodes(), function(v) {
+    if (g.node(v).dummy === "border") {
+      g.removeNode(v);
+    }
+  });
+}
+
+function removeSelfEdges(g) {
+  _.forEach(g.edges(), function(e) {
+    if (e.v === e.w) {
+      var node = g.node(e.v);
+      if (!node.selfEdges) {
+        node.selfEdges = [];
+      }
+      node.selfEdges.push({ e: e, label: g.edge(e) });
+      g.removeEdge(e);
+    }
+  });
+}
+
+function insertSelfEdges(g) {
+  var layers = util.buildLayerMatrix(g);
+  _.forEach(layers, function(layer) {
+    var orderShift = 0;
+    _.forEach(layer, function(v, i) {
+      var node = g.node(v);
+      node.order = i + orderShift;
+      _.forEach(node.selfEdges, function(selfEdge) {
+        util.addDummyNode(g, "selfedge", {
+          width: selfEdge.label.width,
+          height: selfEdge.label.height,
+          rank: node.rank,
+          order: i + (++orderShift),
+          e: selfEdge.e,
+          label: selfEdge.label
+        }, "_se");
+      });
+      delete node.selfEdges;
+    });
+  });
+}
+
+function positionSelfEdges(g) {
+  _.forEach(g.nodes(), function(v) {
+    var node = g.node(v);
+    if (node.dummy === "selfedge") {
+      var selfNode = g.node(node.e.v),
+          x = selfNode.x + selfNode.width / 2,
+          y = selfNode.y,
+          dx = node.x - x,
+          dy = selfNode.height / 2;
+      g.setEdge(node.e, node.label);
+      g.removeNode(v);
+      node.label.points = [
+        { x: x + 2 * dx / 3, y: y - dy },
+        { x: x + 5 * dx / 6, y: y - dy },
+        { x: x +     dx    , y: y },
+        { x: x + 5 * dx / 6, y: y + dy },
+        { x: x + 2 * dx / 3, y: y + dy }
+      ];
+      node.label.x = node.x;
+      node.label.y = node.y;
+    }
+  });
+}
+
+function selectNumberAttrs(obj, attrs) {
+  return _.mapValues(_.pick(obj, attrs), Number);
+}
+
+function canonicalize(attrs) {
+  var newAttrs = {};
+  _.forEach(attrs, function(v, k) {
+    newAttrs[k.toLowerCase()] = v;
+  });
+  return newAttrs;
+}
+
+},{"./acyclic":2,"./add-border-segments":3,"./coordinate-system":4,"./graphlib":7,"./lodash":10,"./nesting-graph":11,"./normalize":12,"./order":17,"./parent-dummy-chains":22,"./position":24,"./rank":26,"./util":29}],10:[function(require,module,exports){
+/* global window */
+
+var lodash;
+
+if (typeof require === "function") {
+  try {
+    lodash = require("lodash");
+  } catch (e) {}
+}
+
+if (!lodash) {
+  lodash = window._;
+}
+
+module.exports = lodash;
+
+},{"lodash":51}],11:[function(require,module,exports){
+var _ = require("./lodash"),
+    util = require("./util");
+
+module.exports = {
+  run: run,
+  cleanup: cleanup
+};
+
+/*
+ * A nesting graph creates dummy nodes for the tops and bottoms of subgraphs,
+ * adds appropriate edges to ensure that all cluster nodes are placed between
+ * these boundries, and ensures that the graph is connected.
+ *
+ * In addition we ensure, through the use of the minlen property, that nodes
+ * and subgraph border nodes to not end up on the same rank.
+ *
+ * Preconditions:
+ *
+ *    1. Input graph is a DAG
+ *    2. Nodes in the input graph has a minlen attribute
+ *
+ * Postconditions:
+ *
+ *    1. Input graph is connected.
+ *    2. Dummy nodes are added for the tops and bottoms of subgraphs.
+ *    3. The minlen attribute for nodes is adjusted to ensure nodes do not
+ *       get placed on the same rank as subgraph border nodes.
+ *
+ * The nesting graph idea comes from Sander, "Layout of Compound Directed
+ * Graphs."
+ */
+function run(g) {
+  var root = util.addDummyNode(g, "root", {}, "_root");
+  var depths = treeDepths(g);
+  var height = _.max(_.values(depths)) - 1; // Note: depths is an Object not an array
+  var nodeSep = 2 * height + 1;
+
+  g.graph().nestingRoot = root;
+
+  // Multiply minlen by nodeSep to align nodes on non-border ranks.
+  _.forEach(g.edges(), function(e) { g.edge(e).minlen *= nodeSep; });
+
+  // Calculate a weight that is sufficient to keep subgraphs vertically compact
+  var weight = sumWeights(g) + 1;
+
+  // Create border nodes and link them up
+  _.forEach(g.children(), function(child) {
+    dfs(g, root, nodeSep, weight, height, depths, child);
+  });
+
+  // Save the multiplier for node layers for later removal of empty border
+  // layers.
+  g.graph().nodeRankFactor = nodeSep;
+}
+
+function dfs(g, root, nodeSep, weight, height, depths, v) {
+  var children = g.children(v);
+  if (!children.length) {
+    if (v !== root) {
+      g.setEdge(root, v, { weight: 0, minlen: nodeSep });
+    }
+    return;
+  }
+
+  var top = util.addBorderNode(g, "_bt"),
+      bottom = util.addBorderNode(g, "_bb"),
+      label = g.node(v);
+
+  g.setParent(top, v);
+  label.borderTop = top;
+  g.setParent(bottom, v);
+  label.borderBottom = bottom;
+
+  _.forEach(children, function(child) {
+    dfs(g, root, nodeSep, weight, height, depths, child);
+
+    var childNode = g.node(child),
+        childTop = childNode.borderTop ? childNode.borderTop : child,
+        childBottom = childNode.borderBottom ? childNode.borderBottom : child,
+        thisWeight = childNode.borderTop ? weight : 2 * weight,
+        minlen = childTop !== childBottom ? 1 : height - depths[v] + 1;
+
+    g.setEdge(top, childTop, {
+      weight: thisWeight,
+      minlen: minlen,
+      nestingEdge: true
+    });
+
+    g.setEdge(childBottom, bottom, {
+      weight: thisWeight,
+      minlen: minlen,
+      nestingEdge: true
+    });
+  });
+
+  if (!g.parent(v)) {
+    g.setEdge(root, top, { weight: 0, minlen: height + depths[v] });
+  }
+}
+
+function treeDepths(g) {
+  var depths = {};
+  function dfs(v, depth) {
+    var children = g.children(v);
+    if (children && children.length) {
+      _.forEach(children, function(child) {
+        dfs(child, depth + 1);
+      });
+    }
+    depths[v] = depth;
+  }
+  _.forEach(g.children(), function(v) { dfs(v, 1); });
+  return depths;
+}
+
+function sumWeights(g) {
+  return _.reduce(g.edges(), function(acc, e) {
+    return acc + g.edge(e).weight;
+  }, 0);
+}
+
+function cleanup(g) {
+  var graphLabel = g.graph();
+  g.removeNode(graphLabel.nestingRoot);
+  delete graphLabel.nestingRoot;
+  _.forEach(g.edges(), function(e) {
+    var edge = g.edge(e);
+    if (edge.nestingEdge) {
+      g.removeEdge(e);
+    }
+  });
+}
+
+},{"./lodash":10,"./util":29}],12:[function(require,module,exports){
+"use strict";
+
+var _ = require("./lodash"),
+    util = require("./util");
+
+module.exports = {
+  run: run,
+  undo: undo
+};
+
+/*
+ * Breaks any long edges in the graph into short segments that span 1 layer
+ * each. This operation is undoable with the denormalize function.
+ *
+ * Pre-conditions:
+ *
+ *    1. The input graph is a DAG.
+ *    2. Each node in the graph has a "rank" property.
+ *
+ * Post-condition:
+ *
+ *    1. All edges in the graph have a length of 1.
+ *    2. Dummy nodes are added where edges have been split into segments.
+ *    3. The graph is augmented with a "dummyChains" attribute which contains
+ *       the first dummy in each chain of dummy nodes produced.
+ */
+function run(g) {
+  g.graph().dummyChains = [];
+  _.forEach(g.edges(), function(edge) { normalizeEdge(g, edge); });
+}
+
+function normalizeEdge(g, e) {
+  var v = e.v,
+      vRank = g.node(v).rank,
+      w = e.w,
+      wRank = g.node(w).rank,
+      name = e.name,
+      edgeLabel = g.edge(e),
+      labelRank = edgeLabel.labelRank;
+
+  if (wRank === vRank + 1) return;
+
+  g.removeEdge(e);
+
+  var dummy, attrs, i;
+  for (i = 0, ++vRank; vRank < wRank; ++i, ++vRank) {
+    edgeLabel.points = [];
+    attrs = {
+      width: 0, height: 0,
+      edgeLabel: edgeLabel, edgeObj: e,
+      rank: vRank
+    };
+    dummy = util.addDummyNode(g, "edge", attrs, "_d");
+    if (vRank === labelRank) {
+      attrs.width = edgeLabel.width;
+      attrs.height = edgeLabel.height;
+      attrs.dummy = "edge-label";
+      attrs.labelpos = edgeLabel.labelpos;
+    }
+    g.setEdge(v, dummy, { weight: edgeLabel.weight }, name);
+    if (i === 0) {
+      g.graph().dummyChains.push(dummy);
+    }
+    v = dummy;
+  }
+
+  g.setEdge(v, w, { weight: edgeLabel.weight }, name);
+}
+
+function undo(g) {
+  _.forEach(g.graph().dummyChains, function(v) {
+    var node = g.node(v),
+        origLabel = node.edgeLabel,
+        w;
+    g.setEdge(node.edgeObj, origLabel);
+    while (node.dummy) {
+      w = g.successors(v)[0];
+      g.removeNode(v);
+      origLabel.points.push({ x: node.x, y: node.y });
+      if (node.dummy === "edge-label") {
+        origLabel.x = node.x;
+        origLabel.y = node.y;
+        origLabel.width = node.width;
+        origLabel.height = node.height;
+      }
+      v = w;
+      node = g.node(v);
+    }
+  });
+}
+
+},{"./lodash":10,"./util":29}],13:[function(require,module,exports){
+var _ = require("../lodash");
+
+module.exports = addSubgraphConstraints;
+
+function addSubgraphConstraints(g, cg, vs) {
+  var prev = {},
+      rootPrev;
+
+  _.forEach(vs, function(v) {
+    var child = g.parent(v),
+        parent,
+        prevChild;
+    while (child) {
+      parent = g.parent(child);
+      if (parent) {
+        prevChild = prev[parent];
+        prev[parent] = child;
+      } else {
+        prevChild = rootPrev;
+        rootPrev = child;
+      }
+      if (prevChild && prevChild !== child) {
+        cg.setEdge(prevChild, child);
+        return;
+      }
+      child = parent;
+    }
+  });
+
+  /*
+  function dfs(v) {
+    var children = v ? g.children(v) : g.children();
+    if (children.length) {
+      var min = Number.POSITIVE_INFINITY,
+          subgraphs = [];
+      _.each(children, function(child) {
+        var childMin = dfs(child);
+        if (g.children(child).length) {
+          subgraphs.push({ v: child, order: childMin });
+        }
+        min = Math.min(min, childMin);
+      });
+      _.reduce(_.sortBy(subgraphs, "order"), function(prev, curr) {
+        cg.setEdge(prev.v, curr.v);
+        return curr;
+      });
+      return min;
+    }
+    return g.node(v).order;
+  }
+  dfs(undefined);
+  */
+}
+
+},{"../lodash":10}],14:[function(require,module,exports){
+var _ = require("../lodash");
+
+module.exports = barycenter;
+
+function barycenter(g, movable) {
+  return _.map(movable, function(v) {
+    var inV = g.inEdges(v);
+    if (!inV.length) {
+      return { v: v };
+    } else {
+      var result = _.reduce(inV, function(acc, e) {
+        var edge = g.edge(e),
+            nodeU = g.node(e.v);
+        return {
+          sum: acc.sum + (edge.weight * nodeU.order),
+          weight: acc.weight + edge.weight
+        };
+      }, { sum: 0, weight: 0 });
+
+      return {
+        v: v,
+        barycenter: result.sum / result.weight,
+        weight: result.weight
+      };
+    }
+  });
+}
+
+
+},{"../lodash":10}],15:[function(require,module,exports){
+var _ = require("../lodash"),
+    Graph = require("../graphlib").Graph;
+
+module.exports = buildLayerGraph;
+
+/*
+ * Constructs a graph that can be used to sort a layer of nodes. The graph will
+ * contain all base and subgraph nodes from the request layer in their original
+ * hierarchy and any edges that are incident on these nodes and are of the type
+ * requested by the "relationship" parameter.
+ *
+ * Nodes from the requested rank that do not have parents are assigned a root
+ * node in the output graph, which is set in the root graph attribute. This
+ * makes it easy to walk the hierarchy of movable nodes during ordering.
+ *
+ * Pre-conditions:
+ *
+ *    1. Input graph is a DAG
+ *    2. Base nodes in the input graph have a rank attribute
+ *    3. Subgraph nodes in the input graph has minRank and maxRank attributes
+ *    4. Edges have an assigned weight
+ *
+ * Post-conditions:
+ *
+ *    1. Output graph has all nodes in the movable rank with preserved
+ *       hierarchy.
+ *    2. Root nodes in the movable layer are made children of the node
+ *       indicated by the root attribute of the graph.
+ *    3. Non-movable nodes incident on movable nodes, selected by the
+ *       relationship parameter, are included in the graph (without hierarchy).
+ *    4. Edges incident on movable nodes, selected by the relationship
+ *       parameter, are added to the output graph.
+ *    5. The weights for copied edges are aggregated as need, since the output
+ *       graph is not a multi-graph.
+ */
+function buildLayerGraph(g, rank, relationship) {
+  var root = createRootNode(g),
+      result = new Graph({ compound: true }).setGraph({ root: root })
+                  .setDefaultNodeLabel(function(v) { return g.node(v); });
+
+  _.forEach(g.nodes(), function(v) {
+    var node = g.node(v),
+        parent = g.parent(v);
+
+    if (node.rank === rank || node.minRank <= rank && rank <= node.maxRank) {
+      result.setNode(v);
+      result.setParent(v, parent || root);
+
+      // This assumes we have only short edges!
+      _.forEach(g[relationship](v), function(e) {
+        var u = e.v === v ? e.w : e.v,
+            edge = result.edge(u, v),
+            weight = !_.isUndefined(edge) ? edge.weight : 0;
+        result.setEdge(u, v, { weight: g.edge(e).weight + weight });
+      });
+
+      if (_.has(node, "minRank")) {
+        result.setNode(v, {
+          borderLeft: node.borderLeft[rank],
+          borderRight: node.borderRight[rank]
+        });
+      }
+    }
+  });
+
+  return result;
+}
+
+function createRootNode(g) {
+  var v;
+  while (g.hasNode((v = _.uniqueId("_root"))));
+  return v;
+}
+
+},{"../graphlib":7,"../lodash":10}],16:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash");
+
+module.exports = crossCount;
+
+/*
+ * A function that takes a layering (an array of layers, each with an array of
+ * ordererd nodes) and a graph and returns a weighted crossing count.
+ *
+ * Pre-conditions:
+ *
+ *    1. Input graph must be simple (not a multigraph), directed, and include
+ *       only simple edges.
+ *    2. Edges in the input graph must have assigned weights.
+ *
+ * Post-conditions:
+ *
+ *    1. The graph and layering matrix are left unchanged.
+ *
+ * This algorithm is derived from Barth, et al., "Bilayer Cross Counting."
+ */
+function crossCount(g, layering) {
+  var cc = 0;
+  for (var i = 1; i < layering.length; ++i) {
+    cc += twoLayerCrossCount(g, layering[i-1], layering[i]);
+  }
+  return cc;
+}
+
+function twoLayerCrossCount(g, northLayer, southLayer) {
+  // Sort all of the edges between the north and south layers by their position
+  // in the north layer and then the south. Map these edges to the position of
+  // their head in the south layer.
+  var southPos = _.zipObject(southLayer,
+                             _.map(southLayer, function (v, i) { return i; }));
+  var southEntries = _.flatten(_.map(northLayer, function(v) {
+    return _.chain(g.outEdges(v))
+            .map(function(e) {
+              return { pos: southPos[e.w], weight: g.edge(e).weight };
+            })
+            .sortBy("pos")
+            .value();
+  }), true);
+
+  // Build the accumulator tree
+  var firstIndex = 1;
+  while (firstIndex < southLayer.length) firstIndex <<= 1;
+  var treeSize = 2 * firstIndex - 1;
+  firstIndex -= 1;
+  var tree = _.map(new Array(treeSize), function() { return 0; });
+
+  // Calculate the weighted crossings
+  var cc = 0;
+  _.forEach(southEntries.forEach(function(entry) {
+    var index = entry.pos + firstIndex;
+    tree[index] += entry.weight;
+    var weightSum = 0;
+    while (index > 0) {
+      if (index % 2) {
+        weightSum += tree[index + 1];
+      }
+      index = (index - 1) >> 1;
+      tree[index] += entry.weight;
+    }
+    cc += entry.weight * weightSum;
+  }));
+
+  return cc;
+}
+
+},{"../lodash":10}],17:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash"),
+    initOrder = require("./init-order"),
+    crossCount = require("./cross-count"),
+    sortSubgraph = require("./sort-subgraph"),
+    buildLayerGraph = require("./build-layer-graph"),
+    addSubgraphConstraints = require("./add-subgraph-constraints"),
+    Graph = require("../graphlib").Graph,
+    util = require("../util");
+
+module.exports = order;
+
+/*
+ * Applies heuristics to minimize edge crossings in the graph and sets the best
+ * order solution as an order attribute on each node.
+ *
+ * Pre-conditions:
+ *
+ *    1. Graph must be DAG
+ *    2. Graph nodes must be objects with a "rank" attribute
+ *    3. Graph edges must have the "weight" attribute
+ *
+ * Post-conditions:
+ *
+ *    1. Graph nodes will have an "order" attribute based on the results of the
+ *       algorithm.
+ */
+function order(g) {
+  var maxRank = util.maxRank(g),
+      downLayerGraphs = buildLayerGraphs(g, _.range(1, maxRank + 1), "inEdges"),
+      upLayerGraphs = buildLayerGraphs(g, _.range(maxRank - 1, -1, -1), "outEdges");
+
+  var layering = initOrder(g);
+  assignOrder(g, layering);
+
+  var bestCC = Number.POSITIVE_INFINITY,
+      best;
+
+  for (var i = 0, lastBest = 0; lastBest < 4; ++i, ++lastBest) {
+    sweepLayerGraphs(i % 2 ? downLayerGraphs : upLayerGraphs, i % 4 >= 2);
+
+    layering = util.buildLayerMatrix(g);
+    var cc = crossCount(g, layering);
+    if (cc < bestCC) {
+      lastBest = 0;
+      best = _.cloneDeep(layering);
+      bestCC = cc;
+    }
+  }
+
+  assignOrder(g, best);
+}
+
+function buildLayerGraphs(g, ranks, relationship) {
+  return _.map(ranks, function(rank) {
+    return buildLayerGraph(g, rank, relationship);
+  });
+}
+
+function sweepLayerGraphs(layerGraphs, biasRight) {
+  var cg = new Graph();
+  _.forEach(layerGraphs, function(lg) {
+    var root = lg.graph().root;
+    var sorted = sortSubgraph(lg, root, cg, biasRight);
+    _.forEach(sorted.vs, function(v, i) {
+      lg.node(v).order = i;
+    });
+    addSubgraphConstraints(lg, cg, sorted.vs);
+  });
+}
+
+function assignOrder(g, layering) {
+  _.forEach(layering, function(layer) {
+    _.forEach(layer, function(v, i) {
+      g.node(v).order = i;
+    });
+  });
+}
+
+},{"../graphlib":7,"../lodash":10,"../util":29,"./add-subgraph-constraints":13,"./build-layer-graph":15,"./cross-count":16,"./init-order":18,"./sort-subgraph":20}],18:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash");
+
+module.exports = initOrder;
+
+/*
+ * Assigns an initial order value for each node by performing a DFS search
+ * starting from nodes in the first rank. Nodes are assigned an order in their
+ * rank as they are first visited.
+ *
+ * This approach comes from Gansner, et al., "A Technique for Drawing Directed
+ * Graphs."
+ *
+ * Returns a layering matrix with an array per layer and each layer sorted by
+ * the order of its nodes.
+ */
+function initOrder(g) {
+  var visited = {},
+      simpleNodes = _.filter(g.nodes(), function(v) {
+        return !g.children(v).length;
+      }),
+      maxRank = _.max(_.map(simpleNodes, function(v) { return g.node(v).rank; })),
+      layers = _.map(_.range(maxRank + 1), function() { return []; });
+
+  function dfs(v) {
+    if (_.has(visited, v)) return;
+    visited[v] = true;
+    var node = g.node(v);
+    layers[node.rank].push(v);
+    _.forEach(g.successors(v), dfs);
+  }
+
+  var orderedVs = _.sortBy(simpleNodes, function(v) { return g.node(v).rank; });
+  _.forEach(orderedVs, dfs);
+
+  return layers;
+}
+
+},{"../lodash":10}],19:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash");
+
+module.exports = resolveConflicts;
+
+/*
+ * Given a list of entries of the form {v, barycenter, weight} and a
+ * constraint graph this function will resolve any conflicts between the
+ * constraint graph and the barycenters for the entries. If the barycenters for
+ * an entry would violate a constraint in the constraint graph then we coalesce
+ * the nodes in the conflict into a new node that respects the contraint and
+ * aggregates barycenter and weight information.
+ *
+ * This implementation is based on the description in Forster, "A Fast and
+ * Simple Hueristic for Constrained Two-Level Crossing Reduction," thought it
+ * differs in some specific details.
+ *
+ * Pre-conditions:
+ *
+ *    1. Each entry has the form {v, barycenter, weight}, or if the node has
+ *       no barycenter, then {v}.
+ *
+ * Returns:
+ *
+ *    A new list of entries of the form {vs, i, barycenter, weight}. The list
+ *    `vs` may either be a singleton or it may be an aggregation of nodes
+ *    ordered such that they do not violate constraints from the constraint
+ *    graph. The property `i` is the lowest original index of any of the
+ *    elements in `vs`.
+ */
+function resolveConflicts(entries, cg) {
+  var mappedEntries = {};
+  _.forEach(entries, function(entry, i) {
+    var tmp = mappedEntries[entry.v] = {
+      indegree: 0,
+      "in": [],
+      out: [],
+      vs: [entry.v],
+      i: i
+    };
+    if (!_.isUndefined(entry.barycenter)) {
+      tmp.barycenter = entry.barycenter;
+      tmp.weight = entry.weight;
+    }
+  });
+
+  _.forEach(cg.edges(), function(e) {
+    var entryV = mappedEntries[e.v],
+        entryW = mappedEntries[e.w];
+    if (!_.isUndefined(entryV) && !_.isUndefined(entryW)) {
+      entryW.indegree++;
+      entryV.out.push(mappedEntries[e.w]);
+    }
+  });
+
+  var sourceSet = _.filter(mappedEntries, function(entry) {
+    return !entry.indegree;
+  });
+
+  return doResolveConflicts(sourceSet);
+}
+
+function doResolveConflicts(sourceSet) {
+  var entries = [];
+
+  function handleIn(vEntry) {
+    return function(uEntry) {
+      if (uEntry.merged) {
+        return;
+      }
+      if (_.isUndefined(uEntry.barycenter) ||
+          _.isUndefined(vEntry.barycenter) ||
+          uEntry.barycenter >= vEntry.barycenter) {
+        mergeEntries(vEntry, uEntry);
+      }
+    };
+  }
+
+  function handleOut(vEntry) {
+    return function(wEntry) {
+      wEntry["in"].push(vEntry);
+      if (--wEntry.indegree === 0) {
+        sourceSet.push(wEntry);
+      }
+    };
+  }
+
+  while (sourceSet.length) {
+    var entry = sourceSet.pop();
+    entries.push(entry);
+    _.forEach(entry["in"].reverse(), handleIn(entry));
+    _.forEach(entry.out, handleOut(entry));
+  }
+
+  return _.chain(entries)
+          .filter(function(entry) { return !entry.merged; })
+          .map(function(entry) {
+            return _.pick(entry, ["vs", "i", "barycenter", "weight"]);
+          })
+          .value();
+}
+
+function mergeEntries(target, source) {
+  var sum = 0,
+      weight = 0;
+
+  if (target.weight) {
+    sum += target.barycenter * target.weight;
+    weight += target.weight;
+  }
+
+  if (source.weight) {
+    sum += source.barycenter * source.weight;
+    weight += source.weight;
+  }
+
+  target.vs = source.vs.concat(target.vs);
+  target.barycenter = sum / weight;
+  target.weight = weight;
+  target.i = Math.min(source.i, target.i);
+  source.merged = true;
+}
+
+},{"../lodash":10}],20:[function(require,module,exports){
+var _ = require("../lodash"),
+    barycenter = require("./barycenter"),
+    resolveConflicts = require("./resolve-conflicts"),
+    sort = require("./sort");
+
+module.exports = sortSubgraph;
+
+function sortSubgraph(g, v, cg, biasRight) {
+  var movable = g.children(v),
+      node = g.node(v),
+      bl = node ? node.borderLeft : undefined,
+      br = node ? node.borderRight: undefined,
+      subgraphs = {};
+
+  if (bl) {
+    movable = _.filter(movable, function(w) {
+      return w !== bl && w !== br;
+    });
+  }
+
+  var barycenters = barycenter(g, movable);
+  _.forEach(barycenters, function(entry) {
+    if (g.children(entry.v).length) {
+      var subgraphResult = sortSubgraph(g, entry.v, cg, biasRight);
+      subgraphs[entry.v] = subgraphResult;
+      if (_.has(subgraphResult, "barycenter")) {
+        mergeBarycenters(entry, subgraphResult);
+      }
+    }
+  });
+
+  var entries = resolveConflicts(barycenters, cg);
+  expandSubgraphs(entries, subgraphs);
+
+  var result = sort(entries, biasRight);
+
+  if (bl) {
+    result.vs = _.flatten([bl, result.vs, br], true);
+    if (g.predecessors(bl).length) {
+      var blPred = g.node(g.predecessors(bl)[0]),
+          brPred = g.node(g.predecessors(br)[0]);
+      if (!_.has(result, "barycenter")) {
+        result.barycenter = 0;
+        result.weight = 0;
+      }
+      result.barycenter = (result.barycenter * result.weight +
+                           blPred.order + brPred.order) / (result.weight + 2);
+      result.weight += 2;
+    }
+  }
+
+  return result;
+}
+
+function expandSubgraphs(entries, subgraphs) {
+  _.forEach(entries, function(entry) {
+    entry.vs = _.flatten(entry.vs.map(function(v) {
+      if (subgraphs[v]) {
+        return subgraphs[v].vs;
+      }
+      return v;
+    }), true);
+  });
+}
+
+function mergeBarycenters(target, other) {
+  if (!_.isUndefined(target.barycenter)) {
+    target.barycenter = (target.barycenter * target.weight +
+                         other.barycenter * other.weight) /
+                        (target.weight + other.weight);
+    target.weight += other.weight;
+  } else {
+    target.barycenter = other.barycenter;
+    target.weight = other.weight;
+  }
+}
+
+},{"../lodash":10,"./barycenter":14,"./resolve-conflicts":19,"./sort":21}],21:[function(require,module,exports){
+var _ = require("../lodash"),
+    util = require("../util");
+
+module.exports = sort;
+
+function sort(entries, biasRight) {
+  var parts = util.partition(entries, function(entry) {
+    return _.has(entry, "barycenter");
+  });
+  var sortable = parts.lhs,
+      unsortable = _.sortBy(parts.rhs, function(entry) { return -entry.i; }),
+      vs = [],
+      sum = 0,
+      weight = 0,
+      vsIndex = 0;
+
+  sortable.sort(compareWithBias(!!biasRight));
+
+  vsIndex = consumeUnsortable(vs, unsortable, vsIndex);
+
+  _.forEach(sortable, function (entry) {
+    vsIndex += entry.vs.length;
+    vs.push(entry.vs);
+    sum += entry.barycenter * entry.weight;
+    weight += entry.weight;
+    vsIndex = consumeUnsortable(vs, unsortable, vsIndex);
+  });
+
+  var result = { vs: _.flatten(vs, true) };
+  if (weight) {
+    result.barycenter = sum / weight;
+    result.weight = weight;
+  }
+  return result;
+}
+
+function consumeUnsortable(vs, unsortable, index) {
+  var last;
+  while (unsortable.length && (last = _.last(unsortable)).i <= index) {
+    unsortable.pop();
+    vs.push(last.vs);
+    index++;
+  }
+  return index;
+}
+
+function compareWithBias(bias) {
+  return function(entryV, entryW) {
+    if (entryV.barycenter < entryW.barycenter) {
+      return -1;
+    } else if (entryV.barycenter > entryW.barycenter) {
+      return 1;
+    }
+
+    return !bias ? entryV.i - entryW.i : entryW.i - entryV.i;
+  };
+}
+
+},{"../lodash":10,"../util":29}],22:[function(require,module,exports){
+var _ = require("./lodash");
+
+module.exports = parentDummyChains;
+
+function parentDummyChains(g) {
+  var postorderNums = postorder(g);
+
+  _.forEach(g.graph().dummyChains, function(v) {
+    var node = g.node(v),
+        edgeObj = node.edgeObj,
+        pathData = findPath(g, postorderNums, edgeObj.v, edgeObj.w),
+        path = pathData.path,
+        lca = pathData.lca,
+        pathIdx = 0,
+        pathV = path[pathIdx],
+        ascending = true;
+
+    while (v !== edgeObj.w) {
+      node = g.node(v);
+
+      if (ascending) {
+        while ((pathV = path[pathIdx]) !== lca &&
+               g.node(pathV).maxRank < node.rank) {
+          pathIdx++;
+        }
+
+        if (pathV === lca) {
+          ascending = false;
+        }
+      }
+
+      if (!ascending) {
+        while (pathIdx < path.length - 1 &&
+               g.node(pathV = path[pathIdx + 1]).minRank <= node.rank) {
+          pathIdx++;
+        }
+        pathV = path[pathIdx];
+      }
+
+      g.setParent(v, pathV);
+      v = g.successors(v)[0];
+    }
+  });
+}
+
+// Find a path from v to w through the lowest common ancestor (LCA). Return the
+// full path and the LCA.
+function findPath(g, postorderNums, v, w) {
+  var vPath = [],
+      wPath = [],
+      low = Math.min(postorderNums[v].low, postorderNums[w].low),
+      lim = Math.max(postorderNums[v].lim, postorderNums[w].lim),
+      parent,
+      lca;
+
+  // Traverse up from v to find the LCA
+  parent = v;
+  do {
+    parent = g.parent(parent);
+    vPath.push(parent);
+  } while (parent &&
+           (postorderNums[parent].low > low || lim > postorderNums[parent].lim));
+  lca = parent;
+
+  // Traverse from w to LCA
+  parent = w;
+  while ((parent = g.parent(parent)) !== lca) {
+    wPath.push(parent);
+  }
+
+  return { path: vPath.concat(wPath.reverse()), lca: lca };
+}
+
+function postorder(g) {
+  var result = {},
+      lim = 0;
+
+  function dfs(v) {
+    var low = lim;
+    _.forEach(g.children(v), dfs);
+    result[v] = { low: low, lim: lim++ };
+  }
+  _.forEach(g.children(), dfs);
+
+  return result;
+}
+
+},{"./lodash":10}],23:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash"),
+    Graph = require("../graphlib").Graph,
+    util = require("../util");
+
+/*
+ * This module provides coordinate assignment based on Brandes and Köpf, "Fast
+ * and Simple Horizontal Coordinate Assignment."
+ */
+
+module.exports = {
+  positionX: positionX,
+  findType1Conflicts: findType1Conflicts,
+  findType2Conflicts: findType2Conflicts,
+  addConflict: addConflict,
+  hasConflict: hasConflict,
+  verticalAlignment: verticalAlignment,
+  horizontalCompaction: horizontalCompaction,
+  alignCoordinates: alignCoordinates,
+  findSmallestWidthAlignment: findSmallestWidthAlignment,
+  balance: balance
+};
+
+/*
+ * Marks all edges in the graph with a type-1 conflict with the "type1Conflict"
+ * property. A type-1 conflict is one where a non-inner segment crosses an
+ * inner segment. An inner segment is an edge with both incident nodes marked
+ * with the "dummy" property.
+ *
+ * This algorithm scans layer by layer, starting with the second, for type-1
+ * conflicts between the current layer and the previous layer. For each layer
+ * it scans the nodes from left to right until it reaches one that is incident
+ * on an inner segment. It then scans predecessors to determine if they have
+ * edges that cross that inner segment. At the end a final scan is done for all
+ * nodes on the current rank to see if they cross the last visited inner
+ * segment.
+ *
+ * This algorithm (safely) assumes that a dummy node will only be incident on a
+ * single node in the layers being scanned.
+ */
+function findType1Conflicts(g, layering) {
+  var conflicts = {};
+
+  function visitLayer(prevLayer, layer) {
+    var
+      // last visited node in the previous layer that is incident on an inner
+      // segment.
+      k0 = 0,
+      // Tracks the last node in this layer scanned for crossings with a type-1
+      // segment.
+      scanPos = 0,
+      prevLayerLength = prevLayer.length,
+      lastNode = _.last(layer);
+
+    _.forEach(layer, function(v, i) {
+      var w = findOtherInnerSegmentNode(g, v),
+          k1 = w ? g.node(w).order : prevLayerLength;
+
+      if (w || v === lastNode) {
+        _.forEach(layer.slice(scanPos, i +1), function(scanNode) {
+          _.forEach(g.predecessors(scanNode), function(u) {
+            var uLabel = g.node(u),
+                uPos = uLabel.order;
+            if ((uPos < k0 || k1 < uPos) &&
+                !(uLabel.dummy && g.node(scanNode).dummy)) {
+              addConflict(conflicts, u, scanNode);
+            }
+          });
+        });
+        scanPos = i + 1;
+        k0 = k1;
+      }
+    });
+
+    return layer;
+  }
+
+  _.reduce(layering, visitLayer);
+  return conflicts;
+}
+
+function findType2Conflicts(g, layering) {
+  var conflicts = {};
+
+  function scan(south, southPos, southEnd, prevNorthBorder, nextNorthBorder) {
+    var v;
+    _.forEach(_.range(southPos, southEnd), function(i) {
+      v = south[i];
+      if (g.node(v).dummy) {
+        _.forEach(g.predecessors(v), function(u) {
+          var uNode = g.node(u);
+          if (uNode.dummy &&
+              (uNode.order < prevNorthBorder || uNode.order > nextNorthBorder)) {
+            addConflict(conflicts, u, v);
+          }
+        });
+      }
+    });
+  }
+
+
+  function visitLayer(north, south) {
+    var prevNorthPos = -1,
+        nextNorthPos,
+        southPos = 0;
+
+    _.forEach(south, function(v, southLookahead) {
+      if (g.node(v).dummy === "border") {
+        var predecessors = g.predecessors(v);
+        if (predecessors.length) {
+          nextNorthPos = g.node(predecessors[0]).order;
+          scan(south, southPos, southLookahead, prevNorthPos, nextNorthPos);
+          southPos = southLookahead;
+          prevNorthPos = nextNorthPos;
+        }
+      }
+      scan(south, southPos, south.length, nextNorthPos, north.length);
+    });
+
+    return south;
+  }
+
+  _.reduce(layering, visitLayer);
+  return conflicts;
+}
+
+function findOtherInnerSegmentNode(g, v) {
+  if (g.node(v).dummy) {
+    return _.find(g.predecessors(v), function(u) {
+      return g.node(u).dummy;
+    });
+  }
+}
+
+function addConflict(conflicts, v, w) {
+  if (v > w) {
+    var tmp = v;
+    v = w;
+    w = tmp;
+  }
+
+  var conflictsV = conflicts[v];
+  if (!conflictsV) {
+    conflicts[v] = conflictsV = {};
+  }
+  conflictsV[w] = true;
+}
+
+function hasConflict(conflicts, v, w) {
+  if (v > w) {
+    var tmp = v;
+    v = w;
+    w = tmp;
+  }
+  return _.has(conflicts[v], w);
+}
+
+/*
+ * Try to align nodes into vertical "blocks" where possible. This algorithm
+ * attempts to align a node with one of its median neighbors. If the edge
+ * connecting a neighbor is a type-1 conflict then we ignore that possibility.
+ * If a previous node has already formed a block with a node after the node
+ * we're trying to form a block with, we also ignore that possibility - our
+ * blocks would be split in that scenario.
+ */
+function verticalAlignment(g, layering, conflicts, neighborFn) {
+  var root = {},
+      align = {},
+      pos = {};
+
+  // We cache the position here based on the layering because the graph and
+  // layering may be out of sync. The layering matrix is manipulated to
+  // generate different extreme alignments.
+  _.forEach(layering, function(layer) {
+    _.forEach(layer, function(v, order) {
+      root[v] = v;
+      align[v] = v;
+      pos[v] = order;
+    });
+  });
+
+  _.forEach(layering, function(layer) {
+    var prevIdx = -1;
+    _.forEach(layer, function(v) {
+      var ws = neighborFn(v);
+      if (ws.length) {
+        ws = _.sortBy(ws, function(w) { return pos[w]; });
+        var mp = (ws.length - 1) / 2;
+        for (var i = Math.floor(mp), il = Math.ceil(mp); i <= il; ++i) {
+          var w = ws[i];
+          if (align[v] === v &&
+              prevIdx < pos[w] &&
+              !hasConflict(conflicts, v, w)) {
+            align[w] = v;
+            align[v] = root[v] = root[w];
+            prevIdx = pos[w];
+          }
+        }
+      }
+    });
+  });
+
+  return { root: root, align: align };
+}
+
+function horizontalCompaction(g, layering, root, align, reverseSep) {
+  // This portion of the algorithm differs from BK due to a number of problems.
+  // Instead of their algorithm we construct a new block graph and do two
+  // sweeps. The first sweep places blocks with the smallest possible
+  // coordinates. The second sweep removes unused space by moving blocks to the
+  // greatest coordinates without violating separation.
+  var xs = {},
+      blockG = buildBlockGraph(g, layering, root, reverseSep),
+      borderType = reverseSep ? "borderLeft" : "borderRight";
+
+  function iterate(setXsFunc, nextNodesFunc) {
+    var stack = blockG.nodes();
+    var elem = stack.pop();
+    var visited = {};
+    while (elem) {
+      if (visited[elem]) {
+        setXsFunc(elem);
+      } else {
+        visited[elem] = true;
+        stack.push(elem);
+        stack = stack.concat(nextNodesFunc(elem));
+      }
+
+      elem = stack.pop();
+    }
+  }
+
+  // First pass, assign smallest coordinates
+  function pass1(elem) {
+    xs[elem] = blockG.inEdges(elem).reduce(function(acc, e) {
+      return Math.max(acc, xs[e.v] + blockG.edge(e));
+    }, 0);
+  }
+
+  // Second pass, assign greatest coordinates
+  function pass2(elem) {
+    var min = blockG.outEdges(elem).reduce(function(acc, e) {
+      return Math.min(acc, xs[e.w] - blockG.edge(e));
+    }, Number.POSITIVE_INFINITY);
+
+    var node = g.node(elem);
+    if (min !== Number.POSITIVE_INFINITY && node.borderType !== borderType) {
+      xs[elem] = Math.max(xs[elem], min);
+    }
+  }
+
+  iterate(pass1, _.bind(blockG.predecessors, blockG));
+  iterate(pass2, _.bind(blockG.successors, blockG));
+
+  // Assign x coordinates to all nodes
+  _.forEach(align, function(v) {
+    xs[v] = xs[root[v]];
+  });
+
+  return xs;
+}
+
+
+function buildBlockGraph(g, layering, root, reverseSep) {
+  var blockGraph = new Graph(),
+      graphLabel = g.graph(),
+      sepFn = sep(graphLabel.nodesep, graphLabel.edgesep, reverseSep);
+
+  _.forEach(layering, function(layer) {
+    var u;
+    _.forEach(layer, function(v) {
+      var vRoot = root[v];
+      blockGraph.setNode(vRoot);
+      if (u) {
+        var uRoot = root[u],
+            prevMax = blockGraph.edge(uRoot, vRoot);
+        blockGraph.setEdge(uRoot, vRoot, Math.max(sepFn(g, v, u), prevMax || 0));
+      }
+      u = v;
+    });
+  });
+
+  return blockGraph;
+}
+
+/*
+ * Returns the alignment that has the smallest width of the given alignments.
+ */
+function findSmallestWidthAlignment(g, xss) {
+  return _.minBy(_.values(xss), function (xs) {
+    var max = Number.NEGATIVE_INFINITY;
+    var min = Number.POSITIVE_INFINITY;
+
+    _.forIn(xs, function (x, v) {
+      var halfWidth = width(g, v) / 2;
+
+      max = Math.max(x + halfWidth, max);
+      min = Math.min(x - halfWidth, min);
+    });
+
+    return max - min;
+  });
+}
+
+/*
+ * Align the coordinates of each of the layout alignments such that
+ * left-biased alignments have their minimum coordinate at the same point as
+ * the minimum coordinate of the smallest width alignment and right-biased
+ * alignments have their maximum coordinate at the same point as the maximum
+ * coordinate of the smallest width alignment.
+ */
+function alignCoordinates(xss, alignTo) {
+  var alignToVals = _.values(alignTo),
+      alignToMin = _.min(alignToVals),
+      alignToMax = _.max(alignToVals);
+
+  _.forEach(["u", "d"], function(vert) {
+    _.forEach(["l", "r"], function(horiz) {
+      var alignment = vert + horiz,
+          xs = xss[alignment],
+          delta;
+      if (xs === alignTo) return;
+
+      var xsVals = _.values(xs);
+      delta = horiz === "l" ? alignToMin - _.min(xsVals) : alignToMax - _.max(xsVals);
+
+      if (delta) {
+        xss[alignment] = _.mapValues(xs, function(x) { return x + delta; });
+      }
+    });
+  });
+}
+
+function balance(xss, align) {
+  return _.mapValues(xss.ul, function(ignore, v) {
+    if (align) {
+      return xss[align.toLowerCase()][v];
+    } else {
+      var xs = _.sortBy(_.map(xss, v));
+      return (xs[1] + xs[2]) / 2;
+    }
+  });
+}
+
+function positionX(g) {
+  var layering = util.buildLayerMatrix(g),
+      conflicts = _.merge(findType1Conflicts(g, layering),
+                          findType2Conflicts(g, layering));
+
+  var xss = {},
+      adjustedLayering;
+  _.forEach(["u", "d"], function(vert) {
+    adjustedLayering = vert === "u" ? layering : _.values(layering).reverse();
+    _.forEach(["l", "r"], function(horiz) {
+      if (horiz === "r") {
+        adjustedLayering = _.map(adjustedLayering, function(inner) {
+          return _.values(inner).reverse();
+        });
+      }
+
+      var neighborFn = _.bind(vert === "u" ? g.predecessors : g.successors, g);
+      var align = verticalAlignment(g, adjustedLayering, conflicts, neighborFn);
+      var xs = horizontalCompaction(g, adjustedLayering,
+                                    align.root, align.align,
+                                    horiz === "r");
+      if (horiz === "r") {
+        xs = _.mapValues(xs, function(x) { return -x; });
+      }
+      xss[vert + horiz] = xs;
+    });
+  });
+
+  var smallestWidth = findSmallestWidthAlignment(g, xss);
+  alignCoordinates(xss, smallestWidth);
+  return balance(xss, g.graph().align);
+}
+
+function sep(nodeSep, edgeSep, reverseSep) {
+  return function(g, v, w) {
+    var vLabel = g.node(v),
+        wLabel = g.node(w),
+        sum = 0,
+        delta;
+
+    sum += vLabel.width / 2;
+    if (_.has(vLabel, "labelpos")) {
+      switch (vLabel.labelpos.toLowerCase()) {
+        case "l": delta = -vLabel.width / 2; break;
+        case "r": delta = vLabel.width / 2; break;
+      }
+    }
+    if (delta) {
+      sum += reverseSep ? delta : -delta;
+    }
+    delta = 0;
+
+    sum += (vLabel.dummy ? edgeSep : nodeSep) / 2;
+    sum += (wLabel.dummy ? edgeSep : nodeSep) / 2;
+
+    sum += wLabel.width / 2;
+    if (_.has(wLabel, "labelpos")) {
+      switch (wLabel.labelpos.toLowerCase()) {
+        case "l": delta = wLabel.width / 2; break;
+        case "r": delta = -wLabel.width / 2; break;
+      }
+    }
+    if (delta) {
+      sum += reverseSep ? delta : -delta;
+    }
+    delta = 0;
+
+    return sum;
+  };
+}
+
+function width(g, v) {
+  return g.node(v).width;
+}
+
+},{"../graphlib":7,"../lodash":10,"../util":29}],24:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash"),
+    util = require("../util"),
+    positionX = require("./bk").positionX;
+
+module.exports = position;
+
+function position(g) {
+  g = util.asNonCompoundGraph(g);
+
+  positionY(g);
+  _.forEach(positionX(g), function(x, v) {
+    g.node(v).x = x;
+  });
+}
+
+function positionY(g) {
+  var layering = util.buildLayerMatrix(g),
+      rankSep = g.graph().ranksep,
+      prevY = 0;
+  _.forEach(layering, function(layer) {
+    var maxHeight = _.max(_.map(layer, function(v) { return g.node(v).height; }));
+    _.forEach(layer, function(v) {
+      g.node(v).y = prevY + maxHeight / 2;
+    });
+    prevY += maxHeight + rankSep;
+  });
+}
+
+
+},{"../lodash":10,"../util":29,"./bk":23}],25:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash"),
+    Graph = require("../graphlib").Graph,
+    slack = require("./util").slack;
+
+module.exports = feasibleTree;
+
+/*
+ * Constructs a spanning tree with tight edges and adjusted the input node's
+ * ranks to achieve this. A tight edge is one that is has a length that matches
+ * its "minlen" attribute.
+ *
+ * The basic structure for this function is derived from Gansner, et al., "A
+ * Technique for Drawing Directed Graphs."
+ *
+ * Pre-conditions:
+ *
+ *    1. Graph must be a DAG.
+ *    2. Graph must be connected.
+ *    3. Graph must have at least one node.
+ *    5. Graph nodes must have been previously assigned a "rank" property that
+ *       respects the "minlen" property of incident edges.
+ *    6. Graph edges must have a "minlen" property.
+ *
+ * Post-conditions:
+ *
+ *    - Graph nodes will have their rank adjusted to ensure that all edges are
+ *      tight.
+ *
+ * Returns a tree (undirected graph) that is constructed using only "tight"
+ * edges.
+ */
+function feasibleTree(g) {
+  var t = new Graph({ directed: false });
+
+  // Choose arbitrary node from which to start our tree
+  var start = g.nodes()[0],
+      size = g.nodeCount();
+  t.setNode(start, {});
+
+  var edge, delta;
+  while (tightTree(t, g) < size) {
+    edge = findMinSlackEdge(t, g);
+    delta = t.hasNode(edge.v) ? slack(g, edge) : -slack(g, edge);
+    shiftRanks(t, g, delta);
+  }
+
+  return t;
+}
+
+/*
+ * Finds a maximal tree of tight edges and returns the number of nodes in the
+ * tree.
+ */
+function tightTree(t, g) {
+  function dfs(v) {
+    _.forEach(g.nodeEdges(v), function(e) {
+      var edgeV = e.v,
+          w = (v === edgeV) ? e.w : edgeV;
+      if (!t.hasNode(w) && !slack(g, e)) {
+        t.setNode(w, {});
+        t.setEdge(v, w, {});
+        dfs(w);
+      }
+    });
+  }
+
+  _.forEach(t.nodes(), dfs);
+  return t.nodeCount();
+}
+
+/*
+ * Finds the edge with the smallest slack that is incident on tree and returns
+ * it.
+ */
+function findMinSlackEdge(t, g) {
+  return _.minBy(g.edges(), function(e) {
+    if (t.hasNode(e.v) !== t.hasNode(e.w)) {
+      return slack(g, e);
+    }
+  });
+}
+
+function shiftRanks(t, g, delta) {
+  _.forEach(t.nodes(), function(v) {
+    g.node(v).rank += delta;
+  });
+}
+
+},{"../graphlib":7,"../lodash":10,"./util":28}],26:[function(require,module,exports){
+"use strict";
+
+var rankUtil = require("./util"),
+    longestPath = rankUtil.longestPath,
+    feasibleTree = require("./feasible-tree"),
+    networkSimplex = require("./network-simplex");
+
+module.exports = rank;
+
+/*
+ * Assigns a rank to each node in the input graph that respects the "minlen"
+ * constraint specified on edges between nodes.
+ *
+ * This basic structure is derived from Gansner, et al., "A Technique for
+ * Drawing Directed Graphs."
+ *
+ * Pre-conditions:
+ *
+ *    1. Graph must be a connected DAG
+ *    2. Graph nodes must be objects
+ *    3. Graph edges must have "weight" and "minlen" attributes
+ *
+ * Post-conditions:
+ *
+ *    1. Graph nodes will have a "rank" attribute based on the results of the
+ *       algorithm. Ranks can start at any index (including negative), we'll
+ *       fix them up later.
+ */
+function rank(g) {
+  switch(g.graph().ranker) {
+    case "network-simplex": networkSimplexRanker(g); break;
+    case "tight-tree": tightTreeRanker(g); break;
+    case "longest-path": longestPathRanker(g); break;
+    default: networkSimplexRanker(g);
+  }
+}
+
+// A fast and simple ranker, but results are far from optimal.
+var longestPathRanker = longestPath;
+
+function tightTreeRanker(g) {
+  longestPath(g);
+  feasibleTree(g);
+}
+
+function networkSimplexRanker(g) {
+  networkSimplex(g);
+}
+
+},{"./feasible-tree":25,"./network-simplex":27,"./util":28}],27:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash"),
+    feasibleTree = require("./feasible-tree"),
+    slack = require("./util").slack,
+    initRank = require("./util").longestPath,
+    preorder = require("../graphlib").alg.preorder,
+    postorder = require("../graphlib").alg.postorder,
+    simplify = require("../util").simplify;
+
+module.exports = networkSimplex;
+
+// Expose some internals for testing purposes
+networkSimplex.initLowLimValues = initLowLimValues;
+networkSimplex.initCutValues = initCutValues;
+networkSimplex.calcCutValue = calcCutValue;
+networkSimplex.leaveEdge = leaveEdge;
+networkSimplex.enterEdge = enterEdge;
+networkSimplex.exchangeEdges = exchangeEdges;
+
+/*
+ * The network simplex algorithm assigns ranks to each node in the input graph
+ * and iteratively improves the ranking to reduce the length of edges.
+ *
+ * Preconditions:
+ *
+ *    1. The input graph must be a DAG.
+ *    2. All nodes in the graph must have an object value.
+ *    3. All edges in the graph must have "minlen" and "weight" attributes.
+ *
+ * Postconditions:
+ *
+ *    1. All nodes in the graph will have an assigned "rank" attribute that has
+ *       been optimized by the network simplex algorithm. Ranks start at 0.
+ *
+ *
+ * A rough sketch of the algorithm is as follows:
+ *
+ *    1. Assign initial ranks to each node. We use the longest path algorithm,
+ *       which assigns ranks to the lowest position possible. In general this
+ *       leads to very wide bottom ranks and unnecessarily long edges.
+ *    2. Construct a feasible tight tree. A tight tree is one such that all
+ *       edges in the tree have no slack (difference between length of edge
+ *       and minlen for the edge). This by itself greatly improves the assigned
+ *       rankings by shorting edges.
+ *    3. Iteratively find edges that have negative cut values. Generally a
+ *       negative cut value indicates that the edge could be removed and a new
+ *       tree edge could be added to produce a more compact graph.
+ *
+ * Much of the algorithms here are derived from Gansner, et al., "A Technique
+ * for Drawing Directed Graphs." The structure of the file roughly follows the
+ * structure of the overall algorithm.
+ */
+function networkSimplex(g) {
+  g = simplify(g);
+  initRank(g);
+  var t = feasibleTree(g);
+  initLowLimValues(t);
+  initCutValues(t, g);
+
+  var e, f;
+  while ((e = leaveEdge(t))) {
+    f = enterEdge(t, g, e);
+    exchangeEdges(t, g, e, f);
+  }
+}
+
+/*
+ * Initializes cut values for all edges in the tree.
+ */
+function initCutValues(t, g) {
+  var vs = postorder(t, t.nodes());
+  vs = vs.slice(0, vs.length - 1);
+  _.forEach(vs, function(v) {
+    assignCutValue(t, g, v);
+  });
+}
+
+function assignCutValue(t, g, child) {
+  var childLab = t.node(child),
+      parent = childLab.parent;
+  t.edge(child, parent).cutvalue = calcCutValue(t, g, child);
+}
+
+/*
+ * Given the tight tree, its graph, and a child in the graph calculate and
+ * return the cut value for the edge between the child and its parent.
+ */
+function calcCutValue(t, g, child) {
+  var childLab = t.node(child),
+      parent = childLab.parent,
+      // True if the child is on the tail end of the edge in the directed graph
+      childIsTail = true,
+      // The graph's view of the tree edge we're inspecting
+      graphEdge = g.edge(child, parent),
+      // The accumulated cut value for the edge between this node and its parent
+      cutValue = 0;
+
+  if (!graphEdge) {
+    childIsTail = false;
+    graphEdge = g.edge(parent, child);
+  }
+
+  cutValue = graphEdge.weight;
+
+  _.forEach(g.nodeEdges(child), function(e) {
+    var isOutEdge = e.v === child,
+        other = isOutEdge ? e.w : e.v;
+
+    if (other !== parent) {
+      var pointsToHead = isOutEdge === childIsTail,
+          otherWeight = g.edge(e).weight;
+
+      cutValue += pointsToHead ? otherWeight : -otherWeight;
+      if (isTreeEdge(t, child, other)) {
+        var otherCutValue = t.edge(child, other).cutvalue;
+        cutValue += pointsToHead ? -otherCutValue : otherCutValue;
+      }
+    }
+  });
+
+  return cutValue;
+}
+
+function initLowLimValues(tree, root) {
+  if (arguments.length < 2) {
+    root = tree.nodes()[0];
+  }
+  dfsAssignLowLim(tree, {}, 1, root);
+}
+
+function dfsAssignLowLim(tree, visited, nextLim, v, parent) {
+  var low = nextLim,
+      label = tree.node(v);
+
+  visited[v] = true;
+  _.forEach(tree.neighbors(v), function(w) {
+    if (!_.has(visited, w)) {
+      nextLim = dfsAssignLowLim(tree, visited, nextLim, w, v);
+    }
+  });
+
+  label.low = low;
+  label.lim = nextLim++;
+  if (parent) {
+    label.parent = parent;
+  } else {
+    // TODO should be able to remove this when we incrementally update low lim
+    delete label.parent;
+  }
+
+  return nextLim;
+}
+
+function leaveEdge(tree) {
+  return _.find(tree.edges(), function(e) {
+    return tree.edge(e).cutvalue < 0;
+  });
+}
+
+function enterEdge(t, g, edge) {
+  var v = edge.v,
+      w = edge.w;
+
+  // For the rest of this function we assume that v is the tail and w is the
+  // head, so if we don't have this edge in the graph we should flip it to
+  // match the correct orientation.
+  if (!g.hasEdge(v, w)) {
+    v = edge.w;
+    w = edge.v;
+  }
+
+  var vLabel = t.node(v),
+      wLabel = t.node(w),
+      tailLabel = vLabel,
+      flip = false;
+
+  // If the root is in the tail of the edge then we need to flip the logic that
+  // checks for the head and tail nodes in the candidates function below.
+  if (vLabel.lim > wLabel.lim) {
+    tailLabel = wLabel;
+    flip = true;
+  }
+
+  var candidates = _.filter(g.edges(), function(edge) {
+    return flip === isDescendant(t, t.node(edge.v), tailLabel) &&
+           flip !== isDescendant(t, t.node(edge.w), tailLabel);
+  });
+
+  return _.minBy(candidates, function(edge) { return slack(g, edge); });
+}
+
+function exchangeEdges(t, g, e, f) {
+  var v = e.v,
+      w = e.w;
+  t.removeEdge(v, w);
+  t.setEdge(f.v, f.w, {});
+  initLowLimValues(t);
+  initCutValues(t, g);
+  updateRanks(t, g);
+}
+
+function updateRanks(t, g) {
+  var root = _.find(t.nodes(), function(v) { return !g.node(v).parent; }),
+      vs = preorder(t, root);
+  vs = vs.slice(1);
+  _.forEach(vs, function(v) {
+    var parent = t.node(v).parent,
+        edge = g.edge(v, parent),
+        flipped = false;
+
+    if (!edge) {
+      edge = g.edge(parent, v);
+      flipped = true;
+    }
+
+    g.node(v).rank = g.node(parent).rank + (flipped ? edge.minlen : -edge.minlen);
+  });
+}
+
+/*
+ * Returns true if the edge is in the tree.
+ */
+function isTreeEdge(tree, u, v) {
+  return tree.hasEdge(u, v);
+}
+
+/*
+ * Returns true if the specified node is descendant of the root node per the
+ * assigned low and lim attributes in the tree.
+ */
+function isDescendant(tree, vLabel, rootLabel) {
+  return rootLabel.low <= vLabel.lim && vLabel.lim <= rootLabel.lim;
+}
+
+},{"../graphlib":7,"../lodash":10,"../util":29,"./feasible-tree":25,"./util":28}],28:[function(require,module,exports){
+"use strict";
+
+var _ = require("../lodash");
+
+module.exports = {
+  longestPath: longestPath,
+  slack: slack
+};
+
+/*
+ * Initializes ranks for the input graph using the longest path algorithm. This
+ * algorithm scales well and is fast in practice, it yields rather poor
+ * solutions. Nodes are pushed to the lowest layer possible, leaving the bottom
+ * ranks wide and leaving edges longer than necessary. However, due to its
+ * speed, this algorithm is good for getting an initial ranking that can be fed
+ * into other algorithms.
+ *
+ * This algorithm does not normalize layers because it will be used by other
+ * algorithms in most cases. If using this algorithm directly, be sure to
+ * run normalize at the end.
+ *
+ * Pre-conditions:
+ *
+ *    1. Input graph is a DAG.
+ *    2. Input graph node labels can be assigned properties.
+ *
+ * Post-conditions:
+ *
+ *    1. Each node will be assign an (unnormalized) "rank" property.
+ */
+function longestPath(g) {
+  var visited = {};
+
+  function dfs(v) {
+    var label = g.node(v);
+    if (_.has(visited, v)) {
+      return label.rank;
+    }
+    visited[v] = true;
+
+    var rank = _.minBy(_.map(g.outEdges(v), function(e) {
+      return dfs(e.w) - g.edge(e).minlen;
+    }));
+
+    if (rank === Number.POSITIVE_INFINITY || // return value of _.map([]) for Lodash 3
+        rank === undefined || // return value of _.map([]) for Lodash 4
+        rank === null) { // return value of _.map([null])
+      rank = 0;
+    }
+
+    return (label.rank = rank);
+  }
+
+  _.forEach(g.sources(), dfs);
+}
+
+/*
+ * Returns the amount of slack for the given edge. The slack is defined as the
+ * difference between the length of the edge and its minimum length.
+ */
+function slack(g, e) {
+  return g.node(e.w).rank - g.node(e.v).rank - g.edge(e).minlen;
+}
+
+},{"../lodash":10}],29:[function(require,module,exports){
+"use strict";
+
+var _ = require("./lodash"),
+    Graph = require("./graphlib").Graph;
+
+module.exports = {
+  addDummyNode: addDummyNode,
+  simplify: simplify,
+  asNonCompoundGraph: asNonCompoundGraph,
+  successorWeights: successorWeights,
+  predecessorWeights: predecessorWeights,
+  intersectRect: intersectRect,
+  buildLayerMatrix: buildLayerMatrix,
+  normalizeRanks: normalizeRanks,
+  removeEmptyRanks: removeEmptyRanks,
+  addBorderNode: addBorderNode,
+  maxRank: maxRank,
+  partition: partition,
+  time: time,
+  notime: notime
+};
+
+/*
+ * Adds a dummy node to the graph and return v.
+ */
+function addDummyNode(g, type, attrs, name) {
+  var v;
+  do {
+    v = _.uniqueId(name);
+  } while (g.hasNode(v));
+
+  attrs.dummy = type;
+  g.setNode(v, attrs);
+  return v;
+}
+
+/*
+ * Returns a new graph with only simple edges. Handles aggregation of data
+ * associated with multi-edges.
+ */
+function simplify(g) {
+  var simplified = new Graph().setGraph(g.graph());
+  _.forEach(g.nodes(), function(v) { simplified.setNode(v, g.node(v)); });
+  _.forEach(g.edges(), function(e) {
+    var simpleLabel = simplified.edge(e.v, e.w) || { weight: 0, minlen: 1 },
+        label = g.edge(e);
+    simplified.setEdge(e.v, e.w, {
+      weight: simpleLabel.weight + label.weight,
+      minlen: Math.max(simpleLabel.minlen, label.minlen)
+    });
+  });
+  return simplified;
+}
+
+function asNonCompoundGraph(g) {
+  var simplified = new Graph({ multigraph: g.isMultigraph() }).setGraph(g.graph());
+  _.forEach(g.nodes(), function(v) {
+    if (!g.children(v).length) {
+      simplified.setNode(v, g.node(v));
+    }
+  });
+  _.forEach(g.edges(), function(e) {
+    simplified.setEdge(e, g.edge(e));
+  });
+  return simplified;
+}
+
+function successorWeights(g) {
+  var weightMap = _.map(g.nodes(), function(v) {
+    var sucs = {};
+    _.forEach(g.outEdges(v), function(e) {
+      sucs[e.w] = (sucs[e.w] || 0) + g.edge(e).weight;
+    });
+    return sucs;
+  });
+  return _.zipObject(g.nodes(), weightMap);
+}
+
+function predecessorWeights(g) {
+  var weightMap = _.map(g.nodes(), function(v) {
+    var preds = {};
+    _.forEach(g.inEdges(v), function(e) {
+      preds[e.v] = (preds[e.v] || 0) + g.edge(e).weight;
+    });
+    return preds;
+  });
+  return _.zipObject(g.nodes(), weightMap);
+}
+
+/*
+ * Finds where a line starting at point ({x, y}) would intersect a rectangle
+ * ({x, y, width, height}) if it were pointing at the rectangle's center.
+ */
+function intersectRect(rect, point) {
+  var x = rect.x;
+  var y = rect.y;
+
+  // Rectangle intersection algorithm from:
+  // http://math.stackexchange.com/questions/108113/find-edge-between-two-boxes
+  var dx = point.x - x;
+  var dy = point.y - y;
+  var w = rect.width / 2;
+  var h = rect.height / 2;
+
+  if (!dx && !dy) {
+    throw new Error("Not possible to find intersection inside of the rectangle");
+  }
+
+  var sx, sy;
+  if (Math.abs(dy) * w > Math.abs(dx) * h) {
+    // Intersection is top or bottom of rect.
+    if (dy < 0) {
+      h = -h;
+    }
+    sx = h * dx / dy;
+    sy = h;
+  } else {
+    // Intersection is left or right of rect.
+    if (dx < 0) {
+      w = -w;
+    }
+    sx = w;
+    sy = w * dy / dx;
+  }
+
+  return { x: x + sx, y: y + sy };
+}
+
+/*
+ * Given a DAG with each node assigned "rank" and "order" properties, this
+ * function will produce a matrix with the ids of each node.
+ */
+function buildLayerMatrix(g) {
+  var layering = _.map(_.range(maxRank(g) + 1), function() { return []; });
+  _.forEach(g.nodes(), function(v) {
+    var node = g.node(v),
+        rank = node.rank;
+    if (!_.isUndefined(rank)) {
+      layering[rank][node.order] = v;
+    }
+  });
+  return layering;
+}
+
+/*
+ * Adjusts the ranks for all nodes in the graph such that all nodes v have
+ * rank(v) >= 0 and at least one node w has rank(w) = 0.
+ */
+function normalizeRanks(g) {
+  var min = _.minBy(_.map(g.nodes(), function(v) { return g.node(v).rank; }));
+  _.forEach(g.nodes(), function(v) {
+    var node = g.node(v);
+    if (_.has(node, "rank")) {
+      node.rank -= min;
+    }
+  });
+}
+
+function removeEmptyRanks(g) {
+  // Ranks may not start at 0, so we need to offset them
+  var offset = _.minBy(_.map(g.nodes(), function(v) { return g.node(v).rank; }));
+
+  var layers = [];
+  _.forEach(g.nodes(), function(v) {
+    var rank = g.node(v).rank - offset;
+    if (!layers[rank]) {
+      layers[rank] = [];
+    }
+    layers[rank].push(v);
+  });
+
+  var delta = 0,
+      nodeRankFactor = g.graph().nodeRankFactor;
+  _.forEach(layers, function(vs, i) {
+    if (_.isUndefined(vs) && i % nodeRankFactor !== 0) {
+      --delta;
+    } else if (delta) {
+      _.forEach(vs, function(v) { g.node(v).rank += delta; });
+    }
+  });
+}
+
+function addBorderNode(g, prefix, rank, order) {
+  var node = {
+    width: 0,
+    height: 0
+  };
+  if (arguments.length >= 4) {
+    node.rank = rank;
+    node.order = order;
+  }
+  return addDummyNode(g, "border", node, prefix);
+}
+
+function maxRank(g) {
+  return _.max(_.map(g.nodes(), function(v) {
+    var rank = g.node(v).rank;
+    if (!_.isUndefined(rank)) {
+      return rank;
+    }
+  }));
+}
+
+/*
+ * Partition a collection into two groups: `lhs` and `rhs`. If the supplied
+ * function returns true for an entry it goes into `lhs`. Otherwise it goes
+ * into `rhs.
+ */
+function partition(collection, fn) {
+  var result = { lhs: [], rhs: [] };
+  _.forEach(collection, function(value) {
+    if (fn(value)) {
+      result.lhs.push(value);
+    } else {
+      result.rhs.push(value);
+    }
+  });
+  return result;
+}
+
+/*
+ * Returns a new function that wraps `fn` with a timer. The wrapper logs the
+ * time it takes to execute the function.
+ */
+function time(name, fn) {
+  var start = _.now();
+  try {
+    return fn();
+  } finally {
+    console.log(name + " time: " + (_.now() - start) + "ms");
+  }
+}
+
+function notime(name, fn) {
+  return fn();
+}
+
+},{"./graphlib":7,"./lodash":10}],30:[function(require,module,exports){
+module.exports = "0.8.2";
+
+},{}],31:[function(require,module,exports){
+/**
+ * Copyright (c) 2014, Chris Pettitt
+ * All rights reserved.
+ *
+ * Redistribution and use in source and binary forms, with or without
+ * modification, are permitted provided that the following conditions are met:
+ *
+ * 1. Redistributions of source code must retain the above copyright notice, this
+ * list of conditions and the following disclaimer.
+ *
+ * 2. Redistributions in binary form must reproduce the above copyright notice,
+ * this list of conditions and the following disclaimer in the documentation
+ * and/or other materials provided with the distribution.
+ *
+ * 3. Neither the name of the copyright holder nor the names of its contributors
+ * may be used to endorse or promote products derived from this software without
+ * specific prior written permission.
+ *
+ * THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND
+ * ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
+ * WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
+ * DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
+ * FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
+ * DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
+ * SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
+ * CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
+ * OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+ * OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+ */
+
+var lib = require("./lib");
+
+module.exports = {
+  Graph: lib.Graph,
+  json: require("./lib/json"),
+  alg: require("./lib/alg"),
+  version: lib.version
+};
+
+},{"./lib":47,"./lib/alg":38,"./lib/json":48}],32:[function(require,module,exports){
+var _ = require("../lodash");
+
+module.exports = components;
+
+function components(g) {
+  var visited = {},
+      cmpts = [],
+      cmpt;
+
+  function dfs(v) {
+    if (_.has(visited, v)) return;
+    visited[v] = true;
+    cmpt.push(v);
+    _.each(g.successors(v), dfs);
+    _.each(g.predecessors(v), dfs);
+  }
+
+  _.each(g.nodes(), function(v) {
+    cmpt = [];
+    dfs(v);
+    if (cmpt.length) {
+      cmpts.push(cmpt);
+    }
+  });
+
+  return cmpts;
+}
+
+},{"../lodash":49}],33:[function(require,module,exports){
+var _ = require("../lodash");
+
+module.exports = dfs;
+
+/*
+ * A helper that preforms a pre- or post-order traversal on the input graph
+ * and returns the nodes in the order they were visited. If the graph is
+ * undirected then this algorithm will navigate using neighbors. If the graph
+ * is directed then this algorithm will navigate using successors.
+ *
+ * Order must be one of "pre" or "post".
+ */
+function dfs(g, vs, order) {
+  if (!_.isArray(vs)) {
+    vs = [vs];
+  }
+
+  var navigation = (g.isDirected() ? g.successors : g.neighbors).bind(g);
+
+  var acc = [],
+      visited = {};
+  _.each(vs, function(v) {
+    if (!g.hasNode(v)) {
+      throw new Error("Graph does not have node: " + v);
+    }
+
+    doDfs(g, v, order === "post", visited, navigation, acc);
+  });
+  return acc;
+}
+
+function doDfs(g, v, postorder, visited, navigation, acc) {
+  if (!_.has(visited, v)) {
+    visited[v] = true;
+
+    if (!postorder) { acc.push(v); }
+    _.each(navigation(v), function(w) {
+      doDfs(g, w, postorder, visited, navigation, acc);
+    });
+    if (postorder) { acc.push(v); }
+  }
+}
+
+},{"../lodash":49}],34:[function(require,module,exports){
+var dijkstra = require("./dijkstra"),
+    _ = require("../lodash");
+
+module.exports = dijkstraAll;
+
+function dijkstraAll(g, weightFunc, edgeFunc) {
+  return _.transform(g.nodes(), function(acc, v) {
+    acc[v] = dijkstra(g, v, weightFunc, edgeFunc);
+  }, {});
+}
+
+},{"../lodash":49,"./dijkstra":35}],35:[function(require,module,exports){
+var _ = require("../lodash"),
+    PriorityQueue = require("../data/priority-queue");
+
+module.exports = dijkstra;
+
+var DEFAULT_WEIGHT_FUNC = _.constant(1);
+
+function dijkstra(g, source, weightFn, edgeFn) {
+  return runDijkstra(g, String(source),
+                     weightFn || DEFAULT_WEIGHT_FUNC,
+                     edgeFn || function(v) { return g.outEdges(v); });
+}
+
+function runDijkstra(g, source, weightFn, edgeFn) {
+  var results = {},
+      pq = new PriorityQueue(),
+      v, vEntry;
+
+  var updateNeighbors = function(edge) {
+    var w = edge.v !== v ? edge.v : edge.w,
+        wEntry = results[w],
+        weight = weightFn(edge),
+        distance = vEntry.distance + weight;
+
+    if (weight < 0) {
+      throw new Error("dijkstra does not allow negative edge weights. " +
+                      "Bad edge: " + edge + " Weight: " + weight);
+    }
+
+    if (distance < wEntry.distance) {
+      wEntry.distance = distance;
+      wEntry.predecessor = v;
+      pq.decrease(w, distance);
+    }
+  };
+
+  g.nodes().forEach(function(v) {
+    var distance = v === source ? 0 : Number.POSITIVE_INFINITY;
+    results[v] = { distance: distance };
+    pq.add(v, distance);
+  });
+
+  while (pq.size() > 0) {
+    v = pq.removeMin();
+    vEntry = results[v];
+    if (vEntry.distance === Number.POSITIVE_INFINITY) {
+      break;
+    }
+
+    edgeFn(v).forEach(updateNeighbors);
+  }
+
+  return results;
+}
+
+},{"../data/priority-queue":45,"../lodash":49}],36:[function(require,module,exports){
+var _ = require("../lodash"),
+    tarjan = require("./tarjan");
+
+module.exports = findCycles;
+
+function findCycles(g) {
+  return _.filter(tarjan(g), function(cmpt) {
+    return cmpt.length > 1 || (cmpt.length === 1 && g.hasEdge(cmpt[0], cmpt[0]));
+  });
+}
+
+},{"../lodash":49,"./tarjan":43}],37:[function(require,module,exports){
+var _ = require("../lodash");
+
+module.exports = floydWarshall;
+
+var DEFAULT_WEIGHT_FUNC = _.constant(1);
+
+function floydWarshall(g, weightFn, edgeFn) {
+  return runFloydWarshall(g,
+                          weightFn || DEFAULT_WEIGHT_FUNC,
+                          edgeFn || function(v) { return g.outEdges(v); });
+}
+
+function runFloydWarshall(g, weightFn, edgeFn) {
+  var results = {},
+      nodes = g.nodes();
+
+  nodes.forEach(function(v) {
+    results[v] = {};
+    results[v][v] = { distance: 0 };
+    nodes.forEach(function(w) {
+      if (v !== w) {
+        results[v][w] = { distance: Number.POSITIVE_INFINITY };
+      }
+    });
+    edgeFn(v).forEach(function(edge) {
+      var w = edge.v === v ? edge.w : edge.v,
+          d = weightFn(edge);
+      results[v][w] = { distance: d, predecessor: v };
+    });
+  });
+
+  nodes.forEach(function(k) {
+    var rowK = results[k];
+    nodes.forEach(function(i) {
+      var rowI = results[i];
+      nodes.forEach(function(j) {
+        var ik = rowI[k];
+        var kj = rowK[j];
+        var ij = rowI[j];
+        var altDistance = ik.distance + kj.distance;
+        if (altDistance < ij.distance) {
+          ij.distance = altDistance;
+          ij.predecessor = kj.predecessor;
+        }
+      });
+    });
+  });
+
+  return results;
+}
+
+},{"../lodash":49}],38:[function(require,module,exports){
+module.exports = {
+  components: require("./components"),
+  dijkstra: require("./dijkstra"),
+  dijkstraAll: require("./dijkstra-all"),
+  findCycles: require("./find-cycles"),
+  floydWarshall: require("./floyd-warshall"),
+  isAcyclic: require("./is-acyclic"),
+  postorder: require("./postorder"),
+  preorder: require("./preorder"),
+  prim: require("./prim"),
+  tarjan: require("./tarjan"),
+  topsort: require("./topsort")
+};
+
+},{"./components":32,"./dijkstra":35,"./dijkstra-all":34,"./find-cycles":36,"./floyd-warshall":37,"./is-acyclic":39,"./postorder":40,"./preorder":41,"./prim":42,"./tarjan":43,"./topsort":44}],39:[function(require,module,exports){
+var topsort = require("./topsort");
+
+module.exports = isAcyclic;
+
+function isAcyclic(g) {
+  try {
+    topsort(g);
+  } catch (e) {
+    if (e instanceof topsort.CycleException) {
+      return false;
+    }
+    throw e;
+  }
+  return true;
+}
+
+},{"./topsort":44}],40:[function(require,module,exports){
+var dfs = require("./dfs");
+
+module.exports = postorder;
+
+function postorder(g, vs) {
+  return dfs(g, vs, "post");
+}
+
+},{"./dfs":33}],41:[function(require,module,exports){
+var dfs = require("./dfs");
+
+module.exports = preorder;
+
+function preorder(g, vs) {
+  return dfs(g, vs, "pre");
+}
+
+},{"./dfs":33}],42:[function(require,module,exports){
+var _ = require("../lodash"),
+    Graph = require("../graph"),
+    PriorityQueue = require("../data/priority-queue");
+
+module.exports = prim;
+
+function prim(g, weightFunc) {
+  var result = new Graph(),
+      parents = {},
+      pq = new PriorityQueue(),
+      v;
+
+  function updateNeighbors(edge) {
+    var w = edge.v === v ? edge.w : edge.v,
+        pri = pq.priority(w);
+    if (pri !== undefined) {
+      var edgeWeight = weightFunc(edge);
+      if (edgeWeight < pri) {
+        parents[w] = v;
+        pq.decrease(w, edgeWeight);
+      }
+    }
+  }
+
+  if (g.nodeCount() === 0) {
+    return result;
+  }
+
+  _.each(g.nodes(), function(v) {
+    pq.add(v, Number.POSITIVE_INFINITY);
+    result.setNode(v);
+  });
+
+  // Start from an arbitrary node
+  pq.decrease(g.nodes()[0], 0);
+
+  var init = false;
+  while (pq.size() > 0) {
+    v = pq.removeMin();
+    if (_.has(parents, v)) {
+      result.setEdge(v, parents[v]);
+    } else if (init) {
+      throw new Error("Input graph is not connected: " + g);
+    } else {
+      init = true;
+    }
+
+    g.nodeEdges(v).forEach(updateNeighbors);
+  }
+
+  return result;
+}
+
+},{"../data/priority-queue":45,"../graph":46,"../lodash":49}],43:[function(require,module,exports){
+var _ = require("../lodash");
+
+module.exports = tarjan;
+
+function tarjan(g) {
+  var index = 0,
+      stack = [],
+      visited = {}, // node id -> { onStack, lowlink, index }
+      results = [];
+
+  function dfs(v) {
+    var entry = visited[v] = {
+      onStack: true,
+      lowlink: index,
+      index: index++
+    };
+    stack.push(v);
+
+    g.successors(v).forEach(function(w) {
+      if (!_.has(visited, w)) {
+        dfs(w);
+        entry.lowlink = Math.min(entry.lowlink, visited[w].lowlink);
+      } else if (visited[w].onStack) {
+        entry.lowlink = Math.min(entry.lowlink, visited[w].index);
+      }
+    });
+
+    if (entry.lowlink === entry.index) {
+      var cmpt = [],
+          w;
+      do {
+        w = stack.pop();
+        visited[w].onStack = false;
+        cmpt.push(w);
+      } while (v !== w);
+      results.push(cmpt);
+    }
+  }
+
+  g.nodes().forEach(function(v) {
+    if (!_.has(visited, v)) {
+      dfs(v);
+    }
+  });
+
+  return results;
+}
+
+},{"../lodash":49}],44:[function(require,module,exports){
+var _ = require("../lodash");
+
+module.exports = topsort;
+topsort.CycleException = CycleException;
+
+function topsort(g) {
+  var visited = {},
+      stack = {},
+      results = [];
+
+  function visit(node) {
+    if (_.has(stack, node)) {
+      throw new CycleException();
+    }
+
+    if (!_.has(visited, node)) {
+      stack[node] = true;
+      visited[node] = true;
+      _.each(g.predecessors(node), visit);
+      delete stack[node];
+      results.push(node);
+    }
+  }
+
+  _.each(g.sinks(), visit);
+
+  if (_.size(visited) !== g.nodeCount()) {
+    throw new CycleException();
+  }
+
+  return results;
+}
+
+function CycleException() {}
+
+},{"../lodash":49}],45:[function(require,module,exports){
+var _ = require("../lodash");
+
+module.exports = PriorityQueue;
+
+/**
+ * A min-priority queue data structure. This algorithm is derived from Cormen,
+ * et al., "Introduction to Algorithms". The basic idea of a min-priority
+ * queue is that you can efficiently (in O(1) time) get the smallest key in
+ * the queue. Adding and removing elements takes O(log n) time. A key can
+ * have its priority decreased in O(log n) time.
+ */
+function PriorityQueue() {
+  this._arr = [];
+  this._keyIndices = {};
+}
+
+/**
+ * Returns the number of elements in the queue. Takes `O(1)` time.
+ */
+PriorityQueue.prototype.size = function() {
+  return this._arr.length;
+};
+
+/**
+ * Returns the keys that are in the queue. Takes `O(n)` time.
+ */
+PriorityQueue.prototype.keys = function() {
+  return this._arr.map(function(x) { return x.key; });
+};
+
+/**
+ * Returns `true` if **key** is in the queue and `false` if not.
+ */
+PriorityQueue.prototype.has = function(key) {
+  return _.has(this._keyIndices, key);
+};
+
+/**
+ * Returns the priority for **key**. If **key** is not present in the queue
+ * then this function returns `undefined`. Takes `O(1)` time.
+ *
+ * @param {Object} key
+ */
+PriorityQueue.prototype.priority = function(key) {
+  var index = this._keyIndices[key];
+  if (index !== undefined) {
+    return this._arr[index].priority;
+  }
+};
+
+/**
+ * Returns the key for the minimum element in this queue. If the queue is
+ * empty this function throws an Error. Takes `O(1)` time.
+ */
+PriorityQueue.prototype.min = function() {
+  if (this.size() === 0) {
+    throw new Error("Queue underflow");
+  }
+  return this._arr[0].key;
+};
+
+/**
+ * Inserts a new key into the priority queue. If the key already exists in
+ * the queue this function returns `false`; otherwise it will return `true`.
+ * Takes `O(n)` time.
+ *
+ * @param {Object} key the key to add
+ * @param {Number} priority the initial priority for the key
+ */
+PriorityQueue.prototype.add = function(key, priority) {
+  var keyIndices = this._keyIndices;
+  key = String(key);
+  if (!_.has(keyIndices, key)) {
+    var arr = this._arr;
+    var index = arr.length;
+    keyIndices[key] = index;
+    arr.push({key: key, priority: priority});
+    this._decrease(index);
+    return true;
+  }
+  return false;
+};
+
+/**
+ * Removes and returns the smallest key in the queue. Takes `O(log n)` time.
+ */
+PriorityQueue.prototype.removeMin = function() {
+  this._swap(0, this._arr.length - 1);
+  var min = this._arr.pop();
+  delete this._keyIndices[min.key];
+  this._heapify(0);
+  return min.key;
+};
+
+/**
+ * Decreases the priority for **key** to **priority**. If the new priority is
+ * greater than the previous priority, this function will throw an Error.
+ *
+ * @param {Object} key the key for which to raise priority
+ * @param {Number} priority the new priority for the key
+ */
+PriorityQueue.prototype.decrease = function(key, priority) {
+  var index = this._keyIndices[key];
+  if (priority > this._arr[index].priority) {
+    throw new Error("New priority is greater than current priority. " +
+        "Key: " + key + " Old: " + this._arr[index].priority + " New: " + priority);
+  }
+  this._arr[index].priority = priority;
+  this._decrease(index);
+};
+
+PriorityQueue.prototype._heapify = function(i) {
+  var arr = this._arr;
+  var l = 2 * i,
+      r = l + 1,
+      largest = i;
+  if (l < arr.length) {
+    largest = arr[l].priority < arr[largest].priority ? l : largest;
+    if (r < arr.length) {
+      largest = arr[r].priority < arr[largest].priority ? r : largest;
+    }
+    if (largest !== i) {
+      this._swap(i, largest);
+      this._heapify(largest);
+    }
+  }
+};
+
+PriorityQueue.prototype._decrease = function(index) {
+  var arr = this._arr;
+  var priority = arr[index].priority;
+  var parent;
+  while (index !== 0) {
+    parent = index >> 1;
+    if (arr[parent].priority < priority) {
+      break;
+    }
+    this._swap(index, parent);
+    index = parent;
+  }
+};
+
+PriorityQueue.prototype._swap = function(i, j) {
+  var arr = this._arr;
+  var keyIndices = this._keyIndices;
+  var origArrI = arr[i];
+  var origArrJ = arr[j];
+  arr[i] = origArrJ;
+  arr[j] = origArrI;
+  keyIndices[origArrJ.key] = i;
+  keyIndices[origArrI.key] = j;
+};
+
+},{"../lodash":49}],46:[function(require,module,exports){
+"use strict";
+
+var _ = require("./lodash");
+
+module.exports = Graph;
+
+var DEFAULT_EDGE_NAME = "\x00",
+    GRAPH_NODE = "\x00",
+    EDGE_KEY_DELIM = "\x01";
+
+// Implementation notes:
+//
+//  * Node id query functions should return string ids for the nodes
+//  * Edge id query functions should return an "edgeObj", edge object, that is
+//    composed of enough information to uniquely identify an edge: {v, w, name}.
+//  * Internally we use an "edgeId", a stringified form of the edgeObj, to
+//    reference edges. This is because we need a performant way to look these
+//    edges up and, object properties, which have string keys, are the closest
+//    we're going to get to a performant hashtable in JavaScript.
+
+function Graph(opts) {
+  this._isDirected = _.has(opts, "directed") ? opts.directed : true;
+  this._isMultigraph = _.has(opts, "multigraph") ? opts.multigraph : false;
+  this._isCompound = _.has(opts, "compound") ? opts.compound : false;
+
+  // Label for the graph itself
+  this._label = undefined;
+
+  // Defaults to be set when creating a new node
+  this._defaultNodeLabelFn = _.constant(undefined);
+
+  // Defaults to be set when creating a new edge
+  this._defaultEdgeLabelFn = _.constant(undefined);
+
+  // v -> label
+  this._nodes = {};
+
+  if (this._isCompound) {
+    // v -> parent
+    this._parent = {};
+
+    // v -> children
+    this._children = {};
+    this._children[GRAPH_NODE] = {};
+  }
+
+  // v -> edgeObj
+  this._in = {};
+
+  // u -> v -> Number
+  this._preds = {};
+
+  // v -> edgeObj
+  this._out = {};
+
+  // v -> w -> Number
+  this._sucs = {};
+
+  // e -> edgeObj
+  this._edgeObjs = {};
+
+  // e -> label
+  this._edgeLabels = {};
+}
+
+/* Number of nodes in the graph. Should only be changed by the implementation. */
+Graph.prototype._nodeCount = 0;
+
+/* Number of edges in the graph. Should only be changed by the implementation. */
+Graph.prototype._edgeCount = 0;
+
+
+/* === Graph functions ========= */
+
+Graph.prototype.isDirected = function() {
+  return this._isDirected;
+};
+
+Graph.prototype.isMultigraph = function() {
+  return this._isMultigraph;
+};
+
+Graph.prototype.isCompound = function() {
+  return this._isCompound;
+};
+
+Graph.prototype.setGraph = function(label) {
+  this._label = label;
+  return this;
+};
+
+Graph.prototype.graph = function() {
+  return this._label;
+};
+
+
+/* === Node functions ========== */
+
+Graph.prototype.setDefaultNodeLabel = function(newDefault) {
+  if (!_.isFunction(newDefault)) {
+    newDefault = _.constant(newDefault);
+  }
+  this._defaultNodeLabelFn = newDefault;
+  return this;
+};
+
+Graph.prototype.nodeCount = function() {
+  return this._nodeCount;
+};
+
+Graph.prototype.nodes = function() {
+  return _.keys(this._nodes);
+};
+
+Graph.prototype.sources = function() {
+  var self = this;
+  return _.filter(this.nodes(), function(v) {
+    return _.isEmpty(self._in[v]);
+  });
+};
+
+Graph.prototype.sinks = function() {
+  var self = this;
+  return _.filter(this.nodes(), function(v) {
+    return _.isEmpty(self._out[v]);
+  });
+};
+
+Graph.prototype.setNodes = function(vs, value) {
+  var args = arguments;
+  var self = this;
+  _.each(vs, function(v) {
+    if (args.length > 1) {
+      self.setNode(v, value);
+    } else {
+      self.setNode(v);
+    }
+  });
+  return this;
+};
+
+Graph.prototype.setNode = function(v, value) {
+  if (_.has(this._nodes, v)) {
+    if (arguments.length > 1) {
+      this._nodes[v] = value;
+    }
+    return this;
+  }
+
+  this._nodes[v] = arguments.length > 1 ? value : this._defaultNodeLabelFn(v);
+  if (this._isCompound) {
+    this._parent[v] = GRAPH_NODE;
+    this._children[v] = {};
+    this._children[GRAPH_NODE][v] = true;
+  }
+  this._in[v] = {};
+  this._preds[v] = {};
+  this._out[v] = {};
+  this._sucs[v] = {};
+  ++this._nodeCount;
+  return this;
+};
+
+Graph.prototype.node = function(v) {
+  return this._nodes[v];
+};
+
+Graph.prototype.hasNode = function(v) {
+  return _.has(this._nodes, v);
+};
+
+Graph.prototype.removeNode =  function(v) {
+  var self = this;
+  if (_.has(this._nodes, v)) {
+    var removeEdge = function(e) { self.removeEdge(self._edgeObjs[e]); };
+    delete this._nodes[v];
+    if (this._isCompound) {
+      this._removeFromParentsChildList(v);
+      delete this._parent[v];
+      _.each(this.children(v), function(child) {
+        self.setParent(child);
+      });
+      delete this._children[v];
+    }
+    _.each(_.keys(this._in[v]), removeEdge);
+    delete this._in[v];
+    delete this._preds[v];
+    _.each(_.keys(this._out[v]), removeEdge);
+    delete this._out[v];
+    delete this._sucs[v];
+    --this._nodeCount;
+  }
+  return this;
+};
+
+Graph.prototype.setParent = function(v, parent) {
+  if (!this._isCompound) {
+    throw new Error("Cannot set parent in a non-compound graph");
+  }
+
+  if (_.isUndefined(parent)) {
+    parent = GRAPH_NODE;
+  } else {
+    // Coerce parent to string
+    parent += "";
+    for (var ancestor = parent;
+         !_.isUndefined(ancestor);
+         ancestor = this.parent(ancestor)) {
+      if (ancestor === v) {
+        throw new Error("Setting " + parent+ " as parent of " + v +
+                        " would create a cycle");
+      }
+    }
+
+    this.setNode(parent);
+  }
+
+  this.setNode(v);
+  this._removeFromParentsChildList(v);
+  this._parent[v] = parent;
+  this._children[parent][v] = true;
+  return this;
+};
+
+Graph.prototype._removeFromParentsChildList = function(v) {
+  delete this._children[this._parent[v]][v];
+};
+
+Graph.prototype.parent = function(v) {
+  if (this._isCompound) {
+    var parent = this._parent[v];
+    if (parent !== GRAPH_NODE) {
+      return parent;
+    }
+  }
+};
+
+Graph.prototype.children = function(v) {
+  if (_.isUndefined(v)) {
+    v = GRAPH_NODE;
+  }
+
+  if (this._isCompound) {
+    var children = this._children[v];
+    if (children) {
+      return _.keys(children);
+    }
+  } else if (v === GRAPH_NODE) {
+    return this.nodes();
+  } else if (this.hasNode(v)) {
+    return [];
+  }
+};
+
+Graph.prototype.predecessors = function(v) {
+  var predsV = this._preds[v];
+  if (predsV) {
+    return _.keys(predsV);
+  }
+};
+
+Graph.prototype.successors = function(v) {
+  var sucsV = this._sucs[v];
+  if (sucsV) {
+    return _.keys(sucsV);
+  }
+};
+
+Graph.prototype.neighbors = function(v) {
+  var preds = this.predecessors(v);
+  if (preds) {
+    return _.union(preds, this.successors(v));
+  }
+};
+
+Graph.prototype.isLeaf = function (v) {
+  var neighbors;
+  if (this.isDirected()) {
+    neighbors = this.successors(v);
+  } else {
+    neighbors = this.neighbors(v);
+  }
+  return neighbors.length === 0;
+};
+
+Graph.prototype.filterNodes = function(filter) {
+  var copy = new this.constructor({
+    directed: this._isDirected,
+    multigraph: this._isMultigraph,
+    compound: this._isCompound
+  });
+
+  copy.setGraph(this.graph());
+
+  var self = this;
+  _.each(this._nodes, function(value, v) {
+    if (filter(v)) {
+      copy.setNode(v, value);
+    }
+  });
+
+  _.each(this._edgeObjs, function(e) {
+    if (copy.hasNode(e.v) && copy.hasNode(e.w)) {
+      copy.setEdge(e, self.edge(e));
+    }
+  });
+
+  var parents = {};
+  function findParent(v) {
+    var parent = self.parent(v);
+    if (parent === undefined || copy.hasNode(parent)) {
+      parents[v] = parent;
+      return parent;
+    } else if (parent in parents) {
+      return parents[parent];
+    } else {
+      return findParent(parent);
+    }
+  }
+
+  if (this._isCompound) {
+    _.each(copy.nodes(), function(v) {
+      copy.setParent(v, findParent(v));
+    });
+  }
+
+  return copy;
+};
+
+/* === Edge functions ========== */
+
+Graph.prototype.setDefaultEdgeLabel = function(newDefault) {
+  if (!_.isFunction(newDefault)) {
+    newDefault = _.constant(newDefault);
+  }
+  this._defaultEdgeLabelFn = newDefault;
+  return this;
+};
+
+Graph.prototype.edgeCount = function() {
+  return this._edgeCount;
+};
+
+Graph.prototype.edges = function() {
+  return _.values(this._edgeObjs);
+};
+
+Graph.prototype.setPath = function(vs, value) {
+  var self = this,
+      args = arguments;
+  _.reduce(vs, function(v, w) {
+    if (args.length > 1) {
+      self.setEdge(v, w, value);
+    } else {
+      self.setEdge(v, w);
+    }
+    return w;
+  });
+  return this;
+};
+
+/*
+ * setEdge(v, w, [value, [name]])
+ * setEdge({ v, w, [name] }, [value])
+ */
+Graph.prototype.setEdge = function() {
+  var v, w, name, value,
+      valueSpecified = false,
+      arg0 = arguments[0];
+
+  if (typeof arg0 === "object" && arg0 !== null && "v" in arg0) {
+    v = arg0.v;
+    w = arg0.w;
+    name = arg0.name;
+    if (arguments.length === 2) {
+      value = arguments[1];
+      valueSpecified = true;
+    }
+  } else {
+    v = arg0;
+    w = arguments[1];
+    name = arguments[3];
+    if (arguments.length > 2) {
+      value = arguments[2];
+      valueSpecified = true;
+    }
+  }
+
+  v = "" + v;
+  w = "" + w;
+  if (!_.isUndefined(name)) {
+    name = "" + name;
+  }
+
+  var e = edgeArgsToId(this._isDirected, v, w, name);
+  if (_.has(this._edgeLabels, e)) {
+    if (valueSpecified) {
+      this._edgeLabels[e] = value;
+    }
+    return this;
+  }
+
+  if (!_.isUndefined(name) && !this._isMultigraph) {
+    throw new Error("Cannot set a named edge when isMultigraph = false");
+  }
+
+  // It didn't exist, so we need to create it.
+  // First ensure the nodes exist.
+  this.setNode(v);
+  this.setNode(w);
+
+  this._edgeLabels[e] = valueSpecified ? value : this._defaultEdgeLabelFn(v, w, name);
+
+  var edgeObj = edgeArgsToObj(this._isDirected, v, w, name);
+  // Ensure we add undirected edges in a consistent way.
+  v = edgeObj.v;
+  w = edgeObj.w;
+
+  Object.freeze(edgeObj);
+  this._edgeObjs[e] = edgeObj;
+  incrementOrInitEntry(this._preds[w], v);
+  incrementOrInitEntry(this._sucs[v], w);
+  this._in[w][e] = edgeObj;
+  this._out[v][e] = edgeObj;
+  this._edgeCount++;
+  return this;
+};
+
+Graph.prototype.edge = function(v, w, name) {
+  var e = (arguments.length === 1
+            ? edgeObjToId(this._isDirected, arguments[0])
+            : edgeArgsToId(this._isDirected, v, w, name));
+  return this._edgeLabels[e];
+};
+
+Graph.prototype.hasEdge = function(v, w, name) {
+  var e = (arguments.length === 1
+            ? edgeObjToId(this._isDirected, arguments[0])
+            : edgeArgsToId(this._isDirected, v, w, name));
+  return _.has(this._edgeLabels, e);
+};
+
+Graph.prototype.removeEdge = function(v, w, name) {
+  var e = (arguments.length === 1
+            ? edgeObjToId(this._isDirected, arguments[0])
+            : edgeArgsToId(this._isDirected, v, w, name)),
+      edge = this._edgeObjs[e];
+  if (edge) {
+    v = edge.v;
+    w = edge.w;
+    delete this._edgeLabels[e];
+    delete this._edgeObjs[e];
+    decrementOrRemoveEntry(this._preds[w], v);
+    decrementOrRemoveEntry(this._sucs[v], w);
+    delete this._in[w][e];
+    delete this._out[v][e];
+    this._edgeCount--;
+  }
+  return this;
+};
+
+Graph.prototype.inEdges = function(v, u) {
+  var inV = this._in[v];
+  if (inV) {
+    var edges = _.values(inV);
+    if (!u) {
+      return edges;
+    }
+    return _.filter(edges, function(edge) { return edge.v === u; });
+  }
+};
+
+Graph.prototype.outEdges = function(v, w) {
+  var outV = this._out[v];
+  if (outV) {
+    var edges = _.values(outV);
+    if (!w) {
+      return edges;
+    }
+    return _.filter(edges, function(edge) { return edge.w === w; });
+  }
+};
+
+Graph.prototype.nodeEdges = function(v, w) {
+  var inEdges = this.inEdges(v, w);
+  if (inEdges) {
+    return inEdges.concat(this.outEdges(v, w));
+  }
+};
+
+function incrementOrInitEntry(map, k) {
+  if (map[k]) {
+    map[k]++;
+  } else {
+    map[k] = 1;
+  }
+}
+
+function decrementOrRemoveEntry(map, k) {
+  if (!--map[k]) { delete map[k]; }
+}
+
+function edgeArgsToId(isDirected, v_, w_, name) {
+  var v = "" + v_;
+  var w = "" + w_;
+  if (!isDirected && v > w) {
+    var tmp = v;
+    v = w;
+    w = tmp;
+  }
+  return v + EDGE_KEY_DELIM + w + EDGE_KEY_DELIM +
+             (_.isUndefined(name) ? DEFAULT_EDGE_NAME : name);
+}
+
+function edgeArgsToObj(isDirected, v_, w_, name) {
+  var v = "" + v_;
+  var w = "" + w_;
+  if (!isDirected && v > w) {
+    var tmp = v;
+    v = w;
+    w = tmp;
+  }
+  var edgeObj =  { v: v, w: w };
+  if (name) {
+    edgeObj.name = name;
+  }
+  return edgeObj;
+}
+
+function edgeObjToId(isDirected, edgeObj) {
+  return edgeArgsToId(isDirected, edgeObj.v, edgeObj.w, edgeObj.name);
+}
+
+},{"./lodash":49}],47:[function(require,module,exports){
+// Includes only the "core" of graphlib
+module.exports = {
+  Graph: require("./graph"),
+  version: require("./version")
+};
+
+},{"./graph":46,"./version":50}],48:[function(require,module,exports){
+var _ = require("./lodash"),
+    Graph = require("./graph");
+
+module.exports = {
+  write: write,
+  read: read
+};
+
+function write(g) {
+  var json = {
+    options: {
+      directed: g.isDirected(),
+      multigraph: g.isMultigraph(),
+      compound: g.isCompound()
+    },
+    nodes: writeNodes(g),
+    edges: writeEdges(g)
+  };
+  if (!_.isUndefined(g.graph())) {
+    json.value = _.clone(g.graph());
+  }
+  return json;
+}
+
+function writeNodes(g) {
+  return _.map(g.nodes(), function(v) {
+    var nodeValue = g.node(v),
+        parent = g.parent(v),
+        node = { v: v };
+    if (!_.isUndefined(nodeValue)) {
+      node.value = nodeValue;
+    }
+    if (!_.isUndefined(parent)) {
+      node.parent = parent;
+    }
+    return node;
+  });
+}
+
+function writeEdges(g) {
+  return _.map(g.edges(), function(e) {
+    var edgeValue = g.edge(e),
+        edge = { v: e.v, w: e.w };
+    if (!_.isUndefined(e.name)) {
+      edge.name = e.name;
+    }
+    if (!_.isUndefined(edgeValue)) {
+      edge.value = edgeValue;
+    }
+    return edge;
+  });
+}
+
+function read(json) {
+  var g = new Graph(json.options).setGraph(json.value);
+  _.each(json.nodes, function(entry) {
+    g.setNode(entry.v, entry.value);
+    if (entry.parent) {
+      g.setParent(entry.v, entry.parent);
+    }
+  });
+  _.each(json.edges, function(entry) {
+    g.setEdge({ v: entry.v, w: entry.w, name: entry.name }, entry.value);
+  });
+  return g;
+}
+
+},{"./graph":46,"./lodash":49}],49:[function(require,module,exports){
+arguments[4][10][0].apply(exports,arguments)
+},{"dup":10,"lodash":51}],50:[function(require,module,exports){
+module.exports = '2.1.5';
+
+},{}],51:[function(require,module,exports){
+(function (global){
+/**
+ * @license
+ * Lodash <https://lodash.com/>
+ * Copyright JS Foundation and other contributors <https://js.foundation/>
+ * Released under MIT license <https://lodash.com/license>
+ * Based on Underscore.js 1.8.3 <http://underscorejs.org/LICENSE>
+ * Copyright Jeremy Ashkenas, DocumentCloud and Investigative Reporters & Editors
+ */
+;(function() {
+
+  /** Used as a safe reference for `undefined` in pre-ES5 environments. */
+  var undefined;
+
+  /** Used as the semantic version number. */
+  var VERSION = '4.17.4';
+
+  /** Used as the size to enable large array optimizations. */
+  var LARGE_ARRAY_SIZE = 200;
+
+  /** Error message constants. */
+  var CORE_ERROR_TEXT = 'Unsupported core-js use. Try https://npms.io/search?q=ponyfill.',
+      FUNC_ERROR_TEXT = 'Expected a function';
+
+  /** Used to stand-in for `undefined` hash values. */
+  var HASH_UNDEFINED = '__lodash_hash_undefined__';
+
+  /** Used as the maximum memoize cache size. */
+  var MAX_MEMOIZE_SIZE = 500;
+
+  /** Used as the internal argument placeholder. */
+  var PLACEHOLDER = '__lodash_placeholder__';
+
+  /** Used to compose bitmasks for cloning. */
+  var CLONE_DEEP_FLAG = 1,
+      CLONE_FLAT_FLAG = 2,
+      CLONE_SYMBOLS_FLAG = 4;
+
+  /** Used to compose bitmasks for value comparisons. */
+  var COMPARE_PARTIAL_FLAG = 1,
+      COMPARE_UNORDERED_FLAG = 2;
+
+  /** Used to compose bitmasks for function metadata. */
+  var WRAP_BIND_FLAG = 1,
+      WRAP_BIND_KEY_FLAG = 2,
+      WRAP_CURRY_BOUND_FLAG = 4,
+      WRAP_CURRY_FLAG = 8,
+      WRAP_CURRY_RIGHT_FLAG = 16,
+      WRAP_PARTIAL_FLAG = 32,
+      WRAP_PARTIAL_RIGHT_FLAG = 64,
+      WRAP_ARY_FLAG = 128,
+      WRAP_REARG_FLAG = 256,
+      WRAP_FLIP_FLAG = 512;
+
+  /** Used as default options for `_.truncate`. */
+  var DEFAULT_TRUNC_LENGTH = 30,
+      DEFAULT_TRUNC_OMISSION = '...';
+
+  /** Used to detect hot functions by number of calls within a span of milliseconds. */
+  var HOT_COUNT = 800,
+      HOT_SPAN = 16;
+
+  /** Used to indicate the type of lazy iteratees. */
+  var LAZY_FILTER_FLAG = 1,
+      LAZY_MAP_FLAG = 2,
+      LAZY_WHILE_FLAG = 3;
+
+  /** Used as references for various `Number` constants. */
+  var INFINITY = 1 / 0,
+      MAX_SAFE_INTEGER = 9007199254740991,
+      MAX_INTEGER = 1.7976931348623157e+308,
+      NAN = 0 / 0;
+
+  /** Used as references for the maximum length and index of an array. */
+  var MAX_ARRAY_LENGTH = 4294967295,
+      MAX_ARRAY_INDEX = MAX_ARRAY_LENGTH - 1,
+      HALF_MAX_ARRAY_LENGTH = MAX_ARRAY_LENGTH >>> 1;
+
+  /** Used to associate wrap methods with their bit flags. */
+  var wrapFlags = [
+    ['ary', WRAP_ARY_FLAG],
+    ['bind', WRAP_BIND_FLAG],
+    ['bindKey', WRAP_BIND_KEY_FLAG],
+    ['curry', WRAP_CURRY_FLAG],
+    ['curryRight', WRAP_CURRY_RIGHT_FLAG],
+    ['flip', WRAP_FLIP_FLAG],
+    ['partial', WRAP_PARTIAL_FLAG],
+    ['partialRight', WRAP_PARTIAL_RIGHT_FLAG],
+    ['rearg', WRAP_REARG_FLAG]
+  ];
+
+  /** `Object#toString` result references. */
+  var argsTag = '[object Arguments]',
+      arrayTag = '[object Array]',
+      asyncTag = '[object AsyncFunction]',
+      boolTag = '[object Boolean]',
+      dateTag = '[object Date]',
+      domExcTag = '[object DOMException]',
+      errorTag = '[object Error]',
+      funcTag = '[object Function]',
+      genTag = '[object GeneratorFunction]',
+      mapTag = '[object Map]',
+      numberTag = '[object Number]',
+      nullTag = '[object Null]',
+      objectTag = '[object Object]',
+      promiseTag = '[object Promise]',
+      proxyTag = '[object Proxy]',
+      regexpTag = '[object RegExp]',
+      setTag = '[object Set]',
+      stringTag = '[object String]',
+      symbolTag = '[object Symbol]',
+      undefinedTag = '[object Undefined]',
+      weakMapTag = '[object WeakMap]',
+      weakSetTag = '[object WeakSet]';
+
+  var arrayBufferTag = '[object ArrayBuffer]',
+      dataViewTag = '[object DataView]',
+      float32Tag = '[object Float32Array]',
+      float64Tag = '[object Float64Array]',
+      int8Tag = '[object Int8Array]',
+      int16Tag = '[object Int16Array]',
+      int32Tag = '[object Int32Array]',
+      uint8Tag = '[object Uint8Array]',
+      uint8ClampedTag = '[object Uint8ClampedArray]',
+      uint16Tag = '[object Uint16Array]',
+      uint32Tag = '[object Uint32Array]';
+
+  /** Used to match empty string literals in compiled template source. */
+  var reEmptyStringLeading = /\b__p \+= '';/g,
+      reEmptyStringMiddle = /\b(__p \+=) '' \+/g,
+      reEmptyStringTrailing = /(__e\(.*?\)|\b__t\)) \+\n'';/g;
+
+  /** Used to match HTML entities and HTML characters. */
+  var reEscapedHtml = /&(?:amp|lt|gt|quot|#39);/g,
+      reUnescapedHtml = /[&<>"']/g,
+      reHasEscapedHtml = RegExp(reEscapedHtml.source),
+      reHasUnescapedHtml = RegExp(reUnescapedHtml.source);
+
+  /** Used to match template delimiters. */
+  var reEscape = /<%-([\s\S]+?)%>/g,
+      reEvaluate = /<%([\s\S]+?)%>/g,
+      reInterpolate = /<%=([\s\S]+?)%>/g;
+
+  /** Used to match property names within property paths. */
+  var reIsDeepProp = /\.|\[(?:[^[\]]*|(["'])(?:(?!\1)[^\\]|\\.)*?\1)\]/,
+      reIsPlainProp = /^\w*$/,
+      reLeadingDot = /^\./,
+      rePropName = /[^.[\]]+|\[(?:(-?\d+(?:\.\d+)?)|(["'])((?:(?!\2)[^\\]|\\.)*?)\2)\]|(?=(?:\.|\[\])(?:\.|\[\]|$))/g;
+
+  /**
+   * Used to match `RegExp`
+   * [syntax characters](http://ecma-international.org/ecma-262/7.0/#sec-patterns).
+   */
+  var reRegExpChar = /[\\^$.*+?()[\]{}|]/g,
+      reHasRegExpChar = RegExp(reRegExpChar.source);
+
+  /** Used to match leading and trailing whitespace. */
+  var reTrim = /^\s+|\s+$/g,
+      reTrimStart = /^\s+/,
+      reTrimEnd = /\s+$/;
+
+  /** Used to match wrap detail comments. */
+  var reWrapComment = /\{(?:\n\/\* \[wrapped with .+\] \*\/)?\n?/,
+      reWrapDetails = /\{\n\/\* \[wrapped with (.+)\] \*/,
+      reSplitDetails = /,? & /;
+
+  /** Used to match words composed of alphanumeric characters. */
+  var reAsciiWord = /[^\x00-\x2f\x3a-\x40\x5b-\x60\x7b-\x7f]+/g;
+
+  /** Used to match backslashes in property paths. */
+  var reEscapeChar = /\\(\\)?/g;
+
+  /**
+   * Used to match
+   * [ES template delimiters](http://ecma-international.org/ecma-262/7.0/#sec-template-literal-lexical-components).
+   */
+  var reEsTemplate = /\$\{([^\\}]*(?:\\.[^\\}]*)*)\}/g;
+
+  /** Used to match `RegExp` flags from their coerced string values. */
+  var reFlags = /\w*$/;
+
+  /** Used to detect bad signed hexadecimal string values. */
+  var reIsBadHex = /^[-+]0x[0-9a-f]+$/i;
+
+  /** Used to detect binary string values. */
+  var reIsBinary = /^0b[01]+$/i;
+
+  /** Used to detect host constructors (Safari). */
+  var reIsHostCtor = /^\[object .+?Constructor\]$/;
+
+  /** Used to detect octal string values. */
+  var reIsOctal = /^0o[0-7]+$/i;
+
+  /** Used to detect unsigned integer values. */
+  var reIsUint = /^(?:0|[1-9]\d*)$/;
+
+  /** Used to match Latin Unicode letters (excluding mathematical operators). */
+  var reLatin = /[\xc0-\xd6\xd8-\xf6\xf8-\xff\u0100-\u017f]/g;
+
+  /** Used to ensure capturing order of template delimiters. */
+  var reNoMatch = /($^)/;
+
+  /** Used to match unescaped characters in compiled string literals. */
+  var reUnescapedString = /['\n\r\u2028\u2029\\]/g;
+
+  /** Used to compose unicode character classes. */
+  var rsAstralRange = '\\ud800-\\udfff',
+      rsComboMarksRange = '\\u0300-\\u036f',
+      reComboHalfMarksRange = '\\ufe20-\\ufe2f',
+      rsComboSymbolsRange = '\\u20d0-\\u20ff',
+      rsComboRange = rsComboMarksRange + reComboHalfMarksRange + rsComboSymbolsRange,
+      rsDingbatRange = '\\u2700-\\u27bf',
+      rsLowerRange = 'a-z\\xdf-\\xf6\\xf8-\\xff',
+      rsMathOpRange = '\\xac\\xb1\\xd7\\xf7',
+      rsNonCharRange = '\\x00-\\x2f\\x3a-\\x40\\x5b-\\x60\\x7b-\\xbf',
+      rsPunctuationRange = '\\u2000-\\u206f',
+      rsSpaceRange = ' \\t\\x0b\\f\\xa0\\ufeff\\n\\r\\u2028\\u2029\\u1680\\u180e\\u2000\\u2001\\u2002\\u2003\\u2004\\u2005\\u2006\\u2007\\u2008\\u2009\\u200a\\u202f\\u205f\\u3000',
+      rsUpperRange = 'A-Z\\xc0-\\xd6\\xd8-\\xde',
+      rsVarRange = '\\ufe0e\\ufe0f',
+      rsBreakRange = rsMathOpRange + rsNonCharRange + rsPunctuationRange + rsSpaceRange;
+
+  /** Used to compose unicode capture groups. */
+  var rsApos = "['\u2019]",
+      rsAstral = '[' + rsAstralRange + ']',
+      rsBreak = '[' + rsBreakRange + ']',
+      rsCombo = '[' + rsComboRange + ']',
+      rsDigits = '\\d+',
+      rsDingbat = '[' + rsDingbatRange + ']',
+      rsLower = '[' + rsLowerRange + ']',
+      rsMisc = '[^' + rsAstralRange + rsBreakRange + rsDigits + rsDingbatRange + rsLowerRange + rsUpperRange + ']',
+      rsFitz = '\\ud83c[\\udffb-\\udfff]',
+      rsModifier = '(?:' + rsCombo + '|' + rsFitz + ')',
+      rsNonAstral = '[^' + rsAstralRange + ']',
+      rsRegional = '(?:\\ud83c[\\udde6-\\uddff]){2}',
+      rsSurrPair = '[\\ud800-\\udbff][\\udc00-\\udfff]',
+      rsUpper = '[' + rsUpperRange + ']',
+      rsZWJ = '\\u200d';
+
+  /** Used to compose unicode regexes. */
+  var rsMiscLower = '(?:' + rsLower + '|' + rsMisc + ')',
+      rsMiscUpper = '(?:' + rsUpper + '|' + rsMisc + ')',
+      rsOptContrLower = '(?:' + rsApos + '(?:d|ll|m|re|s|t|ve))?',
+      rsOptContrUpper = '(?:' + rsApos + '(?:D|LL|M|RE|S|T|VE))?',
+      reOptMod = rsModifier + '?',
+      rsOptVar = '[' + rsVarRange + ']?',
+      rsOptJoin = '(?:' + rsZWJ + '(?:' + [rsNonAstral, rsRegional, rsSurrPair].join('|') + ')' + rsOptVar + reOptMod + ')*',
+      rsOrdLower = '\\d*(?:(?:1st|2nd|3rd|(?![123])\\dth)\\b)',
+      rsOrdUpper = '\\d*(?:(?:1ST|2ND|3RD|(?![123])\\dTH)\\b)',
+      rsSeq = rsOptVar + reOptMod + rsOptJoin,
+      rsEmoji = '(?:' + [rsDingbat, rsRegional, rsSurrPair].join('|') + ')' + rsSeq,
+      rsSymbol = '(?:' + [rsNonAstral + rsCombo + '?', rsCombo, rsRegional, rsSurrPair, rsAstral].join('|') + ')';
+
+  /** Used to match apostrophes. */
+  var reApos = RegExp(rsApos, 'g');
+
+  /**
+   * Used to match [combining diacritical marks](https://en.wikipedia.org/wiki/Combining_Diacritical_Marks) and
+   * [combining diacritical marks for symbols](https://en.wikipedia.org/wiki/Combining_Diacritical_Marks_for_Symbols).
+   */
+  var reComboMark = RegExp(rsCombo, 'g');
+
+  /** Used to match [string symbols](https://mathiasbynens.be/notes/javascript-unicode). */
+  var reUnicode = RegExp(rsFitz + '(?=' + rsFitz + ')|' + rsSymbol + rsSeq, 'g');
+
+  /** Used to match complex or compound words. */
+  var reUnicodeWord = RegExp([
+    rsUpper + '?' + rsLower + '+' + rsOptContrLower + '(?=' + [rsBreak, rsUpper, '$'].join('|') + ')',
+    rsMiscUpper + '+' + rsOptContrUpper + '(?=' + [rsBreak, rsUpper + rsMiscLower, '$'].join('|') + ')',
+    rsUpper + '?' + rsMiscLower + '+' + rsOptContrLower,
+    rsUpper + '+' + rsOptContrUpper,
+    rsOrdUpper,
+    rsOrdLower,
+    rsDigits,
+    rsEmoji
+  ].join('|'), 'g');
+
+  /** Used to detect strings with [zero-width joiners or code points from the astral planes](http://eev.ee/blog/2015/09/12/dark-corners-of-unicode/). */
+  var reHasUnicode = RegExp('[' + rsZWJ + rsAstralRange  + rsComboRange + rsVarRange + ']');
+
+  /** Used to detect strings that need a more robust regexp to match words. */
+  var reHasUnicodeWord = /[a-z][A-Z]|[A-Z]{2,}[a-z]|[0-9][a-zA-Z]|[a-zA-Z][0-9]|[^a-zA-Z0-9 ]/;
+
+  /** Used to assign default `context` object properties. */
+  var contextProps = [
+    'Array', 'Buffer', 'DataView', 'Date', 'Error', 'Float32Array', 'Float64Array',
+    'Function', 'Int8Array', 'Int16Array', 'Int32Array', 'Map', 'Math', 'Object',
+    'Promise', 'RegExp', 'Set', 'String', 'Symbol', 'TypeError', 'Uint8Array',
+    'Uint8ClampedArray', 'Uint16Array', 'Uint32Array', 'WeakMap',
+    '_', 'clearTimeout', 'isFinite', 'parseInt', 'setTimeout'
+  ];
+
+  /** Used to make template sourceURLs easier to identify. */
+  var templateCounter = -1;
+
+  /** Used to identify `toStringTag` values of typed arrays. */
+  var typedArrayTags = {};
+  typedArrayTags[float32Tag] = typedArrayTags[float64Tag] =
+  typedArrayTags[int8Tag] = typedArrayTags[int16Tag] =
+  typedArrayTags[int32Tag] = typedArrayTags[uint8Tag] =
+  typedArrayTags[uint8ClampedTag] = typedArrayTags[uint16Tag] =
+  typedArrayTags[uint32Tag] = true;
+  typedArrayTags[argsTag] = typedArrayTags[arrayTag] =
+  typedArrayTags[arrayBufferTag] = typedArrayTags[boolTag] =
+  typedArrayTags[dataViewTag] = typedArrayTags[dateTag] =
+  typedArrayTags[errorTag] = typedArrayTags[funcTag] =
+  typedArrayTags[mapTag] = typedArrayTags[numberTag] =
+  typedArrayTags[objectTag] = typedArrayTags[regexpTag] =
+  typedArrayTags[setTag] = typedArrayTags[stringTag] =
+  typedArrayTags[weakMapTag] = false;
+
+  /** Used to identify `toStringTag` values supported by `_.clone`. */
+  var cloneableTags = {};
+  cloneableTags[argsTag] = cloneableTags[arrayTag] =
+  cloneableTags[arrayBufferTag] = cloneableTags[dataViewTag] =
+  cloneableTags[boolTag] = cloneableTags[dateTag] =
+  cloneableTags[float32Tag] = cloneableTags[float64Tag] =
+  cloneableTags[int8Tag] = cloneableTags[int16Tag] =
+  cloneableTags[int32Tag] = cloneableTags[mapTag] =
+  cloneableTags[numberTag] = cloneableTags[objectTag] =
+  cloneableTags[regexpTag] = cloneableTags[setTag] =
+  cloneableTags[stringTag] = cloneableTags[symbolTag] =
+  cloneableTags[uint8Tag] = cloneableTags[uint8ClampedTag] =
+  cloneableTags[uint16Tag] = cloneableTags[uint32Tag] = true;
+  cloneableTags[errorTag] = cloneableTags[funcTag] =
+  cloneableTags[weakMapTag] = false;
+
+  /** Used to map Latin Unicode letters to basic Latin letters. */
+  var deburredLetters = {
+    // Latin-1 Supplement block.
+    '\xc0': 'A',  '\xc1': 'A', '\xc2': 'A', '\xc3': 'A', '\xc4': 'A', '\xc5': 'A',
+    '\xe0': 'a',  '\xe1': 'a', '\xe2': 'a', '\xe3': 'a', '\xe4': 'a', '\xe5': 'a',
+    '\xc7': 'C',  '\xe7': 'c',
+    '\xd0': 'D',  '\xf0': 'd',
+    '\xc8': 'E',  '\xc9': 'E', '\xca': 'E', '\xcb': 'E',
+    '\xe8': 'e',  '\xe9': 'e', '\xea': 'e', '\xeb': 'e',
+    '\xcc': 'I',  '\xcd': 'I', '\xce': 'I', '\xcf': 'I',
+    '\xec': 'i',  '\xed': 'i', '\xee': 'i', '\xef': 'i',
+    '\xd1': 'N',  '\xf1': 'n',
+    '\xd2': 'O',  '\xd3': 'O', '\xd4': 'O', '\xd5': 'O', '\xd6': 'O', '\xd8': 'O',
+    '\xf2': 'o',  '\xf3': 'o', '\xf4': 'o', '\xf5': 'o', '\xf6': 'o', '\xf8': 'o',
+    '\xd9': 'U',  '\xda': 'U', '\xdb': 'U', '\xdc': 'U',
+    '\xf9': 'u',  '\xfa': 'u', '\xfb': 'u', '\xfc': 'u',
+    '\xdd': 'Y',  '\xfd': 'y', '\xff': 'y',
+    '\xc6': 'Ae', '\xe6': 'ae',
+    '\xde': 'Th', '\xfe': 'th',
+    '\xdf': 'ss',
+    // Latin Extended-A block.
+    '\u0100': 'A',  '\u0102': 'A', '\u0104': 'A',
+    '\u0101': 'a',  '\u0103': 'a', '\u0105': 'a',
+    '\u0106': 'C',  '\u0108': 'C', '\u010a': 'C', '\u010c': 'C',
+    '\u0107': 'c',  '\u0109': 'c', '\u010b': 'c', '\u010d': 'c',
+    '\u010e': 'D',  '\u0110': 'D', '\u010f': 'd', '\u0111': 'd',
+    '\u0112': 'E',  '\u0114': 'E', '\u0116': 'E', '\u0118': 'E', '\u011a': 'E',
+    '\u0113': 'e',  '\u0115': 'e', '\u0117': 'e', '\u0119': 'e', '\u011b': 'e',
+    '\u011c': 'G',  '\u011e': 'G', '\u0120': 'G', '\u0122': 'G',
+    '\u011d': 'g',  '\u011f': 'g', '\u0121': 'g', '\u0123': 'g',
+    '\u0124': 'H',  '\u0126': 'H', '\u0125': 'h', '\u0127': 'h',
+    '\u0128': 'I',  '\u012a': 'I', '\u012c': 'I', '\u012e': 'I', '\u0130': 'I',
+    '\u0129': 'i',  '\u012b': 'i', '\u012d': 'i', '\u012f': 'i', '\u0131': 'i',
+    '\u0134': 'J',  '\u0135': 'j',
+    '\u0136': 'K',  '\u0137': 'k', '\u0138': 'k',
+    '\u0139': 'L',  '\u013b': 'L', '\u013d': 'L', '\u013f': 'L', '\u0141': 'L',
+    '\u013a': 'l',  '\u013c': 'l', '\u013e': 'l', '\u0140': 'l', '\u0142': 'l',
+    '\u0143': 'N',  '\u0145': 'N', '\u0147': 'N', '\u014a': 'N',
+    '\u0144': 'n',  '\u0146': 'n', '\u0148': 'n', '\u014b': 'n',
+    '\u014c': 'O',  '\u014e': 'O', '\u0150': 'O',
+    '\u014d': 'o',  '\u014f': 'o', '\u0151': 'o',
+    '\u0154': 'R',  '\u0156': 'R', '\u0158': 'R',
+    '\u0155': 'r',  '\u0157': 'r', '\u0159': 'r',
+    '\u015a': 'S',  '\u015c': 'S', '\u015e': 'S', '\u0160': 'S',
+    '\u015b': 's',  '\u015d': 's', '\u015f': 's', '\u0161': 's',
+    '\u0162': 'T',  '\u0164': 'T', '\u0166': 'T',
+    '\u0163': 't',  '\u0165': 't', '\u0167': 't',
+    '\u0168': 'U',  '\u016a': 'U', '\u016c': 'U', '\u016e': 'U', '\u0170': 'U', '\u0172': 'U',
+    '\u0169': 'u',  '\u016b': 'u', '\u016d': 'u', '\u016f': 'u', '\u0171': 'u', '\u0173': 'u',
+    '\u0174': 'W',  '\u0175': 'w',
+    '\u0176': 'Y',  '\u0177': 'y', '\u0178': 'Y',
+    '\u0179': 'Z',  '\u017b': 'Z', '\u017d': 'Z',
+    '\u017a': 'z',  '\u017c': 'z', '\u017e': 'z',
+    '\u0132': 'IJ', '\u0133': 'ij',
+    '\u0152': 'Oe', '\u0153': 'oe',
+    '\u0149': "'n", '\u017f': 's'
+  };
+
+  /** Used to map characters to HTML entities. */
+  var htmlEscapes = {
+    '&': '&amp;',
+    '<': '&lt;',
+    '>': '&gt;',
+    '"': '&quot;',
+    "'": '&#39;'
+  };
+
+  /** Used to map HTML entities to characters. */
+  var htmlUnescapes = {
+    '&amp;': '&',
+    '&lt;': '<',
+    '&gt;': '>',
+    '&quot;': '"',
+    '&#39;': "'"
+  };
+
+  /** Used to escape characters for inclusion in compiled string literals. */
+  var stringEscapes = {
+    '\\': '\\',
+    "'": "'",
+    '\n': 'n',
+    '\r': 'r',
+    '\u2028': 'u2028',
+    '\u2029': 'u2029'
+  };
+
+  /** Built-in method references without a dependency on `root`. */
+  var freeParseFloat = parseFloat,
+      freeParseInt = parseInt;
+
+  /** Detect free variable `global` from Node.js. */
+  var freeGlobal = typeof global == 'object' && global && global.Object === Object && global;
+
+  /** Detect free variable `self`. */
+  var freeSelf = typeof self == 'object' && self && self.Object === Object && self;
+
+  /** Used as a reference to the global object. */
+  var root = freeGlobal || freeSelf || Function('return this')();
+
+  /** Detect free variable `exports`. */
+  var freeExports = typeof exports == 'object' && exports && !exports.nodeType && exports;
+
+  /** Detect free variable `module`. */
+  var freeModule = freeExports && typeof module == 'object' && module && !module.nodeType && module;
+
+  /** Detect the popular CommonJS extension `module.exports`. */
+  var moduleExports = freeModule && freeModule.exports === freeExports;
+
+  /** Detect free variable `process` from Node.js. */
+  var freeProcess = moduleExports && freeGlobal.process;
+
+  /** Used to access faster Node.js helpers. */
+  var nodeUtil = (function() {
+    try {
+      return freeProcess && freeProcess.binding && freeProcess.binding('util');
+    } catch (e) {}
+  }());
+
+  /* Node.js helper references. */
+  var nodeIsArrayBuffer = nodeUtil && nodeUtil.isArrayBuffer,
+      nodeIsDate = nodeUtil && nodeUtil.isDate,
+      nodeIsMap = nodeUtil && nodeUtil.isMap,
+      nodeIsRegExp = nodeUtil && nodeUtil.isRegExp,
+      nodeIsSet = nodeUtil && nodeUtil.isSet,
+      nodeIsTypedArray = nodeUtil && nodeUtil.isTypedArray;
+
+  /*--------------------------------------------------------------------------*/
+
+  /**
+   * Adds the key-value `pair` to `map`.
+   *
+   * @private
+   * @param {Object} map The map to modify.
+   * @param {Array} pair The key-value pair to add.
+   * @returns {Object} Returns `map`.
+   */
+  function addMapEntry(map, pair) {
+    // Don't return `map.set` because it's not chainable in IE 11.
+    map.set(pair[0], pair[1]);
+    return map;
+  }
+
+  /**
+   * Adds `value` to `set`.
+   *
+   * @private
+   * @param {Object} set The set to modify.
+   * @param {*} value The value to add.
+   * @returns {Object} Returns `set`.
+   */
+  function addSetEntry(set, value) {
+    // Don't return `set.add` because it's not chainable in IE 11.
+    set.add(value);
+    return set;
+  }
+
+  /**
+   * A faster alternative to `Function#apply`, this function invokes `func`
+   * with the `this` binding of `thisArg` and the arguments of `args`.
+   *
+   * @private
+   * @param {Function} func The function to invoke.
+   * @param {*} thisArg The `this` binding of `func`.
+   * @param {Array} args The arguments to invoke `func` with.
+   * @returns {*} Returns the result of `func`.
+   */
+  function apply(func, thisArg, args) {
+    switch (args.length) {
+      case 0: return func.call(thisArg);
+      case 1: return func.call(thisArg, args[0]);
+      case 2: return func.call(thisArg, args[0], args[1]);
+      case 3: return func.call(thisArg, args[0], args[1], args[2]);
+    }
+    return func.apply(thisArg, args);
+  }
+
+  /**
+   * A specialized version of `baseAggregator` for arrays.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} setter The function to set `accumulator` values.
+   * @param {Function} iteratee The iteratee to transform keys.
+   * @param {Object} accumulator The initial aggregated object.
+   * @returns {Function} Returns `accumulator`.
+   */
+  function arrayAggregator(array, setter, iteratee, accumulator) {
+    var index = -1,
+        length = array == null ? 0 : array.length;
+
+    while (++index < length) {
+      var value = array[index];
+      setter(accumulator, value, iteratee(value), array);
+    }
+    return accumulator;
+  }
+
+  /**
+   * A specialized version of `_.forEach` for arrays without support for
+   * iteratee shorthands.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @returns {Array} Returns `array`.
+   */
+  function arrayEach(array, iteratee) {
+    var index = -1,
+        length = array == null ? 0 : array.length;
+
+    while (++index < length) {
+      if (iteratee(array[index], index, array) === false) {
+        break;
+      }
+    }
+    return array;
+  }
+
+  /**
+   * A specialized version of `_.forEachRight` for arrays without support for
+   * iteratee shorthands.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @returns {Array} Returns `array`.
+   */
+  function arrayEachRight(array, iteratee) {
+    var length = array == null ? 0 : array.length;
+
+    while (length--) {
+      if (iteratee(array[length], length, array) === false) {
+        break;
+      }
+    }
+    return array;
+  }
+
+  /**
+   * A specialized version of `_.every` for arrays without support for
+   * iteratee shorthands.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} predicate The function invoked per iteration.
+   * @returns {boolean} Returns `true` if all elements pass the predicate check,
+   *  else `false`.
+   */
+  function arrayEvery(array, predicate) {
+    var index = -1,
+        length = array == null ? 0 : array.length;
+
+    while (++index < length) {
+      if (!predicate(array[index], index, array)) {
+        return false;
+      }
+    }
+    return true;
+  }
+
+  /**
+   * A specialized version of `_.filter` for arrays without support for
+   * iteratee shorthands.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} predicate The function invoked per iteration.
+   * @returns {Array} Returns the new filtered array.
+   */
+  function arrayFilter(array, predicate) {
+    var index = -1,
+        length = array == null ? 0 : array.length,
+        resIndex = 0,
+        result = [];
+
+    while (++index < length) {
+      var value = array[index];
+      if (predicate(value, index, array)) {
+        result[resIndex++] = value;
+      }
+    }
+    return result;
+  }
+
+  /**
+   * A specialized version of `_.includes` for arrays without support for
+   * specifying an index to search from.
+   *
+   * @private
+   * @param {Array} [array] The array to inspect.
+   * @param {*} target The value to search for.
+   * @returns {boolean} Returns `true` if `target` is found, else `false`.
+   */
+  function arrayIncludes(array, value) {
+    var length = array == null ? 0 : array.length;
+    return !!length && baseIndexOf(array, value, 0) > -1;
+  }
+
+  /**
+   * This function is like `arrayIncludes` except that it accepts a comparator.
+   *
+   * @private
+   * @param {Array} [array] The array to inspect.
+   * @param {*} target The value to search for.
+   * @param {Function} comparator The comparator invoked per element.
+   * @returns {boolean} Returns `true` if `target` is found, else `false`.
+   */
+  function arrayIncludesWith(array, value, comparator) {
+    var index = -1,
+        length = array == null ? 0 : array.length;
+
+    while (++index < length) {
+      if (comparator(value, array[index])) {
+        return true;
+      }
+    }
+    return false;
+  }
+
+  /**
+   * A specialized version of `_.map` for arrays without support for iteratee
+   * shorthands.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @returns {Array} Returns the new mapped array.
+   */
+  function arrayMap(array, iteratee) {
+    var index = -1,
+        length = array == null ? 0 : array.length,
+        result = Array(length);
+
+    while (++index < length) {
+      result[index] = iteratee(array[index], index, array);
+    }
+    return result;
+  }
+
+  /**
+   * Appends the elements of `values` to `array`.
+   *
+   * @private
+   * @param {Array} array The array to modify.
+   * @param {Array} values The values to append.
+   * @returns {Array} Returns `array`.
+   */
+  function arrayPush(array, values) {
+    var index = -1,
+        length = values.length,
+        offset = array.length;
+
+    while (++index < length) {
+      array[offset + index] = values[index];
+    }
+    return array;
+  }
+
+  /**
+   * A specialized version of `_.reduce` for arrays without support for
+   * iteratee shorthands.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @param {*} [accumulator] The initial value.
+   * @param {boolean} [initAccum] Specify using the first element of `array` as
+   *  the initial value.
+   * @returns {*} Returns the accumulated value.
+   */
+  function arrayReduce(array, iteratee, accumulator, initAccum) {
+    var index = -1,
+        length = array == null ? 0 : array.length;
+
+    if (initAccum && length) {
+      accumulator = array[++index];
+    }
+    while (++index < length) {
+      accumulator = iteratee(accumulator, array[index], index, array);
+    }
+    return accumulator;
+  }
+
+  /**
+   * A specialized version of `_.reduceRight` for arrays without support for
+   * iteratee shorthands.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @param {*} [accumulator] The initial value.
+   * @param {boolean} [initAccum] Specify using the last element of `array` as
+   *  the initial value.
+   * @returns {*} Returns the accumulated value.
+   */
+  function arrayReduceRight(array, iteratee, accumulator, initAccum) {
+    var length = array == null ? 0 : array.length;
+    if (initAccum && length) {
+      accumulator = array[--length];
+    }
+    while (length--) {
+      accumulator = iteratee(accumulator, array[length], length, array);
+    }
+    return accumulator;
+  }
+
+  /**
+   * A specialized version of `_.some` for arrays without support for iteratee
+   * shorthands.
+   *
+   * @private
+   * @param {Array} [array] The array to iterate over.
+   * @param {Function} predicate The function invoked per iteration.
+   * @returns {boolean} Returns `true` if any element passes the predicate check,
+   *  else `false`.
+   */
+  function arraySome(array, predicate) {
+    var index = -1,
+        length = array == null ? 0 : array.length;
+
+    while (++index < length) {
+      if (predicate(array[index], index, array)) {
+        return true;
+      }
+    }
+    return false;
+  }
+
+  /**
+   * Gets the size of an ASCII `string`.
+   *
+   * @private
+   * @param {string} string The string inspect.
+   * @returns {number} Returns the string size.
+   */
+  var asciiSize = baseProperty('length');
+
+  /**
+   * Converts an ASCII `string` to an array.
+   *
+   * @private
+   * @param {string} string The string to convert.
+   * @returns {Array} Returns the converted array.
+   */
+  function asciiToArray(string) {
+    return string.split('');
+  }
+
+  /**
+   * Splits an ASCII `string` into an array of its words.
+   *
+   * @private
+   * @param {string} The string to inspect.
+   * @returns {Array} Returns the words of `string`.
+   */
+  function asciiWords(string) {
+    return string.match(reAsciiWord) || [];
+  }
+
+  /**
+   * The base implementation of methods like `_.findKey` and `_.findLastKey`,
+   * without support for iteratee shorthands, which iterates over `collection`
+   * using `eachFunc`.
+   *
+   * @private
+   * @param {Array|Object} collection The collection to inspect.
+   * @param {Function} predicate The function invoked per iteration.
+   * @param {Function} eachFunc The function to iterate over `collection`.
+   * @returns {*} Returns the found element or its key, else `undefined`.
+   */
+  function baseFindKey(collection, predicate, eachFunc) {
+    var result;
+    eachFunc(collection, function(value, key, collection) {
+      if (predicate(value, key, collection)) {
+        result = key;
+        return false;
+      }
+    });
+    return result;
+  }
+
+  /**
+   * The base implementation of `_.findIndex` and `_.findLastIndex` without
+   * support for iteratee shorthands.
+   *
+   * @private
+   * @param {Array} array The array to inspect.
+   * @param {Function} predicate The function invoked per iteration.
+   * @param {number} fromIndex The index to search from.
+   * @param {boolean} [fromRight] Specify iterating from right to left.
+   * @returns {number} Returns the index of the matched value, else `-1`.
+   */
+  function baseFindIndex(array, predicate, fromIndex, fromRight) {
+    var length = array.length,
+        index = fromIndex + (fromRight ? 1 : -1);
+
+    while ((fromRight ? index-- : ++index < length)) {
+      if (predicate(array[index], index, array)) {
+        return index;
+      }
+    }
+    return -1;
+  }
+
+  /**
+   * The base implementation of `_.indexOf` without `fromIndex` bounds checks.
+   *
+   * @private
+   * @param {Array} array The array to inspect.
+   * @param {*} value The value to search for.
+   * @param {number} fromIndex The index to search from.
+   * @returns {number} Returns the index of the matched value, else `-1`.
+   */
+  function baseIndexOf(array, value, fromIndex) {
+    return value === value
+      ? strictIndexOf(array, value, fromIndex)
+      : baseFindIndex(array, baseIsNaN, fromIndex);
+  }
+
+  /**
+   * This function is like `baseIndexOf` except that it accepts a comparator.
+   *
+   * @private
+   * @param {Array} array The array to inspect.
+   * @param {*} value The value to search for.
+   * @param {number} fromIndex The index to search from.
+   * @param {Function} comparator The comparator invoked per element.
+   * @returns {number} Returns the index of the matched value, else `-1`.
+   */
+  function baseIndexOfWith(array, value, fromIndex, comparator) {
+    var index = fromIndex - 1,
+        length = array.length;
+
+    while (++index < length) {
+      if (comparator(array[index], value)) {
+        return index;
+      }
+    }
+    return -1;
+  }
+
+  /**
+   * The base implementation of `_.isNaN` without support for number objects.
+   *
+   * @private
+   * @param {*} value The value to check.
+   * @returns {boolean} Returns `true` if `value` is `NaN`, else `false`.
+   */
+  function baseIsNaN(value) {
+    return value !== value;
+  }
+
+  /**
+   * The base implementation of `_.mean` and `_.meanBy` without support for
+   * iteratee shorthands.
+   *
+   * @private
+   * @param {Array} array The array to iterate over.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @returns {number} Returns the mean.
+   */
+  function baseMean(array, iteratee) {
+    var length = array == null ? 0 : array.length;
+    return length ? (baseSum(array, iteratee) / length) : NAN;
+  }
+
+  /**
+   * The base implementation of `_.property` without support for deep paths.
+   *
+   * @private
+   * @param {string} key The key of the property to get.
+   * @returns {Function} Returns the new accessor function.
+   */
+  function baseProperty(key) {
+    return function(object) {
+      return object == null ? undefined : object[key];
+    };
+  }
+
+  /**
+   * The base implementation of `_.propertyOf` without support for deep paths.
+   *
+   * @private
+   * @param {Object} object The object to query.
+   * @returns {Function} Returns the new accessor function.
+   */
+  function basePropertyOf(object) {
+    return function(key) {
+      return object == null ? undefined : object[key];
+    };
+  }
+
+  /**
+   * The base implementation of `_.reduce` and `_.reduceRight`, without support
+   * for iteratee shorthands, which iterates over `collection` using `eachFunc`.
+   *
+   * @private
+   * @param {Array|Object} collection The collection to iterate over.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @param {*} accumulator The initial value.
+   * @param {boolean} initAccum Specify using the first or last element of
+   *  `collection` as the initial value.
+   * @param {Function} eachFunc The function to iterate over `collection`.
+   * @returns {*} Returns the accumulated value.
+   */
+  function baseReduce(collection, iteratee, accumulator, initAccum, eachFunc) {
+    eachFunc(collection, function(value, index, collection) {
+      accumulator = initAccum
+        ? (initAccum = false, value)
+        : iteratee(accumulator, value, index, collection);
+    });
+    return accumulator;
+  }
+
+  /**
+   * The base implementation of `_.sortBy` which uses `comparer` to define the
+   * sort order of `array` and replaces criteria objects with their corresponding
+   * values.
+   *
+   * @private
+   * @param {Array} array The array to sort.
+   * @param {Function} comparer The function to define sort order.
+   * @returns {Array} Returns `array`.
+   */
+  function baseSortBy(array, comparer) {
+    var length = array.length;
+
+    array.sort(comparer);
+    while (length--) {
+      array[length] = array[length].value;
+    }
+    return array;
+  }
+
+  /**
+   * The base implementation of `_.sum` and `_.sumBy` without support for
+   * iteratee shorthands.
+   *
+   * @private
+   * @param {Array} array The array to iterate over.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @returns {number} Returns the sum.
+   */
+  function baseSum(array, iteratee) {
+    var result,
+        index = -1,
+        length = array.length;
+
+    while (++index < length) {
+      var current = iteratee(array[index]);
+      if (current !== undefined) {
+        result = result === undefined ? current : (result + current);
+      }
+    }
+    return result;
+  }
+
+  /**
+   * The base implementation of `_.times` without support for iteratee shorthands
+   * or max array length checks.
+   *
+   * @private
+   * @param {number} n The number of times to invoke `iteratee`.
+   * @param {Function} iteratee The function invoked per iteration.
+   * @returns {Array} Returns the array of results.
+   */
+  function baseTimes(n, iteratee) {
+    var index = -1,
+        result = Array(n);
+
+    while (++index < n) {
+      result[index] = iteratee(index);
+    }
+    return result;
+  }
+
+  /**
+   * The base implementation of `_.toPairs` and `_.toPairsIn` which creates an array
+   * of key-value pairs for `object` corresponding to the property names of `props`.
+   *
+   * @private
+   * @param {Object} object The object to query.
+   * @param {Array} props The property names to get values for.
+   * @returns {Object} Returns the key-value pairs.
+   */
+  function baseToPairs(object, props) {
+    return arrayMap(props, function(key) {
+      return [key, object[key]];
+    });
+  }
+
+  /**
+   * The base implementation of `_.unary` without support for storing metadata.
+   *
+   * @private
+   * @param {Function} func The function to cap arguments for.
+   * @returns {Function} Returns the new capped function.
+   */
+  function baseUnary(func) {
+    return function(value) {
+      return func(value);
+    };
+  }
+
+  /**
+   * The base implementation of `_.values` and `_.valuesIn` which creates an
+   * array of `object` property values corresponding to the property names
+   * of `props`.
+   *
+   * @private
+   * @param {Object} object The object to query.
+   * @param {Array} props The property names to get values for.
+   * @returns {Object} Returns the array of property values.
+   */
+  function baseValues(object, props) {
+    return arrayMap(props, function(key) {
+      return object[key];
+    });
+  }
+
+  /**
+   * Checks if a `cache` value for `key` exists.
+   *
+   * @private
+   * @param {Object} cache The cache to query.
+   * @param {string} key The key of the entry to check.
+   * @returns {boolean} Returns `true` if an entry for `key` exists, else `false`.
+   */
+  function cacheHas(cache, key) {
+    return cache.has(key);
+  }
+
+  /**
+   * Used by `_.trim` and `_.trimStart` to get the index of the first string symbol
+   * that is not found in the character symbols.
+   *
+   * @private
+   * @param {Array} strSymbols The string symbols to inspect.
+   * @param {Array} chrSymbols The character symbols to find.
+   * @returns {number} Returns the index of the first unmatched string symbol.
+   */
+  function charsStartIndex(strSymbols, chrSymbols) {
+    var index = -1,
+        length = strSymbols.length;
+
+    while (++index < length && baseIndexOf(chrSymbols, strSymbols[index], 0) > -1) {}
+    return index;
+  }
+
+  /**
+   * Used by `_.trim` and `_.trimEnd` to get the index of the last string symbol
+   * that is not found in the character symbols.
+   *
+   * @private
+   * @param {Array} strSymbols The string symbols to inspect.
+   * @param {Array} chrSymbols The character symbols to find.
+   * @returns {number} Returns the index of the last unmatched string symbol.
+   */
+  function charsEndIndex(strSymbols, chrSymbols) {
+    var index = strSymbols.length;
+
+    while (index-- && baseIndexOf(chrSymbols, strSymbols[index], 0) > -1) {}
+    return index;
+  }
+
+  /**
+   * Gets the number of `placeholder` occurrences in `array`.
+   *
+   * @private
+   * @param {Array} array The array to inspect.
+   * @param {*} placeholder The placeholder to search for.
+   * @returns {number} Returns the placeholder count.
+   */
+  function countHolders(array, placeholder) {
+    var length = array.length,
+        result = 0;
+
+    while (length--) {
+      if (array[length] === placeholder) {
+        ++result;
+      }
+    }
+    return result;
+  }
+
+  /**
+   * Used by `_.deburr` to convert Latin-1 Supplement and Latin Extended-A
+   * letters to basic Latin letters.
+   *
+   * @private
+   * @param {string} letter The matched letter to deburr.
+   * @returns {string} Returns the deburred letter.
+   */
+  var deburrLetter = basePropertyOf(deburredLetters);
+
+  /**
+   * Used by `_.escape` to convert characters to HTML entities.
+   *
+   * @private
+   * @param {string} chr The matched character to escape.
+   * @returns {string} Returns the escaped character.
+   */
+  var escapeHtmlChar = basePropertyOf(htmlEscapes);
+
+  /**
+   * Used by `_.template` to escape characters for inclusion in compiled string literals.
+   *
+   * @private
+   * @param {string} chr The matched character to escape.
+   * @returns {string} Returns the escaped character.
+   */
+  function escapeStringChar(chr) {
+    return '\\' + stringEscapes[chr];
+  }
+
+  /**
+   * Gets the value at `key` of `object`.
+   *
+   * @private
+   * @param {Object} [object] The object to query.
+   * @param {string} key The key of the property to get.
+   * @returns {*} Returns the property value.
+   */
+  function getValue(object, key) {
+    return object == null ? undefined : object[key];
+  }
+
+  /**
+   * Checks if `string` contains Unicode symbols.
+   *
+   * @private
+   * @param {string} string The string to inspect.
+   * @returns {boolean} Returns `true` if a symbol is found, else `false`.
+   */
+  function hasUnicode(string) {
+    return reHasUnicode.test(string);
+  }
+
+  /**
+   * Checks if `string` contains a word composed of Unicode symbols.
+   *
+   * @private
+   * @param {string} string The string to inspect.
+   * @returns {boolean} Returns `true` if a word is found, else `false`.
+   */
+  function hasUnicodeWord(string) {
+    return reHasUnicodeWord.test(string);
+  }
+
+  /**
+   * Converts `iterator` to an array.
+   *
+   * @private
+   * @param {Object} iterator The iterator to convert.
+   * @returns {Array} Returns the converted array.
+   */
+  function iteratorToArray(iterator) {
+    var data,
+        result = [];
+
+    while (!(data = iterator.next()).done) {
+      result.push(data.value);
+    }
+    return result;
+  }
+
+  /**
+   * Converts `map` to its key-value pairs.
+   *
+   * @private
+   * @param {Object} map The map to convert.
+   * @returns {Array} Returns the key-value pairs.
+   */
+  function mapToArray(map) {
+    var index = -1,
+        result = Array(map.size);
+
+    map.forEach(function(value, key) {
+      result[++index] = [key, value];
+    });
+    return result;
+  }
+
+  /**
+   * Creates a unary function that invokes `func` with its argument transformed.
+   *
+   * @private
+   * @param {Function} func The function to wrap.
+   * @param {Function} transform The argument transform.
+   * @returns {Function} Returns the new function.
+   */
+  function overArg(func, transform) {
+    return function(arg) {
+      return func(transform(arg));
+    };
+  }
+
+  /**
+   * Replaces all `placeholder` elements in `array` with an internal placeholder
+   * and returns an array of their indexes.
+   *
+   * @private
+   * @param {Array} array The array to modify.
+   * @param {*} placeholder The placeholder to replace.
+   * @returns {Array} Returns the new array of placeholder indexes.
+   */
+  function replaceHolders(array, placeholder) {
+    var index = -1,
+        length = array.length,
+        resIndex = 0,
+        result = [];
+
+    while (++index < length) {
+      var value = array[index];
+      if (value === placeholder || value === PLACEHOLDER) {
+        array[index] = PLACEHOLDER;
+        result[resIndex++] = index;
+      }
+    }
+    return result;
+  }
+
+  /**
+   * Converts `set` to an array of its values.
+   *
+   * @private
+   * @param {Object} set The set to convert.
+   * @returns {Array} Returns the values.
+   */
+  function setToArray(set) {
+    var index = -1,
+        result = Array(set.size);
+
+    set.forEach(function(value) {
+      result[++index] = value;
+    });
+    return result;
+  }
+
+  /**
+   * Converts `set` to its value-value pairs.
+   *
+   * @private
+   * @param {Object} set The set to convert.
+   * @returns {Array} Returns the value-value pairs.
+   */
+  function setToPairs(set) {
+    var index = -1,
+        result = Array(set.size);
+
+    set.forEach(function(value) {
+      result[++index] = [value, value];
+    });
+    return result;
+  }
+
+  /**
+   * A specialized version of `_.indexOf` which performs strict equality
+   * comparisons of values, i.e. `===`.
+   *
+   * @private
+   * @param {Array} array The array to inspect.
+   * @param {*} value The value to search for.
+   * @param {number} fromIndex The index to search from.
+   * @returns {number} Returns the index of the matched value, else `-1`.
+   */
+  function strictIndexOf(array, value, fromIndex) {
+    var index = fromIndex - 1,
+        length = array.length;
+
+    while (++index < length) {
+      if (array[index] === value) {
+        return index;
+      }
+    }
+    return -1;
+  }
+
+  /**
+   * A specialized version of `_.lastIndexOf` which performs strict equality
+   * comparisons of values, i.e. `===`.
+   *
+   * @private
+   * @param {Array} array The array to inspect.
+   * @param {*} value The value to search for.
+   * @param {number} fromIndex The index to search from.
+   * @returns {number} Returns the index of the matched value, else `-1`.
+   */
+  function strictLastIndexOf(array, value, fromIndex) {
+    var index = fromIndex + 1;
+    while (index--) {
+      if (array[index] === value) {
+        return index;
+      }
+    }
+    return index;
+  }
+
+  /**
+   * Gets the number of symbols in `string`.
+   *
+   * @private
+   * @param {string} string The string to inspect.
+   * @returns {number} Returns the string size.
+   */
+  function stringSize(string) {
+    return hasUnicode(string)
+      ? unicodeSize(string)
+      : asciiSize(string);
+  }
+
+  /**
+   * Converts `string` to an array.
+   *
+   * @private
+   * @param {string} string The string to convert.
+   * @returns {Array} Returns the converted array.
+   */
+  function stringToArray(string) {
+    return hasUnicode(string)
+      ? unicodeToArray(string)
+      : asciiToArray(string);
+  }
+
+  /**
+   * Used by `_.unescape` to convert HTML entities to characters.
+   *
+   * @private
+   * @param {string} chr The matched character to unescape.
+   * @returns {string} Returns the unescaped character.
+   */
+  var unescapeHtmlChar = basePropertyOf(htmlUnescapes);
+
+  /**
+   * Gets the size of a Unicode `string`.
+   *
+   * @private
+   * @param {string} string The string inspect.
+   * @returns {number} Returns the string size.
+   */
+  function unicodeSize(string) {
+    var result = reUnicode.lastIndex = 0;
+    while (reUnicode.test(string)) {
+      ++result;
+    }
+    return result;
+  }
+
+  /**
+   * Converts a Unicode `string` to an array.
+   *
+   * @private
+   * @param {string} string The string to convert.
+   * @returns {Array} Returns the converted array.
+   */
+  function unicodeToArray(string) {
+    return string.match(reUnicode) || [];
+  }
+
+  /**
+   * Splits a Unicode `string` into an array of its words.
+   *
+   * @private
+   * @param {string} The string to inspect.
+   * @returns {Array} Returns the words of `string`.
+   */
+  function unicodeWords(string) {
+    return string.match(reUnicodeWord) || [];
+  }
+
+  /*--------------------------------------------------------------------------*/
+
+  /**
+   * Create a new pristine `lodash` function using the `context` object.
+   *
+   * @static
+   * @memberOf _
+   * @since 1.1.0
+   * @category Util
+   * @param {Object} [context=root] The context object.
+   * @returns {Function} Returns a new `lodash` function.
+   * @example
+   *
+   * _.mixin({ 'foo': _.constant('foo') });
+   *
+   * var lodash = _.runInContext();
+   * lodash.mixin({ 'bar': lodash.constant('bar') });
+   *
+   * _.isFunction(_.foo);
+   * // => true
+   * _.isFunction(_.bar);
+   * // => false
+   *
+   * lodash.isFunction(lodash.foo);
+   * // => false
+   * lodash.isFunction(lodash.bar);
+   * // => true
+   *
+   * // Create a suped-up `defer` in Node.js.
+   * var defer = _.runInContext({ 'setTimeout': setImmediate }).defer;
+   */
+  var runInContext = (function runInContext(context) {
+    context = context == null ? root : _.defaults(root.Object(), context, _.pick(root, contextProps));
+
+    /** Built-in constructor references. */
+    var Array = context.Array,
+        Date = context.Date,
+        Error = context.Error,
+        Function = context.Function,
+        Math = context.Math,
+        Object = context.Object,
+        RegExp = context.RegExp,
+        String = context.String,
+        TypeError = context.TypeError;
+
+    /** Used for built-in method references. */
+    var arrayProto = Array.prototype,
+        funcProto = Function.prototype,
+        objectProto = Object.prototype;
+
+    /** Used to detect overreaching core-js shims. */
+    var coreJsData = context['__core-js_shared__'];
+
+    /** Used to resolve the decompiled source of functions. */
+    var funcToString = funcProto.toString;
+
+    /** Used to check objects for own properties. */
+    var hasOwnProperty = objectProto.hasOwnProperty;
+
+    /** Used to generate unique IDs. */
+    var idCounter = 0;
+
+    /** Used to detect methods masquerading as native. */
+    var maskSrcKey = (function() {
+      var uid = /[^.]+$/.exec(coreJsData && coreJsData.keys && coreJsData.keys.IE_PROTO || '');
+      return uid ? ('Symbol(src)_1.' + uid) : '';
+    }());
+
+    /**
+     * Used to resolve the
+     * [`toStringTag`](http://ecma-international.org/ecma-262/7.0/#sec-object.prototype.tostring)
+     * of values.
+     */
+    var nativeObjectToString = objectProto.toString;
+
+    /** Used to infer the `Object` constructor. */
+    var objectCtorString = funcToString.call(Object);
+
+    /** Used to restore the original `_` reference in `_.noConflict`. */
+    var oldDash = root._;
+
+    /** Used to detect if a method is native. */
+    var reIsNative = RegExp('^' +
+      funcToString.call(hasOwnProperty).replace(reRegExpChar, '\\$&')
+      .replace(/hasOwnProperty|(function).*?(?=\\\()| for .+?(?=\\\])/g, '$1.*?') + '$'
+    );
+
+    /** Built-in value references. */
+    var Buffer = moduleExports ? context.Buffer : undefined,
+        Symbol = context.Symbol,
+        Uint8Array = context.Uint8Array,
+        allocUnsafe = Buffer ? Buffer.allocUnsafe : undefined,
+        getPrototype = overArg(Object.getPrototypeOf, Object),
+        objectCreate = Object.create,
+        propertyIsEnumerable = objectProto.propertyIsEnumerable,
+        splice = arrayProto.splice,
+        spreadableSymbol = Symbol ? Symbol.isConcatSpreadable : undefined,
+        symIterator = Symbol ? Symbol.iterator : undefined,
+        symToStringTag = Symbol ? Symbol.toStringTag : undefined;
+
+    var defineProperty = (function() {
+      try {
+        var func = getNative(Object, 'defineProperty');
+        func({}, '', {});
+        return func;
+      } catch (e) {}
+    }());
+
+    /** Mocked built-ins. */
+    var ctxClearTimeout = context.clearTimeout !== root.clearTimeout && context.clearTimeout,
+        ctxNow = Date && Date.now !== root.Date.now && Date.now,
+        ctxSetTimeout = context.setTimeout !== root.setTimeout && context.setTimeout;
+
+    /* Built-in method references for those with the same name as other `lodash` methods. */
+    var nativeCeil = Math.ceil,
+        nativeFloor = Math.floor,
+        nativeGetSymbols = Object.getOwnPropertySymbols,
+        nativeIsBuffer = Buffer ? Buffer.isBuffer : undefined,
+        nativeIsFinite = context.isFinite,
+        nativeJoin = arrayProto.join,
+        nativeKeys = overArg(Object.keys, Object),
+        nativeMax = Math.max,
+        nativeMin = Math.min,
+        nativeNow = Date.now,
+        nativeParseInt = context.parseInt,
+        nativeRandom = Math.random,
+        nativeReverse = arrayProto.reverse;
+
+    /* Built-in method references that are verified to be native. */
+    var DataView = getNative(context, 'DataView'),
+        Map = getNative(context, 'Map'),
+        Promise = getNative(context, 'Promise'),
+        Set = getNative(context, 'Set'),
+        WeakMap = getNative(context, 'WeakMap'),
+        nativeCreate = getNative(Object, 'create');
+
+    /** Used to store function metadata. */
+    var metaMap = WeakMap && new WeakMap;
+
+    /** Used to lookup unminified function names. */
+    var realNames = {};
+
+    /** Used to detect maps, sets, and weakmaps. */
+    var dataViewCtorString = toSource(DataView),
+        mapCtorString = toSource(Map),
+        promiseCtorString = toSource(Promise),
+        setCtorString = toSource(Set),
+        weakMapCtorString = toSource(WeakMap);
+
+    /** Used to convert symbols to primitives and strings. */
+    var symbolProto = Symbol ? Symbol.prototype : undefined,
+        symbolValueOf = symbolProto ? symbolProto.valueOf : undefined,
+        symbolToString = symbolProto ? symbolProto.toString : undefined;
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates a `lodash` object which wraps `value` to enable implicit method
+     * chain sequences. Methods that operate on and return arrays, collections,
+     * and functions can be chained together. Methods that retrieve a single value
+     * or may return a primitive value will automatically end the chain sequence
+     * and return the unwrapped value. Otherwise, the value must be unwrapped
+     * with `_#value`.
+     *
+     * Explicit chain sequences, which must be unwrapped with `_#value`, may be
+     * enabled using `_.chain`.
+     *
+     * The execution of chained methods is lazy, that is, it's deferred until
+     * `_#value` is implicitly or explicitly called.
+     *
+     * Lazy evaluation allows several methods to support shortcut fusion.
+     * Shortcut fusion is an optimization to merge iteratee calls; this avoids
+     * the creation of intermediate arrays and can greatly reduce the number of
+     * iteratee executions. Sections of a chain sequence qualify for shortcut
+     * fusion if the section is applied to an array and iteratees accept only
+     * one argument. The heuristic for whether a section qualifies for shortcut
+     * fusion is subject to change.
+     *
+     * Chaining is supported in custom builds as long as the `_#value` method is
+     * directly or indirectly included in the build.
+     *
+     * In addition to lodash methods, wrappers have `Array` and `String` methods.
+     *
+     * The wrapper `Array` methods are:
+     * `concat`, `join`, `pop`, `push`, `shift`, `sort`, `splice`, and `unshift`
+     *
+     * The wrapper `String` methods are:
+     * `replace` and `split`
+     *
+     * The wrapper methods that support shortcut fusion are:
+     * `at`, `compact`, `drop`, `dropRight`, `dropWhile`, `filter`, `find`,
+     * `findLast`, `head`, `initial`, `last`, `map`, `reject`, `reverse`, `slice`,
+     * `tail`, `take`, `takeRight`, `takeRightWhile`, `takeWhile`, and `toArray`
+     *
+     * The chainable wrapper methods are:
+     * `after`, `ary`, `assign`, `assignIn`, `assignInWith`, `assignWith`, `at`,
+     * `before`, `bind`, `bindAll`, `bindKey`, `castArray`, `chain`, `chunk`,
+     * `commit`, `compact`, `concat`, `conforms`, `constant`, `countBy`, `create`,
+     * `curry`, `debounce`, `defaults`, `defaultsDeep`, `defer`, `delay`,
+     * `difference`, `differenceBy`, `differenceWith`, `drop`, `dropRight`,
+     * `dropRightWhile`, `dropWhile`, `extend`, `extendWith`, `fill`, `filter`,
+     * `flatMap`, `flatMapDeep`, `flatMapDepth`, `flatten`, `flattenDeep`,
+     * `flattenDepth`, `flip`, `flow`, `flowRight`, `fromPairs`, `functions`,
+     * `functionsIn`, `groupBy`, `initial`, `intersection`, `intersectionBy`,
+     * `intersectionWith`, `invert`, `invertBy`, `invokeMap`, `iteratee`, `keyBy`,
+     * `keys`, `keysIn`, `map`, `mapKeys`, `mapValues`, `matches`, `matchesProperty`,
+     * `memoize`, `merge`, `mergeWith`, `method`, `methodOf`, `mixin`, `negate`,
+     * `nthArg`, `omit`, `omitBy`, `once`, `orderBy`, `over`, `overArgs`,
+     * `overEvery`, `overSome`, `partial`, `partialRight`, `partition`, `pick`,
+     * `pickBy`, `plant`, `property`, `propertyOf`, `pull`, `pullAll`, `pullAllBy`,
+     * `pullAllWith`, `pullAt`, `push`, `range`, `rangeRight`, `rearg`, `reject`,
+     * `remove`, `rest`, `reverse`, `sampleSize`, `set`, `setWith`, `shuffle`,
+     * `slice`, `sort`, `sortBy`, `splice`, `spread`, `tail`, `take`, `takeRight`,
+     * `takeRightWhile`, `takeWhile`, `tap`, `throttle`, `thru`, `toArray`,
+     * `toPairs`, `toPairsIn`, `toPath`, `toPlainObject`, `transform`, `unary`,
+     * `union`, `unionBy`, `unionWith`, `uniq`, `uniqBy`, `uniqWith`, `unset`,
+     * `unshift`, `unzip`, `unzipWith`, `update`, `updateWith`, `values`,
+     * `valuesIn`, `without`, `wrap`, `xor`, `xorBy`, `xorWith`, `zip`,
+     * `zipObject`, `zipObjectDeep`, and `zipWith`
+     *
+     * The wrapper methods that are **not** chainable by default are:
+     * `add`, `attempt`, `camelCase`, `capitalize`, `ceil`, `clamp`, `clone`,
+     * `cloneDeep`, `cloneDeepWith`, `cloneWith`, `conformsTo`, `deburr`,
+     * `defaultTo`, `divide`, `each`, `eachRight`, `endsWith`, `eq`, `escape`,
+     * `escapeRegExp`, `every`, `find`, `findIndex`, `findKey`, `findLast`,
+     * `findLastIndex`, `findLastKey`, `first`, `floor`, `forEach`, `forEachRight`,
+     * `forIn`, `forInRight`, `forOwn`, `forOwnRight`, `get`, `gt`, `gte`, `has`,
+     * `hasIn`, `head`, `identity`, `includes`, `indexOf`, `inRange`, `invoke`,
+     * `isArguments`, `isArray`, `isArrayBuffer`, `isArrayLike`, `isArrayLikeObject`,
+     * `isBoolean`, `isBuffer`, `isDate`, `isElement`, `isEmpty`, `isEqual`,
+     * `isEqualWith`, `isError`, `isFinite`, `isFunction`, `isInteger`, `isLength`,
+     * `isMap`, `isMatch`, `isMatchWith`, `isNaN`, `isNative`, `isNil`, `isNull`,
+     * `isNumber`, `isObject`, `isObjectLike`, `isPlainObject`, `isRegExp`,
+     * `isSafeInteger`, `isSet`, `isString`, `isUndefined`, `isTypedArray`,
+     * `isWeakMap`, `isWeakSet`, `join`, `kebabCase`, `last`, `lastIndexOf`,
+     * `lowerCase`, `lowerFirst`, `lt`, `lte`, `max`, `maxBy`, `mean`, `meanBy`,
+     * `min`, `minBy`, `multiply`, `noConflict`, `noop`, `now`, `nth`, `pad`,
+     * `padEnd`, `padStart`, `parseInt`, `pop`, `random`, `reduce`, `reduceRight`,
+     * `repeat`, `result`, `round`, `runInContext`, `sample`, `shift`, `size`,
+     * `snakeCase`, `some`, `sortedIndex`, `sortedIndexBy`, `sortedLastIndex`,
+     * `sortedLastIndexBy`, `startCase`, `startsWith`, `stubArray`, `stubFalse`,
+     * `stubObject`, `stubString`, `stubTrue`, `subtract`, `sum`, `sumBy`,
+     * `template`, `times`, `toFinite`, `toInteger`, `toJSON`, `toLength`,
+     * `toLower`, `toNumber`, `toSafeInteger`, `toString`, `toUpper`, `trim`,
+     * `trimEnd`, `trimStart`, `truncate`, `unescape`, `uniqueId`, `upperCase`,
+     * `upperFirst`, `value`, and `words`
+     *
+     * @name _
+     * @constructor
+     * @category Seq
+     * @param {*} value The value to wrap in a `lodash` instance.
+     * @returns {Object} Returns the new `lodash` wrapper instance.
+     * @example
+     *
+     * function square(n) {
+     *   return n * n;
+     * }
+     *
+     * var wrapped = _([1, 2, 3]);
+     *
+     * // Returns an unwrapped value.
+     * wrapped.reduce(_.add);
+     * // => 6
+     *
+     * // Returns a wrapped value.
+     * var squares = wrapped.map(square);
+     *
+     * _.isArray(squares);
+     * // => false
+     *
+     * _.isArray(squares.value());
+     * // => true
+     */
+    function lodash(value) {
+      if (isObjectLike(value) && !isArray(value) && !(value instanceof LazyWrapper)) {
+        if (value instanceof LodashWrapper) {
+          return value;
+        }
+        if (hasOwnProperty.call(value, '__wrapped__')) {
+          return wrapperClone(value);
+        }
+      }
+      return new LodashWrapper(value);
+    }
+
+    /**
+     * The base implementation of `_.create` without support for assigning
+     * properties to the created object.
+     *
+     * @private
+     * @param {Object} proto The object to inherit from.
+     * @returns {Object} Returns the new object.
+     */
+    var baseCreate = (function() {
+      function object() {}
+      return function(proto) {
+        if (!isObject(proto)) {
+          return {};
+        }
+        if (objectCreate) {
+          return objectCreate(proto);
+        }
+        object.prototype = proto;
+        var result = new object;
+        object.prototype = undefined;
+        return result;
+      };
+    }());
+
+    /**
+     * The function whose prototype chain sequence wrappers inherit from.
+     *
+     * @private
+     */
+    function baseLodash() {
+      // No operation performed.
+    }
+
+    /**
+     * The base constructor for creating `lodash` wrapper objects.
+     *
+     * @private
+     * @param {*} value The value to wrap.
+     * @param {boolean} [chainAll] Enable explicit method chain sequences.
+     */
+    function LodashWrapper(value, chainAll) {
+      this.__wrapped__ = value;
+      this.__actions__ = [];
+      this.__chain__ = !!chainAll;
+      this.__index__ = 0;
+      this.__values__ = undefined;
+    }
+
+    /**
+     * By default, the template delimiters used by lodash are like those in
+     * embedded Ruby (ERB) as well as ES2015 template strings. Change the
+     * following template settings to use alternative delimiters.
+     *
+     * @static
+     * @memberOf _
+     * @type {Object}
+     */
+    lodash.templateSettings = {
+
+      /**
+       * Used to detect `data` property values to be HTML-escaped.
+       *
+       * @memberOf _.templateSettings
+       * @type {RegExp}
+       */
+      'escape': reEscape,
+
+      /**
+       * Used to detect code to be evaluated.
+       *
+       * @memberOf _.templateSettings
+       * @type {RegExp}
+       */
+      'evaluate': reEvaluate,
+
+      /**
+       * Used to detect `data` property values to inject.
+       *
+       * @memberOf _.templateSettings
+       * @type {RegExp}
+       */
+      'interpolate': reInterpolate,
+
+      /**
+       * Used to reference the data object in the template text.
+       *
+       * @memberOf _.templateSettings
+       * @type {string}
+       */
+      'variable': '',
+
+      /**
+       * Used to import variables into the compiled template.
+       *
+       * @memberOf _.templateSettings
+       * @type {Object}
+       */
+      'imports': {
+
+        /**
+         * A reference to the `lodash` function.
+         *
+         * @memberOf _.templateSettings.imports
+         * @type {Function}
+         */
+        '_': lodash
+      }
+    };
+
+    // Ensure wrappers are instances of `baseLodash`.
+    lodash.prototype = baseLodash.prototype;
+    lodash.prototype.constructor = lodash;
+
+    LodashWrapper.prototype = baseCreate(baseLodash.prototype);
+    LodashWrapper.prototype.constructor = LodashWrapper;
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates a lazy wrapper object which wraps `value` to enable lazy evaluation.
+     *
+     * @private
+     * @constructor
+     * @param {*} value The value to wrap.
+     */
+    function LazyWrapper(value) {
+      this.__wrapped__ = value;
+      this.__actions__ = [];
+      this.__dir__ = 1;
+      this.__filtered__ = false;
+      this.__iteratees__ = [];
+      this.__takeCount__ = MAX_ARRAY_LENGTH;
+      this.__views__ = [];
+    }
+
+    /**
+     * Creates a clone of the lazy wrapper object.
+     *
+     * @private
+     * @name clone
+     * @memberOf LazyWrapper
+     * @returns {Object} Returns the cloned `LazyWrapper` object.
+     */
+    function lazyClone() {
+      var result = new LazyWrapper(this.__wrapped__);
+      result.__actions__ = copyArray(this.__actions__);
+      result.__dir__ = this.__dir__;
+      result.__filtered__ = this.__filtered__;
+      result.__iteratees__ = copyArray(this.__iteratees__);
+      result.__takeCount__ = this.__takeCount__;
+      result.__views__ = copyArray(this.__views__);
+      return result;
+    }
+
+    /**
+     * Reverses the direction of lazy iteration.
+     *
+     * @private
+     * @name reverse
+     * @memberOf LazyWrapper
+     * @returns {Object} Returns the new reversed `LazyWrapper` object.
+     */
+    function lazyReverse() {
+      if (this.__filtered__) {
+        var result = new LazyWrapper(this);
+        result.__dir__ = -1;
+        result.__filtered__ = true;
+      } else {
+        result = this.clone();
+        result.__dir__ *= -1;
+      }
+      return result;
+    }
+
+    /**
+     * Extracts the unwrapped value from its lazy wrapper.
+     *
+     * @private
+     * @name value
+     * @memberOf LazyWrapper
+     * @returns {*} Returns the unwrapped value.
+     */
+    function lazyValue() {
+      var array = this.__wrapped__.value(),
+          dir = this.__dir__,
+          isArr = isArray(array),
+          isRight = dir < 0,
+          arrLength = isArr ? array.length : 0,
+          view = getView(0, arrLength, this.__views__),
+          start = view.start,
+          end = view.end,
+          length = end - start,
+          index = isRight ? end : (start - 1),
+          iteratees = this.__iteratees__,
+          iterLength = iteratees.length,
+          resIndex = 0,
+          takeCount = nativeMin(length, this.__takeCount__);
+
+      if (!isArr || (!isRight && arrLength == length && takeCount == length)) {
+        return baseWrapperValue(array, this.__actions__);
+      }
+      var result = [];
+
+      outer:
+      while (length-- && resIndex < takeCount) {
+        index += dir;
+
+        var iterIndex = -1,
+            value = array[index];
+
+        while (++iterIndex < iterLength) {
+          var data = iteratees[iterIndex],
+              iteratee = data.iteratee,
+              type = data.type,
+              computed = iteratee(value);
+
+          if (type == LAZY_MAP_FLAG) {
+            value = computed;
+          } else if (!computed) {
+            if (type == LAZY_FILTER_FLAG) {
+              continue outer;
+            } else {
+              break outer;
+            }
+          }
+        }
+        result[resIndex++] = value;
+      }
+      return result;
+    }
+
+    // Ensure `LazyWrapper` is an instance of `baseLodash`.
+    LazyWrapper.prototype = baseCreate(baseLodash.prototype);
+    LazyWrapper.prototype.constructor = LazyWrapper;
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates a hash object.
+     *
+     * @private
+     * @constructor
+     * @param {Array} [entries] The key-value pairs to cache.
+     */
+    function Hash(entries) {
+      var index = -1,
+          length = entries == null ? 0 : entries.length;
+
+      this.clear();
+      while (++index < length) {
+        var entry = entries[index];
+        this.set(entry[0], entry[1]);
+      }
+    }
+
+    /**
+     * Removes all key-value entries from the hash.
+     *
+     * @private
+     * @name clear
+     * @memberOf Hash
+     */
+    function hashClear() {
+      this.__data__ = nativeCreate ? nativeCreate(null) : {};
+      this.size = 0;
+    }
+
+    /**
+     * Removes `key` and its value from the hash.
+     *
+     * @private
+     * @name delete
+     * @memberOf Hash
+     * @param {Object} hash The hash to modify.
+     * @param {string} key The key of the value to remove.
+     * @returns {boolean} Returns `true` if the entry was removed, else `false`.
+     */
+    function hashDelete(key) {
+      var result = this.has(key) && delete this.__data__[key];
+      this.size -= result ? 1 : 0;
+      return result;
+    }
+
+    /**
+     * Gets the hash value for `key`.
+     *
+     * @private
+     * @name get
+     * @memberOf Hash
+     * @param {string} key The key of the value to get.
+     * @returns {*} Returns the entry value.
+     */
+    function hashGet(key) {
+      var data = this.__data__;
+      if (nativeCreate) {
+        var result = data[key];
+        return result === HASH_UNDEFINED ? undefined : result;
+      }
+      return hasOwnProperty.call(data, key) ? data[key] : undefined;
+    }
+
+    /**
+     * Checks if a hash value for `key` exists.
+     *
+     * @private
+     * @name has
+     * @memberOf Hash
+     * @param {string} key The key of the entry to check.
+     * @returns {boolean} Returns `true` if an entry for `key` exists, else `false`.
+     */
+    function hashHas(key) {
+      var data = this.__data__;
+      return nativeCreate ? (data[key] !== undefined) : hasOwnProperty.call(data, key);
+    }
+
+    /**
+     * Sets the hash `key` to `value`.
+     *
+     * @private
+     * @name set
+     * @memberOf Hash
+     * @param {string} key The key of the value to set.
+     * @param {*} value The value to set.
+     * @returns {Object} Returns the hash instance.
+     */
+    function hashSet(key, value) {
+      var data = this.__data__;
+      this.size += this.has(key) ? 0 : 1;
+      data[key] = (nativeCreate && value === undefined) ? HASH_UNDEFINED : value;
+      return this;
+    }
+
+    // Add methods to `Hash`.
+    Hash.prototype.clear = hashClear;
+    Hash.prototype['delete'] = hashDelete;
+    Hash.prototype.get = hashGet;
+    Hash.prototype.has = hashHas;
+    Hash.prototype.set = hashSet;
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates an list cache object.
+     *
+     * @private
+     * @constructor
+     * @param {Array} [entries] The key-value pairs to cache.
+     */
+    function ListCache(entries) {
+      var index = -1,
+          length = entries == null ? 0 : entries.length;
+
+      this.clear();
+      while (++index < length) {
+        var entry = entries[index];
+        this.set(entry[0], entry[1]);
+      }
+    }
+
+    /**
+     * Removes all key-value entries from the list cache.
+     *
+     * @private
+     * @name clear
+     * @memberOf ListCache
+     */
+    function listCacheClear() {
+      this.__data__ = [];
+      this.size = 0;
+    }
+
+    /**
+     * Removes `key` and its value from the list cache.
+     *
+     * @private
+     * @name delete
+     * @memberOf ListCache
+     * @param {string} key The key of the value to remove.
+     * @returns {boolean} Returns `true` if the entry was removed, else `false`.
+     */
+    function listCacheDelete(key) {
+      var data = this.__data__,
+          index = assocIndexOf(data, key);
+
+      if (index < 0) {
+        return false;
+      }
+      var lastIndex = data.length - 1;
+      if (index == lastIndex) {
+        data.pop();
+      } else {
+        splice.call(data, index, 1);
+      }
+      --this.size;
+      return true;
+    }
+
+    /**
+     * Gets the list cache value for `key`.
+     *
+     * @private
+     * @name get
+     * @memberOf ListCache
+     * @param {string} key The key of the value to get.
+     * @returns {*} Returns the entry value.
+     */
+    function listCacheGet(key) {
+      var data = this.__data__,
+          index = assocIndexOf(data, key);
+
+      return index < 0 ? undefined : data[index][1];
+    }
+
+    /**
+     * Checks if a list cache value for `key` exists.
+     *
+     * @private
+     * @name has
+     * @memberOf ListCache
+     * @param {string} key The key of the entry to check.
+     * @returns {boolean} Returns `true` if an entry for `key` exists, else `false`.
+     */
+    function listCacheHas(key) {
+      return assocIndexOf(this.__data__, key) > -1;
+    }
+
+    /**
+     * Sets the list cache `key` to `value`.
+     *
+     * @private
+     * @name set
+     * @memberOf ListCache
+     * @param {string} key The key of the value to set.
+     * @param {*} value The value to set.
+     * @returns {Object} Returns the list cache instance.
+     */
+    function listCacheSet(key, value) {
+      var data = this.__data__,
+          index = assocIndexOf(data, key);
+
+      if (index < 0) {
+        ++this.size;
+        data.push([key, value]);
+      } else {
+        data[index][1] = value;
+      }
+      return this;
+    }
+
+    // Add methods to `ListCache`.
+    ListCache.prototype.clear = listCacheClear;
+    ListCache.prototype['delete'] = listCacheDelete;
+    ListCache.prototype.get = listCacheGet;
+    ListCache.prototype.has = listCacheHas;
+    ListCache.prototype.set = listCacheSet;
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates a map cache object to store key-value pairs.
+     *
+     * @private
+     * @constructor
+     * @param {Array} [entries] The key-value pairs to cache.
+     */
+    function MapCache(entries) {
+      var index = -1,
+          length = entries == null ? 0 : entries.length;
+
+      this.clear();
+      while (++index < length) {
+        var entry = entries[index];
+        this.set(entry[0], entry[1]);
+      }
+    }
+
+    /**
+     * Removes all key-value entries from the map.
+     *
+     * @private
+     * @name clear
+     * @memberOf MapCache
+     */
+    function mapCacheClear() {
+      this.size = 0;
+      this.__data__ = {
+        'hash': new Hash,
+        'map': new (Map || ListCache),
+        'string': new Hash
+      };
+    }
+
+    /**
+     * Removes `key` and its value from the map.
+     *
+     * @private
+     * @name delete
+     * @memberOf MapCache
+     * @param {string} key The key of the value to remove.
+     * @returns {boolean} Returns `true` if the entry was removed, else `false`.
+     */
+    function mapCacheDelete(key) {
+      var result = getMapData(this, key)['delete'](key);
+      this.size -= result ? 1 : 0;
+      return result;
+    }
+
+    /**
+     * Gets the map value for `key`.
+     *
+     * @private
+     * @name get
+     * @memberOf MapCache
+     * @param {string} key The key of the value to get.
+     * @returns {*} Returns the entry value.
+     */
+    function mapCacheGet(key) {
+      return getMapData(this, key).get(key);
+    }
+
+    /**
+     * Checks if a map value for `key` exists.
+     *
+     * @private
+     * @name has
+     * @memberOf MapCache
+     * @param {string} key The key of the entry to check.
+     * @returns {boolean} Returns `true` if an entry for `key` exists, else `false`.
+     */
+    function mapCacheHas(key) {
+      return getMapData(this, key).has(key);
+    }
+
+    /**
+     * Sets the map `key` to `value`.
+     *
+     * @private
+     * @name set
+     * @memberOf MapCache
+     * @param {string} key The key of the value to set.
+     * @param {*} value The value to set.
+     * @returns {Object} Returns the map cache instance.
+     */
+    function mapCacheSet(key, value) {
+      var data = getMapData(this, key),
+          size = data.size;
+
+      data.set(key, value);
+      this.size += data.size == size ? 0 : 1;
+      return this;
+    }
+
+    // Add methods to `MapCache`.
+    MapCache.prototype.clear = mapCacheClear;
+    MapCache.prototype['delete'] = mapCacheDelete;
+    MapCache.prototype.get = mapCacheGet;
+    MapCache.prototype.has = mapCacheHas;
+    MapCache.prototype.set = mapCacheSet;
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     *
+     * Creates an array cache object to store unique values.
+     *
+     * @private
+     * @constructor
+     * @param {Array} [values] The values to cache.
+     */
+    function SetCache(values) {
+      var index = -1,
+          length = values == null ? 0 : values.length;
+
+      this.__data__ = new MapCache;
+      while (++index < length) {
+        this.add(values[index]);
+      }
+    }
+
+    /**
+     * Adds `value` to the array cache.
+     *
+     * @private
+     * @name add
+     * @memberOf SetCache
+     * @alias push
+     * @param {*} value The value to cache.
+     * @returns {Object} Returns the cache instance.
+     */
+    function setCacheAdd(value) {
+      this.__data__.set(value, HASH_UNDEFINED);
+      return this;
+    }
+
+    /**
+     * Checks if `value` is in the array cache.
+     *
+     * @private
+     * @name has
+     * @memberOf SetCache
+     * @param {*} value The value to search for.
+     * @returns {number} Returns `true` if `value` is found, else `false`.
+     */
+    function setCacheHas(value) {
+      return this.__data__.has(value);
+    }
+
+    // Add methods to `SetCache`.
+    SetCache.prototype.add = SetCache.prototype.push = setCacheAdd;
+    SetCache.prototype.has = setCacheHas;
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates a stack cache object to store key-value pairs.
+     *
+     * @private
+     * @constructor
+     * @param {Array} [entries] The key-value pairs to cache.
+     */
+    function Stack(entries) {
+      var data = this.__data__ = new ListCache(entries);
+      this.size = data.size;
+    }
+
+    /**
+     * Removes all key-value entries from the stack.
+     *
+     * @private
+     * @name clear
+     * @memberOf Stack
+     */
+    function stackClear() {
+      this.__data__ = new ListCache;
+      this.size = 0;
+    }
+
+    /**
+     * Removes `key` and its value from the stack.
+     *
+     * @private
+     * @name delete
+     * @memberOf Stack
+     * @param {string} key The key of the value to remove.
+     * @returns {boolean} Returns `true` if the entry was removed, else `false`.
+     */
+    function stackDelete(key) {
+      var data = this.__data__,
+          result = data['delete'](key);
+
+      this.size = data.size;
+      return result;
+    }
+
+    /**
+     * Gets the stack value for `key`.
+     *
+     * @private
+     * @name get
+     * @memberOf Stack
+     * @param {string} key The key of the value to get.
+     * @returns {*} Returns the entry value.
+     */
+    function stackGet(key) {
+      return this.__data__.get(key);
+    }
+
+    /**
+     * Checks if a stack value for `key` exists.
+     *
+     * @private
+     * @name has
+     * @memberOf Stack
+     * @param {string} key The key of the entry to check.
+     * @returns {boolean} Returns `true` if an entry for `key` exists, else `false`.
+     */
+    function stackHas(key) {
+      return this.__data__.has(key);
+    }
+
+    /**
+     * Sets the stack `key` to `value`.
+     *
+     * @private
+     * @name set
+     * @memberOf Stack
+     * @param {string} key The key of the value to set.
+     * @param {*} value The value to set.
+     * @returns {Object} Returns the stack cache instance.
+     */
+    function stackSet(key, value) {
+      var data = this.__data__;
+      if (data instanceof ListCache) {
+        var pairs = data.__data__;
+        if (!Map || (pairs.length < LARGE_ARRAY_SIZE - 1)) {
+          pairs.push([key, value]);
+          this.size = ++data.size;
+          return this;
+        }
+        data = this.__data__ = new MapCache(pairs);
+      }
+      data.set(key, value);
+      this.size = data.size;
+      return this;
+    }
+
+    // Add methods to `Stack`.
+    Stack.prototype.clear = stackClear;
+    Stack.prototype['delete'] = stackDelete;
+    Stack.prototype.get = stackGet;
+    Stack.prototype.has = stackHas;
+    Stack.prototype.set = stackSet;
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates an array of the enumerable property names of the array-like `value`.
+     *
+     * @private
+     * @param {*} value The value to query.
+     * @param {boolean} inherited Specify returning inherited property names.
+     * @returns {Array} Returns the array of property names.
+     */
+    function arrayLikeKeys(value, inherited) {
+      var isArr = isArray(value),
+          isArg = !isArr && isArguments(value),
+          isBuff = !isArr && !isArg && isBuffer(value),
+          isType = !isArr && !isArg && !isBuff && isTypedArray(value),
+          skipIndexes = isArr || isArg || isBuff || isType,
+          result = skipIndexes ? baseTimes(value.length, String) : [],
+          length = result.length;
+
+      for (var key in value) {
+        if ((inherited || hasOwnProperty.call(value, key)) &&
+            !(skipIndexes && (
+               // Safari 9 has enumerable `arguments.length` in strict mode.
+               key == 'length' ||
+               // Node.js 0.10 has enumerable non-index properties on buffers.
+               (isBuff && (key == 'offset' || key == 'parent')) ||
+               // PhantomJS 2 has enumerable non-index properties on typed arrays.
+               (isType && (key == 'buffer' || key == 'byteLength' || key == 'byteOffset')) ||
+               // Skip index properties.
+               isIndex(key, length)
+            ))) {
+          result.push(key);
+        }
+      }
+      return result;
+    }
+
+    /**
+     * A specialized version of `_.sample` for arrays.
+     *
+     * @private
+     * @param {Array} array The array to sample.
+     * @returns {*} Returns the random element.
+     */
+    function arraySample(array) {
+      var length = array.length;
+      return length ? array[baseRandom(0, length - 1)] : undefined;
+    }
+
+    /**
+     * A specialized version of `_.sampleSize` for arrays.
+     *
+     * @private
+     * @param {Array} array The array to sample.
+     * @param {number} n The number of elements to sample.
+     * @returns {Array} Returns the random elements.
+     */
+    function arraySampleSize(array, n) {
+      return shuffleSelf(copyArray(array), baseClamp(n, 0, array.length));
+    }
+
+    /**
+     * A specialized version of `_.shuffle` for arrays.
+     *
+     * @private
+     * @param {Array} array The array to shuffle.
+     * @returns {Array} Returns the new shuffled array.
+     */
+    function arrayShuffle(array) {
+      return shuffleSelf(copyArray(array));
+    }
+
+    /**
+     * This function is like `assignValue` except that it doesn't assign
+     * `undefined` values.
+     *
+     * @private
+     * @param {Object} object The object to modify.
+     * @param {string} key The key of the property to assign.
+     * @param {*} value The value to assign.
+     */
+    function assignMergeValue(object, key, value) {
+      if ((value !== undefined && !eq(object[key], value)) ||
+          (value === undefined && !(key in object))) {
+        baseAssignValue(object, key, value);
+      }
+    }
+
+    /**
+     * Assigns `value` to `key` of `object` if the existing value is not equivalent
+     * using [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * for equality comparisons.
+     *
+     * @private
+     * @param {Object} object The object to modify.
+     * @param {string} key The key of the property to assign.
+     * @param {*} value The value to assign.
+     */
+    function assignValue(object, key, value) {
+      var objValue = object[key];
+      if (!(hasOwnProperty.call(object, key) && eq(objValue, value)) ||
+          (value === undefined && !(key in object))) {
+        baseAssignValue(object, key, value);
+      }
+    }
+
+    /**
+     * Gets the index at which the `key` is found in `array` of key-value pairs.
+     *
+     * @private
+     * @param {Array} array The array to inspect.
+     * @param {*} key The key to search for.
+     * @returns {number} Returns the index of the matched value, else `-1`.
+     */
+    function assocIndexOf(array, key) {
+      var length = array.length;
+      while (length--) {
+        if (eq(array[length][0], key)) {
+          return length;
+        }
+      }
+      return -1;
+    }
+
+    /**
+     * Aggregates elements of `collection` on `accumulator` with keys transformed
+     * by `iteratee` and values set by `setter`.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} setter The function to set `accumulator` values.
+     * @param {Function} iteratee The iteratee to transform keys.
+     * @param {Object} accumulator The initial aggregated object.
+     * @returns {Function} Returns `accumulator`.
+     */
+    function baseAggregator(collection, setter, iteratee, accumulator) {
+      baseEach(collection, function(value, key, collection) {
+        setter(accumulator, value, iteratee(value), collection);
+      });
+      return accumulator;
+    }
+
+    /**
+     * The base implementation of `_.assign` without support for multiple sources
+     * or `customizer` functions.
+     *
+     * @private
+     * @param {Object} object The destination object.
+     * @param {Object} source The source object.
+     * @returns {Object} Returns `object`.
+     */
+    function baseAssign(object, source) {
+      return object && copyObject(source, keys(source), object);
+    }
+
+    /**
+     * The base implementation of `_.assignIn` without support for multiple sources
+     * or `customizer` functions.
+     *
+     * @private
+     * @param {Object} object The destination object.
+     * @param {Object} source The source object.
+     * @returns {Object} Returns `object`.
+     */
+    function baseAssignIn(object, source) {
+      return object && copyObject(source, keysIn(source), object);
+    }
+
+    /**
+     * The base implementation of `assignValue` and `assignMergeValue` without
+     * value checks.
+     *
+     * @private
+     * @param {Object} object The object to modify.
+     * @param {string} key The key of the property to assign.
+     * @param {*} value The value to assign.
+     */
+    function baseAssignValue(object, key, value) {
+      if (key == '__proto__' && defineProperty) {
+        defineProperty(object, key, {
+          'configurable': true,
+          'enumerable': true,
+          'value': value,
+          'writable': true
+        });
+      } else {
+        object[key] = value;
+      }
+    }
+
+    /**
+     * The base implementation of `_.at` without support for individual paths.
+     *
+     * @private
+     * @param {Object} object The object to iterate over.
+     * @param {string[]} paths The property paths to pick.
+     * @returns {Array} Returns the picked elements.
+     */
+    function baseAt(object, paths) {
+      var index = -1,
+          length = paths.length,
+          result = Array(length),
+          skip = object == null;
+
+      while (++index < length) {
+        result[index] = skip ? undefined : get(object, paths[index]);
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.clamp` which doesn't coerce arguments.
+     *
+     * @private
+     * @param {number} number The number to clamp.
+     * @param {number} [lower] The lower bound.
+     * @param {number} upper The upper bound.
+     * @returns {number} Returns the clamped number.
+     */
+    function baseClamp(number, lower, upper) {
+      if (number === number) {
+        if (upper !== undefined) {
+          number = number <= upper ? number : upper;
+        }
+        if (lower !== undefined) {
+          number = number >= lower ? number : lower;
+        }
+      }
+      return number;
+    }
+
+    /**
+     * The base implementation of `_.clone` and `_.cloneDeep` which tracks
+     * traversed objects.
+     *
+     * @private
+     * @param {*} value The value to clone.
+     * @param {boolean} bitmask The bitmask flags.
+     *  1 - Deep clone
+     *  2 - Flatten inherited properties
+     *  4 - Clone symbols
+     * @param {Function} [customizer] The function to customize cloning.
+     * @param {string} [key] The key of `value`.
+     * @param {Object} [object] The parent object of `value`.
+     * @param {Object} [stack] Tracks traversed objects and their clone counterparts.
+     * @returns {*} Returns the cloned value.
+     */
+    function baseClone(value, bitmask, customizer, key, object, stack) {
+      var result,
+          isDeep = bitmask & CLONE_DEEP_FLAG,
+          isFlat = bitmask & CLONE_FLAT_FLAG,
+          isFull = bitmask & CLONE_SYMBOLS_FLAG;
+
+      if (customizer) {
+        result = object ? customizer(value, key, object, stack) : customizer(value);
+      }
+      if (result !== undefined) {
+        return result;
+      }
+      if (!isObject(value)) {
+        return value;
+      }
+      var isArr = isArray(value);
+      if (isArr) {
+        result = initCloneArray(value);
+        if (!isDeep) {
+          return copyArray(value, result);
+        }
+      } else {
+        var tag = getTag(value),
+            isFunc = tag == funcTag || tag == genTag;
+
+        if (isBuffer(value)) {
+          return cloneBuffer(value, isDeep);
+        }
+        if (tag == objectTag || tag == argsTag || (isFunc && !object)) {
+          result = (isFlat || isFunc) ? {} : initCloneObject(value);
+          if (!isDeep) {
+            return isFlat
+              ? copySymbolsIn(value, baseAssignIn(result, value))
+              : copySymbols(value, baseAssign(result, value));
+          }
+        } else {
+          if (!cloneableTags[tag]) {
+            return object ? value : {};
+          }
+          result = initCloneByTag(value, tag, baseClone, isDeep);
+        }
+      }
+      // Check for circular references and return its corresponding clone.
+      stack || (stack = new Stack);
+      var stacked = stack.get(value);
+      if (stacked) {
+        return stacked;
+      }
+      stack.set(value, result);
+
+      var keysFunc = isFull
+        ? (isFlat ? getAllKeysIn : getAllKeys)
+        : (isFlat ? keysIn : keys);
+
+      var props = isArr ? undefined : keysFunc(value);
+      arrayEach(props || value, function(subValue, key) {
+        if (props) {
+          key = subValue;
+          subValue = value[key];
+        }
+        // Recursively populate clone (susceptible to call stack limits).
+        assignValue(result, key, baseClone(subValue, bitmask, customizer, key, value, stack));
+      });
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.conforms` which doesn't clone `source`.
+     *
+     * @private
+     * @param {Object} source The object of property predicates to conform to.
+     * @returns {Function} Returns the new spec function.
+     */
+    function baseConforms(source) {
+      var props = keys(source);
+      return function(object) {
+        return baseConformsTo(object, source, props);
+      };
+    }
+
+    /**
+     * The base implementation of `_.conformsTo` which accepts `props` to check.
+     *
+     * @private
+     * @param {Object} object The object to inspect.
+     * @param {Object} source The object of property predicates to conform to.
+     * @returns {boolean} Returns `true` if `object` conforms, else `false`.
+     */
+    function baseConformsTo(object, source, props) {
+      var length = props.length;
+      if (object == null) {
+        return !length;
+      }
+      object = Object(object);
+      while (length--) {
+        var key = props[length],
+            predicate = source[key],
+            value = object[key];
+
+        if ((value === undefined && !(key in object)) || !predicate(value)) {
+          return false;
+        }
+      }
+      return true;
+    }
+
+    /**
+     * The base implementation of `_.delay` and `_.defer` which accepts `args`
+     * to provide to `func`.
+     *
+     * @private
+     * @param {Function} func The function to delay.
+     * @param {number} wait The number of milliseconds to delay invocation.
+     * @param {Array} args The arguments to provide to `func`.
+     * @returns {number|Object} Returns the timer id or timeout object.
+     */
+    function baseDelay(func, wait, args) {
+      if (typeof func != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      return setTimeout(function() { func.apply(undefined, args); }, wait);
+    }
+
+    /**
+     * The base implementation of methods like `_.difference` without support
+     * for excluding multiple arrays or iteratee shorthands.
+     *
+     * @private
+     * @param {Array} array The array to inspect.
+     * @param {Array} values The values to exclude.
+     * @param {Function} [iteratee] The iteratee invoked per element.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new array of filtered values.
+     */
+    function baseDifference(array, values, iteratee, comparator) {
+      var index = -1,
+          includes = arrayIncludes,
+          isCommon = true,
+          length = array.length,
+          result = [],
+          valuesLength = values.length;
+
+      if (!length) {
+        return result;
+      }
+      if (iteratee) {
+        values = arrayMap(values, baseUnary(iteratee));
+      }
+      if (comparator) {
+        includes = arrayIncludesWith;
+        isCommon = false;
+      }
+      else if (values.length >= LARGE_ARRAY_SIZE) {
+        includes = cacheHas;
+        isCommon = false;
+        values = new SetCache(values);
+      }
+      outer:
+      while (++index < length) {
+        var value = array[index],
+            computed = iteratee == null ? value : iteratee(value);
+
+        value = (comparator || value !== 0) ? value : 0;
+        if (isCommon && computed === computed) {
+          var valuesIndex = valuesLength;
+          while (valuesIndex--) {
+            if (values[valuesIndex] === computed) {
+              continue outer;
+            }
+          }
+          result.push(value);
+        }
+        else if (!includes(values, computed, comparator)) {
+          result.push(value);
+        }
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.forEach` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} iteratee The function invoked per iteration.
+     * @returns {Array|Object} Returns `collection`.
+     */
+    var baseEach = createBaseEach(baseForOwn);
+
+    /**
+     * The base implementation of `_.forEachRight` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} iteratee The function invoked per iteration.
+     * @returns {Array|Object} Returns `collection`.
+     */
+    var baseEachRight = createBaseEach(baseForOwnRight, true);
+
+    /**
+     * The base implementation of `_.every` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} predicate The function invoked per iteration.
+     * @returns {boolean} Returns `true` if all elements pass the predicate check,
+     *  else `false`
+     */
+    function baseEvery(collection, predicate) {
+      var result = true;
+      baseEach(collection, function(value, index, collection) {
+        result = !!predicate(value, index, collection);
+        return result;
+      });
+      return result;
+    }
+
+    /**
+     * The base implementation of methods like `_.max` and `_.min` which accepts a
+     * `comparator` to determine the extremum value.
+     *
+     * @private
+     * @param {Array} array The array to iterate over.
+     * @param {Function} iteratee The iteratee invoked per iteration.
+     * @param {Function} comparator The comparator used to compare values.
+     * @returns {*} Returns the extremum value.
+     */
+    function baseExtremum(array, iteratee, comparator) {
+      var index = -1,
+          length = array.length;
+
+      while (++index < length) {
+        var value = array[index],
+            current = iteratee(value);
+
+        if (current != null && (computed === undefined
+              ? (current === current && !isSymbol(current))
+              : comparator(current, computed)
+            )) {
+          var computed = current,
+              result = value;
+        }
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.fill` without an iteratee call guard.
+     *
+     * @private
+     * @param {Array} array The array to fill.
+     * @param {*} value The value to fill `array` with.
+     * @param {number} [start=0] The start position.
+     * @param {number} [end=array.length] The end position.
+     * @returns {Array} Returns `array`.
+     */
+    function baseFill(array, value, start, end) {
+      var length = array.length;
+
+      start = toInteger(start);
+      if (start < 0) {
+        start = -start > length ? 0 : (length + start);
+      }
+      end = (end === undefined || end > length) ? length : toInteger(end);
+      if (end < 0) {
+        end += length;
+      }
+      end = start > end ? 0 : toLength(end);
+      while (start < end) {
+        array[start++] = value;
+      }
+      return array;
+    }
+
+    /**
+     * The base implementation of `_.filter` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} predicate The function invoked per iteration.
+     * @returns {Array} Returns the new filtered array.
+     */
+    function baseFilter(collection, predicate) {
+      var result = [];
+      baseEach(collection, function(value, index, collection) {
+        if (predicate(value, index, collection)) {
+          result.push(value);
+        }
+      });
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.flatten` with support for restricting flattening.
+     *
+     * @private
+     * @param {Array} array The array to flatten.
+     * @param {number} depth The maximum recursion depth.
+     * @param {boolean} [predicate=isFlattenable] The function invoked per iteration.
+     * @param {boolean} [isStrict] Restrict to values that pass `predicate` checks.
+     * @param {Array} [result=[]] The initial result value.
+     * @returns {Array} Returns the new flattened array.
+     */
+    function baseFlatten(array, depth, predicate, isStrict, result) {
+      var index = -1,
+          length = array.length;
+
+      predicate || (predicate = isFlattenable);
+      result || (result = []);
+
+      while (++index < length) {
+        var value = array[index];
+        if (depth > 0 && predicate(value)) {
+          if (depth > 1) {
+            // Recursively flatten arrays (susceptible to call stack limits).
+            baseFlatten(value, depth - 1, predicate, isStrict, result);
+          } else {
+            arrayPush(result, value);
+          }
+        } else if (!isStrict) {
+          result[result.length] = value;
+        }
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `baseForOwn` which iterates over `object`
+     * properties returned by `keysFunc` and invokes `iteratee` for each property.
+     * Iteratee functions may exit iteration early by explicitly returning `false`.
+     *
+     * @private
+     * @param {Object} object The object to iterate over.
+     * @param {Function} iteratee The function invoked per iteration.
+     * @param {Function} keysFunc The function to get the keys of `object`.
+     * @returns {Object} Returns `object`.
+     */
+    var baseFor = createBaseFor();
+
+    /**
+     * This function is like `baseFor` except that it iterates over properties
+     * in the opposite order.
+     *
+     * @private
+     * @param {Object} object The object to iterate over.
+     * @param {Function} iteratee The function invoked per iteration.
+     * @param {Function} keysFunc The function to get the keys of `object`.
+     * @returns {Object} Returns `object`.
+     */
+    var baseForRight = createBaseFor(true);
+
+    /**
+     * The base implementation of `_.forOwn` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Object} object The object to iterate over.
+     * @param {Function} iteratee The function invoked per iteration.
+     * @returns {Object} Returns `object`.
+     */
+    function baseForOwn(object, iteratee) {
+      return object && baseFor(object, iteratee, keys);
+    }
+
+    /**
+     * The base implementation of `_.forOwnRight` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Object} object The object to iterate over.
+     * @param {Function} iteratee The function invoked per iteration.
+     * @returns {Object} Returns `object`.
+     */
+    function baseForOwnRight(object, iteratee) {
+      return object && baseForRight(object, iteratee, keys);
+    }
+
+    /**
+     * The base implementation of `_.functions` which creates an array of
+     * `object` function property names filtered from `props`.
+     *
+     * @private
+     * @param {Object} object The object to inspect.
+     * @param {Array} props The property names to filter.
+     * @returns {Array} Returns the function names.
+     */
+    function baseFunctions(object, props) {
+      return arrayFilter(props, function(key) {
+        return isFunction(object[key]);
+      });
+    }
+
+    /**
+     * The base implementation of `_.get` without support for default values.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @param {Array|string} path The path of the property to get.
+     * @returns {*} Returns the resolved value.
+     */
+    function baseGet(object, path) {
+      path = castPath(path, object);
+
+      var index = 0,
+          length = path.length;
+
+      while (object != null && index < length) {
+        object = object[toKey(path[index++])];
+      }
+      return (index && index == length) ? object : undefined;
+    }
+
+    /**
+     * The base implementation of `getAllKeys` and `getAllKeysIn` which uses
+     * `keysFunc` and `symbolsFunc` to get the enumerable property names and
+     * symbols of `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @param {Function} keysFunc The function to get the keys of `object`.
+     * @param {Function} symbolsFunc The function to get the symbols of `object`.
+     * @returns {Array} Returns the array of property names and symbols.
+     */
+    function baseGetAllKeys(object, keysFunc, symbolsFunc) {
+      var result = keysFunc(object);
+      return isArray(object) ? result : arrayPush(result, symbolsFunc(object));
+    }
+
+    /**
+     * The base implementation of `getTag` without fallbacks for buggy environments.
+     *
+     * @private
+     * @param {*} value The value to query.
+     * @returns {string} Returns the `toStringTag`.
+     */
+    function baseGetTag(value) {
+      if (value == null) {
+        return value === undefined ? undefinedTag : nullTag;
+      }
+      return (symToStringTag && symToStringTag in Object(value))
+        ? getRawTag(value)
+        : objectToString(value);
+    }
+
+    /**
+     * The base implementation of `_.gt` which doesn't coerce arguments.
+     *
+     * @private
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {boolean} Returns `true` if `value` is greater than `other`,
+     *  else `false`.
+     */
+    function baseGt(value, other) {
+      return value > other;
+    }
+
+    /**
+     * The base implementation of `_.has` without support for deep paths.
+     *
+     * @private
+     * @param {Object} [object] The object to query.
+     * @param {Array|string} key The key to check.
+     * @returns {boolean} Returns `true` if `key` exists, else `false`.
+     */
+    function baseHas(object, key) {
+      return object != null && hasOwnProperty.call(object, key);
+    }
+
+    /**
+     * The base implementation of `_.hasIn` without support for deep paths.
+     *
+     * @private
+     * @param {Object} [object] The object to query.
+     * @param {Array|string} key The key to check.
+     * @returns {boolean} Returns `true` if `key` exists, else `false`.
+     */
+    function baseHasIn(object, key) {
+      return object != null && key in Object(object);
+    }
+
+    /**
+     * The base implementation of `_.inRange` which doesn't coerce arguments.
+     *
+     * @private
+     * @param {number} number The number to check.
+     * @param {number} start The start of the range.
+     * @param {number} end The end of the range.
+     * @returns {boolean} Returns `true` if `number` is in the range, else `false`.
+     */
+    function baseInRange(number, start, end) {
+      return number >= nativeMin(start, end) && number < nativeMax(start, end);
+    }
+
+    /**
+     * The base implementation of methods like `_.intersection`, without support
+     * for iteratee shorthands, that accepts an array of arrays to inspect.
+     *
+     * @private
+     * @param {Array} arrays The arrays to inspect.
+     * @param {Function} [iteratee] The iteratee invoked per element.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new array of shared values.
+     */
+    function baseIntersection(arrays, iteratee, comparator) {
+      var includes = comparator ? arrayIncludesWith : arrayIncludes,
+          length = arrays[0].length,
+          othLength = arrays.length,
+          othIndex = othLength,
+          caches = Array(othLength),
+          maxLength = Infinity,
+          result = [];
+
+      while (othIndex--) {
+        var array = arrays[othIndex];
+        if (othIndex && iteratee) {
+          array = arrayMap(array, baseUnary(iteratee));
+        }
+        maxLength = nativeMin(array.length, maxLength);
+        caches[othIndex] = !comparator && (iteratee || (length >= 120 && array.length >= 120))
+          ? new SetCache(othIndex && array)
+          : undefined;
+      }
+      array = arrays[0];
+
+      var index = -1,
+          seen = caches[0];
+
+      outer:
+      while (++index < length && result.length < maxLength) {
+        var value = array[index],
+            computed = iteratee ? iteratee(value) : value;
+
+        value = (comparator || value !== 0) ? value : 0;
+        if (!(seen
+              ? cacheHas(seen, computed)
+              : includes(result, computed, comparator)
+            )) {
+          othIndex = othLength;
+          while (--othIndex) {
+            var cache = caches[othIndex];
+            if (!(cache
+                  ? cacheHas(cache, computed)
+                  : includes(arrays[othIndex], computed, comparator))
+                ) {
+              continue outer;
+            }
+          }
+          if (seen) {
+            seen.push(computed);
+          }
+          result.push(value);
+        }
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.invert` and `_.invertBy` which inverts
+     * `object` with values transformed by `iteratee` and set by `setter`.
+     *
+     * @private
+     * @param {Object} object The object to iterate over.
+     * @param {Function} setter The function to set `accumulator` values.
+     * @param {Function} iteratee The iteratee to transform values.
+     * @param {Object} accumulator The initial inverted object.
+     * @returns {Function} Returns `accumulator`.
+     */
+    function baseInverter(object, setter, iteratee, accumulator) {
+      baseForOwn(object, function(value, key, object) {
+        setter(accumulator, iteratee(value), key, object);
+      });
+      return accumulator;
+    }
+
+    /**
+     * The base implementation of `_.invoke` without support for individual
+     * method arguments.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @param {Array|string} path The path of the method to invoke.
+     * @param {Array} args The arguments to invoke the method with.
+     * @returns {*} Returns the result of the invoked method.
+     */
+    function baseInvoke(object, path, args) {
+      path = castPath(path, object);
+      object = parent(object, path);
+      var func = object == null ? object : object[toKey(last(path))];
+      return func == null ? undefined : apply(func, object, args);
+    }
+
+    /**
+     * The base implementation of `_.isArguments`.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an `arguments` object,
+     */
+    function baseIsArguments(value) {
+      return isObjectLike(value) && baseGetTag(value) == argsTag;
+    }
+
+    /**
+     * The base implementation of `_.isArrayBuffer` without Node.js optimizations.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an array buffer, else `false`.
+     */
+    function baseIsArrayBuffer(value) {
+      return isObjectLike(value) && baseGetTag(value) == arrayBufferTag;
+    }
+
+    /**
+     * The base implementation of `_.isDate` without Node.js optimizations.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a date object, else `false`.
+     */
+    function baseIsDate(value) {
+      return isObjectLike(value) && baseGetTag(value) == dateTag;
+    }
+
+    /**
+     * The base implementation of `_.isEqual` which supports partial comparisons
+     * and tracks traversed objects.
+     *
+     * @private
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @param {boolean} bitmask The bitmask flags.
+     *  1 - Unordered comparison
+     *  2 - Partial comparison
+     * @param {Function} [customizer] The function to customize comparisons.
+     * @param {Object} [stack] Tracks traversed `value` and `other` objects.
+     * @returns {boolean} Returns `true` if the values are equivalent, else `false`.
+     */
+    function baseIsEqual(value, other, bitmask, customizer, stack) {
+      if (value === other) {
+        return true;
+      }
+      if (value == null || other == null || (!isObjectLike(value) && !isObjectLike(other))) {
+        return value !== value && other !== other;
+      }
+      return baseIsEqualDeep(value, other, bitmask, customizer, baseIsEqual, stack);
+    }
+
+    /**
+     * A specialized version of `baseIsEqual` for arrays and objects which performs
+     * deep comparisons and tracks traversed objects enabling objects with circular
+     * references to be compared.
+     *
+     * @private
+     * @param {Object} object The object to compare.
+     * @param {Object} other The other object to compare.
+     * @param {number} bitmask The bitmask flags. See `baseIsEqual` for more details.
+     * @param {Function} customizer The function to customize comparisons.
+     * @param {Function} equalFunc The function to determine equivalents of values.
+     * @param {Object} [stack] Tracks traversed `object` and `other` objects.
+     * @returns {boolean} Returns `true` if the objects are equivalent, else `false`.
+     */
+    function baseIsEqualDeep(object, other, bitmask, customizer, equalFunc, stack) {
+      var objIsArr = isArray(object),
+          othIsArr = isArray(other),
+          objTag = objIsArr ? arrayTag : getTag(object),
+          othTag = othIsArr ? arrayTag : getTag(other);
+
+      objTag = objTag == argsTag ? objectTag : objTag;
+      othTag = othTag == argsTag ? objectTag : othTag;
+
+      var objIsObj = objTag == objectTag,
+          othIsObj = othTag == objectTag,
+          isSameTag = objTag == othTag;
+
+      if (isSameTag && isBuffer(object)) {
+        if (!isBuffer(other)) {
+          return false;
+        }
+        objIsArr = true;
+        objIsObj = false;
+      }
+      if (isSameTag && !objIsObj) {
+        stack || (stack = new Stack);
+        return (objIsArr || isTypedArray(object))
+          ? equalArrays(object, other, bitmask, customizer, equalFunc, stack)
+          : equalByTag(object, other, objTag, bitmask, customizer, equalFunc, stack);
+      }
+      if (!(bitmask & COMPARE_PARTIAL_FLAG)) {
+        var objIsWrapped = objIsObj && hasOwnProperty.call(object, '__wrapped__'),
+            othIsWrapped = othIsObj && hasOwnProperty.call(other, '__wrapped__');
+
+        if (objIsWrapped || othIsWrapped) {
+          var objUnwrapped = objIsWrapped ? object.value() : object,
+              othUnwrapped = othIsWrapped ? other.value() : other;
+
+          stack || (stack = new Stack);
+          return equalFunc(objUnwrapped, othUnwrapped, bitmask, customizer, stack);
+        }
+      }
+      if (!isSameTag) {
+        return false;
+      }
+      stack || (stack = new Stack);
+      return equalObjects(object, other, bitmask, customizer, equalFunc, stack);
+    }
+
+    /**
+     * The base implementation of `_.isMap` without Node.js optimizations.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a map, else `false`.
+     */
+    function baseIsMap(value) {
+      return isObjectLike(value) && getTag(value) == mapTag;
+    }
+
+    /**
+     * The base implementation of `_.isMatch` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Object} object The object to inspect.
+     * @param {Object} source The object of property values to match.
+     * @param {Array} matchData The property names, values, and compare flags to match.
+     * @param {Function} [customizer] The function to customize comparisons.
+     * @returns {boolean} Returns `true` if `object` is a match, else `false`.
+     */
+    function baseIsMatch(object, source, matchData, customizer) {
+      var index = matchData.length,
+          length = index,
+          noCustomizer = !customizer;
+
+      if (object == null) {
+        return !length;
+      }
+      object = Object(object);
+      while (index--) {
+        var data = matchData[index];
+        if ((noCustomizer && data[2])
+              ? data[1] !== object[data[0]]
+              : !(data[0] in object)
+            ) {
+          return false;
+        }
+      }
+      while (++index < length) {
+        data = matchData[index];
+        var key = data[0],
+            objValue = object[key],
+            srcValue = data[1];
+
+        if (noCustomizer && data[2]) {
+          if (objValue === undefined && !(key in object)) {
+            return false;
+          }
+        } else {
+          var stack = new Stack;
+          if (customizer) {
+            var result = customizer(objValue, srcValue, key, object, source, stack);
+          }
+          if (!(result === undefined
+                ? baseIsEqual(srcValue, objValue, COMPARE_PARTIAL_FLAG | COMPARE_UNORDERED_FLAG, customizer, stack)
+                : result
+              )) {
+            return false;
+          }
+        }
+      }
+      return true;
+    }
+
+    /**
+     * The base implementation of `_.isNative` without bad shim checks.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a native function,
+     *  else `false`.
+     */
+    function baseIsNative(value) {
+      if (!isObject(value) || isMasked(value)) {
+        return false;
+      }
+      var pattern = isFunction(value) ? reIsNative : reIsHostCtor;
+      return pattern.test(toSource(value));
+    }
+
+    /**
+     * The base implementation of `_.isRegExp` without Node.js optimizations.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a regexp, else `false`.
+     */
+    function baseIsRegExp(value) {
+      return isObjectLike(value) && baseGetTag(value) == regexpTag;
+    }
+
+    /**
+     * The base implementation of `_.isSet` without Node.js optimizations.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a set, else `false`.
+     */
+    function baseIsSet(value) {
+      return isObjectLike(value) && getTag(value) == setTag;
+    }
+
+    /**
+     * The base implementation of `_.isTypedArray` without Node.js optimizations.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a typed array, else `false`.
+     */
+    function baseIsTypedArray(value) {
+      return isObjectLike(value) &&
+        isLength(value.length) && !!typedArrayTags[baseGetTag(value)];
+    }
+
+    /**
+     * The base implementation of `_.iteratee`.
+     *
+     * @private
+     * @param {*} [value=_.identity] The value to convert to an iteratee.
+     * @returns {Function} Returns the iteratee.
+     */
+    function baseIteratee(value) {
+      // Don't store the `typeof` result in a variable to avoid a JIT bug in Safari 9.
+      // See https://bugs.webkit.org/show_bug.cgi?id=156034 for more details.
+      if (typeof value == 'function') {
+        return value;
+      }
+      if (value == null) {
+        return identity;
+      }
+      if (typeof value == 'object') {
+        return isArray(value)
+          ? baseMatchesProperty(value[0], value[1])
+          : baseMatches(value);
+      }
+      return property(value);
+    }
+
+    /**
+     * The base implementation of `_.keys` which doesn't treat sparse arrays as dense.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property names.
+     */
+    function baseKeys(object) {
+      if (!isPrototype(object)) {
+        return nativeKeys(object);
+      }
+      var result = [];
+      for (var key in Object(object)) {
+        if (hasOwnProperty.call(object, key) && key != 'constructor') {
+          result.push(key);
+        }
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.keysIn` which doesn't treat sparse arrays as dense.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property names.
+     */
+    function baseKeysIn(object) {
+      if (!isObject(object)) {
+        return nativeKeysIn(object);
+      }
+      var isProto = isPrototype(object),
+          result = [];
+
+      for (var key in object) {
+        if (!(key == 'constructor' && (isProto || !hasOwnProperty.call(object, key)))) {
+          result.push(key);
+        }
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.lt` which doesn't coerce arguments.
+     *
+     * @private
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {boolean} Returns `true` if `value` is less than `other`,
+     *  else `false`.
+     */
+    function baseLt(value, other) {
+      return value < other;
+    }
+
+    /**
+     * The base implementation of `_.map` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} iteratee The function invoked per iteration.
+     * @returns {Array} Returns the new mapped array.
+     */
+    function baseMap(collection, iteratee) {
+      var index = -1,
+          result = isArrayLike(collection) ? Array(collection.length) : [];
+
+      baseEach(collection, function(value, key, collection) {
+        result[++index] = iteratee(value, key, collection);
+      });
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.matches` which doesn't clone `source`.
+     *
+     * @private
+     * @param {Object} source The object of property values to match.
+     * @returns {Function} Returns the new spec function.
+     */
+    function baseMatches(source) {
+      var matchData = getMatchData(source);
+      if (matchData.length == 1 && matchData[0][2]) {
+        return matchesStrictComparable(matchData[0][0], matchData[0][1]);
+      }
+      return function(object) {
+        return object === source || baseIsMatch(object, source, matchData);
+      };
+    }
+
+    /**
+     * The base implementation of `_.matchesProperty` which doesn't clone `srcValue`.
+     *
+     * @private
+     * @param {string} path The path of the property to get.
+     * @param {*} srcValue The value to match.
+     * @returns {Function} Returns the new spec function.
+     */
+    function baseMatchesProperty(path, srcValue) {
+      if (isKey(path) && isStrictComparable(srcValue)) {
+        return matchesStrictComparable(toKey(path), srcValue);
+      }
+      return function(object) {
+        var objValue = get(object, path);
+        return (objValue === undefined && objValue === srcValue)
+          ? hasIn(object, path)
+          : baseIsEqual(srcValue, objValue, COMPARE_PARTIAL_FLAG | COMPARE_UNORDERED_FLAG);
+      };
+    }
+
+    /**
+     * The base implementation of `_.merge` without support for multiple sources.
+     *
+     * @private
+     * @param {Object} object The destination object.
+     * @param {Object} source The source object.
+     * @param {number} srcIndex The index of `source`.
+     * @param {Function} [customizer] The function to customize merged values.
+     * @param {Object} [stack] Tracks traversed source values and their merged
+     *  counterparts.
+     */
+    function baseMerge(object, source, srcIndex, customizer, stack) {
+      if (object === source) {
+        return;
+      }
+      baseFor(source, function(srcValue, key) {
+        if (isObject(srcValue)) {
+          stack || (stack = new Stack);
+          baseMergeDeep(object, source, key, srcIndex, baseMerge, customizer, stack);
+        }
+        else {
+          var newValue = customizer
+            ? customizer(object[key], srcValue, (key + ''), object, source, stack)
+            : undefined;
+
+          if (newValue === undefined) {
+            newValue = srcValue;
+          }
+          assignMergeValue(object, key, newValue);
+        }
+      }, keysIn);
+    }
+
+    /**
+     * A specialized version of `baseMerge` for arrays and objects which performs
+     * deep merges and tracks traversed objects enabling objects with circular
+     * references to be merged.
+     *
+     * @private
+     * @param {Object} object The destination object.
+     * @param {Object} source The source object.
+     * @param {string} key The key of the value to merge.
+     * @param {number} srcIndex The index of `source`.
+     * @param {Function} mergeFunc The function to merge values.
+     * @param {Function} [customizer] The function to customize assigned values.
+     * @param {Object} [stack] Tracks traversed source values and their merged
+     *  counterparts.
+     */
+    function baseMergeDeep(object, source, key, srcIndex, mergeFunc, customizer, stack) {
+      var objValue = object[key],
+          srcValue = source[key],
+          stacked = stack.get(srcValue);
+
+      if (stacked) {
+        assignMergeValue(object, key, stacked);
+        return;
+      }
+      var newValue = customizer
+        ? customizer(objValue, srcValue, (key + ''), object, source, stack)
+        : undefined;
+
+      var isCommon = newValue === undefined;
+
+      if (isCommon) {
+        var isArr = isArray(srcValue),
+            isBuff = !isArr && isBuffer(srcValue),
+            isTyped = !isArr && !isBuff && isTypedArray(srcValue);
+
+        newValue = srcValue;
+        if (isArr || isBuff || isTyped) {
+          if (isArray(objValue)) {
+            newValue = objValue;
+          }
+          else if (isArrayLikeObject(objValue)) {
+            newValue = copyArray(objValue);
+          }
+          else if (isBuff) {
+            isCommon = false;
+            newValue = cloneBuffer(srcValue, true);
+          }
+          else if (isTyped) {
+            isCommon = false;
+            newValue = cloneTypedArray(srcValue, true);
+          }
+          else {
+            newValue = [];
+          }
+        }
+        else if (isPlainObject(srcValue) || isArguments(srcValue)) {
+          newValue = objValue;
+          if (isArguments(objValue)) {
+            newValue = toPlainObject(objValue);
+          }
+          else if (!isObject(objValue) || (srcIndex && isFunction(objValue))) {
+            newValue = initCloneObject(srcValue);
+          }
+        }
+        else {
+          isCommon = false;
+        }
+      }
+      if (isCommon) {
+        // Recursively merge objects and arrays (susceptible to call stack limits).
+        stack.set(srcValue, newValue);
+        mergeFunc(newValue, srcValue, srcIndex, customizer, stack);
+        stack['delete'](srcValue);
+      }
+      assignMergeValue(object, key, newValue);
+    }
+
+    /**
+     * The base implementation of `_.nth` which doesn't coerce arguments.
+     *
+     * @private
+     * @param {Array} array The array to query.
+     * @param {number} n The index of the element to return.
+     * @returns {*} Returns the nth element of `array`.
+     */
+    function baseNth(array, n) {
+      var length = array.length;
+      if (!length) {
+        return;
+      }
+      n += n < 0 ? length : 0;
+      return isIndex(n, length) ? array[n] : undefined;
+    }
+
+    /**
+     * The base implementation of `_.orderBy` without param guards.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function[]|Object[]|string[]} iteratees The iteratees to sort by.
+     * @param {string[]} orders The sort orders of `iteratees`.
+     * @returns {Array} Returns the new sorted array.
+     */
+    function baseOrderBy(collection, iteratees, orders) {
+      var index = -1;
+      iteratees = arrayMap(iteratees.length ? iteratees : [identity], baseUnary(getIteratee()));
+
+      var result = baseMap(collection, function(value, key, collection) {
+        var criteria = arrayMap(iteratees, function(iteratee) {
+          return iteratee(value);
+        });
+        return { 'criteria': criteria, 'index': ++index, 'value': value };
+      });
+
+      return baseSortBy(result, function(object, other) {
+        return compareMultiple(object, other, orders);
+      });
+    }
+
+    /**
+     * The base implementation of `_.pick` without support for individual
+     * property identifiers.
+     *
+     * @private
+     * @param {Object} object The source object.
+     * @param {string[]} paths The property paths to pick.
+     * @returns {Object} Returns the new object.
+     */
+    function basePick(object, paths) {
+      return basePickBy(object, paths, function(value, path) {
+        return hasIn(object, path);
+      });
+    }
+
+    /**
+     * The base implementation of  `_.pickBy` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Object} object The source object.
+     * @param {string[]} paths The property paths to pick.
+     * @param {Function} predicate The function invoked per property.
+     * @returns {Object} Returns the new object.
+     */
+    function basePickBy(object, paths, predicate) {
+      var index = -1,
+          length = paths.length,
+          result = {};
+
+      while (++index < length) {
+        var path = paths[index],
+            value = baseGet(object, path);
+
+        if (predicate(value, path)) {
+          baseSet(result, castPath(path, object), value);
+        }
+      }
+      return result;
+    }
+
+    /**
+     * A specialized version of `baseProperty` which supports deep paths.
+     *
+     * @private
+     * @param {Array|string} path The path of the property to get.
+     * @returns {Function} Returns the new accessor function.
+     */
+    function basePropertyDeep(path) {
+      return function(object) {
+        return baseGet(object, path);
+      };
+    }
+
+    /**
+     * The base implementation of `_.pullAllBy` without support for iteratee
+     * shorthands.
+     *
+     * @private
+     * @param {Array} array The array to modify.
+     * @param {Array} values The values to remove.
+     * @param {Function} [iteratee] The iteratee invoked per element.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns `array`.
+     */
+    function basePullAll(array, values, iteratee, comparator) {
+      var indexOf = comparator ? baseIndexOfWith : baseIndexOf,
+          index = -1,
+          length = values.length,
+          seen = array;
+
+      if (array === values) {
+        values = copyArray(values);
+      }
+      if (iteratee) {
+        seen = arrayMap(array, baseUnary(iteratee));
+      }
+      while (++index < length) {
+        var fromIndex = 0,
+            value = values[index],
+            computed = iteratee ? iteratee(value) : value;
+
+        while ((fromIndex = indexOf(seen, computed, fromIndex, comparator)) > -1) {
+          if (seen !== array) {
+            splice.call(seen, fromIndex, 1);
+          }
+          splice.call(array, fromIndex, 1);
+        }
+      }
+      return array;
+    }
+
+    /**
+     * The base implementation of `_.pullAt` without support for individual
+     * indexes or capturing the removed elements.
+     *
+     * @private
+     * @param {Array} array The array to modify.
+     * @param {number[]} indexes The indexes of elements to remove.
+     * @returns {Array} Returns `array`.
+     */
+    function basePullAt(array, indexes) {
+      var length = array ? indexes.length : 0,
+          lastIndex = length - 1;
+
+      while (length--) {
+        var index = indexes[length];
+        if (length == lastIndex || index !== previous) {
+          var previous = index;
+          if (isIndex(index)) {
+            splice.call(array, index, 1);
+          } else {
+            baseUnset(array, index);
+          }
+        }
+      }
+      return array;
+    }
+
+    /**
+     * The base implementation of `_.random` without support for returning
+     * floating-point numbers.
+     *
+     * @private
+     * @param {number} lower The lower bound.
+     * @param {number} upper The upper bound.
+     * @returns {number} Returns the random number.
+     */
+    function baseRandom(lower, upper) {
+      return lower + nativeFloor(nativeRandom() * (upper - lower + 1));
+    }
+
+    /**
+     * The base implementation of `_.range` and `_.rangeRight` which doesn't
+     * coerce arguments.
+     *
+     * @private
+     * @param {number} start The start of the range.
+     * @param {number} end The end of the range.
+     * @param {number} step The value to increment or decrement by.
+     * @param {boolean} [fromRight] Specify iterating from right to left.
+     * @returns {Array} Returns the range of numbers.
+     */
+    function baseRange(start, end, step, fromRight) {
+      var index = -1,
+          length = nativeMax(nativeCeil((end - start) / (step || 1)), 0),
+          result = Array(length);
+
+      while (length--) {
+        result[fromRight ? length : ++index] = start;
+        start += step;
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.repeat` which doesn't coerce arguments.
+     *
+     * @private
+     * @param {string} string The string to repeat.
+     * @param {number} n The number of times to repeat the string.
+     * @returns {string} Returns the repeated string.
+     */
+    function baseRepeat(string, n) {
+      var result = '';
+      if (!string || n < 1 || n > MAX_SAFE_INTEGER) {
+        return result;
+      }
+      // Leverage the exponentiation by squaring algorithm for a faster repeat.
+      // See https://en.wikipedia.org/wiki/Exponentiation_by_squaring for more details.
+      do {
+        if (n % 2) {
+          result += string;
+        }
+        n = nativeFloor(n / 2);
+        if (n) {
+          string += string;
+        }
+      } while (n);
+
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.rest` which doesn't validate or coerce arguments.
+     *
+     * @private
+     * @param {Function} func The function to apply a rest parameter to.
+     * @param {number} [start=func.length-1] The start position of the rest parameter.
+     * @returns {Function} Returns the new function.
+     */
+    function baseRest(func, start) {
+      return setToString(overRest(func, start, identity), func + '');
+    }
+
+    /**
+     * The base implementation of `_.sample`.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to sample.
+     * @returns {*} Returns the random element.
+     */
+    function baseSample(collection) {
+      return arraySample(values(collection));
+    }
+
+    /**
+     * The base implementation of `_.sampleSize` without param guards.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to sample.
+     * @param {number} n The number of elements to sample.
+     * @returns {Array} Returns the random elements.
+     */
+    function baseSampleSize(collection, n) {
+      var array = values(collection);
+      return shuffleSelf(array, baseClamp(n, 0, array.length));
+    }
+
+    /**
+     * The base implementation of `_.set`.
+     *
+     * @private
+     * @param {Object} object The object to modify.
+     * @param {Array|string} path The path of the property to set.
+     * @param {*} value The value to set.
+     * @param {Function} [customizer] The function to customize path creation.
+     * @returns {Object} Returns `object`.
+     */
+    function baseSet(object, path, value, customizer) {
+      if (!isObject(object)) {
+        return object;
+      }
+      path = castPath(path, object);
+
+      var index = -1,
+          length = path.length,
+          lastIndex = length - 1,
+          nested = object;
+
+      while (nested != null && ++index < length) {
+        var key = toKey(path[index]),
+            newValue = value;
+
+        if (index != lastIndex) {
+          var objValue = nested[key];
+          newValue = customizer ? customizer(objValue, key, nested) : undefined;
+          if (newValue === undefined) {
+            newValue = isObject(objValue)
+              ? objValue
+              : (isIndex(path[index + 1]) ? [] : {});
+          }
+        }
+        assignValue(nested, key, newValue);
+        nested = nested[key];
+      }
+      return object;
+    }
+
+    /**
+     * The base implementation of `setData` without support for hot loop shorting.
+     *
+     * @private
+     * @param {Function} func The function to associate metadata with.
+     * @param {*} data The metadata.
+     * @returns {Function} Returns `func`.
+     */
+    var baseSetData = !metaMap ? identity : function(func, data) {
+      metaMap.set(func, data);
+      return func;
+    };
+
+    /**
+     * The base implementation of `setToString` without support for hot loop shorting.
+     *
+     * @private
+     * @param {Function} func The function to modify.
+     * @param {Function} string The `toString` result.
+     * @returns {Function} Returns `func`.
+     */
+    var baseSetToString = !defineProperty ? identity : function(func, string) {
+      return defineProperty(func, 'toString', {
+        'configurable': true,
+        'enumerable': false,
+        'value': constant(string),
+        'writable': true
+      });
+    };
+
+    /**
+     * The base implementation of `_.shuffle`.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to shuffle.
+     * @returns {Array} Returns the new shuffled array.
+     */
+    function baseShuffle(collection) {
+      return shuffleSelf(values(collection));
+    }
+
+    /**
+     * The base implementation of `_.slice` without an iteratee call guard.
+     *
+     * @private
+     * @param {Array} array The array to slice.
+     * @param {number} [start=0] The start position.
+     * @param {number} [end=array.length] The end position.
+     * @returns {Array} Returns the slice of `array`.
+     */
+    function baseSlice(array, start, end) {
+      var index = -1,
+          length = array.length;
+
+      if (start < 0) {
+        start = -start > length ? 0 : (length + start);
+      }
+      end = end > length ? length : end;
+      if (end < 0) {
+        end += length;
+      }
+      length = start > end ? 0 : ((end - start) >>> 0);
+      start >>>= 0;
+
+      var result = Array(length);
+      while (++index < length) {
+        result[index] = array[index + start];
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.some` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} predicate The function invoked per iteration.
+     * @returns {boolean} Returns `true` if any element passes the predicate check,
+     *  else `false`.
+     */
+    function baseSome(collection, predicate) {
+      var result;
+
+      baseEach(collection, function(value, index, collection) {
+        result = predicate(value, index, collection);
+        return !result;
+      });
+      return !!result;
+    }
+
+    /**
+     * The base implementation of `_.sortedIndex` and `_.sortedLastIndex` which
+     * performs a binary search of `array` to determine the index at which `value`
+     * should be inserted into `array` in order to maintain its sort order.
+     *
+     * @private
+     * @param {Array} array The sorted array to inspect.
+     * @param {*} value The value to evaluate.
+     * @param {boolean} [retHighest] Specify returning the highest qualified index.
+     * @returns {number} Returns the index at which `value` should be inserted
+     *  into `array`.
+     */
+    function baseSortedIndex(array, value, retHighest) {
+      var low = 0,
+          high = array == null ? low : array.length;
+
+      if (typeof value == 'number' && value === value && high <= HALF_MAX_ARRAY_LENGTH) {
+        while (low < high) {
+          var mid = (low + high) >>> 1,
+              computed = array[mid];
+
+          if (computed !== null && !isSymbol(computed) &&
+              (retHighest ? (computed <= value) : (computed < value))) {
+            low = mid + 1;
+          } else {
+            high = mid;
+          }
+        }
+        return high;
+      }
+      return baseSortedIndexBy(array, value, identity, retHighest);
+    }
+
+    /**
+     * The base implementation of `_.sortedIndexBy` and `_.sortedLastIndexBy`
+     * which invokes `iteratee` for `value` and each element of `array` to compute
+     * their sort ranking. The iteratee is invoked with one argument; (value).
+     *
+     * @private
+     * @param {Array} array The sorted array to inspect.
+     * @param {*} value The value to evaluate.
+     * @param {Function} iteratee The iteratee invoked per element.
+     * @param {boolean} [retHighest] Specify returning the highest qualified index.
+     * @returns {number} Returns the index at which `value` should be inserted
+     *  into `array`.
+     */
+    function baseSortedIndexBy(array, value, iteratee, retHighest) {
+      value = iteratee(value);
+
+      var low = 0,
+          high = array == null ? 0 : array.length,
+          valIsNaN = value !== value,
+          valIsNull = value === null,
+          valIsSymbol = isSymbol(value),
+          valIsUndefined = value === undefined;
+
+      while (low < high) {
+        var mid = nativeFloor((low + high) / 2),
+            computed = iteratee(array[mid]),
+            othIsDefined = computed !== undefined,
+            othIsNull = computed === null,
+            othIsReflexive = computed === computed,
+            othIsSymbol = isSymbol(computed);
+
+        if (valIsNaN) {
+          var setLow = retHighest || othIsReflexive;
+        } else if (valIsUndefined) {
+          setLow = othIsReflexive && (retHighest || othIsDefined);
+        } else if (valIsNull) {
+          setLow = othIsReflexive && othIsDefined && (retHighest || !othIsNull);
+        } else if (valIsSymbol) {
+          setLow = othIsReflexive && othIsDefined && !othIsNull && (retHighest || !othIsSymbol);
+        } else if (othIsNull || othIsSymbol) {
+          setLow = false;
+        } else {
+          setLow = retHighest ? (computed <= value) : (computed < value);
+        }
+        if (setLow) {
+          low = mid + 1;
+        } else {
+          high = mid;
+        }
+      }
+      return nativeMin(high, MAX_ARRAY_INDEX);
+    }
+
+    /**
+     * The base implementation of `_.sortedUniq` and `_.sortedUniqBy` without
+     * support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array} array The array to inspect.
+     * @param {Function} [iteratee] The iteratee invoked per element.
+     * @returns {Array} Returns the new duplicate free array.
+     */
+    function baseSortedUniq(array, iteratee) {
+      var index = -1,
+          length = array.length,
+          resIndex = 0,
+          result = [];
+
+      while (++index < length) {
+        var value = array[index],
+            computed = iteratee ? iteratee(value) : value;
+
+        if (!index || !eq(computed, seen)) {
+          var seen = computed;
+          result[resIndex++] = value === 0 ? 0 : value;
+        }
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.toNumber` which doesn't ensure correct
+     * conversions of binary, hexadecimal, or octal string values.
+     *
+     * @private
+     * @param {*} value The value to process.
+     * @returns {number} Returns the number.
+     */
+    function baseToNumber(value) {
+      if (typeof value == 'number') {
+        return value;
+      }
+      if (isSymbol(value)) {
+        return NAN;
+      }
+      return +value;
+    }
+
+    /**
+     * The base implementation of `_.toString` which doesn't convert nullish
+     * values to empty strings.
+     *
+     * @private
+     * @param {*} value The value to process.
+     * @returns {string} Returns the string.
+     */
+    function baseToString(value) {
+      // Exit early for strings to avoid a performance hit in some environments.
+      if (typeof value == 'string') {
+        return value;
+      }
+      if (isArray(value)) {
+        // Recursively convert values (susceptible to call stack limits).
+        return arrayMap(value, baseToString) + '';
+      }
+      if (isSymbol(value)) {
+        return symbolToString ? symbolToString.call(value) : '';
+      }
+      var result = (value + '');
+      return (result == '0' && (1 / value) == -INFINITY) ? '-0' : result;
+    }
+
+    /**
+     * The base implementation of `_.uniqBy` without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array} array The array to inspect.
+     * @param {Function} [iteratee] The iteratee invoked per element.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new duplicate free array.
+     */
+    function baseUniq(array, iteratee, comparator) {
+      var index = -1,
+          includes = arrayIncludes,
+          length = array.length,
+          isCommon = true,
+          result = [],
+          seen = result;
+
+      if (comparator) {
+        isCommon = false;
+        includes = arrayIncludesWith;
+      }
+      else if (length >= LARGE_ARRAY_SIZE) {
+        var set = iteratee ? null : createSet(array);
+        if (set) {
+          return setToArray(set);
+        }
+        isCommon = false;
+        includes = cacheHas;
+        seen = new SetCache;
+      }
+      else {
+        seen = iteratee ? [] : result;
+      }
+      outer:
+      while (++index < length) {
+        var value = array[index],
+            computed = iteratee ? iteratee(value) : value;
+
+        value = (comparator || value !== 0) ? value : 0;
+        if (isCommon && computed === computed) {
+          var seenIndex = seen.length;
+          while (seenIndex--) {
+            if (seen[seenIndex] === computed) {
+              continue outer;
+            }
+          }
+          if (iteratee) {
+            seen.push(computed);
+          }
+          result.push(value);
+        }
+        else if (!includes(seen, computed, comparator)) {
+          if (seen !== result) {
+            seen.push(computed);
+          }
+          result.push(value);
+        }
+      }
+      return result;
+    }
+
+    /**
+     * The base implementation of `_.unset`.
+     *
+     * @private
+     * @param {Object} object The object to modify.
+     * @param {Array|string} path The property path to unset.
+     * @returns {boolean} Returns `true` if the property is deleted, else `false`.
+     */
+    function baseUnset(object, path) {
+      path = castPath(path, object);
+      object = parent(object, path);
+      return object == null || delete object[toKey(last(path))];
+    }
+
+    /**
+     * The base implementation of `_.update`.
+     *
+     * @private
+     * @param {Object} object The object to modify.
+     * @param {Array|string} path The path of the property to update.
+     * @param {Function} updater The function to produce the updated value.
+     * @param {Function} [customizer] The function to customize path creation.
+     * @returns {Object} Returns `object`.
+     */
+    function baseUpdate(object, path, updater, customizer) {
+      return baseSet(object, path, updater(baseGet(object, path)), customizer);
+    }
+
+    /**
+     * The base implementation of methods like `_.dropWhile` and `_.takeWhile`
+     * without support for iteratee shorthands.
+     *
+     * @private
+     * @param {Array} array The array to query.
+     * @param {Function} predicate The function invoked per iteration.
+     * @param {boolean} [isDrop] Specify dropping elements instead of taking them.
+     * @param {boolean} [fromRight] Specify iterating from right to left.
+     * @returns {Array} Returns the slice of `array`.
+     */
+    function baseWhile(array, predicate, isDrop, fromRight) {
+      var length = array.length,
+          index = fromRight ? length : -1;
+
+      while ((fromRight ? index-- : ++index < length) &&
+        predicate(array[index], index, array)) {}
+
+      return isDrop
+        ? baseSlice(array, (fromRight ? 0 : index), (fromRight ? index + 1 : length))
+        : baseSlice(array, (fromRight ? index + 1 : 0), (fromRight ? length : index));
+    }
+
+    /**
+     * The base implementation of `wrapperValue` which returns the result of
+     * performing a sequence of actions on the unwrapped `value`, where each
+     * successive action is supplied the return value of the previous.
+     *
+     * @private
+     * @param {*} value The unwrapped value.
+     * @param {Array} actions Actions to perform to resolve the unwrapped value.
+     * @returns {*} Returns the resolved value.
+     */
+    function baseWrapperValue(value, actions) {
+      var result = value;
+      if (result instanceof LazyWrapper) {
+        result = result.value();
+      }
+      return arrayReduce(actions, function(result, action) {
+        return action.func.apply(action.thisArg, arrayPush([result], action.args));
+      }, result);
+    }
+
+    /**
+     * The base implementation of methods like `_.xor`, without support for
+     * iteratee shorthands, that accepts an array of arrays to inspect.
+     *
+     * @private
+     * @param {Array} arrays The arrays to inspect.
+     * @param {Function} [iteratee] The iteratee invoked per element.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new array of values.
+     */
+    function baseXor(arrays, iteratee, comparator) {
+      var length = arrays.length;
+      if (length < 2) {
+        return length ? baseUniq(arrays[0]) : [];
+      }
+      var index = -1,
+          result = Array(length);
+
+      while (++index < length) {
+        var array = arrays[index],
+            othIndex = -1;
+
+        while (++othIndex < length) {
+          if (othIndex != index) {
+            result[index] = baseDifference(result[index] || array, arrays[othIndex], iteratee, comparator);
+          }
+        }
+      }
+      return baseUniq(baseFlatten(result, 1), iteratee, comparator);
+    }
+
+    /**
+     * This base implementation of `_.zipObject` which assigns values using `assignFunc`.
+     *
+     * @private
+     * @param {Array} props The property identifiers.
+     * @param {Array} values The property values.
+     * @param {Function} assignFunc The function to assign values.
+     * @returns {Object} Returns the new object.
+     */
+    function baseZipObject(props, values, assignFunc) {
+      var index = -1,
+          length = props.length,
+          valsLength = values.length,
+          result = {};
+
+      while (++index < length) {
+        var value = index < valsLength ? values[index] : undefined;
+        assignFunc(result, props[index], value);
+      }
+      return result;
+    }
+
+    /**
+     * Casts `value` to an empty array if it's not an array like object.
+     *
+     * @private
+     * @param {*} value The value to inspect.
+     * @returns {Array|Object} Returns the cast array-like object.
+     */
+    function castArrayLikeObject(value) {
+      return isArrayLikeObject(value) ? value : [];
+    }
+
+    /**
+     * Casts `value` to `identity` if it's not a function.
+     *
+     * @private
+     * @param {*} value The value to inspect.
+     * @returns {Function} Returns cast function.
+     */
+    function castFunction(value) {
+      return typeof value == 'function' ? value : identity;
+    }
+
+    /**
+     * Casts `value` to a path array if it's not one.
+     *
+     * @private
+     * @param {*} value The value to inspect.
+     * @param {Object} [object] The object to query keys on.
+     * @returns {Array} Returns the cast property path array.
+     */
+    function castPath(value, object) {
+      if (isArray(value)) {
+        return value;
+      }
+      return isKey(value, object) ? [value] : stringToPath(toString(value));
+    }
+
+    /**
+     * A `baseRest` alias which can be replaced with `identity` by module
+     * replacement plugins.
+     *
+     * @private
+     * @type {Function}
+     * @param {Function} func The function to apply a rest parameter to.
+     * @returns {Function} Returns the new function.
+     */
+    var castRest = baseRest;
+
+    /**
+     * Casts `array` to a slice if it's needed.
+     *
+     * @private
+     * @param {Array} array The array to inspect.
+     * @param {number} start The start position.
+     * @param {number} [end=array.length] The end position.
+     * @returns {Array} Returns the cast slice.
+     */
+    function castSlice(array, start, end) {
+      var length = array.length;
+      end = end === undefined ? length : end;
+      return (!start && end >= length) ? array : baseSlice(array, start, end);
+    }
+
+    /**
+     * A simple wrapper around the global [`clearTimeout`](https://mdn.io/clearTimeout).
+     *
+     * @private
+     * @param {number|Object} id The timer id or timeout object of the timer to clear.
+     */
+    var clearTimeout = ctxClearTimeout || function(id) {
+      return root.clearTimeout(id);
+    };
+
+    /**
+     * Creates a clone of  `buffer`.
+     *
+     * @private
+     * @param {Buffer} buffer The buffer to clone.
+     * @param {boolean} [isDeep] Specify a deep clone.
+     * @returns {Buffer} Returns the cloned buffer.
+     */
+    function cloneBuffer(buffer, isDeep) {
+      if (isDeep) {
+        return buffer.slice();
+      }
+      var length = buffer.length,
+          result = allocUnsafe ? allocUnsafe(length) : new buffer.constructor(length);
+
+      buffer.copy(result);
+      return result;
+    }
+
+    /**
+     * Creates a clone of `arrayBuffer`.
+     *
+     * @private
+     * @param {ArrayBuffer} arrayBuffer The array buffer to clone.
+     * @returns {ArrayBuffer} Returns the cloned array buffer.
+     */
+    function cloneArrayBuffer(arrayBuffer) {
+      var result = new arrayBuffer.constructor(arrayBuffer.byteLength);
+      new Uint8Array(result).set(new Uint8Array(arrayBuffer));
+      return result;
+    }
+
+    /**
+     * Creates a clone of `dataView`.
+     *
+     * @private
+     * @param {Object} dataView The data view to clone.
+     * @param {boolean} [isDeep] Specify a deep clone.
+     * @returns {Object} Returns the cloned data view.
+     */
+    function cloneDataView(dataView, isDeep) {
+      var buffer = isDeep ? cloneArrayBuffer(dataView.buffer) : dataView.buffer;
+      return new dataView.constructor(buffer, dataView.byteOffset, dataView.byteLength);
+    }
+
+    /**
+     * Creates a clone of `map`.
+     *
+     * @private
+     * @param {Object} map The map to clone.
+     * @param {Function} cloneFunc The function to clone values.
+     * @param {boolean} [isDeep] Specify a deep clone.
+     * @returns {Object} Returns the cloned map.
+     */
+    function cloneMap(map, isDeep, cloneFunc) {
+      var array = isDeep ? cloneFunc(mapToArray(map), CLONE_DEEP_FLAG) : mapToArray(map);
+      return arrayReduce(array, addMapEntry, new map.constructor);
+    }
+
+    /**
+     * Creates a clone of `regexp`.
+     *
+     * @private
+     * @param {Object} regexp The regexp to clone.
+     * @returns {Object} Returns the cloned regexp.
+     */
+    function cloneRegExp(regexp) {
+      var result = new regexp.constructor(regexp.source, reFlags.exec(regexp));
+      result.lastIndex = regexp.lastIndex;
+      return result;
+    }
+
+    /**
+     * Creates a clone of `set`.
+     *
+     * @private
+     * @param {Object} set The set to clone.
+     * @param {Function} cloneFunc The function to clone values.
+     * @param {boolean} [isDeep] Specify a deep clone.
+     * @returns {Object} Returns the cloned set.
+     */
+    function cloneSet(set, isDeep, cloneFunc) {
+      var array = isDeep ? cloneFunc(setToArray(set), CLONE_DEEP_FLAG) : setToArray(set);
+      return arrayReduce(array, addSetEntry, new set.constructor);
+    }
+
+    /**
+     * Creates a clone of the `symbol` object.
+     *
+     * @private
+     * @param {Object} symbol The symbol object to clone.
+     * @returns {Object} Returns the cloned symbol object.
+     */
+    function cloneSymbol(symbol) {
+      return symbolValueOf ? Object(symbolValueOf.call(symbol)) : {};
+    }
+
+    /**
+     * Creates a clone of `typedArray`.
+     *
+     * @private
+     * @param {Object} typedArray The typed array to clone.
+     * @param {boolean} [isDeep] Specify a deep clone.
+     * @returns {Object} Returns the cloned typed array.
+     */
+    function cloneTypedArray(typedArray, isDeep) {
+      var buffer = isDeep ? cloneArrayBuffer(typedArray.buffer) : typedArray.buffer;
+      return new typedArray.constructor(buffer, typedArray.byteOffset, typedArray.length);
+    }
+
+    /**
+     * Compares values to sort them in ascending order.
+     *
+     * @private
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {number} Returns the sort order indicator for `value`.
+     */
+    function compareAscending(value, other) {
+      if (value !== other) {
+        var valIsDefined = value !== undefined,
+            valIsNull = value === null,
+            valIsReflexive = value === value,
+            valIsSymbol = isSymbol(value);
+
+        var othIsDefined = other !== undefined,
+            othIsNull = other === null,
+            othIsReflexive = other === other,
+            othIsSymbol = isSymbol(other);
+
+        if ((!othIsNull && !othIsSymbol && !valIsSymbol && value > other) ||
+            (valIsSymbol && othIsDefined && othIsReflexive && !othIsNull && !othIsSymbol) ||
+            (valIsNull && othIsDefined && othIsReflexive) ||
+            (!valIsDefined && othIsReflexive) ||
+            !valIsReflexive) {
+          return 1;
+        }
+        if ((!valIsNull && !valIsSymbol && !othIsSymbol && value < other) ||
+            (othIsSymbol && valIsDefined && valIsReflexive && !valIsNull && !valIsSymbol) ||
+            (othIsNull && valIsDefined && valIsReflexive) ||
+            (!othIsDefined && valIsReflexive) ||
+            !othIsReflexive) {
+          return -1;
+        }
+      }
+      return 0;
+    }
+
+    /**
+     * Used by `_.orderBy` to compare multiple properties of a value to another
+     * and stable sort them.
+     *
+     * If `orders` is unspecified, all values are sorted in ascending order. Otherwise,
+     * specify an order of "desc" for descending or "asc" for ascending sort order
+     * of corresponding values.
+     *
+     * @private
+     * @param {Object} object The object to compare.
+     * @param {Object} other The other object to compare.
+     * @param {boolean[]|string[]} orders The order to sort by for each property.
+     * @returns {number} Returns the sort order indicator for `object`.
+     */
+    function compareMultiple(object, other, orders) {
+      var index = -1,
+          objCriteria = object.criteria,
+          othCriteria = other.criteria,
+          length = objCriteria.length,
+          ordersLength = orders.length;
+
+      while (++index < length) {
+        var result = compareAscending(objCriteria[index], othCriteria[index]);
+        if (result) {
+          if (index >= ordersLength) {
+            return result;
+          }
+          var order = orders[index];
+          return result * (order == 'desc' ? -1 : 1);
+        }
+      }
+      // Fixes an `Array#sort` bug in the JS engine embedded in Adobe applications
+      // that causes it, under certain circumstances, to provide the same value for
+      // `object` and `other`. See https://github.com/jashkenas/underscore/pull/1247
+      // for more details.
+      //
+      // This also ensures a stable sort in V8 and other engines.
+      // See https://bugs.chromium.org/p/v8/issues/detail?id=90 for more details.
+      return object.index - other.index;
+    }
+
+    /**
+     * Creates an array that is the composition of partially applied arguments,
+     * placeholders, and provided arguments into a single array of arguments.
+     *
+     * @private
+     * @param {Array} args The provided arguments.
+     * @param {Array} partials The arguments to prepend to those provided.
+     * @param {Array} holders The `partials` placeholder indexes.
+     * @params {boolean} [isCurried] Specify composing for a curried function.
+     * @returns {Array} Returns the new array of composed arguments.
+     */
+    function composeArgs(args, partials, holders, isCurried) {
+      var argsIndex = -1,
+          argsLength = args.length,
+          holdersLength = holders.length,
+          leftIndex = -1,
+          leftLength = partials.length,
+          rangeLength = nativeMax(argsLength - holdersLength, 0),
+          result = Array(leftLength + rangeLength),
+          isUncurried = !isCurried;
+
+      while (++leftIndex < leftLength) {
+        result[leftIndex] = partials[leftIndex];
+      }
+      while (++argsIndex < holdersLength) {
+        if (isUncurried || argsIndex < argsLength) {
+          result[holders[argsIndex]] = args[argsIndex];
+        }
+      }
+      while (rangeLength--) {
+        result[leftIndex++] = args[argsIndex++];
+      }
+      return result;
+    }
+
+    /**
+     * This function is like `composeArgs` except that the arguments composition
+     * is tailored for `_.partialRight`.
+     *
+     * @private
+     * @param {Array} args The provided arguments.
+     * @param {Array} partials The arguments to append to those provided.
+     * @param {Array} holders The `partials` placeholder indexes.
+     * @params {boolean} [isCurried] Specify composing for a curried function.
+     * @returns {Array} Returns the new array of composed arguments.
+     */
+    function composeArgsRight(args, partials, holders, isCurried) {
+      var argsIndex = -1,
+          argsLength = args.length,
+          holdersIndex = -1,
+          holdersLength = holders.length,
+          rightIndex = -1,
+          rightLength = partials.length,
+          rangeLength = nativeMax(argsLength - holdersLength, 0),
+          result = Array(rangeLength + rightLength),
+          isUncurried = !isCurried;
+
+      while (++argsIndex < rangeLength) {
+        result[argsIndex] = args[argsIndex];
+      }
+      var offset = argsIndex;
+      while (++rightIndex < rightLength) {
+        result[offset + rightIndex] = partials[rightIndex];
+      }
+      while (++holdersIndex < holdersLength) {
+        if (isUncurried || argsIndex < argsLength) {
+          result[offset + holders[holdersIndex]] = args[argsIndex++];
+        }
+      }
+      return result;
+    }
+
+    /**
+     * Copies the values of `source` to `array`.
+     *
+     * @private
+     * @param {Array} source The array to copy values from.
+     * @param {Array} [array=[]] The array to copy values to.
+     * @returns {Array} Returns `array`.
+     */
+    function copyArray(source, array) {
+      var index = -1,
+          length = source.length;
+
+      array || (array = Array(length));
+      while (++index < length) {
+        array[index] = source[index];
+      }
+      return array;
+    }
+
+    /**
+     * Copies properties of `source` to `object`.
+     *
+     * @private
+     * @param {Object} source The object to copy properties from.
+     * @param {Array} props The property identifiers to copy.
+     * @param {Object} [object={}] The object to copy properties to.
+     * @param {Function} [customizer] The function to customize copied values.
+     * @returns {Object} Returns `object`.
+     */
+    function copyObject(source, props, object, customizer) {
+      var isNew = !object;
+      object || (object = {});
+
+      var index = -1,
+          length = props.length;
+
+      while (++index < length) {
+        var key = props[index];
+
+        var newValue = customizer
+          ? customizer(object[key], source[key], key, object, source)
+          : undefined;
+
+        if (newValue === undefined) {
+          newValue = source[key];
+        }
+        if (isNew) {
+          baseAssignValue(object, key, newValue);
+        } else {
+          assignValue(object, key, newValue);
+        }
+      }
+      return object;
+    }
+
+    /**
+     * Copies own symbols of `source` to `object`.
+     *
+     * @private
+     * @param {Object} source The object to copy symbols from.
+     * @param {Object} [object={}] The object to copy symbols to.
+     * @returns {Object} Returns `object`.
+     */
+    function copySymbols(source, object) {
+      return copyObject(source, getSymbols(source), object);
+    }
+
+    /**
+     * Copies own and inherited symbols of `source` to `object`.
+     *
+     * @private
+     * @param {Object} source The object to copy symbols from.
+     * @param {Object} [object={}] The object to copy symbols to.
+     * @returns {Object} Returns `object`.
+     */
+    function copySymbolsIn(source, object) {
+      return copyObject(source, getSymbolsIn(source), object);
+    }
+
+    /**
+     * Creates a function like `_.groupBy`.
+     *
+     * @private
+     * @param {Function} setter The function to set accumulator values.
+     * @param {Function} [initializer] The accumulator object initializer.
+     * @returns {Function} Returns the new aggregator function.
+     */
+    function createAggregator(setter, initializer) {
+      return function(collection, iteratee) {
+        var func = isArray(collection) ? arrayAggregator : baseAggregator,
+            accumulator = initializer ? initializer() : {};
+
+        return func(collection, setter, getIteratee(iteratee, 2), accumulator);
+      };
+    }
+
+    /**
+     * Creates a function like `_.assign`.
+     *
+     * @private
+     * @param {Function} assigner The function to assign values.
+     * @returns {Function} Returns the new assigner function.
+     */
+    function createAssigner(assigner) {
+      return baseRest(function(object, sources) {
+        var index = -1,
+            length = sources.length,
+            customizer = length > 1 ? sources[length - 1] : undefined,
+            guard = length > 2 ? sources[2] : undefined;
+
+        customizer = (assigner.length > 3 && typeof customizer == 'function')
+          ? (length--, customizer)
+          : undefined;
+
+        if (guard && isIterateeCall(sources[0], sources[1], guard)) {
+          customizer = length < 3 ? undefined : customizer;
+          length = 1;
+        }
+        object = Object(object);
+        while (++index < length) {
+          var source = sources[index];
+          if (source) {
+            assigner(object, source, index, customizer);
+          }
+        }
+        return object;
+      });
+    }
+
+    /**
+     * Creates a `baseEach` or `baseEachRight` function.
+     *
+     * @private
+     * @param {Function} eachFunc The function to iterate over a collection.
+     * @param {boolean} [fromRight] Specify iterating from right to left.
+     * @returns {Function} Returns the new base function.
+     */
+    function createBaseEach(eachFunc, fromRight) {
+      return function(collection, iteratee) {
+        if (collection == null) {
+          return collection;
+        }
+        if (!isArrayLike(collection)) {
+          return eachFunc(collection, iteratee);
+        }
+        var length = collection.length,
+            index = fromRight ? length : -1,
+            iterable = Object(collection);
+
+        while ((fromRight ? index-- : ++index < length)) {
+          if (iteratee(iterable[index], index, iterable) === false) {
+            break;
+          }
+        }
+        return collection;
+      };
+    }
+
+    /**
+     * Creates a base function for methods like `_.forIn` and `_.forOwn`.
+     *
+     * @private
+     * @param {boolean} [fromRight] Specify iterating from right to left.
+     * @returns {Function} Returns the new base function.
+     */
+    function createBaseFor(fromRight) {
+      return function(object, iteratee, keysFunc) {
+        var index = -1,
+            iterable = Object(object),
+            props = keysFunc(object),
+            length = props.length;
+
+        while (length--) {
+          var key = props[fromRight ? length : ++index];
+          if (iteratee(iterable[key], key, iterable) === false) {
+            break;
+          }
+        }
+        return object;
+      };
+    }
+
+    /**
+     * Creates a function that wraps `func` to invoke it with the optional `this`
+     * binding of `thisArg`.
+     *
+     * @private
+     * @param {Function} func The function to wrap.
+     * @param {number} bitmask The bitmask flags. See `createWrap` for more details.
+     * @param {*} [thisArg] The `this` binding of `func`.
+     * @returns {Function} Returns the new wrapped function.
+     */
+    function createBind(func, bitmask, thisArg) {
+      var isBind = bitmask & WRAP_BIND_FLAG,
+          Ctor = createCtor(func);
+
+      function wrapper() {
+        var fn = (this && this !== root && this instanceof wrapper) ? Ctor : func;
+        return fn.apply(isBind ? thisArg : this, arguments);
+      }
+      return wrapper;
+    }
+
+    /**
+     * Creates a function like `_.lowerFirst`.
+     *
+     * @private
+     * @param {string} methodName The name of the `String` case method to use.
+     * @returns {Function} Returns the new case function.
+     */
+    function createCaseFirst(methodName) {
+      return function(string) {
+        string = toString(string);
+
+        var strSymbols = hasUnicode(string)
+          ? stringToArray(string)
+          : undefined;
+
+        var chr = strSymbols
+          ? strSymbols[0]
+          : string.charAt(0);
+
+        var trailing = strSymbols
+          ? castSlice(strSymbols, 1).join('')
+          : string.slice(1);
+
+        return chr[methodName]() + trailing;
+      };
+    }
+
+    /**
+     * Creates a function like `_.camelCase`.
+     *
+     * @private
+     * @param {Function} callback The function to combine each word.
+     * @returns {Function} Returns the new compounder function.
+     */
+    function createCompounder(callback) {
+      return function(string) {
+        return arrayReduce(words(deburr(string).replace(reApos, '')), callback, '');
+      };
+    }
+
+    /**
+     * Creates a function that produces an instance of `Ctor` regardless of
+     * whether it was invoked as part of a `new` expression or by `call` or `apply`.
+     *
+     * @private
+     * @param {Function} Ctor The constructor to wrap.
+     * @returns {Function} Returns the new wrapped function.
+     */
+    function createCtor(Ctor) {
+      return function() {
+        // Use a `switch` statement to work with class constructors. See
+        // http://ecma-international.org/ecma-262/7.0/#sec-ecmascript-function-objects-call-thisargument-argumentslist
+        // for more details.
+        var args = arguments;
+        switch (args.length) {
+          case 0: return new Ctor;
+          case 1: return new Ctor(args[0]);
+          case 2: return new Ctor(args[0], args[1]);
+          case 3: return new Ctor(args[0], args[1], args[2]);
+          case 4: return new Ctor(args[0], args[1], args[2], args[3]);
+          case 5: return new Ctor(args[0], args[1], args[2], args[3], args[4]);
+          case 6: return new Ctor(args[0], args[1], args[2], args[3], args[4], args[5]);
+          case 7: return new Ctor(args[0], args[1], args[2], args[3], args[4], args[5], args[6]);
+        }
+        var thisBinding = baseCreate(Ctor.prototype),
+            result = Ctor.apply(thisBinding, args);
+
+        // Mimic the constructor's `return` behavior.
+        // See https://es5.github.io/#x13.2.2 for more details.
+        return isObject(result) ? result : thisBinding;
+      };
+    }
+
+    /**
+     * Creates a function that wraps `func` to enable currying.
+     *
+     * @private
+     * @param {Function} func The function to wrap.
+     * @param {number} bitmask The bitmask flags. See `createWrap` for more details.
+     * @param {number} arity The arity of `func`.
+     * @returns {Function} Returns the new wrapped function.
+     */
+    function createCurry(func, bitmask, arity) {
+      var Ctor = createCtor(func);
+
+      function wrapper() {
+        var length = arguments.length,
+            args = Array(length),
+            index = length,
+            placeholder = getHolder(wrapper);
+
+        while (index--) {
+          args[index] = arguments[index];
+        }
+        var holders = (length < 3 && args[0] !== placeholder && args[length - 1] !== placeholder)
+          ? []
+          : replaceHolders(args, placeholder);
+
+        length -= holders.length;
+        if (length < arity) {
+          return createRecurry(
+            func, bitmask, createHybrid, wrapper.placeholder, undefined,
+            args, holders, undefined, undefined, arity - length);
+        }
+        var fn = (this && this !== root && this instanceof wrapper) ? Ctor : func;
+        return apply(fn, this, args);
+      }
+      return wrapper;
+    }
+
+    /**
+     * Creates a `_.find` or `_.findLast` function.
+     *
+     * @private
+     * @param {Function} findIndexFunc The function to find the collection index.
+     * @returns {Function} Returns the new find function.
+     */
+    function createFind(findIndexFunc) {
+      return function(collection, predicate, fromIndex) {
+        var iterable = Object(collection);
+        if (!isArrayLike(collection)) {
+          var iteratee = getIteratee(predicate, 3);
+          collection = keys(collection);
+          predicate = function(key) { return iteratee(iterable[key], key, iterable); };
+        }
+        var index = findIndexFunc(collection, predicate, fromIndex);
+        return index > -1 ? iterable[iteratee ? collection[index] : index] : undefined;
+      };
+    }
+
+    /**
+     * Creates a `_.flow` or `_.flowRight` function.
+     *
+     * @private
+     * @param {boolean} [fromRight] Specify iterating from right to left.
+     * @returns {Function} Returns the new flow function.
+     */
+    function createFlow(fromRight) {
+      return flatRest(function(funcs) {
+        var length = funcs.length,
+            index = length,
+            prereq = LodashWrapper.prototype.thru;
+
+        if (fromRight) {
+          funcs.reverse();
+        }
+        while (index--) {
+          var func = funcs[index];
+          if (typeof func != 'function') {
+            throw new TypeError(FUNC_ERROR_TEXT);
+          }
+          if (prereq && !wrapper && getFuncName(func) == 'wrapper') {
+            var wrapper = new LodashWrapper([], true);
+          }
+        }
+        index = wrapper ? index : length;
+        while (++index < length) {
+          func = funcs[index];
+
+          var funcName = getFuncName(func),
+              data = funcName == 'wrapper' ? getData(func) : undefined;
+
+          if (data && isLaziable(data[0]) &&
+                data[1] == (WRAP_ARY_FLAG | WRAP_CURRY_FLAG | WRAP_PARTIAL_FLAG | WRAP_REARG_FLAG) &&
+                !data[4].length && data[9] == 1
+              ) {
+            wrapper = wrapper[getFuncName(data[0])].apply(wrapper, data[3]);
+          } else {
+            wrapper = (func.length == 1 && isLaziable(func))
+              ? wrapper[funcName]()
+              : wrapper.thru(func);
+          }
+        }
+        return function() {
+          var args = arguments,
+              value = args[0];
+
+          if (wrapper && args.length == 1 && isArray(value)) {
+            return wrapper.plant(value).value();
+          }
+          var index = 0,
+              result = length ? funcs[index].apply(this, args) : value;
+
+          while (++index < length) {
+            result = funcs[index].call(this, result);
+          }
+          return result;
+        };
+      });
+    }
+
+    /**
+     * Creates a function that wraps `func` to invoke it with optional `this`
+     * binding of `thisArg`, partial application, and currying.
+     *
+     * @private
+     * @param {Function|string} func The function or method name to wrap.
+     * @param {number} bitmask The bitmask flags. See `createWrap` for more details.
+     * @param {*} [thisArg] The `this` binding of `func`.
+     * @param {Array} [partials] The arguments to prepend to those provided to
+     *  the new function.
+     * @param {Array} [holders] The `partials` placeholder indexes.
+     * @param {Array} [partialsRight] The arguments to append to those provided
+     *  to the new function.
+     * @param {Array} [holdersRight] The `partialsRight` placeholder indexes.
+     * @param {Array} [argPos] The argument positions of the new function.
+     * @param {number} [ary] The arity cap of `func`.
+     * @param {number} [arity] The arity of `func`.
+     * @returns {Function} Returns the new wrapped function.
+     */
+    function createHybrid(func, bitmask, thisArg, partials, holders, partialsRight, holdersRight, argPos, ary, arity) {
+      var isAry = bitmask & WRAP_ARY_FLAG,
+          isBind = bitmask & WRAP_BIND_FLAG,
+          isBindKey = bitmask & WRAP_BIND_KEY_FLAG,
+          isCurried = bitmask & (WRAP_CURRY_FLAG | WRAP_CURRY_RIGHT_FLAG),
+          isFlip = bitmask & WRAP_FLIP_FLAG,
+          Ctor = isBindKey ? undefined : createCtor(func);
+
+      function wrapper() {
+        var length = arguments.length,
+            args = Array(length),
+            index = length;
+
+        while (index--) {
+          args[index] = arguments[index];
+        }
+        if (isCurried) {
+          var placeholder = getHolder(wrapper),
+              holdersCount = countHolders(args, placeholder);
+        }
+        if (partials) {
+          args = composeArgs(args, partials, holders, isCurried);
+        }
+        if (partialsRight) {
+          args = composeArgsRight(args, partialsRight, holdersRight, isCurried);
+        }
+        length -= holdersCount;
+        if (isCurried && length < arity) {
+          var newHolders = replaceHolders(args, placeholder);
+          return createRecurry(
+            func, bitmask, createHybrid, wrapper.placeholder, thisArg,
+            args, newHolders, argPos, ary, arity - length
+          );
+        }
+        var thisBinding = isBind ? thisArg : this,
+            fn = isBindKey ? thisBinding[func] : func;
+
+        length = args.length;
+        if (argPos) {
+          args = reorder(args, argPos);
+        } else if (isFlip && length > 1) {
+          args.reverse();
+        }
+        if (isAry && ary < length) {
+          args.length = ary;
+        }
+        if (this && this !== root && this instanceof wrapper) {
+          fn = Ctor || createCtor(fn);
+        }
+        return fn.apply(thisBinding, args);
+      }
+      return wrapper;
+    }
+
+    /**
+     * Creates a function like `_.invertBy`.
+     *
+     * @private
+     * @param {Function} setter The function to set accumulator values.
+     * @param {Function} toIteratee The function to resolve iteratees.
+     * @returns {Function} Returns the new inverter function.
+     */
+    function createInverter(setter, toIteratee) {
+      return function(object, iteratee) {
+        return baseInverter(object, setter, toIteratee(iteratee), {});
+      };
+    }
+
+    /**
+     * Creates a function that performs a mathematical operation on two values.
+     *
+     * @private
+     * @param {Function} operator The function to perform the operation.
+     * @param {number} [defaultValue] The value used for `undefined` arguments.
+     * @returns {Function} Returns the new mathematical operation function.
+     */
+    function createMathOperation(operator, defaultValue) {
+      return function(value, other) {
+        var result;
+        if (value === undefined && other === undefined) {
+          return defaultValue;
+        }
+        if (value !== undefined) {
+          result = value;
+        }
+        if (other !== undefined) {
+          if (result === undefined) {
+            return other;
+          }
+          if (typeof value == 'string' || typeof other == 'string') {
+            value = baseToString(value);
+            other = baseToString(other);
+          } else {
+            value = baseToNumber(value);
+            other = baseToNumber(other);
+          }
+          result = operator(value, other);
+        }
+        return result;
+      };
+    }
+
+    /**
+     * Creates a function like `_.over`.
+     *
+     * @private
+     * @param {Function} arrayFunc The function to iterate over iteratees.
+     * @returns {Function} Returns the new over function.
+     */
+    function createOver(arrayFunc) {
+      return flatRest(function(iteratees) {
+        iteratees = arrayMap(iteratees, baseUnary(getIteratee()));
+        return baseRest(function(args) {
+          var thisArg = this;
+          return arrayFunc(iteratees, function(iteratee) {
+            return apply(iteratee, thisArg, args);
+          });
+        });
+      });
+    }
+
+    /**
+     * Creates the padding for `string` based on `length`. The `chars` string
+     * is truncated if the number of characters exceeds `length`.
+     *
+     * @private
+     * @param {number} length The padding length.
+     * @param {string} [chars=' '] The string used as padding.
+     * @returns {string} Returns the padding for `string`.
+     */
+    function createPadding(length, chars) {
+      chars = chars === undefined ? ' ' : baseToString(chars);
+
+      var charsLength = chars.length;
+      if (charsLength < 2) {
+        return charsLength ? baseRepeat(chars, length) : chars;
+      }
+      var result = baseRepeat(chars, nativeCeil(length / stringSize(chars)));
+      return hasUnicode(chars)
+        ? castSlice(stringToArray(result), 0, length).join('')
+        : result.slice(0, length);
+    }
+
+    /**
+     * Creates a function that wraps `func` to invoke it with the `this` binding
+     * of `thisArg` and `partials` prepended to the arguments it receives.
+     *
+     * @private
+     * @param {Function} func The function to wrap.
+     * @param {number} bitmask The bitmask flags. See `createWrap` for more details.
+     * @param {*} thisArg The `this` binding of `func`.
+     * @param {Array} partials The arguments to prepend to those provided to
+     *  the new function.
+     * @returns {Function} Returns the new wrapped function.
+     */
+    function createPartial(func, bitmask, thisArg, partials) {
+      var isBind = bitmask & WRAP_BIND_FLAG,
+          Ctor = createCtor(func);
+
+      function wrapper() {
+        var argsIndex = -1,
+            argsLength = arguments.length,
+            leftIndex = -1,
+            leftLength = partials.length,
+            args = Array(leftLength + argsLength),
+            fn = (this && this !== root && this instanceof wrapper) ? Ctor : func;
+
+        while (++leftIndex < leftLength) {
+          args[leftIndex] = partials[leftIndex];
+        }
+        while (argsLength--) {
+          args[leftIndex++] = arguments[++argsIndex];
+        }
+        return apply(fn, isBind ? thisArg : this, args);
+      }
+      return wrapper;
+    }
+
+    /**
+     * Creates a `_.range` or `_.rangeRight` function.
+     *
+     * @private
+     * @param {boolean} [fromRight] Specify iterating from right to left.
+     * @returns {Function} Returns the new range function.
+     */
+    function createRange(fromRight) {
+      return function(start, end, step) {
+        if (step && typeof step != 'number' && isIterateeCall(start, end, step)) {
+          end = step = undefined;
+        }
+        // Ensure the sign of `-0` is preserved.
+        start = toFinite(start);
+        if (end === undefined) {
+          end = start;
+          start = 0;
+        } else {
+          end = toFinite(end);
+        }
+        step = step === undefined ? (start < end ? 1 : -1) : toFinite(step);
+        return baseRange(start, end, step, fromRight);
+      };
+    }
+
+    /**
+     * Creates a function that performs a relational operation on two values.
+     *
+     * @private
+     * @param {Function} operator The function to perform the operation.
+     * @returns {Function} Returns the new relational operation function.
+     */
+    function createRelationalOperation(operator) {
+      return function(value, other) {
+        if (!(typeof value == 'string' && typeof other == 'string')) {
+          value = toNumber(value);
+          other = toNumber(other);
+        }
+        return operator(value, other);
+      };
+    }
+
+    /**
+     * Creates a function that wraps `func` to continue currying.
+     *
+     * @private
+     * @param {Function} func The function to wrap.
+     * @param {number} bitmask The bitmask flags. See `createWrap` for more details.
+     * @param {Function} wrapFunc The function to create the `func` wrapper.
+     * @param {*} placeholder The placeholder value.
+     * @param {*} [thisArg] The `this` binding of `func`.
+     * @param {Array} [partials] The arguments to prepend to those provided to
+     *  the new function.
+     * @param {Array} [holders] The `partials` placeholder indexes.
+     * @param {Array} [argPos] The argument positions of the new function.
+     * @param {number} [ary] The arity cap of `func`.
+     * @param {number} [arity] The arity of `func`.
+     * @returns {Function} Returns the new wrapped function.
+     */
+    function createRecurry(func, bitmask, wrapFunc, placeholder, thisArg, partials, holders, argPos, ary, arity) {
+      var isCurry = bitmask & WRAP_CURRY_FLAG,
+          newHolders = isCurry ? holders : undefined,
+          newHoldersRight = isCurry ? undefined : holders,
+          newPartials = isCurry ? partials : undefined,
+          newPartialsRight = isCurry ? undefined : partials;
+
+      bitmask |= (isCurry ? WRAP_PARTIAL_FLAG : WRAP_PARTIAL_RIGHT_FLAG);
+      bitmask &= ~(isCurry ? WRAP_PARTIAL_RIGHT_FLAG : WRAP_PARTIAL_FLAG);
+
+      if (!(bitmask & WRAP_CURRY_BOUND_FLAG)) {
+        bitmask &= ~(WRAP_BIND_FLAG | WRAP_BIND_KEY_FLAG);
+      }
+      var newData = [
+        func, bitmask, thisArg, newPartials, newHolders, newPartialsRight,
+        newHoldersRight, argPos, ary, arity
+      ];
+
+      var result = wrapFunc.apply(undefined, newData);
+      if (isLaziable(func)) {
+        setData(result, newData);
+      }
+      result.placeholder = placeholder;
+      return setWrapToString(result, func, bitmask);
+    }
+
+    /**
+     * Creates a function like `_.round`.
+     *
+     * @private
+     * @param {string} methodName The name of the `Math` method to use when rounding.
+     * @returns {Function} Returns the new round function.
+     */
+    function createRound(methodName) {
+      var func = Math[methodName];
+      return function(number, precision) {
+        number = toNumber(number);
+        precision = precision == null ? 0 : nativeMin(toInteger(precision), 292);
+        if (precision) {
+          // Shift with exponential notation to avoid floating-point issues.
+          // See [MDN](https://mdn.io/round#Examples) for more details.
+          var pair = (toString(number) + 'e').split('e'),
+              value = func(pair[0] + 'e' + (+pair[1] + precision));
+
+          pair = (toString(value) + 'e').split('e');
+          return +(pair[0] + 'e' + (+pair[1] - precision));
+        }
+        return func(number);
+      };
+    }
+
+    /**
+     * Creates a set object of `values`.
+     *
+     * @private
+     * @param {Array} values The values to add to the set.
+     * @returns {Object} Returns the new set.
+     */
+    var createSet = !(Set && (1 / setToArray(new Set([,-0]))[1]) == INFINITY) ? noop : function(values) {
+      return new Set(values);
+    };
+
+    /**
+     * Creates a `_.toPairs` or `_.toPairsIn` function.
+     *
+     * @private
+     * @param {Function} keysFunc The function to get the keys of a given object.
+     * @returns {Function} Returns the new pairs function.
+     */
+    function createToPairs(keysFunc) {
+      return function(object) {
+        var tag = getTag(object);
+        if (tag == mapTag) {
+          return mapToArray(object);
+        }
+        if (tag == setTag) {
+          return setToPairs(object);
+        }
+        return baseToPairs(object, keysFunc(object));
+      };
+    }
+
+    /**
+     * Creates a function that either curries or invokes `func` with optional
+     * `this` binding and partially applied arguments.
+     *
+     * @private
+     * @param {Function|string} func The function or method name to wrap.
+     * @param {number} bitmask The bitmask flags.
+     *    1 - `_.bind`
+     *    2 - `_.bindKey`
+     *    4 - `_.curry` or `_.curryRight` of a bound function
+     *    8 - `_.curry`
+     *   16 - `_.curryRight`
+     *   32 - `_.partial`
+     *   64 - `_.partialRight`
+     *  128 - `_.rearg`
+     *  256 - `_.ary`
+     *  512 - `_.flip`
+     * @param {*} [thisArg] The `this` binding of `func`.
+     * @param {Array} [partials] The arguments to be partially applied.
+     * @param {Array} [holders] The `partials` placeholder indexes.
+     * @param {Array} [argPos] The argument positions of the new function.
+     * @param {number} [ary] The arity cap of `func`.
+     * @param {number} [arity] The arity of `func`.
+     * @returns {Function} Returns the new wrapped function.
+     */
+    function createWrap(func, bitmask, thisArg, partials, holders, argPos, ary, arity) {
+      var isBindKey = bitmask & WRAP_BIND_KEY_FLAG;
+      if (!isBindKey && typeof func != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      var length = partials ? partials.length : 0;
+      if (!length) {
+        bitmask &= ~(WRAP_PARTIAL_FLAG | WRAP_PARTIAL_RIGHT_FLAG);
+        partials = holders = undefined;
+      }
+      ary = ary === undefined ? ary : nativeMax(toInteger(ary), 0);
+      arity = arity === undefined ? arity : toInteger(arity);
+      length -= holders ? holders.length : 0;
+
+      if (bitmask & WRAP_PARTIAL_RIGHT_FLAG) {
+        var partialsRight = partials,
+            holdersRight = holders;
+
+        partials = holders = undefined;
+      }
+      var data = isBindKey ? undefined : getData(func);
+
+      var newData = [
+        func, bitmask, thisArg, partials, holders, partialsRight, holdersRight,
+        argPos, ary, arity
+      ];
+
+      if (data) {
+        mergeData(newData, data);
+      }
+      func = newData[0];
+      bitmask = newData[1];
+      thisArg = newData[2];
+      partials = newData[3];
+      holders = newData[4];
+      arity = newData[9] = newData[9] === undefined
+        ? (isBindKey ? 0 : func.length)
+        : nativeMax(newData[9] - length, 0);
+
+      if (!arity && bitmask & (WRAP_CURRY_FLAG | WRAP_CURRY_RIGHT_FLAG)) {
+        bitmask &= ~(WRAP_CURRY_FLAG | WRAP_CURRY_RIGHT_FLAG);
+      }
+      if (!bitmask || bitmask == WRAP_BIND_FLAG) {
+        var result = createBind(func, bitmask, thisArg);
+      } else if (bitmask == WRAP_CURRY_FLAG || bitmask == WRAP_CURRY_RIGHT_FLAG) {
+        result = createCurry(func, bitmask, arity);
+      } else if ((bitmask == WRAP_PARTIAL_FLAG || bitmask == (WRAP_BIND_FLAG | WRAP_PARTIAL_FLAG)) && !holders.length) {
+        result = createPartial(func, bitmask, thisArg, partials);
+      } else {
+        result = createHybrid.apply(undefined, newData);
+      }
+      var setter = data ? baseSetData : setData;
+      return setWrapToString(setter(result, newData), func, bitmask);
+    }
+
+    /**
+     * Used by `_.defaults` to customize its `_.assignIn` use to assign properties
+     * of source objects to the destination object for all destination properties
+     * that resolve to `undefined`.
+     *
+     * @private
+     * @param {*} objValue The destination value.
+     * @param {*} srcValue The source value.
+     * @param {string} key The key of the property to assign.
+     * @param {Object} object The parent object of `objValue`.
+     * @returns {*} Returns the value to assign.
+     */
+    function customDefaultsAssignIn(objValue, srcValue, key, object) {
+      if (objValue === undefined ||
+          (eq(objValue, objectProto[key]) && !hasOwnProperty.call(object, key))) {
+        return srcValue;
+      }
+      return objValue;
+    }
+
+    /**
+     * Used by `_.defaultsDeep` to customize its `_.merge` use to merge source
+     * objects into destination objects that are passed thru.
+     *
+     * @private
+     * @param {*} objValue The destination value.
+     * @param {*} srcValue The source value.
+     * @param {string} key The key of the property to merge.
+     * @param {Object} object The parent object of `objValue`.
+     * @param {Object} source The parent object of `srcValue`.
+     * @param {Object} [stack] Tracks traversed source values and their merged
+     *  counterparts.
+     * @returns {*} Returns the value to assign.
+     */
+    function customDefaultsMerge(objValue, srcValue, key, object, source, stack) {
+      if (isObject(objValue) && isObject(srcValue)) {
+        // Recursively merge objects and arrays (susceptible to call stack limits).
+        stack.set(srcValue, objValue);
+        baseMerge(objValue, srcValue, undefined, customDefaultsMerge, stack);
+        stack['delete'](srcValue);
+      }
+      return objValue;
+    }
+
+    /**
+     * Used by `_.omit` to customize its `_.cloneDeep` use to only clone plain
+     * objects.
+     *
+     * @private
+     * @param {*} value The value to inspect.
+     * @param {string} key The key of the property to inspect.
+     * @returns {*} Returns the uncloned value or `undefined` to defer cloning to `_.cloneDeep`.
+     */
+    function customOmitClone(value) {
+      return isPlainObject(value) ? undefined : value;
+    }
+
+    /**
+     * A specialized version of `baseIsEqualDeep` for arrays with support for
+     * partial deep comparisons.
+     *
+     * @private
+     * @param {Array} array The array to compare.
+     * @param {Array} other The other array to compare.
+     * @param {number} bitmask The bitmask flags. See `baseIsEqual` for more details.
+     * @param {Function} customizer The function to customize comparisons.
+     * @param {Function} equalFunc The function to determine equivalents of values.
+     * @param {Object} stack Tracks traversed `array` and `other` objects.
+     * @returns {boolean} Returns `true` if the arrays are equivalent, else `false`.
+     */
+    function equalArrays(array, other, bitmask, customizer, equalFunc, stack) {
+      var isPartial = bitmask & COMPARE_PARTIAL_FLAG,
+          arrLength = array.length,
+          othLength = other.length;
+
+      if (arrLength != othLength && !(isPartial && othLength > arrLength)) {
+        return false;
+      }
+      // Assume cyclic values are equal.
+      var stacked = stack.get(array);
+      if (stacked && stack.get(other)) {
+        return stacked == other;
+      }
+      var index = -1,
+          result = true,
+          seen = (bitmask & COMPARE_UNORDERED_FLAG) ? new SetCache : undefined;
+
+      stack.set(array, other);
+      stack.set(other, array);
+
+      // Ignore non-index properties.
+      while (++index < arrLength) {
+        var arrValue = array[index],
+            othValue = other[index];
+
+        if (customizer) {
+          var compared = isPartial
+            ? customizer(othValue, arrValue, index, other, array, stack)
+            : customizer(arrValue, othValue, index, array, other, stack);
+        }
+        if (compared !== undefined) {
+          if (compared) {
+            continue;
+          }
+          result = false;
+          break;
+        }
+        // Recursively compare arrays (susceptible to call stack limits).
+        if (seen) {
+          if (!arraySome(other, function(othValue, othIndex) {
+                if (!cacheHas(seen, othIndex) &&
+                    (arrValue === othValue || equalFunc(arrValue, othValue, bitmask, customizer, stack))) {
+                  return seen.push(othIndex);
+                }
+              })) {
+            result = false;
+            break;
+          }
+        } else if (!(
+              arrValue === othValue ||
+                equalFunc(arrValue, othValue, bitmask, customizer, stack)
+            )) {
+          result = false;
+          break;
+        }
+      }
+      stack['delete'](array);
+      stack['delete'](other);
+      return result;
+    }
+
+    /**
+     * A specialized version of `baseIsEqualDeep` for comparing objects of
+     * the same `toStringTag`.
+     *
+     * **Note:** This function only supports comparing values with tags of
+     * `Boolean`, `Date`, `Error`, `Number`, `RegExp`, or `String`.
+     *
+     * @private
+     * @param {Object} object The object to compare.
+     * @param {Object} other The other object to compare.
+     * @param {string} tag The `toStringTag` of the objects to compare.
+     * @param {number} bitmask The bitmask flags. See `baseIsEqual` for more details.
+     * @param {Function} customizer The function to customize comparisons.
+     * @param {Function} equalFunc The function to determine equivalents of values.
+     * @param {Object} stack Tracks traversed `object` and `other` objects.
+     * @returns {boolean} Returns `true` if the objects are equivalent, else `false`.
+     */
+    function equalByTag(object, other, tag, bitmask, customizer, equalFunc, stack) {
+      switch (tag) {
+        case dataViewTag:
+          if ((object.byteLength != other.byteLength) ||
+              (object.byteOffset != other.byteOffset)) {
+            return false;
+          }
+          object = object.buffer;
+          other = other.buffer;
+
+        case arrayBufferTag:
+          if ((object.byteLength != other.byteLength) ||
+              !equalFunc(new Uint8Array(object), new Uint8Array(other))) {
+            return false;
+          }
+          return true;
+
+        case boolTag:
+        case dateTag:
+        case numberTag:
+          // Coerce booleans to `1` or `0` and dates to milliseconds.
+          // Invalid dates are coerced to `NaN`.
+          return eq(+object, +other);
+
+        case errorTag:
+          return object.name == other.name && object.message == other.message;
+
+        case regexpTag:
+        case stringTag:
+          // Coerce regexes to strings and treat strings, primitives and objects,
+          // as equal. See http://www.ecma-international.org/ecma-262/7.0/#sec-regexp.prototype.tostring
+          // for more details.
+          return object == (other + '');
+
+        case mapTag:
+          var convert = mapToArray;
+
+        case setTag:
+          var isPartial = bitmask & COMPARE_PARTIAL_FLAG;
+          convert || (convert = setToArray);
+
+          if (object.size != other.size && !isPartial) {
+            return false;
+          }
+          // Assume cyclic values are equal.
+          var stacked = stack.get(object);
+          if (stacked) {
+            return stacked == other;
+          }
+          bitmask |= COMPARE_UNORDERED_FLAG;
+
+          // Recursively compare objects (susceptible to call stack limits).
+          stack.set(object, other);
+          var result = equalArrays(convert(object), convert(other), bitmask, customizer, equalFunc, stack);
+          stack['delete'](object);
+          return result;
+
+        case symbolTag:
+          if (symbolValueOf) {
+            return symbolValueOf.call(object) == symbolValueOf.call(other);
+          }
+      }
+      return false;
+    }
+
+    /**
+     * A specialized version of `baseIsEqualDeep` for objects with support for
+     * partial deep comparisons.
+     *
+     * @private
+     * @param {Object} object The object to compare.
+     * @param {Object} other The other object to compare.
+     * @param {number} bitmask The bitmask flags. See `baseIsEqual` for more details.
+     * @param {Function} customizer The function to customize comparisons.
+     * @param {Function} equalFunc The function to determine equivalents of values.
+     * @param {Object} stack Tracks traversed `object` and `other` objects.
+     * @returns {boolean} Returns `true` if the objects are equivalent, else `false`.
+     */
+    function equalObjects(object, other, bitmask, customizer, equalFunc, stack) {
+      var isPartial = bitmask & COMPARE_PARTIAL_FLAG,
+          objProps = getAllKeys(object),
+          objLength = objProps.length,
+          othProps = getAllKeys(other),
+          othLength = othProps.length;
+
+      if (objLength != othLength && !isPartial) {
+        return false;
+      }
+      var index = objLength;
+      while (index--) {
+        var key = objProps[index];
+        if (!(isPartial ? key in other : hasOwnProperty.call(other, key))) {
+          return false;
+        }
+      }
+      // Assume cyclic values are equal.
+      var stacked = stack.get(object);
+      if (stacked && stack.get(other)) {
+        return stacked == other;
+      }
+      var result = true;
+      stack.set(object, other);
+      stack.set(other, object);
+
+      var skipCtor = isPartial;
+      while (++index < objLength) {
+        key = objProps[index];
+        var objValue = object[key],
+            othValue = other[key];
+
+        if (customizer) {
+          var compared = isPartial
+            ? customizer(othValue, objValue, key, other, object, stack)
+            : customizer(objValue, othValue, key, object, other, stack);
+        }
+        // Recursively compare objects (susceptible to call stack limits).
+        if (!(compared === undefined
+              ? (objValue === othValue || equalFunc(objValue, othValue, bitmask, customizer, stack))
+              : compared
+            )) {
+          result = false;
+          break;
+        }
+        skipCtor || (skipCtor = key == 'constructor');
+      }
+      if (result && !skipCtor) {
+        var objCtor = object.constructor,
+            othCtor = other.constructor;
+
+        // Non `Object` object instances with different constructors are not equal.
+        if (objCtor != othCtor &&
+            ('constructor' in object && 'constructor' in other) &&
+            !(typeof objCtor == 'function' && objCtor instanceof objCtor &&
+              typeof othCtor == 'function' && othCtor instanceof othCtor)) {
+          result = false;
+        }
+      }
+      stack['delete'](object);
+      stack['delete'](other);
+      return result;
+    }
+
+    /**
+     * A specialized version of `baseRest` which flattens the rest array.
+     *
+     * @private
+     * @param {Function} func The function to apply a rest parameter to.
+     * @returns {Function} Returns the new function.
+     */
+    function flatRest(func) {
+      return setToString(overRest(func, undefined, flatten), func + '');
+    }
+
+    /**
+     * Creates an array of own enumerable property names and symbols of `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property names and symbols.
+     */
+    function getAllKeys(object) {
+      return baseGetAllKeys(object, keys, getSymbols);
+    }
+
+    /**
+     * Creates an array of own and inherited enumerable property names and
+     * symbols of `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property names and symbols.
+     */
+    function getAllKeysIn(object) {
+      return baseGetAllKeys(object, keysIn, getSymbolsIn);
+    }
+
+    /**
+     * Gets metadata for `func`.
+     *
+     * @private
+     * @param {Function} func The function to query.
+     * @returns {*} Returns the metadata for `func`.
+     */
+    var getData = !metaMap ? noop : function(func) {
+      return metaMap.get(func);
+    };
+
+    /**
+     * Gets the name of `func`.
+     *
+     * @private
+     * @param {Function} func The function to query.
+     * @returns {string} Returns the function name.
+     */
+    function getFuncName(func) {
+      var result = (func.name + ''),
+          array = realNames[result],
+          length = hasOwnProperty.call(realNames, result) ? array.length : 0;
+
+      while (length--) {
+        var data = array[length],
+            otherFunc = data.func;
+        if (otherFunc == null || otherFunc == func) {
+          return data.name;
+        }
+      }
+      return result;
+    }
+
+    /**
+     * Gets the argument placeholder value for `func`.
+     *
+     * @private
+     * @param {Function} func The function to inspect.
+     * @returns {*} Returns the placeholder value.
+     */
+    function getHolder(func) {
+      var object = hasOwnProperty.call(lodash, 'placeholder') ? lodash : func;
+      return object.placeholder;
+    }
+
+    /**
+     * Gets the appropriate "iteratee" function. If `_.iteratee` is customized,
+     * this function returns the custom method, otherwise it returns `baseIteratee`.
+     * If arguments are provided, the chosen function is invoked with them and
+     * its result is returned.
+     *
+     * @private
+     * @param {*} [value] The value to convert to an iteratee.
+     * @param {number} [arity] The arity of the created iteratee.
+     * @returns {Function} Returns the chosen function or its result.
+     */
+    function getIteratee() {
+      var result = lodash.iteratee || iteratee;
+      result = result === iteratee ? baseIteratee : result;
+      return arguments.length ? result(arguments[0], arguments[1]) : result;
+    }
+
+    /**
+     * Gets the data for `map`.
+     *
+     * @private
+     * @param {Object} map The map to query.
+     * @param {string} key The reference key.
+     * @returns {*} Returns the map data.
+     */
+    function getMapData(map, key) {
+      var data = map.__data__;
+      return isKeyable(key)
+        ? data[typeof key == 'string' ? 'string' : 'hash']
+        : data.map;
+    }
+
+    /**
+     * Gets the property names, values, and compare flags of `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the match data of `object`.
+     */
+    function getMatchData(object) {
+      var result = keys(object),
+          length = result.length;
+
+      while (length--) {
+        var key = result[length],
+            value = object[key];
+
+        result[length] = [key, value, isStrictComparable(value)];
+      }
+      return result;
+    }
+
+    /**
+     * Gets the native function at `key` of `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @param {string} key The key of the method to get.
+     * @returns {*} Returns the function if it's native, else `undefined`.
+     */
+    function getNative(object, key) {
+      var value = getValue(object, key);
+      return baseIsNative(value) ? value : undefined;
+    }
+
+    /**
+     * A specialized version of `baseGetTag` which ignores `Symbol.toStringTag` values.
+     *
+     * @private
+     * @param {*} value The value to query.
+     * @returns {string} Returns the raw `toStringTag`.
+     */
+    function getRawTag(value) {
+      var isOwn = hasOwnProperty.call(value, symToStringTag),
+          tag = value[symToStringTag];
+
+      try {
+        value[symToStringTag] = undefined;
+        var unmasked = true;
+      } catch (e) {}
+
+      var result = nativeObjectToString.call(value);
+      if (unmasked) {
+        if (isOwn) {
+          value[symToStringTag] = tag;
+        } else {
+          delete value[symToStringTag];
+        }
+      }
+      return result;
+    }
+
+    /**
+     * Creates an array of the own enumerable symbols of `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of symbols.
+     */
+    var getSymbols = !nativeGetSymbols ? stubArray : function(object) {
+      if (object == null) {
+        return [];
+      }
+      object = Object(object);
+      return arrayFilter(nativeGetSymbols(object), function(symbol) {
+        return propertyIsEnumerable.call(object, symbol);
+      });
+    };
+
+    /**
+     * Creates an array of the own and inherited enumerable symbols of `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of symbols.
+     */
+    var getSymbolsIn = !nativeGetSymbols ? stubArray : function(object) {
+      var result = [];
+      while (object) {
+        arrayPush(result, getSymbols(object));
+        object = getPrototype(object);
+      }
+      return result;
+    };
+
+    /**
+     * Gets the `toStringTag` of `value`.
+     *
+     * @private
+     * @param {*} value The value to query.
+     * @returns {string} Returns the `toStringTag`.
+     */
+    var getTag = baseGetTag;
+
+    // Fallback for data views, maps, sets, and weak maps in IE 11 and promises in Node.js < 6.
+    if ((DataView && getTag(new DataView(new ArrayBuffer(1))) != dataViewTag) ||
+        (Map && getTag(new Map) != mapTag) ||
+        (Promise && getTag(Promise.resolve()) != promiseTag) ||
+        (Set && getTag(new Set) != setTag) ||
+        (WeakMap && getTag(new WeakMap) != weakMapTag)) {
+      getTag = function(value) {
+        var result = baseGetTag(value),
+            Ctor = result == objectTag ? value.constructor : undefined,
+            ctorString = Ctor ? toSource(Ctor) : '';
+
+        if (ctorString) {
+          switch (ctorString) {
+            case dataViewCtorString: return dataViewTag;
+            case mapCtorString: return mapTag;
+            case promiseCtorString: return promiseTag;
+            case setCtorString: return setTag;
+            case weakMapCtorString: return weakMapTag;
+          }
+        }
+        return result;
+      };
+    }
+
+    /**
+     * Gets the view, applying any `transforms` to the `start` and `end` positions.
+     *
+     * @private
+     * @param {number} start The start of the view.
+     * @param {number} end The end of the view.
+     * @param {Array} transforms The transformations to apply to the view.
+     * @returns {Object} Returns an object containing the `start` and `end`
+     *  positions of the view.
+     */
+    function getView(start, end, transforms) {
+      var index = -1,
+          length = transforms.length;
+
+      while (++index < length) {
+        var data = transforms[index],
+            size = data.size;
+
+        switch (data.type) {
+          case 'drop':      start += size; break;
+          case 'dropRight': end -= size; break;
+          case 'take':      end = nativeMin(end, start + size); break;
+          case 'takeRight': start = nativeMax(start, end - size); break;
+        }
+      }
+      return { 'start': start, 'end': end };
+    }
+
+    /**
+     * Extracts wrapper details from the `source` body comment.
+     *
+     * @private
+     * @param {string} source The source to inspect.
+     * @returns {Array} Returns the wrapper details.
+     */
+    function getWrapDetails(source) {
+      var match = source.match(reWrapDetails);
+      return match ? match[1].split(reSplitDetails) : [];
+    }
+
+    /**
+     * Checks if `path` exists on `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @param {Array|string} path The path to check.
+     * @param {Function} hasFunc The function to check properties.
+     * @returns {boolean} Returns `true` if `path` exists, else `false`.
+     */
+    function hasPath(object, path, hasFunc) {
+      path = castPath(path, object);
+
+      var index = -1,
+          length = path.length,
+          result = false;
+
+      while (++index < length) {
+        var key = toKey(path[index]);
+        if (!(result = object != null && hasFunc(object, key))) {
+          break;
+        }
+        object = object[key];
+      }
+      if (result || ++index != length) {
+        return result;
+      }
+      length = object == null ? 0 : object.length;
+      return !!length && isLength(length) && isIndex(key, length) &&
+        (isArray(object) || isArguments(object));
+    }
+
+    /**
+     * Initializes an array clone.
+     *
+     * @private
+     * @param {Array} array The array to clone.
+     * @returns {Array} Returns the initialized clone.
+     */
+    function initCloneArray(array) {
+      var length = array.length,
+          result = array.constructor(length);
+
+      // Add properties assigned by `RegExp#exec`.
+      if (length && typeof array[0] == 'string' && hasOwnProperty.call(array, 'index')) {
+        result.index = array.index;
+        result.input = array.input;
+      }
+      return result;
+    }
+
+    /**
+     * Initializes an object clone.
+     *
+     * @private
+     * @param {Object} object The object to clone.
+     * @returns {Object} Returns the initialized clone.
+     */
+    function initCloneObject(object) {
+      return (typeof object.constructor == 'function' && !isPrototype(object))
+        ? baseCreate(getPrototype(object))
+        : {};
+    }
+
+    /**
+     * Initializes an object clone based on its `toStringTag`.
+     *
+     * **Note:** This function only supports cloning values with tags of
+     * `Boolean`, `Date`, `Error`, `Number`, `RegExp`, or `String`.
+     *
+     * @private
+     * @param {Object} object The object to clone.
+     * @param {string} tag The `toStringTag` of the object to clone.
+     * @param {Function} cloneFunc The function to clone values.
+     * @param {boolean} [isDeep] Specify a deep clone.
+     * @returns {Object} Returns the initialized clone.
+     */
+    function initCloneByTag(object, tag, cloneFunc, isDeep) {
+      var Ctor = object.constructor;
+      switch (tag) {
+        case arrayBufferTag:
+          return cloneArrayBuffer(object);
+
+        case boolTag:
+        case dateTag:
+          return new Ctor(+object);
+
+        case dataViewTag:
+          return cloneDataView(object, isDeep);
+
+        case float32Tag: case float64Tag:
+        case int8Tag: case int16Tag: case int32Tag:
+        case uint8Tag: case uint8ClampedTag: case uint16Tag: case uint32Tag:
+          return cloneTypedArray(object, isDeep);
+
+        case mapTag:
+          return cloneMap(object, isDeep, cloneFunc);
+
+        case numberTag:
+        case stringTag:
+          return new Ctor(object);
+
+        case regexpTag:
+          return cloneRegExp(object);
+
+        case setTag:
+          return cloneSet(object, isDeep, cloneFunc);
+
+        case symbolTag:
+          return cloneSymbol(object);
+      }
+    }
+
+    /**
+     * Inserts wrapper `details` in a comment at the top of the `source` body.
+     *
+     * @private
+     * @param {string} source The source to modify.
+     * @returns {Array} details The details to insert.
+     * @returns {string} Returns the modified source.
+     */
+    function insertWrapDetails(source, details) {
+      var length = details.length;
+      if (!length) {
+        return source;
+      }
+      var lastIndex = length - 1;
+      details[lastIndex] = (length > 1 ? '& ' : '') + details[lastIndex];
+      details = details.join(length > 2 ? ', ' : ' ');
+      return source.replace(reWrapComment, '{\n/* [wrapped with ' + details + '] */\n');
+    }
+
+    /**
+     * Checks if `value` is a flattenable `arguments` object or array.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is flattenable, else `false`.
+     */
+    function isFlattenable(value) {
+      return isArray(value) || isArguments(value) ||
+        !!(spreadableSymbol && value && value[spreadableSymbol]);
+    }
+
+    /**
+     * Checks if `value` is a valid array-like index.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @param {number} [length=MAX_SAFE_INTEGER] The upper bounds of a valid index.
+     * @returns {boolean} Returns `true` if `value` is a valid index, else `false`.
+     */
+    function isIndex(value, length) {
+      length = length == null ? MAX_SAFE_INTEGER : length;
+      return !!length &&
+        (typeof value == 'number' || reIsUint.test(value)) &&
+        (value > -1 && value % 1 == 0 && value < length);
+    }
+
+    /**
+     * Checks if the given arguments are from an iteratee call.
+     *
+     * @private
+     * @param {*} value The potential iteratee value argument.
+     * @param {*} index The potential iteratee index or key argument.
+     * @param {*} object The potential iteratee object argument.
+     * @returns {boolean} Returns `true` if the arguments are from an iteratee call,
+     *  else `false`.
+     */
+    function isIterateeCall(value, index, object) {
+      if (!isObject(object)) {
+        return false;
+      }
+      var type = typeof index;
+      if (type == 'number'
+            ? (isArrayLike(object) && isIndex(index, object.length))
+            : (type == 'string' && index in object)
+          ) {
+        return eq(object[index], value);
+      }
+      return false;
+    }
+
+    /**
+     * Checks if `value` is a property name and not a property path.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @param {Object} [object] The object to query keys on.
+     * @returns {boolean} Returns `true` if `value` is a property name, else `false`.
+     */
+    function isKey(value, object) {
+      if (isArray(value)) {
+        return false;
+      }
+      var type = typeof value;
+      if (type == 'number' || type == 'symbol' || type == 'boolean' ||
+          value == null || isSymbol(value)) {
+        return true;
+      }
+      return reIsPlainProp.test(value) || !reIsDeepProp.test(value) ||
+        (object != null && value in Object(object));
+    }
+
+    /**
+     * Checks if `value` is suitable for use as unique object key.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is suitable, else `false`.
+     */
+    function isKeyable(value) {
+      var type = typeof value;
+      return (type == 'string' || type == 'number' || type == 'symbol' || type == 'boolean')
+        ? (value !== '__proto__')
+        : (value === null);
+    }
+
+    /**
+     * Checks if `func` has a lazy counterpart.
+     *
+     * @private
+     * @param {Function} func The function to check.
+     * @returns {boolean} Returns `true` if `func` has a lazy counterpart,
+     *  else `false`.
+     */
+    function isLaziable(func) {
+      var funcName = getFuncName(func),
+          other = lodash[funcName];
+
+      if (typeof other != 'function' || !(funcName in LazyWrapper.prototype)) {
+        return false;
+      }
+      if (func === other) {
+        return true;
+      }
+      var data = getData(other);
+      return !!data && func === data[0];
+    }
+
+    /**
+     * Checks if `func` has its source masked.
+     *
+     * @private
+     * @param {Function} func The function to check.
+     * @returns {boolean} Returns `true` if `func` is masked, else `false`.
+     */
+    function isMasked(func) {
+      return !!maskSrcKey && (maskSrcKey in func);
+    }
+
+    /**
+     * Checks if `func` is capable of being masked.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `func` is maskable, else `false`.
+     */
+    var isMaskable = coreJsData ? isFunction : stubFalse;
+
+    /**
+     * Checks if `value` is likely a prototype object.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a prototype, else `false`.
+     */
+    function isPrototype(value) {
+      var Ctor = value && value.constructor,
+          proto = (typeof Ctor == 'function' && Ctor.prototype) || objectProto;
+
+      return value === proto;
+    }
+
+    /**
+     * Checks if `value` is suitable for strict equality comparisons, i.e. `===`.
+     *
+     * @private
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` if suitable for strict
+     *  equality comparisons, else `false`.
+     */
+    function isStrictComparable(value) {
+      return value === value && !isObject(value);
+    }
+
+    /**
+     * A specialized version of `matchesProperty` for source values suitable
+     * for strict equality comparisons, i.e. `===`.
+     *
+     * @private
+     * @param {string} key The key of the property to get.
+     * @param {*} srcValue The value to match.
+     * @returns {Function} Returns the new spec function.
+     */
+    function matchesStrictComparable(key, srcValue) {
+      return function(object) {
+        if (object == null) {
+          return false;
+        }
+        return object[key] === srcValue &&
+          (srcValue !== undefined || (key in Object(object)));
+      };
+    }
+
+    /**
+     * A specialized version of `_.memoize` which clears the memoized function's
+     * cache when it exceeds `MAX_MEMOIZE_SIZE`.
+     *
+     * @private
+     * @param {Function} func The function to have its output memoized.
+     * @returns {Function} Returns the new memoized function.
+     */
+    function memoizeCapped(func) {
+      var result = memoize(func, function(key) {
+        if (cache.size === MAX_MEMOIZE_SIZE) {
+          cache.clear();
+        }
+        return key;
+      });
+
+      var cache = result.cache;
+      return result;
+    }
+
+    /**
+     * Merges the function metadata of `source` into `data`.
+     *
+     * Merging metadata reduces the number of wrappers used to invoke a function.
+     * This is possible because methods like `_.bind`, `_.curry`, and `_.partial`
+     * may be applied regardless of execution order. Methods like `_.ary` and
+     * `_.rearg` modify function arguments, making the order in which they are
+     * executed important, preventing the merging of metadata. However, we make
+     * an exception for a safe combined case where curried functions have `_.ary`
+     * and or `_.rearg` applied.
+     *
+     * @private
+     * @param {Array} data The destination metadata.
+     * @param {Array} source The source metadata.
+     * @returns {Array} Returns `data`.
+     */
+    function mergeData(data, source) {
+      var bitmask = data[1],
+          srcBitmask = source[1],
+          newBitmask = bitmask | srcBitmask,
+          isCommon = newBitmask < (WRAP_BIND_FLAG | WRAP_BIND_KEY_FLAG | WRAP_ARY_FLAG);
+
+      var isCombo =
+        ((srcBitmask == WRAP_ARY_FLAG) && (bitmask == WRAP_CURRY_FLAG)) ||
+        ((srcBitmask == WRAP_ARY_FLAG) && (bitmask == WRAP_REARG_FLAG) && (data[7].length <= source[8])) ||
+        ((srcBitmask == (WRAP_ARY_FLAG | WRAP_REARG_FLAG)) && (source[7].length <= source[8]) && (bitmask == WRAP_CURRY_FLAG));
+
+      // Exit early if metadata can't be merged.
+      if (!(isCommon || isCombo)) {
+        return data;
+      }
+      // Use source `thisArg` if available.
+      if (srcBitmask & WRAP_BIND_FLAG) {
+        data[2] = source[2];
+        // Set when currying a bound function.
+        newBitmask |= bitmask & WRAP_BIND_FLAG ? 0 : WRAP_CURRY_BOUND_FLAG;
+      }
+      // Compose partial arguments.
+      var value = source[3];
+      if (value) {
+        var partials = data[3];
+        data[3] = partials ? composeArgs(partials, value, source[4]) : value;
+        data[4] = partials ? replaceHolders(data[3], PLACEHOLDER) : source[4];
+      }
+      // Compose partial right arguments.
+      value = source[5];
+      if (value) {
+        partials = data[5];
+        data[5] = partials ? composeArgsRight(partials, value, source[6]) : value;
+        data[6] = partials ? replaceHolders(data[5], PLACEHOLDER) : source[6];
+      }
+      // Use source `argPos` if available.
+      value = source[7];
+      if (value) {
+        data[7] = value;
+      }
+      // Use source `ary` if it's smaller.
+      if (srcBitmask & WRAP_ARY_FLAG) {
+        data[8] = data[8] == null ? source[8] : nativeMin(data[8], source[8]);
+      }
+      // Use source `arity` if one is not provided.
+      if (data[9] == null) {
+        data[9] = source[9];
+      }
+      // Use source `func` and merge bitmasks.
+      data[0] = source[0];
+      data[1] = newBitmask;
+
+      return data;
+    }
+
+    /**
+     * This function is like
+     * [`Object.keys`](http://ecma-international.org/ecma-262/7.0/#sec-object.keys)
+     * except that it includes inherited enumerable properties.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property names.
+     */
+    function nativeKeysIn(object) {
+      var result = [];
+      if (object != null) {
+        for (var key in Object(object)) {
+          result.push(key);
+        }
+      }
+      return result;
+    }
+
+    /**
+     * Converts `value` to a string using `Object.prototype.toString`.
+     *
+     * @private
+     * @param {*} value The value to convert.
+     * @returns {string} Returns the converted string.
+     */
+    function objectToString(value) {
+      return nativeObjectToString.call(value);
+    }
+
+    /**
+     * A specialized version of `baseRest` which transforms the rest array.
+     *
+     * @private
+     * @param {Function} func The function to apply a rest parameter to.
+     * @param {number} [start=func.length-1] The start position of the rest parameter.
+     * @param {Function} transform The rest array transform.
+     * @returns {Function} Returns the new function.
+     */
+    function overRest(func, start, transform) {
+      start = nativeMax(start === undefined ? (func.length - 1) : start, 0);
+      return function() {
+        var args = arguments,
+            index = -1,
+            length = nativeMax(args.length - start, 0),
+            array = Array(length);
+
+        while (++index < length) {
+          array[index] = args[start + index];
+        }
+        index = -1;
+        var otherArgs = Array(start + 1);
+        while (++index < start) {
+          otherArgs[index] = args[index];
+        }
+        otherArgs[start] = transform(array);
+        return apply(func, this, otherArgs);
+      };
+    }
+
+    /**
+     * Gets the parent value at `path` of `object`.
+     *
+     * @private
+     * @param {Object} object The object to query.
+     * @param {Array} path The path to get the parent value of.
+     * @returns {*} Returns the parent value.
+     */
+    function parent(object, path) {
+      return path.length < 2 ? object : baseGet(object, baseSlice(path, 0, -1));
+    }
+
+    /**
+     * Reorder `array` according to the specified indexes where the element at
+     * the first index is assigned as the first element, the element at
+     * the second index is assigned as the second element, and so on.
+     *
+     * @private
+     * @param {Array} array The array to reorder.
+     * @param {Array} indexes The arranged array indexes.
+     * @returns {Array} Returns `array`.
+     */
+    function reorder(array, indexes) {
+      var arrLength = array.length,
+          length = nativeMin(indexes.length, arrLength),
+          oldArray = copyArray(array);
+
+      while (length--) {
+        var index = indexes[length];
+        array[length] = isIndex(index, arrLength) ? oldArray[index] : undefined;
+      }
+      return array;
+    }
+
+    /**
+     * Sets metadata for `func`.
+     *
+     * **Note:** If this function becomes hot, i.e. is invoked a lot in a short
+     * period of time, it will trip its breaker and transition to an identity
+     * function to avoid garbage collection pauses in V8. See
+     * [V8 issue 2070](https://bugs.chromium.org/p/v8/issues/detail?id=2070)
+     * for more details.
+     *
+     * @private
+     * @param {Function} func The function to associate metadata with.
+     * @param {*} data The metadata.
+     * @returns {Function} Returns `func`.
+     */
+    var setData = shortOut(baseSetData);
+
+    /**
+     * A simple wrapper around the global [`setTimeout`](https://mdn.io/setTimeout).
+     *
+     * @private
+     * @param {Function} func The function to delay.
+     * @param {number} wait The number of milliseconds to delay invocation.
+     * @returns {number|Object} Returns the timer id or timeout object.
+     */
+    var setTimeout = ctxSetTimeout || function(func, wait) {
+      return root.setTimeout(func, wait);
+    };
+
+    /**
+     * Sets the `toString` method of `func` to return `string`.
+     *
+     * @private
+     * @param {Function} func The function to modify.
+     * @param {Function} string The `toString` result.
+     * @returns {Function} Returns `func`.
+     */
+    var setToString = shortOut(baseSetToString);
+
+    /**
+     * Sets the `toString` method of `wrapper` to mimic the source of `reference`
+     * with wrapper details in a comment at the top of the source body.
+     *
+     * @private
+     * @param {Function} wrapper The function to modify.
+     * @param {Function} reference The reference function.
+     * @param {number} bitmask The bitmask flags. See `createWrap` for more details.
+     * @returns {Function} Returns `wrapper`.
+     */
+    function setWrapToString(wrapper, reference, bitmask) {
+      var source = (reference + '');
+      return setToString(wrapper, insertWrapDetails(source, updateWrapDetails(getWrapDetails(source), bitmask)));
+    }
+
+    /**
+     * Creates a function that'll short out and invoke `identity` instead
+     * of `func` when it's called `HOT_COUNT` or more times in `HOT_SPAN`
+     * milliseconds.
+     *
+     * @private
+     * @param {Function} func The function to restrict.
+     * @returns {Function} Returns the new shortable function.
+     */
+    function shortOut(func) {
+      var count = 0,
+          lastCalled = 0;
+
+      return function() {
+        var stamp = nativeNow(),
+            remaining = HOT_SPAN - (stamp - lastCalled);
+
+        lastCalled = stamp;
+        if (remaining > 0) {
+          if (++count >= HOT_COUNT) {
+            return arguments[0];
+          }
+        } else {
+          count = 0;
+        }
+        return func.apply(undefined, arguments);
+      };
+    }
+
+    /**
+     * A specialized version of `_.shuffle` which mutates and sets the size of `array`.
+     *
+     * @private
+     * @param {Array} array The array to shuffle.
+     * @param {number} [size=array.length] The size of `array`.
+     * @returns {Array} Returns `array`.
+     */
+    function shuffleSelf(array, size) {
+      var index = -1,
+          length = array.length,
+          lastIndex = length - 1;
+
+      size = size === undefined ? length : size;
+      while (++index < size) {
+        var rand = baseRandom(index, lastIndex),
+            value = array[rand];
+
+        array[rand] = array[index];
+        array[index] = value;
+      }
+      array.length = size;
+      return array;
+    }
+
+    /**
+     * Converts `string` to a property path array.
+     *
+     * @private
+     * @param {string} string The string to convert.
+     * @returns {Array} Returns the property path array.
+     */
+    var stringToPath = memoizeCapped(function(string) {
+      var result = [];
+      if (reLeadingDot.test(string)) {
+        result.push('');
+      }
+      string.replace(rePropName, function(match, number, quote, string) {
+        result.push(quote ? string.replace(reEscapeChar, '$1') : (number || match));
+      });
+      return result;
+    });
+
+    /**
+     * Converts `value` to a string key if it's not a string or symbol.
+     *
+     * @private
+     * @param {*} value The value to inspect.
+     * @returns {string|symbol} Returns the key.
+     */
+    function toKey(value) {
+      if (typeof value == 'string' || isSymbol(value)) {
+        return value;
+      }
+      var result = (value + '');
+      return (result == '0' && (1 / value) == -INFINITY) ? '-0' : result;
+    }
+
+    /**
+     * Converts `func` to its source code.
+     *
+     * @private
+     * @param {Function} func The function to convert.
+     * @returns {string} Returns the source code.
+     */
+    function toSource(func) {
+      if (func != null) {
+        try {
+          return funcToString.call(func);
+        } catch (e) {}
+        try {
+          return (func + '');
+        } catch (e) {}
+      }
+      return '';
+    }
+
+    /**
+     * Updates wrapper `details` based on `bitmask` flags.
+     *
+     * @private
+     * @returns {Array} details The details to modify.
+     * @param {number} bitmask The bitmask flags. See `createWrap` for more details.
+     * @returns {Array} Returns `details`.
+     */
+    function updateWrapDetails(details, bitmask) {
+      arrayEach(wrapFlags, function(pair) {
+        var value = '_.' + pair[0];
+        if ((bitmask & pair[1]) && !arrayIncludes(details, value)) {
+          details.push(value);
+        }
+      });
+      return details.sort();
+    }
+
+    /**
+     * Creates a clone of `wrapper`.
+     *
+     * @private
+     * @param {Object} wrapper The wrapper to clone.
+     * @returns {Object} Returns the cloned wrapper.
+     */
+    function wrapperClone(wrapper) {
+      if (wrapper instanceof LazyWrapper) {
+        return wrapper.clone();
+      }
+      var result = new LodashWrapper(wrapper.__wrapped__, wrapper.__chain__);
+      result.__actions__ = copyArray(wrapper.__actions__);
+      result.__index__  = wrapper.__index__;
+      result.__values__ = wrapper.__values__;
+      return result;
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates an array of elements split into groups the length of `size`.
+     * If `array` can't be split evenly, the final chunk will be the remaining
+     * elements.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to process.
+     * @param {number} [size=1] The length of each chunk
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Array} Returns the new array of chunks.
+     * @example
+     *
+     * _.chunk(['a', 'b', 'c', 'd'], 2);
+     * // => [['a', 'b'], ['c', 'd']]
+     *
+     * _.chunk(['a', 'b', 'c', 'd'], 3);
+     * // => [['a', 'b', 'c'], ['d']]
+     */
+    function chunk(array, size, guard) {
+      if ((guard ? isIterateeCall(array, size, guard) : size === undefined)) {
+        size = 1;
+      } else {
+        size = nativeMax(toInteger(size), 0);
+      }
+      var length = array == null ? 0 : array.length;
+      if (!length || size < 1) {
+        return [];
+      }
+      var index = 0,
+          resIndex = 0,
+          result = Array(nativeCeil(length / size));
+
+      while (index < length) {
+        result[resIndex++] = baseSlice(array, index, (index += size));
+      }
+      return result;
+    }
+
+    /**
+     * Creates an array with all falsey values removed. The values `false`, `null`,
+     * `0`, `""`, `undefined`, and `NaN` are falsey.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to compact.
+     * @returns {Array} Returns the new array of filtered values.
+     * @example
+     *
+     * _.compact([0, 1, false, 2, '', 3]);
+     * // => [1, 2, 3]
+     */
+    function compact(array) {
+      var index = -1,
+          length = array == null ? 0 : array.length,
+          resIndex = 0,
+          result = [];
+
+      while (++index < length) {
+        var value = array[index];
+        if (value) {
+          result[resIndex++] = value;
+        }
+      }
+      return result;
+    }
+
+    /**
+     * Creates a new array concatenating `array` with any additional arrays
+     * and/or values.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to concatenate.
+     * @param {...*} [values] The values to concatenate.
+     * @returns {Array} Returns the new concatenated array.
+     * @example
+     *
+     * var array = [1];
+     * var other = _.concat(array, 2, [3], [[4]]);
+     *
+     * console.log(other);
+     * // => [1, 2, 3, [4]]
+     *
+     * console.log(array);
+     * // => [1]
+     */
+    function concat() {
+      var length = arguments.length;
+      if (!length) {
+        return [];
+      }
+      var args = Array(length - 1),
+          array = arguments[0],
+          index = length;
+
+      while (index--) {
+        args[index - 1] = arguments[index];
+      }
+      return arrayPush(isArray(array) ? copyArray(array) : [array], baseFlatten(args, 1));
+    }
+
+    /**
+     * Creates an array of `array` values not included in the other given arrays
+     * using [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * for equality comparisons. The order and references of result values are
+     * determined by the first array.
+     *
+     * **Note:** Unlike `_.pullAll`, this method returns a new array.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {...Array} [values] The values to exclude.
+     * @returns {Array} Returns the new array of filtered values.
+     * @see _.without, _.xor
+     * @example
+     *
+     * _.difference([2, 1], [2, 3]);
+     * // => [1]
+     */
+    var difference = baseRest(function(array, values) {
+      return isArrayLikeObject(array)
+        ? baseDifference(array, baseFlatten(values, 1, isArrayLikeObject, true))
+        : [];
+    });
+
+    /**
+     * This method is like `_.difference` except that it accepts `iteratee` which
+     * is invoked for each element of `array` and `values` to generate the criterion
+     * by which they're compared. The order and references of result values are
+     * determined by the first array. The iteratee is invoked with one argument:
+     * (value).
+     *
+     * **Note:** Unlike `_.pullAllBy`, this method returns a new array.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {...Array} [values] The values to exclude.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {Array} Returns the new array of filtered values.
+     * @example
+     *
+     * _.differenceBy([2.1, 1.2], [2.3, 3.4], Math.floor);
+     * // => [1.2]
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.differenceBy([{ 'x': 2 }, { 'x': 1 }], [{ 'x': 1 }], 'x');
+     * // => [{ 'x': 2 }]
+     */
+    var differenceBy = baseRest(function(array, values) {
+      var iteratee = last(values);
+      if (isArrayLikeObject(iteratee)) {
+        iteratee = undefined;
+      }
+      return isArrayLikeObject(array)
+        ? baseDifference(array, baseFlatten(values, 1, isArrayLikeObject, true), getIteratee(iteratee, 2))
+        : [];
+    });
+
+    /**
+     * This method is like `_.difference` except that it accepts `comparator`
+     * which is invoked to compare elements of `array` to `values`. The order and
+     * references of result values are determined by the first array. The comparator
+     * is invoked with two arguments: (arrVal, othVal).
+     *
+     * **Note:** Unlike `_.pullAllWith`, this method returns a new array.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {...Array} [values] The values to exclude.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new array of filtered values.
+     * @example
+     *
+     * var objects = [{ 'x': 1, 'y': 2 }, { 'x': 2, 'y': 1 }];
+     *
+     * _.differenceWith(objects, [{ 'x': 1, 'y': 2 }], _.isEqual);
+     * // => [{ 'x': 2, 'y': 1 }]
+     */
+    var differenceWith = baseRest(function(array, values) {
+      var comparator = last(values);
+      if (isArrayLikeObject(comparator)) {
+        comparator = undefined;
+      }
+      return isArrayLikeObject(array)
+        ? baseDifference(array, baseFlatten(values, 1, isArrayLikeObject, true), undefined, comparator)
+        : [];
+    });
+
+    /**
+     * Creates a slice of `array` with `n` elements dropped from the beginning.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.5.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {number} [n=1] The number of elements to drop.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * _.drop([1, 2, 3]);
+     * // => [2, 3]
+     *
+     * _.drop([1, 2, 3], 2);
+     * // => [3]
+     *
+     * _.drop([1, 2, 3], 5);
+     * // => []
+     *
+     * _.drop([1, 2, 3], 0);
+     * // => [1, 2, 3]
+     */
+    function drop(array, n, guard) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return [];
+      }
+      n = (guard || n === undefined) ? 1 : toInteger(n);
+      return baseSlice(array, n < 0 ? 0 : n, length);
+    }
+
+    /**
+     * Creates a slice of `array` with `n` elements dropped from the end.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {number} [n=1] The number of elements to drop.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * _.dropRight([1, 2, 3]);
+     * // => [1, 2]
+     *
+     * _.dropRight([1, 2, 3], 2);
+     * // => [1]
+     *
+     * _.dropRight([1, 2, 3], 5);
+     * // => []
+     *
+     * _.dropRight([1, 2, 3], 0);
+     * // => [1, 2, 3]
+     */
+    function dropRight(array, n, guard) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return [];
+      }
+      n = (guard || n === undefined) ? 1 : toInteger(n);
+      n = length - n;
+      return baseSlice(array, 0, n < 0 ? 0 : n);
+    }
+
+    /**
+     * Creates a slice of `array` excluding elements dropped from the end.
+     * Elements are dropped until `predicate` returns falsey. The predicate is
+     * invoked with three arguments: (value, index, array).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'active': true },
+     *   { 'user': 'fred',    'active': false },
+     *   { 'user': 'pebbles', 'active': false }
+     * ];
+     *
+     * _.dropRightWhile(users, function(o) { return !o.active; });
+     * // => objects for ['barney']
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.dropRightWhile(users, { 'user': 'pebbles', 'active': false });
+     * // => objects for ['barney', 'fred']
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.dropRightWhile(users, ['active', false]);
+     * // => objects for ['barney']
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.dropRightWhile(users, 'active');
+     * // => objects for ['barney', 'fred', 'pebbles']
+     */
+    function dropRightWhile(array, predicate) {
+      return (array && array.length)
+        ? baseWhile(array, getIteratee(predicate, 3), true, true)
+        : [];
+    }
+
+    /**
+     * Creates a slice of `array` excluding elements dropped from the beginning.
+     * Elements are dropped until `predicate` returns falsey. The predicate is
+     * invoked with three arguments: (value, index, array).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'active': false },
+     *   { 'user': 'fred',    'active': false },
+     *   { 'user': 'pebbles', 'active': true }
+     * ];
+     *
+     * _.dropWhile(users, function(o) { return !o.active; });
+     * // => objects for ['pebbles']
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.dropWhile(users, { 'user': 'barney', 'active': false });
+     * // => objects for ['fred', 'pebbles']
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.dropWhile(users, ['active', false]);
+     * // => objects for ['pebbles']
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.dropWhile(users, 'active');
+     * // => objects for ['barney', 'fred', 'pebbles']
+     */
+    function dropWhile(array, predicate) {
+      return (array && array.length)
+        ? baseWhile(array, getIteratee(predicate, 3), true)
+        : [];
+    }
+
+    /**
+     * Fills elements of `array` with `value` from `start` up to, but not
+     * including, `end`.
+     *
+     * **Note:** This method mutates `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.2.0
+     * @category Array
+     * @param {Array} array The array to fill.
+     * @param {*} value The value to fill `array` with.
+     * @param {number} [start=0] The start position.
+     * @param {number} [end=array.length] The end position.
+     * @returns {Array} Returns `array`.
+     * @example
+     *
+     * var array = [1, 2, 3];
+     *
+     * _.fill(array, 'a');
+     * console.log(array);
+     * // => ['a', 'a', 'a']
+     *
+     * _.fill(Array(3), 2);
+     * // => [2, 2, 2]
+     *
+     * _.fill([4, 6, 8, 10], '*', 1, 3);
+     * // => [4, '*', '*', 10]
+     */
+    function fill(array, value, start, end) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return [];
+      }
+      if (start && typeof start != 'number' && isIterateeCall(array, value, start)) {
+        start = 0;
+        end = length;
+      }
+      return baseFill(array, value, start, end);
+    }
+
+    /**
+     * This method is like `_.find` except that it returns the index of the first
+     * element `predicate` returns truthy for instead of the element itself.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.1.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @param {number} [fromIndex=0] The index to search from.
+     * @returns {number} Returns the index of the found element, else `-1`.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'active': false },
+     *   { 'user': 'fred',    'active': false },
+     *   { 'user': 'pebbles', 'active': true }
+     * ];
+     *
+     * _.findIndex(users, function(o) { return o.user == 'barney'; });
+     * // => 0
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.findIndex(users, { 'user': 'fred', 'active': false });
+     * // => 1
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.findIndex(users, ['active', false]);
+     * // => 0
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.findIndex(users, 'active');
+     * // => 2
+     */
+    function findIndex(array, predicate, fromIndex) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return -1;
+      }
+      var index = fromIndex == null ? 0 : toInteger(fromIndex);
+      if (index < 0) {
+        index = nativeMax(length + index, 0);
+      }
+      return baseFindIndex(array, getIteratee(predicate, 3), index);
+    }
+
+    /**
+     * This method is like `_.findIndex` except that it iterates over elements
+     * of `collection` from right to left.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @param {number} [fromIndex=array.length-1] The index to search from.
+     * @returns {number} Returns the index of the found element, else `-1`.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'active': true },
+     *   { 'user': 'fred',    'active': false },
+     *   { 'user': 'pebbles', 'active': false }
+     * ];
+     *
+     * _.findLastIndex(users, function(o) { return o.user == 'pebbles'; });
+     * // => 2
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.findLastIndex(users, { 'user': 'barney', 'active': true });
+     * // => 0
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.findLastIndex(users, ['active', false]);
+     * // => 2
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.findLastIndex(users, 'active');
+     * // => 0
+     */
+    function findLastIndex(array, predicate, fromIndex) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return -1;
+      }
+      var index = length - 1;
+      if (fromIndex !== undefined) {
+        index = toInteger(fromIndex);
+        index = fromIndex < 0
+          ? nativeMax(length + index, 0)
+          : nativeMin(index, length - 1);
+      }
+      return baseFindIndex(array, getIteratee(predicate, 3), index, true);
+    }
+
+    /**
+     * Flattens `array` a single level deep.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to flatten.
+     * @returns {Array} Returns the new flattened array.
+     * @example
+     *
+     * _.flatten([1, [2, [3, [4]], 5]]);
+     * // => [1, 2, [3, [4]], 5]
+     */
+    function flatten(array) {
+      var length = array == null ? 0 : array.length;
+      return length ? baseFlatten(array, 1) : [];
+    }
+
+    /**
+     * Recursively flattens `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to flatten.
+     * @returns {Array} Returns the new flattened array.
+     * @example
+     *
+     * _.flattenDeep([1, [2, [3, [4]], 5]]);
+     * // => [1, 2, 3, 4, 5]
+     */
+    function flattenDeep(array) {
+      var length = array == null ? 0 : array.length;
+      return length ? baseFlatten(array, INFINITY) : [];
+    }
+
+    /**
+     * Recursively flatten `array` up to `depth` times.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.4.0
+     * @category Array
+     * @param {Array} array The array to flatten.
+     * @param {number} [depth=1] The maximum recursion depth.
+     * @returns {Array} Returns the new flattened array.
+     * @example
+     *
+     * var array = [1, [2, [3, [4]], 5]];
+     *
+     * _.flattenDepth(array, 1);
+     * // => [1, 2, [3, [4]], 5]
+     *
+     * _.flattenDepth(array, 2);
+     * // => [1, 2, 3, [4], 5]
+     */
+    function flattenDepth(array, depth) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return [];
+      }
+      depth = depth === undefined ? 1 : toInteger(depth);
+      return baseFlatten(array, depth);
+    }
+
+    /**
+     * The inverse of `_.toPairs`; this method returns an object composed
+     * from key-value `pairs`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} pairs The key-value pairs.
+     * @returns {Object} Returns the new object.
+     * @example
+     *
+     * _.fromPairs([['a', 1], ['b', 2]]);
+     * // => { 'a': 1, 'b': 2 }
+     */
+    function fromPairs(pairs) {
+      var index = -1,
+          length = pairs == null ? 0 : pairs.length,
+          result = {};
+
+      while (++index < length) {
+        var pair = pairs[index];
+        result[pair[0]] = pair[1];
+      }
+      return result;
+    }
+
+    /**
+     * Gets the first element of `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @alias first
+     * @category Array
+     * @param {Array} array The array to query.
+     * @returns {*} Returns the first element of `array`.
+     * @example
+     *
+     * _.head([1, 2, 3]);
+     * // => 1
+     *
+     * _.head([]);
+     * // => undefined
+     */
+    function head(array) {
+      return (array && array.length) ? array[0] : undefined;
+    }
+
+    /**
+     * Gets the index at which the first occurrence of `value` is found in `array`
+     * using [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * for equality comparisons. If `fromIndex` is negative, it's used as the
+     * offset from the end of `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {*} value The value to search for.
+     * @param {number} [fromIndex=0] The index to search from.
+     * @returns {number} Returns the index of the matched value, else `-1`.
+     * @example
+     *
+     * _.indexOf([1, 2, 1, 2], 2);
+     * // => 1
+     *
+     * // Search from the `fromIndex`.
+     * _.indexOf([1, 2, 1, 2], 2, 2);
+     * // => 3
+     */
+    function indexOf(array, value, fromIndex) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return -1;
+      }
+      var index = fromIndex == null ? 0 : toInteger(fromIndex);
+      if (index < 0) {
+        index = nativeMax(length + index, 0);
+      }
+      return baseIndexOf(array, value, index);
+    }
+
+    /**
+     * Gets all but the last element of `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * _.initial([1, 2, 3]);
+     * // => [1, 2]
+     */
+    function initial(array) {
+      var length = array == null ? 0 : array.length;
+      return length ? baseSlice(array, 0, -1) : [];
+    }
+
+    /**
+     * Creates an array of unique values that are included in all given arrays
+     * using [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * for equality comparisons. The order and references of result values are
+     * determined by the first array.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @returns {Array} Returns the new array of intersecting values.
+     * @example
+     *
+     * _.intersection([2, 1], [2, 3]);
+     * // => [2]
+     */
+    var intersection = baseRest(function(arrays) {
+      var mapped = arrayMap(arrays, castArrayLikeObject);
+      return (mapped.length && mapped[0] === arrays[0])
+        ? baseIntersection(mapped)
+        : [];
+    });
+
+    /**
+     * This method is like `_.intersection` except that it accepts `iteratee`
+     * which is invoked for each element of each `arrays` to generate the criterion
+     * by which they're compared. The order and references of result values are
+     * determined by the first array. The iteratee is invoked with one argument:
+     * (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {Array} Returns the new array of intersecting values.
+     * @example
+     *
+     * _.intersectionBy([2.1, 1.2], [2.3, 3.4], Math.floor);
+     * // => [2.1]
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.intersectionBy([{ 'x': 1 }], [{ 'x': 2 }, { 'x': 1 }], 'x');
+     * // => [{ 'x': 1 }]
+     */
+    var intersectionBy = baseRest(function(arrays) {
+      var iteratee = last(arrays),
+          mapped = arrayMap(arrays, castArrayLikeObject);
+
+      if (iteratee === last(mapped)) {
+        iteratee = undefined;
+      } else {
+        mapped.pop();
+      }
+      return (mapped.length && mapped[0] === arrays[0])
+        ? baseIntersection(mapped, getIteratee(iteratee, 2))
+        : [];
+    });
+
+    /**
+     * This method is like `_.intersection` except that it accepts `comparator`
+     * which is invoked to compare elements of `arrays`. The order and references
+     * of result values are determined by the first array. The comparator is
+     * invoked with two arguments: (arrVal, othVal).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new array of intersecting values.
+     * @example
+     *
+     * var objects = [{ 'x': 1, 'y': 2 }, { 'x': 2, 'y': 1 }];
+     * var others = [{ 'x': 1, 'y': 1 }, { 'x': 1, 'y': 2 }];
+     *
+     * _.intersectionWith(objects, others, _.isEqual);
+     * // => [{ 'x': 1, 'y': 2 }]
+     */
+    var intersectionWith = baseRest(function(arrays) {
+      var comparator = last(arrays),
+          mapped = arrayMap(arrays, castArrayLikeObject);
+
+      comparator = typeof comparator == 'function' ? comparator : undefined;
+      if (comparator) {
+        mapped.pop();
+      }
+      return (mapped.length && mapped[0] === arrays[0])
+        ? baseIntersection(mapped, undefined, comparator)
+        : [];
+    });
+
+    /**
+     * Converts all elements in `array` into a string separated by `separator`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to convert.
+     * @param {string} [separator=','] The element separator.
+     * @returns {string} Returns the joined string.
+     * @example
+     *
+     * _.join(['a', 'b', 'c'], '~');
+     * // => 'a~b~c'
+     */
+    function join(array, separator) {
+      return array == null ? '' : nativeJoin.call(array, separator);
+    }
+
+    /**
+     * Gets the last element of `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @returns {*} Returns the last element of `array`.
+     * @example
+     *
+     * _.last([1, 2, 3]);
+     * // => 3
+     */
+    function last(array) {
+      var length = array == null ? 0 : array.length;
+      return length ? array[length - 1] : undefined;
+    }
+
+    /**
+     * This method is like `_.indexOf` except that it iterates over elements of
+     * `array` from right to left.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {*} value The value to search for.
+     * @param {number} [fromIndex=array.length-1] The index to search from.
+     * @returns {number} Returns the index of the matched value, else `-1`.
+     * @example
+     *
+     * _.lastIndexOf([1, 2, 1, 2], 2);
+     * // => 3
+     *
+     * // Search from the `fromIndex`.
+     * _.lastIndexOf([1, 2, 1, 2], 2, 2);
+     * // => 1
+     */
+    function lastIndexOf(array, value, fromIndex) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return -1;
+      }
+      var index = length;
+      if (fromIndex !== undefined) {
+        index = toInteger(fromIndex);
+        index = index < 0 ? nativeMax(length + index, 0) : nativeMin(index, length - 1);
+      }
+      return value === value
+        ? strictLastIndexOf(array, value, index)
+        : baseFindIndex(array, baseIsNaN, index, true);
+    }
+
+    /**
+     * Gets the element at index `n` of `array`. If `n` is negative, the nth
+     * element from the end is returned.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.11.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {number} [n=0] The index of the element to return.
+     * @returns {*} Returns the nth element of `array`.
+     * @example
+     *
+     * var array = ['a', 'b', 'c', 'd'];
+     *
+     * _.nth(array, 1);
+     * // => 'b'
+     *
+     * _.nth(array, -2);
+     * // => 'c';
+     */
+    function nth(array, n) {
+      return (array && array.length) ? baseNth(array, toInteger(n)) : undefined;
+    }
+
+    /**
+     * Removes all given values from `array` using
+     * [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * for equality comparisons.
+     *
+     * **Note:** Unlike `_.without`, this method mutates `array`. Use `_.remove`
+     * to remove elements from an array by predicate.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Array
+     * @param {Array} array The array to modify.
+     * @param {...*} [values] The values to remove.
+     * @returns {Array} Returns `array`.
+     * @example
+     *
+     * var array = ['a', 'b', 'c', 'a', 'b', 'c'];
+     *
+     * _.pull(array, 'a', 'c');
+     * console.log(array);
+     * // => ['b', 'b']
+     */
+    var pull = baseRest(pullAll);
+
+    /**
+     * This method is like `_.pull` except that it accepts an array of values to remove.
+     *
+     * **Note:** Unlike `_.difference`, this method mutates `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to modify.
+     * @param {Array} values The values to remove.
+     * @returns {Array} Returns `array`.
+     * @example
+     *
+     * var array = ['a', 'b', 'c', 'a', 'b', 'c'];
+     *
+     * _.pullAll(array, ['a', 'c']);
+     * console.log(array);
+     * // => ['b', 'b']
+     */
+    function pullAll(array, values) {
+      return (array && array.length && values && values.length)
+        ? basePullAll(array, values)
+        : array;
+    }
+
+    /**
+     * This method is like `_.pullAll` except that it accepts `iteratee` which is
+     * invoked for each element of `array` and `values` to generate the criterion
+     * by which they're compared. The iteratee is invoked with one argument: (value).
+     *
+     * **Note:** Unlike `_.differenceBy`, this method mutates `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to modify.
+     * @param {Array} values The values to remove.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {Array} Returns `array`.
+     * @example
+     *
+     * var array = [{ 'x': 1 }, { 'x': 2 }, { 'x': 3 }, { 'x': 1 }];
+     *
+     * _.pullAllBy(array, [{ 'x': 1 }, { 'x': 3 }], 'x');
+     * console.log(array);
+     * // => [{ 'x': 2 }]
+     */
+    function pullAllBy(array, values, iteratee) {
+      return (array && array.length && values && values.length)
+        ? basePullAll(array, values, getIteratee(iteratee, 2))
+        : array;
+    }
+
+    /**
+     * This method is like `_.pullAll` except that it accepts `comparator` which
+     * is invoked to compare elements of `array` to `values`. The comparator is
+     * invoked with two arguments: (arrVal, othVal).
+     *
+     * **Note:** Unlike `_.differenceWith`, this method mutates `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.6.0
+     * @category Array
+     * @param {Array} array The array to modify.
+     * @param {Array} values The values to remove.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns `array`.
+     * @example
+     *
+     * var array = [{ 'x': 1, 'y': 2 }, { 'x': 3, 'y': 4 }, { 'x': 5, 'y': 6 }];
+     *
+     * _.pullAllWith(array, [{ 'x': 3, 'y': 4 }], _.isEqual);
+     * console.log(array);
+     * // => [{ 'x': 1, 'y': 2 }, { 'x': 5, 'y': 6 }]
+     */
+    function pullAllWith(array, values, comparator) {
+      return (array && array.length && values && values.length)
+        ? basePullAll(array, values, undefined, comparator)
+        : array;
+    }
+
+    /**
+     * Removes elements from `array` corresponding to `indexes` and returns an
+     * array of removed elements.
+     *
+     * **Note:** Unlike `_.at`, this method mutates `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to modify.
+     * @param {...(number|number[])} [indexes] The indexes of elements to remove.
+     * @returns {Array} Returns the new array of removed elements.
+     * @example
+     *
+     * var array = ['a', 'b', 'c', 'd'];
+     * var pulled = _.pullAt(array, [1, 3]);
+     *
+     * console.log(array);
+     * // => ['a', 'c']
+     *
+     * console.log(pulled);
+     * // => ['b', 'd']
+     */
+    var pullAt = flatRest(function(array, indexes) {
+      var length = array == null ? 0 : array.length,
+          result = baseAt(array, indexes);
+
+      basePullAt(array, arrayMap(indexes, function(index) {
+        return isIndex(index, length) ? +index : index;
+      }).sort(compareAscending));
+
+      return result;
+    });
+
+    /**
+     * Removes all elements from `array` that `predicate` returns truthy for
+     * and returns an array of the removed elements. The predicate is invoked
+     * with three arguments: (value, index, array).
+     *
+     * **Note:** Unlike `_.filter`, this method mutates `array`. Use `_.pull`
+     * to pull elements from an array by value.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Array
+     * @param {Array} array The array to modify.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the new array of removed elements.
+     * @example
+     *
+     * var array = [1, 2, 3, 4];
+     * var evens = _.remove(array, function(n) {
+     *   return n % 2 == 0;
+     * });
+     *
+     * console.log(array);
+     * // => [1, 3]
+     *
+     * console.log(evens);
+     * // => [2, 4]
+     */
+    function remove(array, predicate) {
+      var result = [];
+      if (!(array && array.length)) {
+        return result;
+      }
+      var index = -1,
+          indexes = [],
+          length = array.length;
+
+      predicate = getIteratee(predicate, 3);
+      while (++index < length) {
+        var value = array[index];
+        if (predicate(value, index, array)) {
+          result.push(value);
+          indexes.push(index);
+        }
+      }
+      basePullAt(array, indexes);
+      return result;
+    }
+
+    /**
+     * Reverses `array` so that the first element becomes the last, the second
+     * element becomes the second to last, and so on.
+     *
+     * **Note:** This method mutates `array` and is based on
+     * [`Array#reverse`](https://mdn.io/Array/reverse).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to modify.
+     * @returns {Array} Returns `array`.
+     * @example
+     *
+     * var array = [1, 2, 3];
+     *
+     * _.reverse(array);
+     * // => [3, 2, 1]
+     *
+     * console.log(array);
+     * // => [3, 2, 1]
+     */
+    function reverse(array) {
+      return array == null ? array : nativeReverse.call(array);
+    }
+
+    /**
+     * Creates a slice of `array` from `start` up to, but not including, `end`.
+     *
+     * **Note:** This method is used instead of
+     * [`Array#slice`](https://mdn.io/Array/slice) to ensure dense arrays are
+     * returned.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to slice.
+     * @param {number} [start=0] The start position.
+     * @param {number} [end=array.length] The end position.
+     * @returns {Array} Returns the slice of `array`.
+     */
+    function slice(array, start, end) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return [];
+      }
+      if (end && typeof end != 'number' && isIterateeCall(array, start, end)) {
+        start = 0;
+        end = length;
+      }
+      else {
+        start = start == null ? 0 : toInteger(start);
+        end = end === undefined ? length : toInteger(end);
+      }
+      return baseSlice(array, start, end);
+    }
+
+    /**
+     * Uses a binary search to determine the lowest index at which `value`
+     * should be inserted into `array` in order to maintain its sort order.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The sorted array to inspect.
+     * @param {*} value The value to evaluate.
+     * @returns {number} Returns the index at which `value` should be inserted
+     *  into `array`.
+     * @example
+     *
+     * _.sortedIndex([30, 50], 40);
+     * // => 1
+     */
+    function sortedIndex(array, value) {
+      return baseSortedIndex(array, value);
+    }
+
+    /**
+     * This method is like `_.sortedIndex` except that it accepts `iteratee`
+     * which is invoked for `value` and each element of `array` to compute their
+     * sort ranking. The iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The sorted array to inspect.
+     * @param {*} value The value to evaluate.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {number} Returns the index at which `value` should be inserted
+     *  into `array`.
+     * @example
+     *
+     * var objects = [{ 'x': 4 }, { 'x': 5 }];
+     *
+     * _.sortedIndexBy(objects, { 'x': 4 }, function(o) { return o.x; });
+     * // => 0
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.sortedIndexBy(objects, { 'x': 4 }, 'x');
+     * // => 0
+     */
+    function sortedIndexBy(array, value, iteratee) {
+      return baseSortedIndexBy(array, value, getIteratee(iteratee, 2));
+    }
+
+    /**
+     * This method is like `_.indexOf` except that it performs a binary
+     * search on a sorted `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {*} value The value to search for.
+     * @returns {number} Returns the index of the matched value, else `-1`.
+     * @example
+     *
+     * _.sortedIndexOf([4, 5, 5, 5, 6], 5);
+     * // => 1
+     */
+    function sortedIndexOf(array, value) {
+      var length = array == null ? 0 : array.length;
+      if (length) {
+        var index = baseSortedIndex(array, value);
+        if (index < length && eq(array[index], value)) {
+          return index;
+        }
+      }
+      return -1;
+    }
+
+    /**
+     * This method is like `_.sortedIndex` except that it returns the highest
+     * index at which `value` should be inserted into `array` in order to
+     * maintain its sort order.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The sorted array to inspect.
+     * @param {*} value The value to evaluate.
+     * @returns {number} Returns the index at which `value` should be inserted
+     *  into `array`.
+     * @example
+     *
+     * _.sortedLastIndex([4, 5, 5, 5, 6], 5);
+     * // => 4
+     */
+    function sortedLastIndex(array, value) {
+      return baseSortedIndex(array, value, true);
+    }
+
+    /**
+     * This method is like `_.sortedLastIndex` except that it accepts `iteratee`
+     * which is invoked for `value` and each element of `array` to compute their
+     * sort ranking. The iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The sorted array to inspect.
+     * @param {*} value The value to evaluate.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {number} Returns the index at which `value` should be inserted
+     *  into `array`.
+     * @example
+     *
+     * var objects = [{ 'x': 4 }, { 'x': 5 }];
+     *
+     * _.sortedLastIndexBy(objects, { 'x': 4 }, function(o) { return o.x; });
+     * // => 1
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.sortedLastIndexBy(objects, { 'x': 4 }, 'x');
+     * // => 1
+     */
+    function sortedLastIndexBy(array, value, iteratee) {
+      return baseSortedIndexBy(array, value, getIteratee(iteratee, 2), true);
+    }
+
+    /**
+     * This method is like `_.lastIndexOf` except that it performs a binary
+     * search on a sorted `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {*} value The value to search for.
+     * @returns {number} Returns the index of the matched value, else `-1`.
+     * @example
+     *
+     * _.sortedLastIndexOf([4, 5, 5, 5, 6], 5);
+     * // => 3
+     */
+    function sortedLastIndexOf(array, value) {
+      var length = array == null ? 0 : array.length;
+      if (length) {
+        var index = baseSortedIndex(array, value, true) - 1;
+        if (eq(array[index], value)) {
+          return index;
+        }
+      }
+      return -1;
+    }
+
+    /**
+     * This method is like `_.uniq` except that it's designed and optimized
+     * for sorted arrays.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @returns {Array} Returns the new duplicate free array.
+     * @example
+     *
+     * _.sortedUniq([1, 1, 2]);
+     * // => [1, 2]
+     */
+    function sortedUniq(array) {
+      return (array && array.length)
+        ? baseSortedUniq(array)
+        : [];
+    }
+
+    /**
+     * This method is like `_.uniqBy` except that it's designed and optimized
+     * for sorted arrays.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {Function} [iteratee] The iteratee invoked per element.
+     * @returns {Array} Returns the new duplicate free array.
+     * @example
+     *
+     * _.sortedUniqBy([1.1, 1.2, 2.3, 2.4], Math.floor);
+     * // => [1.1, 2.3]
+     */
+    function sortedUniqBy(array, iteratee) {
+      return (array && array.length)
+        ? baseSortedUniq(array, getIteratee(iteratee, 2))
+        : [];
+    }
+
+    /**
+     * Gets all but the first element of `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * _.tail([1, 2, 3]);
+     * // => [2, 3]
+     */
+    function tail(array) {
+      var length = array == null ? 0 : array.length;
+      return length ? baseSlice(array, 1, length) : [];
+    }
+
+    /**
+     * Creates a slice of `array` with `n` elements taken from the beginning.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {number} [n=1] The number of elements to take.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * _.take([1, 2, 3]);
+     * // => [1]
+     *
+     * _.take([1, 2, 3], 2);
+     * // => [1, 2]
+     *
+     * _.take([1, 2, 3], 5);
+     * // => [1, 2, 3]
+     *
+     * _.take([1, 2, 3], 0);
+     * // => []
+     */
+    function take(array, n, guard) {
+      if (!(array && array.length)) {
+        return [];
+      }
+      n = (guard || n === undefined) ? 1 : toInteger(n);
+      return baseSlice(array, 0, n < 0 ? 0 : n);
+    }
+
+    /**
+     * Creates a slice of `array` with `n` elements taken from the end.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {number} [n=1] The number of elements to take.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * _.takeRight([1, 2, 3]);
+     * // => [3]
+     *
+     * _.takeRight([1, 2, 3], 2);
+     * // => [2, 3]
+     *
+     * _.takeRight([1, 2, 3], 5);
+     * // => [1, 2, 3]
+     *
+     * _.takeRight([1, 2, 3], 0);
+     * // => []
+     */
+    function takeRight(array, n, guard) {
+      var length = array == null ? 0 : array.length;
+      if (!length) {
+        return [];
+      }
+      n = (guard || n === undefined) ? 1 : toInteger(n);
+      n = length - n;
+      return baseSlice(array, n < 0 ? 0 : n, length);
+    }
+
+    /**
+     * Creates a slice of `array` with elements taken from the end. Elements are
+     * taken until `predicate` returns falsey. The predicate is invoked with
+     * three arguments: (value, index, array).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'active': true },
+     *   { 'user': 'fred',    'active': false },
+     *   { 'user': 'pebbles', 'active': false }
+     * ];
+     *
+     * _.takeRightWhile(users, function(o) { return !o.active; });
+     * // => objects for ['fred', 'pebbles']
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.takeRightWhile(users, { 'user': 'pebbles', 'active': false });
+     * // => objects for ['pebbles']
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.takeRightWhile(users, ['active', false]);
+     * // => objects for ['fred', 'pebbles']
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.takeRightWhile(users, 'active');
+     * // => []
+     */
+    function takeRightWhile(array, predicate) {
+      return (array && array.length)
+        ? baseWhile(array, getIteratee(predicate, 3), false, true)
+        : [];
+    }
+
+    /**
+     * Creates a slice of `array` with elements taken from the beginning. Elements
+     * are taken until `predicate` returns falsey. The predicate is invoked with
+     * three arguments: (value, index, array).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Array
+     * @param {Array} array The array to query.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the slice of `array`.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'active': false },
+     *   { 'user': 'fred',    'active': false },
+     *   { 'user': 'pebbles', 'active': true }
+     * ];
+     *
+     * _.takeWhile(users, function(o) { return !o.active; });
+     * // => objects for ['barney', 'fred']
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.takeWhile(users, { 'user': 'barney', 'active': false });
+     * // => objects for ['barney']
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.takeWhile(users, ['active', false]);
+     * // => objects for ['barney', 'fred']
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.takeWhile(users, 'active');
+     * // => []
+     */
+    function takeWhile(array, predicate) {
+      return (array && array.length)
+        ? baseWhile(array, getIteratee(predicate, 3))
+        : [];
+    }
+
+    /**
+     * Creates an array of unique values, in order, from all given arrays using
+     * [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * for equality comparisons.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @returns {Array} Returns the new array of combined values.
+     * @example
+     *
+     * _.union([2], [1, 2]);
+     * // => [2, 1]
+     */
+    var union = baseRest(function(arrays) {
+      return baseUniq(baseFlatten(arrays, 1, isArrayLikeObject, true));
+    });
+
+    /**
+     * This method is like `_.union` except that it accepts `iteratee` which is
+     * invoked for each element of each `arrays` to generate the criterion by
+     * which uniqueness is computed. Result values are chosen from the first
+     * array in which the value occurs. The iteratee is invoked with one argument:
+     * (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {Array} Returns the new array of combined values.
+     * @example
+     *
+     * _.unionBy([2.1], [1.2, 2.3], Math.floor);
+     * // => [2.1, 1.2]
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.unionBy([{ 'x': 1 }], [{ 'x': 2 }, { 'x': 1 }], 'x');
+     * // => [{ 'x': 1 }, { 'x': 2 }]
+     */
+    var unionBy = baseRest(function(arrays) {
+      var iteratee = last(arrays);
+      if (isArrayLikeObject(iteratee)) {
+        iteratee = undefined;
+      }
+      return baseUniq(baseFlatten(arrays, 1, isArrayLikeObject, true), getIteratee(iteratee, 2));
+    });
+
+    /**
+     * This method is like `_.union` except that it accepts `comparator` which
+     * is invoked to compare elements of `arrays`. Result values are chosen from
+     * the first array in which the value occurs. The comparator is invoked
+     * with two arguments: (arrVal, othVal).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new array of combined values.
+     * @example
+     *
+     * var objects = [{ 'x': 1, 'y': 2 }, { 'x': 2, 'y': 1 }];
+     * var others = [{ 'x': 1, 'y': 1 }, { 'x': 1, 'y': 2 }];
+     *
+     * _.unionWith(objects, others, _.isEqual);
+     * // => [{ 'x': 1, 'y': 2 }, { 'x': 2, 'y': 1 }, { 'x': 1, 'y': 1 }]
+     */
+    var unionWith = baseRest(function(arrays) {
+      var comparator = last(arrays);
+      comparator = typeof comparator == 'function' ? comparator : undefined;
+      return baseUniq(baseFlatten(arrays, 1, isArrayLikeObject, true), undefined, comparator);
+    });
+
+    /**
+     * Creates a duplicate-free version of an array, using
+     * [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * for equality comparisons, in which only the first occurrence of each element
+     * is kept. The order of result values is determined by the order they occur
+     * in the array.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @returns {Array} Returns the new duplicate free array.
+     * @example
+     *
+     * _.uniq([2, 1, 2]);
+     * // => [2, 1]
+     */
+    function uniq(array) {
+      return (array && array.length) ? baseUniq(array) : [];
+    }
+
+    /**
+     * This method is like `_.uniq` except that it accepts `iteratee` which is
+     * invoked for each element in `array` to generate the criterion by which
+     * uniqueness is computed. The order of result values is determined by the
+     * order they occur in the array. The iteratee is invoked with one argument:
+     * (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {Array} Returns the new duplicate free array.
+     * @example
+     *
+     * _.uniqBy([2.1, 1.2, 2.3], Math.floor);
+     * // => [2.1, 1.2]
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.uniqBy([{ 'x': 1 }, { 'x': 2 }, { 'x': 1 }], 'x');
+     * // => [{ 'x': 1 }, { 'x': 2 }]
+     */
+    function uniqBy(array, iteratee) {
+      return (array && array.length) ? baseUniq(array, getIteratee(iteratee, 2)) : [];
+    }
+
+    /**
+     * This method is like `_.uniq` except that it accepts `comparator` which
+     * is invoked to compare elements of `array`. The order of result values is
+     * determined by the order they occur in the array.The comparator is invoked
+     * with two arguments: (arrVal, othVal).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new duplicate free array.
+     * @example
+     *
+     * var objects = [{ 'x': 1, 'y': 2 }, { 'x': 2, 'y': 1 }, { 'x': 1, 'y': 2 }];
+     *
+     * _.uniqWith(objects, _.isEqual);
+     * // => [{ 'x': 1, 'y': 2 }, { 'x': 2, 'y': 1 }]
+     */
+    function uniqWith(array, comparator) {
+      comparator = typeof comparator == 'function' ? comparator : undefined;
+      return (array && array.length) ? baseUniq(array, undefined, comparator) : [];
+    }
+
+    /**
+     * This method is like `_.zip` except that it accepts an array of grouped
+     * elements and creates an array regrouping the elements to their pre-zip
+     * configuration.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.2.0
+     * @category Array
+     * @param {Array} array The array of grouped elements to process.
+     * @returns {Array} Returns the new array of regrouped elements.
+     * @example
+     *
+     * var zipped = _.zip(['a', 'b'], [1, 2], [true, false]);
+     * // => [['a', 1, true], ['b', 2, false]]
+     *
+     * _.unzip(zipped);
+     * // => [['a', 'b'], [1, 2], [true, false]]
+     */
+    function unzip(array) {
+      if (!(array && array.length)) {
+        return [];
+      }
+      var length = 0;
+      array = arrayFilter(array, function(group) {
+        if (isArrayLikeObject(group)) {
+          length = nativeMax(group.length, length);
+          return true;
+        }
+      });
+      return baseTimes(length, function(index) {
+        return arrayMap(array, baseProperty(index));
+      });
+    }
+
+    /**
+     * This method is like `_.unzip` except that it accepts `iteratee` to specify
+     * how regrouped values should be combined. The iteratee is invoked with the
+     * elements of each group: (...group).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.8.0
+     * @category Array
+     * @param {Array} array The array of grouped elements to process.
+     * @param {Function} [iteratee=_.identity] The function to combine
+     *  regrouped values.
+     * @returns {Array} Returns the new array of regrouped elements.
+     * @example
+     *
+     * var zipped = _.zip([1, 2], [10, 20], [100, 200]);
+     * // => [[1, 10, 100], [2, 20, 200]]
+     *
+     * _.unzipWith(zipped, _.add);
+     * // => [3, 30, 300]
+     */
+    function unzipWith(array, iteratee) {
+      if (!(array && array.length)) {
+        return [];
+      }
+      var result = unzip(array);
+      if (iteratee == null) {
+        return result;
+      }
+      return arrayMap(result, function(group) {
+        return apply(iteratee, undefined, group);
+      });
+    }
+
+    /**
+     * Creates an array excluding all given values using
+     * [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * for equality comparisons.
+     *
+     * **Note:** Unlike `_.pull`, this method returns a new array.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {Array} array The array to inspect.
+     * @param {...*} [values] The values to exclude.
+     * @returns {Array} Returns the new array of filtered values.
+     * @see _.difference, _.xor
+     * @example
+     *
+     * _.without([2, 1, 2, 3], 1, 2);
+     * // => [3]
+     */
+    var without = baseRest(function(array, values) {
+      return isArrayLikeObject(array)
+        ? baseDifference(array, values)
+        : [];
+    });
+
+    /**
+     * Creates an array of unique values that is the
+     * [symmetric difference](https://en.wikipedia.org/wiki/Symmetric_difference)
+     * of the given arrays. The order of result values is determined by the order
+     * they occur in the arrays.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.4.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @returns {Array} Returns the new array of filtered values.
+     * @see _.difference, _.without
+     * @example
+     *
+     * _.xor([2, 1], [2, 3]);
+     * // => [1, 3]
+     */
+    var xor = baseRest(function(arrays) {
+      return baseXor(arrayFilter(arrays, isArrayLikeObject));
+    });
+
+    /**
+     * This method is like `_.xor` except that it accepts `iteratee` which is
+     * invoked for each element of each `arrays` to generate the criterion by
+     * which by which they're compared. The order of result values is determined
+     * by the order they occur in the arrays. The iteratee is invoked with one
+     * argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {Array} Returns the new array of filtered values.
+     * @example
+     *
+     * _.xorBy([2.1, 1.2], [2.3, 3.4], Math.floor);
+     * // => [1.2, 3.4]
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.xorBy([{ 'x': 1 }], [{ 'x': 2 }, { 'x': 1 }], 'x');
+     * // => [{ 'x': 2 }]
+     */
+    var xorBy = baseRest(function(arrays) {
+      var iteratee = last(arrays);
+      if (isArrayLikeObject(iteratee)) {
+        iteratee = undefined;
+      }
+      return baseXor(arrayFilter(arrays, isArrayLikeObject), getIteratee(iteratee, 2));
+    });
+
+    /**
+     * This method is like `_.xor` except that it accepts `comparator` which is
+     * invoked to compare elements of `arrays`. The order of result values is
+     * determined by the order they occur in the arrays. The comparator is invoked
+     * with two arguments: (arrVal, othVal).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to inspect.
+     * @param {Function} [comparator] The comparator invoked per element.
+     * @returns {Array} Returns the new array of filtered values.
+     * @example
+     *
+     * var objects = [{ 'x': 1, 'y': 2 }, { 'x': 2, 'y': 1 }];
+     * var others = [{ 'x': 1, 'y': 1 }, { 'x': 1, 'y': 2 }];
+     *
+     * _.xorWith(objects, others, _.isEqual);
+     * // => [{ 'x': 2, 'y': 1 }, { 'x': 1, 'y': 1 }]
+     */
+    var xorWith = baseRest(function(arrays) {
+      var comparator = last(arrays);
+      comparator = typeof comparator == 'function' ? comparator : undefined;
+      return baseXor(arrayFilter(arrays, isArrayLikeObject), undefined, comparator);
+    });
+
+    /**
+     * Creates an array of grouped elements, the first of which contains the
+     * first elements of the given arrays, the second of which contains the
+     * second elements of the given arrays, and so on.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to process.
+     * @returns {Array} Returns the new array of grouped elements.
+     * @example
+     *
+     * _.zip(['a', 'b'], [1, 2], [true, false]);
+     * // => [['a', 1, true], ['b', 2, false]]
+     */
+    var zip = baseRest(unzip);
+
+    /**
+     * This method is like `_.fromPairs` except that it accepts two arrays,
+     * one of property identifiers and one of corresponding values.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.4.0
+     * @category Array
+     * @param {Array} [props=[]] The property identifiers.
+     * @param {Array} [values=[]] The property values.
+     * @returns {Object} Returns the new object.
+     * @example
+     *
+     * _.zipObject(['a', 'b'], [1, 2]);
+     * // => { 'a': 1, 'b': 2 }
+     */
+    function zipObject(props, values) {
+      return baseZipObject(props || [], values || [], assignValue);
+    }
+
+    /**
+     * This method is like `_.zipObject` except that it supports property paths.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.1.0
+     * @category Array
+     * @param {Array} [props=[]] The property identifiers.
+     * @param {Array} [values=[]] The property values.
+     * @returns {Object} Returns the new object.
+     * @example
+     *
+     * _.zipObjectDeep(['a.b[0].c', 'a.b[1].d'], [1, 2]);
+     * // => { 'a': { 'b': [{ 'c': 1 }, { 'd': 2 }] } }
+     */
+    function zipObjectDeep(props, values) {
+      return baseZipObject(props || [], values || [], baseSet);
+    }
+
+    /**
+     * This method is like `_.zip` except that it accepts `iteratee` to specify
+     * how grouped values should be combined. The iteratee is invoked with the
+     * elements of each group: (...group).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.8.0
+     * @category Array
+     * @param {...Array} [arrays] The arrays to process.
+     * @param {Function} [iteratee=_.identity] The function to combine
+     *  grouped values.
+     * @returns {Array} Returns the new array of grouped elements.
+     * @example
+     *
+     * _.zipWith([1, 2], [10, 20], [100, 200], function(a, b, c) {
+     *   return a + b + c;
+     * });
+     * // => [111, 222]
+     */
+    var zipWith = baseRest(function(arrays) {
+      var length = arrays.length,
+          iteratee = length > 1 ? arrays[length - 1] : undefined;
+
+      iteratee = typeof iteratee == 'function' ? (arrays.pop(), iteratee) : undefined;
+      return unzipWith(arrays, iteratee);
+    });
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates a `lodash` wrapper instance that wraps `value` with explicit method
+     * chain sequences enabled. The result of such sequences must be unwrapped
+     * with `_#value`.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.3.0
+     * @category Seq
+     * @param {*} value The value to wrap.
+     * @returns {Object} Returns the new `lodash` wrapper instance.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'age': 36 },
+     *   { 'user': 'fred',    'age': 40 },
+     *   { 'user': 'pebbles', 'age': 1 }
+     * ];
+     *
+     * var youngest = _
+     *   .chain(users)
+     *   .sortBy('age')
+     *   .map(function(o) {
+     *     return o.user + ' is ' + o.age;
+     *   })
+     *   .head()
+     *   .value();
+     * // => 'pebbles is 1'
+     */
+    function chain(value) {
+      var result = lodash(value);
+      result.__chain__ = true;
+      return result;
+    }
+
+    /**
+     * This method invokes `interceptor` and returns `value`. The interceptor
+     * is invoked with one argument; (value). The purpose of this method is to
+     * "tap into" a method chain sequence in order to modify intermediate results.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Seq
+     * @param {*} value The value to provide to `interceptor`.
+     * @param {Function} interceptor The function to invoke.
+     * @returns {*} Returns `value`.
+     * @example
+     *
+     * _([1, 2, 3])
+     *  .tap(function(array) {
+     *    // Mutate input array.
+     *    array.pop();
+     *  })
+     *  .reverse()
+     *  .value();
+     * // => [2, 1]
+     */
+    function tap(value, interceptor) {
+      interceptor(value);
+      return value;
+    }
+
+    /**
+     * This method is like `_.tap` except that it returns the result of `interceptor`.
+     * The purpose of this method is to "pass thru" values replacing intermediate
+     * results in a method chain sequence.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Seq
+     * @param {*} value The value to provide to `interceptor`.
+     * @param {Function} interceptor The function to invoke.
+     * @returns {*} Returns the result of `interceptor`.
+     * @example
+     *
+     * _('  abc  ')
+     *  .chain()
+     *  .trim()
+     *  .thru(function(value) {
+     *    return [value];
+     *  })
+     *  .value();
+     * // => ['abc']
+     */
+    function thru(value, interceptor) {
+      return interceptor(value);
+    }
+
+    /**
+     * This method is the wrapper version of `_.at`.
+     *
+     * @name at
+     * @memberOf _
+     * @since 1.0.0
+     * @category Seq
+     * @param {...(string|string[])} [paths] The property paths to pick.
+     * @returns {Object} Returns the new `lodash` wrapper instance.
+     * @example
+     *
+     * var object = { 'a': [{ 'b': { 'c': 3 } }, 4] };
+     *
+     * _(object).at(['a[0].b.c', 'a[1]']).value();
+     * // => [3, 4]
+     */
+    var wrapperAt = flatRest(function(paths) {
+      var length = paths.length,
+          start = length ? paths[0] : 0,
+          value = this.__wrapped__,
+          interceptor = function(object) { return baseAt(object, paths); };
+
+      if (length > 1 || this.__actions__.length ||
+          !(value instanceof LazyWrapper) || !isIndex(start)) {
+        return this.thru(interceptor);
+      }
+      value = value.slice(start, +start + (length ? 1 : 0));
+      value.__actions__.push({
+        'func': thru,
+        'args': [interceptor],
+        'thisArg': undefined
+      });
+      return new LodashWrapper(value, this.__chain__).thru(function(array) {
+        if (length && !array.length) {
+          array.push(undefined);
+        }
+        return array;
+      });
+    });
+
+    /**
+     * Creates a `lodash` wrapper instance with explicit method chain sequences enabled.
+     *
+     * @name chain
+     * @memberOf _
+     * @since 0.1.0
+     * @category Seq
+     * @returns {Object} Returns the new `lodash` wrapper instance.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney', 'age': 36 },
+     *   { 'user': 'fred',   'age': 40 }
+     * ];
+     *
+     * // A sequence without explicit chaining.
+     * _(users).head();
+     * // => { 'user': 'barney', 'age': 36 }
+     *
+     * // A sequence with explicit chaining.
+     * _(users)
+     *   .chain()
+     *   .head()
+     *   .pick('user')
+     *   .value();
+     * // => { 'user': 'barney' }
+     */
+    function wrapperChain() {
+      return chain(this);
+    }
+
+    /**
+     * Executes the chain sequence and returns the wrapped result.
+     *
+     * @name commit
+     * @memberOf _
+     * @since 3.2.0
+     * @category Seq
+     * @returns {Object} Returns the new `lodash` wrapper instance.
+     * @example
+     *
+     * var array = [1, 2];
+     * var wrapped = _(array).push(3);
+     *
+     * console.log(array);
+     * // => [1, 2]
+     *
+     * wrapped = wrapped.commit();
+     * console.log(array);
+     * // => [1, 2, 3]
+     *
+     * wrapped.last();
+     * // => 3
+     *
+     * console.log(array);
+     * // => [1, 2, 3]
+     */
+    function wrapperCommit() {
+      return new LodashWrapper(this.value(), this.__chain__);
+    }
+
+    /**
+     * Gets the next value on a wrapped object following the
+     * [iterator protocol](https://mdn.io/iteration_protocols#iterator).
+     *
+     * @name next
+     * @memberOf _
+     * @since 4.0.0
+     * @category Seq
+     * @returns {Object} Returns the next iterator value.
+     * @example
+     *
+     * var wrapped = _([1, 2]);
+     *
+     * wrapped.next();
+     * // => { 'done': false, 'value': 1 }
+     *
+     * wrapped.next();
+     * // => { 'done': false, 'value': 2 }
+     *
+     * wrapped.next();
+     * // => { 'done': true, 'value': undefined }
+     */
+    function wrapperNext() {
+      if (this.__values__ === undefined) {
+        this.__values__ = toArray(this.value());
+      }
+      var done = this.__index__ >= this.__values__.length,
+          value = done ? undefined : this.__values__[this.__index__++];
+
+      return { 'done': done, 'value': value };
+    }
+
+    /**
+     * Enables the wrapper to be iterable.
+     *
+     * @name Symbol.iterator
+     * @memberOf _
+     * @since 4.0.0
+     * @category Seq
+     * @returns {Object} Returns the wrapper object.
+     * @example
+     *
+     * var wrapped = _([1, 2]);
+     *
+     * wrapped[Symbol.iterator]() === wrapped;
+     * // => true
+     *
+     * Array.from(wrapped);
+     * // => [1, 2]
+     */
+    function wrapperToIterator() {
+      return this;
+    }
+
+    /**
+     * Creates a clone of the chain sequence planting `value` as the wrapped value.
+     *
+     * @name plant
+     * @memberOf _
+     * @since 3.2.0
+     * @category Seq
+     * @param {*} value The value to plant.
+     * @returns {Object} Returns the new `lodash` wrapper instance.
+     * @example
+     *
+     * function square(n) {
+     *   return n * n;
+     * }
+     *
+     * var wrapped = _([1, 2]).map(square);
+     * var other = wrapped.plant([3, 4]);
+     *
+     * other.value();
+     * // => [9, 16]
+     *
+     * wrapped.value();
+     * // => [1, 4]
+     */
+    function wrapperPlant(value) {
+      var result,
+          parent = this;
+
+      while (parent instanceof baseLodash) {
+        var clone = wrapperClone(parent);
+        clone.__index__ = 0;
+        clone.__values__ = undefined;
+        if (result) {
+          previous.__wrapped__ = clone;
+        } else {
+          result = clone;
+        }
+        var previous = clone;
+        parent = parent.__wrapped__;
+      }
+      previous.__wrapped__ = value;
+      return result;
+    }
+
+    /**
+     * This method is the wrapper version of `_.reverse`.
+     *
+     * **Note:** This method mutates the wrapped array.
+     *
+     * @name reverse
+     * @memberOf _
+     * @since 0.1.0
+     * @category Seq
+     * @returns {Object} Returns the new `lodash` wrapper instance.
+     * @example
+     *
+     * var array = [1, 2, 3];
+     *
+     * _(array).reverse().value()
+     * // => [3, 2, 1]
+     *
+     * console.log(array);
+     * // => [3, 2, 1]
+     */
+    function wrapperReverse() {
+      var value = this.__wrapped__;
+      if (value instanceof LazyWrapper) {
+        var wrapped = value;
+        if (this.__actions__.length) {
+          wrapped = new LazyWrapper(this);
+        }
+        wrapped = wrapped.reverse();
+        wrapped.__actions__.push({
+          'func': thru,
+          'args': [reverse],
+          'thisArg': undefined
+        });
+        return new LodashWrapper(wrapped, this.__chain__);
+      }
+      return this.thru(reverse);
+    }
+
+    /**
+     * Executes the chain sequence to resolve the unwrapped value.
+     *
+     * @name value
+     * @memberOf _
+     * @since 0.1.0
+     * @alias toJSON, valueOf
+     * @category Seq
+     * @returns {*} Returns the resolved unwrapped value.
+     * @example
+     *
+     * _([1, 2, 3]).value();
+     * // => [1, 2, 3]
+     */
+    function wrapperValue() {
+      return baseWrapperValue(this.__wrapped__, this.__actions__);
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Creates an object composed of keys generated from the results of running
+     * each element of `collection` thru `iteratee`. The corresponding value of
+     * each key is the number of times the key was returned by `iteratee`. The
+     * iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.5.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The iteratee to transform keys.
+     * @returns {Object} Returns the composed aggregate object.
+     * @example
+     *
+     * _.countBy([6.1, 4.2, 6.3], Math.floor);
+     * // => { '4': 1, '6': 2 }
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.countBy(['one', 'two', 'three'], 'length');
+     * // => { '3': 2, '5': 1 }
+     */
+    var countBy = createAggregator(function(result, value, key) {
+      if (hasOwnProperty.call(result, key)) {
+        ++result[key];
+      } else {
+        baseAssignValue(result, key, 1);
+      }
+    });
+
+    /**
+     * Checks if `predicate` returns truthy for **all** elements of `collection`.
+     * Iteration is stopped once `predicate` returns falsey. The predicate is
+     * invoked with three arguments: (value, index|key, collection).
+     *
+     * **Note:** This method returns `true` for
+     * [empty collections](https://en.wikipedia.org/wiki/Empty_set) because
+     * [everything is true](https://en.wikipedia.org/wiki/Vacuous_truth) of
+     * elements of empty collections.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {boolean} Returns `true` if all elements pass the predicate check,
+     *  else `false`.
+     * @example
+     *
+     * _.every([true, 1, null, 'yes'], Boolean);
+     * // => false
+     *
+     * var users = [
+     *   { 'user': 'barney', 'age': 36, 'active': false },
+     *   { 'user': 'fred',   'age': 40, 'active': false }
+     * ];
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.every(users, { 'user': 'barney', 'active': false });
+     * // => false
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.every(users, ['active', false]);
+     * // => true
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.every(users, 'active');
+     * // => false
+     */
+    function every(collection, predicate, guard) {
+      var func = isArray(collection) ? arrayEvery : baseEvery;
+      if (guard && isIterateeCall(collection, predicate, guard)) {
+        predicate = undefined;
+      }
+      return func(collection, getIteratee(predicate, 3));
+    }
+
+    /**
+     * Iterates over elements of `collection`, returning an array of all elements
+     * `predicate` returns truthy for. The predicate is invoked with three
+     * arguments: (value, index|key, collection).
+     *
+     * **Note:** Unlike `_.remove`, this method returns a new array.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the new filtered array.
+     * @see _.reject
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney', 'age': 36, 'active': true },
+     *   { 'user': 'fred',   'age': 40, 'active': false }
+     * ];
+     *
+     * _.filter(users, function(o) { return !o.active; });
+     * // => objects for ['fred']
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.filter(users, { 'age': 36, 'active': true });
+     * // => objects for ['barney']
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.filter(users, ['active', false]);
+     * // => objects for ['fred']
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.filter(users, 'active');
+     * // => objects for ['barney']
+     */
+    function filter(collection, predicate) {
+      var func = isArray(collection) ? arrayFilter : baseFilter;
+      return func(collection, getIteratee(predicate, 3));
+    }
+
+    /**
+     * Iterates over elements of `collection`, returning the first element
+     * `predicate` returns truthy for. The predicate is invoked with three
+     * arguments: (value, index|key, collection).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to inspect.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @param {number} [fromIndex=0] The index to search from.
+     * @returns {*} Returns the matched element, else `undefined`.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'age': 36, 'active': true },
+     *   { 'user': 'fred',    'age': 40, 'active': false },
+     *   { 'user': 'pebbles', 'age': 1,  'active': true }
+     * ];
+     *
+     * _.find(users, function(o) { return o.age < 40; });
+     * // => object for 'barney'
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.find(users, { 'age': 1, 'active': true });
+     * // => object for 'pebbles'
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.find(users, ['active', false]);
+     * // => object for 'fred'
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.find(users, 'active');
+     * // => object for 'barney'
+     */
+    var find = createFind(findIndex);
+
+    /**
+     * This method is like `_.find` except that it iterates over elements of
+     * `collection` from right to left.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to inspect.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @param {number} [fromIndex=collection.length-1] The index to search from.
+     * @returns {*} Returns the matched element, else `undefined`.
+     * @example
+     *
+     * _.findLast([1, 2, 3, 4], function(n) {
+     *   return n % 2 == 1;
+     * });
+     * // => 3
+     */
+    var findLast = createFind(findLastIndex);
+
+    /**
+     * Creates a flattened array of values by running each element in `collection`
+     * thru `iteratee` and flattening the mapped results. The iteratee is invoked
+     * with three arguments: (value, index|key, collection).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the new flattened array.
+     * @example
+     *
+     * function duplicate(n) {
+     *   return [n, n];
+     * }
+     *
+     * _.flatMap([1, 2], duplicate);
+     * // => [1, 1, 2, 2]
+     */
+    function flatMap(collection, iteratee) {
+      return baseFlatten(map(collection, iteratee), 1);
+    }
+
+    /**
+     * This method is like `_.flatMap` except that it recursively flattens the
+     * mapped results.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.7.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the new flattened array.
+     * @example
+     *
+     * function duplicate(n) {
+     *   return [[[n, n]]];
+     * }
+     *
+     * _.flatMapDeep([1, 2], duplicate);
+     * // => [1, 1, 2, 2]
+     */
+    function flatMapDeep(collection, iteratee) {
+      return baseFlatten(map(collection, iteratee), INFINITY);
+    }
+
+    /**
+     * This method is like `_.flatMap` except that it recursively flattens the
+     * mapped results up to `depth` times.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.7.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @param {number} [depth=1] The maximum recursion depth.
+     * @returns {Array} Returns the new flattened array.
+     * @example
+     *
+     * function duplicate(n) {
+     *   return [[[n, n]]];
+     * }
+     *
+     * _.flatMapDepth([1, 2], duplicate, 2);
+     * // => [[1, 1], [2, 2]]
+     */
+    function flatMapDepth(collection, iteratee, depth) {
+      depth = depth === undefined ? 1 : toInteger(depth);
+      return baseFlatten(map(collection, iteratee), depth);
+    }
+
+    /**
+     * Iterates over elements of `collection` and invokes `iteratee` for each element.
+     * The iteratee is invoked with three arguments: (value, index|key, collection).
+     * Iteratee functions may exit iteration early by explicitly returning `false`.
+     *
+     * **Note:** As with other "Collections" methods, objects with a "length"
+     * property are iterated like arrays. To avoid this behavior use `_.forIn`
+     * or `_.forOwn` for object iteration.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @alias each
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Array|Object} Returns `collection`.
+     * @see _.forEachRight
+     * @example
+     *
+     * _.forEach([1, 2], function(value) {
+     *   console.log(value);
+     * });
+     * // => Logs `1` then `2`.
+     *
+     * _.forEach({ 'a': 1, 'b': 2 }, function(value, key) {
+     *   console.log(key);
+     * });
+     * // => Logs 'a' then 'b' (iteration order is not guaranteed).
+     */
+    function forEach(collection, iteratee) {
+      var func = isArray(collection) ? arrayEach : baseEach;
+      return func(collection, getIteratee(iteratee, 3));
+    }
+
+    /**
+     * This method is like `_.forEach` except that it iterates over elements of
+     * `collection` from right to left.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @alias eachRight
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Array|Object} Returns `collection`.
+     * @see _.forEach
+     * @example
+     *
+     * _.forEachRight([1, 2], function(value) {
+     *   console.log(value);
+     * });
+     * // => Logs `2` then `1`.
+     */
+    function forEachRight(collection, iteratee) {
+      var func = isArray(collection) ? arrayEachRight : baseEachRight;
+      return func(collection, getIteratee(iteratee, 3));
+    }
+
+    /**
+     * Creates an object composed of keys generated from the results of running
+     * each element of `collection` thru `iteratee`. The order of grouped values
+     * is determined by the order they occur in `collection`. The corresponding
+     * value of each key is an array of elements responsible for generating the
+     * key. The iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The iteratee to transform keys.
+     * @returns {Object} Returns the composed aggregate object.
+     * @example
+     *
+     * _.groupBy([6.1, 4.2, 6.3], Math.floor);
+     * // => { '4': [4.2], '6': [6.1, 6.3] }
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.groupBy(['one', 'two', 'three'], 'length');
+     * // => { '3': ['one', 'two'], '5': ['three'] }
+     */
+    var groupBy = createAggregator(function(result, value, key) {
+      if (hasOwnProperty.call(result, key)) {
+        result[key].push(value);
+      } else {
+        baseAssignValue(result, key, [value]);
+      }
+    });
+
+    /**
+     * Checks if `value` is in `collection`. If `collection` is a string, it's
+     * checked for a substring of `value`, otherwise
+     * [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * is used for equality comparisons. If `fromIndex` is negative, it's used as
+     * the offset from the end of `collection`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object|string} collection The collection to inspect.
+     * @param {*} value The value to search for.
+     * @param {number} [fromIndex=0] The index to search from.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.reduce`.
+     * @returns {boolean} Returns `true` if `value` is found, else `false`.
+     * @example
+     *
+     * _.includes([1, 2, 3], 1);
+     * // => true
+     *
+     * _.includes([1, 2, 3], 1, 2);
+     * // => false
+     *
+     * _.includes({ 'a': 1, 'b': 2 }, 1);
+     * // => true
+     *
+     * _.includes('abcd', 'bc');
+     * // => true
+     */
+    function includes(collection, value, fromIndex, guard) {
+      collection = isArrayLike(collection) ? collection : values(collection);
+      fromIndex = (fromIndex && !guard) ? toInteger(fromIndex) : 0;
+
+      var length = collection.length;
+      if (fromIndex < 0) {
+        fromIndex = nativeMax(length + fromIndex, 0);
+      }
+      return isString(collection)
+        ? (fromIndex <= length && collection.indexOf(value, fromIndex) > -1)
+        : (!!length && baseIndexOf(collection, value, fromIndex) > -1);
+    }
+
+    /**
+     * Invokes the method at `path` of each element in `collection`, returning
+     * an array of the results of each invoked method. Any additional arguments
+     * are provided to each invoked method. If `path` is a function, it's invoked
+     * for, and `this` bound to, each element in `collection`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Array|Function|string} path The path of the method to invoke or
+     *  the function invoked per iteration.
+     * @param {...*} [args] The arguments to invoke each method with.
+     * @returns {Array} Returns the array of results.
+     * @example
+     *
+     * _.invokeMap([[5, 1, 7], [3, 2, 1]], 'sort');
+     * // => [[1, 5, 7], [1, 2, 3]]
+     *
+     * _.invokeMap([123, 456], String.prototype.split, '');
+     * // => [['1', '2', '3'], ['4', '5', '6']]
+     */
+    var invokeMap = baseRest(function(collection, path, args) {
+      var index = -1,
+          isFunc = typeof path == 'function',
+          result = isArrayLike(collection) ? Array(collection.length) : [];
+
+      baseEach(collection, function(value) {
+        result[++index] = isFunc ? apply(path, value, args) : baseInvoke(value, path, args);
+      });
+      return result;
+    });
+
+    /**
+     * Creates an object composed of keys generated from the results of running
+     * each element of `collection` thru `iteratee`. The corresponding value of
+     * each key is the last element responsible for generating the key. The
+     * iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The iteratee to transform keys.
+     * @returns {Object} Returns the composed aggregate object.
+     * @example
+     *
+     * var array = [
+     *   { 'dir': 'left', 'code': 97 },
+     *   { 'dir': 'right', 'code': 100 }
+     * ];
+     *
+     * _.keyBy(array, function(o) {
+     *   return String.fromCharCode(o.code);
+     * });
+     * // => { 'a': { 'dir': 'left', 'code': 97 }, 'd': { 'dir': 'right', 'code': 100 } }
+     *
+     * _.keyBy(array, 'dir');
+     * // => { 'left': { 'dir': 'left', 'code': 97 }, 'right': { 'dir': 'right', 'code': 100 } }
+     */
+    var keyBy = createAggregator(function(result, value, key) {
+      baseAssignValue(result, key, value);
+    });
+
+    /**
+     * Creates an array of values by running each element in `collection` thru
+     * `iteratee`. The iteratee is invoked with three arguments:
+     * (value, index|key, collection).
+     *
+     * Many lodash methods are guarded to work as iteratees for methods like
+     * `_.every`, `_.filter`, `_.map`, `_.mapValues`, `_.reject`, and `_.some`.
+     *
+     * The guarded methods are:
+     * `ary`, `chunk`, `curry`, `curryRight`, `drop`, `dropRight`, `every`,
+     * `fill`, `invert`, `parseInt`, `random`, `range`, `rangeRight`, `repeat`,
+     * `sampleSize`, `slice`, `some`, `sortBy`, `split`, `take`, `takeRight`,
+     * `template`, `trim`, `trimEnd`, `trimStart`, and `words`
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the new mapped array.
+     * @example
+     *
+     * function square(n) {
+     *   return n * n;
+     * }
+     *
+     * _.map([4, 8], square);
+     * // => [16, 64]
+     *
+     * _.map({ 'a': 4, 'b': 8 }, square);
+     * // => [16, 64] (iteration order is not guaranteed)
+     *
+     * var users = [
+     *   { 'user': 'barney' },
+     *   { 'user': 'fred' }
+     * ];
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.map(users, 'user');
+     * // => ['barney', 'fred']
+     */
+    function map(collection, iteratee) {
+      var func = isArray(collection) ? arrayMap : baseMap;
+      return func(collection, getIteratee(iteratee, 3));
+    }
+
+    /**
+     * This method is like `_.sortBy` except that it allows specifying the sort
+     * orders of the iteratees to sort by. If `orders` is unspecified, all values
+     * are sorted in ascending order. Otherwise, specify an order of "desc" for
+     * descending or "asc" for ascending sort order of corresponding values.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Array[]|Function[]|Object[]|string[]} [iteratees=[_.identity]]
+     *  The iteratees to sort by.
+     * @param {string[]} [orders] The sort orders of `iteratees`.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.reduce`.
+     * @returns {Array} Returns the new sorted array.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'fred',   'age': 48 },
+     *   { 'user': 'barney', 'age': 34 },
+     *   { 'user': 'fred',   'age': 40 },
+     *   { 'user': 'barney', 'age': 36 }
+     * ];
+     *
+     * // Sort by `user` in ascending order and by `age` in descending order.
+     * _.orderBy(users, ['user', 'age'], ['asc', 'desc']);
+     * // => objects for [['barney', 36], ['barney', 34], ['fred', 48], ['fred', 40]]
+     */
+    function orderBy(collection, iteratees, orders, guard) {
+      if (collection == null) {
+        return [];
+      }
+      if (!isArray(iteratees)) {
+        iteratees = iteratees == null ? [] : [iteratees];
+      }
+      orders = guard ? undefined : orders;
+      if (!isArray(orders)) {
+        orders = orders == null ? [] : [orders];
+      }
+      return baseOrderBy(collection, iteratees, orders);
+    }
+
+    /**
+     * Creates an array of elements split into two groups, the first of which
+     * contains elements `predicate` returns truthy for, the second of which
+     * contains elements `predicate` returns falsey for. The predicate is
+     * invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the array of grouped elements.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney',  'age': 36, 'active': false },
+     *   { 'user': 'fred',    'age': 40, 'active': true },
+     *   { 'user': 'pebbles', 'age': 1,  'active': false }
+     * ];
+     *
+     * _.partition(users, function(o) { return o.active; });
+     * // => objects for [['fred'], ['barney', 'pebbles']]
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.partition(users, { 'age': 1, 'active': false });
+     * // => objects for [['pebbles'], ['barney', 'fred']]
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.partition(users, ['active', false]);
+     * // => objects for [['barney', 'pebbles'], ['fred']]
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.partition(users, 'active');
+     * // => objects for [['fred'], ['barney', 'pebbles']]
+     */
+    var partition = createAggregator(function(result, value, key) {
+      result[key ? 0 : 1].push(value);
+    }, function() { return [[], []]; });
+
+    /**
+     * Reduces `collection` to a value which is the accumulated result of running
+     * each element in `collection` thru `iteratee`, where each successive
+     * invocation is supplied the return value of the previous. If `accumulator`
+     * is not given, the first element of `collection` is used as the initial
+     * value. The iteratee is invoked with four arguments:
+     * (accumulator, value, index|key, collection).
+     *
+     * Many lodash methods are guarded to work as iteratees for methods like
+     * `_.reduce`, `_.reduceRight`, and `_.transform`.
+     *
+     * The guarded methods are:
+     * `assign`, `defaults`, `defaultsDeep`, `includes`, `merge`, `orderBy`,
+     * and `sortBy`
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @param {*} [accumulator] The initial value.
+     * @returns {*} Returns the accumulated value.
+     * @see _.reduceRight
+     * @example
+     *
+     * _.reduce([1, 2], function(sum, n) {
+     *   return sum + n;
+     * }, 0);
+     * // => 3
+     *
+     * _.reduce({ 'a': 1, 'b': 2, 'c': 1 }, function(result, value, key) {
+     *   (result[value] || (result[value] = [])).push(key);
+     *   return result;
+     * }, {});
+     * // => { '1': ['a', 'c'], '2': ['b'] } (iteration order is not guaranteed)
+     */
+    function reduce(collection, iteratee, accumulator) {
+      var func = isArray(collection) ? arrayReduce : baseReduce,
+          initAccum = arguments.length < 3;
+
+      return func(collection, getIteratee(iteratee, 4), accumulator, initAccum, baseEach);
+    }
+
+    /**
+     * This method is like `_.reduce` except that it iterates over elements of
+     * `collection` from right to left.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @param {*} [accumulator] The initial value.
+     * @returns {*} Returns the accumulated value.
+     * @see _.reduce
+     * @example
+     *
+     * var array = [[0, 1], [2, 3], [4, 5]];
+     *
+     * _.reduceRight(array, function(flattened, other) {
+     *   return flattened.concat(other);
+     * }, []);
+     * // => [4, 5, 2, 3, 0, 1]
+     */
+    function reduceRight(collection, iteratee, accumulator) {
+      var func = isArray(collection) ? arrayReduceRight : baseReduce,
+          initAccum = arguments.length < 3;
+
+      return func(collection, getIteratee(iteratee, 4), accumulator, initAccum, baseEachRight);
+    }
+
+    /**
+     * The opposite of `_.filter`; this method returns the elements of `collection`
+     * that `predicate` does **not** return truthy for.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the new filtered array.
+     * @see _.filter
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney', 'age': 36, 'active': false },
+     *   { 'user': 'fred',   'age': 40, 'active': true }
+     * ];
+     *
+     * _.reject(users, function(o) { return !o.active; });
+     * // => objects for ['fred']
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.reject(users, { 'age': 40, 'active': true });
+     * // => objects for ['barney']
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.reject(users, ['active', false]);
+     * // => objects for ['fred']
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.reject(users, 'active');
+     * // => objects for ['barney']
+     */
+    function reject(collection, predicate) {
+      var func = isArray(collection) ? arrayFilter : baseFilter;
+      return func(collection, negate(getIteratee(predicate, 3)));
+    }
+
+    /**
+     * Gets a random element from `collection`.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to sample.
+     * @returns {*} Returns the random element.
+     * @example
+     *
+     * _.sample([1, 2, 3, 4]);
+     * // => 2
+     */
+    function sample(collection) {
+      var func = isArray(collection) ? arraySample : baseSample;
+      return func(collection);
+    }
+
+    /**
+     * Gets `n` random elements at unique keys from `collection` up to the
+     * size of `collection`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to sample.
+     * @param {number} [n=1] The number of elements to sample.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Array} Returns the random elements.
+     * @example
+     *
+     * _.sampleSize([1, 2, 3], 2);
+     * // => [3, 1]
+     *
+     * _.sampleSize([1, 2, 3], 4);
+     * // => [2, 3, 1]
+     */
+    function sampleSize(collection, n, guard) {
+      if ((guard ? isIterateeCall(collection, n, guard) : n === undefined)) {
+        n = 1;
+      } else {
+        n = toInteger(n);
+      }
+      var func = isArray(collection) ? arraySampleSize : baseSampleSize;
+      return func(collection, n);
+    }
+
+    /**
+     * Creates an array of shuffled values, using a version of the
+     * [Fisher-Yates shuffle](https://en.wikipedia.org/wiki/Fisher-Yates_shuffle).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to shuffle.
+     * @returns {Array} Returns the new shuffled array.
+     * @example
+     *
+     * _.shuffle([1, 2, 3, 4]);
+     * // => [4, 1, 3, 2]
+     */
+    function shuffle(collection) {
+      var func = isArray(collection) ? arrayShuffle : baseShuffle;
+      return func(collection);
+    }
+
+    /**
+     * Gets the size of `collection` by returning its length for array-like
+     * values or the number of own enumerable string keyed properties for objects.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object|string} collection The collection to inspect.
+     * @returns {number} Returns the collection size.
+     * @example
+     *
+     * _.size([1, 2, 3]);
+     * // => 3
+     *
+     * _.size({ 'a': 1, 'b': 2 });
+     * // => 2
+     *
+     * _.size('pebbles');
+     * // => 7
+     */
+    function size(collection) {
+      if (collection == null) {
+        return 0;
+      }
+      if (isArrayLike(collection)) {
+        return isString(collection) ? stringSize(collection) : collection.length;
+      }
+      var tag = getTag(collection);
+      if (tag == mapTag || tag == setTag) {
+        return collection.size;
+      }
+      return baseKeys(collection).length;
+    }
+
+    /**
+     * Checks if `predicate` returns truthy for **any** element of `collection`.
+     * Iteration is stopped once `predicate` returns truthy. The predicate is
+     * invoked with three arguments: (value, index|key, collection).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {boolean} Returns `true` if any element passes the predicate check,
+     *  else `false`.
+     * @example
+     *
+     * _.some([null, 0, 'yes', false], Boolean);
+     * // => true
+     *
+     * var users = [
+     *   { 'user': 'barney', 'active': true },
+     *   { 'user': 'fred',   'active': false }
+     * ];
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.some(users, { 'user': 'barney', 'active': false });
+     * // => false
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.some(users, ['active', false]);
+     * // => true
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.some(users, 'active');
+     * // => true
+     */
+    function some(collection, predicate, guard) {
+      var func = isArray(collection) ? arraySome : baseSome;
+      if (guard && isIterateeCall(collection, predicate, guard)) {
+        predicate = undefined;
+      }
+      return func(collection, getIteratee(predicate, 3));
+    }
+
+    /**
+     * Creates an array of elements, sorted in ascending order by the results of
+     * running each element in a collection thru each iteratee. This method
+     * performs a stable sort, that is, it preserves the original sort order of
+     * equal elements. The iteratees are invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Collection
+     * @param {Array|Object} collection The collection to iterate over.
+     * @param {...(Function|Function[])} [iteratees=[_.identity]]
+     *  The iteratees to sort by.
+     * @returns {Array} Returns the new sorted array.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'fred',   'age': 48 },
+     *   { 'user': 'barney', 'age': 36 },
+     *   { 'user': 'fred',   'age': 40 },
+     *   { 'user': 'barney', 'age': 34 }
+     * ];
+     *
+     * _.sortBy(users, [function(o) { return o.user; }]);
+     * // => objects for [['barney', 36], ['barney', 34], ['fred', 48], ['fred', 40]]
+     *
+     * _.sortBy(users, ['user', 'age']);
+     * // => objects for [['barney', 34], ['barney', 36], ['fred', 40], ['fred', 48]]
+     */
+    var sortBy = baseRest(function(collection, iteratees) {
+      if (collection == null) {
+        return [];
+      }
+      var length = iteratees.length;
+      if (length > 1 && isIterateeCall(collection, iteratees[0], iteratees[1])) {
+        iteratees = [];
+      } else if (length > 2 && isIterateeCall(iteratees[0], iteratees[1], iteratees[2])) {
+        iteratees = [iteratees[0]];
+      }
+      return baseOrderBy(collection, baseFlatten(iteratees, 1), []);
+    });
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Gets the timestamp of the number of milliseconds that have elapsed since
+     * the Unix epoch (1 January 1970 00:00:00 UTC).
+     *
+     * @static
+     * @memberOf _
+     * @since 2.4.0
+     * @category Date
+     * @returns {number} Returns the timestamp.
+     * @example
+     *
+     * _.defer(function(stamp) {
+     *   console.log(_.now() - stamp);
+     * }, _.now());
+     * // => Logs the number of milliseconds it took for the deferred invocation.
+     */
+    var now = ctxNow || function() {
+      return root.Date.now();
+    };
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * The opposite of `_.before`; this method creates a function that invokes
+     * `func` once it's called `n` or more times.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {number} n The number of calls before `func` is invoked.
+     * @param {Function} func The function to restrict.
+     * @returns {Function} Returns the new restricted function.
+     * @example
+     *
+     * var saves = ['profile', 'settings'];
+     *
+     * var done = _.after(saves.length, function() {
+     *   console.log('done saving!');
+     * });
+     *
+     * _.forEach(saves, function(type) {
+     *   asyncSave({ 'type': type, 'complete': done });
+     * });
+     * // => Logs 'done saving!' after the two async saves have completed.
+     */
+    function after(n, func) {
+      if (typeof func != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      n = toInteger(n);
+      return function() {
+        if (--n < 1) {
+          return func.apply(this, arguments);
+        }
+      };
+    }
+
+    /**
+     * Creates a function that invokes `func`, with up to `n` arguments,
+     * ignoring any additional arguments.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Function
+     * @param {Function} func The function to cap arguments for.
+     * @param {number} [n=func.length] The arity cap.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Function} Returns the new capped function.
+     * @example
+     *
+     * _.map(['6', '8', '10'], _.ary(parseInt, 1));
+     * // => [6, 8, 10]
+     */
+    function ary(func, n, guard) {
+      n = guard ? undefined : n;
+      n = (func && n == null) ? func.length : n;
+      return createWrap(func, WRAP_ARY_FLAG, undefined, undefined, undefined, undefined, n);
+    }
+
+    /**
+     * Creates a function that invokes `func`, with the `this` binding and arguments
+     * of the created function, while it's called less than `n` times. Subsequent
+     * calls to the created function return the result of the last `func` invocation.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Function
+     * @param {number} n The number of calls at which `func` is no longer invoked.
+     * @param {Function} func The function to restrict.
+     * @returns {Function} Returns the new restricted function.
+     * @example
+     *
+     * jQuery(element).on('click', _.before(5, addContactToList));
+     * // => Allows adding up to 4 contacts to the list.
+     */
+    function before(n, func) {
+      var result;
+      if (typeof func != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      n = toInteger(n);
+      return function() {
+        if (--n > 0) {
+          result = func.apply(this, arguments);
+        }
+        if (n <= 1) {
+          func = undefined;
+        }
+        return result;
+      };
+    }
+
+    /**
+     * Creates a function that invokes `func` with the `this` binding of `thisArg`
+     * and `partials` prepended to the arguments it receives.
+     *
+     * The `_.bind.placeholder` value, which defaults to `_` in monolithic builds,
+     * may be used as a placeholder for partially applied arguments.
+     *
+     * **Note:** Unlike native `Function#bind`, this method doesn't set the "length"
+     * property of bound functions.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {Function} func The function to bind.
+     * @param {*} thisArg The `this` binding of `func`.
+     * @param {...*} [partials] The arguments to be partially applied.
+     * @returns {Function} Returns the new bound function.
+     * @example
+     *
+     * function greet(greeting, punctuation) {
+     *   return greeting + ' ' + this.user + punctuation;
+     * }
+     *
+     * var object = { 'user': 'fred' };
+     *
+     * var bound = _.bind(greet, object, 'hi');
+     * bound('!');
+     * // => 'hi fred!'
+     *
+     * // Bound with placeholders.
+     * var bound = _.bind(greet, object, _, '!');
+     * bound('hi');
+     * // => 'hi fred!'
+     */
+    var bind = baseRest(function(func, thisArg, partials) {
+      var bitmask = WRAP_BIND_FLAG;
+      if (partials.length) {
+        var holders = replaceHolders(partials, getHolder(bind));
+        bitmask |= WRAP_PARTIAL_FLAG;
+      }
+      return createWrap(func, bitmask, thisArg, partials, holders);
+    });
+
+    /**
+     * Creates a function that invokes the method at `object[key]` with `partials`
+     * prepended to the arguments it receives.
+     *
+     * This method differs from `_.bind` by allowing bound functions to reference
+     * methods that may be redefined or don't yet exist. See
+     * [Peter Michaux's article](http://peter.michaux.ca/articles/lazy-function-definition-pattern)
+     * for more details.
+     *
+     * The `_.bindKey.placeholder` value, which defaults to `_` in monolithic
+     * builds, may be used as a placeholder for partially applied arguments.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.10.0
+     * @category Function
+     * @param {Object} object The object to invoke the method on.
+     * @param {string} key The key of the method.
+     * @param {...*} [partials] The arguments to be partially applied.
+     * @returns {Function} Returns the new bound function.
+     * @example
+     *
+     * var object = {
+     *   'user': 'fred',
+     *   'greet': function(greeting, punctuation) {
+     *     return greeting + ' ' + this.user + punctuation;
+     *   }
+     * };
+     *
+     * var bound = _.bindKey(object, 'greet', 'hi');
+     * bound('!');
+     * // => 'hi fred!'
+     *
+     * object.greet = function(greeting, punctuation) {
+     *   return greeting + 'ya ' + this.user + punctuation;
+     * };
+     *
+     * bound('!');
+     * // => 'hiya fred!'
+     *
+     * // Bound with placeholders.
+     * var bound = _.bindKey(object, 'greet', _, '!');
+     * bound('hi');
+     * // => 'hiya fred!'
+     */
+    var bindKey = baseRest(function(object, key, partials) {
+      var bitmask = WRAP_BIND_FLAG | WRAP_BIND_KEY_FLAG;
+      if (partials.length) {
+        var holders = replaceHolders(partials, getHolder(bindKey));
+        bitmask |= WRAP_PARTIAL_FLAG;
+      }
+      return createWrap(key, bitmask, object, partials, holders);
+    });
+
+    /**
+     * Creates a function that accepts arguments of `func` and either invokes
+     * `func` returning its result, if at least `arity` number of arguments have
+     * been provided, or returns a function that accepts the remaining `func`
+     * arguments, and so on. The arity of `func` may be specified if `func.length`
+     * is not sufficient.
+     *
+     * The `_.curry.placeholder` value, which defaults to `_` in monolithic builds,
+     * may be used as a placeholder for provided arguments.
+     *
+     * **Note:** This method doesn't set the "length" property of curried functions.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Function
+     * @param {Function} func The function to curry.
+     * @param {number} [arity=func.length] The arity of `func`.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Function} Returns the new curried function.
+     * @example
+     *
+     * var abc = function(a, b, c) {
+     *   return [a, b, c];
+     * };
+     *
+     * var curried = _.curry(abc);
+     *
+     * curried(1)(2)(3);
+     * // => [1, 2, 3]
+     *
+     * curried(1, 2)(3);
+     * // => [1, 2, 3]
+     *
+     * curried(1, 2, 3);
+     * // => [1, 2, 3]
+     *
+     * // Curried with placeholders.
+     * curried(1)(_, 3)(2);
+     * // => [1, 2, 3]
+     */
+    function curry(func, arity, guard) {
+      arity = guard ? undefined : arity;
+      var result = createWrap(func, WRAP_CURRY_FLAG, undefined, undefined, undefined, undefined, undefined, arity);
+      result.placeholder = curry.placeholder;
+      return result;
+    }
+
+    /**
+     * This method is like `_.curry` except that arguments are applied to `func`
+     * in the manner of `_.partialRight` instead of `_.partial`.
+     *
+     * The `_.curryRight.placeholder` value, which defaults to `_` in monolithic
+     * builds, may be used as a placeholder for provided arguments.
+     *
+     * **Note:** This method doesn't set the "length" property of curried functions.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Function
+     * @param {Function} func The function to curry.
+     * @param {number} [arity=func.length] The arity of `func`.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Function} Returns the new curried function.
+     * @example
+     *
+     * var abc = function(a, b, c) {
+     *   return [a, b, c];
+     * };
+     *
+     * var curried = _.curryRight(abc);
+     *
+     * curried(3)(2)(1);
+     * // => [1, 2, 3]
+     *
+     * curried(2, 3)(1);
+     * // => [1, 2, 3]
+     *
+     * curried(1, 2, 3);
+     * // => [1, 2, 3]
+     *
+     * // Curried with placeholders.
+     * curried(3)(1, _)(2);
+     * // => [1, 2, 3]
+     */
+    function curryRight(func, arity, guard) {
+      arity = guard ? undefined : arity;
+      var result = createWrap(func, WRAP_CURRY_RIGHT_FLAG, undefined, undefined, undefined, undefined, undefined, arity);
+      result.placeholder = curryRight.placeholder;
+      return result;
+    }
+
+    /**
+     * Creates a debounced function that delays invoking `func` until after `wait`
+     * milliseconds have elapsed since the last time the debounced function was
+     * invoked. The debounced function comes with a `cancel` method to cancel
+     * delayed `func` invocations and a `flush` method to immediately invoke them.
+     * Provide `options` to indicate whether `func` should be invoked on the
+     * leading and/or trailing edge of the `wait` timeout. The `func` is invoked
+     * with the last arguments provided to the debounced function. Subsequent
+     * calls to the debounced function return the result of the last `func`
+     * invocation.
+     *
+     * **Note:** If `leading` and `trailing` options are `true`, `func` is
+     * invoked on the trailing edge of the timeout only if the debounced function
+     * is invoked more than once during the `wait` timeout.
+     *
+     * If `wait` is `0` and `leading` is `false`, `func` invocation is deferred
+     * until to the next tick, similar to `setTimeout` with a timeout of `0`.
+     *
+     * See [David Corbacho's article](https://css-tricks.com/debouncing-throttling-explained-examples/)
+     * for details over the differences between `_.debounce` and `_.throttle`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {Function} func The function to debounce.
+     * @param {number} [wait=0] The number of milliseconds to delay.
+     * @param {Object} [options={}] The options object.
+     * @param {boolean} [options.leading=false]
+     *  Specify invoking on the leading edge of the timeout.
+     * @param {number} [options.maxWait]
+     *  The maximum time `func` is allowed to be delayed before it's invoked.
+     * @param {boolean} [options.trailing=true]
+     *  Specify invoking on the trailing edge of the timeout.
+     * @returns {Function} Returns the new debounced function.
+     * @example
+     *
+     * // Avoid costly calculations while the window size is in flux.
+     * jQuery(window).on('resize', _.debounce(calculateLayout, 150));
+     *
+     * // Invoke `sendMail` when clicked, debouncing subsequent calls.
+     * jQuery(element).on('click', _.debounce(sendMail, 300, {
+     *   'leading': true,
+     *   'trailing': false
+     * }));
+     *
+     * // Ensure `batchLog` is invoked once after 1 second of debounced calls.
+     * var debounced = _.debounce(batchLog, 250, { 'maxWait': 1000 });
+     * var source = new EventSource('/stream');
+     * jQuery(source).on('message', debounced);
+     *
+     * // Cancel the trailing debounced invocation.
+     * jQuery(window).on('popstate', debounced.cancel);
+     */
+    function debounce(func, wait, options) {
+      var lastArgs,
+          lastThis,
+          maxWait,
+          result,
+          timerId,
+          lastCallTime,
+          lastInvokeTime = 0,
+          leading = false,
+          maxing = false,
+          trailing = true;
+
+      if (typeof func != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      wait = toNumber(wait) || 0;
+      if (isObject(options)) {
+        leading = !!options.leading;
+        maxing = 'maxWait' in options;
+        maxWait = maxing ? nativeMax(toNumber(options.maxWait) || 0, wait) : maxWait;
+        trailing = 'trailing' in options ? !!options.trailing : trailing;
+      }
+
+      function invokeFunc(time) {
+        var args = lastArgs,
+            thisArg = lastThis;
+
+        lastArgs = lastThis = undefined;
+        lastInvokeTime = time;
+        result = func.apply(thisArg, args);
+        return result;
+      }
+
+      function leadingEdge(time) {
+        // Reset any `maxWait` timer.
+        lastInvokeTime = time;
+        // Start the timer for the trailing edge.
+        timerId = setTimeout(timerExpired, wait);
+        // Invoke the leading edge.
+        return leading ? invokeFunc(time) : result;
+      }
+
+      function remainingWait(time) {
+        var timeSinceLastCall = time - lastCallTime,
+            timeSinceLastInvoke = time - lastInvokeTime,
+            result = wait - timeSinceLastCall;
+
+        return maxing ? nativeMin(result, maxWait - timeSinceLastInvoke) : result;
+      }
+
+      function shouldInvoke(time) {
+        var timeSinceLastCall = time - lastCallTime,
+            timeSinceLastInvoke = time - lastInvokeTime;
+
+        // Either this is the first call, activity has stopped and we're at the
+        // trailing edge, the system time has gone backwards and we're treating
+        // it as the trailing edge, or we've hit the `maxWait` limit.
+        return (lastCallTime === undefined || (timeSinceLastCall >= wait) ||
+          (timeSinceLastCall < 0) || (maxing && timeSinceLastInvoke >= maxWait));
+      }
+
+      function timerExpired() {
+        var time = now();
+        if (shouldInvoke(time)) {
+          return trailingEdge(time);
+        }
+        // Restart the timer.
+        timerId = setTimeout(timerExpired, remainingWait(time));
+      }
+
+      function trailingEdge(time) {
+        timerId = undefined;
+
+        // Only invoke if we have `lastArgs` which means `func` has been
+        // debounced at least once.
+        if (trailing && lastArgs) {
+          return invokeFunc(time);
+        }
+        lastArgs = lastThis = undefined;
+        return result;
+      }
+
+      function cancel() {
+        if (timerId !== undefined) {
+          clearTimeout(timerId);
+        }
+        lastInvokeTime = 0;
+        lastArgs = lastCallTime = lastThis = timerId = undefined;
+      }
+
+      function flush() {
+        return timerId === undefined ? result : trailingEdge(now());
+      }
+
+      function debounced() {
+        var time = now(),
+            isInvoking = shouldInvoke(time);
+
+        lastArgs = arguments;
+        lastThis = this;
+        lastCallTime = time;
+
+        if (isInvoking) {
+          if (timerId === undefined) {
+            return leadingEdge(lastCallTime);
+          }
+          if (maxing) {
+            // Handle invocations in a tight loop.
+            timerId = setTimeout(timerExpired, wait);
+            return invokeFunc(lastCallTime);
+          }
+        }
+        if (timerId === undefined) {
+          timerId = setTimeout(timerExpired, wait);
+        }
+        return result;
+      }
+      debounced.cancel = cancel;
+      debounced.flush = flush;
+      return debounced;
+    }
+
+    /**
+     * Defers invoking the `func` until the current call stack has cleared. Any
+     * additional arguments are provided to `func` when it's invoked.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {Function} func The function to defer.
+     * @param {...*} [args] The arguments to invoke `func` with.
+     * @returns {number} Returns the timer id.
+     * @example
+     *
+     * _.defer(function(text) {
+     *   console.log(text);
+     * }, 'deferred');
+     * // => Logs 'deferred' after one millisecond.
+     */
+    var defer = baseRest(function(func, args) {
+      return baseDelay(func, 1, args);
+    });
+
+    /**
+     * Invokes `func` after `wait` milliseconds. Any additional arguments are
+     * provided to `func` when it's invoked.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {Function} func The function to delay.
+     * @param {number} wait The number of milliseconds to delay invocation.
+     * @param {...*} [args] The arguments to invoke `func` with.
+     * @returns {number} Returns the timer id.
+     * @example
+     *
+     * _.delay(function(text) {
+     *   console.log(text);
+     * }, 1000, 'later');
+     * // => Logs 'later' after one second.
+     */
+    var delay = baseRest(function(func, wait, args) {
+      return baseDelay(func, toNumber(wait) || 0, args);
+    });
+
+    /**
+     * Creates a function that invokes `func` with arguments reversed.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Function
+     * @param {Function} func The function to flip arguments for.
+     * @returns {Function} Returns the new flipped function.
+     * @example
+     *
+     * var flipped = _.flip(function() {
+     *   return _.toArray(arguments);
+     * });
+     *
+     * flipped('a', 'b', 'c', 'd');
+     * // => ['d', 'c', 'b', 'a']
+     */
+    function flip(func) {
+      return createWrap(func, WRAP_FLIP_FLAG);
+    }
+
+    /**
+     * Creates a function that memoizes the result of `func`. If `resolver` is
+     * provided, it determines the cache key for storing the result based on the
+     * arguments provided to the memoized function. By default, the first argument
+     * provided to the memoized function is used as the map cache key. The `func`
+     * is invoked with the `this` binding of the memoized function.
+     *
+     * **Note:** The cache is exposed as the `cache` property on the memoized
+     * function. Its creation may be customized by replacing the `_.memoize.Cache`
+     * constructor with one whose instances implement the
+     * [`Map`](http://ecma-international.org/ecma-262/7.0/#sec-properties-of-the-map-prototype-object)
+     * method interface of `clear`, `delete`, `get`, `has`, and `set`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {Function} func The function to have its output memoized.
+     * @param {Function} [resolver] The function to resolve the cache key.
+     * @returns {Function} Returns the new memoized function.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': 2 };
+     * var other = { 'c': 3, 'd': 4 };
+     *
+     * var values = _.memoize(_.values);
+     * values(object);
+     * // => [1, 2]
+     *
+     * values(other);
+     * // => [3, 4]
+     *
+     * object.a = 2;
+     * values(object);
+     * // => [1, 2]
+     *
+     * // Modify the result cache.
+     * values.cache.set(object, ['a', 'b']);
+     * values(object);
+     * // => ['a', 'b']
+     *
+     * // Replace `_.memoize.Cache`.
+     * _.memoize.Cache = WeakMap;
+     */
+    function memoize(func, resolver) {
+      if (typeof func != 'function' || (resolver != null && typeof resolver != 'function')) {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      var memoized = function() {
+        var args = arguments,
+            key = resolver ? resolver.apply(this, args) : args[0],
+            cache = memoized.cache;
+
+        if (cache.has(key)) {
+          return cache.get(key);
+        }
+        var result = func.apply(this, args);
+        memoized.cache = cache.set(key, result) || cache;
+        return result;
+      };
+      memoized.cache = new (memoize.Cache || MapCache);
+      return memoized;
+    }
+
+    // Expose `MapCache`.
+    memoize.Cache = MapCache;
+
+    /**
+     * Creates a function that negates the result of the predicate `func`. The
+     * `func` predicate is invoked with the `this` binding and arguments of the
+     * created function.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Function
+     * @param {Function} predicate The predicate to negate.
+     * @returns {Function} Returns the new negated function.
+     * @example
+     *
+     * function isEven(n) {
+     *   return n % 2 == 0;
+     * }
+     *
+     * _.filter([1, 2, 3, 4, 5, 6], _.negate(isEven));
+     * // => [1, 3, 5]
+     */
+    function negate(predicate) {
+      if (typeof predicate != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      return function() {
+        var args = arguments;
+        switch (args.length) {
+          case 0: return !predicate.call(this);
+          case 1: return !predicate.call(this, args[0]);
+          case 2: return !predicate.call(this, args[0], args[1]);
+          case 3: return !predicate.call(this, args[0], args[1], args[2]);
+        }
+        return !predicate.apply(this, args);
+      };
+    }
+
+    /**
+     * Creates a function that is restricted to invoking `func` once. Repeat calls
+     * to the function return the value of the first invocation. The `func` is
+     * invoked with the `this` binding and arguments of the created function.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {Function} func The function to restrict.
+     * @returns {Function} Returns the new restricted function.
+     * @example
+     *
+     * var initialize = _.once(createApplication);
+     * initialize();
+     * initialize();
+     * // => `createApplication` is invoked once
+     */
+    function once(func) {
+      return before(2, func);
+    }
+
+    /**
+     * Creates a function that invokes `func` with its arguments transformed.
+     *
+     * @static
+     * @since 4.0.0
+     * @memberOf _
+     * @category Function
+     * @param {Function} func The function to wrap.
+     * @param {...(Function|Function[])} [transforms=[_.identity]]
+     *  The argument transforms.
+     * @returns {Function} Returns the new function.
+     * @example
+     *
+     * function doubled(n) {
+     *   return n * 2;
+     * }
+     *
+     * function square(n) {
+     *   return n * n;
+     * }
+     *
+     * var func = _.overArgs(function(x, y) {
+     *   return [x, y];
+     * }, [square, doubled]);
+     *
+     * func(9, 3);
+     * // => [81, 6]
+     *
+     * func(10, 5);
+     * // => [100, 10]
+     */
+    var overArgs = castRest(function(func, transforms) {
+      transforms = (transforms.length == 1 && isArray(transforms[0]))
+        ? arrayMap(transforms[0], baseUnary(getIteratee()))
+        : arrayMap(baseFlatten(transforms, 1), baseUnary(getIteratee()));
+
+      var funcsLength = transforms.length;
+      return baseRest(function(args) {
+        var index = -1,
+            length = nativeMin(args.length, funcsLength);
+
+        while (++index < length) {
+          args[index] = transforms[index].call(this, args[index]);
+        }
+        return apply(func, this, args);
+      });
+    });
+
+    /**
+     * Creates a function that invokes `func` with `partials` prepended to the
+     * arguments it receives. This method is like `_.bind` except it does **not**
+     * alter the `this` binding.
+     *
+     * The `_.partial.placeholder` value, which defaults to `_` in monolithic
+     * builds, may be used as a placeholder for partially applied arguments.
+     *
+     * **Note:** This method doesn't set the "length" property of partially
+     * applied functions.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.2.0
+     * @category Function
+     * @param {Function} func The function to partially apply arguments to.
+     * @param {...*} [partials] The arguments to be partially applied.
+     * @returns {Function} Returns the new partially applied function.
+     * @example
+     *
+     * function greet(greeting, name) {
+     *   return greeting + ' ' + name;
+     * }
+     *
+     * var sayHelloTo = _.partial(greet, 'hello');
+     * sayHelloTo('fred');
+     * // => 'hello fred'
+     *
+     * // Partially applied with placeholders.
+     * var greetFred = _.partial(greet, _, 'fred');
+     * greetFred('hi');
+     * // => 'hi fred'
+     */
+    var partial = baseRest(function(func, partials) {
+      var holders = replaceHolders(partials, getHolder(partial));
+      return createWrap(func, WRAP_PARTIAL_FLAG, undefined, partials, holders);
+    });
+
+    /**
+     * This method is like `_.partial` except that partially applied arguments
+     * are appended to the arguments it receives.
+     *
+     * The `_.partialRight.placeholder` value, which defaults to `_` in monolithic
+     * builds, may be used as a placeholder for partially applied arguments.
+     *
+     * **Note:** This method doesn't set the "length" property of partially
+     * applied functions.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.0.0
+     * @category Function
+     * @param {Function} func The function to partially apply arguments to.
+     * @param {...*} [partials] The arguments to be partially applied.
+     * @returns {Function} Returns the new partially applied function.
+     * @example
+     *
+     * function greet(greeting, name) {
+     *   return greeting + ' ' + name;
+     * }
+     *
+     * var greetFred = _.partialRight(greet, 'fred');
+     * greetFred('hi');
+     * // => 'hi fred'
+     *
+     * // Partially applied with placeholders.
+     * var sayHelloTo = _.partialRight(greet, 'hello', _);
+     * sayHelloTo('fred');
+     * // => 'hello fred'
+     */
+    var partialRight = baseRest(function(func, partials) {
+      var holders = replaceHolders(partials, getHolder(partialRight));
+      return createWrap(func, WRAP_PARTIAL_RIGHT_FLAG, undefined, partials, holders);
+    });
+
+    /**
+     * Creates a function that invokes `func` with arguments arranged according
+     * to the specified `indexes` where the argument value at the first index is
+     * provided as the first argument, the argument value at the second index is
+     * provided as the second argument, and so on.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Function
+     * @param {Function} func The function to rearrange arguments for.
+     * @param {...(number|number[])} indexes The arranged argument indexes.
+     * @returns {Function} Returns the new function.
+     * @example
+     *
+     * var rearged = _.rearg(function(a, b, c) {
+     *   return [a, b, c];
+     * }, [2, 0, 1]);
+     *
+     * rearged('b', 'c', 'a')
+     * // => ['a', 'b', 'c']
+     */
+    var rearg = flatRest(function(func, indexes) {
+      return createWrap(func, WRAP_REARG_FLAG, undefined, undefined, undefined, indexes);
+    });
+
+    /**
+     * Creates a function that invokes `func` with the `this` binding of the
+     * created function and arguments from `start` and beyond provided as
+     * an array.
+     *
+     * **Note:** This method is based on the
+     * [rest parameter](https://mdn.io/rest_parameters).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Function
+     * @param {Function} func The function to apply a rest parameter to.
+     * @param {number} [start=func.length-1] The start position of the rest parameter.
+     * @returns {Function} Returns the new function.
+     * @example
+     *
+     * var say = _.rest(function(what, names) {
+     *   return what + ' ' + _.initial(names).join(', ') +
+     *     (_.size(names) > 1 ? ', & ' : '') + _.last(names);
+     * });
+     *
+     * say('hello', 'fred', 'barney', 'pebbles');
+     * // => 'hello fred, barney, & pebbles'
+     */
+    function rest(func, start) {
+      if (typeof func != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      start = start === undefined ? start : toInteger(start);
+      return baseRest(func, start);
+    }
+
+    /**
+     * Creates a function that invokes `func` with the `this` binding of the
+     * create function and an array of arguments much like
+     * [`Function#apply`](http://www.ecma-international.org/ecma-262/7.0/#sec-function.prototype.apply).
+     *
+     * **Note:** This method is based on the
+     * [spread operator](https://mdn.io/spread_operator).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.2.0
+     * @category Function
+     * @param {Function} func The function to spread arguments over.
+     * @param {number} [start=0] The start position of the spread.
+     * @returns {Function} Returns the new function.
+     * @example
+     *
+     * var say = _.spread(function(who, what) {
+     *   return who + ' says ' + what;
+     * });
+     *
+     * say(['fred', 'hello']);
+     * // => 'fred says hello'
+     *
+     * var numbers = Promise.all([
+     *   Promise.resolve(40),
+     *   Promise.resolve(36)
+     * ]);
+     *
+     * numbers.then(_.spread(function(x, y) {
+     *   return x + y;
+     * }));
+     * // => a Promise of 76
+     */
+    function spread(func, start) {
+      if (typeof func != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      start = start == null ? 0 : nativeMax(toInteger(start), 0);
+      return baseRest(function(args) {
+        var array = args[start],
+            otherArgs = castSlice(args, 0, start);
+
+        if (array) {
+          arrayPush(otherArgs, array);
+        }
+        return apply(func, this, otherArgs);
+      });
+    }
+
+    /**
+     * Creates a throttled function that only invokes `func` at most once per
+     * every `wait` milliseconds. The throttled function comes with a `cancel`
+     * method to cancel delayed `func` invocations and a `flush` method to
+     * immediately invoke them. Provide `options` to indicate whether `func`
+     * should be invoked on the leading and/or trailing edge of the `wait`
+     * timeout. The `func` is invoked with the last arguments provided to the
+     * throttled function. Subsequent calls to the throttled function return the
+     * result of the last `func` invocation.
+     *
+     * **Note:** If `leading` and `trailing` options are `true`, `func` is
+     * invoked on the trailing edge of the timeout only if the throttled function
+     * is invoked more than once during the `wait` timeout.
+     *
+     * If `wait` is `0` and `leading` is `false`, `func` invocation is deferred
+     * until to the next tick, similar to `setTimeout` with a timeout of `0`.
+     *
+     * See [David Corbacho's article](https://css-tricks.com/debouncing-throttling-explained-examples/)
+     * for details over the differences between `_.throttle` and `_.debounce`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {Function} func The function to throttle.
+     * @param {number} [wait=0] The number of milliseconds to throttle invocations to.
+     * @param {Object} [options={}] The options object.
+     * @param {boolean} [options.leading=true]
+     *  Specify invoking on the leading edge of the timeout.
+     * @param {boolean} [options.trailing=true]
+     *  Specify invoking on the trailing edge of the timeout.
+     * @returns {Function} Returns the new throttled function.
+     * @example
+     *
+     * // Avoid excessively updating the position while scrolling.
+     * jQuery(window).on('scroll', _.throttle(updatePosition, 100));
+     *
+     * // Invoke `renewToken` when the click event is fired, but not more than once every 5 minutes.
+     * var throttled = _.throttle(renewToken, 300000, { 'trailing': false });
+     * jQuery(element).on('click', throttled);
+     *
+     * // Cancel the trailing throttled invocation.
+     * jQuery(window).on('popstate', throttled.cancel);
+     */
+    function throttle(func, wait, options) {
+      var leading = true,
+          trailing = true;
+
+      if (typeof func != 'function') {
+        throw new TypeError(FUNC_ERROR_TEXT);
+      }
+      if (isObject(options)) {
+        leading = 'leading' in options ? !!options.leading : leading;
+        trailing = 'trailing' in options ? !!options.trailing : trailing;
+      }
+      return debounce(func, wait, {
+        'leading': leading,
+        'maxWait': wait,
+        'trailing': trailing
+      });
+    }
+
+    /**
+     * Creates a function that accepts up to one argument, ignoring any
+     * additional arguments.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Function
+     * @param {Function} func The function to cap arguments for.
+     * @returns {Function} Returns the new capped function.
+     * @example
+     *
+     * _.map(['6', '8', '10'], _.unary(parseInt));
+     * // => [6, 8, 10]
+     */
+    function unary(func) {
+      return ary(func, 1);
+    }
+
+    /**
+     * Creates a function that provides `value` to `wrapper` as its first
+     * argument. Any additional arguments provided to the function are appended
+     * to those provided to the `wrapper`. The wrapper is invoked with the `this`
+     * binding of the created function.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Function
+     * @param {*} value The value to wrap.
+     * @param {Function} [wrapper=identity] The wrapper function.
+     * @returns {Function} Returns the new function.
+     * @example
+     *
+     * var p = _.wrap(_.escape, function(func, text) {
+     *   return '' + func(text) + '';
+     * });
+     *
+     * p('fred, barney, & pebbles');
+     * // => 'fred, barney, &amp; pebbles'
+     */
+    function wrap(value, wrapper) {
+      return partial(castFunction(wrapper), value);
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Casts `value` as an array if it's not one.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.4.0
+     * @category Lang
+     * @param {*} value The value to inspect.
+     * @returns {Array} Returns the cast array.
+     * @example
+     *
+     * _.castArray(1);
+     * // => [1]
+     *
+     * _.castArray({ 'a': 1 });
+     * // => [{ 'a': 1 }]
+     *
+     * _.castArray('abc');
+     * // => ['abc']
+     *
+     * _.castArray(null);
+     * // => [null]
+     *
+     * _.castArray(undefined);
+     * // => [undefined]
+     *
+     * _.castArray();
+     * // => []
+     *
+     * var array = [1, 2, 3];
+     * console.log(_.castArray(array) === array);
+     * // => true
+     */
+    function castArray() {
+      if (!arguments.length) {
+        return [];
+      }
+      var value = arguments[0];
+      return isArray(value) ? value : [value];
+    }
+
+    /**
+     * Creates a shallow clone of `value`.
+     *
+     * **Note:** This method is loosely based on the
+     * [structured clone algorithm](https://mdn.io/Structured_clone_algorithm)
+     * and supports cloning arrays, array buffers, booleans, date objects, maps,
+     * numbers, `Object` objects, regexes, sets, strings, symbols, and typed
+     * arrays. The own enumerable properties of `arguments` objects are cloned
+     * as plain objects. An empty object is returned for uncloneable values such
+     * as error objects, functions, DOM nodes, and WeakMaps.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to clone.
+     * @returns {*} Returns the cloned value.
+     * @see _.cloneDeep
+     * @example
+     *
+     * var objects = [{ 'a': 1 }, { 'b': 2 }];
+     *
+     * var shallow = _.clone(objects);
+     * console.log(shallow[0] === objects[0]);
+     * // => true
+     */
+    function clone(value) {
+      return baseClone(value, CLONE_SYMBOLS_FLAG);
+    }
+
+    /**
+     * This method is like `_.clone` except that it accepts `customizer` which
+     * is invoked to produce the cloned value. If `customizer` returns `undefined`,
+     * cloning is handled by the method instead. The `customizer` is invoked with
+     * up to four arguments; (value [, index|key, object, stack]).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to clone.
+     * @param {Function} [customizer] The function to customize cloning.
+     * @returns {*} Returns the cloned value.
+     * @see _.cloneDeepWith
+     * @example
+     *
+     * function customizer(value) {
+     *   if (_.isElement(value)) {
+     *     return value.cloneNode(false);
+     *   }
+     * }
+     *
+     * var el = _.cloneWith(document.body, customizer);
+     *
+     * console.log(el === document.body);
+     * // => false
+     * console.log(el.nodeName);
+     * // => 'BODY'
+     * console.log(el.childNodes.length);
+     * // => 0
+     */
+    function cloneWith(value, customizer) {
+      customizer = typeof customizer == 'function' ? customizer : undefined;
+      return baseClone(value, CLONE_SYMBOLS_FLAG, customizer);
+    }
+
+    /**
+     * This method is like `_.clone` except that it recursively clones `value`.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.0.0
+     * @category Lang
+     * @param {*} value The value to recursively clone.
+     * @returns {*} Returns the deep cloned value.
+     * @see _.clone
+     * @example
+     *
+     * var objects = [{ 'a': 1 }, { 'b': 2 }];
+     *
+     * var deep = _.cloneDeep(objects);
+     * console.log(deep[0] === objects[0]);
+     * // => false
+     */
+    function cloneDeep(value) {
+      return baseClone(value, CLONE_DEEP_FLAG | CLONE_SYMBOLS_FLAG);
+    }
+
+    /**
+     * This method is like `_.cloneWith` except that it recursively clones `value`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to recursively clone.
+     * @param {Function} [customizer] The function to customize cloning.
+     * @returns {*} Returns the deep cloned value.
+     * @see _.cloneWith
+     * @example
+     *
+     * function customizer(value) {
+     *   if (_.isElement(value)) {
+     *     return value.cloneNode(true);
+     *   }
+     * }
+     *
+     * var el = _.cloneDeepWith(document.body, customizer);
+     *
+     * console.log(el === document.body);
+     * // => false
+     * console.log(el.nodeName);
+     * // => 'BODY'
+     * console.log(el.childNodes.length);
+     * // => 20
+     */
+    function cloneDeepWith(value, customizer) {
+      customizer = typeof customizer == 'function' ? customizer : undefined;
+      return baseClone(value, CLONE_DEEP_FLAG | CLONE_SYMBOLS_FLAG, customizer);
+    }
+
+    /**
+     * Checks if `object` conforms to `source` by invoking the predicate
+     * properties of `source` with the corresponding property values of `object`.
+     *
+     * **Note:** This method is equivalent to `_.conforms` when `source` is
+     * partially applied.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.14.0
+     * @category Lang
+     * @param {Object} object The object to inspect.
+     * @param {Object} source The object of property predicates to conform to.
+     * @returns {boolean} Returns `true` if `object` conforms, else `false`.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': 2 };
+     *
+     * _.conformsTo(object, { 'b': function(n) { return n > 1; } });
+     * // => true
+     *
+     * _.conformsTo(object, { 'b': function(n) { return n > 2; } });
+     * // => false
+     */
+    function conformsTo(object, source) {
+      return source == null || baseConformsTo(object, source, keys(source));
+    }
+
+    /**
+     * Performs a
+     * [`SameValueZero`](http://ecma-international.org/ecma-262/7.0/#sec-samevaluezero)
+     * comparison between two values to determine if they are equivalent.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {boolean} Returns `true` if the values are equivalent, else `false`.
+     * @example
+     *
+     * var object = { 'a': 1 };
+     * var other = { 'a': 1 };
+     *
+     * _.eq(object, object);
+     * // => true
+     *
+     * _.eq(object, other);
+     * // => false
+     *
+     * _.eq('a', 'a');
+     * // => true
+     *
+     * _.eq('a', Object('a'));
+     * // => false
+     *
+     * _.eq(NaN, NaN);
+     * // => true
+     */
+    function eq(value, other) {
+      return value === other || (value !== value && other !== other);
+    }
+
+    /**
+     * Checks if `value` is greater than `other`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.9.0
+     * @category Lang
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {boolean} Returns `true` if `value` is greater than `other`,
+     *  else `false`.
+     * @see _.lt
+     * @example
+     *
+     * _.gt(3, 1);
+     * // => true
+     *
+     * _.gt(3, 3);
+     * // => false
+     *
+     * _.gt(1, 3);
+     * // => false
+     */
+    var gt = createRelationalOperation(baseGt);
+
+    /**
+     * Checks if `value` is greater than or equal to `other`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.9.0
+     * @category Lang
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {boolean} Returns `true` if `value` is greater than or equal to
+     *  `other`, else `false`.
+     * @see _.lte
+     * @example
+     *
+     * _.gte(3, 1);
+     * // => true
+     *
+     * _.gte(3, 3);
+     * // => true
+     *
+     * _.gte(1, 3);
+     * // => false
+     */
+    var gte = createRelationalOperation(function(value, other) {
+      return value >= other;
+    });
+
+    /**
+     * Checks if `value` is likely an `arguments` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an `arguments` object,
+     *  else `false`.
+     * @example
+     *
+     * _.isArguments(function() { return arguments; }());
+     * // => true
+     *
+     * _.isArguments([1, 2, 3]);
+     * // => false
+     */
+    var isArguments = baseIsArguments(function() { return arguments; }()) ? baseIsArguments : function(value) {
+      return isObjectLike(value) && hasOwnProperty.call(value, 'callee') &&
+        !propertyIsEnumerable.call(value, 'callee');
+    };
+
+    /**
+     * Checks if `value` is classified as an `Array` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an array, else `false`.
+     * @example
+     *
+     * _.isArray([1, 2, 3]);
+     * // => true
+     *
+     * _.isArray(document.body.children);
+     * // => false
+     *
+     * _.isArray('abc');
+     * // => false
+     *
+     * _.isArray(_.noop);
+     * // => false
+     */
+    var isArray = Array.isArray;
+
+    /**
+     * Checks if `value` is classified as an `ArrayBuffer` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.3.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an array buffer, else `false`.
+     * @example
+     *
+     * _.isArrayBuffer(new ArrayBuffer(2));
+     * // => true
+     *
+     * _.isArrayBuffer(new Array(2));
+     * // => false
+     */
+    var isArrayBuffer = nodeIsArrayBuffer ? baseUnary(nodeIsArrayBuffer) : baseIsArrayBuffer;
+
+    /**
+     * Checks if `value` is array-like. A value is considered array-like if it's
+     * not a function and has a `value.length` that's an integer greater than or
+     * equal to `0` and less than or equal to `Number.MAX_SAFE_INTEGER`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is array-like, else `false`.
+     * @example
+     *
+     * _.isArrayLike([1, 2, 3]);
+     * // => true
+     *
+     * _.isArrayLike(document.body.children);
+     * // => true
+     *
+     * _.isArrayLike('abc');
+     * // => true
+     *
+     * _.isArrayLike(_.noop);
+     * // => false
+     */
+    function isArrayLike(value) {
+      return value != null && isLength(value.length) && !isFunction(value);
+    }
+
+    /**
+     * This method is like `_.isArrayLike` except that it also checks if `value`
+     * is an object.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an array-like object,
+     *  else `false`.
+     * @example
+     *
+     * _.isArrayLikeObject([1, 2, 3]);
+     * // => true
+     *
+     * _.isArrayLikeObject(document.body.children);
+     * // => true
+     *
+     * _.isArrayLikeObject('abc');
+     * // => false
+     *
+     * _.isArrayLikeObject(_.noop);
+     * // => false
+     */
+    function isArrayLikeObject(value) {
+      return isObjectLike(value) && isArrayLike(value);
+    }
+
+    /**
+     * Checks if `value` is classified as a boolean primitive or object.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a boolean, else `false`.
+     * @example
+     *
+     * _.isBoolean(false);
+     * // => true
+     *
+     * _.isBoolean(null);
+     * // => false
+     */
+    function isBoolean(value) {
+      return value === true || value === false ||
+        (isObjectLike(value) && baseGetTag(value) == boolTag);
+    }
+
+    /**
+     * Checks if `value` is a buffer.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.3.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a buffer, else `false`.
+     * @example
+     *
+     * _.isBuffer(new Buffer(2));
+     * // => true
+     *
+     * _.isBuffer(new Uint8Array(2));
+     * // => false
+     */
+    var isBuffer = nativeIsBuffer || stubFalse;
+
+    /**
+     * Checks if `value` is classified as a `Date` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a date object, else `false`.
+     * @example
+     *
+     * _.isDate(new Date);
+     * // => true
+     *
+     * _.isDate('Mon April 23 2012');
+     * // => false
+     */
+    var isDate = nodeIsDate ? baseUnary(nodeIsDate) : baseIsDate;
+
+    /**
+     * Checks if `value` is likely a DOM element.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a DOM element, else `false`.
+     * @example
+     *
+     * _.isElement(document.body);
+     * // => true
+     *
+     * _.isElement('<body>');
+     * // => false
+     */
+    function isElement(value) {
+      return isObjectLike(value) && value.nodeType === 1 && !isPlainObject(value);
+    }
+
+    /**
+     * Checks if `value` is an empty object, collection, map, or set.
+     *
+     * Objects are considered empty if they have no own enumerable string keyed
+     * properties.
+     *
+     * Array-like values such as `arguments` objects, arrays, buffers, strings, or
+     * jQuery-like collections are considered empty if they have a `length` of `0`.
+     * Similarly, maps and sets are considered empty if they have a `size` of `0`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is empty, else `false`.
+     * @example
+     *
+     * _.isEmpty(null);
+     * // => true
+     *
+     * _.isEmpty(true);
+     * // => true
+     *
+     * _.isEmpty(1);
+     * // => true
+     *
+     * _.isEmpty([1, 2, 3]);
+     * // => false
+     *
+     * _.isEmpty({ 'a': 1 });
+     * // => false
+     */
+    function isEmpty(value) {
+      if (value == null) {
+        return true;
+      }
+      if (isArrayLike(value) &&
+          (isArray(value) || typeof value == 'string' || typeof value.splice == 'function' ||
+            isBuffer(value) || isTypedArray(value) || isArguments(value))) {
+        return !value.length;
+      }
+      var tag = getTag(value);
+      if (tag == mapTag || tag == setTag) {
+        return !value.size;
+      }
+      if (isPrototype(value)) {
+        return !baseKeys(value).length;
+      }
+      for (var key in value) {
+        if (hasOwnProperty.call(value, key)) {
+          return false;
+        }
+      }
+      return true;
+    }
+
+    /**
+     * Performs a deep comparison between two values to determine if they are
+     * equivalent.
+     *
+     * **Note:** This method supports comparing arrays, array buffers, booleans,
+     * date objects, error objects, maps, numbers, `Object` objects, regexes,
+     * sets, strings, symbols, and typed arrays. `Object` objects are compared
+     * by their own, not inherited, enumerable properties. Functions and DOM
+     * nodes are compared by strict equality, i.e. `===`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {boolean} Returns `true` if the values are equivalent, else `false`.
+     * @example
+     *
+     * var object = { 'a': 1 };
+     * var other = { 'a': 1 };
+     *
+     * _.isEqual(object, other);
+     * // => true
+     *
+     * object === other;
+     * // => false
+     */
+    function isEqual(value, other) {
+      return baseIsEqual(value, other);
+    }
+
+    /**
+     * This method is like `_.isEqual` except that it accepts `customizer` which
+     * is invoked to compare values. If `customizer` returns `undefined`, comparisons
+     * are handled by the method instead. The `customizer` is invoked with up to
+     * six arguments: (objValue, othValue [, index|key, object, other, stack]).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @param {Function} [customizer] The function to customize comparisons.
+     * @returns {boolean} Returns `true` if the values are equivalent, else `false`.
+     * @example
+     *
+     * function isGreeting(value) {
+     *   return /^h(?:i|ello)$/.test(value);
+     * }
+     *
+     * function customizer(objValue, othValue) {
+     *   if (isGreeting(objValue) && isGreeting(othValue)) {
+     *     return true;
+     *   }
+     * }
+     *
+     * var array = ['hello', 'goodbye'];
+     * var other = ['hi', 'goodbye'];
+     *
+     * _.isEqualWith(array, other, customizer);
+     * // => true
+     */
+    function isEqualWith(value, other, customizer) {
+      customizer = typeof customizer == 'function' ? customizer : undefined;
+      var result = customizer ? customizer(value, other) : undefined;
+      return result === undefined ? baseIsEqual(value, other, undefined, customizer) : !!result;
+    }
+
+    /**
+     * Checks if `value` is an `Error`, `EvalError`, `RangeError`, `ReferenceError`,
+     * `SyntaxError`, `TypeError`, or `URIError` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an error object, else `false`.
+     * @example
+     *
+     * _.isError(new Error);
+     * // => true
+     *
+     * _.isError(Error);
+     * // => false
+     */
+    function isError(value) {
+      if (!isObjectLike(value)) {
+        return false;
+      }
+      var tag = baseGetTag(value);
+      return tag == errorTag || tag == domExcTag ||
+        (typeof value.message == 'string' && typeof value.name == 'string' && !isPlainObject(value));
+    }
+
+    /**
+     * Checks if `value` is a finite primitive number.
+     *
+     * **Note:** This method is based on
+     * [`Number.isFinite`](https://mdn.io/Number/isFinite).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a finite number, else `false`.
+     * @example
+     *
+     * _.isFinite(3);
+     * // => true
+     *
+     * _.isFinite(Number.MIN_VALUE);
+     * // => true
+     *
+     * _.isFinite(Infinity);
+     * // => false
+     *
+     * _.isFinite('3');
+     * // => false
+     */
+    function isFinite(value) {
+      return typeof value == 'number' && nativeIsFinite(value);
+    }
+
+    /**
+     * Checks if `value` is classified as a `Function` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a function, else `false`.
+     * @example
+     *
+     * _.isFunction(_);
+     * // => true
+     *
+     * _.isFunction(/abc/);
+     * // => false
+     */
+    function isFunction(value) {
+      if (!isObject(value)) {
+        return false;
+      }
+      // The use of `Object#toString` avoids issues with the `typeof` operator
+      // in Safari 9 which returns 'object' for typed arrays and other constructors.
+      var tag = baseGetTag(value);
+      return tag == funcTag || tag == genTag || tag == asyncTag || tag == proxyTag;
+    }
+
+    /**
+     * Checks if `value` is an integer.
+     *
+     * **Note:** This method is based on
+     * [`Number.isInteger`](https://mdn.io/Number/isInteger).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an integer, else `false`.
+     * @example
+     *
+     * _.isInteger(3);
+     * // => true
+     *
+     * _.isInteger(Number.MIN_VALUE);
+     * // => false
+     *
+     * _.isInteger(Infinity);
+     * // => false
+     *
+     * _.isInteger('3');
+     * // => false
+     */
+    function isInteger(value) {
+      return typeof value == 'number' && value == toInteger(value);
+    }
+
+    /**
+     * Checks if `value` is a valid array-like length.
+     *
+     * **Note:** This method is loosely based on
+     * [`ToLength`](http://ecma-international.org/ecma-262/7.0/#sec-tolength).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a valid length, else `false`.
+     * @example
+     *
+     * _.isLength(3);
+     * // => true
+     *
+     * _.isLength(Number.MIN_VALUE);
+     * // => false
+     *
+     * _.isLength(Infinity);
+     * // => false
+     *
+     * _.isLength('3');
+     * // => false
+     */
+    function isLength(value) {
+      return typeof value == 'number' &&
+        value > -1 && value % 1 == 0 && value <= MAX_SAFE_INTEGER;
+    }
+
+    /**
+     * Checks if `value` is the
+     * [language type](http://www.ecma-international.org/ecma-262/7.0/#sec-ecmascript-language-types)
+     * of `Object`. (e.g. arrays, functions, objects, regexes, `new Number(0)`, and `new String('')`)
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is an object, else `false`.
+     * @example
+     *
+     * _.isObject({});
+     * // => true
+     *
+     * _.isObject([1, 2, 3]);
+     * // => true
+     *
+     * _.isObject(_.noop);
+     * // => true
+     *
+     * _.isObject(null);
+     * // => false
+     */
+    function isObject(value) {
+      var type = typeof value;
+      return value != null && (type == 'object' || type == 'function');
+    }
+
+    /**
+     * Checks if `value` is object-like. A value is object-like if it's not `null`
+     * and has a `typeof` result of "object".
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is object-like, else `false`.
+     * @example
+     *
+     * _.isObjectLike({});
+     * // => true
+     *
+     * _.isObjectLike([1, 2, 3]);
+     * // => true
+     *
+     * _.isObjectLike(_.noop);
+     * // => false
+     *
+     * _.isObjectLike(null);
+     * // => false
+     */
+    function isObjectLike(value) {
+      return value != null && typeof value == 'object';
+    }
+
+    /**
+     * Checks if `value` is classified as a `Map` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.3.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a map, else `false`.
+     * @example
+     *
+     * _.isMap(new Map);
+     * // => true
+     *
+     * _.isMap(new WeakMap);
+     * // => false
+     */
+    var isMap = nodeIsMap ? baseUnary(nodeIsMap) : baseIsMap;
+
+    /**
+     * Performs a partial deep comparison between `object` and `source` to
+     * determine if `object` contains equivalent property values.
+     *
+     * **Note:** This method is equivalent to `_.matches` when `source` is
+     * partially applied.
+     *
+     * Partial comparisons will match empty array and empty object `source`
+     * values against any array or object value, respectively. See `_.isEqual`
+     * for a list of supported value comparisons.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Lang
+     * @param {Object} object The object to inspect.
+     * @param {Object} source The object of property values to match.
+     * @returns {boolean} Returns `true` if `object` is a match, else `false`.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': 2 };
+     *
+     * _.isMatch(object, { 'b': 2 });
+     * // => true
+     *
+     * _.isMatch(object, { 'b': 1 });
+     * // => false
+     */
+    function isMatch(object, source) {
+      return object === source || baseIsMatch(object, source, getMatchData(source));
+    }
+
+    /**
+     * This method is like `_.isMatch` except that it accepts `customizer` which
+     * is invoked to compare values. If `customizer` returns `undefined`, comparisons
+     * are handled by the method instead. The `customizer` is invoked with five
+     * arguments: (objValue, srcValue, index|key, object, source).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {Object} object The object to inspect.
+     * @param {Object} source The object of property values to match.
+     * @param {Function} [customizer] The function to customize comparisons.
+     * @returns {boolean} Returns `true` if `object` is a match, else `false`.
+     * @example
+     *
+     * function isGreeting(value) {
+     *   return /^h(?:i|ello)$/.test(value);
+     * }
+     *
+     * function customizer(objValue, srcValue) {
+     *   if (isGreeting(objValue) && isGreeting(srcValue)) {
+     *     return true;
+     *   }
+     * }
+     *
+     * var object = { 'greeting': 'hello' };
+     * var source = { 'greeting': 'hi' };
+     *
+     * _.isMatchWith(object, source, customizer);
+     * // => true
+     */
+    function isMatchWith(object, source, customizer) {
+      customizer = typeof customizer == 'function' ? customizer : undefined;
+      return baseIsMatch(object, source, getMatchData(source), customizer);
+    }
+
+    /**
+     * Checks if `value` is `NaN`.
+     *
+     * **Note:** This method is based on
+     * [`Number.isNaN`](https://mdn.io/Number/isNaN) and is not the same as
+     * global [`isNaN`](https://mdn.io/isNaN) which returns `true` for
+     * `undefined` and other non-number values.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is `NaN`, else `false`.
+     * @example
+     *
+     * _.isNaN(NaN);
+     * // => true
+     *
+     * _.isNaN(new Number(NaN));
+     * // => true
+     *
+     * isNaN(undefined);
+     * // => true
+     *
+     * _.isNaN(undefined);
+     * // => false
+     */
+    function isNaN(value) {
+      // An `NaN` primitive is the only value that is not equal to itself.
+      // Perform the `toStringTag` check first to avoid errors with some
+      // ActiveX objects in IE.
+      return isNumber(value) && value != +value;
+    }
+
+    /**
+     * Checks if `value` is a pristine native function.
+     *
+     * **Note:** This method can't reliably detect native functions in the presence
+     * of the core-js package because core-js circumvents this kind of detection.
+     * Despite multiple requests, the core-js maintainer has made it clear: any
+     * attempt to fix the detection will be obstructed. As a result, we're left
+     * with little choice but to throw an error. Unfortunately, this also affects
+     * packages, like [babel-polyfill](https://www.npmjs.com/package/babel-polyfill),
+     * which rely on core-js.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a native function,
+     *  else `false`.
+     * @example
+     *
+     * _.isNative(Array.prototype.push);
+     * // => true
+     *
+     * _.isNative(_);
+     * // => false
+     */
+    function isNative(value) {
+      if (isMaskable(value)) {
+        throw new Error(CORE_ERROR_TEXT);
+      }
+      return baseIsNative(value);
+    }
+
+    /**
+     * Checks if `value` is `null`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is `null`, else `false`.
+     * @example
+     *
+     * _.isNull(null);
+     * // => true
+     *
+     * _.isNull(void 0);
+     * // => false
+     */
+    function isNull(value) {
+      return value === null;
+    }
+
+    /**
+     * Checks if `value` is `null` or `undefined`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is nullish, else `false`.
+     * @example
+     *
+     * _.isNil(null);
+     * // => true
+     *
+     * _.isNil(void 0);
+     * // => true
+     *
+     * _.isNil(NaN);
+     * // => false
+     */
+    function isNil(value) {
+      return value == null;
+    }
+
+    /**
+     * Checks if `value` is classified as a `Number` primitive or object.
+     *
+     * **Note:** To exclude `Infinity`, `-Infinity`, and `NaN`, which are
+     * classified as numbers, use the `_.isFinite` method.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a number, else `false`.
+     * @example
+     *
+     * _.isNumber(3);
+     * // => true
+     *
+     * _.isNumber(Number.MIN_VALUE);
+     * // => true
+     *
+     * _.isNumber(Infinity);
+     * // => true
+     *
+     * _.isNumber('3');
+     * // => false
+     */
+    function isNumber(value) {
+      return typeof value == 'number' ||
+        (isObjectLike(value) && baseGetTag(value) == numberTag);
+    }
+
+    /**
+     * Checks if `value` is a plain object, that is, an object created by the
+     * `Object` constructor or one with a `[[Prototype]]` of `null`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.8.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a plain object, else `false`.
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     * }
+     *
+     * _.isPlainObject(new Foo);
+     * // => false
+     *
+     * _.isPlainObject([1, 2, 3]);
+     * // => false
+     *
+     * _.isPlainObject({ 'x': 0, 'y': 0 });
+     * // => true
+     *
+     * _.isPlainObject(Object.create(null));
+     * // => true
+     */
+    function isPlainObject(value) {
+      if (!isObjectLike(value) || baseGetTag(value) != objectTag) {
+        return false;
+      }
+      var proto = getPrototype(value);
+      if (proto === null) {
+        return true;
+      }
+      var Ctor = hasOwnProperty.call(proto, 'constructor') && proto.constructor;
+      return typeof Ctor == 'function' && Ctor instanceof Ctor &&
+        funcToString.call(Ctor) == objectCtorString;
+    }
+
+    /**
+     * Checks if `value` is classified as a `RegExp` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.1.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a regexp, else `false`.
+     * @example
+     *
+     * _.isRegExp(/abc/);
+     * // => true
+     *
+     * _.isRegExp('/abc/');
+     * // => false
+     */
+    var isRegExp = nodeIsRegExp ? baseUnary(nodeIsRegExp) : baseIsRegExp;
+
+    /**
+     * Checks if `value` is a safe integer. An integer is safe if it's an IEEE-754
+     * double precision number which isn't the result of a rounded unsafe integer.
+     *
+     * **Note:** This method is based on
+     * [`Number.isSafeInteger`](https://mdn.io/Number/isSafeInteger).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a safe integer, else `false`.
+     * @example
+     *
+     * _.isSafeInteger(3);
+     * // => true
+     *
+     * _.isSafeInteger(Number.MIN_VALUE);
+     * // => false
+     *
+     * _.isSafeInteger(Infinity);
+     * // => false
+     *
+     * _.isSafeInteger('3');
+     * // => false
+     */
+    function isSafeInteger(value) {
+      return isInteger(value) && value >= -MAX_SAFE_INTEGER && value <= MAX_SAFE_INTEGER;
+    }
+
+    /**
+     * Checks if `value` is classified as a `Set` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.3.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a set, else `false`.
+     * @example
+     *
+     * _.isSet(new Set);
+     * // => true
+     *
+     * _.isSet(new WeakSet);
+     * // => false
+     */
+    var isSet = nodeIsSet ? baseUnary(nodeIsSet) : baseIsSet;
+
+    /**
+     * Checks if `value` is classified as a `String` primitive or object.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a string, else `false`.
+     * @example
+     *
+     * _.isString('abc');
+     * // => true
+     *
+     * _.isString(1);
+     * // => false
+     */
+    function isString(value) {
+      return typeof value == 'string' ||
+        (!isArray(value) && isObjectLike(value) && baseGetTag(value) == stringTag);
+    }
+
+    /**
+     * Checks if `value` is classified as a `Symbol` primitive or object.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a symbol, else `false`.
+     * @example
+     *
+     * _.isSymbol(Symbol.iterator);
+     * // => true
+     *
+     * _.isSymbol('abc');
+     * // => false
+     */
+    function isSymbol(value) {
+      return typeof value == 'symbol' ||
+        (isObjectLike(value) && baseGetTag(value) == symbolTag);
+    }
+
+    /**
+     * Checks if `value` is classified as a typed array.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a typed array, else `false`.
+     * @example
+     *
+     * _.isTypedArray(new Uint8Array);
+     * // => true
+     *
+     * _.isTypedArray([]);
+     * // => false
+     */
+    var isTypedArray = nodeIsTypedArray ? baseUnary(nodeIsTypedArray) : baseIsTypedArray;
+
+    /**
+     * Checks if `value` is `undefined`.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is `undefined`, else `false`.
+     * @example
+     *
+     * _.isUndefined(void 0);
+     * // => true
+     *
+     * _.isUndefined(null);
+     * // => false
+     */
+    function isUndefined(value) {
+      return value === undefined;
+    }
+
+    /**
+     * Checks if `value` is classified as a `WeakMap` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.3.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a weak map, else `false`.
+     * @example
+     *
+     * _.isWeakMap(new WeakMap);
+     * // => true
+     *
+     * _.isWeakMap(new Map);
+     * // => false
+     */
+    function isWeakMap(value) {
+      return isObjectLike(value) && getTag(value) == weakMapTag;
+    }
+
+    /**
+     * Checks if `value` is classified as a `WeakSet` object.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.3.0
+     * @category Lang
+     * @param {*} value The value to check.
+     * @returns {boolean} Returns `true` if `value` is a weak set, else `false`.
+     * @example
+     *
+     * _.isWeakSet(new WeakSet);
+     * // => true
+     *
+     * _.isWeakSet(new Set);
+     * // => false
+     */
+    function isWeakSet(value) {
+      return isObjectLike(value) && baseGetTag(value) == weakSetTag;
+    }
+
+    /**
+     * Checks if `value` is less than `other`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.9.0
+     * @category Lang
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {boolean} Returns `true` if `value` is less than `other`,
+     *  else `false`.
+     * @see _.gt
+     * @example
+     *
+     * _.lt(1, 3);
+     * // => true
+     *
+     * _.lt(3, 3);
+     * // => false
+     *
+     * _.lt(3, 1);
+     * // => false
+     */
+    var lt = createRelationalOperation(baseLt);
+
+    /**
+     * Checks if `value` is less than or equal to `other`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.9.0
+     * @category Lang
+     * @param {*} value The value to compare.
+     * @param {*} other The other value to compare.
+     * @returns {boolean} Returns `true` if `value` is less than or equal to
+     *  `other`, else `false`.
+     * @see _.gte
+     * @example
+     *
+     * _.lte(1, 3);
+     * // => true
+     *
+     * _.lte(3, 3);
+     * // => true
+     *
+     * _.lte(3, 1);
+     * // => false
+     */
+    var lte = createRelationalOperation(function(value, other) {
+      return value <= other;
+    });
+
+    /**
+     * Converts `value` to an array.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Lang
+     * @param {*} value The value to convert.
+     * @returns {Array} Returns the converted array.
+     * @example
+     *
+     * _.toArray({ 'a': 1, 'b': 2 });
+     * // => [1, 2]
+     *
+     * _.toArray('abc');
+     * // => ['a', 'b', 'c']
+     *
+     * _.toArray(1);
+     * // => []
+     *
+     * _.toArray(null);
+     * // => []
+     */
+    function toArray(value) {
+      if (!value) {
+        return [];
+      }
+      if (isArrayLike(value)) {
+        return isString(value) ? stringToArray(value) : copyArray(value);
+      }
+      if (symIterator && value[symIterator]) {
+        return iteratorToArray(value[symIterator]());
+      }
+      var tag = getTag(value),
+          func = tag == mapTag ? mapToArray : (tag == setTag ? setToArray : values);
+
+      return func(value);
+    }
+
+    /**
+     * Converts `value` to a finite number.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.12.0
+     * @category Lang
+     * @param {*} value The value to convert.
+     * @returns {number} Returns the converted number.
+     * @example
+     *
+     * _.toFinite(3.2);
+     * // => 3.2
+     *
+     * _.toFinite(Number.MIN_VALUE);
+     * // => 5e-324
+     *
+     * _.toFinite(Infinity);
+     * // => 1.7976931348623157e+308
+     *
+     * _.toFinite('3.2');
+     * // => 3.2
+     */
+    function toFinite(value) {
+      if (!value) {
+        return value === 0 ? value : 0;
+      }
+      value = toNumber(value);
+      if (value === INFINITY || value === -INFINITY) {
+        var sign = (value < 0 ? -1 : 1);
+        return sign * MAX_INTEGER;
+      }
+      return value === value ? value : 0;
+    }
+
+    /**
+     * Converts `value` to an integer.
+     *
+     * **Note:** This method is loosely based on
+     * [`ToInteger`](http://www.ecma-international.org/ecma-262/7.0/#sec-tointeger).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to convert.
+     * @returns {number} Returns the converted integer.
+     * @example
+     *
+     * _.toInteger(3.2);
+     * // => 3
+     *
+     * _.toInteger(Number.MIN_VALUE);
+     * // => 0
+     *
+     * _.toInteger(Infinity);
+     * // => 1.7976931348623157e+308
+     *
+     * _.toInteger('3.2');
+     * // => 3
+     */
+    function toInteger(value) {
+      var result = toFinite(value),
+          remainder = result % 1;
+
+      return result === result ? (remainder ? result - remainder : result) : 0;
+    }
+
+    /**
+     * Converts `value` to an integer suitable for use as the length of an
+     * array-like object.
+     *
+     * **Note:** This method is based on
+     * [`ToLength`](http://ecma-international.org/ecma-262/7.0/#sec-tolength).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to convert.
+     * @returns {number} Returns the converted integer.
+     * @example
+     *
+     * _.toLength(3.2);
+     * // => 3
+     *
+     * _.toLength(Number.MIN_VALUE);
+     * // => 0
+     *
+     * _.toLength(Infinity);
+     * // => 4294967295
+     *
+     * _.toLength('3.2');
+     * // => 3
+     */
+    function toLength(value) {
+      return value ? baseClamp(toInteger(value), 0, MAX_ARRAY_LENGTH) : 0;
+    }
+
+    /**
+     * Converts `value` to a number.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to process.
+     * @returns {number} Returns the number.
+     * @example
+     *
+     * _.toNumber(3.2);
+     * // => 3.2
+     *
+     * _.toNumber(Number.MIN_VALUE);
+     * // => 5e-324
+     *
+     * _.toNumber(Infinity);
+     * // => Infinity
+     *
+     * _.toNumber('3.2');
+     * // => 3.2
+     */
+    function toNumber(value) {
+      if (typeof value == 'number') {
+        return value;
+      }
+      if (isSymbol(value)) {
+        return NAN;
+      }
+      if (isObject(value)) {
+        var other = typeof value.valueOf == 'function' ? value.valueOf() : value;
+        value = isObject(other) ? (other + '') : other;
+      }
+      if (typeof value != 'string') {
+        return value === 0 ? value : +value;
+      }
+      value = value.replace(reTrim, '');
+      var isBinary = reIsBinary.test(value);
+      return (isBinary || reIsOctal.test(value))
+        ? freeParseInt(value.slice(2), isBinary ? 2 : 8)
+        : (reIsBadHex.test(value) ? NAN : +value);
+    }
+
+    /**
+     * Converts `value` to a plain object flattening inherited enumerable string
+     * keyed properties of `value` to own properties of the plain object.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Lang
+     * @param {*} value The value to convert.
+     * @returns {Object} Returns the converted plain object.
+     * @example
+     *
+     * function Foo() {
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.assign({ 'a': 1 }, new Foo);
+     * // => { 'a': 1, 'b': 2 }
+     *
+     * _.assign({ 'a': 1 }, _.toPlainObject(new Foo));
+     * // => { 'a': 1, 'b': 2, 'c': 3 }
+     */
+    function toPlainObject(value) {
+      return copyObject(value, keysIn(value));
+    }
+
+    /**
+     * Converts `value` to a safe integer. A safe integer can be compared and
+     * represented correctly.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to convert.
+     * @returns {number} Returns the converted integer.
+     * @example
+     *
+     * _.toSafeInteger(3.2);
+     * // => 3
+     *
+     * _.toSafeInteger(Number.MIN_VALUE);
+     * // => 0
+     *
+     * _.toSafeInteger(Infinity);
+     * // => 9007199254740991
+     *
+     * _.toSafeInteger('3.2');
+     * // => 3
+     */
+    function toSafeInteger(value) {
+      return value
+        ? baseClamp(toInteger(value), -MAX_SAFE_INTEGER, MAX_SAFE_INTEGER)
+        : (value === 0 ? value : 0);
+    }
+
+    /**
+     * Converts `value` to a string. An empty string is returned for `null`
+     * and `undefined` values. The sign of `-0` is preserved.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Lang
+     * @param {*} value The value to convert.
+     * @returns {string} Returns the converted string.
+     * @example
+     *
+     * _.toString(null);
+     * // => ''
+     *
+     * _.toString(-0);
+     * // => '-0'
+     *
+     * _.toString([1, 2, 3]);
+     * // => '1,2,3'
+     */
+    function toString(value) {
+      return value == null ? '' : baseToString(value);
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Assigns own enumerable string keyed properties of source objects to the
+     * destination object. Source objects are applied from left to right.
+     * Subsequent sources overwrite property assignments of previous sources.
+     *
+     * **Note:** This method mutates `object` and is loosely based on
+     * [`Object.assign`](https://mdn.io/Object/assign).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.10.0
+     * @category Object
+     * @param {Object} object The destination object.
+     * @param {...Object} [sources] The source objects.
+     * @returns {Object} Returns `object`.
+     * @see _.assignIn
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     * }
+     *
+     * function Bar() {
+     *   this.c = 3;
+     * }
+     *
+     * Foo.prototype.b = 2;
+     * Bar.prototype.d = 4;
+     *
+     * _.assign({ 'a': 0 }, new Foo, new Bar);
+     * // => { 'a': 1, 'c': 3 }
+     */
+    var assign = createAssigner(function(object, source) {
+      if (isPrototype(source) || isArrayLike(source)) {
+        copyObject(source, keys(source), object);
+        return;
+      }
+      for (var key in source) {
+        if (hasOwnProperty.call(source, key)) {
+          assignValue(object, key, source[key]);
+        }
+      }
+    });
+
+    /**
+     * This method is like `_.assign` except that it iterates over own and
+     * inherited source properties.
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @alias extend
+     * @category Object
+     * @param {Object} object The destination object.
+     * @param {...Object} [sources] The source objects.
+     * @returns {Object} Returns `object`.
+     * @see _.assign
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     * }
+     *
+     * function Bar() {
+     *   this.c = 3;
+     * }
+     *
+     * Foo.prototype.b = 2;
+     * Bar.prototype.d = 4;
+     *
+     * _.assignIn({ 'a': 0 }, new Foo, new Bar);
+     * // => { 'a': 1, 'b': 2, 'c': 3, 'd': 4 }
+     */
+    var assignIn = createAssigner(function(object, source) {
+      copyObject(source, keysIn(source), object);
+    });
+
+    /**
+     * This method is like `_.assignIn` except that it accepts `customizer`
+     * which is invoked to produce the assigned values. If `customizer` returns
+     * `undefined`, assignment is handled by the method instead. The `customizer`
+     * is invoked with five arguments: (objValue, srcValue, key, object, source).
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @alias extendWith
+     * @category Object
+     * @param {Object} object The destination object.
+     * @param {...Object} sources The source objects.
+     * @param {Function} [customizer] The function to customize assigned values.
+     * @returns {Object} Returns `object`.
+     * @see _.assignWith
+     * @example
+     *
+     * function customizer(objValue, srcValue) {
+     *   return _.isUndefined(objValue) ? srcValue : objValue;
+     * }
+     *
+     * var defaults = _.partialRight(_.assignInWith, customizer);
+     *
+     * defaults({ 'a': 1 }, { 'b': 2 }, { 'a': 3 });
+     * // => { 'a': 1, 'b': 2 }
+     */
+    var assignInWith = createAssigner(function(object, source, srcIndex, customizer) {
+      copyObject(source, keysIn(source), object, customizer);
+    });
+
+    /**
+     * This method is like `_.assign` except that it accepts `customizer`
+     * which is invoked to produce the assigned values. If `customizer` returns
+     * `undefined`, assignment is handled by the method instead. The `customizer`
+     * is invoked with five arguments: (objValue, srcValue, key, object, source).
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The destination object.
+     * @param {...Object} sources The source objects.
+     * @param {Function} [customizer] The function to customize assigned values.
+     * @returns {Object} Returns `object`.
+     * @see _.assignInWith
+     * @example
+     *
+     * function customizer(objValue, srcValue) {
+     *   return _.isUndefined(objValue) ? srcValue : objValue;
+     * }
+     *
+     * var defaults = _.partialRight(_.assignWith, customizer);
+     *
+     * defaults({ 'a': 1 }, { 'b': 2 }, { 'a': 3 });
+     * // => { 'a': 1, 'b': 2 }
+     */
+    var assignWith = createAssigner(function(object, source, srcIndex, customizer) {
+      copyObject(source, keys(source), object, customizer);
+    });
+
+    /**
+     * Creates an array of values corresponding to `paths` of `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.0.0
+     * @category Object
+     * @param {Object} object The object to iterate over.
+     * @param {...(string|string[])} [paths] The property paths to pick.
+     * @returns {Array} Returns the picked values.
+     * @example
+     *
+     * var object = { 'a': [{ 'b': { 'c': 3 } }, 4] };
+     *
+     * _.at(object, ['a[0].b.c', 'a[1]']);
+     * // => [3, 4]
+     */
+    var at = flatRest(baseAt);
+
+    /**
+     * Creates an object that inherits from the `prototype` object. If a
+     * `properties` object is given, its own enumerable string keyed properties
+     * are assigned to the created object.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.3.0
+     * @category Object
+     * @param {Object} prototype The object to inherit from.
+     * @param {Object} [properties] The properties to assign to the object.
+     * @returns {Object} Returns the new object.
+     * @example
+     *
+     * function Shape() {
+     *   this.x = 0;
+     *   this.y = 0;
+     * }
+     *
+     * function Circle() {
+     *   Shape.call(this);
+     * }
+     *
+     * Circle.prototype = _.create(Shape.prototype, {
+     *   'constructor': Circle
+     * });
+     *
+     * var circle = new Circle;
+     * circle instanceof Circle;
+     * // => true
+     *
+     * circle instanceof Shape;
+     * // => true
+     */
+    function create(prototype, properties) {
+      var result = baseCreate(prototype);
+      return properties == null ? result : baseAssign(result, properties);
+    }
+
+    /**
+     * Assigns own and inherited enumerable string keyed properties of source
+     * objects to the destination object for all destination properties that
+     * resolve to `undefined`. Source objects are applied from left to right.
+     * Once a property is set, additional values of the same property are ignored.
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Object
+     * @param {Object} object The destination object.
+     * @param {...Object} [sources] The source objects.
+     * @returns {Object} Returns `object`.
+     * @see _.defaultsDeep
+     * @example
+     *
+     * _.defaults({ 'a': 1 }, { 'b': 2 }, { 'a': 3 });
+     * // => { 'a': 1, 'b': 2 }
+     */
+    var defaults = baseRest(function(args) {
+      args.push(undefined, customDefaultsAssignIn);
+      return apply(assignInWith, undefined, args);
+    });
+
+    /**
+     * This method is like `_.defaults` except that it recursively assigns
+     * default properties.
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.10.0
+     * @category Object
+     * @param {Object} object The destination object.
+     * @param {...Object} [sources] The source objects.
+     * @returns {Object} Returns `object`.
+     * @see _.defaults
+     * @example
+     *
+     * _.defaultsDeep({ 'a': { 'b': 2 } }, { 'a': { 'b': 1, 'c': 3 } });
+     * // => { 'a': { 'b': 2, 'c': 3 } }
+     */
+    var defaultsDeep = baseRest(function(args) {
+      args.push(undefined, customDefaultsMerge);
+      return apply(mergeWith, undefined, args);
+    });
+
+    /**
+     * This method is like `_.find` except that it returns the key of the first
+     * element `predicate` returns truthy for instead of the element itself.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.1.0
+     * @category Object
+     * @param {Object} object The object to inspect.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {string|undefined} Returns the key of the matched element,
+     *  else `undefined`.
+     * @example
+     *
+     * var users = {
+     *   'barney':  { 'age': 36, 'active': true },
+     *   'fred':    { 'age': 40, 'active': false },
+     *   'pebbles': { 'age': 1,  'active': true }
+     * };
+     *
+     * _.findKey(users, function(o) { return o.age < 40; });
+     * // => 'barney' (iteration order is not guaranteed)
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.findKey(users, { 'age': 1, 'active': true });
+     * // => 'pebbles'
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.findKey(users, ['active', false]);
+     * // => 'fred'
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.findKey(users, 'active');
+     * // => 'barney'
+     */
+    function findKey(object, predicate) {
+      return baseFindKey(object, getIteratee(predicate, 3), baseForOwn);
+    }
+
+    /**
+     * This method is like `_.findKey` except that it iterates over elements of
+     * a collection in the opposite order.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Object
+     * @param {Object} object The object to inspect.
+     * @param {Function} [predicate=_.identity] The function invoked per iteration.
+     * @returns {string|undefined} Returns the key of the matched element,
+     *  else `undefined`.
+     * @example
+     *
+     * var users = {
+     *   'barney':  { 'age': 36, 'active': true },
+     *   'fred':    { 'age': 40, 'active': false },
+     *   'pebbles': { 'age': 1,  'active': true }
+     * };
+     *
+     * _.findLastKey(users, function(o) { return o.age < 40; });
+     * // => returns 'pebbles' assuming `_.findKey` returns 'barney'
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.findLastKey(users, { 'age': 36, 'active': true });
+     * // => 'barney'
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.findLastKey(users, ['active', false]);
+     * // => 'fred'
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.findLastKey(users, 'active');
+     * // => 'pebbles'
+     */
+    function findLastKey(object, predicate) {
+      return baseFindKey(object, getIteratee(predicate, 3), baseForOwnRight);
+    }
+
+    /**
+     * Iterates over own and inherited enumerable string keyed properties of an
+     * object and invokes `iteratee` for each property. The iteratee is invoked
+     * with three arguments: (value, key, object). Iteratee functions may exit
+     * iteration early by explicitly returning `false`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.3.0
+     * @category Object
+     * @param {Object} object The object to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Object} Returns `object`.
+     * @see _.forInRight
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.forIn(new Foo, function(value, key) {
+     *   console.log(key);
+     * });
+     * // => Logs 'a', 'b', then 'c' (iteration order is not guaranteed).
+     */
+    function forIn(object, iteratee) {
+      return object == null
+        ? object
+        : baseFor(object, getIteratee(iteratee, 3), keysIn);
+    }
+
+    /**
+     * This method is like `_.forIn` except that it iterates over properties of
+     * `object` in the opposite order.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Object
+     * @param {Object} object The object to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Object} Returns `object`.
+     * @see _.forIn
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.forInRight(new Foo, function(value, key) {
+     *   console.log(key);
+     * });
+     * // => Logs 'c', 'b', then 'a' assuming `_.forIn` logs 'a', 'b', then 'c'.
+     */
+    function forInRight(object, iteratee) {
+      return object == null
+        ? object
+        : baseForRight(object, getIteratee(iteratee, 3), keysIn);
+    }
+
+    /**
+     * Iterates over own enumerable string keyed properties of an object and
+     * invokes `iteratee` for each property. The iteratee is invoked with three
+     * arguments: (value, key, object). Iteratee functions may exit iteration
+     * early by explicitly returning `false`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.3.0
+     * @category Object
+     * @param {Object} object The object to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Object} Returns `object`.
+     * @see _.forOwnRight
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.forOwn(new Foo, function(value, key) {
+     *   console.log(key);
+     * });
+     * // => Logs 'a' then 'b' (iteration order is not guaranteed).
+     */
+    function forOwn(object, iteratee) {
+      return object && baseForOwn(object, getIteratee(iteratee, 3));
+    }
+
+    /**
+     * This method is like `_.forOwn` except that it iterates over properties of
+     * `object` in the opposite order.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.0.0
+     * @category Object
+     * @param {Object} object The object to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Object} Returns `object`.
+     * @see _.forOwn
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.forOwnRight(new Foo, function(value, key) {
+     *   console.log(key);
+     * });
+     * // => Logs 'b' then 'a' assuming `_.forOwn` logs 'a' then 'b'.
+     */
+    function forOwnRight(object, iteratee) {
+      return object && baseForOwnRight(object, getIteratee(iteratee, 3));
+    }
+
+    /**
+     * Creates an array of function property names from own enumerable properties
+     * of `object`.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Object
+     * @param {Object} object The object to inspect.
+     * @returns {Array} Returns the function names.
+     * @see _.functionsIn
+     * @example
+     *
+     * function Foo() {
+     *   this.a = _.constant('a');
+     *   this.b = _.constant('b');
+     * }
+     *
+     * Foo.prototype.c = _.constant('c');
+     *
+     * _.functions(new Foo);
+     * // => ['a', 'b']
+     */
+    function functions(object) {
+      return object == null ? [] : baseFunctions(object, keys(object));
+    }
+
+    /**
+     * Creates an array of function property names from own and inherited
+     * enumerable properties of `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The object to inspect.
+     * @returns {Array} Returns the function names.
+     * @see _.functions
+     * @example
+     *
+     * function Foo() {
+     *   this.a = _.constant('a');
+     *   this.b = _.constant('b');
+     * }
+     *
+     * Foo.prototype.c = _.constant('c');
+     *
+     * _.functionsIn(new Foo);
+     * // => ['a', 'b', 'c']
+     */
+    function functionsIn(object) {
+      return object == null ? [] : baseFunctions(object, keysIn(object));
+    }
+
+    /**
+     * Gets the value at `path` of `object`. If the resolved value is
+     * `undefined`, the `defaultValue` is returned in its place.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.7.0
+     * @category Object
+     * @param {Object} object The object to query.
+     * @param {Array|string} path The path of the property to get.
+     * @param {*} [defaultValue] The value returned for `undefined` resolved values.
+     * @returns {*} Returns the resolved value.
+     * @example
+     *
+     * var object = { 'a': [{ 'b': { 'c': 3 } }] };
+     *
+     * _.get(object, 'a[0].b.c');
+     * // => 3
+     *
+     * _.get(object, ['a', '0', 'b', 'c']);
+     * // => 3
+     *
+     * _.get(object, 'a.b.c', 'default');
+     * // => 'default'
+     */
+    function get(object, path, defaultValue) {
+      var result = object == null ? undefined : baseGet(object, path);
+      return result === undefined ? defaultValue : result;
+    }
+
+    /**
+     * Checks if `path` is a direct property of `object`.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Object
+     * @param {Object} object The object to query.
+     * @param {Array|string} path The path to check.
+     * @returns {boolean} Returns `true` if `path` exists, else `false`.
+     * @example
+     *
+     * var object = { 'a': { 'b': 2 } };
+     * var other = _.create({ 'a': _.create({ 'b': 2 }) });
+     *
+     * _.has(object, 'a');
+     * // => true
+     *
+     * _.has(object, 'a.b');
+     * // => true
+     *
+     * _.has(object, ['a', 'b']);
+     * // => true
+     *
+     * _.has(other, 'a');
+     * // => false
+     */
+    function has(object, path) {
+      return object != null && hasPath(object, path, baseHas);
+    }
+
+    /**
+     * Checks if `path` is a direct or inherited property of `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The object to query.
+     * @param {Array|string} path The path to check.
+     * @returns {boolean} Returns `true` if `path` exists, else `false`.
+     * @example
+     *
+     * var object = _.create({ 'a': _.create({ 'b': 2 }) });
+     *
+     * _.hasIn(object, 'a');
+     * // => true
+     *
+     * _.hasIn(object, 'a.b');
+     * // => true
+     *
+     * _.hasIn(object, ['a', 'b']);
+     * // => true
+     *
+     * _.hasIn(object, 'b');
+     * // => false
+     */
+    function hasIn(object, path) {
+      return object != null && hasPath(object, path, baseHasIn);
+    }
+
+    /**
+     * Creates an object composed of the inverted keys and values of `object`.
+     * If `object` contains duplicate values, subsequent values overwrite
+     * property assignments of previous values.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.7.0
+     * @category Object
+     * @param {Object} object The object to invert.
+     * @returns {Object} Returns the new inverted object.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': 2, 'c': 1 };
+     *
+     * _.invert(object);
+     * // => { '1': 'c', '2': 'b' }
+     */
+    var invert = createInverter(function(result, value, key) {
+      result[value] = key;
+    }, constant(identity));
+
+    /**
+     * This method is like `_.invert` except that the inverted object is generated
+     * from the results of running each element of `object` thru `iteratee`. The
+     * corresponding inverted value of each inverted key is an array of keys
+     * responsible for generating the inverted value. The iteratee is invoked
+     * with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.1.0
+     * @category Object
+     * @param {Object} object The object to invert.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {Object} Returns the new inverted object.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': 2, 'c': 1 };
+     *
+     * _.invertBy(object);
+     * // => { '1': ['a', 'c'], '2': ['b'] }
+     *
+     * _.invertBy(object, function(value) {
+     *   return 'group' + value;
+     * });
+     * // => { 'group1': ['a', 'c'], 'group2': ['b'] }
+     */
+    var invertBy = createInverter(function(result, value, key) {
+      if (hasOwnProperty.call(result, value)) {
+        result[value].push(key);
+      } else {
+        result[value] = [key];
+      }
+    }, getIteratee);
+
+    /**
+     * Invokes the method at `path` of `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The object to query.
+     * @param {Array|string} path The path of the method to invoke.
+     * @param {...*} [args] The arguments to invoke the method with.
+     * @returns {*} Returns the result of the invoked method.
+     * @example
+     *
+     * var object = { 'a': [{ 'b': { 'c': [1, 2, 3, 4] } }] };
+     *
+     * _.invoke(object, 'a[0].b.c.slice', 1, 3);
+     * // => [2, 3]
+     */
+    var invoke = baseRest(baseInvoke);
+
+    /**
+     * Creates an array of the own enumerable property names of `object`.
+     *
+     * **Note:** Non-object values are coerced to objects. See the
+     * [ES spec](http://ecma-international.org/ecma-262/7.0/#sec-object.keys)
+     * for more details.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Object
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property names.
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.keys(new Foo);
+     * // => ['a', 'b'] (iteration order is not guaranteed)
+     *
+     * _.keys('hi');
+     * // => ['0', '1']
+     */
+    function keys(object) {
+      return isArrayLike(object) ? arrayLikeKeys(object) : baseKeys(object);
+    }
+
+    /**
+     * Creates an array of the own and inherited enumerable property names of `object`.
+     *
+     * **Note:** Non-object values are coerced to objects.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Object
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property names.
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.keysIn(new Foo);
+     * // => ['a', 'b', 'c'] (iteration order is not guaranteed)
+     */
+    function keysIn(object) {
+      return isArrayLike(object) ? arrayLikeKeys(object, true) : baseKeysIn(object);
+    }
+
+    /**
+     * The opposite of `_.mapValues`; this method creates an object with the
+     * same values as `object` and keys generated by running each own enumerable
+     * string keyed property of `object` thru `iteratee`. The iteratee is invoked
+     * with three arguments: (value, key, object).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.8.0
+     * @category Object
+     * @param {Object} object The object to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Object} Returns the new mapped object.
+     * @see _.mapValues
+     * @example
+     *
+     * _.mapKeys({ 'a': 1, 'b': 2 }, function(value, key) {
+     *   return key + value;
+     * });
+     * // => { 'a1': 1, 'b2': 2 }
+     */
+    function mapKeys(object, iteratee) {
+      var result = {};
+      iteratee = getIteratee(iteratee, 3);
+
+      baseForOwn(object, function(value, key, object) {
+        baseAssignValue(result, iteratee(value, key, object), value);
+      });
+      return result;
+    }
+
+    /**
+     * Creates an object with the same keys as `object` and values generated
+     * by running each own enumerable string keyed property of `object` thru
+     * `iteratee`. The iteratee is invoked with three arguments:
+     * (value, key, object).
+     *
+     * @static
+     * @memberOf _
+     * @since 2.4.0
+     * @category Object
+     * @param {Object} object The object to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Object} Returns the new mapped object.
+     * @see _.mapKeys
+     * @example
+     *
+     * var users = {
+     *   'fred':    { 'user': 'fred',    'age': 40 },
+     *   'pebbles': { 'user': 'pebbles', 'age': 1 }
+     * };
+     *
+     * _.mapValues(users, function(o) { return o.age; });
+     * // => { 'fred': 40, 'pebbles': 1 } (iteration order is not guaranteed)
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.mapValues(users, 'age');
+     * // => { 'fred': 40, 'pebbles': 1 } (iteration order is not guaranteed)
+     */
+    function mapValues(object, iteratee) {
+      var result = {};
+      iteratee = getIteratee(iteratee, 3);
+
+      baseForOwn(object, function(value, key, object) {
+        baseAssignValue(result, key, iteratee(value, key, object));
+      });
+      return result;
+    }
+
+    /**
+     * This method is like `_.assign` except that it recursively merges own and
+     * inherited enumerable string keyed properties of source objects into the
+     * destination object. Source properties that resolve to `undefined` are
+     * skipped if a destination value exists. Array and plain object properties
+     * are merged recursively. Other objects and value types are overridden by
+     * assignment. Source objects are applied from left to right. Subsequent
+     * sources overwrite property assignments of previous sources.
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.5.0
+     * @category Object
+     * @param {Object} object The destination object.
+     * @param {...Object} [sources] The source objects.
+     * @returns {Object} Returns `object`.
+     * @example
+     *
+     * var object = {
+     *   'a': [{ 'b': 2 }, { 'd': 4 }]
+     * };
+     *
+     * var other = {
+     *   'a': [{ 'c': 3 }, { 'e': 5 }]
+     * };
+     *
+     * _.merge(object, other);
+     * // => { 'a': [{ 'b': 2, 'c': 3 }, { 'd': 4, 'e': 5 }] }
+     */
+    var merge = createAssigner(function(object, source, srcIndex) {
+      baseMerge(object, source, srcIndex);
+    });
+
+    /**
+     * This method is like `_.merge` except that it accepts `customizer` which
+     * is invoked to produce the merged values of the destination and source
+     * properties. If `customizer` returns `undefined`, merging is handled by the
+     * method instead. The `customizer` is invoked with six arguments:
+     * (objValue, srcValue, key, object, source, stack).
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The destination object.
+     * @param {...Object} sources The source objects.
+     * @param {Function} customizer The function to customize assigned values.
+     * @returns {Object} Returns `object`.
+     * @example
+     *
+     * function customizer(objValue, srcValue) {
+     *   if (_.isArray(objValue)) {
+     *     return objValue.concat(srcValue);
+     *   }
+     * }
+     *
+     * var object = { 'a': [1], 'b': [2] };
+     * var other = { 'a': [3], 'b': [4] };
+     *
+     * _.mergeWith(object, other, customizer);
+     * // => { 'a': [1, 3], 'b': [2, 4] }
+     */
+    var mergeWith = createAssigner(function(object, source, srcIndex, customizer) {
+      baseMerge(object, source, srcIndex, customizer);
+    });
+
+    /**
+     * The opposite of `_.pick`; this method creates an object composed of the
+     * own and inherited enumerable property paths of `object` that are not omitted.
+     *
+     * **Note:** This method is considerably slower than `_.pick`.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Object
+     * @param {Object} object The source object.
+     * @param {...(string|string[])} [paths] The property paths to omit.
+     * @returns {Object} Returns the new object.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': '2', 'c': 3 };
+     *
+     * _.omit(object, ['a', 'c']);
+     * // => { 'b': '2' }
+     */
+    var omit = flatRest(function(object, paths) {
+      var result = {};
+      if (object == null) {
+        return result;
+      }
+      var isDeep = false;
+      paths = arrayMap(paths, function(path) {
+        path = castPath(path, object);
+        isDeep || (isDeep = path.length > 1);
+        return path;
+      });
+      copyObject(object, getAllKeysIn(object), result);
+      if (isDeep) {
+        result = baseClone(result, CLONE_DEEP_FLAG | CLONE_FLAT_FLAG | CLONE_SYMBOLS_FLAG, customOmitClone);
+      }
+      var length = paths.length;
+      while (length--) {
+        baseUnset(result, paths[length]);
+      }
+      return result;
+    });
+
+    /**
+     * The opposite of `_.pickBy`; this method creates an object composed of
+     * the own and inherited enumerable string keyed properties of `object` that
+     * `predicate` doesn't return truthy for. The predicate is invoked with two
+     * arguments: (value, key).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The source object.
+     * @param {Function} [predicate=_.identity] The function invoked per property.
+     * @returns {Object} Returns the new object.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': '2', 'c': 3 };
+     *
+     * _.omitBy(object, _.isNumber);
+     * // => { 'b': '2' }
+     */
+    function omitBy(object, predicate) {
+      return pickBy(object, negate(getIteratee(predicate)));
+    }
+
+    /**
+     * Creates an object composed of the picked `object` properties.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Object
+     * @param {Object} object The source object.
+     * @param {...(string|string[])} [paths] The property paths to pick.
+     * @returns {Object} Returns the new object.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': '2', 'c': 3 };
+     *
+     * _.pick(object, ['a', 'c']);
+     * // => { 'a': 1, 'c': 3 }
+     */
+    var pick = flatRest(function(object, paths) {
+      return object == null ? {} : basePick(object, paths);
+    });
+
+    /**
+     * Creates an object composed of the `object` properties `predicate` returns
+     * truthy for. The predicate is invoked with two arguments: (value, key).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The source object.
+     * @param {Function} [predicate=_.identity] The function invoked per property.
+     * @returns {Object} Returns the new object.
+     * @example
+     *
+     * var object = { 'a': 1, 'b': '2', 'c': 3 };
+     *
+     * _.pickBy(object, _.isNumber);
+     * // => { 'a': 1, 'c': 3 }
+     */
+    function pickBy(object, predicate) {
+      if (object == null) {
+        return {};
+      }
+      var props = arrayMap(getAllKeysIn(object), function(prop) {
+        return [prop];
+      });
+      predicate = getIteratee(predicate);
+      return basePickBy(object, props, function(value, path) {
+        return predicate(value, path[0]);
+      });
+    }
+
+    /**
+     * This method is like `_.get` except that if the resolved value is a
+     * function it's invoked with the `this` binding of its parent object and
+     * its result is returned.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Object
+     * @param {Object} object The object to query.
+     * @param {Array|string} path The path of the property to resolve.
+     * @param {*} [defaultValue] The value returned for `undefined` resolved values.
+     * @returns {*} Returns the resolved value.
+     * @example
+     *
+     * var object = { 'a': [{ 'b': { 'c1': 3, 'c2': _.constant(4) } }] };
+     *
+     * _.result(object, 'a[0].b.c1');
+     * // => 3
+     *
+     * _.result(object, 'a[0].b.c2');
+     * // => 4
+     *
+     * _.result(object, 'a[0].b.c3', 'default');
+     * // => 'default'
+     *
+     * _.result(object, 'a[0].b.c3', _.constant('default'));
+     * // => 'default'
+     */
+    function result(object, path, defaultValue) {
+      path = castPath(path, object);
+
+      var index = -1,
+          length = path.length;
+
+      // Ensure the loop is entered when path is empty.
+      if (!length) {
+        length = 1;
+        object = undefined;
+      }
+      while (++index < length) {
+        var value = object == null ? undefined : object[toKey(path[index])];
+        if (value === undefined) {
+          index = length;
+          value = defaultValue;
+        }
+        object = isFunction(value) ? value.call(object) : value;
+      }
+      return object;
+    }
+
+    /**
+     * Sets the value at `path` of `object`. If a portion of `path` doesn't exist,
+     * it's created. Arrays are created for missing index properties while objects
+     * are created for all other missing properties. Use `_.setWith` to customize
+     * `path` creation.
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.7.0
+     * @category Object
+     * @param {Object} object The object to modify.
+     * @param {Array|string} path The path of the property to set.
+     * @param {*} value The value to set.
+     * @returns {Object} Returns `object`.
+     * @example
+     *
+     * var object = { 'a': [{ 'b': { 'c': 3 } }] };
+     *
+     * _.set(object, 'a[0].b.c', 4);
+     * console.log(object.a[0].b.c);
+     * // => 4
+     *
+     * _.set(object, ['x', '0', 'y', 'z'], 5);
+     * console.log(object.x[0].y.z);
+     * // => 5
+     */
+    function set(object, path, value) {
+      return object == null ? object : baseSet(object, path, value);
+    }
+
+    /**
+     * This method is like `_.set` except that it accepts `customizer` which is
+     * invoked to produce the objects of `path`.  If `customizer` returns `undefined`
+     * path creation is handled by the method instead. The `customizer` is invoked
+     * with three arguments: (nsValue, key, nsObject).
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The object to modify.
+     * @param {Array|string} path The path of the property to set.
+     * @param {*} value The value to set.
+     * @param {Function} [customizer] The function to customize assigned values.
+     * @returns {Object} Returns `object`.
+     * @example
+     *
+     * var object = {};
+     *
+     * _.setWith(object, '[0][1]', 'a', Object);
+     * // => { '0': { '1': 'a' } }
+     */
+    function setWith(object, path, value, customizer) {
+      customizer = typeof customizer == 'function' ? customizer : undefined;
+      return object == null ? object : baseSet(object, path, value, customizer);
+    }
+
+    /**
+     * Creates an array of own enumerable string keyed-value pairs for `object`
+     * which can be consumed by `_.fromPairs`. If `object` is a map or set, its
+     * entries are returned.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @alias entries
+     * @category Object
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the key-value pairs.
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.toPairs(new Foo);
+     * // => [['a', 1], ['b', 2]] (iteration order is not guaranteed)
+     */
+    var toPairs = createToPairs(keys);
+
+    /**
+     * Creates an array of own and inherited enumerable string keyed-value pairs
+     * for `object` which can be consumed by `_.fromPairs`. If `object` is a map
+     * or set, its entries are returned.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @alias entriesIn
+     * @category Object
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the key-value pairs.
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.toPairsIn(new Foo);
+     * // => [['a', 1], ['b', 2], ['c', 3]] (iteration order is not guaranteed)
+     */
+    var toPairsIn = createToPairs(keysIn);
+
+    /**
+     * An alternative to `_.reduce`; this method transforms `object` to a new
+     * `accumulator` object which is the result of running each of its own
+     * enumerable string keyed properties thru `iteratee`, with each invocation
+     * potentially mutating the `accumulator` object. If `accumulator` is not
+     * provided, a new object with the same `[[Prototype]]` will be used. The
+     * iteratee is invoked with four arguments: (accumulator, value, key, object).
+     * Iteratee functions may exit iteration early by explicitly returning `false`.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.3.0
+     * @category Object
+     * @param {Object} object The object to iterate over.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @param {*} [accumulator] The custom accumulator value.
+     * @returns {*} Returns the accumulated value.
+     * @example
+     *
+     * _.transform([2, 3, 4], function(result, n) {
+     *   result.push(n *= n);
+     *   return n % 2 == 0;
+     * }, []);
+     * // => [4, 9]
+     *
+     * _.transform({ 'a': 1, 'b': 2, 'c': 1 }, function(result, value, key) {
+     *   (result[value] || (result[value] = [])).push(key);
+     * }, {});
+     * // => { '1': ['a', 'c'], '2': ['b'] }
+     */
+    function transform(object, iteratee, accumulator) {
+      var isArr = isArray(object),
+          isArrLike = isArr || isBuffer(object) || isTypedArray(object);
+
+      iteratee = getIteratee(iteratee, 4);
+      if (accumulator == null) {
+        var Ctor = object && object.constructor;
+        if (isArrLike) {
+          accumulator = isArr ? new Ctor : [];
+        }
+        else if (isObject(object)) {
+          accumulator = isFunction(Ctor) ? baseCreate(getPrototype(object)) : {};
+        }
+        else {
+          accumulator = {};
+        }
+      }
+      (isArrLike ? arrayEach : baseForOwn)(object, function(value, index, object) {
+        return iteratee(accumulator, value, index, object);
+      });
+      return accumulator;
+    }
+
+    /**
+     * Removes the property at `path` of `object`.
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Object
+     * @param {Object} object The object to modify.
+     * @param {Array|string} path The path of the property to unset.
+     * @returns {boolean} Returns `true` if the property is deleted, else `false`.
+     * @example
+     *
+     * var object = { 'a': [{ 'b': { 'c': 7 } }] };
+     * _.unset(object, 'a[0].b.c');
+     * // => true
+     *
+     * console.log(object);
+     * // => { 'a': [{ 'b': {} }] };
+     *
+     * _.unset(object, ['a', '0', 'b', 'c']);
+     * // => true
+     *
+     * console.log(object);
+     * // => { 'a': [{ 'b': {} }] };
+     */
+    function unset(object, path) {
+      return object == null ? true : baseUnset(object, path);
+    }
+
+    /**
+     * This method is like `_.set` except that accepts `updater` to produce the
+     * value to set. Use `_.updateWith` to customize `path` creation. The `updater`
+     * is invoked with one argument: (value).
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.6.0
+     * @category Object
+     * @param {Object} object The object to modify.
+     * @param {Array|string} path The path of the property to set.
+     * @param {Function} updater The function to produce the updated value.
+     * @returns {Object} Returns `object`.
+     * @example
+     *
+     * var object = { 'a': [{ 'b': { 'c': 3 } }] };
+     *
+     * _.update(object, 'a[0].b.c', function(n) { return n * n; });
+     * console.log(object.a[0].b.c);
+     * // => 9
+     *
+     * _.update(object, 'x[0].y.z', function(n) { return n ? n + 1 : 0; });
+     * console.log(object.x[0].y.z);
+     * // => 0
+     */
+    function update(object, path, updater) {
+      return object == null ? object : baseUpdate(object, path, castFunction(updater));
+    }
+
+    /**
+     * This method is like `_.update` except that it accepts `customizer` which is
+     * invoked to produce the objects of `path`.  If `customizer` returns `undefined`
+     * path creation is handled by the method instead. The `customizer` is invoked
+     * with three arguments: (nsValue, key, nsObject).
+     *
+     * **Note:** This method mutates `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.6.0
+     * @category Object
+     * @param {Object} object The object to modify.
+     * @param {Array|string} path The path of the property to set.
+     * @param {Function} updater The function to produce the updated value.
+     * @param {Function} [customizer] The function to customize assigned values.
+     * @returns {Object} Returns `object`.
+     * @example
+     *
+     * var object = {};
+     *
+     * _.updateWith(object, '[0][1]', _.constant('a'), Object);
+     * // => { '0': { '1': 'a' } }
+     */
+    function updateWith(object, path, updater, customizer) {
+      customizer = typeof customizer == 'function' ? customizer : undefined;
+      return object == null ? object : baseUpdate(object, path, castFunction(updater), customizer);
+    }
+
+    /**
+     * Creates an array of the own enumerable string keyed property values of `object`.
+     *
+     * **Note:** Non-object values are coerced to objects.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Object
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property values.
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.values(new Foo);
+     * // => [1, 2] (iteration order is not guaranteed)
+     *
+     * _.values('hi');
+     * // => ['h', 'i']
+     */
+    function values(object) {
+      return object == null ? [] : baseValues(object, keys(object));
+    }
+
+    /**
+     * Creates an array of the own and inherited enumerable string keyed property
+     * values of `object`.
+     *
+     * **Note:** Non-object values are coerced to objects.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Object
+     * @param {Object} object The object to query.
+     * @returns {Array} Returns the array of property values.
+     * @example
+     *
+     * function Foo() {
+     *   this.a = 1;
+     *   this.b = 2;
+     * }
+     *
+     * Foo.prototype.c = 3;
+     *
+     * _.valuesIn(new Foo);
+     * // => [1, 2, 3] (iteration order is not guaranteed)
+     */
+    function valuesIn(object) {
+      return object == null ? [] : baseValues(object, keysIn(object));
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Clamps `number` within the inclusive `lower` and `upper` bounds.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Number
+     * @param {number} number The number to clamp.
+     * @param {number} [lower] The lower bound.
+     * @param {number} upper The upper bound.
+     * @returns {number} Returns the clamped number.
+     * @example
+     *
+     * _.clamp(-10, -5, 5);
+     * // => -5
+     *
+     * _.clamp(10, -5, 5);
+     * // => 5
+     */
+    function clamp(number, lower, upper) {
+      if (upper === undefined) {
+        upper = lower;
+        lower = undefined;
+      }
+      if (upper !== undefined) {
+        upper = toNumber(upper);
+        upper = upper === upper ? upper : 0;
+      }
+      if (lower !== undefined) {
+        lower = toNumber(lower);
+        lower = lower === lower ? lower : 0;
+      }
+      return baseClamp(toNumber(number), lower, upper);
+    }
+
+    /**
+     * Checks if `n` is between `start` and up to, but not including, `end`. If
+     * `end` is not specified, it's set to `start` with `start` then set to `0`.
+     * If `start` is greater than `end` the params are swapped to support
+     * negative ranges.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.3.0
+     * @category Number
+     * @param {number} number The number to check.
+     * @param {number} [start=0] The start of the range.
+     * @param {number} end The end of the range.
+     * @returns {boolean} Returns `true` if `number` is in the range, else `false`.
+     * @see _.range, _.rangeRight
+     * @example
+     *
+     * _.inRange(3, 2, 4);
+     * // => true
+     *
+     * _.inRange(4, 8);
+     * // => true
+     *
+     * _.inRange(4, 2);
+     * // => false
+     *
+     * _.inRange(2, 2);
+     * // => false
+     *
+     * _.inRange(1.2, 2);
+     * // => true
+     *
+     * _.inRange(5.2, 4);
+     * // => false
+     *
+     * _.inRange(-3, -2, -6);
+     * // => true
+     */
+    function inRange(number, start, end) {
+      start = toFinite(start);
+      if (end === undefined) {
+        end = start;
+        start = 0;
+      } else {
+        end = toFinite(end);
+      }
+      number = toNumber(number);
+      return baseInRange(number, start, end);
+    }
+
+    /**
+     * Produces a random number between the inclusive `lower` and `upper` bounds.
+     * If only one argument is provided a number between `0` and the given number
+     * is returned. If `floating` is `true`, or either `lower` or `upper` are
+     * floats, a floating-point number is returned instead of an integer.
+     *
+     * **Note:** JavaScript follows the IEEE-754 standard for resolving
+     * floating-point values which can produce unexpected results.
+     *
+     * @static
+     * @memberOf _
+     * @since 0.7.0
+     * @category Number
+     * @param {number} [lower=0] The lower bound.
+     * @param {number} [upper=1] The upper bound.
+     * @param {boolean} [floating] Specify returning a floating-point number.
+     * @returns {number} Returns the random number.
+     * @example
+     *
+     * _.random(0, 5);
+     * // => an integer between 0 and 5
+     *
+     * _.random(5);
+     * // => also an integer between 0 and 5
+     *
+     * _.random(5, true);
+     * // => a floating-point number between 0 and 5
+     *
+     * _.random(1.2, 5.2);
+     * // => a floating-point number between 1.2 and 5.2
+     */
+    function random(lower, upper, floating) {
+      if (floating && typeof floating != 'boolean' && isIterateeCall(lower, upper, floating)) {
+        upper = floating = undefined;
+      }
+      if (floating === undefined) {
+        if (typeof upper == 'boolean') {
+          floating = upper;
+          upper = undefined;
+        }
+        else if (typeof lower == 'boolean') {
+          floating = lower;
+          lower = undefined;
+        }
+      }
+      if (lower === undefined && upper === undefined) {
+        lower = 0;
+        upper = 1;
+      }
+      else {
+        lower = toFinite(lower);
+        if (upper === undefined) {
+          upper = lower;
+          lower = 0;
+        } else {
+          upper = toFinite(upper);
+        }
+      }
+      if (lower > upper) {
+        var temp = lower;
+        lower = upper;
+        upper = temp;
+      }
+      if (floating || lower % 1 || upper % 1) {
+        var rand = nativeRandom();
+        return nativeMin(lower + (rand * (upper - lower + freeParseFloat('1e-' + ((rand + '').length - 1)))), upper);
+      }
+      return baseRandom(lower, upper);
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Converts `string` to [camel case](https://en.wikipedia.org/wiki/CamelCase).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the camel cased string.
+     * @example
+     *
+     * _.camelCase('Foo Bar');
+     * // => 'fooBar'
+     *
+     * _.camelCase('--foo-bar--');
+     * // => 'fooBar'
+     *
+     * _.camelCase('__FOO_BAR__');
+     * // => 'fooBar'
+     */
+    var camelCase = createCompounder(function(result, word, index) {
+      word = word.toLowerCase();
+      return result + (index ? capitalize(word) : word);
+    });
+
+    /**
+     * Converts the first character of `string` to upper case and the remaining
+     * to lower case.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to capitalize.
+     * @returns {string} Returns the capitalized string.
+     * @example
+     *
+     * _.capitalize('FRED');
+     * // => 'Fred'
+     */
+    function capitalize(string) {
+      return upperFirst(toString(string).toLowerCase());
+    }
+
+    /**
+     * Deburrs `string` by converting
+     * [Latin-1 Supplement](https://en.wikipedia.org/wiki/Latin-1_Supplement_(Unicode_block)#Character_table)
+     * and [Latin Extended-A](https://en.wikipedia.org/wiki/Latin_Extended-A)
+     * letters to basic Latin letters and removing
+     * [combining diacritical marks](https://en.wikipedia.org/wiki/Combining_Diacritical_Marks).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to deburr.
+     * @returns {string} Returns the deburred string.
+     * @example
+     *
+     * _.deburr('déjà vu');
+     * // => 'deja vu'
+     */
+    function deburr(string) {
+      string = toString(string);
+      return string && string.replace(reLatin, deburrLetter).replace(reComboMark, '');
+    }
+
+    /**
+     * Checks if `string` ends with the given target string.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to inspect.
+     * @param {string} [target] The string to search for.
+     * @param {number} [position=string.length] The position to search up to.
+     * @returns {boolean} Returns `true` if `string` ends with `target`,
+     *  else `false`.
+     * @example
+     *
+     * _.endsWith('abc', 'c');
+     * // => true
+     *
+     * _.endsWith('abc', 'b');
+     * // => false
+     *
+     * _.endsWith('abc', 'b', 2);
+     * // => true
+     */
+    function endsWith(string, target, position) {
+      string = toString(string);
+      target = baseToString(target);
+
+      var length = string.length;
+      position = position === undefined
+        ? length
+        : baseClamp(toInteger(position), 0, length);
+
+      var end = position;
+      position -= target.length;
+      return position >= 0 && string.slice(position, end) == target;
+    }
+
+    /**
+     * Converts the characters "&", "<", ">", '"', and "'" in `string` to their
+     * corresponding HTML entities.
+     *
+     * **Note:** No other characters are escaped. To escape additional
+     * characters use a third-party library like [_he_](https://mths.be/he).
+     *
+     * Though the ">" character is escaped for symmetry, characters like
+     * ">" and "/" don't need escaping in HTML and have no special meaning
+     * unless they're part of a tag or unquoted attribute value. See
+     * [Mathias Bynens's article](https://mathiasbynens.be/notes/ambiguous-ampersands)
+     * (under "semi-related fun fact") for more details.
+     *
+     * When working with HTML you should always
+     * [quote attribute values](http://wonko.com/post/html-escaping) to reduce
+     * XSS vectors.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category String
+     * @param {string} [string=''] The string to escape.
+     * @returns {string} Returns the escaped string.
+     * @example
+     *
+     * _.escape('fred, barney, & pebbles');
+     * // => 'fred, barney, &amp; pebbles'
+     */
+    function escape(string) {
+      string = toString(string);
+      return (string && reHasUnescapedHtml.test(string))
+        ? string.replace(reUnescapedHtml, escapeHtmlChar)
+        : string;
+    }
+
+    /**
+     * Escapes the `RegExp` special characters "^", "$", "\", ".", "*", "+",
+     * "?", "(", ")", "[", "]", "{", "}", and "|" in `string`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to escape.
+     * @returns {string} Returns the escaped string.
+     * @example
+     *
+     * _.escapeRegExp('[lodash](https://lodash.com/)');
+     * // => '\[lodash\]\(https://lodash\.com/\)'
+     */
+    function escapeRegExp(string) {
+      string = toString(string);
+      return (string && reHasRegExpChar.test(string))
+        ? string.replace(reRegExpChar, '\\$&')
+        : string;
+    }
+
+    /**
+     * Converts `string` to
+     * [kebab case](https://en.wikipedia.org/wiki/Letter_case#Special_case_styles).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the kebab cased string.
+     * @example
+     *
+     * _.kebabCase('Foo Bar');
+     * // => 'foo-bar'
+     *
+     * _.kebabCase('fooBar');
+     * // => 'foo-bar'
+     *
+     * _.kebabCase('__FOO_BAR__');
+     * // => 'foo-bar'
+     */
+    var kebabCase = createCompounder(function(result, word, index) {
+      return result + (index ? '-' : '') + word.toLowerCase();
+    });
+
+    /**
+     * Converts `string`, as space separated words, to lower case.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the lower cased string.
+     * @example
+     *
+     * _.lowerCase('--Foo-Bar--');
+     * // => 'foo bar'
+     *
+     * _.lowerCase('fooBar');
+     * // => 'foo bar'
+     *
+     * _.lowerCase('__FOO_BAR__');
+     * // => 'foo bar'
+     */
+    var lowerCase = createCompounder(function(result, word, index) {
+      return result + (index ? ' ' : '') + word.toLowerCase();
+    });
+
+    /**
+     * Converts the first character of `string` to lower case.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the converted string.
+     * @example
+     *
+     * _.lowerFirst('Fred');
+     * // => 'fred'
+     *
+     * _.lowerFirst('FRED');
+     * // => 'fRED'
+     */
+    var lowerFirst = createCaseFirst('toLowerCase');
+
+    /**
+     * Pads `string` on the left and right sides if it's shorter than `length`.
+     * Padding characters are truncated if they can't be evenly divided by `length`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to pad.
+     * @param {number} [length=0] The padding length.
+     * @param {string} [chars=' '] The string used as padding.
+     * @returns {string} Returns the padded string.
+     * @example
+     *
+     * _.pad('abc', 8);
+     * // => '  abc   '
+     *
+     * _.pad('abc', 8, '_-');
+     * // => '_-abc_-_'
+     *
+     * _.pad('abc', 3);
+     * // => 'abc'
+     */
+    function pad(string, length, chars) {
+      string = toString(string);
+      length = toInteger(length);
+
+      var strLength = length ? stringSize(string) : 0;
+      if (!length || strLength >= length) {
+        return string;
+      }
+      var mid = (length - strLength) / 2;
+      return (
+        createPadding(nativeFloor(mid), chars) +
+        string +
+        createPadding(nativeCeil(mid), chars)
+      );
+    }
+
+    /**
+     * Pads `string` on the right side if it's shorter than `length`. Padding
+     * characters are truncated if they exceed `length`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to pad.
+     * @param {number} [length=0] The padding length.
+     * @param {string} [chars=' '] The string used as padding.
+     * @returns {string} Returns the padded string.
+     * @example
+     *
+     * _.padEnd('abc', 6);
+     * // => 'abc   '
+     *
+     * _.padEnd('abc', 6, '_-');
+     * // => 'abc_-_'
+     *
+     * _.padEnd('abc', 3);
+     * // => 'abc'
+     */
+    function padEnd(string, length, chars) {
+      string = toString(string);
+      length = toInteger(length);
+
+      var strLength = length ? stringSize(string) : 0;
+      return (length && strLength < length)
+        ? (string + createPadding(length - strLength, chars))
+        : string;
+    }
+
+    /**
+     * Pads `string` on the left side if it's shorter than `length`. Padding
+     * characters are truncated if they exceed `length`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to pad.
+     * @param {number} [length=0] The padding length.
+     * @param {string} [chars=' '] The string used as padding.
+     * @returns {string} Returns the padded string.
+     * @example
+     *
+     * _.padStart('abc', 6);
+     * // => '   abc'
+     *
+     * _.padStart('abc', 6, '_-');
+     * // => '_-_abc'
+     *
+     * _.padStart('abc', 3);
+     * // => 'abc'
+     */
+    function padStart(string, length, chars) {
+      string = toString(string);
+      length = toInteger(length);
+
+      var strLength = length ? stringSize(string) : 0;
+      return (length && strLength < length)
+        ? (createPadding(length - strLength, chars) + string)
+        : string;
+    }
+
+    /**
+     * Converts `string` to an integer of the specified radix. If `radix` is
+     * `undefined` or `0`, a `radix` of `10` is used unless `value` is a
+     * hexadecimal, in which case a `radix` of `16` is used.
+     *
+     * **Note:** This method aligns with the
+     * [ES5 implementation](https://es5.github.io/#x15.1.2.2) of `parseInt`.
+     *
+     * @static
+     * @memberOf _
+     * @since 1.1.0
+     * @category String
+     * @param {string} string The string to convert.
+     * @param {number} [radix=10] The radix to interpret `value` by.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {number} Returns the converted integer.
+     * @example
+     *
+     * _.parseInt('08');
+     * // => 8
+     *
+     * _.map(['6', '08', '10'], _.parseInt);
+     * // => [6, 8, 10]
+     */
+    function parseInt(string, radix, guard) {
+      if (guard || radix == null) {
+        radix = 0;
+      } else if (radix) {
+        radix = +radix;
+      }
+      return nativeParseInt(toString(string).replace(reTrimStart, ''), radix || 0);
+    }
+
+    /**
+     * Repeats the given string `n` times.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to repeat.
+     * @param {number} [n=1] The number of times to repeat the string.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {string} Returns the repeated string.
+     * @example
+     *
+     * _.repeat('*', 3);
+     * // => '***'
+     *
+     * _.repeat('abc', 2);
+     * // => 'abcabc'
+     *
+     * _.repeat('abc', 0);
+     * // => ''
+     */
+    function repeat(string, n, guard) {
+      if ((guard ? isIterateeCall(string, n, guard) : n === undefined)) {
+        n = 1;
+      } else {
+        n = toInteger(n);
+      }
+      return baseRepeat(toString(string), n);
+    }
+
+    /**
+     * Replaces matches for `pattern` in `string` with `replacement`.
+     *
+     * **Note:** This method is based on
+     * [`String#replace`](https://mdn.io/String/replace).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to modify.
+     * @param {RegExp|string} pattern The pattern to replace.
+     * @param {Function|string} replacement The match replacement.
+     * @returns {string} Returns the modified string.
+     * @example
+     *
+     * _.replace('Hi Fred', 'Fred', 'Barney');
+     * // => 'Hi Barney'
+     */
+    function replace() {
+      var args = arguments,
+          string = toString(args[0]);
+
+      return args.length < 3 ? string : string.replace(args[1], args[2]);
+    }
+
+    /**
+     * Converts `string` to
+     * [snake case](https://en.wikipedia.org/wiki/Snake_case).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the snake cased string.
+     * @example
+     *
+     * _.snakeCase('Foo Bar');
+     * // => 'foo_bar'
+     *
+     * _.snakeCase('fooBar');
+     * // => 'foo_bar'
+     *
+     * _.snakeCase('--FOO-BAR--');
+     * // => 'foo_bar'
+     */
+    var snakeCase = createCompounder(function(result, word, index) {
+      return result + (index ? '_' : '') + word.toLowerCase();
+    });
+
+    /**
+     * Splits `string` by `separator`.
+     *
+     * **Note:** This method is based on
+     * [`String#split`](https://mdn.io/String/split).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to split.
+     * @param {RegExp|string} separator The separator pattern to split by.
+     * @param {number} [limit] The length to truncate results to.
+     * @returns {Array} Returns the string segments.
+     * @example
+     *
+     * _.split('a-b-c', '-', 2);
+     * // => ['a', 'b']
+     */
+    function split(string, separator, limit) {
+      if (limit && typeof limit != 'number' && isIterateeCall(string, separator, limit)) {
+        separator = limit = undefined;
+      }
+      limit = limit === undefined ? MAX_ARRAY_LENGTH : limit >>> 0;
+      if (!limit) {
+        return [];
+      }
+      string = toString(string);
+      if (string && (
+            typeof separator == 'string' ||
+            (separator != null && !isRegExp(separator))
+          )) {
+        separator = baseToString(separator);
+        if (!separator && hasUnicode(string)) {
+          return castSlice(stringToArray(string), 0, limit);
+        }
+      }
+      return string.split(separator, limit);
+    }
+
+    /**
+     * Converts `string` to
+     * [start case](https://en.wikipedia.org/wiki/Letter_case#Stylistic_or_specialised_usage).
+     *
+     * @static
+     * @memberOf _
+     * @since 3.1.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the start cased string.
+     * @example
+     *
+     * _.startCase('--foo-bar--');
+     * // => 'Foo Bar'
+     *
+     * _.startCase('fooBar');
+     * // => 'Foo Bar'
+     *
+     * _.startCase('__FOO_BAR__');
+     * // => 'FOO BAR'
+     */
+    var startCase = createCompounder(function(result, word, index) {
+      return result + (index ? ' ' : '') + upperFirst(word);
+    });
+
+    /**
+     * Checks if `string` starts with the given target string.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to inspect.
+     * @param {string} [target] The string to search for.
+     * @param {number} [position=0] The position to search from.
+     * @returns {boolean} Returns `true` if `string` starts with `target`,
+     *  else `false`.
+     * @example
+     *
+     * _.startsWith('abc', 'a');
+     * // => true
+     *
+     * _.startsWith('abc', 'b');
+     * // => false
+     *
+     * _.startsWith('abc', 'b', 1);
+     * // => true
+     */
+    function startsWith(string, target, position) {
+      string = toString(string);
+      position = position == null
+        ? 0
+        : baseClamp(toInteger(position), 0, string.length);
+
+      target = baseToString(target);
+      return string.slice(position, position + target.length) == target;
+    }
+
+    /**
+     * Creates a compiled template function that can interpolate data properties
+     * in "interpolate" delimiters, HTML-escape interpolated data properties in
+     * "escape" delimiters, and execute JavaScript in "evaluate" delimiters. Data
+     * properties may be accessed as free variables in the template. If a setting
+     * object is given, it takes precedence over `_.templateSettings` values.
+     *
+     * **Note:** In the development build `_.template` utilizes
+     * [sourceURLs](http://www.html5rocks.com/en/tutorials/developertools/sourcemaps/#toc-sourceurl)
+     * for easier debugging.
+     *
+     * For more information on precompiling templates see
+     * [lodash's custom builds documentation](https://lodash.com/custom-builds).
+     *
+     * For more information on Chrome extension sandboxes see
+     * [Chrome's extensions documentation](https://developer.chrome.com/extensions/sandboxingEval).
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category String
+     * @param {string} [string=''] The template string.
+     * @param {Object} [options={}] The options object.
+     * @param {RegExp} [options.escape=_.templateSettings.escape]
+     *  The HTML "escape" delimiter.
+     * @param {RegExp} [options.evaluate=_.templateSettings.evaluate]
+     *  The "evaluate" delimiter.
+     * @param {Object} [options.imports=_.templateSettings.imports]
+     *  An object to import into the template as free variables.
+     * @param {RegExp} [options.interpolate=_.templateSettings.interpolate]
+     *  The "interpolate" delimiter.
+     * @param {string} [options.sourceURL='lodash.templateSources[n]']
+     *  The sourceURL of the compiled template.
+     * @param {string} [options.variable='obj']
+     *  The data object variable name.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Function} Returns the compiled template function.
+     * @example
+     *
+     * // Use the "interpolate" delimiter to create a compiled template.
+     * var compiled = _.template('hello <%= user %>!');
+     * compiled({ 'user': 'fred' });
+     * // => 'hello fred!'
+     *
+     * // Use the HTML "escape" delimiter to escape data property values.
+     * var compiled = _.template('<b><%- value %></b>');
+     * compiled({ 'value': '<script>' });
+     * // => '<b>&lt;script&gt;</b>'
+     *
+     * // Use the "evaluate" delimiter to execute JavaScript and generate HTML.
+     * var compiled = _.template('<% _.forEach(users, function(user) { %><li><%- user %></li><% }); %>');
+     * compiled({ 'users': ['fred', 'barney'] });
+     * // => '<li>fred</li><li>barney</li>'
+     *
+     * // Use the internal `print` function in "evaluate" delimiters.
+     * var compiled = _.template('<% print("hello " + user); %>!');
+     * compiled({ 'user': 'barney' });
+     * // => 'hello barney!'
+     *
+     * // Use the ES template literal delimiter as an "interpolate" delimiter.
+     * // Disable support by replacing the "interpolate" delimiter.
+     * var compiled = _.template('hello ${ user }!');
+     * compiled({ 'user': 'pebbles' });
+     * // => 'hello pebbles!'
+     *
+     * // Use backslashes to treat delimiters as plain text.
+     * var compiled = _.template('<%= "\\<%- value %\\>" %>');
+     * compiled({ 'value': 'ignored' });
+     * // => '<%- value %>'
+     *
+     * // Use the `imports` option to import `jQuery` as `jq`.
+     * var text = '<% jq.each(users, function(user) { %><li><%- user %></li><% }); %>';
+     * var compiled = _.template(text, { 'imports': { 'jq': jQuery } });
+     * compiled({ 'users': ['fred', 'barney'] });
+     * // => '<li>fred</li><li>barney</li>'
+     *
+     * // Use the `sourceURL` option to specify a custom sourceURL for the template.
+     * var compiled = _.template('hello <%= user %>!', { 'sourceURL': '/basic/greeting.jst' });
+     * compiled(data);
+     * // => Find the source of "greeting.jst" under the Sources tab or Resources panel of the web inspector.
+     *
+     * // Use the `variable` option to ensure a with-statement isn't used in the compiled template.
+     * var compiled = _.template('hi <%= data.user %>!', { 'variable': 'data' });
+     * compiled.source;
+     * // => function(data) {
+     * //   var __t, __p = '';
+     * //   __p += 'hi ' + ((__t = ( data.user )) == null ? '' : __t) + '!';
+     * //   return __p;
+     * // }
+     *
+     * // Use custom template delimiters.
+     * _.templateSettings.interpolate = /{{([\s\S]+?)}}/g;
+     * var compiled = _.template('hello {{ user }}!');
+     * compiled({ 'user': 'mustache' });
+     * // => 'hello mustache!'
+     *
+     * // Use the `source` property to inline compiled templates for meaningful
+     * // line numbers in error messages and stack traces.
+     * fs.writeFileSync(path.join(process.cwd(), 'jst.js'), '\
+     *   var JST = {\
+     *     "main": ' + _.template(mainText).source + '\
+     *   };\
+     * ');
+     */
+    function template(string, options, guard) {
+      // Based on John Resig's `tmpl` implementation
+      // (http://ejohn.org/blog/javascript-micro-templating/)
+      // and Laura Doktorova's doT.js (https://github.com/olado/doT).
+      var settings = lodash.templateSettings;
+
+      if (guard && isIterateeCall(string, options, guard)) {
+        options = undefined;
+      }
+      string = toString(string);
+      options = assignInWith({}, options, settings, customDefaultsAssignIn);
+
+      var imports = assignInWith({}, options.imports, settings.imports, customDefaultsAssignIn),
+          importsKeys = keys(imports),
+          importsValues = baseValues(imports, importsKeys);
+
+      var isEscaping,
+          isEvaluating,
+          index = 0,
+          interpolate = options.interpolate || reNoMatch,
+          source = "__p += '";
+
+      // Compile the regexp to match each delimiter.
+      var reDelimiters = RegExp(
+        (options.escape || reNoMatch).source + '|' +
+        interpolate.source + '|' +
+        (interpolate === reInterpolate ? reEsTemplate : reNoMatch).source + '|' +
+        (options.evaluate || reNoMatch).source + '|$'
+      , 'g');
+
+      // Use a sourceURL for easier debugging.
+      var sourceURL = '//# sourceURL=' +
+        ('sourceURL' in options
+          ? options.sourceURL
+          : ('lodash.templateSources[' + (++templateCounter) + ']')
+        ) + '\n';
+
+      string.replace(reDelimiters, function(match, escapeValue, interpolateValue, esTemplateValue, evaluateValue, offset) {
+        interpolateValue || (interpolateValue = esTemplateValue);
+
+        // Escape characters that can't be included in string literals.
+        source += string.slice(index, offset).replace(reUnescapedString, escapeStringChar);
+
+        // Replace delimiters with snippets.
+        if (escapeValue) {
+          isEscaping = true;
+          source += "' +\n__e(" + escapeValue + ") +\n'";
+        }
+        if (evaluateValue) {
+          isEvaluating = true;
+          source += "';\n" + evaluateValue + ";\n__p += '";
+        }
+        if (interpolateValue) {
+          source += "' +\n((__t = (" + interpolateValue + ")) == null ? '' : __t) +\n'";
+        }
+        index = offset + match.length;
+
+        // The JS engine embedded in Adobe products needs `match` returned in
+        // order to produce the correct `offset` value.
+        return match;
+      });
+
+      source += "';\n";
+
+      // If `variable` is not specified wrap a with-statement around the generated
+      // code to add the data object to the top of the scope chain.
+      var variable = options.variable;
+      if (!variable) {
+        source = 'with (obj) {\n' + source + '\n}\n';
+      }
+      // Cleanup code by stripping empty strings.
+      source = (isEvaluating ? source.replace(reEmptyStringLeading, '') : source)
+        .replace(reEmptyStringMiddle, '$1')
+        .replace(reEmptyStringTrailing, '$1;');
+
+      // Frame code as the function body.
+      source = 'function(' + (variable || 'obj') + ') {\n' +
+        (variable
+          ? ''
+          : 'obj || (obj = {});\n'
+        ) +
+        "var __t, __p = ''" +
+        (isEscaping
+           ? ', __e = _.escape'
+           : ''
+        ) +
+        (isEvaluating
+          ? ', __j = Array.prototype.join;\n' +
+            "function print() { __p += __j.call(arguments, '') }\n"
+          : ';\n'
+        ) +
+        source +
+        'return __p\n}';
+
+      var result = attempt(function() {
+        return Function(importsKeys, sourceURL + 'return ' + source)
+          .apply(undefined, importsValues);
+      });
+
+      // Provide the compiled function's source by its `toString` method or
+      // the `source` property as a convenience for inlining compiled templates.
+      result.source = source;
+      if (isError(result)) {
+        throw result;
+      }
+      return result;
+    }
+
+    /**
+     * Converts `string`, as a whole, to lower case just like
+     * [String#toLowerCase](https://mdn.io/toLowerCase).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the lower cased string.
+     * @example
+     *
+     * _.toLower('--Foo-Bar--');
+     * // => '--foo-bar--'
+     *
+     * _.toLower('fooBar');
+     * // => 'foobar'
+     *
+     * _.toLower('__FOO_BAR__');
+     * // => '__foo_bar__'
+     */
+    function toLower(value) {
+      return toString(value).toLowerCase();
+    }
+
+    /**
+     * Converts `string`, as a whole, to upper case just like
+     * [String#toUpperCase](https://mdn.io/toUpperCase).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the upper cased string.
+     * @example
+     *
+     * _.toUpper('--foo-bar--');
+     * // => '--FOO-BAR--'
+     *
+     * _.toUpper('fooBar');
+     * // => 'FOOBAR'
+     *
+     * _.toUpper('__foo_bar__');
+     * // => '__FOO_BAR__'
+     */
+    function toUpper(value) {
+      return toString(value).toUpperCase();
+    }
+
+    /**
+     * Removes leading and trailing whitespace or specified characters from `string`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to trim.
+     * @param {string} [chars=whitespace] The characters to trim.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {string} Returns the trimmed string.
+     * @example
+     *
+     * _.trim('  abc  ');
+     * // => 'abc'
+     *
+     * _.trim('-_-abc-_-', '_-');
+     * // => 'abc'
+     *
+     * _.map(['  foo  ', '  bar  '], _.trim);
+     * // => ['foo', 'bar']
+     */
+    function trim(string, chars, guard) {
+      string = toString(string);
+      if (string && (guard || chars === undefined)) {
+        return string.replace(reTrim, '');
+      }
+      if (!string || !(chars = baseToString(chars))) {
+        return string;
+      }
+      var strSymbols = stringToArray(string),
+          chrSymbols = stringToArray(chars),
+          start = charsStartIndex(strSymbols, chrSymbols),
+          end = charsEndIndex(strSymbols, chrSymbols) + 1;
+
+      return castSlice(strSymbols, start, end).join('');
+    }
+
+    /**
+     * Removes trailing whitespace or specified characters from `string`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to trim.
+     * @param {string} [chars=whitespace] The characters to trim.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {string} Returns the trimmed string.
+     * @example
+     *
+     * _.trimEnd('  abc  ');
+     * // => '  abc'
+     *
+     * _.trimEnd('-_-abc-_-', '_-');
+     * // => '-_-abc'
+     */
+    function trimEnd(string, chars, guard) {
+      string = toString(string);
+      if (string && (guard || chars === undefined)) {
+        return string.replace(reTrimEnd, '');
+      }
+      if (!string || !(chars = baseToString(chars))) {
+        return string;
+      }
+      var strSymbols = stringToArray(string),
+          end = charsEndIndex(strSymbols, stringToArray(chars)) + 1;
+
+      return castSlice(strSymbols, 0, end).join('');
+    }
+
+    /**
+     * Removes leading whitespace or specified characters from `string`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to trim.
+     * @param {string} [chars=whitespace] The characters to trim.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {string} Returns the trimmed string.
+     * @example
+     *
+     * _.trimStart('  abc  ');
+     * // => 'abc  '
+     *
+     * _.trimStart('-_-abc-_-', '_-');
+     * // => 'abc-_-'
+     */
+    function trimStart(string, chars, guard) {
+      string = toString(string);
+      if (string && (guard || chars === undefined)) {
+        return string.replace(reTrimStart, '');
+      }
+      if (!string || !(chars = baseToString(chars))) {
+        return string;
+      }
+      var strSymbols = stringToArray(string),
+          start = charsStartIndex(strSymbols, stringToArray(chars));
+
+      return castSlice(strSymbols, start).join('');
+    }
+
+    /**
+     * Truncates `string` if it's longer than the given maximum string length.
+     * The last characters of the truncated string are replaced with the omission
+     * string which defaults to "...".
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to truncate.
+     * @param {Object} [options={}] The options object.
+     * @param {number} [options.length=30] The maximum string length.
+     * @param {string} [options.omission='...'] The string to indicate text is omitted.
+     * @param {RegExp|string} [options.separator] The separator pattern to truncate to.
+     * @returns {string} Returns the truncated string.
+     * @example
+     *
+     * _.truncate('hi-diddly-ho there, neighborino');
+     * // => 'hi-diddly-ho there, neighbo...'
+     *
+     * _.truncate('hi-diddly-ho there, neighborino', {
+     *   'length': 24,
+     *   'separator': ' '
+     * });
+     * // => 'hi-diddly-ho there,...'
+     *
+     * _.truncate('hi-diddly-ho there, neighborino', {
+     *   'length': 24,
+     *   'separator': /,? +/
+     * });
+     * // => 'hi-diddly-ho there...'
+     *
+     * _.truncate('hi-diddly-ho there, neighborino', {
+     *   'omission': ' [...]'
+     * });
+     * // => 'hi-diddly-ho there, neig [...]'
+     */
+    function truncate(string, options) {
+      var length = DEFAULT_TRUNC_LENGTH,
+          omission = DEFAULT_TRUNC_OMISSION;
+
+      if (isObject(options)) {
+        var separator = 'separator' in options ? options.separator : separator;
+        length = 'length' in options ? toInteger(options.length) : length;
+        omission = 'omission' in options ? baseToString(options.omission) : omission;
+      }
+      string = toString(string);
+
+      var strLength = string.length;
+      if (hasUnicode(string)) {
+        var strSymbols = stringToArray(string);
+        strLength = strSymbols.length;
+      }
+      if (length >= strLength) {
+        return string;
+      }
+      var end = length - stringSize(omission);
+      if (end < 1) {
+        return omission;
+      }
+      var result = strSymbols
+        ? castSlice(strSymbols, 0, end).join('')
+        : string.slice(0, end);
+
+      if (separator === undefined) {
+        return result + omission;
+      }
+      if (strSymbols) {
+        end += (result.length - end);
+      }
+      if (isRegExp(separator)) {
+        if (string.slice(end).search(separator)) {
+          var match,
+              substring = result;
+
+          if (!separator.global) {
+            separator = RegExp(separator.source, toString(reFlags.exec(separator)) + 'g');
+          }
+          separator.lastIndex = 0;
+          while ((match = separator.exec(substring))) {
+            var newEnd = match.index;
+          }
+          result = result.slice(0, newEnd === undefined ? end : newEnd);
+        }
+      } else if (string.indexOf(baseToString(separator), end) != end) {
+        var index = result.lastIndexOf(separator);
+        if (index > -1) {
+          result = result.slice(0, index);
+        }
+      }
+      return result + omission;
+    }
+
+    /**
+     * The inverse of `_.escape`; this method converts the HTML entities
+     * `&amp;`, `&lt;`, `&gt;`, `&quot;`, and `&#39;` in `string` to
+     * their corresponding characters.
+     *
+     * **Note:** No other HTML entities are unescaped. To unescape additional
+     * HTML entities use a third-party library like [_he_](https://mths.be/he).
+     *
+     * @static
+     * @memberOf _
+     * @since 0.6.0
+     * @category String
+     * @param {string} [string=''] The string to unescape.
+     * @returns {string} Returns the unescaped string.
+     * @example
+     *
+     * _.unescape('fred, barney, &amp; pebbles');
+     * // => 'fred, barney, & pebbles'
+     */
+    function unescape(string) {
+      string = toString(string);
+      return (string && reHasEscapedHtml.test(string))
+        ? string.replace(reEscapedHtml, unescapeHtmlChar)
+        : string;
+    }
+
+    /**
+     * Converts `string`, as space separated words, to upper case.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the upper cased string.
+     * @example
+     *
+     * _.upperCase('--foo-bar');
+     * // => 'FOO BAR'
+     *
+     * _.upperCase('fooBar');
+     * // => 'FOO BAR'
+     *
+     * _.upperCase('__foo_bar__');
+     * // => 'FOO BAR'
+     */
+    var upperCase = createCompounder(function(result, word, index) {
+      return result + (index ? ' ' : '') + word.toUpperCase();
+    });
+
+    /**
+     * Converts the first character of `string` to upper case.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category String
+     * @param {string} [string=''] The string to convert.
+     * @returns {string} Returns the converted string.
+     * @example
+     *
+     * _.upperFirst('fred');
+     * // => 'Fred'
+     *
+     * _.upperFirst('FRED');
+     * // => 'FRED'
+     */
+    var upperFirst = createCaseFirst('toUpperCase');
+
+    /**
+     * Splits `string` into an array of its words.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category String
+     * @param {string} [string=''] The string to inspect.
+     * @param {RegExp|string} [pattern] The pattern to match words.
+     * @param- {Object} [guard] Enables use as an iteratee for methods like `_.map`.
+     * @returns {Array} Returns the words of `string`.
+     * @example
+     *
+     * _.words('fred, barney, & pebbles');
+     * // => ['fred', 'barney', 'pebbles']
+     *
+     * _.words('fred, barney, & pebbles', /[^, ]+/g);
+     * // => ['fred', 'barney', '&', 'pebbles']
+     */
+    function words(string, pattern, guard) {
+      string = toString(string);
+      pattern = guard ? undefined : pattern;
+
+      if (pattern === undefined) {
+        return hasUnicodeWord(string) ? unicodeWords(string) : asciiWords(string);
+      }
+      return string.match(pattern) || [];
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Attempts to invoke `func`, returning either the result or the caught error
+     * object. Any additional arguments are provided to `func` when it's invoked.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Util
+     * @param {Function} func The function to attempt.
+     * @param {...*} [args] The arguments to invoke `func` with.
+     * @returns {*} Returns the `func` result or error object.
+     * @example
+     *
+     * // Avoid throwing errors for invalid selectors.
+     * var elements = _.attempt(function(selector) {
+     *   return document.querySelectorAll(selector);
+     * }, '>_>');
+     *
+     * if (_.isError(elements)) {
+     *   elements = [];
+     * }
+     */
+    var attempt = baseRest(function(func, args) {
+      try {
+        return apply(func, undefined, args);
+      } catch (e) {
+        return isError(e) ? e : new Error(e);
+      }
+    });
+
+    /**
+     * Binds methods of an object to the object itself, overwriting the existing
+     * method.
+     *
+     * **Note:** This method doesn't set the "length" property of bound functions.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Util
+     * @param {Object} object The object to bind and assign the bound methods to.
+     * @param {...(string|string[])} methodNames The object method names to bind.
+     * @returns {Object} Returns `object`.
+     * @example
+     *
+     * var view = {
+     *   'label': 'docs',
+     *   'click': function() {
+     *     console.log('clicked ' + this.label);
+     *   }
+     * };
+     *
+     * _.bindAll(view, ['click']);
+     * jQuery(element).on('click', view.click);
+     * // => Logs 'clicked docs' when clicked.
+     */
+    var bindAll = flatRest(function(object, methodNames) {
+      arrayEach(methodNames, function(key) {
+        key = toKey(key);
+        baseAssignValue(object, key, bind(object[key], object));
+      });
+      return object;
+    });
+
+    /**
+     * Creates a function that iterates over `pairs` and invokes the corresponding
+     * function of the first predicate to return truthy. The predicate-function
+     * pairs are invoked with the `this` binding and arguments of the created
+     * function.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Util
+     * @param {Array} pairs The predicate-function pairs.
+     * @returns {Function} Returns the new composite function.
+     * @example
+     *
+     * var func = _.cond([
+     *   [_.matches({ 'a': 1 }),           _.constant('matches A')],
+     *   [_.conforms({ 'b': _.isNumber }), _.constant('matches B')],
+     *   [_.stubTrue,                      _.constant('no match')]
+     * ]);
+     *
+     * func({ 'a': 1, 'b': 2 });
+     * // => 'matches A'
+     *
+     * func({ 'a': 0, 'b': 1 });
+     * // => 'matches B'
+     *
+     * func({ 'a': '1', 'b': '2' });
+     * // => 'no match'
+     */
+    function cond(pairs) {
+      var length = pairs == null ? 0 : pairs.length,
+          toIteratee = getIteratee();
+
+      pairs = !length ? [] : arrayMap(pairs, function(pair) {
+        if (typeof pair[1] != 'function') {
+          throw new TypeError(FUNC_ERROR_TEXT);
+        }
+        return [toIteratee(pair[0]), pair[1]];
+      });
+
+      return baseRest(function(args) {
+        var index = -1;
+        while (++index < length) {
+          var pair = pairs[index];
+          if (apply(pair[0], this, args)) {
+            return apply(pair[1], this, args);
+          }
+        }
+      });
+    }
+
+    /**
+     * Creates a function that invokes the predicate properties of `source` with
+     * the corresponding property values of a given object, returning `true` if
+     * all predicates return truthy, else `false`.
+     *
+     * **Note:** The created function is equivalent to `_.conformsTo` with
+     * `source` partially applied.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Util
+     * @param {Object} source The object of property predicates to conform to.
+     * @returns {Function} Returns the new spec function.
+     * @example
+     *
+     * var objects = [
+     *   { 'a': 2, 'b': 1 },
+     *   { 'a': 1, 'b': 2 }
+     * ];
+     *
+     * _.filter(objects, _.conforms({ 'b': function(n) { return n > 1; } }));
+     * // => [{ 'a': 1, 'b': 2 }]
+     */
+    function conforms(source) {
+      return baseConforms(baseClone(source, CLONE_DEEP_FLAG));
+    }
+
+    /**
+     * Creates a function that returns `value`.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.4.0
+     * @category Util
+     * @param {*} value The value to return from the new function.
+     * @returns {Function} Returns the new constant function.
+     * @example
+     *
+     * var objects = _.times(2, _.constant({ 'a': 1 }));
+     *
+     * console.log(objects);
+     * // => [{ 'a': 1 }, { 'a': 1 }]
+     *
+     * console.log(objects[0] === objects[1]);
+     * // => true
+     */
+    function constant(value) {
+      return function() {
+        return value;
+      };
+    }
+
+    /**
+     * Checks `value` to determine whether a default value should be returned in
+     * its place. The `defaultValue` is returned if `value` is `NaN`, `null`,
+     * or `undefined`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.14.0
+     * @category Util
+     * @param {*} value The value to check.
+     * @param {*} defaultValue The default value.
+     * @returns {*} Returns the resolved value.
+     * @example
+     *
+     * _.defaultTo(1, 10);
+     * // => 1
+     *
+     * _.defaultTo(undefined, 10);
+     * // => 10
+     */
+    function defaultTo(value, defaultValue) {
+      return (value == null || value !== value) ? defaultValue : value;
+    }
+
+    /**
+     * Creates a function that returns the result of invoking the given functions
+     * with the `this` binding of the created function, where each successive
+     * invocation is supplied the return value of the previous.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Util
+     * @param {...(Function|Function[])} [funcs] The functions to invoke.
+     * @returns {Function} Returns the new composite function.
+     * @see _.flowRight
+     * @example
+     *
+     * function square(n) {
+     *   return n * n;
+     * }
+     *
+     * var addSquare = _.flow([_.add, square]);
+     * addSquare(1, 2);
+     * // => 9
+     */
+    var flow = createFlow();
+
+    /**
+     * This method is like `_.flow` except that it creates a function that
+     * invokes the given functions from right to left.
+     *
+     * @static
+     * @since 3.0.0
+     * @memberOf _
+     * @category Util
+     * @param {...(Function|Function[])} [funcs] The functions to invoke.
+     * @returns {Function} Returns the new composite function.
+     * @see _.flow
+     * @example
+     *
+     * function square(n) {
+     *   return n * n;
+     * }
+     *
+     * var addSquare = _.flowRight([square, _.add]);
+     * addSquare(1, 2);
+     * // => 9
+     */
+    var flowRight = createFlow(true);
+
+    /**
+     * This method returns the first argument it receives.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Util
+     * @param {*} value Any value.
+     * @returns {*} Returns `value`.
+     * @example
+     *
+     * var object = { 'a': 1 };
+     *
+     * console.log(_.identity(object) === object);
+     * // => true
+     */
+    function identity(value) {
+      return value;
+    }
+
+    /**
+     * Creates a function that invokes `func` with the arguments of the created
+     * function. If `func` is a property name, the created function returns the
+     * property value for a given element. If `func` is an array or object, the
+     * created function returns `true` for elements that contain the equivalent
+     * source properties, otherwise it returns `false`.
+     *
+     * @static
+     * @since 4.0.0
+     * @memberOf _
+     * @category Util
+     * @param {*} [func=_.identity] The value to convert to a callback.
+     * @returns {Function} Returns the callback.
+     * @example
+     *
+     * var users = [
+     *   { 'user': 'barney', 'age': 36, 'active': true },
+     *   { 'user': 'fred',   'age': 40, 'active': false }
+     * ];
+     *
+     * // The `_.matches` iteratee shorthand.
+     * _.filter(users, _.iteratee({ 'user': 'barney', 'active': true }));
+     * // => [{ 'user': 'barney', 'age': 36, 'active': true }]
+     *
+     * // The `_.matchesProperty` iteratee shorthand.
+     * _.filter(users, _.iteratee(['user', 'fred']));
+     * // => [{ 'user': 'fred', 'age': 40 }]
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.map(users, _.iteratee('user'));
+     * // => ['barney', 'fred']
+     *
+     * // Create custom iteratee shorthands.
+     * _.iteratee = _.wrap(_.iteratee, function(iteratee, func) {
+     *   return !_.isRegExp(func) ? iteratee(func) : function(string) {
+     *     return func.test(string);
+     *   };
+     * });
+     *
+     * _.filter(['abc', 'def'], /ef/);
+     * // => ['def']
+     */
+    function iteratee(func) {
+      return baseIteratee(typeof func == 'function' ? func : baseClone(func, CLONE_DEEP_FLAG));
+    }
+
+    /**
+     * Creates a function that performs a partial deep comparison between a given
+     * object and `source`, returning `true` if the given object has equivalent
+     * property values, else `false`.
+     *
+     * **Note:** The created function is equivalent to `_.isMatch` with `source`
+     * partially applied.
+     *
+     * Partial comparisons will match empty array and empty object `source`
+     * values against any array or object value, respectively. See `_.isEqual`
+     * for a list of supported value comparisons.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Util
+     * @param {Object} source The object of property values to match.
+     * @returns {Function} Returns the new spec function.
+     * @example
+     *
+     * var objects = [
+     *   { 'a': 1, 'b': 2, 'c': 3 },
+     *   { 'a': 4, 'b': 5, 'c': 6 }
+     * ];
+     *
+     * _.filter(objects, _.matches({ 'a': 4, 'c': 6 }));
+     * // => [{ 'a': 4, 'b': 5, 'c': 6 }]
+     */
+    function matches(source) {
+      return baseMatches(baseClone(source, CLONE_DEEP_FLAG));
+    }
+
+    /**
+     * Creates a function that performs a partial deep comparison between the
+     * value at `path` of a given object to `srcValue`, returning `true` if the
+     * object value is equivalent, else `false`.
+     *
+     * **Note:** Partial comparisons will match empty array and empty object
+     * `srcValue` values against any array or object value, respectively. See
+     * `_.isEqual` for a list of supported value comparisons.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.2.0
+     * @category Util
+     * @param {Array|string} path The path of the property to get.
+     * @param {*} srcValue The value to match.
+     * @returns {Function} Returns the new spec function.
+     * @example
+     *
+     * var objects = [
+     *   { 'a': 1, 'b': 2, 'c': 3 },
+     *   { 'a': 4, 'b': 5, 'c': 6 }
+     * ];
+     *
+     * _.find(objects, _.matchesProperty('a', 4));
+     * // => { 'a': 4, 'b': 5, 'c': 6 }
+     */
+    function matchesProperty(path, srcValue) {
+      return baseMatchesProperty(path, baseClone(srcValue, CLONE_DEEP_FLAG));
+    }
+
+    /**
+     * Creates a function that invokes the method at `path` of a given object.
+     * Any additional arguments are provided to the invoked method.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.7.0
+     * @category Util
+     * @param {Array|string} path The path of the method to invoke.
+     * @param {...*} [args] The arguments to invoke the method with.
+     * @returns {Function} Returns the new invoker function.
+     * @example
+     *
+     * var objects = [
+     *   { 'a': { 'b': _.constant(2) } },
+     *   { 'a': { 'b': _.constant(1) } }
+     * ];
+     *
+     * _.map(objects, _.method('a.b'));
+     * // => [2, 1]
+     *
+     * _.map(objects, _.method(['a', 'b']));
+     * // => [2, 1]
+     */
+    var method = baseRest(function(path, args) {
+      return function(object) {
+        return baseInvoke(object, path, args);
+      };
+    });
+
+    /**
+     * The opposite of `_.method`; this method creates a function that invokes
+     * the method at a given path of `object`. Any additional arguments are
+     * provided to the invoked method.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.7.0
+     * @category Util
+     * @param {Object} object The object to query.
+     * @param {...*} [args] The arguments to invoke the method with.
+     * @returns {Function} Returns the new invoker function.
+     * @example
+     *
+     * var array = _.times(3, _.constant),
+     *     object = { 'a': array, 'b': array, 'c': array };
+     *
+     * _.map(['a[2]', 'c[0]'], _.methodOf(object));
+     * // => [2, 0]
+     *
+     * _.map([['a', '2'], ['c', '0']], _.methodOf(object));
+     * // => [2, 0]
+     */
+    var methodOf = baseRest(function(object, args) {
+      return function(path) {
+        return baseInvoke(object, path, args);
+      };
+    });
+
+    /**
+     * Adds all own enumerable string keyed function properties of a source
+     * object to the destination object. If `object` is a function, then methods
+     * are added to its prototype as well.
+     *
+     * **Note:** Use `_.runInContext` to create a pristine `lodash` function to
+     * avoid conflicts caused by modifying the original.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Util
+     * @param {Function|Object} [object=lodash] The destination object.
+     * @param {Object} source The object of functions to add.
+     * @param {Object} [options={}] The options object.
+     * @param {boolean} [options.chain=true] Specify whether mixins are chainable.
+     * @returns {Function|Object} Returns `object`.
+     * @example
+     *
+     * function vowels(string) {
+     *   return _.filter(string, function(v) {
+     *     return /[aeiou]/i.test(v);
+     *   });
+     * }
+     *
+     * _.mixin({ 'vowels': vowels });
+     * _.vowels('fred');
+     * // => ['e']
+     *
+     * _('fred').vowels().value();
+     * // => ['e']
+     *
+     * _.mixin({ 'vowels': vowels }, { 'chain': false });
+     * _('fred').vowels();
+     * // => ['e']
+     */
+    function mixin(object, source, options) {
+      var props = keys(source),
+          methodNames = baseFunctions(source, props);
+
+      if (options == null &&
+          !(isObject(source) && (methodNames.length || !props.length))) {
+        options = source;
+        source = object;
+        object = this;
+        methodNames = baseFunctions(source, keys(source));
+      }
+      var chain = !(isObject(options) && 'chain' in options) || !!options.chain,
+          isFunc = isFunction(object);
+
+      arrayEach(methodNames, function(methodName) {
+        var func = source[methodName];
+        object[methodName] = func;
+        if (isFunc) {
+          object.prototype[methodName] = function() {
+            var chainAll = this.__chain__;
+            if (chain || chainAll) {
+              var result = object(this.__wrapped__),
+                  actions = result.__actions__ = copyArray(this.__actions__);
+
+              actions.push({ 'func': func, 'args': arguments, 'thisArg': object });
+              result.__chain__ = chainAll;
+              return result;
+            }
+            return func.apply(object, arrayPush([this.value()], arguments));
+          };
+        }
+      });
+
+      return object;
+    }
+
+    /**
+     * Reverts the `_` variable to its previous value and returns a reference to
+     * the `lodash` function.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Util
+     * @returns {Function} Returns the `lodash` function.
+     * @example
+     *
+     * var lodash = _.noConflict();
+     */
+    function noConflict() {
+      if (root._ === this) {
+        root._ = oldDash;
+      }
+      return this;
+    }
+
+    /**
+     * This method returns `undefined`.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.3.0
+     * @category Util
+     * @example
+     *
+     * _.times(2, _.noop);
+     * // => [undefined, undefined]
+     */
+    function noop() {
+      // No operation performed.
+    }
+
+    /**
+     * Creates a function that gets the argument at index `n`. If `n` is negative,
+     * the nth argument from the end is returned.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Util
+     * @param {number} [n=0] The index of the argument to return.
+     * @returns {Function} Returns the new pass-thru function.
+     * @example
+     *
+     * var func = _.nthArg(1);
+     * func('a', 'b', 'c', 'd');
+     * // => 'b'
+     *
+     * var func = _.nthArg(-2);
+     * func('a', 'b', 'c', 'd');
+     * // => 'c'
+     */
+    function nthArg(n) {
+      n = toInteger(n);
+      return baseRest(function(args) {
+        return baseNth(args, n);
+      });
+    }
+
+    /**
+     * Creates a function that invokes `iteratees` with the arguments it receives
+     * and returns their results.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Util
+     * @param {...(Function|Function[])} [iteratees=[_.identity]]
+     *  The iteratees to invoke.
+     * @returns {Function} Returns the new function.
+     * @example
+     *
+     * var func = _.over([Math.max, Math.min]);
+     *
+     * func(1, 2, 3, 4);
+     * // => [4, 1]
+     */
+    var over = createOver(arrayMap);
+
+    /**
+     * Creates a function that checks if **all** of the `predicates` return
+     * truthy when invoked with the arguments it receives.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Util
+     * @param {...(Function|Function[])} [predicates=[_.identity]]
+     *  The predicates to check.
+     * @returns {Function} Returns the new function.
+     * @example
+     *
+     * var func = _.overEvery([Boolean, isFinite]);
+     *
+     * func('1');
+     * // => true
+     *
+     * func(null);
+     * // => false
+     *
+     * func(NaN);
+     * // => false
+     */
+    var overEvery = createOver(arrayEvery);
+
+    /**
+     * Creates a function that checks if **any** of the `predicates` return
+     * truthy when invoked with the arguments it receives.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Util
+     * @param {...(Function|Function[])} [predicates=[_.identity]]
+     *  The predicates to check.
+     * @returns {Function} Returns the new function.
+     * @example
+     *
+     * var func = _.overSome([Boolean, isFinite]);
+     *
+     * func('1');
+     * // => true
+     *
+     * func(null);
+     * // => true
+     *
+     * func(NaN);
+     * // => false
+     */
+    var overSome = createOver(arraySome);
+
+    /**
+     * Creates a function that returns the value at `path` of a given object.
+     *
+     * @static
+     * @memberOf _
+     * @since 2.4.0
+     * @category Util
+     * @param {Array|string} path The path of the property to get.
+     * @returns {Function} Returns the new accessor function.
+     * @example
+     *
+     * var objects = [
+     *   { 'a': { 'b': 2 } },
+     *   { 'a': { 'b': 1 } }
+     * ];
+     *
+     * _.map(objects, _.property('a.b'));
+     * // => [2, 1]
+     *
+     * _.map(_.sortBy(objects, _.property(['a', 'b'])), 'a.b');
+     * // => [1, 2]
+     */
+    function property(path) {
+      return isKey(path) ? baseProperty(toKey(path)) : basePropertyDeep(path);
+    }
+
+    /**
+     * The opposite of `_.property`; this method creates a function that returns
+     * the value at a given path of `object`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.0.0
+     * @category Util
+     * @param {Object} object The object to query.
+     * @returns {Function} Returns the new accessor function.
+     * @example
+     *
+     * var array = [0, 1, 2],
+     *     object = { 'a': array, 'b': array, 'c': array };
+     *
+     * _.map(['a[2]', 'c[0]'], _.propertyOf(object));
+     * // => [2, 0]
+     *
+     * _.map([['a', '2'], ['c', '0']], _.propertyOf(object));
+     * // => [2, 0]
+     */
+    function propertyOf(object) {
+      return function(path) {
+        return object == null ? undefined : baseGet(object, path);
+      };
+    }
+
+    /**
+     * Creates an array of numbers (positive and/or negative) progressing from
+     * `start` up to, but not including, `end`. A step of `-1` is used if a negative
+     * `start` is specified without an `end` or `step`. If `end` is not specified,
+     * it's set to `start` with `start` then set to `0`.
+     *
+     * **Note:** JavaScript follows the IEEE-754 standard for resolving
+     * floating-point values which can produce unexpected results.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Util
+     * @param {number} [start=0] The start of the range.
+     * @param {number} end The end of the range.
+     * @param {number} [step=1] The value to increment or decrement by.
+     * @returns {Array} Returns the range of numbers.
+     * @see _.inRange, _.rangeRight
+     * @example
+     *
+     * _.range(4);
+     * // => [0, 1, 2, 3]
+     *
+     * _.range(-4);
+     * // => [0, -1, -2, -3]
+     *
+     * _.range(1, 5);
+     * // => [1, 2, 3, 4]
+     *
+     * _.range(0, 20, 5);
+     * // => [0, 5, 10, 15]
+     *
+     * _.range(0, -4, -1);
+     * // => [0, -1, -2, -3]
+     *
+     * _.range(1, 4, 0);
+     * // => [1, 1, 1]
+     *
+     * _.range(0);
+     * // => []
+     */
+    var range = createRange();
+
+    /**
+     * This method is like `_.range` except that it populates values in
+     * descending order.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Util
+     * @param {number} [start=0] The start of the range.
+     * @param {number} end The end of the range.
+     * @param {number} [step=1] The value to increment or decrement by.
+     * @returns {Array} Returns the range of numbers.
+     * @see _.inRange, _.range
+     * @example
+     *
+     * _.rangeRight(4);
+     * // => [3, 2, 1, 0]
+     *
+     * _.rangeRight(-4);
+     * // => [-3, -2, -1, 0]
+     *
+     * _.rangeRight(1, 5);
+     * // => [4, 3, 2, 1]
+     *
+     * _.rangeRight(0, 20, 5);
+     * // => [15, 10, 5, 0]
+     *
+     * _.rangeRight(0, -4, -1);
+     * // => [-3, -2, -1, 0]
+     *
+     * _.rangeRight(1, 4, 0);
+     * // => [1, 1, 1]
+     *
+     * _.rangeRight(0);
+     * // => []
+     */
+    var rangeRight = createRange(true);
+
+    /**
+     * This method returns a new empty array.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.13.0
+     * @category Util
+     * @returns {Array} Returns the new empty array.
+     * @example
+     *
+     * var arrays = _.times(2, _.stubArray);
+     *
+     * console.log(arrays);
+     * // => [[], []]
+     *
+     * console.log(arrays[0] === arrays[1]);
+     * // => false
+     */
+    function stubArray() {
+      return [];
+    }
+
+    /**
+     * This method returns `false`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.13.0
+     * @category Util
+     * @returns {boolean} Returns `false`.
+     * @example
+     *
+     * _.times(2, _.stubFalse);
+     * // => [false, false]
+     */
+    function stubFalse() {
+      return false;
+    }
+
+    /**
+     * This method returns a new empty object.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.13.0
+     * @category Util
+     * @returns {Object} Returns the new empty object.
+     * @example
+     *
+     * var objects = _.times(2, _.stubObject);
+     *
+     * console.log(objects);
+     * // => [{}, {}]
+     *
+     * console.log(objects[0] === objects[1]);
+     * // => false
+     */
+    function stubObject() {
+      return {};
+    }
+
+    /**
+     * This method returns an empty string.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.13.0
+     * @category Util
+     * @returns {string} Returns the empty string.
+     * @example
+     *
+     * _.times(2, _.stubString);
+     * // => ['', '']
+     */
+    function stubString() {
+      return '';
+    }
+
+    /**
+     * This method returns `true`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.13.0
+     * @category Util
+     * @returns {boolean} Returns `true`.
+     * @example
+     *
+     * _.times(2, _.stubTrue);
+     * // => [true, true]
+     */
+    function stubTrue() {
+      return true;
+    }
+
+    /**
+     * Invokes the iteratee `n` times, returning an array of the results of
+     * each invocation. The iteratee is invoked with one argument; (index).
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Util
+     * @param {number} n The number of times to invoke `iteratee`.
+     * @param {Function} [iteratee=_.identity] The function invoked per iteration.
+     * @returns {Array} Returns the array of results.
+     * @example
+     *
+     * _.times(3, String);
+     * // => ['0', '1', '2']
+     *
+     *  _.times(4, _.constant(0));
+     * // => [0, 0, 0, 0]
+     */
+    function times(n, iteratee) {
+      n = toInteger(n);
+      if (n < 1 || n > MAX_SAFE_INTEGER) {
+        return [];
+      }
+      var index = MAX_ARRAY_LENGTH,
+          length = nativeMin(n, MAX_ARRAY_LENGTH);
+
+      iteratee = getIteratee(iteratee);
+      n -= MAX_ARRAY_LENGTH;
+
+      var result = baseTimes(length, iteratee);
+      while (++index < n) {
+        iteratee(index);
+      }
+      return result;
+    }
+
+    /**
+     * Converts `value` to a property path array.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Util
+     * @param {*} value The value to convert.
+     * @returns {Array} Returns the new property path array.
+     * @example
+     *
+     * _.toPath('a.b.c');
+     * // => ['a', 'b', 'c']
+     *
+     * _.toPath('a[0].b.c');
+     * // => ['a', '0', 'b', 'c']
+     */
+    function toPath(value) {
+      if (isArray(value)) {
+        return arrayMap(value, toKey);
+      }
+      return isSymbol(value) ? [value] : copyArray(stringToPath(toString(value)));
+    }
+
+    /**
+     * Generates a unique ID. If `prefix` is given, the ID is appended to it.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Util
+     * @param {string} [prefix=''] The value to prefix the ID with.
+     * @returns {string} Returns the unique ID.
+     * @example
+     *
+     * _.uniqueId('contact_');
+     * // => 'contact_104'
+     *
+     * _.uniqueId();
+     * // => '105'
+     */
+    function uniqueId(prefix) {
+      var id = ++idCounter;
+      return toString(prefix) + id;
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * Adds two numbers.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.4.0
+     * @category Math
+     * @param {number} augend The first number in an addition.
+     * @param {number} addend The second number in an addition.
+     * @returns {number} Returns the total.
+     * @example
+     *
+     * _.add(6, 4);
+     * // => 10
+     */
+    var add = createMathOperation(function(augend, addend) {
+      return augend + addend;
+    }, 0);
+
+    /**
+     * Computes `number` rounded up to `precision`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.10.0
+     * @category Math
+     * @param {number} number The number to round up.
+     * @param {number} [precision=0] The precision to round up to.
+     * @returns {number} Returns the rounded up number.
+     * @example
+     *
+     * _.ceil(4.006);
+     * // => 5
+     *
+     * _.ceil(6.004, 2);
+     * // => 6.01
+     *
+     * _.ceil(6040, -2);
+     * // => 6100
+     */
+    var ceil = createRound('ceil');
+
+    /**
+     * Divide two numbers.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.7.0
+     * @category Math
+     * @param {number} dividend The first number in a division.
+     * @param {number} divisor The second number in a division.
+     * @returns {number} Returns the quotient.
+     * @example
+     *
+     * _.divide(6, 4);
+     * // => 1.5
+     */
+    var divide = createMathOperation(function(dividend, divisor) {
+      return dividend / divisor;
+    }, 1);
+
+    /**
+     * Computes `number` rounded down to `precision`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.10.0
+     * @category Math
+     * @param {number} number The number to round down.
+     * @param {number} [precision=0] The precision to round down to.
+     * @returns {number} Returns the rounded down number.
+     * @example
+     *
+     * _.floor(4.006);
+     * // => 4
+     *
+     * _.floor(0.046, 2);
+     * // => 0.04
+     *
+     * _.floor(4060, -2);
+     * // => 4000
+     */
+    var floor = createRound('floor');
+
+    /**
+     * Computes the maximum value of `array`. If `array` is empty or falsey,
+     * `undefined` is returned.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Math
+     * @param {Array} array The array to iterate over.
+     * @returns {*} Returns the maximum value.
+     * @example
+     *
+     * _.max([4, 2, 8, 6]);
+     * // => 8
+     *
+     * _.max([]);
+     * // => undefined
+     */
+    function max(array) {
+      return (array && array.length)
+        ? baseExtremum(array, identity, baseGt)
+        : undefined;
+    }
+
+    /**
+     * This method is like `_.max` except that it accepts `iteratee` which is
+     * invoked for each element in `array` to generate the criterion by which
+     * the value is ranked. The iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Math
+     * @param {Array} array The array to iterate over.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {*} Returns the maximum value.
+     * @example
+     *
+     * var objects = [{ 'n': 1 }, { 'n': 2 }];
+     *
+     * _.maxBy(objects, function(o) { return o.n; });
+     * // => { 'n': 2 }
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.maxBy(objects, 'n');
+     * // => { 'n': 2 }
+     */
+    function maxBy(array, iteratee) {
+      return (array && array.length)
+        ? baseExtremum(array, getIteratee(iteratee, 2), baseGt)
+        : undefined;
+    }
+
+    /**
+     * Computes the mean of the values in `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Math
+     * @param {Array} array The array to iterate over.
+     * @returns {number} Returns the mean.
+     * @example
+     *
+     * _.mean([4, 2, 8, 6]);
+     * // => 5
+     */
+    function mean(array) {
+      return baseMean(array, identity);
+    }
+
+    /**
+     * This method is like `_.mean` except that it accepts `iteratee` which is
+     * invoked for each element in `array` to generate the value to be averaged.
+     * The iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.7.0
+     * @category Math
+     * @param {Array} array The array to iterate over.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {number} Returns the mean.
+     * @example
+     *
+     * var objects = [{ 'n': 4 }, { 'n': 2 }, { 'n': 8 }, { 'n': 6 }];
+     *
+     * _.meanBy(objects, function(o) { return o.n; });
+     * // => 5
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.meanBy(objects, 'n');
+     * // => 5
+     */
+    function meanBy(array, iteratee) {
+      return baseMean(array, getIteratee(iteratee, 2));
+    }
+
+    /**
+     * Computes the minimum value of `array`. If `array` is empty or falsey,
+     * `undefined` is returned.
+     *
+     * @static
+     * @since 0.1.0
+     * @memberOf _
+     * @category Math
+     * @param {Array} array The array to iterate over.
+     * @returns {*} Returns the minimum value.
+     * @example
+     *
+     * _.min([4, 2, 8, 6]);
+     * // => 2
+     *
+     * _.min([]);
+     * // => undefined
+     */
+    function min(array) {
+      return (array && array.length)
+        ? baseExtremum(array, identity, baseLt)
+        : undefined;
+    }
+
+    /**
+     * This method is like `_.min` except that it accepts `iteratee` which is
+     * invoked for each element in `array` to generate the criterion by which
+     * the value is ranked. The iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Math
+     * @param {Array} array The array to iterate over.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {*} Returns the minimum value.
+     * @example
+     *
+     * var objects = [{ 'n': 1 }, { 'n': 2 }];
+     *
+     * _.minBy(objects, function(o) { return o.n; });
+     * // => { 'n': 1 }
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.minBy(objects, 'n');
+     * // => { 'n': 1 }
+     */
+    function minBy(array, iteratee) {
+      return (array && array.length)
+        ? baseExtremum(array, getIteratee(iteratee, 2), baseLt)
+        : undefined;
+    }
+
+    /**
+     * Multiply two numbers.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.7.0
+     * @category Math
+     * @param {number} multiplier The first number in a multiplication.
+     * @param {number} multiplicand The second number in a multiplication.
+     * @returns {number} Returns the product.
+     * @example
+     *
+     * _.multiply(6, 4);
+     * // => 24
+     */
+    var multiply = createMathOperation(function(multiplier, multiplicand) {
+      return multiplier * multiplicand;
+    }, 1);
+
+    /**
+     * Computes `number` rounded to `precision`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.10.0
+     * @category Math
+     * @param {number} number The number to round.
+     * @param {number} [precision=0] The precision to round to.
+     * @returns {number} Returns the rounded number.
+     * @example
+     *
+     * _.round(4.006);
+     * // => 4
+     *
+     * _.round(4.006, 2);
+     * // => 4.01
+     *
+     * _.round(4060, -2);
+     * // => 4100
+     */
+    var round = createRound('round');
+
+    /**
+     * Subtract two numbers.
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Math
+     * @param {number} minuend The first number in a subtraction.
+     * @param {number} subtrahend The second number in a subtraction.
+     * @returns {number} Returns the difference.
+     * @example
+     *
+     * _.subtract(6, 4);
+     * // => 2
+     */
+    var subtract = createMathOperation(function(minuend, subtrahend) {
+      return minuend - subtrahend;
+    }, 0);
+
+    /**
+     * Computes the sum of the values in `array`.
+     *
+     * @static
+     * @memberOf _
+     * @since 3.4.0
+     * @category Math
+     * @param {Array} array The array to iterate over.
+     * @returns {number} Returns the sum.
+     * @example
+     *
+     * _.sum([4, 2, 8, 6]);
+     * // => 20
+     */
+    function sum(array) {
+      return (array && array.length)
+        ? baseSum(array, identity)
+        : 0;
+    }
+
+    /**
+     * This method is like `_.sum` except that it accepts `iteratee` which is
+     * invoked for each element in `array` to generate the value to be summed.
+     * The iteratee is invoked with one argument: (value).
+     *
+     * @static
+     * @memberOf _
+     * @since 4.0.0
+     * @category Math
+     * @param {Array} array The array to iterate over.
+     * @param {Function} [iteratee=_.identity] The iteratee invoked per element.
+     * @returns {number} Returns the sum.
+     * @example
+     *
+     * var objects = [{ 'n': 4 }, { 'n': 2 }, { 'n': 8 }, { 'n': 6 }];
+     *
+     * _.sumBy(objects, function(o) { return o.n; });
+     * // => 20
+     *
+     * // The `_.property` iteratee shorthand.
+     * _.sumBy(objects, 'n');
+     * // => 20
+     */
+    function sumBy(array, iteratee) {
+      return (array && array.length)
+        ? baseSum(array, getIteratee(iteratee, 2))
+        : 0;
+    }
+
+    /*------------------------------------------------------------------------*/
+
+    // Add methods that return wrapped values in chain sequences.
+    lodash.after = after;
+    lodash.ary = ary;
+    lodash.assign = assign;
+    lodash.assignIn = assignIn;
+    lodash.assignInWith = assignInWith;
+    lodash.assignWith = assignWith;
+    lodash.at = at;
+    lodash.before = before;
+    lodash.bind = bind;
+    lodash.bindAll = bindAll;
+    lodash.bindKey = bindKey;
+    lodash.castArray = castArray;
+    lodash.chain = chain;
+    lodash.chunk = chunk;
+    lodash.compact = compact;
+    lodash.concat = concat;
+    lodash.cond = cond;
+    lodash.conforms = conforms;
+    lodash.constant = constant;
+    lodash.countBy = countBy;
+    lodash.create = create;
+    lodash.curry = curry;
+    lodash.curryRight = curryRight;
+    lodash.debounce = debounce;
+    lodash.defaults = defaults;
+    lodash.defaultsDeep = defaultsDeep;
+    lodash.defer = defer;
+    lodash.delay = delay;
+    lodash.difference = difference;
+    lodash.differenceBy = differenceBy;
+    lodash.differenceWith = differenceWith;
+    lodash.drop = drop;
+    lodash.dropRight = dropRight;
+    lodash.dropRightWhile = dropRightWhile;
+    lodash.dropWhile = dropWhile;
+    lodash.fill = fill;
+    lodash.filter = filter;
+    lodash.flatMap = flatMap;
+    lodash.flatMapDeep = flatMapDeep;
+    lodash.flatMapDepth = flatMapDepth;
+    lodash.flatten = flatten;
+    lodash.flattenDeep = flattenDeep;
+    lodash.flattenDepth = flattenDepth;
+    lodash.flip = flip;
+    lodash.flow = flow;
+    lodash.flowRight = flowRight;
+    lodash.fromPairs = fromPairs;
+    lodash.functions = functions;
+    lodash.functionsIn = functionsIn;
+    lodash.groupBy = groupBy;
+    lodash.initial = initial;
+    lodash.intersection = intersection;
+    lodash.intersectionBy = intersectionBy;
+    lodash.intersectionWith = intersectionWith;
+    lodash.invert = invert;
+    lodash.invertBy = invertBy;
+    lodash.invokeMap = invokeMap;
+    lodash.iteratee = iteratee;
+    lodash.keyBy = keyBy;
+    lodash.keys = keys;
+    lodash.keysIn = keysIn;
+    lodash.map = map;
+    lodash.mapKeys = mapKeys;
+    lodash.mapValues = mapValues;
+    lodash.matches = matches;
+    lodash.matchesProperty = matchesProperty;
+    lodash.memoize = memoize;
+    lodash.merge = merge;
+    lodash.mergeWith = mergeWith;
+    lodash.method = method;
+    lodash.methodOf = methodOf;
+    lodash.mixin = mixin;
+    lodash.negate = negate;
+    lodash.nthArg = nthArg;
+    lodash.omit = omit;
+    lodash.omitBy = omitBy;
+    lodash.once = once;
+    lodash.orderBy = orderBy;
+    lodash.over = over;
+    lodash.overArgs = overArgs;
+    lodash.overEvery = overEvery;
+    lodash.overSome = overSome;
+    lodash.partial = partial;
+    lodash.partialRight = partialRight;
+    lodash.partition = partition;
+    lodash.pick = pick;
+    lodash.pickBy = pickBy;
+    lodash.property = property;
+    lodash.propertyOf = propertyOf;
+    lodash.pull = pull;
+    lodash.pullAll = pullAll;
+    lodash.pullAllBy = pullAllBy;
+    lodash.pullAllWith = pullAllWith;
+    lodash.pullAt = pullAt;
+    lodash.range = range;
+    lodash.rangeRight = rangeRight;
+    lodash.rearg = rearg;
+    lodash.reject = reject;
+    lodash.remove = remove;
+    lodash.rest = rest;
+    lodash.reverse = reverse;
+    lodash.sampleSize = sampleSize;
+    lodash.set = set;
+    lodash.setWith = setWith;
+    lodash.shuffle = shuffle;
+    lodash.slice = slice;
+    lodash.sortBy = sortBy;
+    lodash.sortedUniq = sortedUniq;
+    lodash.sortedUniqBy = sortedUniqBy;
+    lodash.split = split;
+    lodash.spread = spread;
+    lodash.tail = tail;
+    lodash.take = take;
+    lodash.takeRight = takeRight;
+    lodash.takeRightWhile = takeRightWhile;
+    lodash.takeWhile = takeWhile;
+    lodash.tap = tap;
+    lodash.throttle = throttle;
+    lodash.thru = thru;
+    lodash.toArray = toArray;
+    lodash.toPairs = toPairs;
+    lodash.toPairsIn = toPairsIn;
+    lodash.toPath = toPath;
+    lodash.toPlainObject = toPlainObject;
+    lodash.transform = transform;
+    lodash.unary = unary;
+    lodash.union = union;
+    lodash.unionBy = unionBy;
+    lodash.unionWith = unionWith;
+    lodash.uniq = uniq;
+    lodash.uniqBy = uniqBy;
+    lodash.uniqWith = uniqWith;
+    lodash.unset = unset;
+    lodash.unzip = unzip;
+    lodash.unzipWith = unzipWith;
+    lodash.update = update;
+    lodash.updateWith = updateWith;
+    lodash.values = values;
+    lodash.valuesIn = valuesIn;
+    lodash.without = without;
+    lodash.words = words;
+    lodash.wrap = wrap;
+    lodash.xor = xor;
+    lodash.xorBy = xorBy;
+    lodash.xorWith = xorWith;
+    lodash.zip = zip;
+    lodash.zipObject = zipObject;
+    lodash.zipObjectDeep = zipObjectDeep;
+    lodash.zipWith = zipWith;
+
+    // Add aliases.
+    lodash.entries = toPairs;
+    lodash.entriesIn = toPairsIn;
+    lodash.extend = assignIn;
+    lodash.extendWith = assignInWith;
+
+    // Add methods to `lodash.prototype`.
+    mixin(lodash, lodash);
+
+    /*------------------------------------------------------------------------*/
+
+    // Add methods that return unwrapped values in chain sequences.
+    lodash.add = add;
+    lodash.attempt = attempt;
+    lodash.camelCase = camelCase;
+    lodash.capitalize = capitalize;
+    lodash.ceil = ceil;
+    lodash.clamp = clamp;
+    lodash.clone = clone;
+    lodash.cloneDeep = cloneDeep;
+    lodash.cloneDeepWith = cloneDeepWith;
+    lodash.cloneWith = cloneWith;
+    lodash.conformsTo = conformsTo;
+    lodash.deburr = deburr;
+    lodash.defaultTo = defaultTo;
+    lodash.divide = divide;
+    lodash.endsWith = endsWith;
+    lodash.eq = eq;
+    lodash.escape = escape;
+    lodash.escapeRegExp = escapeRegExp;
+    lodash.every = every;
+    lodash.find = find;
+    lodash.findIndex = findIndex;
+    lodash.findKey = findKey;
+    lodash.findLast = findLast;
+    lodash.findLastIndex = findLastIndex;
+    lodash.findLastKey = findLastKey;
+    lodash.floor = floor;
+    lodash.forEach = forEach;
+    lodash.forEachRight = forEachRight;
+    lodash.forIn = forIn;
+    lodash.forInRight = forInRight;
+    lodash.forOwn = forOwn;
+    lodash.forOwnRight = forOwnRight;
+    lodash.get = get;
+    lodash.gt = gt;
+    lodash.gte = gte;
+    lodash.has = has;
+    lodash.hasIn = hasIn;
+    lodash.head = head;
+    lodash.identity = identity;
+    lodash.includes = includes;
+    lodash.indexOf = indexOf;
+    lodash.inRange = inRange;
+    lodash.invoke = invoke;
+    lodash.isArguments = isArguments;
+    lodash.isArray = isArray;
+    lodash.isArrayBuffer = isArrayBuffer;
+    lodash.isArrayLike = isArrayLike;
+    lodash.isArrayLikeObject = isArrayLikeObject;
+    lodash.isBoolean = isBoolean;
+    lodash.isBuffer = isBuffer;
+    lodash.isDate = isDate;
+    lodash.isElement = isElement;
+    lodash.isEmpty = isEmpty;
+    lodash.isEqual = isEqual;
+    lodash.isEqualWith = isEqualWith;
+    lodash.isError = isError;
+    lodash.isFinite = isFinite;
+    lodash.isFunction = isFunction;
+    lodash.isInteger = isInteger;
+    lodash.isLength = isLength;
+    lodash.isMap = isMap;
+    lodash.isMatch = isMatch;
+    lodash.isMatchWith = isMatchWith;
+    lodash.isNaN = isNaN;
+    lodash.isNative = isNative;
+    lodash.isNil = isNil;
+    lodash.isNull = isNull;
+    lodash.isNumber = isNumber;
+    lodash.isObject = isObject;
+    lodash.isObjectLike = isObjectLike;
+    lodash.isPlainObject = isPlainObject;
+    lodash.isRegExp = isRegExp;
+    lodash.isSafeInteger = isSafeInteger;
+    lodash.isSet = isSet;
+    lodash.isString = isString;
+    lodash.isSymbol = isSymbol;
+    lodash.isTypedArray = isTypedArray;
+    lodash.isUndefined = isUndefined;
+    lodash.isWeakMap = isWeakMap;
+    lodash.isWeakSet = isWeakSet;
+    lodash.join = join;
+    lodash.kebabCase = kebabCase;
+    lodash.last = last;
+    lodash.lastIndexOf = lastIndexOf;
+    lodash.lowerCase = lowerCase;
+    lodash.lowerFirst = lowerFirst;
+    lodash.lt = lt;
+    lodash.lte = lte;
+    lodash.max = max;
+    lodash.maxBy = maxBy;
+    lodash.mean = mean;
+    lodash.meanBy = meanBy;
+    lodash.min = min;
+    lodash.minBy = minBy;
+    lodash.stubArray = stubArray;
+    lodash.stubFalse = stubFalse;
+    lodash.stubObject = stubObject;
+    lodash.stubString = stubString;
+    lodash.stubTrue = stubTrue;
+    lodash.multiply = multiply;
+    lodash.nth = nth;
+    lodash.noConflict = noConflict;
+    lodash.noop = noop;
+    lodash.now = now;
+    lodash.pad = pad;
+    lodash.padEnd = padEnd;
+    lodash.padStart = padStart;
+    lodash.parseInt = parseInt;
+    lodash.random = random;
+    lodash.reduce = reduce;
+    lodash.reduceRight = reduceRight;
+    lodash.repeat = repeat;
+    lodash.replace = replace;
+    lodash.result = result;
+    lodash.round = round;
+    lodash.runInContext = runInContext;
+    lodash.sample = sample;
+    lodash.size = size;
+    lodash.snakeCase = snakeCase;
+    lodash.some = some;
+    lodash.sortedIndex = sortedIndex;
+    lodash.sortedIndexBy = sortedIndexBy;
+    lodash.sortedIndexOf = sortedIndexOf;
+    lodash.sortedLastIndex = sortedLastIndex;
+    lodash.sortedLastIndexBy = sortedLastIndexBy;
+    lodash.sortedLastIndexOf = sortedLastIndexOf;
+    lodash.startCase = startCase;
+    lodash.startsWith = startsWith;
+    lodash.subtract = subtract;
+    lodash.sum = sum;
+    lodash.sumBy = sumBy;
+    lodash.template = template;
+    lodash.times = times;
+    lodash.toFinite = toFinite;
+    lodash.toInteger = toInteger;
+    lodash.toLength = toLength;
+    lodash.toLower = toLower;
+    lodash.toNumber = toNumber;
+    lodash.toSafeInteger = toSafeInteger;
+    lodash.toString = toString;
+    lodash.toUpper = toUpper;
+    lodash.trim = trim;
+    lodash.trimEnd = trimEnd;
+    lodash.trimStart = trimStart;
+    lodash.truncate = truncate;
+    lodash.unescape = unescape;
+    lodash.uniqueId = uniqueId;
+    lodash.upperCase = upperCase;
+    lodash.upperFirst = upperFirst;
+
+    // Add aliases.
+    lodash.each = forEach;
+    lodash.eachRight = forEachRight;
+    lodash.first = head;
+
+    mixin(lodash, (function() {
+      var source = {};
+      baseForOwn(lodash, function(func, methodName) {
+        if (!hasOwnProperty.call(lodash.prototype, methodName)) {
+          source[methodName] = func;
+        }
+      });
+      return source;
+    }()), { 'chain': false });
+
+    /*------------------------------------------------------------------------*/
+
+    /**
+     * The semantic version number.
+     *
+     * @static
+     * @memberOf _
+     * @type {string}
+     */
+    lodash.VERSION = VERSION;
+
+    // Assign default placeholders.
+    arrayEach(['bind', 'bindKey', 'curry', 'curryRight', 'partial', 'partialRight'], function(methodName) {
+      lodash[methodName].placeholder = lodash;
+    });
+
+    // Add `LazyWrapper` methods for `_.drop` and `_.take` variants.
+    arrayEach(['drop', 'take'], function(methodName, index) {
+      LazyWrapper.prototype[methodName] = function(n) {
+        n = n === undefined ? 1 : nativeMax(toInteger(n), 0);
+
+        var result = (this.__filtered__ && !index)
+          ? new LazyWrapper(this)
+          : this.clone();
+
+        if (result.__filtered__) {
+          result.__takeCount__ = nativeMin(n, result.__takeCount__);
+        } else {
+          result.__views__.push({
+            'size': nativeMin(n, MAX_ARRAY_LENGTH),
+            'type': methodName + (result.__dir__ < 0 ? 'Right' : '')
+          });
+        }
+        return result;
+      };
+
+      LazyWrapper.prototype[methodName + 'Right'] = function(n) {
+        return this.reverse()[methodName](n).reverse();
+      };
+    });
+
+    // Add `LazyWrapper` methods that accept an `iteratee` value.
+    arrayEach(['filter', 'map', 'takeWhile'], function(methodName, index) {
+      var type = index + 1,
+          isFilter = type == LAZY_FILTER_FLAG || type == LAZY_WHILE_FLAG;
+
+      LazyWrapper.prototype[methodName] = function(iteratee) {
+        var result = this.clone();
+        result.__iteratees__.push({
+          'iteratee': getIteratee(iteratee, 3),
+          'type': type
+        });
+        result.__filtered__ = result.__filtered__ || isFilter;
+        return result;
+      };
+    });
+
+    // Add `LazyWrapper` methods for `_.head` and `_.last`.
+    arrayEach(['head', 'last'], function(methodName, index) {
+      var takeName = 'take' + (index ? 'Right' : '');
+
+      LazyWrapper.prototype[methodName] = function() {
+        return this[takeName](1).value()[0];
+      };
+    });
+
+    // Add `LazyWrapper` methods for `_.initial` and `_.tail`.
+    arrayEach(['initial', 'tail'], function(methodName, index) {
+      var dropName = 'drop' + (index ? '' : 'Right');
+
+      LazyWrapper.prototype[methodName] = function() {
+        return this.__filtered__ ? new LazyWrapper(this) : this[dropName](1);
+      };
+    });
+
+    LazyWrapper.prototype.compact = function() {
+      return this.filter(identity);
+    };
+
+    LazyWrapper.prototype.find = function(predicate) {
+      return this.filter(predicate).head();
+    };
+
+    LazyWrapper.prototype.findLast = function(predicate) {
+      return this.reverse().find(predicate);
+    };
+
+    LazyWrapper.prototype.invokeMap = baseRest(function(path, args) {
+      if (typeof path == 'function') {
+        return new LazyWrapper(this);
+      }
+      return this.map(function(value) {
+        return baseInvoke(value, path, args);
+      });
+    });
+
+    LazyWrapper.prototype.reject = function(predicate) {
+      return this.filter(negate(getIteratee(predicate)));
+    };
+
+    LazyWrapper.prototype.slice = function(start, end) {
+      start = toInteger(start);
+
+      var result = this;
+      if (result.__filtered__ && (start > 0 || end < 0)) {
+        return new LazyWrapper(result);
+      }
+      if (start < 0) {
+        result = result.takeRight(-start);
+      } else if (start) {
+        result = result.drop(start);
+      }
+      if (end !== undefined) {
+        end = toInteger(end);
+        result = end < 0 ? result.dropRight(-end) : result.take(end - start);
+      }
+      return result;
+    };
+
+    LazyWrapper.prototype.takeRightWhile = function(predicate) {
+      return this.reverse().takeWhile(predicate).reverse();
+    };
+
+    LazyWrapper.prototype.toArray = function() {
+      return this.take(MAX_ARRAY_LENGTH);
+    };
+
+    // Add `LazyWrapper` methods to `lodash.prototype`.
+    baseForOwn(LazyWrapper.prototype, function(func, methodName) {
+      var checkIteratee = /^(?:filter|find|map|reject)|While$/.test(methodName),
+          isTaker = /^(?:head|last)$/.test(methodName),
+          lodashFunc = lodash[isTaker ? ('take' + (methodName == 'last' ? 'Right' : '')) : methodName],
+          retUnwrapped = isTaker || /^find/.test(methodName);
+
+      if (!lodashFunc) {
+        return;
+      }
+      lodash.prototype[methodName] = function() {
+        var value = this.__wrapped__,
+            args = isTaker ? [1] : arguments,
+            isLazy = value instanceof LazyWrapper,
+            iteratee = args[0],
+            useLazy = isLazy || isArray(value);
+
+        var interceptor = function(value) {
+          var result = lodashFunc.apply(lodash, arrayPush([value], args));
+          return (isTaker && chainAll) ? result[0] : result;
+        };
+
+        if (useLazy && checkIteratee && typeof iteratee == 'function' && iteratee.length != 1) {
+          // Avoid lazy use if the iteratee has a "length" value other than `1`.
+          isLazy = useLazy = false;
+        }
+        var chainAll = this.__chain__,
+            isHybrid = !!this.__actions__.length,
+            isUnwrapped = retUnwrapped && !chainAll,
+            onlyLazy = isLazy && !isHybrid;
+
+        if (!retUnwrapped && useLazy) {
+          value = onlyLazy ? value : new LazyWrapper(this);
+          var result = func.apply(value, args);
+          result.__actions__.push({ 'func': thru, 'args': [interceptor], 'thisArg': undefined });
+          return new LodashWrapper(result, chainAll);
+        }
+        if (isUnwrapped && onlyLazy) {
+          return func.apply(this, args);
+        }
+        result = this.thru(interceptor);
+        return isUnwrapped ? (isTaker ? result.value()[0] : result.value()) : result;
+      };
+    });
+
+    // Add `Array` methods to `lodash.prototype`.
+    arrayEach(['pop', 'push', 'shift', 'sort', 'splice', 'unshift'], function(methodName) {
+      var func = arrayProto[methodName],
+          chainName = /^(?:push|sort|unshift)$/.test(methodName) ? 'tap' : 'thru',
+          retUnwrapped = /^(?:pop|shift)$/.test(methodName);
+
+      lodash.prototype[methodName] = function() {
+        var args = arguments;
+        if (retUnwrapped && !this.__chain__) {
+          var value = this.value();
+          return func.apply(isArray(value) ? value : [], args);
+        }
+        return this[chainName](function(value) {
+          return func.apply(isArray(value) ? value : [], args);
+        });
+      };
+    });
+
+    // Map minified method names to their real names.
+    baseForOwn(LazyWrapper.prototype, function(func, methodName) {
+      var lodashFunc = lodash[methodName];
+      if (lodashFunc) {
+        var key = (lodashFunc.name + ''),
+            names = realNames[key] || (realNames[key] = []);
+
+        names.push({ 'name': methodName, 'func': lodashFunc });
+      }
+    });
+
+    realNames[createHybrid(undefined, WRAP_BIND_KEY_FLAG).name] = [{
+      'name': 'wrapper',
+      'func': undefined
+    }];
+
+    // Add methods to `LazyWrapper`.
+    LazyWrapper.prototype.clone = lazyClone;
+    LazyWrapper.prototype.reverse = lazyReverse;
+    LazyWrapper.prototype.value = lazyValue;
+
+    // Add chain sequence methods to the `lodash` wrapper.
+    lodash.prototype.at = wrapperAt;
+    lodash.prototype.chain = wrapperChain;
+    lodash.prototype.commit = wrapperCommit;
+    lodash.prototype.next = wrapperNext;
+    lodash.prototype.plant = wrapperPlant;
+    lodash.prototype.reverse = wrapperReverse;
+    lodash.prototype.toJSON = lodash.prototype.valueOf = lodash.prototype.value = wrapperValue;
+
+    // Add lazy aliases.
+    lodash.prototype.first = lodash.prototype.head;
+
+    if (symIterator) {
+      lodash.prototype[symIterator] = wrapperToIterator;
+    }
+    return lodash;
+  });
+
+  /*--------------------------------------------------------------------------*/
+
+  // Export lodash.
+  var _ = runInContext();
+
+  // Some AMD build optimizers, like r.js, check for condition patterns like:
+  if (typeof define == 'function' && typeof define.amd == 'object' && define.amd) {
+    // Expose Lodash on the global object to prevent errors when Lodash is
+    // loaded by a script tag in the presence of an AMD loader.
+    // See http://requirejs.org/docs/errors.html#mismatch for more details.
+    // Use `_.noConflict` to remove Lodash from the global object.
+    root._ = _;
+
+    // Define as an anonymous module so, through path mapping, it can be
+    // referenced as the "underscore" module.
+    define(function() {
+      return _;
+    });
+  }
+  // Check for `exports` after `define` in case a build optimizer adds it.
+  else if (freeModule) {
+    // Export for Node.js.
+    (freeModule.exports = _)._ = _;
+    // Export for CommonJS support.
+    freeExports._ = _;
+  }
+  else {
+    // Export to the global object.
+    root._ = _;
+  }
+}.call(this));
+
+}).call(this,typeof global !== "undefined" ? global : typeof self !== "undefined" ? self : typeof window !== "undefined" ? window : {})
+},{}]},{},[1])(1)
+});
\ No newline at end of file
diff --git a/diode/datahelper.js b/diode/datahelper.js
new file mode 100644
index 0000000000..ea95766448
--- /dev/null
+++ b/diode/datahelper.js
@@ -0,0 +1,1228 @@
+
+// Similar to the same class in python
+class Entry {
+    constructor(entryobj) {
+        this.data = entryobj;
+    }
+
+    nodeid() {
+        return new Number(this.data.node) & 0xFFFF;
+        return this.data.node;
+    }
+
+    stateid() {
+        return (new Number(this.data.node) >> 16) & 0xFFFF;
+        return this.data.node;
+    }
+
+    thread() {
+        return this.data.thread;
+    }
+    iteration() {
+        return this.data.iteration;
+    }
+    values() {
+        return this.data.values;
+    }
+
+    getKeys() {
+        return this.data.values.map(x => ObjectHelper.listKeys(x)[0]);
+    }
+    getValue(papi_code) {
+        let vals = this.data.values;
+        for (let it of vals) {
+            let keys = Object.keys(it);
+            if (keys.some(x => x == papi_code)) {
+                return it[papi_code];
+            }
+        }
+        if(papi_code == "-2147483589") {
+            // Instead of TOT_CYC, we'll allow REF_CYC with a warning
+            console.warn("Fallback used (TOT_CYC => REF_CYC)");
+            return this.getValue("-2147483541");
+        }
+        ObjectHelper.logObject("this", this);
+        ObjectHelper.assert("got value from key " + papi_code, false);
+        return null;
+    }
+}
+// Similar to the same class in Python
+class Section {
+    constructor(sectionobj = null, flags = undefined) {
+        if (sectionobj == null) {
+            return;
+        }
+        if(sectionobj instanceof Section) {
+            
+            Object.assign(this, sectionobj);
+            this.private_fix_entries();
+            return;
+        }
+        if(flags == "from_raw") {
+            Object.assign(this, sectionobj);
+            this.private_fix_entries();
+            return;
+        }
+        this.node = sectionobj['entry_node'];
+        this.datasize = sectionobj['static_movement'];
+
+        ObjectHelper.assert("this is valid", this != undefined);
+        let entries = sectionobj['entries']; // this is an array
+        if(entries == undefined) {
+            entries = sectionobj['_entries']; // Try with the other format
+            if(entries != undefined) {
+                ObjectHelper.logObject("sectionobj", sectionobj);
+                console.trace("Dangerous assignment");
+                Object.assign(this, sectionobj);
+                this.private_fix_entries();
+                ObjectHelper.assert("Correct subelements", this._entries.every(x => x instanceof Entry));
+                return;
+            }
+            else {
+                // Throw exception.
+            }
+
+        }
+        ObjectHelper.assert("entries is valid", entries != undefined && entries != "undefined" && typeof entries != "function");
+        this._entries = [];
+
+        for (let e of entries) {
+            // Construct and push
+            this._entries.push(new Entry(e));
+        }
+    }
+
+    private_fix_entries() {
+        if(this._entries == undefined) {
+            return;
+        }
+        if(!this._entries.every(x => x instanceof Entry)) {
+            // Fix the entries...
+            this._entries = this._entries.map(x => new Entry(x.data));
+        }
+    }
+
+    nodeid() {
+        return new Number(this.node) & 0xFFFF;
+        return this.node;
+    }
+
+    stateid() {
+        return (new Number(this.node) >> 16) & 0xFFFF;
+        return this.node;
+    }
+
+    entries() {
+        return this._entries.map(x => x.data);
+    }
+
+    /* Returns true if this section is a serial section, i.e. only one thread 
+       number occurs in all entries.
+       This does not mean that the section iself was not inside a parallel 
+       block, but that the section itself was not parallelized.
+    */
+    isSerial() {
+        return this.threadid() != null;
+    }
+
+    threadid() {
+        if(this._entries.length == 0) {
+            return null;
+        }
+        let tid = this._entries[0].thread();
+        if(this._entries.every(x => x.thread() == tid)) {
+            return tid;
+        }
+        
+        return null; // Not everyone had the same thread
+    }
+
+    // Returns the full cost of this section on one particular CPU, grouped 
+    // by nodeid.
+    sumIteration() {
+        ObjectHelper.assert("object is serial", this.isSerial());
+
+        let ret = {
+            "entry_node": this.nodeid().toString(),
+            "thread": this.threadid(),
+            "iteration": "mixed",
+            "flags": "mixed",
+            "values": []
+        };
+
+        let keys = this.list_events();
+
+        let vals = keys.map(x => { 
+            let ret = {}; 
+            ret[x] = MathHelper.sum(this.select_event(x));
+            return ret;
+        });
+
+        ret['values'] = vals;
+
+        let e = new Entry(ret);
+
+        let retobj = new Section();
+        Object.assign(retobj, this);
+        retobj._entries = [e];
+
+        return retobj;
+    }
+
+    numIterations() {
+        let keys = this.list_events();
+
+        let entry_count = 0;
+        keys.forEach(x => { 
+            entry_count = Math.max(entry_count, this.select_event(x).length);
+        });
+        return entry_count;
+    }
+
+    select_event(event) {
+
+        return this._entries.filter(x => x.getValue(event) != null).map(x => x.getValue(event));
+    }
+
+    list_events() {
+        let keys = this._entries.map(x => x.getKeys());
+        if(keys.length == 0) {
+            return [];
+        }
+        let first = keys[0];
+        ObjectHelper.assert("same keys", keys.every(x => ObjectHelper.arraysEqual(x, first)));
+
+        return first;
+    }
+
+    select_thread(threadnum) {
+        return this.filter(x => x.thread() == threadnum);
+    }
+    select_node(nodeid) {
+        return this.filter(x => x.nodeid() == nodeid);
+    }
+
+    get_max_thread_num() {
+        ObjectHelper.assert("entries are defined", this._entries != undefined);
+        for(let x of this._entries)
+            ObjectHelper.assert("Must be correct object", x.thread != undefined);
+        if(this._entries.length === 0)
+            return undefined;
+        return max_func(this._entries, x => x.thread());
+    }
+
+    get_min_thread_num() {
+        ObjectHelper.assert("entries are defined", this._entries != undefined);
+        if(this._entries.length === 0)
+            return undefined;
+        return min_func(this._entries, x => x.thread());
+    }
+
+    filter(predicate) {
+        let ret = new Section();
+        ret.node = this.nodeid();
+        ObjectHelper.assert("entries valid", this._entries.every(x => x instanceof Entry));
+        ret._entries = this._entries.filter(predicate);
+
+        return ret;
+    }
+}
+
+class SuperSection {
+
+    constructor(supersection_obj = null) {
+        let s = supersection_obj;
+        if(s == null) {
+            return;
+        }
+
+        this._sections = supersection_obj.sections.map(x => new Section(x));
+        this._nodeid = supersection_obj.supernode;
+    }
+
+    // Downgrade to a section by flattening 
+    toSection(nodeid = undefined, stateid = undefined) {
+        let input = this.sections().filter(x => (nodeid == undefined || x.nodeid() == nodeid) && (stateid == undefined || x.stateid() == stateid));
+        let rawobj = ObjectHelper.merge(input, {"datasize": (x, y) => x + y});
+        let ret =  new Section(rawobj, "from_raw");
+
+        let max_thread_num = 0;
+        if(ret._entries != undefined) {
+
+            max_thread_num = ret.get_max_thread_num();
+
+            if(max_thread_num === undefined) {
+                return undefined;
+            }
+        }
+
+        for(let t = 0; t < max_thread_num; ++t) {
+            let pre = input.map(x => x.select_thread(t).select_event('-2147483589'));
+            if(pre.length == 0) {
+                // We don't like it...
+                continue;
+            }
+            else {
+                let unwrapped = ObjectHelper.flatten(pre);
+                let oldsum = MathHelper.sum(unwrapped);
+                let newsum = MathHelper.sum(ret.select_thread(t).select_event('-2147483589'));
+                ObjectHelper.assert("sum cyc equal", newsum == oldsum);
+            }
+        }
+
+        return ret;
+    }
+
+    getSections(nodeid, stateid = undefined) {
+        ObjectHelper.assert("nodeid provided", nodeid != undefined);
+
+        let ret = this.sections();
+
+        return ret.filter(x => x.nodeid() == nodeid && (stateid == undefined || x.stateid() == stateid));
+    }
+
+    static meanRepetitions(array) {
+        ObjectHelper.assert("correct type", array instanceof Array);
+        ObjectHelper.assert("correct type", array[0] instanceof SuperSection);
+
+        let means = array.map(x => x.toThreadMean());
+    }
+
+    // Get mean of threads
+    toThreadMean(nodeid) {
+        ObjectHelper.assert("nodeid defined", nodeid != undefined);
+
+        // If we have a supersection of many sections and each of the 
+        // sections contains only one thread, we are inside a parallel section.
+        // Otherwise, if a supersection contains sections with mixed threads, 
+        // we are outside.
+
+        let all_serial = this.sections().every(x => x.isSerial());
+        if(all_serial) {
+            console.log("all sections are serial");
+        }
+        else {
+            console.log("not all subsections are serial.");
+            // This operation does not make any sense then.
+            return null;
+        }
+
+        // Since all sections are serial, we can linearly collapse every 
+        // section to a single entry. (the overhead of instrumentation is 
+        // ignored at this point)
+        // We use the mean of all iterations per node.
+        let means = this.sections().map(x => x.sumIteration());
+        console.log("means len: " + means.length);
+        // `means` now has the means of all super-iterations 
+
+        return means;
+    }
+
+    get_max_thread_num() {
+        // This just returns the maximum thread number of all sections
+        ObjectHelper.assert("sections are defined", this.sections() != undefined);
+        return max_func(this.sections(), x => x.get_max_thread_num());
+    }
+
+    filter(predicate) {
+        let ret = new SuperSection();
+        ret._nodeid = this.nodeid();
+        
+        ret._sections = this.sections().filter(predicate);
+
+        return ret;
+    }
+
+    nodeid() {
+        return this._nodeid & 0xFFFF;
+    }
+
+    stateid() {
+        return new Number(this._nodeid) >> 16;
+    }
+
+    containsSection(nodeid, stateid = undefined) {
+        return this.sections().filter(x => x.nodeid() == nodeid && (stateid == undefined || x.stateid() == stateid)).length > 0;
+    }
+
+    sections() {
+        return this._sections;
+    }
+
+    allSectionNodeIds(for_state = undefined) {
+        if(for_state == undefined) {
+            return this.sections().map(x => x.nodeid());
+        }
+        else {
+            // Only return the nodes of a given state
+            return this.sections().filter(x => x.stateid() == for_state).map(x => x.nodeid());
+        }
+    }
+    allSectionStateIds() {
+        return this.sections().map(x => x.stateid());
+    }
+
+}
+
+class MathHelper {
+    static stdev(array) {
+        return Math.sqrt(this.var(array));
+    }
+
+    static majority(array) {
+
+        ObjectHelper.assert("array valid", array != undefined && array instanceof Array);
+        let i = 0;
+        let dict = {};
+        for (let x of array) {
+            if (dict[x] == undefined) {
+                dict[x] = 0;
+            }
+            dict[x]++;
+            i++;
+        }
+        let a = [];
+        for (let k of Object.keys(dict)) {
+            let v = dict[k];
+
+            a.push([k, v]);
+        }
+        //console.log(JSON.stringify(a));
+        ObjectHelper.logObject("a", a);
+        ObjectHelper.assert("a sensible", a.length > 0);
+        return max_func_obj(a, x => x[1], x => x[0]);
+    }
+
+    // Pearson's correlation coefficient
+    static corr(X, Y) {
+        return this.cov(X, Y) / (this.stdev(X) * this.stdev(Y));
+    }
+
+    // sample correlation
+    static sample_corr(X, Y) {
+        let xmean = this.mean(X);
+        let ymean = this.mean(Y);
+
+        let num = this.sum(this.zip(X, Y).map(x => (x[0] - xmean) * (x[1] - ymean)));
+        let denom = Math.sqrt(this.sum(X.map(x => (x - xmean) * (x - xmean))) * this.sum(Y.map(y => (y - ymean) * (y - ymean))));
+        return num / denom;
+    }
+
+    static cov(X, Y) {
+        let n = X.length;
+
+        let acc = 0;
+        for (let i = 0; i < n; ++i) {
+            for (let j = i + 1; j < n; ++j) {
+                let tmp = (X[i] - X[j]) * (Y[i] - Y[j]);
+                acc += tmp;
+            }
+        }
+        return tmp / (n * n);
+    }
+
+    static var(array) {
+        let mean = this.mean(array);
+        let sum = array.reduce((a, b) => a + Math.pow(b - mean, 2.0), 0);
+        return sum / array.length;
+    }
+
+    static mean(array) {
+        if (array.length != 0) {
+            return this.sum(array) / (array.length);
+        }
+        else {
+            return 0;
+        }
+    }
+
+    // This is the upper median
+    static median(array) {
+        if (array.length != 0) {
+            let sorted = array.map(x => new Number(x));
+            sorted.sort((a, b) => a - b);
+            let index = sorted.length / 2;
+            index = Math.floor(index); // Don't need this...
+            let ret = sorted[index];
+
+            ret = new Number(ret);
+            
+            ObjectHelper.assert("is_number", ret instanceof Number);
+            return ret;
+        }
+        else {
+            return 0;
+        }
+    }
+
+    static sum(array) {
+        ObjectHelper.assert("Input valid", array != undefined && array != null);
+        return array.reduce((a, b) => Number(a) + Number(b), 0);
+    }
+
+    // Zips all elements of a 2d array 
+    // (example: zip2d([[a, b, c], [1,2,3]]) -> [[a, 1], [b, 2], [c, 3]]). 
+    // Restrictions: All sumelements must be of the same size!
+    static zip2d(array) {
+        if (array == []) return [];
+
+        let ret = [];
+        let outersize = array.length;
+        let innersize = array[0].length;
+
+        for (let i = 0; i < innersize; ++i) {
+            let tmp = [];
+            for (let j = 0; j < outersize; ++j) {
+                tmp.push(array[j][i]);
+            }
+            ret.push(tmp);
+        }
+        return ret;
+    }
+
+    static zip(X, Y) {
+        let ret = [];
+        for (let i = 0; i < X.length; ++i) {
+            ret.push([X[i], Y[i]]);
+        }
+        return ret;
+    }
+
+    static unique(array) {
+        return array.filter((x, index, a) => a.indexOf(x) == index);
+    }
+}
+
+class ObjectHelper {
+
+    static arraysEqual(arr1, arr2) {
+        this.assert("1 is array", arr1 instanceof Array);
+        this.assert("2 is array", arr2 instanceof Array);
+        if(arr1.length != arr2.length) {
+            return false;
+        }
+
+        for(let i = 0; i < arr1.length; ++i) {
+            if(arr1[i] != arr2[i]) {
+                return false;
+            }
+        }
+        return true;
+    }
+
+    static listKeys(obj) {
+        let ret = [];
+        for (let k of Object.keys(obj)) {
+            ret.push(k);
+        }
+
+        return ret;
+    }
+
+    // Merges all objects together. Primitive values must be the same, and 
+    // list elements will be appended. All prepended underscores of keys are
+    // removed.
+    static merge(in_array, onconflicts = {}) {
+        // onconflicts: dict of key -> function(x, y), where the function 
+        // resolves conflicts if the values are not identical.
+        if(in_array.length == 0) return [];
+
+        let keys = this.listKeys(in_array[0]);
+        //keys = keys.map(x=>x.replace(/^[_]+/, ""));
+
+        let ret = {};
+        // Prime by adding keys
+        for(let k of keys) {
+            ret[k] = undefined;
+        }
+        for(let x of in_array) {
+            for(let k of keys) {
+                let v = x[k];
+                if(v instanceof Array) {
+                    if(ret[k] == undefined) {
+                        ret[k] = [];
+                    }
+                    ret[k].push(...v);
+                }
+                else {
+                    if(ret[k] == undefined) {
+                        ret[k] = v;
+                    }
+                    if(ret[k] != v) {
+                        console.log("Different elements for key " + k + ": " + ret[k] + " vs " + v);
+                        if(onconflicts[k] != undefined) {
+                            ret[k] = onconflicts[k](ret[k], v);
+                            continue;
+                        }
+                    }
+                    this.assert("Same primitive values", ret[k] == v);
+                }
+            }
+        }
+        return ret;
+    }
+
+    // Flattens an array of arrays to a single array
+    static flatten(in_array) {
+        return [].concat.apply([], in_array);
+    }
+
+    static logObject(title, obj) {
+        return console.log(title + ": " + JSON.stringify(obj));
+    }
+
+    /* Groups elements by selecting a key using func(x) for every element x 
+       in in_array. Returns a dict of key => [objects] */
+    static groupBy(in_array, func) {
+
+        let ret = {};
+
+        for(let x of in_array) {
+            ObjectHelper.assert("key not undefined", func(x) !== undefined);
+            if(ObjectHelper.listKeys(ret).includes(func(x))) {
+                ret[func(x)].push(x);
+            }
+            else {
+                ret[func(x)] = [x];
+            }
+        }
+
+        return Object.values(ret);
+    }
+
+    static assert(name, expr) {
+        if(!expr) {
+            console.log("Assertion \"" + name + "\" failed");
+            console.trace();
+            window.alert("Assertion failed. Check console");
+            throw new Error();
+        }
+    }
+
+    static stringify_circular(obj) {
+        const getCircularReplacer = () => {
+            const seen = new WeakSet;
+            return (key, value) => {
+                if (typeof value === "object" && value !== null) {
+                    if (seen.has(value)) {
+                        return;
+                    }
+                    seen.add(value);
+                }
+                return value;
+            };
+        };
+
+        return JSON.stringify(obj, getCircularReplacer());
+    }
+}
+
+// Class providing analysis of threads (mainly used in balance)
+class ThreadAnalysis {
+    constructor(section) {
+        if(section instanceof Section) {
+            this.section = section;
+        }
+        else if(section instanceof SuperSection) {
+            ObjectHelper.assert("this is undesired.", false);
+            // A supersection has many subsections. For now, let's try if 
+            // merging works for this.
+            this.section = section.toSection();
+        }
+    }
+
+    analyze() {
+        let section = this.section;
+
+        let b_print_analysis = false;   // Set to true to debug.
+
+        let data = {};
+
+        let max_thread_num = Number(section.get_max_thread_num());
+        if (b_print_analysis)
+            console.log("max_thread_num: " + max_thread_num);
+        let tot_cyc = [];
+        let tot_l3_miss = [];
+        let tot_l2_miss = [];
+        let t = 0;
+        for (t = 0; t < max_thread_num + 1; t++) {
+            let ts = section.select_thread(t);
+            let tc = ts.select_event('-2147483589'); // PAPI_TOT_CYC
+            tot_cyc.push(MathHelper.sum(tc));
+
+            let tl3 = ts.select_event('-2147483640'); // PAPI_L3_TCM
+            tot_l3_miss.push(MathHelper.sum(tl3));
+
+            let tl2 = ts.select_event('-2147483641');   //PAPI_L2_TCM
+            tot_l2_miss.push(MathHelper.sum(tl2));
+        }
+
+        //Now we can get the balance
+        let i = 0;
+        if (b_print_analysis)
+            for (let t of tot_cyc) {
+                console.log("Thread " + i + " took " + t + " cycles");
+                i++;
+            }
+
+        data.cycles_per_thread = tot_cyc;
+
+        if(toplevel_use_mean)
+            data.balance_stdev = (MathHelper.stdev(tot_cyc) / MathHelper.mean(tot_cyc));
+        else if(toplevel_use_median) {
+            data.balance_stdev = (MathHelper.stdev(tot_cyc) / MathHelper.median(tot_cyc));
+        }
+        else ObjectHelper.assert("Undefined mode", false);
+        if (b_print_analysis)
+            if (tot_cyc.length > 1 && MathHelper.mean(tot_cyc) != 0) {
+
+                console.log("stdev: " + MathHelper.stdev(tot_cyc));
+                console.log("Balance (stdev): " + data.balance_stdev);
+            }
+
+
+        // We need different means of balance calculations.
+        let max_elem = Math.max(...tot_cyc);
+        let min_elem = Math.min(...tot_cyc);
+        let max_diff = max_func(tot_cyc, x => Math.max(Math.abs(max_elem - x), Math.abs(min_elem - x)));
+        
+        let biggest_unbalance = 0;
+        if(toplevel_use_mean)
+            biggest_unbalance = max_diff / MathHelper.mean(tot_cyc);
+        else if(toplevel_use_median)
+            biggest_unbalance = max_diff / MathHelper.median(tot_cyc);
+        else ObjectHelper.assert("Undefined mode", false);
+
+        if (b_print_analysis)
+            console.log("max_diff: " + max_diff);
+        if (b_print_analysis)
+            console.log("Balance (max): " + biggest_unbalance);
+
+        data.balance_max = biggest_unbalance;
+
+        i = 0;
+        if (b_print_analysis)
+            for (let t of tot_l3_miss) {
+                console.log("Thread " + i + " had " + t + " L3 misses");
+                i++;
+            }
+        let sum_l3 = MathHelper.sum(tot_l3_miss);
+        if (b_print_analysis)
+            console.log("\n" + section.datasize + " bytes (presumably) accessed\n" + sum_l3 + " L3 misses over all threads\n" + (sum_l3 * 64) + " bytes loaded from memory");
+
+        i = 0;
+        if (b_print_analysis)
+            for (let t of tot_l2_miss) {
+                console.log("Thread " + i + " had " + t + " L2 misses");
+                i++;
+            }
+
+        let sum_l2 = MathHelper.sum(tot_l2_miss);
+        if (b_print_analysis)
+            console.log("\n" + section.datasize + " bytes (presumably) accessed\n" + sum_l2 + " L3 misses over all threads\n" + (sum_l2 * 64) + " bytes loaded from memory");
+
+
+        return new DataBlock(data, "thread");
+    }
+
+
+}
+
+class SuperSectionThreadAnalysis {
+    constructor(supersection, for_node, for_state) {
+        this.for_node = for_node;
+        this.for_state = for_state;
+
+        if(supersection instanceof Section) {
+            ObjectHelper.assert("this is undesired.", false);
+        }
+        else if(supersection instanceof SuperSection) {
+            
+            // A supersection has many subsections.
+            this.supersection = supersection;
+        }
+    }
+
+    analyze() {
+        let supersection = this.supersection;
+        let sections = this.supersection.getSections(this.for_node, this.for_state);
+
+        // We have to distinguish 2 cases: All sections serial or parallel
+        return;
+
+        ObjectHelper.assert("all sections serial", sections.every(x => x.isSerial()));
+
+        let b_print_analysis = false;   // Set to true to debug.
+
+        let data = {};
+
+        let max_thread_num = Number(supersection.get_max_thread_num());
+        if (b_print_analysis)
+            console.log("max_thread_num: " + max_thread_num);
+        let tot_cyc = [];
+        let tot_l3_miss = [];
+        let tot_l2_miss = [];
+        
+
+
+        let t = 0;
+        for (t = 0; t < max_thread_num + 1; t++) {
+            //console.log("iteration " + t);
+            let ts = supersection.select_thread(t);
+            let tc = ts.select_event('-2147483589'); // PAPI_TOT_CYC
+            //console.log(tc);
+            tot_cyc.push(MathHelper.sum(tc));
+
+            let tl3 = ts.select_event('-2147483640'); // PAPI_L3_TCM
+            tot_l3_miss.push(MathHelper.sum(tl3));
+
+            let tl2 = ts.select_event('-2147483641');   //PAPI_L2_TCM
+            tot_l2_miss.push(MathHelper.sum(tl2));
+        }
+
+        // Now we can get the balance
+        let i = 0;
+        if (b_print_analysis)
+            for (let t of tot_cyc) {
+                console.log("Thread " + i + " took " + t + " cycles");
+                i++;
+            }
+
+        data.cycles_per_thread = tot_cyc;
+
+        data.balance_stdev = (MathHelper.stdev(tot_cyc) / MathHelper.mean(tot_cyc));
+        if (b_print_analysis)
+            if (tot_cyc.length > 1 && MathHelper.mean(tot_cyc) != 0) {
+
+                console.log("stddev: " + MathHelper.stdev(tot_cyc));
+                console.log("Balance (stdev): " + data.balance_stdev);
+            }
+
+
+        // We need different means of balance calculations.
+        let max_elem = Math.max(...tot_cyc);
+        let min_elem = Math.min(...tot_cyc);
+        let max_diff = max_func(tot_cyc, x => Math.max(Math.abs(max_elem - x), Math.abs(min_elem - x)));
+        let biggest_unbalance = max_diff / MathHelper.mean(tot_cyc);
+        if (b_print_analysis)
+            console.log("max_diff: " + max_diff);
+        if (b_print_analysis)
+            console.log("Balance (max): " + biggest_unbalance);
+
+        data.balance_max = biggest_unbalance;
+
+        i = 0;
+        if (b_print_analysis)
+            for (let t of tot_l3_miss) {
+                console.log("Thread " + i + " had " + t + " L3 misses");
+                i++;
+            }
+        let sum_l3 = MathHelper.sum(tot_l3_miss);
+        if (b_print_analysis)
+            console.log("\n" + supersection.datasize + " bytes accessed\n" + sum_l3 + " L3 misses over all threads\n" + (sum_l3 * 64) + " bytes loaded from memory");
+
+        i = 0;
+        if (b_print_analysis)
+            for (let t of tot_l2_miss) {
+                console.log("Thread " + i + " had " + t + " L2 misses");
+                i++;
+            }
+
+        let sum_l2 = MathHelper.sum(tot_l2_miss);
+        if (b_print_analysis)
+            console.log("\n" + supersection.datasize + " bytes accessed\n" + sum_l2 + " L3 misses over all threads\n" + (sum_l2 * 64) + " bytes loaded from memory");
+
+
+        return new DataBlock(data, "thread");
+    }
+
+
+}
+
+class MemoryAnalysis {
+    constructor(section, target_bw) {
+        this.section = section;
+        this.analysis_result = null;
+        this.Memory_Target_Bandwidth = target_bw;
+        if(this.Memory_Target_Bandwidth == undefined) {
+            this.Memory_Target_Bandwidth = 20;
+        }
+    }
+
+    judgement(analysis = null) {
+        if (analysis == null) analysis = this.analysis_result;
+
+        // We say memory was slow if the achieved bandwidth is below 50% of the target bandwidth
+        let bandwidth = analysis.data.expected_bandwidth;
+        let m = bandwidth / analysis.data.Memory_Target_Bandwidth;
+        if (m < 0.5) {
+            return -1;
+        }
+        // Otherwise, everything is fine
+        return 1;
+    }
+
+    analyze() {
+        let section = this.section;
+
+        ObjectHelper.assert("section is Section", section instanceof Section);
+
+        let b_print_analysis = false;   // Set to true to debug.
+
+        let data = {};
+
+        data.expected_memory_movement = section.datasize;
+
+        if(b_print_analysis)
+            console.log("Expected data movement: " + data.expected_memory_movement);
+
+        let max_thread_num = Number(section.get_max_thread_num());
+        let min_thread_num = Number(section.get_min_thread_num());
+
+        if(max_thread_num == undefined || min_thread_num == undefined) {
+            return undefined;
+        }
+        if (b_print_analysis)
+            console.log("max_thread_num: " + max_thread_num);
+        let tot_cyc = [];
+        let tot_l3_miss = [];
+        let tot_l2_miss = [];
+
+        let mem_bw = []; // Bandwidth from mem to L3
+        let l3_bw = [];  // Bandwidth from L3 to L2
+
+        let critical_path_cyc = 0; // Critical path cycles
+
+        let t = 0;
+        for (t = min_thread_num; t < max_thread_num + 1; t++) {
+            let ts = section.select_thread(t);
+            let tc = ts.select_event('-2147483589'); // PAPI_TOT_CYC
+            let tc_sum = MathHelper.sum(tc);
+            tot_cyc.push(tc_sum);
+
+            let tl3 = ts.select_event('-2147483640'); // PAPI_L3_TCM
+            let tl3_sum = MathHelper.sum(tl3);
+            tot_l3_miss.push(tl3_sum);
+
+            let tl2 = ts.select_event('-2147483641');   //PAPI_L2_TCM
+            let tl2_sum = MathHelper.sum(tl2);
+            tot_l2_miss.push(tl2_sum);
+
+            // Add the bandwidths for this element
+            mem_bw.push(tl3_sum / tc_sum);
+            l3_bw.push(tl2_sum / tc_sum);
+        }
+
+        if(tot_cyc.length == 0) {
+            return undefined;
+        }
+        critical_path_cyc = max_func(tot_cyc, x => x);
+
+        data.critical_path_cyc = critical_path_cyc;
+
+        data.mem_bandwidth = mem_bw;
+        data.l3_bandwidth = l3_bw;
+
+        // Now we can get the balance
+        let i = 0;
+        if (b_print_analysis)
+            for (let t of tot_cyc) {
+                console.log("Thread " + i + " took " + t + " cycles");
+                i++;
+            }
+
+        data.TOT_CYC = tot_cyc;
+        data.L3_TCM = tot_l3_miss;
+        data.L2_TCM = tot_l2_miss;
+
+
+        i = 0;
+        if (b_print_analysis)
+            for (let t of tot_l3_miss) {
+                console.log("Thread " + i + " had " + t + " L3 misses");
+                i++;
+            }
+        let sum_l3 = MathHelper.sum(tot_l3_miss);
+        if (b_print_analysis)
+            console.log("\n" + section.datasize + " bytes accessed\n" + sum_l3 + " L3 misses over all threads\n" + (sum_l3 * 64) + " bytes loaded from memory");
+
+        i = 0;
+        if (b_print_analysis)
+            for (let t of tot_l2_miss) {
+                console.log("Thread " + i + " had " + t + " L2 misses");
+                i++;
+            }
+
+        let sum_l2 = MathHelper.sum(tot_l2_miss);
+        if (b_print_analysis)
+            console.log("\n" + section.datasize + " bytes accessed\n" + sum_l2 + " L3 misses over all threads\n" + (sum_l2 * 64) + " bytes loaded from memory");
+
+        {
+            let datasize = data.expected_memory_movement;
+            let crit_cyc = data.critical_path_cyc;
+
+            let expected_bandwidth = datasize / crit_cyc;
+
+            data.expected_bandwidth = expected_bandwidth;
+        }
+        data.Memory_Target_Bandwidth = this.Memory_Target_Bandwidth;
+        let ret = new DataBlock(data, "memory");
+        this.analysis_result = ret;
+        ret.judgement = this.judgement();
+        return ret;
+    }
+}
+
+class SuperSectionMemoryAnalysis {
+    constructor(section, nodeid, stateid, target_bw) {
+        this.section = section;
+        this.for_node = nodeid;
+        this.for_state = stateid;
+        this.analysis_result = null;
+
+        this.Memory_Target_Bandwidth = target_bw;
+        if(this.Memory_Target_Bandwidth == undefined) {
+            this.Memory_Target_Bandwidth = 20;
+        }
+
+        ObjectHelper.assert("for_node defined", this.for_node != undefined && new Number(this.for_node) != NaN);
+    }
+
+    judgement(analysis = null) {
+        if (analysis == null) analysis = this.analysis_result;
+
+        // We say memory was slow if the achieved bandwidth is below 50% of 
+        // the target bandwidth
+        let bandwidth = analysis.data.expected_bandwidth;
+
+        let m = bandwidth / analysis.data.Memory_Target_Bandwidth;
+        if (m < 0.5) {
+            return -1;
+        }
+        // Otherwise, everything is fine
+        return 1;
+    }
+
+    analyze() {
+        let section = this.section;
+
+        ObjectHelper.assert("section is SuperSection", section instanceof SuperSection);
+
+        // We have a supersection, so we should try to get individual 
+        // sections out.
+        section = section.toSection(this.for_node, this.for_state);
+        if(section == undefined) {
+            return null;
+        }
+        if(section['_entries'] == undefined) {
+            return null;
+        }
+
+        let b_print_analysis = false;   // Set to true to debug.
+
+        let data = {};
+
+        data.expected_memory_movement = section.datasize;
+
+        if(b_print_analysis)
+            console.log("Expected data movement: " + data.expected_memory_movement);
+
+        
+
+        let max_thread_num = Number(section.get_max_thread_num());
+        let min_thread_num = Number(section.get_min_thread_num());
+        if (b_print_analysis)
+            console.log("max_thread_num: " + max_thread_num);
+        let tot_cyc = [];
+        let tot_l3_miss = [];
+        let tot_l2_miss = [];
+
+        let mem_bw = []; // Bandwidth from mem to L3
+        let l3_bw = [];  // Bandwidth from L3 to L2
+
+        let critical_path_cyc = 0; // Critical path cycles
+
+        let t = 0;
+        for (t = min_thread_num; t < max_thread_num + 1; t++) {
+            //console.log("iteration " + t);
+            let ts = section.select_thread(t);
+            let tc = ts.select_event('-2147483589'); // PAPI_TOT_CYC
+            //console.log(tc);
+            let tc_sum = MathHelper.sum(tc);
+            tot_cyc.push(tc_sum);
+
+            let tl3 = ts.select_event('-2147483640'); // PAPI_L3_TCM
+            let tl3_sum = MathHelper.sum(tl3);
+            tot_l3_miss.push(tl3_sum);
+
+            let tl2 = ts.select_event('-2147483641');   //PAPI_L2_TCM
+            let tl2_sum = MathHelper.sum(tl2);
+            tot_l2_miss.push(tl2_sum);
+
+            // Add the bandwidths for this element
+            mem_bw.push(tl3_sum / tc_sum);
+            l3_bw.push(tl2_sum / tc_sum);
+        }
+
+        critical_path_cyc = max_func(tot_cyc, x => x);
+
+        data.critical_path_cyc = critical_path_cyc;
+
+        data.mem_bandwidth = mem_bw;
+        data.l3_bandwidth = l3_bw;
+
+        let i = 0;
+        if (b_print_analysis)
+            for (let t of tot_cyc) {
+                console.log("Thread " + i + " took " + t + " cycles");
+                i++;
+            }
+
+        data.TOT_CYC = tot_cyc;
+        data.L3_TCM = tot_l3_miss;
+        data.L2_TCM = tot_l2_miss;
+
+
+        i = 0;
+        if (b_print_analysis)
+            for (let t of tot_l3_miss) {
+                console.log("Thread " + i + " had " + t + " L3 misses");
+                i++;
+            }
+        let sum_l3 = MathHelper.sum(tot_l3_miss);
+        if (b_print_analysis)
+            console.log("\n" + section.datasize + " bytes accessed\n" + sum_l3 + " L3 misses over all threads\n" + (sum_l3 * 64) + " bytes loaded from memory");
+
+        i = 0;
+        if (b_print_analysis)
+            for (let t of tot_l2_miss) {
+                console.log("Thread " + i + " had " + t + " L2 misses");
+                i++;
+            }
+
+        let sum_l2 = MathHelper.sum(tot_l2_miss);
+        if (b_print_analysis)
+            console.log("\n" + section.datasize + " bytes accessed\n" + sum_l2 + " L3 misses over all threads\n" + (sum_l2 * 64) + " bytes loaded from memory");
+
+        {
+            let datasize = data.expected_memory_movement;
+            let crit_cyc = data.critical_path_cyc;
+
+            let expected_bandwidth = datasize / crit_cyc;
+
+            data.expected_bandwidth = expected_bandwidth;
+        }
+
+        data.Memory_Target_Bandwidth = this.Memory_Target_Bandwidth;
+        let ret = new DataBlock(data, "memory");
+        this.analysis_result = ret;
+        ret.judgement = this.judgement();
+        return ret;
+    }
+}
+
+
+class CriticalPathAnalysis {
+
+    constructor(rundata, entry_node, stateid) {
+        this.rundata = rundata;
+        this.analysis_result = null;
+        this.section_entry_node = entry_node;
+        this.for_state = stateid;
+    }
+
+    judgement(analysis = null) {
+        if (analysis == null) analysis = this.analysis_result;
+        let e = analysis.data.efficiency;
+        let max_thread_num = max_func(e, x => x.thread_num);
+
+        let eff = 0;
+
+        if(toplevel_use_mean) {
+            eff = MathHelper.mean(e.find(x => x.thread_num == max_thread_num).value);
+        }
+        else if(toplevel_use_median) {
+            eff = MathHelper.median(e.find(x => x.thread_num == max_thread_num).value);
+        }
+        else {
+            ObjectHelper.assert("Undefined mode", false);
+        }
+
+        if (eff < 0.5) {
+            return -1;
+        }
+
+        return 1;
+    }
+
+    analyze() {
+        let data = {};
+
+        let i = 0;
+        for (let run of this.rundata) {
+            console.log(i + run.runopts);
+            i++;
+        }
+
+        // We want to compare different runs.
+        let runs = this.rundata.map(x => x.data);
+
+
+        let single_threaded = new SuperSection(runs[0][0]).toThreadMean(this.section_entry_node, this.for_state);
+
+        let filtered_to_section = runs.map(x => x.map(y => new SuperSection(y).toSection(this.section_entry_node, this.for_state)).filter(x => x != undefined && x._entries != undefined));
+
+       let thread_analyzed = filtered_to_section.map(x => x.map(y => { 
+           return new ThreadAnalysis(y).analyze(); 
+        }));
+
+        // Now map to cycles_per_thread
+        let cycles_per_thread = thread_analyzed.map(x => x.map(y => y.data.cycles_per_thread));
+
+        // From here, we have all thread analysis (including tot_cyc for 
+        // each element)
+        let critical_paths = cycles_per_thread.map(x => x.map(y => max_func(y, z => z)));
+
+        data.critical_paths = critical_paths.map((x, index) => ({ thread_num: index + 1, value: x }));
+
+        let T1 = data.critical_paths.find(x => x.thread_num == 1).value;
+        data.speedup = critical_paths.map((x, index) => ({ thread_num: index + 1, value: x.map((y, yi) => T1[yi] / y) }));
+
+
+        data.efficiency = data.speedup.map((x, index) => ({ thread_num: index + 1, value: x.value.map((y, yi) => y / (index + 1)) }));
+
+        let ret = new DataBlock(data, "path");
+        this.analysis_result = ret;
+        ret.judgement = this.judgement();
+        return ret;
+    }
+
+}
+
+
+// Class to check if a result is reasonable
+class ResultVerifier {
+    constructor(all_perf_data) {
+        this.all_data = all_perf_data;
+
+        // Extract the list of supersections from the runs
+        let rundata = this.all_data.map(x => x.data);
+
+        ObjectHelper.assert("Data received", rundata.length > 0);
+        ResultVerifier.assert_all_runs_same_number_of_entries(rundata);
+    }
+
+    static assert_all_runs_same_number_of_entries(runs) {
+        
+        ObjectHelper.assert("all_runs_same_number_of_supersection", runs.every(x => x.length == runs[0].length));
+
+        // Now get all sections and check again.
+        ObjectHelper.assert("all_runs_same_number_of_sections", runs.every(x => MathHelper.sum(x.map(y => new SuperSection(y).sections().length)) == MathHelper.sum(runs[0].map(y => new SuperSection(y).sections().length))));
+
+        // Now check number of entries (this must also be the same)
+        ObjectHelper.assert("all_runs_same_number_of_entries", runs.every(x => MathHelper.sum(x.map(y => MathHelper.sum(new SuperSection(y).sections().map(x => x.entries().length)))) == MathHelper.sum(runs[0].map(y => MathHelper.sum(new SuperSection(y).sections().map(x => x.entries().length))))));
+
+    }
+
+
+}
\ No newline at end of file
diff --git a/diode/diode.py b/diode/diode.py
new file mode 100644
index 0000000000..b4a408dcbd
--- /dev/null
+++ b/diode/diode.py
@@ -0,0 +1,1156 @@
+#!/usr/bin/python3
+""" Entry point to DIODE: The Data-centric Interactive Optimization Development
+    Environment. """
+
+import gi
+import os
+import re
+import sys
+import copy
+import pickle
+import argparse
+import traceback
+from six import StringIO
+gi.require_version('Gtk', '3.0')
+gi.require_version('GtkSource', '3.0')
+gi.require_version('WebKit2', '4.0')
+from gi.repository import Gtk, GtkSource, GObject, GLib, Gdk
+
+from diode.rendered_graphs import RenderedGraphs
+
+from diode.rendered_graph_html5 import RenderedGraphHTML5
+from diode.optgraph.optgraph import OptimizationGraph
+from diode.optgraph.DaceState import DaceState
+from dace.transformation.pattern_matching import Transformation
+from dace.transformation.optimizer import SDFGOptimizer
+from diode.pattern_editor import PatternEditor
+from diode.sdfg_editor import SDFGEditor
+from diode.config_ui import DIODEConfig
+from diode.performance_plot import PerformancePlot
+from diode.remote_execution import Executor, AsyncExecutor
+from diode.property_renderer import PropertyRenderer
+from diode.images import ImageStore
+
+import dace
+import dace.properties
+from dace.config import Config
+
+
+class DIODE:
+    """ GUI class for DIODE: The Data-centric Interactive Optimization 
+        Development Environment. 
+
+        @note: Written in Gtk+Glade, using pygobject and Webkit.
+    """
+
+    def __init__(self, headless=False):
+        """ Initializes a DIODE environment.
+            @param headless: If True, runs without a window. Opens a UI window
+                             otherwise.
+        """
+        self.has_warned_about_multiple_sdfgs = False
+        self.config = DIODEConfig()
+
+        self.current_python_script = ""
+        self.headless = headless
+        self.filename = None
+
+        # Initialize glade
+        self.builder = Gtk.Builder()
+        GObject.type_register(GtkSource.View)
+        scriptdir = os.path.dirname(os.path.abspath(__file__))
+        self.builder.add_from_file(os.path.join(scriptdir, "main.glade"))
+
+        # Initialize transformation optimization graph
+        optgraph_da = self.builder.get_object("optimizationsgraph")
+        treeview = self.builder.get_object("optimizationtreeview")
+        self.optimization_graph = OptimizationGraph(
+            graph_da=optgraph_da,
+            treeview_widget=treeview,
+            expand_node_callback=self.OnOptgraphNodeExpand,
+            hover_node_callback=self.OnOptgraphNodeHover,
+            activate_node_callback=self.OnOptgraphNodeActivate)
+
+        self.pattern_editor = PatternEditor(self.builder)
+        self.sdfg_editor = SDFGEditor(self.builder)
+
+        # Initialize rendered SDFGs
+        self.rendered_sdfgs = RenderedGraphs(self.builder)
+        if self.config["renderer"]["html5renderer"]:
+            self.rendered_sdfgs.set_render_engine("html5")
+        else:
+            self.rendered_sdfgs.set_render_engine("xdot")
+        self.rendered_sdfgs.set_container("sdfg_notebook")
+
+        # Initialize performance plot
+        self.perfplot = PerformancePlot(self.builder)
+        self.perfplot.render()
+
+        # Initialize DaCe program executor
+        self.executor = AsyncExecutor(self.perfplot, self.headless,
+                                      self.rendered_sdfgs, self)
+
+        # Set up a property renderer, tell it where to render into and what
+        # to do after an update
+        proplabel = self.builder.get_object("propertylabel")
+        propgrid = self.builder.get_object("propertygrid")
+        self.propren = PropertyRenderer(proplabel, propgrid,
+                                        self.OnSDFGPropChange)
+        self.rendered_sdfgs.set_on_click_cb(self.on_sdfg_click)
+
+        # Load pictures for buttons
+        self.image_store = ImageStore()
+        pixbuf = self.image_store.get_image("run.png")
+        image = Gtk.Image.new_from_pixbuf(pixbuf)
+        button = self.builder.get_object("RunToolbutton")
+        button.set_icon_widget(image)
+
+        dic = {
+            "onDeleteMainWindow": self.OnExit,
+            "onActivateQuitMenu": self.OnExit,
+            "onActivateOpenMenu": self.OnActivateOpenMenu,
+            "onActivateSavePythonMenu": self.OnActivateSavePythonMenu,
+            "onActivateSaveAsPythonMenu": self.OnActivateSaveAsPythonMenu,
+            "onActivatePreferences": self.OnActivatePreferences,
+            "onLoadTrans": self.OnLoadTrans,
+            "onViewHwinfo": self.OnViewHwinfo,
+            "onReadPAPICounters": self.OnReadPAPICounters,
+            "onReadSystemInfo": self.OnReadSystemInfo,
+            "onClickRunTB": self.OnClickRunTB,
+            "onStoreScript": self.OnStoreScript,
+            "onLoadScript": self.OnLoadScript,
+            "onLoadSDFG": self.OnLoadSDFG,
+            "onSaveSDFG": self.OnStoreSDFG,
+            "onSwitchPage": self.OnSwitchPage,
+            "onScrollPythonPane": self.OnScrollPythonPane,
+            "onScrollCodePane": self.OnScrollCodePane,
+        }
+        self.builder.connect_signals(dic)
+
+        # We don't have access to the sourcebuffer from within glade,
+        # thus, we connect the signals / configure it here.
+        tbuffer = self.builder.get_object("sourceview").get_buffer()
+        tbuffer.connect("changed", self.OnChangeTextbuffer)
+        self.builder.get_object("resview").set_editable(False)
+        self.init_syntax_highlighting("sourceview", "python")
+        self.init_syntax_highlighting("resview", "cpp")
+
+        if self.headless == False:
+            self.load_interface_configuration()
+
+            window = self.builder.get_object("main_window")
+            window.set_title(
+                "DIODE: Data-centric Integrated Optimization Development "
+                "Environment")
+            window.set_position(Gtk.WindowPosition.CENTER)
+            window.show_all()
+
+            if self.config["diode"]["general"]["show_transfed"] == False:
+                self.remove_page_in_notebook("notebook",
+                                             "Transformation Editor")
+
+            if self.config["diode"]["general"]["show_sdfged"] == False:
+                self.remove_page_in_notebook("notebook", "SDFG Editor")
+
+            if self.config["diode"]["general"]["show_optgraph"] == False:
+                self.remove_page_in_notebook("opts_notebook", "Graph")
+
+    def remove_page_in_notebook(self, notebook_id, page_name):
+        notebook = self.builder.get_object(notebook_id)
+        num_hide = self.page_name2page_num(page_name, notebook_id)
+        notebook.remove_page(num_hide)
+
+    def page_num2page_name(self, page_num, notebook_id="notebook"):
+        notebook = self.builder.get_object(notebook_id)
+        page = notebook.get_nth_page(page_num)
+        if page is None:
+            raise ValueError("Page " + str(page_num) + " does not exist.")
+        return notebook.get_tab_label_text(page)
+
+    def page_name2page_num(self, page_name, notebook_id="notebook"):
+        notebook = self.builder.get_object(notebook_id)
+        npages = notebook.get_n_pages()
+        for i in range(0, npages):
+            page = notebook.get_nth_page(i)
+            if page is None:
+                raise ValueError("Page " + str(i) + " does not exist.")
+            label = notebook.get_tab_label_text(page)
+            if label == page_name:
+                return i
+        raise ValueError("No page with the name \"" + page_name + "\" found.")
+
+    def switch_to_page(self, notebook, pagename):
+        """ The Gtk notebook API only allows to go to the next or previous 
+            page, but not to a specific one, so we implement this logic in
+            this function. """
+        notebook.handler_block_by_func(self.OnSwitchPage)
+
+        target = self.page_name2page_num(pagename)
+        while target < notebook.get_current_page():
+            notebook.prev_page()
+        while target > notebook.get_current_page():
+            notebook.next_page()
+        notebook.handler_unblock_by_func(self.OnSwitchPage)
+
+    def on_sdfg_click(self, sdfg, elem):
+        if elem is not None:
+            self.propren.render_properties_for_element(sdfg, elem)
+        else:
+            self.propren.render_free_symbols(sdfg)
+
+    def OnSwitchPage(self, notebook, page, page_num):
+        pagename = self.page_num2page_name(page_num)
+        self.SwitchPanes(pagename)
+        self.emit_script_cmd("diode.SwitchPanes(\"" + pagename + "\")")
+
+    def SwitchPanes(self, newpage):
+        # If we switch from SDFG editor to DIODE, "import" the edited graph
+        # into the optimizer, but only if there have been changes
+        notebook = self.builder.get_object("notebook")
+        active_page = self.page_num2page_name(notebook.get_current_page())
+        if (active_page == "SDFG Editor") and (newpage == "Optimizer") and \
+           (self.sdfg_editor.sdfg_modified() == True):
+            new_sdfg = self.sdfg_editor.get_sdfg()
+            code = "# The currently shown SDFG has been generated using the "
+            code += "SDFG editor.\n"
+
+            new_ds = DaceState(
+                dace_code=code,
+                fake_fname="edit",
+                sdfg=new_sdfg,
+                headless=self.headless)
+            if new_ds.has_multiple_eligible_sdfgs:
+                self.onMultipleSDFGs()
+
+            # Set the code window without triggering recompilation
+            sv = self.builder.get_object("sourceview")
+            buf = sv.get_buffer()
+            buf.handler_block_by_func(self.OnChangeTextbuffer)
+            buf.set_text(code)
+            buf.handler_unblock_by_func(self.OnChangeTextbuffer)
+            current_node = self.optimization_graph.get_current()
+            new_node = self.optimization_graph.add_node(
+                parent=current_node, label="Manual Edit")
+            new_node.set_dace_state(new_ds)
+            if current_node is not None:
+                self.optimization_graph.add_edge(current_node, new_node)
+            self.optimization_graph.set_current(new_node)
+            self.draw_sdfg_graph()
+            self.update_generated_code()
+            self.current_python_script += self.sdfg_editor.get_edit_script()
+            self.sdfg_editor.reset_edit_script()
+        # Switch the page, but do _not_ emit the corresponding signal, since
+        # that would generate endless recursion.
+        notebook = self.builder.get_object("notebook")
+        self.switch_to_page(notebook, newpage)
+
+    def load_interface_configuration(self):
+        # Set window dimensions
+        window = self.builder.get_object("main_window")
+        window.resize(
+            int(Config.get('diode', 'layout', 'window_width')),
+            int(Config.get('diode', 'layout', 'window_height')))
+        if bool(Config.get('diode', 'layout', 'window_maximized')):
+            window.maximize()
+
+        # Set pane relative sizes
+        self.set_pane_relative_sizes()
+
+    def set_pane_relative_sizes(self):
+        window = self.builder.get_object("main_window")
+        width, height = window.get_size()
+        toppane = self.builder.get_object("TopPane")
+        pypane = self.builder.get_object("TopLeftPane")
+        optgraphpane = self.builder.get_object("TopRightPane")
+        codepane = self.builder.get_object("BottomPane")
+        perfpane = self.builder.get_object("BottomRightPane")
+
+        # Top pane
+        toppane_height = float(Config.get('diode', 'layout', 'toppane_height'))
+        pypane_width = float(Config.get('diode', 'layout', 'pypane_width'))
+        optgraph_width = float(Config.get('diode', 'layout', 'optpane_width'))
+        toppane.set_position((toppane_height / 100.0) * height)
+        pypane.set_position((pypane_width / 100.0) * width)
+        optgraphpane.set_position((optgraph_width / 100.0) * width)
+
+        # Bottom pane
+        codepane_width = float(Config.get('diode', 'layout', 'codepane_width'))
+        perfpane_width = float(Config.get('diode', 'layout', 'perfpane_width'))
+        codepane.set_position((codepane_width / 100.0) * width)
+        perfpane.set_position((perfpane_width / 100.0) * width)
+
+    def onMultipleSDFGs(self):
+        if self.has_warned_about_multiple_sdfgs:
+            pass
+        if self.headless:
+            return  # Don't warn when GUI is not displayed.
+        self.has_warned_about_multiple_sdfgs = True
+        self.show_error_message(
+            "Multiple SDFGs",
+            "The currently loaded code contains multiple top-level SDFGs, thus it is not defined which one should be run!"
+        )
+
+    def save_interface_configuration(self):
+        # Save window dimensions
+        window = self.builder.get_object("main_window")
+        width, height = window.get_size()
+
+        Config.set(
+            'diode', 'layout', 'window_maximized', value=window.is_maximized())
+        Config.set('diode', 'layout', 'window_width', value=width)
+        Config.set('diode', 'layout', 'window_height', value=height)
+
+        # Save pane dimensions
+        toppane = self.builder.get_object("TopPane")
+        pypane = self.builder.get_object("TopLeftPane")
+        optgraphpane = self.builder.get_object("TopRightPane")
+        codepane = self.builder.get_object("BottomPane")
+        perfpane = self.builder.get_object("BottomRightPane")
+
+        Config.set(
+            'diode',
+            'layout',
+            'toppane_height',
+            value=(float(toppane.get_position()) / height * 100))
+        Config.set(
+            'diode',
+            'layout',
+            'pypane_width',
+            value=(float(pypane.get_position()) / width * 100))
+        Config.set(
+            'diode',
+            'layout',
+            'codepane_width',
+            value=(float(codepane.get_position()) / width * 100))
+        Config.set(
+            'diode',
+            'layout',
+            'perfpane_width',
+            value=(float(perfpane.get_position()) / width * 100))
+        Config.set(
+            'diode',
+            'layout',
+            'optpane_width',
+            value=(float(optgraphpane.get_position()) / width * 100))
+
+        # Serialize
+        Config.save()
+
+    def find_single_node_in_optgraph_by_label(self, label):
+        nodes = self.optimization_graph.find_nodes_by_label(label)
+        if len(nodes) < 1:
+            print("There is no optstate with the label " + label)
+            return None
+        if len(nodes) > 1:
+            print("There is more than one optimization state labeled " + \
+                  label + ", using the first one.")
+        return nodes[0]
+
+    def OnLoadSDFG(self, *args):
+        dialog = Gtk.FileChooserDialog(
+            "Please choose a file", None, Gtk.FileChooserAction.SAVE,
+            (Gtk.STOCK_CANCEL, Gtk.ResponseType.CANCEL, Gtk.STOCK_SAVE,
+             Gtk.ResponseType.OK))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        response = dialog.run()
+        filename = None
+        if response == Gtk.ResponseType.OK:
+            filename = dialog.get_filename()
+            filename = os.path.realpath(filename)
+            self.LoadSDFG(filename)
+            self.emit_script_cmd("diode.LoadSDFG(\"" + filename + "\")")
+        dialog.destroy()
+
+    def OnStoreSDFG(self, *args):
+        dialog = Gtk.FileChooserDialog(
+            "Please choose a file", None, Gtk.FileChooserAction.SAVE,
+            (Gtk.STOCK_CANCEL, Gtk.ResponseType.CANCEL, Gtk.STOCK_SAVE,
+             Gtk.ResponseType.OK))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        response = dialog.run()
+        filename = None
+        if response == Gtk.ResponseType.OK:
+            filename = dialog.get_filename()
+            filename = os.path.realpath(filename)
+            self.SaveSDFG(filename)
+            self.emit_script_cmd("diode.SaveSDFG(\"" + filename + "\")")
+        dialog.destroy()
+
+    def OnButtonPressSDFG(self, clicked_elem):
+        sdfg = self.optimization_graph.get_current().get_dace_state().get_sdfg(
+        )
+
+        # Now try to find the element in the SDFG that was clicked
+        if len(clicked_elem) == 0:
+            return
+        if clicked_elem[0]['type'] == "SDFGState":
+            sid = int(clicked_elem[0]['id'])
+            if len(clicked_elem) > 1:
+                if clicked_elem[1]['type'] != "Memlet":
+                    nid = int(clicked_elem[1]['id'])
+                    self.propren.render_properties_for_node(sdfg, sid, nid)
+            else:
+                self.propren.render_properties_for_state(sdfg, sid)
+
+    def OnSDFGPropChange(self, sdfg, elemtype, *args):
+        """ This is the callback for the PropertyRenderer in the Optimizer.
+            It gets called after all changes have been applied to the SDFG.
+            We need to make sure that the displayed SDFG is refreshed,
+            and the action that was performed is logged. """
+
+        # The property renderer is also used to render/change transformation
+        # properties
+        if elemtype == "pattern_match":
+            optgraph, nodeid, propname, newval = args
+            tname = optgraph.find_node(nodeid).get_label()
+            self.emit_script_cmd("diode.ChangePatternProperties(\"" + \
+                str(tname) + "\", \"" + \
+                str(propname) +  "\", \"" + \
+                str(newval) + "\")")
+
+            # If the node is not expanded, we don't need to do anything
+            # but if it is, we need to delete the subtree below and expand
+            # again
+            node = self.optimization_graph.find_node(nodeid)
+
+            optnode = self.optimization_graph.find_node(nodeid)
+            if optnode.is_expanded():
+                # Create a new subtree using the modified pattern match
+                self.optimization_graph.clear_subtree(optnode)
+                self.optimization_graph.expand_node(optnode)
+                self.optimization_graph.set_current(optnode)
+                self.draw_sdfg_graph()
+                self.update_generated_code()
+            return
+
+        if elemtype == "node":
+            nodeid, propname, newval = args
+            self.emit_script_cmd("diode.ChangeSDFGNodeProperties(\"" + \
+                                 str(nodeid) +   "\", \"" + \
+                                 str(propname) + "\", \"" + \
+                                 str(newval) + "\")")
+        elif elemtype == "memlet":
+            tail, head, label, propname, newval = args
+            self.emit_script_cmd("diode.ChangeSDFGMemletProperties(\"" + \
+                                 str(tail) +     "\", \"" + \
+                                 str(head) +     "\", \"" + \
+                                 str(label) +    "\", \"" + \
+                                 str(propname) + "\", \"" + \
+                                 str(newval) + "\")")
+        dace_state = self.optimization_graph.get_current().get_dace_state()
+        dace_state.set_sdfg(sdfg)
+        self.draw_sdfg_graph()
+        self.update_generated_code()
+
+
+#
+#  DIODE FRONTEND FUNCTIONS
+#
+
+    def SaveSDFG(self, filename):
+        sdfg = self.optimization_graph.get_current().get_dace_state().get_sdfg(
+        )
+        with open(filename, 'wb') as f:
+            f.write((pickle.dumps(sdfg)))
+
+    def LoadSDFG(self, filename):
+        with open(filename, 'rb') as f:
+            sdfg = pickle.loads(f.read())
+            # Clear all DaCe states
+            self.optimization_graph.clear()
+            # Create a new DaCe state, name it after the file we loaded from
+            filename = os.path.relpath(filename)
+            nn = self.optimization_graph.add_node(label=filename)
+            if not hasattr(sdfg, 'sourcecode'):
+                sdfg.sourcecode = 'N/A'
+            ds = DaceState(
+                fake_fname="deserialized",
+                sdfg=sdfg,
+                dace_code=None,
+                source_code=sdfg.sourcecode,
+                headless=self.headless)
+            if ds.has_multiple_eligible_sdfgs:
+                self.onMultipleSDFGs()
+
+            nn.set_dace_state(ds)
+            self.optimization_graph.set_current(nn)
+            self.optimization_graph.expand_node(nn)
+            self.draw_sdfg_graph()
+            self.update_generated_code()
+            self.update_source_code()
+            self.propren.render_free_symbols(sdfg)
+
+    def update_source_code(self):
+        dace_state = self.optimization_graph.get_current().get_dace_state()
+        notebook = self.builder.get_object('sourceview_notebook')
+        num_pages = notebook.get_n_pages()
+        for page_index in range(0, num_pages):
+            notebook.remove_page(-1)
+
+        if dace_state.dace_code is not None:
+            sourceview = GtkSource.View()
+            sourceview.set_editable(False)
+            tbuffer = sourceview.get_buffer()
+            lang_manager = GtkSource.LanguageManager()
+            language = lang_manager.get_language("python")
+            tbuffer.set_language(language)
+            tbuffer.set_highlight_syntax(True)
+            code = str(dace_state.dace_code)
+
+            tbuffer.set_text(code)
+            scroll = Gtk.ScrolledWindow()
+            scroll.add(sourceview)
+            label = Gtk.Label(label="DaCe Code")
+            notebook.append_page(scroll, label)
+
+        if dace_state.source_code is not None:
+            sourceview = GtkSource.View()
+            sourceview.set_editable(False)
+            tbuffer = sourceview.get_buffer()
+            lang_manager = GtkSource.LanguageManager()
+            language = lang_manager.get_language("python")
+            tbuffer.set_language(language)
+            tbuffer.set_highlight_syntax(True)
+            code = str(dace_state.source_code)
+
+            tbuffer.set_text(code)
+            scroll = Gtk.ScrolledWindow()
+            scroll.add(sourceview)
+            label = Gtk.Label(label="Source Code")
+            notebook.append_page(scroll, label)
+
+        notebook.show_all()
+
+    def OpenPythonFile(self, filename):
+        self.filename = filename
+        with open(filename, 'r') as f:
+            self.open_file(f)
+
+    def SetCurrent(self, optstate):
+        node = self.find_single_node_in_optgraph_by_label(optstate)
+        current = self.optimization_graph.get_current()
+        self.optimization_graph.set_explored(current)
+        self.optimization_graph.set_current(node)
+        self.draw_sdfg_graph()
+        self.update_generated_code()
+        self.propren.render_free_symbols(current.get_dace_state().get_sdfg())
+
+    def ExpandNode(self, optstate):
+        # Make sure the "optstate" node is compiled and all its children are
+        # displayed, but the children do not need to have a DaceState.
+        node = self.find_single_node_in_optgraph_by_label(optstate)
+        if node == None:
+            raise ValueError("Node " + optstate + " not found")
+        self.optimization_graph.set_current(node)
+        self.optimization_graph.expand_node(node)
+        self.draw_sdfg_graph()
+        self.update_generated_code()
+        self.optimization_graph.OnChange()
+
+    def LoadTransformation(self, filename):
+        old_patterns = Transformation.patterns()
+        Transformation.register_pattern_file(filename)
+        new_patterns = Transformation.patterns()
+        if len(new_patterns) == len(old_patterns):
+            print("The pattern " + filename + " didn't load!")
+            return
+        # Find out the name of the newly loaded transformation (a file should
+        # not contain more than one transformation)
+        new_names = set(p.__name__ for p in new_patterns)
+        old_names = set(p.__name__ for p in old_patterns)
+        new_pattern = new_names.difference(old_names).pop()
+        self.update_patterns_in_optgraph(new_pattern)
+        self.optimization_graph.OnChange()
+
+    def ClearHighlights(self):
+        self.rendered_sdfgs.clear_highlights()
+
+    def ChangePreferences(self, option, newval):
+        cpath = tuple(option.split(':'))
+        try:
+            Config.get(*cpath)
+        except KeyError:
+            raise KeyError("Option " + option + " does not exist!")
+
+        Config.set(*cpath, value=newval)
+
+    def HighlightSDFGElement(self, elem):
+        pass
+
+    def ChangeSDFGProperties(self, elem, prop, newval):
+        curr = self.optimization_graph.get_current().get_dace_state()
+        sdfg = curr.get_sdfg()
+        sid, nid = self.split_nodeid_in_state_and_nodeid(elem)
+        node = sdfg.find_node(sid, nid)
+        dace.properties.set_property_from_string(prop, node, newval, sdfg)
+        curr.set_sdfg(sdfg)
+        self.draw_sdfg_graph()
+        self.update_generated_code()
+
+    def ChangeSDFGMemletProperties(self, elem_head, elem_tail, mid, prop,
+                                   newval):
+        curr = self.optimization_graph.get_current().get_dace_state()
+        sdfg = curr.get_sdfg()
+        sid1, nid1 = self.split_nodeid_in_state_and_nodeid(elem_head)
+        sid2, nid2 = self.split_nodeid_in_state_and_nodeid(elem_tail)
+
+        sid = int(sid1)
+        sdfg_state = sdfg.nodes()[sid]
+
+        nid1 = sdfg_state.nodes()[int(nid1)]
+        nid2 = sdfg_state.nodes()[int(nid2)]
+        memlet = sdfg_state.edges_between(nid2, nid1)[mid].data
+        memlet.set_property(prop, newval)
+        dace_state = self.optimization_graph.get_current().get_dace_state()
+        dace_state.compile()
+        self.draw_sdfg_graph()
+        self.update_generated_code()
+
+    def Run(self, fail_on_nonzero=None):
+        if self.optimization_graph.get_current() == None:
+            return False
+        if fail_on_nonzero is None:
+            if self.headless:
+                fail_on_nonzero = True
+            else:
+                fail_on_nonzero = False
+
+        dace_state = self.optimization_graph.get_current().get_dace_state()
+        res = self.executor.run_async(dace_state, fail_on_nonzero)
+        return res
+
+    def ChangePatternProperties(self, nodelabel, propname, newval):
+        node = self.find_single_node_in_optgraph_by_label(nodelabel)
+        pattern_match = node.get_pattern_match()
+        dace.properties.set_property_from_string(propname, pattern_match,
+                                                 newval, None)
+        node.apply_pattern_match()
+        node.get_dace_state().set_is_compiled(False)
+        self.optimization_graph.clear_subtree(node)
+        self.optimization_graph.expand_node(node)
+        self.draw_sdfg_graph()
+        self.update_generated_code()
+        self.optimization_graph.OnChange()
+
+    def OnMoveHandle(self, widget, position, widget_name):
+        widget_pos = str(widget.get_position())
+        window = self.builder.get_object("main_window")
+        win_size_x = str(window.get_allocation().width)
+        win_size_y = str(window.get_allocation().height)
+        print("Handle moved: " + widget_name   + \
+              " new position = " + widget_pos  + \
+              " window size x = " + win_size_x + \
+              " window size y = " + win_size_y)
+
+    def OnLoadTrans(self, *args):
+        dialog = Gtk.FileChooserDialog(
+            "Please choose a file", None, Gtk.FileChooserAction.OPEN,
+            (Gtk.STOCK_CANCEL, Gtk.ResponseType.CANCEL, Gtk.STOCK_OPEN,
+             Gtk.ResponseType.OK))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        response = dialog.run()
+        filename = None
+        if response == Gtk.ResponseType.OK:
+            filename = dialog.get_filename()
+            filename = os.path.realpath(filename)
+            self.LoadTransformation(filename)
+            self.emit_script_cmd("diode.LoadTransformation(\"" + filename + \
+                                 "\")")
+        dialog.destroy()
+
+    def OnOptgraphNodeHover(self, nodeid, pattern_match):
+        self.rendered_sdfgs.clear_highlights()
+        if pattern_match is not None:
+            sid = pattern_match.state_id
+            nodes = list(pattern_match.subgraph.values())
+            for n in nodes:
+                nodeid = "s" + str(sid) + "_" + str(n)
+                elem = self.rendered_sdfgs.get_element_by_id(
+                    self.rendered_sdfgs.currently_displayed_sdfg(), nodeid)
+                if elem is not None:
+                    self.rendered_sdfgs.highlight_element(
+                        self.rendered_sdfgs.currently_displayed_sdfg(), elem)
+
+    def OnOptgraphNodeExpand(self, nodeid, pattern_match):
+        # Double click, apply the transformation
+        self.rendered_sdfgs.clear_highlights()
+        self.propren.clear_properties()
+        self.propren.render_properties_for_pattern(self.optimization_graph,
+                                                   nodeid, pattern_match)
+        self.draw_sdfg_graph()
+        self.update_generated_code()
+        self.emit_script_cmd("diode.ExpandNode(\"" + nodeid + "\")")
+
+    def OnOptgraphNodeActivate(self, nodeid, pattern_match):
+        # Single click on a node, render properties and show where
+        # the transformation applies
+        self.rendered_sdfgs.clear_highlights()
+        self.propren.clear_properties()
+        self.update_generated_code()
+        self.propren.render_properties_for_pattern(self.optimization_graph,
+                                                   nodeid, pattern_match)
+        self.emit_script_cmd("diode.ActivateNode(\"" + nodeid + "\")")
+
+    def update_patterns_in_optgraph(self, new_pattern):
+        # Re-evaluate all expanded nodes in the optimization graph to
+        # see if new edges need to be added.
+        nodes = self.optimization_graph.get_nodes()
+        for node in nodes:
+            if node.get_expanded() == False: continue
+            ds = node.get_dace_state()
+            sdfg = ds.get_sdfg()
+            opt = SDFGOptimizer(sdfg)
+            patterns = opt.get_pattern_matches()
+            for p in patterns:
+                if str(p.pattern) == str(new_pattern):
+                    # create a new state / node for the new pattern
+                    nds = copy.deepcopy(node.get_dace_state())
+                    p.apply_pattern(nds.get_sdfg())
+                    optgraph = self.optimization_graph
+                    nn = optgraph.add_node(
+                        parent=node.get_parent(), label=new_pattern, pattern=p)
+                    nn.set_dace_state(nds)
+                    optgraph.set_unexplored(nn)
+                    optgraph.add_edge(tail=node, head=nn, label="", pattern=p)
+
+    def emit_script_cmd(self, cmd):
+        self.current_python_script += cmd + "\n"
+
+    def optimize_optscript(self):
+        # Some actions stored in the optscript are redundant, i.e., if pattern
+        # properties are changed ("foo" -> "fo" -> "f" by deleting characters)
+        # This function removes such redundancies by simple pattern matching
+        nreps = 1
+        while nreps != 0:
+            regex = r'^diode\.(ChangePatternProperties)\((.*?),\s*(.*?),\s*(.*?)\)\s*^diode\.\1\(\2,\s*\3,\s*(.*?)\)\s*'
+            p = re.compile(regex, re.MULTILINE)
+            result, nreps = p.subn(
+                "diode.ChangePatternProperties(\\2, \\3, \\5)\n",
+                self.current_python_script)
+            self.current_python_script = result
+
+    def OnStoreScript(self, *args):
+        # Open a dialog and save to a file
+        dialog = Gtk.FileChooserDialog(
+            "Please choose a file", None, Gtk.FileChooserAction.SAVE,
+            (Gtk.STOCK_CANCEL, Gtk.ResponseType.CANCEL, Gtk.STOCK_SAVE,
+             Gtk.ResponseType.OK))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        response = dialog.run()
+        if response == Gtk.ResponseType.OK:
+            with open(dialog.get_filename(), 'w') as f:
+                self.optimize_optscript()
+                f.write(self.current_python_script)
+        dialog.destroy()
+
+    def OnLoadScript(self, *args):
+        dialog = Gtk.FileChooserDialog(
+            "Please choose a file", None, Gtk.FileChooserAction.SAVE,
+            (Gtk.STOCK_CANCEL, Gtk.ResponseType.CANCEL, Gtk.STOCK_SAVE,
+             Gtk.ResponseType.OK))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        response = dialog.run()
+        contents = ""
+        if response == Gtk.ResponseType.OK:
+            self.load_optscript(dialog.get_filename())
+        dialog.destroy()
+
+    def load_optscript(self, filename):
+        with open(filename, 'r') as f:
+            contents = f.read()
+            # We are running from within DIODE, execute everything
+            # in the current context, i.e., diode = self.
+            self.current_python_script += contents
+            contents = "diode = self\n" + \
+                       "sdfg_edit = self.sdfg_editor\n" + \
+                       contents
+            exec(contents)
+
+    def OnViewHwinfo(self, *args):
+        print("show hwinfo graph")
+
+    def OnReadPAPICounters(self, *args):
+        from dace.codegen.instrumentation.perfsettings import PerfUtils
+        nonderiv, deriv, num_hw_ctrs = PerfUtils.read_available_perfcounters()
+
+        dialog = Gtk.MessageDialog(
+            None, 0, Gtk.MessageType.INFO,
+            Gtk.ButtonsType.OK, "Available counters on %s" % Config.get(
+                "execution", "general", "host"))
+        dialog.format_secondary_text(
+            "Number of hardware counters available: %d\nNon-Derived: %s\nDerived: %s\n"
+            % (num_hw_ctrs, str(nonderiv), str(deriv)))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        dialog.run()
+        dialog.destroy()
+
+    def OnReadSystemInfo(self, *args):
+        from dace.codegen.instrumentation.perfsettings import PerfUtils, PerfPAPIInfoStatic
+
+        bandwidth = PerfUtils.gather_remote_metrics()
+
+        dialog = Gtk.MessageDialog(
+            None, 0, Gtk.MessageType.INFO, Gtk.ButtonsType.OK,
+            "System info on %s" % Config.get("execution", "general", "host"))
+        dialog.format_secondary_text(
+            "Memory bandwidth: %s B/c" % str(bandwidth))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        dialog.run()
+
+        PerfPAPIInfoStatic.info.memspeed = float(bandwidth)
+
+        dialog.destroy()
+
+    def OnClickRunTB(self, widget):
+        self.emit_script_cmd("diode.Run()")
+        self.Run()
+
+    def OnActivatePreferences(self, widget, *args):
+        self.config.render_config_dialog()
+        return True
+
+    def OnActivateSaveAsPythonMenu(self, widget, *args):
+        dialog = Gtk.FileChooserDialog(
+            "Please choose a file", None, Gtk.FileChooserAction.SAVE,
+            (Gtk.STOCK_CANCEL, Gtk.ResponseType.CANCEL, Gtk.STOCK_SAVE,
+             Gtk.ResponseType.OK))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        response = dialog.run()
+        filename = None
+        if response == Gtk.ResponseType.OK:
+            filename = dialog.get_filename()
+            filename = os.path.realpath(filename)
+            self.SavePythonCode(filename)
+            self.filename = filename
+            self.emit_script_cmd("diode.SavePythonCode(\"" + filename + "\")")
+        dialog.destroy()
+
+    def OnActivateSavePythonMenu(self, widget, *args):
+        if self.filename is None:
+            raise ValueError(
+                "Asked to save code but no filename selected, maybe use Save As?"
+            )
+        self.SavePythonCode(self.filename)
+        self.emit_script_cmd("diode.SavePythonCode(\"" + self.filename + "\")")
+
+    def SavePythonCode(self, filename):
+        statuslabel = self.builder.get_object("run_status_text")
+        statusprogress = self.builder.get_object("run_status")
+        statuslabel.set_text("Saving DaCe program to " + str(filename))
+        tbuffer = self.builder.get_object("sourceview").get_buffer()
+        start = tbuffer.get_start_iter()
+        end = tbuffer.get_end_iter()
+        dace_code = tbuffer.get_text(start, end, True)
+        with open(filename, "w") as f:
+            f.write(dace_code)
+        statuslabel.set_text("Saving DaCe program to " + str(filename) +
+                             " ... done")
+
+    def init_syntax_highlighting(self, widgetname, language):
+        tbuffer = self.builder.get_object(widgetname).get_buffer()
+        lang_manager = GtkSource.LanguageManager()
+        language = lang_manager.get_language(language)
+        tbuffer.set_language(language)
+        tbuffer.set_highlight_syntax(True)
+
+    def open_file(self, f):
+        self.filename = f.name
+        tbuffer = self.builder.get_object("sourceview").get_buffer()
+        buf = f.read()
+        f.close()
+        tbuffer.set_text(buf)
+
+    def OnActivateOpenMenu(self, *args):
+        dialog = Gtk.FileChooserDialog(
+            "Please choose a file", None, Gtk.FileChooserAction.OPEN,
+            (Gtk.STOCK_CANCEL, Gtk.ResponseType.CANCEL, Gtk.STOCK_OPEN,
+             Gtk.ResponseType.OK))
+        main_window = self.builder.get_object("main_window")
+        dialog.set_transient_for(main_window)
+        response = dialog.run()
+        if response == Gtk.ResponseType.OK:
+            with open(dialog.get_filename(), "r") as f:
+                self.open_file(f)
+                self.emit_script_cmd("diode.OpenPythonFile(\"" + \
+                    os.path.realpath(f.name) + "\")")
+        dialog.destroy()
+
+    def OnChangeTextbuffer(self, *args):
+        # Take the code and execute it
+        # Note: When users change the buffer this is sometimes called without
+        #       any text, thus we must handle this case gracefully.
+        statuslabel = self.builder.get_object("run_status_text")
+        statusprogress = self.builder.get_object("run_status")
+        statuslabel.set_text("Compiling DaCe program")
+        statusprogress.pulse()
+
+        tbuffer = self.builder.get_object("sourceview").get_buffer()
+        start = tbuffer.get_start_iter()
+        end = tbuffer.get_end_iter()
+        dace_code = tbuffer.get_text(start, end, True)
+        try:
+            dace_state = DaceState(
+                dace_code, self.filename, headless=self.headless)
+
+            if dace_state.has_multiple_eligible_sdfgs:
+                self.onMultipleSDFGs()
+        except:
+            exstr = StringIO()
+            traceback.print_exc(file=exstr)
+
+            if self.headless == True:
+                print("Compilation failed:\n\n" + exstr.getvalue())
+                exit(-1)
+            else:
+                print("Compilation failed")
+
+            self.update_generated_code("Compilation failed:\n\n" + \
+                                       exstr.getvalue())
+            statuslabel.set_text("Compiling DaCe program ... failed")
+            return
+
+        self.optimization_graph.clear()
+        n = self.optimization_graph.add_node(label="Unoptimized")
+        n.set_dace_state(dace_state)
+        self.optimization_graph.set_current(n)
+        self.optimization_graph.expand_node(n)
+        self.update_generated_code()
+        statuslabel.set_text("Compiling DaCe program ... completed")
+
+        self.draw_sdfg_graph()
+        self.propren.render_free_symbols(dace_state.get_sdfg())
+
+        # Now that we rendered the SDFG, we can show any errors that might
+        # have occured while generating it
+        for e in dace_state.errors:
+            self.show_validation_error(e)
+
+    def highlight_node(self, sdfg, stateid, nodeid):
+        self.rendered_sdfgs.clear_highlights()
+        elem = self.rendered_sdfgs.get_element_by_sdfg_node(
+            sdfg, stateid, nodeid)
+        self.rendered_sdfgs.highlight_element(sdfg, elem)
+
+    def highlight_source_by_debuginfo(self, di):
+        sourceview = self.builder.get_object("sourceview")
+        iterator = sourceview.get_buffer().get_iter_at_line(di.start_line)
+        if sourceview.scroll_to_iter(iterator, 0, False, 0.5, 0.5) == False:
+            print("scrolling failed!!")
+
+    def show_validation_error(self, err):
+        # Take any of the DaCe defined exceptions and display in a useful way,
+        # i.e., show the message, highlight the corresponding node/memlet,
+        # LOC etc.
+        if isinstance(err, dace.sdfg.InvalidSDFGError):
+            self.show_error_message("Invalid SDFG", err.message)
+        if isinstance(err, dace.sdfg.InvalidSDFGInterstateEdgeError):
+            self.show_error_message("Invalid Interstate Edge", err.message)
+        if isinstance(err, dace.sdfg.InvalidSDFGEdgeError):
+            self.show_error_message("Invalid SDFG Edge", err.message)
+        if isinstance(err, dace.sdfg.InvalidSDFGNodeError):
+            self.show_error_message("Invalid SDFG Node", err.message)
+            if (err.sdfg is not None) and (err.state_id is not None) and (
+                    err.node_id is not None):
+                self.highlight_node(err.sdfg, err.state_id, err.node_id)
+                state = err.sdfg.nodes()[err.state_id]
+                node = state.nodes()[err.node_id]
+                if hasattr(node, 'debuginfo') and (node.debuginfo is not None):
+                    self.highlight_source_by_debuginfo(node.debuginfo)
+                if hasattr(node,
+                           'debuginfo2') and (node.debuginfo2 is not None):
+                    self.highlight_source_by_debuginfo(node.debuginfo2)
+
+    def show_error_message(self, caption, message):
+        main_window = self.builder.get_object("main_window")
+        dialog = Gtk.MessageDialog(main_window, 0, Gtk.MessageType.WARNING,
+                                   Gtk.ButtonsType.OK, caption)
+        dialog.format_secondary_text(message)
+        dialog.run()
+        dialog.destroy()
+
+    def update_generated_code(self, contents=None):
+        try:
+            dace_state = self.optimization_graph.get_current().get_dace_state()
+            if contents is None:
+                program_code = dace_state.get_generated_code()
+            else:
+                program_code = contents
+        except:
+            program_code = contents
+
+        notebook = self.builder.get_object('resview_notebook')
+        num_pages = notebook.get_n_pages()
+        for page_index in range(0, num_pages):
+            notebook.remove_page(-1)
+
+        # A list of CodeObject codes
+        if isinstance(program_code, list):
+            if len(program_code) > 1:
+                notebook.set_show_tabs(True)
+            else:
+                notebook.set_show_tabs(False)
+            for codeobj in program_code:
+                sourceview = GtkSource.View()
+                sourceview.set_editable(False)
+                tbuffer = sourceview.get_buffer()
+                lang_manager = GtkSource.LanguageManager()
+                language = codeobj.language
+                if codeobj.language == "cu":
+                    language = "cpp"
+                language = lang_manager.get_language(language)
+                tbuffer.set_language(language)
+                tbuffer.set_highlight_syntax(True)
+                code = str(codeobj.code)
+                code = re.sub("\s*////__DACE.*", "", code, 0, re.MULTILINE)
+                tbuffer.set_text(code)
+                scroll = Gtk.ScrolledWindow()
+                scroll.add(sourceview)
+                label = Gtk.Label(label=str(codeobj.title))
+                notebook.append_page(scroll, label)
+        else:
+            # Add a single page that shows contents.
+            # Probably we are showing an error, so don't highlight syntax.
+            notebook.set_show_tabs(False)
+            sourceview = GtkSource.View()
+            sourceview.set_editable(False)
+            label = Gtk.Label(label="Generated Code")
+            scroll = Gtk.ScrolledWindow()
+            scroll.add(sourceview)
+            notebook.append_page(scroll, label)
+            tbuffer = sourceview.get_buffer()
+            tbuffer.set_text(program_code)
+            lang_manager = GtkSource.LanguageManager()
+            language = lang_manager.get_language("cpp")
+            tbuffer.set_language(language)
+            tbuffer.set_highlight_syntax(True)
+        notebook.show_all()
+
+    def draw_sdfg_graph(self):
+        current_state = self.optimization_graph.get_current().get_dace_state()
+        sdfgs = current_state.get_sdfgs()
+        self.rendered_sdfgs.render_sdfgs(sdfgs)
+
+    def OnNodePropertiesChangedSwitch(self, widget, value, data):
+        # We need this function only because the callback for a GtkSwitch
+        # state-change is different from most other widgets: it also sends
+        # the new value (which we can obtain from the widget anyway)
+        self.OnNodePropertiesChanged(widget, data)
+
+    def OnPatternPropertiesChangedSwitch(self, widget, value, data):
+        # We need this function only because the callback for a GtkSwitch
+        # state-change is different from most other widgets: it also sends
+        # the new value (which we can obtain from the widget anyway)
+        self.OnPatternPropertiesChanged(widget, data)
+
+    def split_nodeid_in_state_and_nodeid(self, nodeid):
+        match = re.match("s(\d+)_(\d+)", nodeid)
+        if match:
+            sid, nid = match.groups()
+            return int(sid), int(nid)
+        else:
+            raise ValueError("Node ID " + nodeid + " has the wrong form")
+            return None
+
+    def OnScrollPythonPane(self, widget, ev):
+        if ev.get_state() & Gdk.ModifierType.CONTROL_MASK:
+            d = determine_scroll_direction(ev)
+            code_pane = self.builder.get_object("sourceview")
+            return True  # Cancel scroll event (no motion)
+        return False
+
+    def OnScrollCodePane(self, widget, ev):
+        if ev.get_state() & Gdk.ModifierType.CONTROL_MASK:
+            d = determine_scroll_direction(ev)
+            code_pane = self.builder.get_object("resview")
+            return True  # Cancel scroll event (no motion)
+
+        return False
+
+    def OnExit(self, *args):
+        self.save_interface_configuration()
+        exit(0)
+
+
+def run_main():
+    parser = argparse.ArgumentParser(
+        description=
+        "DIODE: Data-centric Integrated Optimization Development Environment")
+    parser.add_argument(
+        "--file",
+        metavar='file',
+        type=argparse.FileType('r'),
+        help="Load the specifed Python code")
+    parser.add_argument(
+        "--sdfg",
+        metavar='sdfg',
+        type=argparse.FileType('r'),
+        help="Load the specifed SDFG")
+    parser.add_argument(
+        "--optscript",
+        metavar="optscript",
+        type=argparse.FileType('r'),
+        help="Run the specified Optimization Script")
+    parser.add_argument(
+        "--stop-after-run",
+        metavar="N",
+        type=int,
+        help="Terminate after the Nth call to diode.Run()")
+    parser.add_argument(
+        "--headless", action="store_true", help="Run without showing the GUI")
+    parser.add_argument(
+        "--local",
+        action="store_true",
+        help="Run locally instead of using ssh")
+    args = parser.parse_args()
+    diode = None
+
+    if args.headless:
+        diode = DIODE(headless=True)
+    else:
+        diode = DIODE(headless=False)
+
+    if args.local:
+        Config.set("execution", "general", "repetitions", value=1)
+        Config.set("execution", "general", "host", value=" ")
+        Config.set(
+            "execution",
+            "general",
+            "copycmd_l2r",
+            value='cp ${srcfile} ${dstfile}')
+        Config.set(
+            "execution",
+            "general",
+            "copycmd_r2l",
+            value='cp ${srcfile} ${dstfile}')
+        Config.set("execution", "general", "execcmd", value='${command}')
+
+    if args.file:
+        diode.open_file(args.file)
+        diode.emit_script_cmd("diode.OpenPythonFile(\"" + \
+              os.path.realpath(args.file.name) + "\")")
+
+    if args.sdfg:
+        diode.LoadSDFG(args.sdfg.name)
+        diode.emit_script_cmd("diode.LoadSDFG(\"" + \
+              os.path.realpath(args.sdfg.name) + "\")")
+
+    if args.optscript:
+        diode.load_optscript(args.optscript.name)
+
+    if args.headless == False:
+        try:
+            GLib.MainLoop().run()
+        except KeyboardInterrupt:
+            sys.exit()
+
+
+if __name__ == "__main__":
+    run_main()
diff --git a/diode/global_vars.js b/diode/global_vars.js
new file mode 100644
index 0000000000..1443496eb1
--- /dev/null
+++ b/diode/global_vars.js
@@ -0,0 +1,4 @@
+var targetsection_ignore_error = true; // Set to true to ignore errors occurring when no valid targetsection was found
+var toplevel_use_mean = false;
+var toplevel_use_median = true;
+var cache_graphs = true; // Cache graphs, such they are not re-layouted on every draw call
\ No newline at end of file
diff --git a/diode/images.py b/diode/images.py
new file mode 100644
index 0000000000..0bf9d75335
--- /dev/null
+++ b/diode/images.py
@@ -0,0 +1,323 @@
+#!/usr/bin/python3
+import gi
+import os
+import re
+import zlib
+import base64
+import argparse
+
+gi.require_version('Gtk', '3.0')
+from gi.repository import Gtk, GdkPixbuf, GLib
+
+
+class ImageStore:
+    """ This class implements an image collection for small images, such as
+        icons. Storing them in files is not ideal since Glade is not able to
+        handle relative paths and different locations of files. Thus, we 
+        compress and base64 encode the pixel data, and store it as a Python 
+        string. This class can be used via import and the `get_image()`
+        method will return a GtkPixbuf, or this file can be called directly and
+        it will self-modify in order to add new images. 
+    """
+
+    def __init__(self):
+        self.images = {
+            "edge_tail_redir.png": {
+                'width':
+                22,
+                'height':
+                20,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                88,
+                'pixels':
+                'eJyVVXtIW2cU/1KzJRluK2XqoMRnEanaJCQ32iZoYYrzj5ohOhSCwpw4FJHJKibM4AwVrY/FkPkIg20ybDA+JsJAjFbFdfW5MVqqq/UdXRofdLrEJJq7cwLXhZVSPXC4N+c75/ed79zv9wshhIyPj5OUlJS3NRoNpdfrP2ppacmorKwUSKVSXn5+Pvm/ZWZmEpFIxK2oqIjX6XS3sKa6ujohLS3tXZqmmbQLBoPhw+np6aHNzc3n+/v7jr29Pcf6+vrW5OSkub6+PhFzQ0NDCZ/PJ/heW1srnpiYuLe2tmaF3H+gxmm1Wu2zs7OWtra2W4AZAJhZsL7t9XpptK2tLXp3d9f3fnJyQi8sLPyp1WpFRqORQG+kqqoq9jHY8fGxL2d5eZleXFyksR59Y2PD3tHRoYR9LViPhvGBgQF6bGyMZszlctGDg4Na7BO9v79f5XQ6T9eHhoZok8lEu93uU4wHYIDxM7O3x+Oh4fw0nIlm+j86OsK9NAxub2/vbYfDcYq7urqKZ6KZ3vAJ8xtvampKW1paWmHi/uaB/f549Og3tUYT87VeT2rv3iVfqFRRv8/NzTH9+RtiwFw24DsqYMashoaGGzAPE8TW/rLZXtifPfM8nZnZn9Hpfv1eqZSdECKGr1wAXuwlRLHK55sfGgyWx0+eLG9vb7+w2Wx/r6ysrEOfPc3NzcmIiRciPT2dxMbGvllSUhJR09hIPY2PN1ovXtxxsVi/ANa34D+B3wFXgfeBO+0BAY2ymzc/qKmpoerq6hJKS0sjBQIBJzs7+6V7Sf/n1ccs1thzNvsb6O86/L7ktxYB/sl3ISGKawLBZUokegnnFbgs8Ct2NvuS9OpVoVgiCaSk0tOcKLmcRCQm8qKugQmFAVRCwmtx/U0oFCKn3o+Li4uOiYkhxcXFeHdxXgRiV2DtMuac1yQSCUlNTb0QHh6eGRwcfJ2JBwUFUcC9j+VyOVssFp8bF7jue/J4vFIOh9MYGRnJjY6O5nC53Dvgt3EtOTn53LhlZWUkKysrEO53Xmtr6wPg/v35+fn77e3tDyH2aW5u7jsqlepcmOXl5SQvL++9kZGRH0BXXMg95N3BwYGPh6AzbuCpqbCwMEStVp8JE3UR+drd3f054HgZHk1NTfk4zvD78PCQBq1QY65CoTgz7ujoqJHRDcSFOfi0gMHFGHD0RyhhZWRkvBa3qKiIgFaSzs7O/J2dnSOG+6gH/joG83F3dXV9hj3k5OScaRZKpZIkJSW91dfX9yVqM+iXF3vHHgHbCxprA43TAv8DCwoKzoTJGEVReFfZ+P9iNpu/Gh4evmexWEw9PT0g89obYWFhb8hkslfW/wvK/N6q'
+            },
+            "edge_head_redir.png": {
+                'width':
+                22,
+                'height':
+                20,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                88,
+                'pixels':
+                'eJyVVWtIW1kQPmoMtKat1epSH1jdqNi8atzSrdXYCMpKC6WPf0pLf4j0BUtRUX9oNBvRClKtKBZbEASR/NiNRIOPqJSmVoMVRY1VY4zvuLFKQeMrOTtzSyTd7db0g49772HmOzPnzMwl5P8RFxdHwsLCvLOyssJLSkp+LS4uvpSZmRkWFBTEEggE3/H8NsLDwwmllJSWlop7e3tfG43GaYvFsr66umqdmZn52NPT81IulwvRJjg42C1Ns9lMmpqaSFlZ2cXJycmpg4MDirBarXRjY4N539/fp+MA1K6vr2diOAoZGRmMXWdn5/O9vT3qRHt7O4U4D793d3dpW1tbKdrm5eUdqZuens7odnR0VDp1MWbIgy4sLFC73e6qq0Db3NzcI3VHR0dJY2Mjc7YGAOb8b+Da2NjYqEwm49fV1bl1DojQ0FDGVqFQXNBqtQ1wlObl5eVPwHXQM3V3d9dBbfDRJjAw0C1NV0RHR5PIyEj/5ORkKdyRCLSESUlJUi6XezoqKuqH9RBCoZAkJiaS2NhYPuA0j8cjWLNAf1g7DyQikejHdcEH6Mfj83m5T5+SlJQUIpVKCcTrAfsIQNcXtd0F/ULvfUICREKh4DaPd+Y3l7iwB8VisT/kE4PfERERbunaPTyI3dPz+DSbrXjj4/PMQYgI11z2JJ/9/Lx/Ons2PSAg4Aau19bWflOLxWLhwyMhIeFExuPHQb1c7jUVm61bJ6QVdG4C/YFsIAvoBXz0gMV6cYzDeQ7zIgQZHx/PQQ0vL68vMUCt5Ofn/6xWq/+A2n03NzdnNBoMC10qlUkrl78yhITcBZ0HwIdIiP/+J19f5fiTJ32ari4DzI5Zk8lkHBkZedva2irLyck5Nz8/T6C2xbCm39nZYWp+ZWXlcA5s2WyO93q9Rnbv3jnQ5AB91RJJ4ohW+2Fve5ux2dzcpGtra9ThcFCbzUaHh4d1BQUFfJgDTa792tLSQqEXDntra2uLKpXK3zEv5J8qVf42+COwp3U6HV1aWmJ0ERgfzJKXfX19bc6ex+fg4CCdmJj4ar5AfkVOXY1GU+46j6ampujs7OyhLsYG/aiEu7wD52FxrrsC9wG/GehncUNDA2lubiYVFRWXYQabXe2d7/iE+1mqrq6+DlfnCbPj2tDQUDf0vxXOdgdoW1xcXO3v7/+rsrLyCsaJdRoTw5QsqaqqujowMKCG/C1oiwTfv/V6fUdNTU0q1gWAyS8tLe0UzIDLsMcdyOFWYWFhnEQi8cnOzv5PXZaXl2PvcYqKin5BW/C5Df+pS6mpqSdhRjM2/wByctyu'
+            },
+            "unmap.png": {
+                'width':
+                75,
+                'height':
+                18,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztV3ko7lkYPubOvWaakpT/UGoME9lHsrsMQ0hChmSpiSn+mDJlQpIkW5IsWbLvhCH7FhMiIVka+5Z9H4z1nfc9zXfHdm++7+Kq8dTb6ft+v+/9znnO+zznPYwxZmZm9o2ampq5qqrqWxUVlZe4ERoaGm+VlJTMxcTEmLKyspm7u/tUTU3NYVNT035jY+NL/BvNzc37qampB4aGhm36+vqMYGlp6VxcXLx2enoKL/gPCwsLEBIS0mJsbKxIPElJSdEgZmJi8mNRUdHGycnJp57iswDxFBQU1KGtrf01csMEkJaW5nyhd7mXlJRs/d/ra3FxkerpD11dXUWsKebv78+uQk5OjobPLCwsvMrKyrY/lq+LiwseN3F2dvbu+8vLy1vv3PVZmDw0np+fizzvpaUlCA0N7TEyMvrW3NyceXl5sbsgLy/P+bKysvqpoqJil+YjKmhvKK5ieXkZOjs7obu7m6/n+PgY0tLSYGJigj+vqqqCurq6a79ZWVmBmZmZD+Y5PDyE9PR0mJqagrm5OWhra+P8CQvKGxYW1mdqaqpCPPn6+t7JkwB4LtLwysbG5ufKyso9Ufmanp6GyclJWF1d5dqnMTc3FwYHByEzMxO6urpgfX0dYmJiIDY2Fvb39yEwMBBKS0uv5aG1j42NXcuTk5MDQ0NDkJGR8S5PcnIy+Qvnva+vT+j50p6Eh4cPoK7UMG7p7n1QV1fnfNnb2/tXV1cfiFLTra2tUF5eDg0NDYBnBtTW1nJeiD/iivZ+d3eXj8HBwfw5rRf351qenp4ezs3VPNHR0byGiKv29nbY2dmB3t5eQO8g/UB/f79QcyX+IyIihlBPmsRTQEDAvXgSQE9Pj0lISHzu4ODwC/ZefwnLV0dHB2AfAtij8FqhtdJaSDd5eXkwOjoKGxsbgD0d5zQxMZFzUVhYCLOzs7zOCFSH+fn5/D3Kg33PtTxUc5Snvr4eDg4OIC4u7hbfH8La2hpERkaO2NnZfWdgYMCysrKE4kkA9DcmIyPz2snJ6Vfcz8O7PPZ9IA2Oj49zPyEd00jrHxkZ4boiL5ufn4eBgQGuIYrh4WGurZaWFv6MQPVCtUS4mYfeuZrn6OgI9vb2uJ7uA/rPqKioUdSPrpaWFisoKBCJJwHI4xQVFV+7uLj8hr57JAxfooI8n7x5a2uL19F91y4MqBbREyYcHR31NTU1GZ4xH8WTANbW1gzzvXFzcwtBLR0/BV8CPMZ/bW5uklb/dHZ2NsI7H0tJSXkQngSwtbVl2JuJ490xDL3j76fk6yFBtRofHz/t6upqqqCgwPA8eVCeBECfZ9jvf+Hp6RmBnnIiSg/zKbG9vQ0JCQmzqI/v0YdZUlLSo/AkAPo8afJLb2/vKOwLTun+SD3+cw/SHZ61Cx4eHj/QOrDXe1SeBECfZ3jGfuXn5xeKntiYnZ1d95wD+7N69PHffXx8rGn+2HM8CU8C6OjoMOTqlaysrLiUlNSzDklJSXHsnd7gvUgM+zKR1vsPWY8H4g=='
+            },
+            "tasklet.png": {
+                'width':
+                75,
+                'height':
+                19,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJzdWAtIlmcUfgdjLswYAzfQMZt5ayTYbylFXjI1tbS8X9O8oBKF+80L3tIpTjcFL1liqeWNUjHExJmmljcYzlDE7OI9L4i3SvP669k5LxiDbb+u/NXtgZf/4/OT73zPe85zzvMytjmoqKhgV65cYadPn2ZaWlrM2dmZfnc7ODjsxet9W7FMTEy+wVCkXFxcmEAgYAYGBkwoFLK0tLRN+soPQ3V1NQsKCmIYHzt8+DCzs7Pj3Bw7dmz/qVOn7M6dO/dLaGhozdWrV7uvXbvWJ+l1/fr1vqSkpC5/f/9SW1vbMIzDVFVVda8Uwtvbm3Onq6vLLl68yOLj4yXKDeWNn58fMzIy4nlz4cIFev8ejOkAcuPk7u6eHB4eXp+ZmTlaW1sr6unpgZmZGVhZWdmyJRKJ4M2bN/D8+XOorKxcIP4CAgIq7O3to/X09CzU1NT2ycjI7AoLC2OampoM73Hubt269cG8AAArKSnhfJw4cYJpa2vz+wcPHvwC90YD68zVw8MjDWuuOTs7e+zRo0crfX198O7dO9hpePv2Lbx8+RKwDhYzMjIGg4ODq5ycnOLwu6zV1dVV5eTkpBMTE9mhQ4eYjo4O8/HxYYWFhRvi6ejRo+zIkSPsE4SGhsaXyL2mubm5h5eXV0ZUVNRvOTk5Ew0NDSsDAwMwNze33VT8a8zOzgLlPOb+MtbAMOpEHepFItaLPX7v9/Ly8jL37t3jukL6Ig7GxsaKZ86c8fH19c2KiYn5PT8/f6q5uXn11atXMD8/v92fuumgWqCaePz4sQhrZDQiIqLB1dU1BXlwPnv27NfiuLK2tv4ZtQlGRkZgcXFx02NbWFiAwcFBoLykNTk5+ZdnlpeX+ftJf0iHJBXL34FqheLKzc0V2djY+IrjCjU6dWhoSGKx9Pb2Qnl5OYSEhEBRURHU1dVxbogTAl2TPickJMDr1695LicnJ/P417ibmJiA0dFRicVIaG1tBZxDfhDH1fnz51No3yWFpaUlzgdqHzx79ox/c01NDfUtePLkCTQ1NUF/fz+g3kJjYyM8ffoUcCbgsWMPgYKCAkhPT+e/xJ2k0NLSQlz5bSdXhNXV1fdcjY+Pk85CXFwclJaW8l/q+ZcvXwacIXntYf+Cmzdvwp07d6CqqopfE7+SxE7hijQL51hoa2vjMaWmpkJ0dDT1dbh79y7gvAKxsbGct4cPH0JKSgrpB+AMyZ/Hns6fo/yUFDbCFenV8PCwxGIgEFcdHR0wNjbG51W6phzr7u4G0sqpqSl48eIFUBx0nxZpVHt7O+/309PT/O9rGicJbESvqA+WlZWtUG79H2eE9bA2f+Esv4xeSWwfPHnypIKZmZmrp6dnWmRkZBP+z1h9fb2I9HYnzuUfC8prymes9eUbN24MY3+uQ++diHOoHXqjr8Rxpa+vzz0N8sPQe+7B6wOmpqZObm5uSX/2e9T7aQ/+S6CesuYXHzx4sIT9dBB1sxprLQ7ndmv0cCrKysrSly5d4nM7zlfiqHoPnIG4r6RzDfLKqL3E3W70Tfsx9+gcIQH3oBb3Yoj2hDwXea+dBOKGZrSuri7A+XoB+2o/eulfHR0dY9APWiA3SoqKirtQo7mXpnOBwMBAhj5uQxz9E4qLi5m/vz8zNDTkPpN4VFFRkRYIBCr4Divcm59wj6por3DPFmnvaA+pV23VotmN+kJnZyfNu/PYP3uEQuF9rKXI48ePmyE33ykoKHxuaWnJv4H8Hp054EzyUdysh9u3bzP07KRv/L2YZ0xJSWkXcrcP+TR3cHD4EX3pfXyuMS8vb0sW6kMDzmWFyE0Ien5j5OZbWVnZz9ZipHNInOsYzrES5WY9ZGVlMfSdDPsCr3UrKys6H5JCHy5jYWGxJQu5kMFQPqWzFdIN9L6cG+zrm/KNfwDeM4BN'
+            },
+            "stream_unmap.png": {
+                'width':
+                75,
+                'height':
+                22,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztmOsvc0EQxv934kMTH0QibnEpglQiEiH4QgQtJa2WuEYogrgVperSkd8ks6mqS+qceL31JCdtn+2emZ3dnXl2Rf7wHTw9Pcnj46N+5vP5kryB9vd4uI/45+dnx/P9M77Ql/d4P303PDw8yOXlpbYNDw9LY2OjtLe3y8nJibbf3d1JKBRSvru7W66urpRPp9PS1dWl/MDAgNzf3yt/dHQkbW1tyo+MjDib29vb0tzcrPzU1JSzv7y8rBzP4uKi42dnZ5VramqStbU1N5aJiQnlW1tbJZVKuTEMDQ0p39HRIWdnZ8rf3t5KX1+f8r29vZLJZJQ/Pz+Xzs5O5QcHByWXyyl/cHCg74UfGxt7NXeA/i0tLbKzsyPX19dqh3cRW5tHYgl/cXHh+hMDfsMTN5tL+tEfnvcZjz9wPOYzyGazjmdeDIzTeJsHUOgjMbIYMofmo81Poe98fuY77zMeO6WwsbEh9fX12q/SQQxYu7ZWSoF1VemxYi2zB6enp1/lvD+8BftrfHz8y3EiX87MzPjs1b+J4lz+Gci71IFIJFIx6/D4+FhrTDk4PT1VHVBu/6+C/Ento84V26JWUYv4D3nEr3kjTzc0NDgtVA6KNZ0fYE7QXEtLS9LT0/OqjTjV1taqLkOPfVSXygU2iBN5ygsU6x4vgZapqamReDwudXV1b9rRj+jazc1NX+wzLvSnV5ifn1d9brrPS9zc3EhVVZX09/dLIBBQW+vr6659b29Pqqur39WF34Efe4a9yBkHfeY18Jc1k0gkNG8sLCxINBp17dQl9qnX45qcnPRs3xWD3Msc+52/sFPqzOoV8J/6Tp1nTfuN36wjyCOjo6O6Vv0Ge4K96GU+/J/BHQrnbGrYbwH6en9//0dsz83Nacx+A7gLCgaDP3puI2+R78PhsOZLuxsEW1tbyqMBuDcC7N/V1VXludOzOyxyCHGHj8Vi7q4K7YNGhU8mky7no6+pl/DoC8uf6ED8gKemGtCx1HE/a8ZXcHh4qOPjsbtIsLu7qxwa084OxIoYwjN2zi0APc69JzyxtDtKzjwrKyvKU99trNQvYgtPTCxW5ATswRfuN/K4vfMP/uMFOZ4pMQ=='
+            },
+            "stream.png": {
+                'width':
+                75,
+                'height':
+                21,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztmNdKJFEQht9XvPMBvFBEQVHMigkDJhQVE+YsJsw556yYs7V8BdWMu7rBnTMzG35ohq7p013xrzpHJPB4eXmRu7s7ubq6ktvbW09+fX0t+/v7et3c3LyRX1xcqIy1fwOenp7ULrMHX3R0dEhVVZW0t7fL8/OzypeXlyU9PV1SUlKku7vbWz8yMqLyzMxMmZ6eVtnr66u0tLTos8i3trZU/vDwIPX19VJWViatra2eb9Hh7OxMY8DaYAMd0OXx8VHv0bu2tlbtKSgokMvLS5Xz29fXJ8PDw7KxseHpzjr8iH32DnsvfubytRP7yUPiYP7mme3tbZmYmJDR0VEvP/FTfn6+JCcnq06sBXzr9PTUuw8EpqamJDc3V9LS0mR9fV1l6L+zsyPHx8dyf38f9HjiD+KEb0yXlZUV1ZlrfHzc79/E7pmZGZmcnPRk+GdpaUnzItg++VWQwwcHBxpTgP5dXV3S1tYmR0dHn7bn5OREsrKyJC8vTxYWFvypckhhd3dXeY9aweafBbVk/EFOsdbfuWOcFGrAbtOLvPue36jx6upqGRsbc6YP36DvNTc3a94Tm1D0GxyTmJioHPce0L+4uFh7mivACREREbK2tiaDg4Pa6+mTyOFBZqi9vT29p+/RL5ARY3oe8wKyw8NDZzoamGuoS/TwBbFFd/RyCfIoNTVVwsLCZG5uTvmwvLxc4uLidNaoqanR+9jYWJ2p6urqpLCwUDIyMqSoqEgiIyP1maSkpDezhivgj0B85z3Y/N3Q0CDh4eGSnZ0tPT096j/ynjmUGp2fn5f+/n59Lj4+XuPLPXXBjIov6cOusbi4KAMDA9/IyS343CWos5KSEp0dyQ1yiLzBdmYT8igmJkb9l5CQoL6MiorSZzs7O9VvQ0NDEh0dLefn5051hVtzcnLencWoS/Kc2nAF8hnegQvIL/KMeR4esnxnPoGfuDY3N/U/LuZKuIq5nHe45FWAH9ib2T7haxB39gZw178Icsn4id8f7YeItfUZ6jLQe6hggL0itU3Nw5WfAb6trKzUPRR8Eaye4G9QV+SEzdlwJdy5urr6Yc397HvhB85LjPvZU3HWArfYmUKoAn+YTziLoLfSbysqKrzYu9zX0oOYIzn78D1/ou8zW5LHvj3dpS68G46gjozvkbH/KC0t1bmN2AL6F+ck7AFd94aPdDWQw/R95n/8BuC8xsZG7b/EEv4D6Io99JHZ2VkvxtjDXENdMN/b+9mjEZempia110C+M59xdsYa04k5jH5P7/yT+JY+T4+lZu38El8xX2I7uWj1TU339vbqOSq22n6R3ECGv/ChgfzF/7z3d7jmP/yLL/AYyXE='
+            },
+            "stream_map.png": {
+                'width':
+                75,
+                'height':
+                22,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztmNtLskEQxv/1IOgqgurCiM4gdFOBUhiYEGoQnQ+EEVSUWdlBO5+bj9/ALK9SWR/vakIPLOijuzs7MzszOyJ/qCfe3t7k/f1dPz8/P8vl5aVcXFzI9fW1+8/j46NyjLu7O8ff3987/uHhwfG3t7eOf3p6Uo49rq6ulGOPl5cXt3+5XFa+VCrpd/D6+upkYZ7J2ChwpvHxcbm5udHvR0dH0tfXJ5FIRBKJhJN7e3tbenp6lM9kMm7+wsKCcoz19XXHp1Ip5Xp7e2V3d1c5dBOLxZQfGBiQYrGoPPoeHR1VPhqNOlnQ0dDQkPITExNqx0YBe4+MjFToBNtxJgZ2NfC78fbfr3jmGh/0h+/whmpZ8PP9/X0vuqiFk5MTmZ6ertDJbwb3tLu7W3283mj0/f8f7O3taSywe+obx8fHGkObFefn5y5f+AR66uzslEKh4H0v36iOeWECWwwODsrKyoqX9esN8i55yYe+sMPp6Wmoa5L78FVyFL5anSMPDw81tnBvqNHCBHUeNQk1TJj68pXr0EdXV5csLy9Le3t7Rf3Kb5xlZmZGZmdntZ4MG9SqrB3W+bLZrMzNzYWy1kfo6OiQ1dVVaWtr05wRtPHW1pa0tLSoLn87Njc3NZZTA/sC9Wx/f7+0trbqfaA2N31x78nxBwcH3vY35HK5Cr/+CZAXnyJm+ARxI5/Pqz2IXfPz8xW+RZwKxjFfSKfT+nYKOy76Ajqph14+25u3bfC9WgvoFX9slMyNBHf+u/eQfDA2NibxeLwp3zBhgbPXuosbGxsyPDxclzfAbwZ6or/zVaymH9Ussc03qAGo/T56+1KTU/NPTk5qvWOg5wPHCPYzqH2MN/3ju0tLS8qx1tnZmfLEAXIqfDKZdO98+nT09eCpCc2fkY9+Dzx50WIneXJqakr5YG+QfqDJsrOzU3FeOOaQawFrLS4uOhmtHqIXSFyHp/4lP6+trWn/shr0cOEZ1LMGfM34YNyjH2R8sC/M3nC8WcxHkY8+JjzntZ4c8ZE+GDy2Mp2gM+bD876x2Ilubc+gvdF9LRmZa/YMymj2+UzGP/wc/wCzIy7C'
+            },
+            "state_trans.png": {
+                'width':
+                75,
+                'height':
+                16,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztl1tIVFEUhndZ5CBRllBkN0shspLQyMzS0YjKysyZpixSsqvpJKZRViRJWFkmDFEahIl2sRepjBjIy4P45pMipZBXfBCFyUJQ4+/fJ4dIxlPRrvGhDYszcy7rnP87/9prHyGESEnBskOHYDx48M9jLE/k/v3wFW4Ye/difkICIlXqSUrCMmf+q1dx49UrON6+VRPPn8Nx8iSuTKRn61ZITb+sv6QE4uJFCKsVIi8P4to1iKoq19fHxODY06fqtLx+DcelS7jhzP/gAWyfPkHZ6O0FLlzAbVdaPnyAZDU7Ph6GwECI7GyI9HSI5GSIEycgsrK+MZBsYmMhUlMhHj2CB89bcvkyZly/Du/cXCymhimu8tPPaV1d6rQMDQFFRbA58xcXw/bxo7r8PT0aq1uutHR0aKwCySooMhKBBQUwkNVsen0GPe9NP05buxaCacTu3fDNzITPvXuYFh6O7KAgnDl6FMHkmlRWhqmu8lssSOU9lA3pofv33cOKx6bu2AGL0Qjrhg3IYU3l0S+V3JcbHY10zg0+69ZhKY8fiYhAIhku4LOKVatgZNzZsgWZ588jhnWu5UtMhMjI+F6Pk4XVyMgIOvgg/f39uufpsXr3DiIsDIfXr0cGtZ/j73z6yrJnD24uX45QWVvBwYimj/LJbjN9ZjCbIVavRsSmTVgREoKyAweQ/D0jtJD1TE6/xaqvrw8Oh0Mpq+HhYQwMDKCurg7V1dWoqKhgjoknOMmK88tN1pgYH1LXmjUI27gRUdu2wUJvJZCThVxiyW8l9wtfX4ioKMyh17bTM37yGvIz7toFH57rf+oUQmQuek94eyOA52fRi6H0pBd74Gk9VqOjo5qWIU5EVWwQDQ0NSlk1NzdrfGw2G5qamlDMC7q7u3VZHT+Ol+xJZp0wUbuZYdq5E/vIxTT+OPfHM7T/PK5t+V+7buy32d8fuR4e+OLpCcesWaim9+ydnRNr6eHDlZaWatuamhrU19crZdXJmxexGRQWFmq5y8vLdb0rWbF/PWM9GP9msOcZAwKQOX06Pnt54f28eXhIfz7R85WcP0q4CKmtrYXdbtdqRSUrmb+trY1rgV60tLSg6ydNeawG8/nexd8M+kv4+WERvRXHWl7IW09hDabosRocHERra6s2V0lN7e3tSln97tCb29UPiJkzIUJDv61XJ0sf/NXxb1n9OP6zmjysOG3fVclKtkiuFwvcxMqqmhW/G+468/Pb+WxlJRrfvFET/HZtTEuD1R2s+E1uefxYnZYXL9CYk4NzzvxcNxtMJsyNi1MTfLdzWYMGd7Bi//WU91elRXKhFi+Z+ytNqw0/'
+            },
+            "state.png": {
+                'width':
+                75,
+                'height':
+                14,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztlTFqAkEYRjc5QdpcwKClRcADeIW01mIlFhFsXQnBxi6ygoWKoIhiIYIgaCmI3kAUxCjbuIogii87S4QsMZBup5gH38wwTPHz+GdG0zQtHOYhGiWWTJJUcSeR4DUS4Vn7JhDAV6uxms1AxZ3xGFIp9Ksrvx/fcMgnil+YJmQyvP10NRiw8rouGdlsQNflcbXdbpnP5xwOB69K+BOZXF0uF+r1Ot1ul+l0ymQyYb1es1gsGI1G9h0wWS6XzjkvkM2VcJTL5Wi1WjQaDXq9Hvl8HsMwqNmfjvB4PB69KE8qV+fz2f5rxnQ6HdrtttNj5XKZQqFApVKhWCw6vXY6nbwoTypXgv1+77xZwsdut3PWYhb7lmU5s1fI5kpmlKv/c8PVU7+PKZ4EFXdWdgel07xfXYVCPGazfDSbVFXcKZWoxuO8XF3ZrSbGe5XbCQa5E56+AI/z9Ck='
+            },
+            "run.png": {
+                'width':
+                27,
+                'height':
+                30,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                108,
+                'pixels':
+                'eJy91s0rBVEYgPHXZ/koKeWyU5JslLKRkpW9xU3JQv4AKUlZKMoSC2UlOxaykJW1zY3ETlIohCvKwlc+n9PcU6N7pzEz55y3ftt5Fu/MmSMitSKygGscYw5NYmcm8IafnC8cYQQ1hlv7vo7fO3bQh3JDrduAlvaEFbSjKGErG9LSLjGNRgctvcsDDKHacktT79I2elFmuaU9Yhlt8r9dJmlp55hEykFL73IPA6i03NJesIlulFpuafdYRIuDlnaKYfHOHtstRZ09Y45aypXDVtZR6xWzDlo3GEeVxdYz1tCJYvHGdOsDu+hHhfwdU61vnGAUdVJ4TLTUM+bRHNAw0VI72UAXSkI6cVufyCAtwWe6idaZePe8+giNqK0HLKFV4t+n7kIa6pvfQo/k/4+izmFAQ/1n1T11UOLdmQrNjHi79ncuMIUGQw096nnruZ7aySo6JPkdN29+Abgq6aY='
+            },
+            "reduce.png": {
+                'width':
+                60,
+                'height':
+                22,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                240,
+                'pixels':
+                'eJzlV1lMlFcU/sGWUBt9sI3pCzRN0ybFN5OmLT5QMGmAAYbNASMwJSADSNkKOGVfiwpmBBUIFULCEtkUFB5sgWFX2VelwzKDCAyEgYGI7JyecxOJbSJNGAYbe5Kbublzl/Od853lj4+P/1IkEhl7eHhoZbi7u39nZGT0hZ6enqG+vv6nOjo6hgYGBp8LhcJvtPXm68Pb2/vUr0lJnx0+epSbev6cMzExyfPx8VFHRUXNR0ZGamOoLl68KA8PD5fhfBjnMrFYPBYXFzenpfd2BmFCvEs8Hu8ahzI5OUl4yzo6OmB7exs2Nja0NjY3N2Fra+tvg9a0+SZhevToEVhYWOQQ3omJCc7U1LSsp6cH3lUhX/4Tb29v79tWS2vS2dkJyOcdvGZmZuX/J7zW1tZ/aIJ3dXUVnjx5An19fTA3N7fr3pWVFbaXYveghPBaWloyvGNjY5ytra1UE7xDQ0OANQfS09MhODgY0IZsfWFhgQ3KGSSLi4sgk8kgPj4eXr58Cevr62ydfl/lL5VKtXOG1mdnZ5mNNMVrbm7O8KKtOTs7O2l/f/+e7yO/Yq2BpaUlCAkJgYcPH7Jx+fJluHTpErS1tbE3sf4wrAEBAUD2LSsrY+cLCwtBLpdDRUUFJCYmQk5ODkxPT8OtW7cgNTWV2ZFssFfp6urayVeE197eXiO8AwMDwOfzITQ0FLC+w9TUFPj7+zN9MzMzITo6GhISEqC9vZ3ZhmzS3NwMaWlp7DzZRSqVQlhYGMzMzLA60tLSAp6enlBdXc3s09DQsGf9uru7d/hMeB0dHZs09S/pSjyOiYmB2tpa8PPzg6KiIqivr9/x7eDgIIyMjAD2GtDa2sp8R9iwL4Camhq2ThwhIXzYJ7DzdXV1oFQq9wfvwADn4uLSqwleil+JRMLmxN3r168zbiYnJzNuEjbCExsby/iNPQ9gn8Ni4MaNG+Dr6wvDw8OQm5vLfE12Gh8fh5SUFMaPvLw8jfEih2/jVHdcJuNcXV37iJN7FcorlItIqF+iWCO/kc5kC/IZ5SKFQsE48Gov5aLR0VGYn59n+9fW1hhu2kf7X7x4wc7TPfT/XoV6KScnpzs4PTQul3NOAkFHdnY2e4vy5rsiy8vLrB4QRzAnF+PSoWejo9wFb+/vsQYnoA1qAwMDlTdv3tykuCHOaWLXgxbiGelMuQ/jZBPz3DT6soZvY5PoLRJ9TfH7bGiIUykUHG7nJFevHnZzczPiW1u72tjYZOG8KzIiYrGgoGCbcvrrtfS/IKQLxQLlw/z8fIgID1eTzqh7JmI896NQ+BXmlQ9wK6dWqbg3SUtzMycfG9MVh4UdQ58bYy8W6GBvX+7l5TWC+Wb1/v37b4379CZx9N69e5QPV0gnzEWlVlZWAajrt4j5mHJiQrfz8eM34ttNyDY/nD7NlRQXv4+cMLSzteXh3UnOzs7SoKAgZUZGxlZjYyOrudrgPt1Jd1N9wjjbonhDXHXWqAPqYvGTn59heUnJexfOn2e67rfMK5Xs3jSJ5ENXF5cT+K4b9hq/Cd3curHOLFGvRPlQrVbvift0huKGagjGEdXmReIovpFtg3HmLhSeuIZvkw4Lc3P7ju/fBHnF/fn0qe4vYvHHZxwdT1nxeD87ODjcEYlEo1euXFmrqqpiPcZuPTBxlOKD4gTr7yqdpfjBHBosOHPGGO/+aFyh0P39wYMDx7ebkM0/OX6cKy8t1fPy9DTE7xAr9H8ycr8OvyOUWVlZG01NTaw3Jo5SHGCt2MC4mHbG2oB5JtmWz+dR3FTevasnsLPTCke1Jfitw/RNTUk5cu7s2ZPIR3/kZaVAIFDjWEB7VKIP/bDfOSlJTT1CezEOtK7XX65D1f8='
+            },
+            "map.png": {
+                'width':
+                75,
+                'height':
+                18,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztl0lIY1kUhp9Y9MK02AvRNK1gGjcRxIUDuBAX3QgGcQZHnDViKs4TunICjbTiRmjcutBOnCecR9oRFUccUMFyJQ6JRu04nD7nwpNY2FVJWWUp9g+Xl7x33r33fPfcc+7juKfJ3d2dc3BwEDg7O//m6uoqeanNzc3NOzg4WOTt7c2dn58/0WvjReOKRCKz2NjY8ubm5tOxsTHtS239/f3aoqKisbi4OLFEIuFyc3OfjVNoaCjFlBmOrZiYmNDBK9Du7i4UFxf/LZVKxTY2Ns/CKTExkfPz8xPgVYFr9io48drZ2YHCwsI+W1vbnx0dHb8pp6ysLC4+Pl6QnJxcOTIyoru7u/ve7hut7e3tu/T09L8sLS2tMZdx34JZfn4+l52dLZDJZJXDw8Of5XR2dgY63cOwu76+hpOTE7i9vWX/Ly4umB2J+qNn+u/QPbVaDR+PpdVqQaPRsN83NzdwenrKbC4vL+9/f0pbW1tAvOzs7Kw9PT254+Pjr8bJ3Nycw9g1S01NrRwcHDQonjY3N+/94f3u6+sD5Azj4+PMx+XlZcA1YL5TPsG1gMPDwwds19bW7tmSjo6OoKurC7q7u9k7xKeiogJ6e3thb2+PjWHI/DY2NhgvJycnay8vLw5vPZmTUCiki2l4eHgu1pN/9Of9XyIbmvPKygpg7ofR0VGYnJyEqqoq2N/fB4VCAVNTU8zXzMxMGBgYANzTkJGRweKIF9Z3wBoLMzMz930QE7pHvOvq6mB1dRU6OzuhoKCArcPS0tJn58drfX2dxldibAn9/f2fxEssFjNOPj4+sp6eHrUhnHjV19ezuatUKmhoaGBxUF1dzVjRlfbewcEB8zMnJwdwHaCkpITtQz4urq6uoKamBjo6OlgfLS0tLKboSqyampqYzfT0NItRjHsYGhoyeI4kilvMw0r0UYjx8EW8PDw8GCdfX18Zzs8oTuRrY2Mji4OFhQWYm5ujnMp4UHyQn4uLizA/Pw9YS5nPFA/l5eXMnveXcldtbS2zoz7oHWJN3KkvsqVn7e3tLAapL+rfWFH843qpwsLChAkJCUZxCggI4CwsLEwDAwPf45qqKbcYI2JFc38st9MeI+6Uo6iRHdnTM8rLmGcZV7IhH5RKJXumL6oJ9B7Ni66UF/k5fmxrqGit8IyqSkpKEkZHRxvECc8EnIuLi2lISMj7trY2jbGcvpaIH+1R4vdcorjNy8trwjORtZWV1Sc5YRxyQUFB7yIjI+Wtra2aL12j1yza0ykpKX8ijh/xG+5RTmVlZRzWoXcxMTFyrDNvkhMvzI26qKioP0xMTASPnVVLS0t/QJ7pWGPOv9e+e0lCXjeY56vt7e3Nsc4+YBUREfF7WlraOp6LPuBZ5wPW+zfdiAHutV2JRJKKddtUnxV+C/+EHH/FXCXC2vnmG561RMhDhGfVX/AsbKLPSi6Xc1Kp9P+m1/D8wL6DZ2dnuX8BB0hemw=='
+            },
+            "edge_thin.png": {
+                'width':
+                75,
+                'height':
+                31,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztWGlMVFcUvgMMguwMW9lRNgFF1iCUHQZBoYIwIJWqKHSQXVlVwr7IjAoEQgGtIiqEVWpCIAitgTSEtkj900SijaW2ado0dpFEbDr97oQh44DB1ICCc5KbeXPve3fO/d75zvnOECI1qa1Pa2pqIvHx8SQvL++FeYFAQJKTk0lQUBCJiIggBQUFJDExkWzdulU4t5zt3r2bmJmZkcjISFJVVUViY2PJnj17SFhYGElJSVmL46yqNTQ0bMrOzg6+cuWKivg8sJI5f/58SFFR0Qkej5dy8eJFFw6Ho2dtbZ1y5MgR6+X2SkhIMNq2bVtqSEiIWXd3t96FCxcSy8rKTmAkNDc3K6/NiVbHbt26RTIyMjwcHR0fIwb8u7q6FteAlXJ4ePjXHh4e9/bt2xeUm5trderUqcBdu3b9grgKXm6/wMBAZxcXl5/S09M/BMYaiCcff3//ZnNz8x/8/Pz01+xgq2Bzc3OMgICACk1NzX9tbGwaxsbG5ERrwEoF8TENDt2mMZaTk0MmJiYUrl27ZtPf368gudf8/DzlM7O1tdV6ampK+cyZM0KustnsGCMjowc7d+5c11ilpaWZAo9+Nze3YUNDw18RN04jIyPCtQWs7gKrEVwzZ2dnGeAki8vl+iEWLSX3mpmZYSD2NA8cOPA+OGsvmgdescbGxg8cHBzWNVZ79+7l7t+//yz456urqzvn6+tbRXM6NQms5IaHh3WOHj3KNzEx+RO4ciT3amxsZMXExPCR2/8AZzNE8xsBq8rKSnVg0XPs2DGX58+fK4EjYwYGBt+BO0Y9PT1L4gq8k8G5bdXU1H5ksVjRkvsVFhbKlJSUBFhZWc3jvnTR/HrHCjWP5mG2u7v7/aSkpKvHjx//NDg4eFpdXV2AfJyGPCyJlTytA7a2tgYaGhoPtbS0lmCFWkdaWlo87ezsniFHbRishoaGFIDDVeBT/tGCHTx4MElfX/8R+DWKvKVC66A4VvS5HTt2GAKr70VY5efnk8OHD5Pa2lrhvtAFXhSrjRJX9GwnT570Q/37HLFA6zoJDQ0lKioqxN7evhw18RlyWCTwURDHqq2tjUA7vYfYewi8ourq6khxcbFdVlZWFGqjKt37dbG6dOkS6ejoIJ2dnasJwSsb8pM+/O5zcnL68vTp02aiXA7dqIbYqlZWVp6Hfhirr693BU6L+QpYKTk7O4coKir+jPgrxLouakObqanpP9D9aXQPcNDrdTiI59Vu3LhhhN9juLq6EtRVcv369VXDYiUD3UyUlJQCtLW1A3FtLoaVNjDylpGRYWM9CLnaBViI9BUDukkVWLnLyckFoQb4gHcmwN05Ojr6C9THAjxP9ZXX9u3b59ETZU5OTpKbN28K9dWrYoW9iuDDfU9Pzxbk00D4oEXnwQECvUKmp6dXGZ0XDfgQVVVVoqOjI7wWw4rATyIrK0uAFZ1XAk734Pdd9DeeeN/6iBnCZDIJsBLmKNQFY2jUevRJ7JqaGkM+n++BnPYE+35w584dDXDTDXgX4f5H0PMrYkU1Pt6VAD4IEN9PgfE3iC8etAgb/BTi5uPjQ1JTUxf9ljSaT6j/a2nwRRa6Kx79Xz44lo736iV5T0VFBau9vd0EsZMKPTsBHcpHvegFPzf39vYaZ2ZmpuFsWVu2bElG36240m8ilBoRtxSExUG/I0c+tbS0nAKf+YhhL7yXzfR+9FFL9igtLdWJiooqgk+8NR5VGJV431U4R7XkOp3DOAvt+RnepwD9418YHaK1hecqccbKlX4L9/DA029pXIljJT7k5eUF0He/o1YP45mPofnkJLFCL2oMLdSLdzyCWvTWDfg1jM8hfA4tXP+vfcC5xwwG46VYifDS09N74u3t/cno6OiSvhR6iHH58mUm8gITvcSGHLTvRg5vkeQgHciRAuiU3ywsLG5DJ+egH3UeHx/fvByPz507R8BT4X9o4OKGHBwOh+b2JprXRfhA8/6NeJuE7stGbXXq6uoSJm36n+PAwMBL8/u7YKi7NRQf9N9fof5Wc7ncgL6+PtbCGv1fkgwODr5pN98KQ19qfujQIT9oD036PS4ujnR3dxP05G/atbfOKLcoF8vLywmPx3vT7khNalKTmtSkJjWpvdP2H4a0w8s='
+            },
+            "delete.png": {
+                'width':
+                23,
+                'height':
+                30,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                92,
+                'pixels':
+                'eJxjYMAJPID4ORD/x4OfQdXhAlFoOBKIM4H4GgFzYfgaVH0kFrP+4cDEmAvD1DCDVHyfhlgBiOWAWBqIZaiE5aDmgoAilnigBCsyIEA8EP+lIo5DM5uacRg/avao2aNmj5o9avaQNzuBhmbHUdls5DrNDYi/UslckDmuSGYLAPEmKpm9EYj5GVCBBhBvAOJvZJoJ0rceag4YAAB83CNR'
+            },
+            "cursor.png": {
+                'width':
+                30,
+                'height':
+                30,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                120,
+                'pixels':
+                'eJzNlj9sElEcx4//JhYcSGkgBCYn4mBkMOzK0I0Bki7deg4kBOaqzMbAhHFx0DA4OZCwkEA7wtIuQNyAYBzocL0/7/jX+vz+bGM0ViVyvPhLPpf8Xu7uc+/d797vJOm/jHtgH2wJ9h6CS/AC3BboLdpsNg7mgt3FQCDAU6kUc7lcM4HuYjQa5b1eb5nP5w2HwyHK/TwSifDRaMRN01zmcjlR7m/e4XDIKQzDEOX+ySvQ/YtXkPtGrwD3b70bdv/Ru0H3X70bcq/kpWCMXRQKBeZ0Oq1wr+ydz+fLRqNxHgwGdVy3kK762Ka8XyaTyaxarWrJZNLw+XxLXPMZvJOueqjV3ks6tNttPRaL6XivDOeegmfXPtcazu9e2p9pbqqqzmq1moYaUvr9vtlsNlWPx3OB896C7TVdP0YxHA7zVqvFyuWyGo/HTbfbTb14IcuyhueYJhIJE/kxuGOlF32X+/3+BXq/hvwIPAHvQ6HQdDAYGJVKRUVOtbRroVcGA/AGPAbe6/FHQC2VSup4PGbo0bQGtNZOi7z0Pxe54X7kb2LdmaIoZjabpTl/ktar4VVDxrue1et1vdPp6F6vl77ZpwK8UfAxnU7r2CdN+n6Rn4AdAe6XVHPdbtegvcNut08xtifA+xCcYc5KJpPRsH9w5K8FeG+BD4B8VPevwH0BXooH4ADcBQ5Bzn+Or6ajC1U='
+            },
+            "array.png": {
+                'width':
+                75,
+                'height':
+                21,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztWHswlWkc/o5idmPbESWKREpHroNIUdGNojFRskqKkmoSaZpC5FIm28VYl25i+uOYaFq6Tbm1QvfFHqULolZy6ULuzrvP75thZi922zanmtnfzDvvd/jO973v8/5+z/P8DscNfezevZs7fPgwN336dFl8lDMzM1MwNjYeKycnp4rPY7W0tDTt7e1N586da25paWluYmJijnvMra2tzRcuXGhuYWGhS/cJBAJVdXV11VmzZinSc9TU1OTc3d1lXF1dpbCLjxd6enoCTU1N2UmTJimrqqrqCIVCK+zX2cbGZiP2ErV8+fI4Pz+/zM2bN58PCgrKjYyM/CU6OlocExMjjouLe3T06NHG48ePN504caLp5MmT/KDPx44da0pISKg9cOCAeN++feKoqKiKkJCQEnrOli1bznt4eCSvWLEidt68ef7A0ENfX98W7zbS0NBQGz9+vPy0adNkXFxcOLxf6ph4eXlx/v7+gokTJ36NfNBCDsyfPXv2Rjc3t/hNmzZlh4eHlyUmJv565syZ1pycnJ47d+6whw8fsufPn7Pm5mbW1tbGOjo6WE9PD+vr62P/JiQSCevt7WVdXV3s3bt37PXr1+zFixesurqalZeXs6Kior7s7OzOU6dONcfGxj7asWPHT2vWrElzcHAIwbm5YLkGEyZMUJoyZYoMzvOjY3Po0CF+xrMpZ7RRI67Lli2L3bZtWw7yofbcuXOdt2/fZnV1dezt27c8Bp9DEK50Jo2Njez+/fssNze3NyUlpSk0NPTnVatWpSAHfSdPnmwK7EY6OzsLcOYfjBFw4ZDDgnHjxqkjb9Z5e3tnALeaq1ev9lRVVfH5Qev50qK7u5s1NDSwW7duSdLS0lqQe0WOjo4R4BALFRWVrxDc5cuX3wsjYM2hvihHdRcvXhwNjqi8du1ab1NT07+umS8h2tvbWUVFBUPdtqxfv/5H5JsL9j4C2A2K0ZgxYzhwMwdt+RZ6FIh6r0HuSoa6pj4n/KleMzMz21evXp2uq6trCO34S6zAe5ySktIoaEpqXl5erzR4h/g5PT2dnz+nePr0KQOviaGllsgdDro9gBP5HwrUXGhhYaHUSIj00dfXl928eZN1dnYSh7Dr16+zkpISfvTrG+kG/Y94gLiSOHuogzR7+/btxQoKCirQzQGs5syZQ9M3u3btKqY1SyNI+y9cuMBSU1PZ3r172cuXLxnOjyGnWWBgICsoKODH2bNnGXwWS05O5u/Pz8/nPYM04uLFi10zZsxwhGcbwMrW1pamEQEBAXmtra1SWUdtbS2DB2M3btwYyK39+/fzngl6wucQ9IhlZGQweBP2+PFjOmdWVlYmlfVREHeh15hvZGQ0gBU8M69/dnZ2PlhbuzT49sqVKwP7hldn8OcsPj6erzeaybuePn2awROxpKQkRmdI95HeSyPobNauXZuBnmokeq3fcbuysjJNcqjHIKzvFWJI10I+h+qw/5pqn2qr35fTeZG+0DV5OXheBu8ypGuioHcWFxf3wU9mwV9qUQ7Br/5JC0ePHs3JysoOMzU1dUAfU3jp0qXulpaWIV/fPwXhJxaL+b5gqILOhPIcXqkO/XowckeZMBo+fPigPgu9Ju/Z4fmVUatuGzZsOA+f9or0SFpcJq2gfCbehGZ0hYWFldvb24dOnTpViH3LGBsbD4rRH4N6c/L78F0jDAwMLJYsWRKCHjAfelQPfeojPSdf9CX1OYQN6e3du3fJ17Whzy+Fn0wAdzvp6OiMnTlzpuDv/Pr7hI+PDwd9Iv6XR84J0Qd8Byy/h7YXHDx4sAp61oYalxAfkn6R//mUfpwwoXp99uwZKy0tZcQl8B2N8Jf3PD090xYsWBAAH2ADPlJG/ciQzrm7u/8njAYLS0tLbunSpQLUqzx0QgPvtUZf5O3k5BQJTkyn30JiYmIqoa8N0NU29Nt98AYS4pwnT57we6B+gvwB7Ynyk/Cl0c/xNOia/kY9G9X/mzdveI9YX1/PampqWGVlJeWIBHkuycrK6oBuvjpy5Ej1nj177tHvWytXrvwB3BNgZmbmJBQK9VEjo+ArhxHHfOogzbCyshoG/OTRG6ii9vW0tbWtbGxsXICvp7Ozc8iiRYvC0G8lIVfTvby8RMBWtHXrVtHOnTtFwcHBIvCGKCIiQgR/KkIeiOCPRfBVIngwEd2/bt06Eb6bBk6IxNmEw+f44dlu4Ak71JEJ+l4NQ0PDUViDXGJiIod3fWpYPiiQS/wMLASKiooyuPygAQ4WPHjwgCsqKpL6Hv6P94vfAAvhtdQ='
+            },
+            "conflictres.png": {
+                'width':
+                75,
+                'height':
+                28,
+                'has_alpha':
+                True,
+                'bits_per_sample':
+                8,
+                'rowstride':
+                300,
+                'pixels':
+                'eJztWGdIZGcUfeOO+ydodN0lm92QYPmjf/wTggS7YBd7LyCKYkWwoosRLEQ0QWJvKDZGFMVVEY0aA2os0Qg2sCCixhLsFXXmy/keDsgGo1P0mY0HHg/e6Lx7z3fuufcOwzzhCU94TMjJyWEIIQp6enr6ysrKTmpqag5KSkr/m0tFRcVBVVXVEblbq6urf6KhoXEjVw0NDSxXVlZWsTo6OjtxcXGkurpaiOuyqqrqo75qa2svoRWRgYHBpa6u7nsfHx81Jyenf9VWRUUF093dzXd1dbXy8PAYqqurEx0fH5OPGSKRiIyOjpKoqKgt5P1dVlbWS+iM6ubWWlxeXma0tbUZPz+/N+bm5lkxMTG7k5OTXKd0L9je3iaFhYWXzs7OPdCS0enp6bO+vj5J7YtJSUlhOjs7+Q4ODjZubm4jNTU1osPDQ67TkwuEQiEZHh4m4eHhmzY2Nu+Sk5NfgS+JOboOcqXD0NDQt9bW1j/Cw/ampqZY3coKeXyHNKBays/Pv7C3t+9CzRlsbGwoZGZmysTTdZSUlDCNjY2Kjo6OdtDY7/B7IovGaLz0XAcGBsj4+DhpbW0liJkIBAKytrZGmpqaSE9PDykrK2P/hr5vcHBQJo6oloaGhkhwcPCKkZFRXFhY2AstLa07+ZKkWFpaYtBLmZCQkC+g2xxobJ9qTBpQHuChZGZmho0/IiKC7OzsEHw3+wy1QTo6Okhubi5JT08n/f39JDo6mpyfn0v1Pno2BQUF5+jxrYaGhl/jEc/X11fuHH2I1NRUpre3V9Hd3d0evXIM/VZ0dHQkUeybm5skISGBvahu0IPI/v4+y9n8/DxJSkoiExMTrL6QI1ldXWU/k7Qni30JZ7Cqr68fjZg/5fF4DJ7dO09inJ2dsXec9Zeo+5/i4+MPpqen75wD1ePs7CxJS0sj4JoEBQWxNebt7U3GxsZYDdEc6WfZ2dlkYWGBBAYGkoODgzu/40pLF/DZdtTcN1tbW7zy8vIH4+hDYP5iUCvPoS8nT0/PP9Ar7+Rji4uLLDfUq/b29li/old7ezuZm5sjzc3NhM4pVHNdXV2s1urr69k6vQ3XtPQntBQL/1a5ba58KKysrDB0doMWvoLG8mNjY6X2MVlBucS8dGFnZ9eB2VAPtc7D3MQ1Rf8A9THM/M/h+9aYVX6TxsekBdXSyMgIQW9bNzExiff391elMaF2uablRuBc2buXl9fnZmZmmdDYtiQ+Jg2uZm+qpU4LC4tvd3d3FTDfcMzE3YF5gkH8fJyxBWayAfiYUN4aE/sS5uQNvCcRWnoBb+I6damAemTvqMnX2N8zsFfKTWPUl4qKii6xf/0MPemDMx60zHHGsgOez2RkZPCNjY0t0I8GZfExsS/RPQ7e/Q53NfoO1B3XacoN6+vr7MwPrl6jXr5Hje7Q+VxSLRUXF1Mt9aDfGtI9rqWlhevU7g3wFgb58k1NTa2ueuWtPnatx23Bu5OhpZf/VV+SFCcnJ+wd+b6BxrLQK3dv8jGxlsDrLy4uLkaYTRUwv3OcwcMD+w2Tl5fHt7S0tAFvw/Q3WLHGqJbob5WRkZF/2drapmCHekX/RyQScR02Z8DuwmhqajKYrd/Cq38AJ3u03kpLS4XQ0a/YdU2xNz9ra2vjOtRHA3gQU1lZqQiN2WHPrYd/pyQmJn4GrrgO7VECpcdgj2YCAgIUBAIBD7sv1yE9QQr8Dds3KLI='
+            },
+        }
+
+    def get_image(self, name):
+        # get the data back: decode b64 and uncompress the pixels
+        data = self.images[name]["pixels"]
+        data = base64.b64decode(data)
+        data = zlib.decompress(data)
+        pb = GdkPixbuf.Pixbuf.new_from_bytes(
+            GLib.Bytes(data), GdkPixbuf.Colorspace.RGB,
+            self.images[name]["has_alpha"],
+            self.images[name]["bits_per_sample"], self.images[name]["width"],
+            self.images[name]["height"], self.images[name]["rowstride"])
+        return pb
+
+    def add_image(self, filename, name):
+        image = Gtk.Image()
+        image.set_from_file(filename)
+        pixbuf = image.get_pixbuf()
+        if pixbuf == None:
+            print("Couldn't load image " + args.file)
+            os.exit()
+        colorspace = pixbuf.get_colorspace()
+        width = pixbuf.get_width()
+        height = pixbuf.get_height()
+        rowstride = pixbuf.get_rowstride()
+        has_alpha = pixbuf.get_has_alpha()
+        bits_per_sample = pixbuf.get_bits_per_sample()
+        pixels = pixbuf.get_pixels()
+        pixels = zlib.compress(pixels)
+        pixels = base64.b64encode(pixels).decode('utf-8')
+        newdata = {
+            "width": width,
+            "height": height,
+            #                ignore colorspace for now
+            #                "colorspace"      : colorspace,
+            "has_alpha": has_alpha,
+            "bits_per_sample": bits_per_sample,
+            "rowstride": rowstride,
+            "pixels": pixels
+        }
+        old_contents = ""
+        with open(__file__, 'r') as f:
+            old_contents = f.read()
+        p = re.compile('(self.images = {)')
+        new_contents = p.sub("self.images = {\n" + "\"" + name + "\" : " + \
+                str(newdata) + ",", old_contents, count=1)
+        with open(__file__, 'w') as f:
+            f.write(new_contents)
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser(description='DIODE ImageStore')
+    parser.add_argument(
+        "file", metavar='file', type=argparse.FileType('r'), nargs='?')
+    args = parser.parse_args()
+    ims = ImageStore()
+    if args.file:
+        ims.add_image(args.file.name, os.path.basename(args.file.name))
+    else:
+        image = ims.get_image("conflictres.png")
diff --git a/diode/main.glade b/diode/main.glade
new file mode 100644
index 0000000000..5575d41cfd
--- /dev/null
+++ b/diode/main.glade
@@ -0,0 +1,1153 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!-- Generated with glade 3.22.1 -->
+<interface>
+  <requires lib="gtk+" version="3.18"/>
+  <requires lib="gtksourceview" version="3.0"/>
+  <object class="GtkWindow" id="main_window">
+    <property name="can_focus">False</property>
+    <property name="default_width">1200</property>
+    <property name="default_height">680</property>
+    <signal name="delete-event" handler="onDeleteMainWindow" swapped="no"/>
+    <child type="titlebar">
+      <placeholder/>
+    </child>
+    <child>
+      <object class="GtkBox">
+        <property name="visible">True</property>
+        <property name="can_focus">False</property>
+        <property name="orientation">vertical</property>
+        <child>
+          <object class="GtkMenuBar">
+            <property name="visible">True</property>
+            <property name="can_focus">False</property>
+            <child>
+              <object class="GtkMenuItem">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="label" translatable="yes">_File</property>
+                <property name="use_underline">True</property>
+                <child type="submenu">
+                  <object class="GtkMenu">
+                    <property name="visible">True</property>
+                    <property name="can_focus">False</property>
+                    <child>
+                      <object class="GtkMenuItem" id="loadcode_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label" translatable="yes">Load Python Code</property>
+                        <signal name="activate" handler="onActivateOpenMenu" swapped="no"/>
+                        <accelerator key="l" signal="activate" modifiers="GDK_CONTROL_MASK"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem" id="savecode_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label" translatable="yes">Save Python Code</property>
+                        <signal name="activate" handler="onActivateSavePythonMenu" swapped="no"/>
+                        <accelerator key="s" signal="activate" modifiers="GDK_CONTROL_MASK"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem" id="savecodeas_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label" translatable="yes">Save Python Code As</property>
+                        <signal name="activate" handler="onActivateSaveAsPythonMenu" swapped="no"/>
+                        <accelerator key="a" signal="activate" modifiers="GDK_CONTROL_MASK"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkSeparatorMenuItem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem" id="loadtrans_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label" translatable="yes">Load Transformation</property>
+                        <property name="use_underline">True</property>
+                        <signal name="activate" handler="onLoadTrans" swapped="no"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem" id="savetrans_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label" translatable="yes">Save Transformation</property>
+                        <property name="use_underline">True</property>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkSeparatorMenuItem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem" id="saveoptscript_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label" translatable="yes">Save Optimization Script</property>
+                        <property name="use_underline">True</property>
+                        <signal name="activate" handler="onStoreScript" swapped="no"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem" id="loadoptscript_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label" translatable="yes">Load Optimization Script</property>
+                        <property name="use_underline">True</property>
+                        <signal name="activate" handler="onLoadScript" swapped="no"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkSeparatorMenuItem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem" id="loadsdfg_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="tooltip_text" translatable="yes">Load an SDFG from a file, replaces the current SDFG</property>
+                        <property name="label" translatable="yes">Load SDFG</property>
+                        <property name="use_underline">True</property>
+                        <signal name="activate" handler="onLoadSDFG" swapped="no"/>
+                        <accelerator key="l" signal="activate" modifiers="GDK_SHIFT_MASK | GDK_CONTROL_MASK"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem" id="savesdfg_menuitem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label" translatable="yes">Save SDFG</property>
+                        <property name="use_underline">True</property>
+                        <signal name="activate" handler="onSaveSDFG" swapped="no"/>
+                        <accelerator key="s" signal="activate" modifiers="GDK_SHIFT_MASK | GDK_CONTROL_MASK"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkSeparatorMenuItem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkImageMenuItem">
+                        <property name="label">gtk-quit</property>
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="use_underline">True</property>
+                        <property name="use_stock">True</property>
+                        <signal name="activate" handler="onActivateQuitMenu" swapped="no"/>
+                      </object>
+                    </child>
+                  </object>
+                </child>
+              </object>
+            </child>
+            <child>
+              <object class="GtkMenuItem">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="label" translatable="yes">_Edit</property>
+                <property name="use_underline">True</property>
+                <child type="submenu">
+                  <object class="GtkMenu">
+                    <property name="visible">True</property>
+                    <property name="can_focus">False</property>
+                    <child>
+                      <object class="GtkImageMenuItem">
+                        <property name="label">gtk-preferences</property>
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="use_underline">True</property>
+                        <property name="use_stock">True</property>
+                        <signal name="activate" handler="onActivatePreferences" swapped="no"/>
+                        <accelerator key="p" signal="activate" modifiers="GDK_CONTROL_MASK"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkImageMenuItem">
+                        <property name="label">gtk-copy</property>
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="use_underline">True</property>
+                        <property name="use_stock">True</property>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkImageMenuItem">
+                        <property name="label">gtk-paste</property>
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="use_underline">True</property>
+                        <property name="use_stock">True</property>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkImageMenuItem">
+                        <property name="label">gtk-delete</property>
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="use_underline">True</property>
+                        <property name="use_stock">True</property>
+                      </object>
+                    </child>
+                  </object>
+                </child>
+              </object>
+            </child>
+            <child>
+              <object class="GtkMenuItem">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="label" translatable="yes">_View</property>
+                <property name="use_underline">True</property>
+                <child type="submenu">
+                  <object class="GtkMenu">
+                    <property name="visible">True</property>
+                    <property name="can_focus">False</property>
+                    <child>
+                      <object class="GtkMenuItem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label">hwinfo</property>
+                        <property name="use_underline">True</property>
+                        <signal name="activate" handler="onViewHwinfo" swapped="no"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label">Remote PAPI counter info</property>
+                        <property name="use_underline">True</property>
+                        <signal name="activate" handler="onReadPAPICounters" swapped="no"/>
+                      </object>
+                    </child>
+                    <child>
+                      <object class="GtkMenuItem">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="label">System info</property>
+                        <property name="use_underline">True</property>
+                        <signal name="activate" handler="onReadSystemInfo" swapped="no"/>
+                      </object>
+                    </child>
+                  </object>
+                </child>
+              </object>
+            </child>
+            <child>
+              <object class="GtkMenuItem">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="label" translatable="yes">_Help</property>
+                <property name="use_underline">True</property>
+                <child type="submenu">
+                  <object class="GtkMenu">
+                    <property name="visible">True</property>
+                    <property name="can_focus">False</property>
+                    <child>
+                      <object class="GtkImageMenuItem">
+                        <property name="label">gtk-about</property>
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="use_underline">True</property>
+                        <property name="use_stock">True</property>
+                      </object>
+                    </child>
+                  </object>
+                </child>
+              </object>
+            </child>
+          </object>
+          <packing>
+            <property name="expand">False</property>
+            <property name="fill">True</property>
+            <property name="position">0</property>
+          </packing>
+        </child>
+        <child>
+          <object class="GtkNotebook" id="notebook">
+            <property name="visible">True</property>
+            <property name="can_focus">True</property>
+            <signal name="switch-page" handler="onSwitchPage" swapped="no"/>
+            <child>
+              <object class="GtkPaned" id="TopPane">
+                <property name="visible">True</property>
+                <property name="can_focus">True</property>
+                <property name="margin_left">2</property>
+                <property name="margin_right">2</property>
+                <property name="margin_top">2</property>
+                <property name="margin_bottom">2</property>
+                <property name="orientation">vertical</property>
+                <property name="position">500</property>
+                <property name="position_set">True</property>
+                <property name="wide_handle">True</property>
+                <child>
+                  <object class="GtkPaned" id="TopLeftPane">
+                    <property name="visible">True</property>
+                    <property name="can_focus">True</property>
+                    <property name="margin_left">2</property>
+                    <property name="margin_right">2</property>
+                    <property name="position">200</property>
+                    <property name="position_set">True</property>
+                    <property name="wide_handle">True</property>
+                    <child>
+                      <object class="GtkNotebook" id="sourceview_notebook">
+                        <property name="visible">True</property>
+                        <property name="can_focus">True</property>
+                        <child>
+                          <object class="GtkScrolledWindow">
+                            <property name="visible">True</property>
+                            <property name="can_focus">True</property>
+                            <property name="hscrollbar_policy">always</property>
+                            <property name="vscrollbar_policy">always</property>
+                            <property name="shadow_type">in</property>
+                            <child>
+                              <object class="GtkSourceView" id="sourceview">
+                                <property name="visible">True</property>
+                                <property name="can_focus">True</property>
+                                <property name="left_margin">7</property>
+                                <property name="right_margin">2</property>
+                                <property name="monospace">True</property>
+                                <property name="show_line_numbers">True</property>
+                                <property name="tab_width">4</property>
+                                <property name="auto_indent">True</property>
+                                <property name="insert_spaces_instead_of_tabs">True</property>
+                                <property name="show_right_margin">True</property>
+                                <property name="smart_home_end">always</property>
+                                <property name="highlight_current_line">True</property>
+                                <signal name="scroll-event" handler="onScrollPythonPane" swapped="no"/>
+                              </object>
+                            </child>
+                          </object>
+                        </child>
+                        <child type="tab">
+                          <object class="GtkLabel">
+                            <property name="visible">True</property>
+                            <property name="can_focus">False</property>
+                            <property name="label" translatable="yes">Python Code</property>
+                          </object>
+                          <packing>
+                            <property name="tab_fill">False</property>
+                          </packing>
+                        </child>
+                        <child>
+                          <placeholder/>
+                        </child>
+                        <child type="tab">
+                          <placeholder/>
+                        </child>
+                        <child>
+                          <placeholder/>
+                        </child>
+                        <child type="tab">
+                          <placeholder/>
+                        </child>
+                      </object>
+                      <packing>
+                        <property name="resize">True</property>
+                        <property name="shrink">True</property>
+                      </packing>
+                    </child>
+                    <child>
+                      <object class="GtkPaned" id="TopRightPane">
+                        <property name="visible">True</property>
+                        <property name="can_focus">True</property>
+                        <property name="position">150</property>
+                        <property name="position_set">True</property>
+                        <property name="wide_handle">True</property>
+                        <child>
+                          <object class="GtkNotebook" id="opts_notebook">
+                            <property name="visible">True</property>
+                            <property name="can_focus">True</property>
+                            <child>
+                              <object class="GtkDrawingArea" id="optimizationsgraph">
+                                <property name="visible">True</property>
+                                <property name="app_paintable">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="can_default">True</property>
+                                <property name="receives_default">True</property>
+                                <property name="events">GDK_EXPOSURE_MASK | GDK_POINTER_MOTION_MASK | GDK_POINTER_MOTION_HINT_MASK | GDK_BUTTON_MOTION_MASK | GDK_BUTTON1_MOTION_MASK | GDK_BUTTON2_MOTION_MASK | GDK_BUTTON3_MOTION_MASK | GDK_BUTTON_PRESS_MASK | GDK_BUTTON_RELEASE_MASK | GDK_KEY_PRESS_MASK | GDK_KEY_RELEASE_MASK | GDK_ENTER_NOTIFY_MASK | GDK_LEAVE_NOTIFY_MASK | GDK_FOCUS_CHANGE_MASK | GDK_STRUCTURE_MASK | GDK_PROPERTY_CHANGE_MASK | GDK_VISIBILITY_NOTIFY_MASK | GDK_PROXIMITY_IN_MASK | GDK_PROXIMITY_OUT_MASK | GDK_SUBSTRUCTURE_MASK | GDK_SCROLL_MASK | GDK_TOUCH_MASK | GDK_SMOOTH_SCROLL_MASK</property>
+                              </object>
+                            </child>
+                            <child type="tab">
+                              <object class="GtkLabel">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="label" translatable="yes">Graph</property>
+                              </object>
+                              <packing>
+                                <property name="tab_fill">False</property>
+                              </packing>
+                            </child>
+                            <child>
+                              <object class="GtkScrolledWindow">
+                                <property name="visible">True</property>
+                                <property name="can_focus">True</property>
+                                <property name="shadow_type">in</property>
+                                <child>
+                                  <object class="GtkTreeView" id="optimizationtreeview">
+                                    <property name="visible">True</property>
+                                    <property name="can_focus">True</property>
+                                    <child internal-child="selection">
+                                      <object class="GtkTreeSelection"/>
+                                    </child>
+                                  </object>
+                                </child>
+                              </object>
+                              <packing>
+                                <property name="position">1</property>
+                              </packing>
+                            </child>
+                            <child type="tab">
+                              <object class="GtkLabel">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="label" translatable="yes">TreeView</property>
+                              </object>
+                              <packing>
+                                <property name="position">1</property>
+                                <property name="tab_fill">False</property>
+                              </packing>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child type="tab">
+                              <placeholder/>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="resize">False</property>
+                            <property name="shrink">True</property>
+                          </packing>
+                        </child>
+                        <child>
+                          <object class="GtkNotebook" id="sdfg_notebook">
+                            <property name="visible">True</property>
+                            <property name="can_focus">True</property>
+                            <property name="scrollable">True</property>
+                            <property name="enable_popup">True</property>
+                            <child>
+                              <object class="GtkSpinner">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                              </object>
+                            </child>
+                            <child type="tab">
+                              <object class="GtkLabel">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="label" translatable="yes">SDFG</property>
+                              </object>
+                              <packing>
+                                <property name="tab_fill">False</property>
+                              </packing>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child type="tab">
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child type="tab">
+                              <placeholder/>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="resize">True</property>
+                            <property name="shrink">True</property>
+                          </packing>
+                        </child>
+                      </object>
+                      <packing>
+                        <property name="resize">True</property>
+                        <property name="shrink">True</property>
+                      </packing>
+                    </child>
+                  </object>
+                  <packing>
+                    <property name="resize">False</property>
+                    <property name="shrink">True</property>
+                  </packing>
+                </child>
+                <child>
+                  <object class="GtkPaned" id="BottomPane">
+                    <property name="visible">True</property>
+                    <property name="can_focus">True</property>
+                    <property name="margin_left">2</property>
+                    <property name="margin_right">2</property>
+                    <property name="margin_top">2</property>
+                    <property name="margin_bottom">2</property>
+                    <property name="position">200</property>
+                    <property name="position_set">True</property>
+                    <property name="wide_handle">True</property>
+                    <child>
+                      <object class="GtkBox">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="orientation">vertical</property>
+                        <child>
+                          <object class="GtkToolbar">
+                            <property name="visible">True</property>
+                            <property name="can_focus">False</property>
+                            <child>
+                              <object class="GtkToolButton" id="RunToolbutton">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="label" translatable="yes">Run</property>
+                                <property name="use_underline">True</property>
+                                <signal name="clicked" handler="onClickRunTB" swapped="no"/>
+                              </object>
+                              <packing>
+                                <property name="expand">False</property>
+                                <property name="homogeneous">True</property>
+                              </packing>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="expand">False</property>
+                            <property name="fill">True</property>
+                            <property name="position">0</property>
+                          </packing>
+                        </child>
+                        <child>
+                          <object class="GtkNotebook" id="resview_notebook">
+                            <property name="name">resview_notebook</property>
+                            <property name="visible">True</property>
+                            <property name="can_focus">True</property>
+                            <property name="show_tabs">False</property>
+                            <property name="scrollable">True</property>
+                            <property name="enable_popup">True</property>
+                            <child>
+                              <object class="GtkScrolledWindow">
+                                <property name="visible">True</property>
+                                <property name="can_focus">True</property>
+                                <property name="hscrollbar_policy">always</property>
+                                <property name="vscrollbar_policy">always</property>
+                                <property name="shadow_type">in</property>
+                                <child>
+                                  <object class="GtkSourceView" id="resview">
+                                    <property name="visible">True</property>
+                                    <property name="can_focus">True</property>
+                                    <property name="hscroll_policy">natural</property>
+                                    <property name="vscroll_policy">natural</property>
+                                    <property name="editable">False</property>
+                                    <property name="left_margin">2</property>
+                                    <property name="right_margin">2</property>
+                                    <property name="show_line_numbers">True</property>
+                                    <property name="show_line_marks">True</property>
+                                    <property name="tab_width">4</property>
+                                    <property name="auto_indent">True</property>
+                                    <property name="insert_spaces_instead_of_tabs">True</property>
+                                    <property name="highlight_current_line">True</property>
+                                    <signal name="scroll-event" handler="onScrollCodePane" swapped="no"/>
+                                  </object>
+                                </child>
+                              </object>
+                            </child>
+                            <child type="tab">
+                              <object class="GtkLabel">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="label" translatable="yes">Generated Code</property>
+                              </object>
+                              <packing>
+                                <property name="tab_fill">False</property>
+                              </packing>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child type="tab">
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child type="tab">
+                              <placeholder/>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="expand">True</property>
+                            <property name="fill">True</property>
+                            <property name="position">1</property>
+                          </packing>
+                        </child>
+                      </object>
+                      <packing>
+                        <property name="resize">False</property>
+                        <property name="shrink">True</property>
+                      </packing>
+                    </child>
+                    <child>
+                      <object class="GtkPaned" id="BottomRightPane">
+                        <property name="visible">True</property>
+                        <property name="can_focus">True</property>
+                        <property name="margin_left">2</property>
+                        <property name="margin_right">2</property>
+                        <property name="position">200</property>
+                        <property name="wide_handle">True</property>
+                        <child>
+                          <object class="GtkScrolledWindow" id="performance_window">
+                            <property name="visible">True</property>
+                            <property name="can_focus">True</property>
+                            <property name="shadow_type">in</property>
+                            <child>
+                              <placeholder/>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="resize">False</property>
+                            <property name="shrink">True</property>
+                          </packing>
+                        </child>
+                        <child>
+                          <object class="GtkBox">
+                            <property name="visible">True</property>
+                            <property name="can_focus">False</property>
+                            <property name="orientation">vertical</property>
+                            <child>
+                              <object class="GtkLabel" id="propertylabel">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="label" translatable="yes">Properties of: (Nothing selected)</property>
+                              </object>
+                              <packing>
+                                <property name="expand">False</property>
+                                <property name="fill">True</property>
+                                <property name="position">0</property>
+                              </packing>
+                            </child>
+                            <child>
+                              <object class="GtkScrolledWindow">
+                                <property name="visible">True</property>
+                                <property name="can_focus">True</property>
+                                <property name="shadow_type">in</property>
+                                <child>
+                                  <object class="GtkViewport">
+                                    <property name="visible">True</property>
+                                    <property name="can_focus">False</property>
+                                    <child>
+                                      <object class="GtkGrid" id="propertygrid">
+                                        <property name="visible">True</property>
+                                        <property name="can_focus">False</property>
+                                        <property name="valign">start</property>
+                                        <property name="margin_top">5</property>
+                                        <property name="margin_bottom">5</property>
+                                        <property name="row_spacing">5</property>
+                                        <property name="column_spacing">5</property>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                        <child>
+                                          <placeholder/>
+                                        </child>
+                                      </object>
+                                    </child>
+                                  </object>
+                                </child>
+                              </object>
+                              <packing>
+                                <property name="expand">True</property>
+                                <property name="fill">True</property>
+                                <property name="position">1</property>
+                              </packing>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="resize">True</property>
+                            <property name="shrink">True</property>
+                          </packing>
+                        </child>
+                      </object>
+                      <packing>
+                        <property name="resize">True</property>
+                        <property name="shrink">True</property>
+                      </packing>
+                    </child>
+                  </object>
+                  <packing>
+                    <property name="resize">True</property>
+                    <property name="shrink">True</property>
+                  </packing>
+                </child>
+              </object>
+            </child>
+            <child type="tab">
+              <object class="GtkLabel">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="label" translatable="yes">Optimizer</property>
+              </object>
+              <packing>
+                <property name="tab_fill">False</property>
+                <property name="reorderable">True</property>
+                <property name="detachable">True</property>
+              </packing>
+            </child>
+            <child>
+              <object class="GtkBox">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="orientation">vertical</property>
+                <child>
+                  <object class="GtkToolbar" id="pated_toolbar">
+                    <property name="visible">True</property>
+                    <property name="can_focus">False</property>
+                  </object>
+                  <packing>
+                    <property name="expand">False</property>
+                    <property name="fill">True</property>
+                    <property name="position">0</property>
+                  </packing>
+                </child>
+                <child>
+                  <object class="GtkPaned">
+                    <property name="visible">True</property>
+                    <property name="can_focus">True</property>
+                    <property name="position">600</property>
+                    <property name="wide_handle">True</property>
+                    <child>
+                      <object class="GtkPaned">
+                        <property name="visible">True</property>
+                        <property name="can_focus">True</property>
+                        <property name="orientation">vertical</property>
+                        <property name="position">200</property>
+                        <property name="wide_handle">True</property>
+                        <child>
+                          <object class="GtkPaned">
+                            <property name="visible">True</property>
+                            <property name="can_focus">True</property>
+                            <property name="position">500</property>
+                            <property name="position_set">True</property>
+                            <property name="wide_handle">True</property>
+                            <child>
+                              <object class="GtkBox">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="orientation">vertical</property>
+                                <child>
+                                  <object class="GtkLabel">
+                                    <property name="visible">True</property>
+                                    <property name="can_focus">False</property>
+                                    <property name="label" translatable="yes">Find</property>
+                                  </object>
+                                  <packing>
+                                    <property name="expand">False</property>
+                                    <property name="fill">True</property>
+                                    <property name="position">0</property>
+                                  </packing>
+                                </child>
+                                <child>
+                                  <object class="GtkDrawingArea" id="find_da">
+                                    <property name="visible">True</property>
+                                    <property name="app_paintable">True</property>
+                                    <property name="can_focus">True</property>
+                                    <property name="is_focus">True</property>
+                                    <property name="can_default">True</property>
+                                    <property name="has_default">True</property>
+                                    <property name="receives_default">True</property>
+                                    <property name="events">GDK_EXPOSURE_MASK | GDK_POINTER_MOTION_MASK | GDK_POINTER_MOTION_HINT_MASK | GDK_BUTTON_MOTION_MASK | GDK_BUTTON1_MOTION_MASK | GDK_BUTTON2_MOTION_MASK | GDK_BUTTON3_MOTION_MASK | GDK_BUTTON_PRESS_MASK | GDK_BUTTON_RELEASE_MASK | GDK_KEY_PRESS_MASK | GDK_KEY_RELEASE_MASK | GDK_ENTER_NOTIFY_MASK | GDK_LEAVE_NOTIFY_MASK | GDK_FOCUS_CHANGE_MASK | GDK_STRUCTURE_MASK | GDK_PROPERTY_CHANGE_MASK | GDK_VISIBILITY_NOTIFY_MASK | GDK_PROXIMITY_IN_MASK | GDK_PROXIMITY_OUT_MASK | GDK_SUBSTRUCTURE_MASK | GDK_SCROLL_MASK | GDK_TOUCH_MASK | GDK_SMOOTH_SCROLL_MASK</property>
+                                  </object>
+                                  <packing>
+                                    <property name="expand">True</property>
+                                    <property name="fill">True</property>
+                                    <property name="position">1</property>
+                                  </packing>
+                                </child>
+                              </object>
+                              <packing>
+                                <property name="resize">False</property>
+                                <property name="shrink">True</property>
+                              </packing>
+                            </child>
+                            <child>
+                              <object class="GtkBox">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="orientation">vertical</property>
+                                <child>
+                                  <object class="GtkLabel">
+                                    <property name="visible">True</property>
+                                    <property name="can_focus">False</property>
+                                    <property name="label" translatable="yes">Replace with</property>
+                                  </object>
+                                  <packing>
+                                    <property name="expand">False</property>
+                                    <property name="fill">True</property>
+                                    <property name="position">0</property>
+                                  </packing>
+                                </child>
+                                <child>
+                                  <object class="GtkDrawingArea" id="replace_da">
+                                    <property name="visible">True</property>
+                                    <property name="app_paintable">True</property>
+                                    <property name="can_focus">True</property>
+                                    <property name="is_focus">True</property>
+                                    <property name="can_default">True</property>
+                                    <property name="has_default">True</property>
+                                    <property name="receives_default">True</property>
+                                    <property name="events">GDK_EXPOSURE_MASK | GDK_POINTER_MOTION_MASK | GDK_POINTER_MOTION_HINT_MASK | GDK_BUTTON_MOTION_MASK | GDK_BUTTON1_MOTION_MASK | GDK_BUTTON2_MOTION_MASK | GDK_BUTTON3_MOTION_MASK | GDK_BUTTON_PRESS_MASK | GDK_BUTTON_RELEASE_MASK | GDK_KEY_PRESS_MASK | GDK_KEY_RELEASE_MASK | GDK_ENTER_NOTIFY_MASK | GDK_LEAVE_NOTIFY_MASK | GDK_FOCUS_CHANGE_MASK | GDK_STRUCTURE_MASK | GDK_PROPERTY_CHANGE_MASK | GDK_VISIBILITY_NOTIFY_MASK | GDK_PROXIMITY_IN_MASK | GDK_PROXIMITY_OUT_MASK | GDK_SUBSTRUCTURE_MASK | GDK_SCROLL_MASK | GDK_TOUCH_MASK | GDK_SMOOTH_SCROLL_MASK</property>
+                                  </object>
+                                  <packing>
+                                    <property name="expand">True</property>
+                                    <property name="fill">True</property>
+                                    <property name="position">1</property>
+                                  </packing>
+                                </child>
+                              </object>
+                              <packing>
+                                <property name="resize">True</property>
+                                <property name="shrink">True</property>
+                              </packing>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="resize">False</property>
+                            <property name="shrink">True</property>
+                          </packing>
+                        </child>
+                        <child>
+                          <object class="GtkPaned">
+                            <property name="visible">True</property>
+                            <property name="can_focus">True</property>
+                            <property name="position">400</property>
+                            <property name="position_set">True</property>
+                            <property name="wide_handle">True</property>
+                            <child>
+                              <object class="GtkSourceView" id="pe_sourceview">
+                                <property name="visible">True</property>
+                                <property name="can_focus">True</property>
+                                <property name="left_margin">2</property>
+                                <property name="right_margin">2</property>
+                                <property name="show_line_numbers">True</property>
+                              </object>
+                              <packing>
+                                <property name="resize">False</property>
+                                <property name="shrink">True</property>
+                              </packing>
+                            </child>
+                            <child>
+                              <object class="GtkBox">
+                                <property name="visible">True</property>
+                                <property name="can_focus">False</property>
+                                <property name="orientation">vertical</property>
+                                <child>
+                                  <object class="GtkLabel" id="pe_propertylabel">
+                                    <property name="visible">True</property>
+                                    <property name="can_focus">False</property>
+                                    <property name="label" translatable="yes">Properties of: (Nothing selected)</property>
+                                  </object>
+                                  <packing>
+                                    <property name="expand">False</property>
+                                    <property name="fill">True</property>
+                                    <property name="position">0</property>
+                                  </packing>
+                                </child>
+                                <child>
+                                  <object class="GtkGrid" id="pe_propertygrid">
+                                    <property name="visible">True</property>
+                                    <property name="can_focus">False</property>
+                                    <property name="row_spacing">5</property>
+                                    <property name="column_spacing">5</property>
+                                    <property name="row_homogeneous">True</property>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                    <child>
+                                      <placeholder/>
+                                    </child>
+                                  </object>
+                                  <packing>
+                                    <property name="expand">False</property>
+                                    <property name="fill">True</property>
+                                    <property name="padding">5</property>
+                                    <property name="position">1</property>
+                                  </packing>
+                                </child>
+                              </object>
+                              <packing>
+                                <property name="resize">True</property>
+                                <property name="shrink">True</property>
+                              </packing>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="resize">True</property>
+                            <property name="shrink">True</property>
+                          </packing>
+                        </child>
+                      </object>
+                      <packing>
+                        <property name="resize">False</property>
+                        <property name="shrink">True</property>
+                      </packing>
+                    </child>
+                    <child>
+                      <object class="GtkDrawingArea" id="patedmainsdfg">
+                        <property name="visible">True</property>
+                        <property name="app_paintable">True</property>
+                        <property name="can_focus">True</property>
+                        <property name="is_focus">True</property>
+                        <property name="can_default">True</property>
+                        <property name="has_default">True</property>
+                        <property name="receives_default">True</property>
+                        <property name="events">GDK_EXPOSURE_MASK | GDK_POINTER_MOTION_MASK | GDK_POINTER_MOTION_HINT_MASK | GDK_BUTTON_MOTION_MASK | GDK_BUTTON1_MOTION_MASK | GDK_BUTTON2_MOTION_MASK | GDK_BUTTON3_MOTION_MASK | GDK_BUTTON_PRESS_MASK | GDK_BUTTON_RELEASE_MASK | GDK_KEY_PRESS_MASK | GDK_KEY_RELEASE_MASK | GDK_ENTER_NOTIFY_MASK | GDK_LEAVE_NOTIFY_MASK | GDK_FOCUS_CHANGE_MASK | GDK_STRUCTURE_MASK | GDK_PROPERTY_CHANGE_MASK | GDK_VISIBILITY_NOTIFY_MASK | GDK_PROXIMITY_IN_MASK | GDK_PROXIMITY_OUT_MASK | GDK_SUBSTRUCTURE_MASK | GDK_SCROLL_MASK | GDK_TOUCH_MASK | GDK_SMOOTH_SCROLL_MASK</property>
+                      </object>
+                      <packing>
+                        <property name="resize">True</property>
+                        <property name="shrink">True</property>
+                      </packing>
+                    </child>
+                  </object>
+                  <packing>
+                    <property name="expand">True</property>
+                    <property name="fill">True</property>
+                    <property name="position">1</property>
+                  </packing>
+                </child>
+              </object>
+              <packing>
+                <property name="position">1</property>
+              </packing>
+            </child>
+            <child type="tab">
+              <object class="GtkLabel">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="label" translatable="yes">Transformation Editor</property>
+              </object>
+              <packing>
+                <property name="position">1</property>
+                <property name="tab_fill">False</property>
+              </packing>
+            </child>
+            <child>
+              <object class="GtkBox">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="orientation">vertical</property>
+                <child>
+                  <object class="GtkToolbar" id="sdfged_toolbar">
+                    <property name="visible">True</property>
+                    <property name="can_focus">False</property>
+                  </object>
+                  <packing>
+                    <property name="expand">False</property>
+                    <property name="fill">True</property>
+                    <property name="position">0</property>
+                  </packing>
+                </child>
+                <child>
+                  <object class="GtkPaned">
+                    <property name="visible">True</property>
+                    <property name="can_focus">True</property>
+                    <property name="position">800</property>
+                    <property name="position_set">True</property>
+                    <property name="wide_handle">True</property>
+                    <child>
+                      <object class="GtkDrawingArea" id="sdfg_editor_da">
+                        <property name="visible">True</property>
+                        <property name="app_paintable">True</property>
+                        <property name="can_focus">True</property>
+                        <property name="has_focus">True</property>
+                        <property name="can_default">True</property>
+                        <property name="has_default">True</property>
+                        <property name="events">GDK_EXPOSURE_MASK | GDK_POINTER_MOTION_MASK | GDK_POINTER_MOTION_HINT_MASK | GDK_BUTTON_MOTION_MASK | GDK_BUTTON1_MOTION_MASK | GDK_BUTTON2_MOTION_MASK | GDK_BUTTON3_MOTION_MASK | GDK_BUTTON_PRESS_MASK | GDK_BUTTON_RELEASE_MASK | GDK_KEY_PRESS_MASK | GDK_KEY_RELEASE_MASK | GDK_ENTER_NOTIFY_MASK | GDK_LEAVE_NOTIFY_MASK | GDK_FOCUS_CHANGE_MASK | GDK_STRUCTURE_MASK | GDK_PROPERTY_CHANGE_MASK | GDK_VISIBILITY_NOTIFY_MASK | GDK_PROXIMITY_IN_MASK | GDK_PROXIMITY_OUT_MASK | GDK_SUBSTRUCTURE_MASK | GDK_SCROLL_MASK | GDK_TOUCH_MASK | GDK_SMOOTH_SCROLL_MASK</property>
+                        <property name="hexpand">True</property>
+                        <property name="vexpand">True</property>
+                      </object>
+                      <packing>
+                        <property name="resize">True</property>
+                        <property name="shrink">True</property>
+                      </packing>
+                    </child>
+                    <child>
+                      <object class="GtkBox">
+                        <property name="visible">True</property>
+                        <property name="can_focus">False</property>
+                        <property name="orientation">vertical</property>
+                        <child>
+                          <object class="GtkLabel" id="se_propertylabel">
+                            <property name="visible">True</property>
+                            <property name="can_focus">False</property>
+                            <property name="label" translatable="yes">Properties</property>
+                          </object>
+                          <packing>
+                            <property name="expand">False</property>
+                            <property name="fill">True</property>
+                            <property name="position">0</property>
+                          </packing>
+                        </child>
+                        <child>
+                          <object class="GtkGrid" id="se_propertygrid">
+                            <property name="visible">True</property>
+                            <property name="can_focus">False</property>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                            <child>
+                              <placeholder/>
+                            </child>
+                          </object>
+                          <packing>
+                            <property name="expand">False</property>
+                            <property name="fill">True</property>
+                            <property name="position">1</property>
+                          </packing>
+                        </child>
+                      </object>
+                      <packing>
+                        <property name="resize">True</property>
+                        <property name="shrink">True</property>
+                      </packing>
+                    </child>
+                  </object>
+                  <packing>
+                    <property name="expand">False</property>
+                    <property name="fill">True</property>
+                    <property name="position">1</property>
+                  </packing>
+                </child>
+              </object>
+              <packing>
+                <property name="position">2</property>
+              </packing>
+            </child>
+            <child type="tab">
+              <object class="GtkLabel">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="label" translatable="yes">SDFG Editor</property>
+              </object>
+              <packing>
+                <property name="position">2</property>
+                <property name="tab_fill">False</property>
+              </packing>
+            </child>
+          </object>
+          <packing>
+            <property name="expand">True</property>
+            <property name="fill">True</property>
+            <property name="position">1</property>
+          </packing>
+        </child>
+        <child>
+          <object class="GtkBox">
+            <property name="visible">True</property>
+            <property name="can_focus">False</property>
+            <property name="spacing">5</property>
+            <property name="homogeneous">True</property>
+            <child>
+              <object class="GtkLabel" id="run_status_text">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="label">RUN_STATUS_TEXT</property>
+              </object>
+              <packing>
+                <property name="expand">False</property>
+                <property name="fill">True</property>
+                <property name="padding">5</property>
+                <property name="position">0</property>
+              </packing>
+            </child>
+            <child>
+              <object class="GtkProgressBar" id="run_status">
+                <property name="visible">True</property>
+                <property name="can_focus">False</property>
+                <property name="fraction">0.33000000000000002</property>
+              </object>
+              <packing>
+                <property name="expand">False</property>
+                <property name="fill">True</property>
+                <property name="position">2</property>
+              </packing>
+            </child>
+          </object>
+          <packing>
+            <property name="expand">False</property>
+            <property name="fill">True</property>
+            <property name="position">2</property>
+          </packing>
+        </child>
+      </object>
+    </child>
+  </object>
+</interface>
diff --git a/diode/memory_button.js b/diode/memory_button.js
new file mode 100644
index 0000000000..dd23bd80f4
--- /dev/null
+++ b/diode/memory_button.js
@@ -0,0 +1,205 @@
+
+
+
+class MemoryButton extends Button {
+    constructor(ctx, all_mem_analyses, target_bw) {
+        super(ctx);
+
+        ObjectHelper.logObject("target_bw", target_bw);
+        this.Memory_Target_Bandwidth = new Number(target_bw);
+        ObjectHelper.logObject("Memory_Target_Bandwidth", this.Memory_Target_Bandwidth);
+
+        this._display_image = {
+            "1": "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAyAAAAMgCAMAAADsrvZaAAAABGdBTUEAALGPC/xhBQAAAAlwSFlzAAAOwgAADsIBFShKgAAAABl0RVh0U29mdHdhcmUAcGFpbnQubmV0IDQuMC4yMfEgaZUAAAH4UExURYCAgP///wD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IYCAgAD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IYCAgAD/IYR7e3eIeYh3dwD/IQD/IQD/IWeYbQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IQD/IRfnMgD/IQD/IYCAgAAAAAD/IQAAAAD/IQD/IQvzKQD/IQD/IQD/IQX5JAD/IQD/IQAAAAAAAAD/IQAAAAAEAAAIAQAMAQAQAgAUAgAYAwAcAwAgBAAkBAAoBQAsBQAwBgA0BgA4BwA8BwBACABECABICQBMCQBOCQBQCgBUCgBYCwBcCwBgDABkDABoDQBsDQBwDgB0DwB4DwB8EACAEACDEACHEQCLEQCPEgCTEwCXEwCbFACfFACjFQCnFQCrFgCvFgCzFwC3FwC7GAC/GADDGQDEGQDHGQDLGgDPGgDTGwDXGwDbHADfHADjHQDnHQDrHgDvHgDzHwD3HwD7IAD/IQL9IgT7Iwj3Jgr1KAzzKRDvLBLtLhrlNBzjNR7hNyLdOijXPizTQTjHSjzDTT+/UEe3VkuzWVenYlujZV2hZmWZbGeXbm+PdHGNdXOLd3eHenmFe32BfoCAgDev/KEAAABHdFJOUwAABAgMEBQYHCAkKCwwNDg8QERISVBUWFxgZGhsbnBwc3N0eHx8gIOHi4+Xn6evt7u/w8vNz9PT1Nvd3+Pk5+vv8fP3+fr7JNa6awAAFslJREFUeNrt3fmfFVVixuE6bbtGiHGJMYxOEhM0KuJodASNGnC4DQiyqUAIClFAYgQJa1BZBKQn+75OJqv9b0bnow4g1Zy+91TVOVXP93fpO1Y9c+97bG5VlSRJ0iSF+PzLEhaoiIxE+dcpNDARGpgIDkikjmxQIjggERyQiA5GRAckUs44GBEdmRr51SVuC5Whowsji1bPLpt2byiUVHv/WqZXzM7Orljk/sAjIHKdls1+1Wofs+hg5Dotmf06H7PwQOR6A+SbfMyig5FruvmF2Z+36gG3Cx6IXNmTs1f1hI9ZdDDy8x6avabn7nTb4IHI1y1ecy0QH7PwQOS6A8THLDwQmXeA+JiFByPzDRAfs/AgZL4B4mMWHojMO0B8zMIDkfkGiI9ZeCAy3wDxMQsPRKpfXHNDID5m4TFYIre8OBvRI24rPIZJZHmMj6en3Fh4DJLIwzE+Xr7NncXHIIXctSYGyD3uLDwGSSRugDzq1sJjmESiBsgPDBA+hikkaoC8dKubC49BEokaIGvudnfhMUgicQNkqduLj2EKiRogTxkgeAyTiAHCByH13W2A4IFIbbe+ZIDwQUhdU08ZIHggUttSA4QPBQOEDy1ciAGCh+qJxA2Q5QYIH8MUEjVAXrzFXYbHIInEDZC73GZ8DFJI3AB52G3GxyCFRA4QtxkewyRigPCheiH3zBogfKhOyG0vGyB8qE7I1NMGCB6qJfKoAcKHajNA+FB9Bggfqi9ugDzpVuOjuVOinF9g1AB54WZXUwZI7QBZHPsbK4BogAPkoQCIDJDaARIA0SD77bgBAogG2X0TDxBA1N9uf2XiAQKI+jtAnpl8gACi3vZYggECiAwQQGSAXL/vBUBkgNS1LAAiA6SuFdOAyACpa/WiAIgMkLqWBEBkgEwyQABRD3s82QABRP3r/nQDBBD1rjtWpRsggKhv3fRswgECiAwQQGSAjDdAANEQB8gDARAZIHU9EQCRAVLXc9OAyACpa9WdARAZICkGCCDqT9PPJR8ggKg/PZF+gACi3vRAAwMEEPWlO5sYIIDIAAFEBsh4AwQQGSCAyAD5qvsDIDJA6no8ACIDpK5nbwJEBkjtALkjACIDJOkAAUTlD5AVzQ0QQFR8yxocIICo9JY0OUAAUeEtWt3kAAFEBgggMkAAkQFy/V65IwAiA6Su+wIgMkDqeiwAIgOkrmemAJEBUjtAbg+AaIgtbmGAAKJSu/mFFgYIICq1J9sYIICo0B5qZYAAokIHyJpWBgggMkAAkQECiAyQq3r5tgCIDJC67gmAyACp69EAiAyQup6eAkQGSMMDBBCV1l1tDhBAVFi3vNjmAAFEhbW81QECiMrq4XYHCCAyQACRAQKIDJCf9YMpQGSA1PXSrQEQGSA1rbk7ACIDpK6lARAZIHU9NQWIBtlvdjFAAFEh3d3JAAFEZXTrS50MkKRCXEU11dRT3QwQQFRESzsaIEmFuIzq2wABRAZIW0JcSHU4QJZPASIDpK4Xb2nwFQCi0gfIXQEQGSB1PdzsiwBEZQ+QAIgG2SOdD5BkQho9ASiyrM5Q2v3flurH3DMbNUCKuAiADMhHS0BuezlqgJRxGQBp4spUQwYy9XTUACnkOgAyoDeQdoA8GjdACrkQgAzoDaQVIJMPkKyuBCAD8tEGkBQDJKdrAQggKV943AB5sgJkuECqIQOJGiAv3FzOxQBkQG8gzQOJGyCLC7oagAzoDaRxIHED5KGSLgcgA/LRNJCEAySbCwIIIMle+2MJBwgg/QRSDRjIfSkHSC5XBJABvYE0C+T2V5IOkEwuCSADegNpFMjUM4kHSB7XBJABvYE0CiT5AMnjogAyoDeQJoE0MECyuCqAAJLi5ccNkO9VgPAxRCBxA2RZgdcFkHQXIs3r+KUSgUQNkBXTBV4YQDJ7A1k6+0h5QKIGyOpFJV4ZQLJ6A5le9uWd9CulAYkbIEuKvDSA5ATk1mci/7pdVkBuam6AAMLHFS3++vsIX7q9KCCPNzhAOr84gOQD5JdXfXM3PTddEJD7mxwggMT2/MqVr/4oaaPM2n/F/XRsJquXNt+FuWNVowMEkMhWjvrd2o+uuqEOlgLkpmebHSBdCwEkjzacvuaW2lMIkMYHCCCAjEZbz197T11+qwggUQNk1aJJrz4gwway89J376oLmwoAEjdAHij4DAWQDNp3+Xq31Zl12QOJGyBP5HDKCEixzRyuubGOz+QOJGqAJDmzBmSwQDacqr21/jhzIHED5M4s/kMVIIW25dw8N9ferIG0NUA6fQsBpNt2XJrv5rq8PWMg08+1NUAAGSyQdy/Pf3td3JwvkCdaGyBdfsYCpMt5fuiGN9iZ9bkCeaDFAdLhWwgg3fXayYhb7MRMnkDubHOAdPgWAkh3x1dnY26x2UNZAml3gAAyyHeQo1FAZt/NEUjLA6S7z1iAdNe6T6KAXN6RH5C4AfILaW8BQIY20jd8FiXk0ubcgMQNkPsrQACZqG2XooScfS0vIHED5PEKEEAmbHfcDDk1kxWQqAHy7E1VH4QA0m0H4oQczglI3AC5owIEkMk7EidkXz5AuhkgKYQAUuJR1um4o6yduQCZXtHJAAFkoEBGG87HHWVtyQTIso4GSEefsQDpvK1xR1nnNmQBZElXA6SjtxBAum9X3Aw5vTYDIItWdzZAABkqkKu+Mm6ePuoeSJcDpFdA7n0wdT/sMZDRR3FC9nf1+r69Cr8T8zJ/99dueDXvHTqQB+eS12cgM6fihOzq6PV9cw3+LeZF/vl/3fhiPljMWwggeRxlnYs7ytraKZD//LOYF/nvc4AAkrotcUdZn23oEMj//XXMS/ynOUAASd/Oy1FCPl7XHZB/jHmBf/NFo0AqQAYKZLQvboYc6QxIsgHSIZAASLkdihNyoCMgP002QAABZKyjrJNxQnZ3AuR//yrZAAEEkLF6Le5bHC690QWQf0g3QCYCUgEyWCCjzZFHWRvbB/KvCQcIIICM2Y64o6xP2j7KmvvpjxMOEEAAGbc/jJshR1t+WWkHSJdAAiBl92GckPfbfVVRA+Rvv2gFSAXIkIHMnIgT8nabL+rdmFf0F/89Bwggjbf+TNxfwX2zvZe0LWoZ/WQOEEDaOMq6GHeU9XprZKN+kfKf5wABpJW2xx1lfdrWUVbUVwj/3ReAANJSe+JmyLF2vk0u/QCZEEi7QgDJsA/ihBxs47W8mX6AAALIhEdZx+OE7Gnh11/Opx8ggAAyYesij7LeavyVHG9ggAACyKRtuhAl5MKmhl/HviYGCCCAtPTRf/bMugxexU/m2gVSAQLI6J24GXK8yaOshgYIIIAk6GCckA+7HiB//0VZQAIgPTnKOhYnZG+3A+Qv/2cOEEA6Ocr6NO4oa3uXA+TH/zEHCCDd9HrcMz4vNvOMz7gB8i9zgADSVW98HneUtb6sATIxkAoQQH7W23Ez5GQDR1kHmhsggACSqvfjhBxK/oO3NzhAAAEkWUfjhLyb+Mdu+KzBAQIIIOmOsj6JO8rakfaI+USTAwQQD9Bp+//LZy8lPcqKGiC/91vjX817hw4kfSsHCmS0Le7b5M4mfDBC1ABZc3dTT17O6pwXkOzbHTdDTs20+6a1NAACSBYdiBNyuNUBsjx0CKQCBJArOhInZF+LHl+8BRBAsjnKOh13lLWzvQFyVwAEkHyOss7HHWVtaWuAPBwAASSjtsYdZZ2f+Cgr6QABBJC22hU3Q06vnfDnHEw5QAABpLX2xwn5aLKfsmM25QABBJD2OhwnZP8kP2PjhaQDBBBA2mvmVJyQXU3/iOUBEEByPMo6F3eUtS2bAQIIIG22JfIZn+MeZSUfIIAA0mo7475N7uPxvk0uboD8RgAEkFx7L26GHGlugDwZAAEk3w7FCTnQ1AB54WZAAMn5KOtknJDdDQ2QxQEQQHLutbNRQD5/o5EB8lAABJC82xx5lLVxYe9Mp5sYIIAA0n474o6yPlnQUdYHjQyQBu8kQACpa2/cDDma+lchFz5AAAGkiz6ME/J+9B/4+sVmBggggHRylHUiTsg7sX/exw0NEEAA6aT1kc/4fLPjAQIIIB0dZV2MO8p6vdsB4hQLkI56K+4o69OIo6y4AbIkAAJIQe2JmyHHbvhtcnEDZFkABJCi+iBOyB8l+XNWTAMCSGFHWcfjhOxJMEA+XxQAAaSw1kUeZb01+QDZGwABpLg2xT0Y4cKm+j9ibdTTR46MAAGkwN6MO8o6s26y/yZ/dh0ggBTZO3Ez5HjdUVbUgxU+3zoCBJAyOxgn5MOaz2hRvzm/dwRI+v9ZHsHWzlHWsTgheycZIBMBaeYRbOUD8RDPlo6yPo07yto+wQD5suQX80FAAGmljXFHWRc3jzdALm0dAQJIyb3xedxR1vqxBsjXvzEPCCDF9nbcDDk5M8YA+eaxh4AAUm7vxwk5tPAB8u0vAwMCSMEdjRPy7oIHyLdPdAMEkJKPsj6OO8ra8e0/sXkhAyRDIAEQQBZQ3IM3Zy9tXtDh8BXPXQcEkKLbFvdtcue+fjDC4QUNkL4BqQAZXrvjZsipmehf4brqkdKAAFJ4B+KEfPWxacsCBwgggPSgI3FC9i18gAACSA9aezruKGvnggcIIID04ijrfNyDERZ03gUIIL1pa9xR1jjP38kMSAAEkDHalcrHd/6CFSCA9KH9aXx8shYQQHrZ4RQ+Ln33a1AAAaQXxT3KeeEPAO0TkAqQIR9lnUs/QAABpD9tuZR8gGQHJAACyNjtvJx6gAACSJ96L/UAAQSQXnUo8QABBJB+HWWdTDtAAAGkX712dkwfF+ueaZgVkFZ9ANLHNo95lLVrBAggQ2j7WEdZH4wAAWQY7R3Dx8czgAAylD5MN0AAAaSHR1knkg2QzIAEQABJ0PozqQZIr4BUmQDxAJ3uj7IuJBogX5b8at47dCDp8wi2hfbW5TQD5MtyuhEAASRRe9IMkLyABEAASdUHSQYIIID09SjreJyP0zOAADLE1kUdZV3YOBoKkAoQXdnrMQ9G2DEqB0gABJCUvXnjo6yDI0AAGWw3fNbBqRlAABlwfzrpAMkJSOs+AOl9vz/pAAEEkF7/UtakAwQQQHrd0QkHCCCA9LptEw6QjIAEQABJ34nJBkh/gFSA6DptvFjj4/0RIIBo9AfX9/EnM4UBCYAA0kh7r/ff009E+wAEkL6/h3xXyEfrRoUB6cIHIANp0zW/+X5x90L+aUAA6X07r/jK3s/2rR8BAoiuPs167/DJc2dPHtu/fe0C/8le+ABETTXcNxBABAggKh9IAAQQQHJ7AwFEZQAJgAACCCCAAAIIIID0xAcgAqQDIB5/0Ks6f/xB6BsQD9DpVZ0/QKczH4CoACABEEAAAQQQQMYB0qEPQAQIIAIEEPUUSJc+ABEggKhgIAEQQADJ9A0EEGUOJAACCCC5voEAoryBBEAAAQQQQAAZB0jXPgBRzkACIIAAkvEbCCDKGEgABBBAAAEEkHGAZOADEAECiEoEEgABBJC8fQAiQABReUDy8AGI8gQSAAEEkOzfQABRlkACIIAAkr8PQAQIICoLSD4+AFF+QEL/gXiATq9q+QE6AwCSPo9g67B2L3VOPgBRbkACIIAAUogPQNRPIBUg6iOQAAgggBTjAxDlBCQAAgggBb2BAKKMgARAAAGkJB+AKBsgARBAACnrDQQQ5QIkAAIIIIX5AER5AAmAAAJIcT4AUQ5AsvUBiDIAEgABBJASfQCizoEEQAABpEwfgKhPQCpA1DMgARBAACnVByDqFEgABBBAyvUBiDoEkr8PQNQdkAAIIIAU7QMQdQQkFOHD4w8UUQOPPwjDBuIBOr0q/QN0SvEBiLoAUowPQAQIIOo5kAoQQADpwAcgAgQQ9RpIBQgggHTiAxCVDqQCBBBAAAEEkPx8AKKygVSAAAJIZz4AESCAqKdAKkAAAaRDH4CoXCAVIIAA0qkPQAQIIOohkAoQQADp2AcgKhNIBQgggHTuAxCVCKQCBBBAAAEEkLx9AKLygFSAAAJIFj4AUWlAKkAAASQTH4CoLCAVIIAAko0PD9BRRDe+PN8PrVT1BUj6PIKtw8a7i/rgAxBNDCT02AcgmhBI6LUPQDQJkBB67gMQjQ8k9N8HIBoTSAhD8AGIxgESwkB8AKKFAwnD8QGIFggkhCH5AEQLABK6qQIEkPyBhDBAH4AoBkjosAoQQDJvuD4AUd5AKkAAASRbH4AoZyAVIIAAkrEPQJQtkCzuO0CUKZAKEEAAydwHIMoTSAUIIIBk7wMQZQgko/sOEGUHpAIEEECK8AGIcgNSAQIIIGXwAER5AakAAQSQcnwAonyAZHjbefyBsgFSDQiIB+gA0gsfgCgPIFUFCCCAlOYDEGUApKoAAQSQAn0Aoq6BVBUggABSpg9A1CmQqgIEEECK9QGIugNSVYAAAkjBPABRV0AqQAABpHAegCim5BfzQUAAAQQQQAABBBBAAAFEgAAiQAARIIAIEEAAAQQQQAABBBBAAAEEEEAAAQQQQAQIIAIEEAECiAABRIAAAggg/QfiATq9KvIvmv969NW8d+hA0ucRbLkDqXoZIEoBpOprgGhiIFWPA0STAan6HSAaH0jV/wDRmECqQQSIxgBSDSZAtEAg1aACRPFAquEFiOKAVMMMEN24argBIkAAESCACBBABAgggGTVSkAAESCACBBABAggAgQQQAABBBBAAAEEEEAAAQQQQAABBBAB0iQQXz3ap36Y/GoO/qtHfXl1n5rz5dWACBBABAggAgQQAQIIIIAAAggggAACCCCAAAIIIIAAAogAAUSAACJAABEggAgQQAABBBBAAAEEEEAAAQQQQAABBBBABAggAgQQAQKIAAEEEEAAAQQQQAABBBBAAAEEEEAAAUSAACJAABEgmQHxAJ0+5QE6HsGmefIINkAECCACBBABAogAAQQQQAABBBBAAAEEEEAAAQQQQAABRIAAIkAAESCACBBABAgggAACCCCAAAIIIIAAAggggAACCCACBBABAogAAUSAAAIIIIAAAggggAACyLf5bt4+5bt5fbu75sm3uwMiQAARIIAIEEAECCCAAAIIIIAAAggggAACCCCAAAIIIAIEEAECiAABRIAAIkAAAQQQQAABBBBAAAEEEEAAAQQQQAARIIAIEEAECCACBBBAAAEEEEAAAQQQQAABBBBAAAEEEAECiAABRIDkBsQDdPqUB+h4BJvmySPYABEggAgQQAQIIAIEEEAAAQQQQAABBBBAAAEEEEAAAQQQAQKIAAFEgAAiQAARIIAAAggggAACCCCAAAIIIIAAAggggAgQQAQIIAIEEAECCCCA9BKIb1bsU75Z0Xfzap58Ny8gAgQQAQKIAAFEgAACCCCAAAIIIIAAAggggAACCCCAACJAABEggAgQQAQIIAIEEEAAAQQQQAABBBBAAAEEEEAAAQQQAQKIAAFEgAAiQAABBBBAAAEEEEAAAQQQQAABBBBAABEggAgQQARIbkA8QKdPeYCOR7BpnjyCDRABAogAAUSAACJAAAEEEEAAAQQQQAABBBBAAAEEEEAAESCACBBABAggAgQQAQIIIIAAAggggAACCCCAACJAAFHuPb9y5as/Uvu9unLl824/SZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZIkSZKkwfT/1uXf711Fr2kAAAAASUVORK5CYII=",
+            "-1": "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAyAAAAMgCAMAAADsrvZaAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsMAAA7DAcdvqGQAAAAYdEVYdFNvZnR3YXJlAHBhaW50Lm5ldCA0LjEuMWMqnEsAAAJnUExURYCAgP////8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAICAgP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAICAgIR7e/8AAIh3d/8AAP8AAJhnZ/8AAP8AAP8AAKZZWf8AAP8AAP8AAP8AAP8AAME9Pf8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAOUZGf8AAP8AAICAgP8AAAAAAP8AAP8AAAAAAPENDf8AAP8AAP8AAPUJCf8AAIkAAP8AAJsAAP8AAL4AAP8AAPwCAgAAAAAAAAQAAP8AAIEAAKQAAP0BAd8AAAAAAAQAAAgAAAwAABAAABQAABgAABwAACAAACQAACgAACwAADAAADQAADgAADwAAEAAAEQAAEgAAEwAAFAAAFQAAFgAAFwAAGAAAGQAAGgAAGwAAHAAAHQAAHgAAHwAAIAAAICAgIF9fYMAAIN7e4V5eYcAAId3d4l1dYsAAItzc41xcY8AAI9vb5FtbZMAAJNra5VpaZcAAJdnZ5llZZsAAJtjY51hYZ8AAJ9fX6FdXaMAAKVZWacAAKdXV6lVVasAAKtTU61RUa8AAK9PT7MAALNLS7cAALdHR7sAALtDQ78AAL8/P8E+PsMAAMU6OscAAMc4OMsAAMs0NM0yMs8AAM8wMNMAANMsLNcAANkmJtsAANskJN8AAN8gIOMAAOUaGucAAOcYGOkWFusAAOsUFO0SEu8AAO8QEPEODvMAAPMMDPcAAPcICPkAAPkGBvsAAPsEBP0CAv8AAMNo3eUAAABZdFJOUwAABAgMEBQYHCAkKCwwNDg8QERISUxQVFhcYGRobG5wcHN0eHx8gIOFh4uPk5ebm5+jp6uvs7e7v8PHy8vP09PU19vd39/j5+jr7u/x8/X3+Pn6+vv9/f3+zOZYkwAAJb1JREFUeNrt3f9/Vfd92PFzvwghyRL6EoSQRKZZixhIVELIWGhSFMQQ0zRNS3a77UgCDDJYgAlfZUZMSpeWjZWMldQloyWbS+g8ppYSlxFKysKmqouc80cNgt3YRl/uvZ/359s5r9dPfvhhsO7nc586388nCIiIiIhUKimrrmts3dzV0/u8vr3PG45eNPziH/te/Muers2tjXXVZSUMFiWldEVd85YdvbtfYsi34d293Vua6yrSDCDFtLKaxrbtuwpzsYSUXdvbGmvWMpwUn9bUtnT2j0WSjfV3ttSuYWjJ7zJVje07RyJdjexsb6zKMMzkYxVNXYORiQa7mioYbvJpw1Hd2jMamWy0p7WaTQl5UGl9e18uslGur72+lAkgd0tXb+6P7Na/uZpTweTkpqOxezRyodHuRjYk5Fbr2vojl2JDQu4cktdvH4nca3R7PYft5ICOscjVxjBC6MAIoQMj5NdReZcfOj410rmOKSNjlWwajHxrcBNPlZChjcd45GPjXWxGiI3HipuR17lFnth4rLQZ6a5hGklL6Yb+KA4NNHJSi+T3rVr2RnFpXysH7CRaWftoFKfGO8qZVJKqakcuil09HIyQSPW7onjW38Adv6TMYyCKb0MbmWCCx0qntOqZZCq2ml1R/BuoZaKpKB69UTLq5XCdCq7yzSg59VYy4VRIFT1Rsurh3XOUdyUduShp5Tq4uk55lW4ejZLYaDOXRWj1agejpDbICS1apfI3oiT3Brdo0UoHH1tzUbLLbeVQhJareSSikSa+CLTkqd1edLy8KsIpX3r13FXrODQ+e1qklfNZ9MXWDeDi8zdo8QIU+lzZxB+cv3qwnuVrQZ9WtxcRr7S3ji8GvWhNFxqWrIuXaFEQrOfc7rJnfNfz9Uh6mQ4crFAHr9BKdpWDIFix3TwpkuRauPax6ums1/maJLXSnXz/8+jNtXxVODqnFY7VN/Bl4eicVqiTq4ZJq5yj80Iaeo2vTLJ2r0b50hfUGLtZCSrdxje+4LZyh29SKuHsVVFns7jzJBlVcmticQ1X8eVJQI1cHCy28Ua+PrE//NjG91zlfC8HIvGutI8vuVJ9pXyJYlzFMF9x1QMRrojEt9oxvuDqV0R40jC2h+euPXg+trfvza72tqaGmpqa8rLn/erHfPEPZc//TUNTW3vXm317XVOdc+PFWSkSzp2rg/v6urc011Vk8vzBMxV1zVu6+/Y58/NvKX4SAOJq6e1O3NPU01ZfninyI2Qq6tt6hlz4GDsyAIlZWduvTRzr37apOivwSUqqN23rt30tZ1cpQGJV2W6rZ352tFSlZbeHVS07rJ6RG6oASIyqsrb/nhvoaCjTpb6hY8DaeYeRaoDEpmpLJ4L2bluf1fzRsuu3Wbq1bKwGIDHJyuWP8Z2bKgx9vopNO20ck+Q2ACQWbTC/GzLcXpsx+hkzte3DXggBiHNtzBnXUW3lg1YbN5LbBBDvazF87NpZa3NnstPwm1paAOJ5rxs9bu2sTVv+vOnaLqNHXF8DiNe1G/yu9DdnnfjM2eZ+g5+6HSD+ljbnY7TjNYc+eGXHqKNCAJLI7UfvxoxjHz3T2OukEIAkz8d4Z4WTH7+ic9w9IQBJ2vH56OZSZ0egdLOZk1ptAOH87jI3k7RknB6EzCYjt8e3AMS7Nhr4Xgw0pJ0fh3R9n0tCAOJIBu4vGaj3ZCxq9B+v533XCUDcqDYHj88Px4ArQgDiRNrvbx+s92xE6nUTydUCxJuqNPsYapQ+9miTOktkj8hYNUA8qUzv84P7muQPzQ0ASaU26r3bd7QCIF6U1fr8+fiWEg0/sxEgqUyb1kuHe0sB4kFpredsesq1/NBmgKRSa7t1Dk5fFiDup/P9V4O6HvYwBSSVqtZ5q29PGiCup/H9iSPN6ZT3QFKpRo2HaB0AcbxGfZPftcYWa1kgqRKNa6S0AMTp9F0g3Ftnb7vXJv2/q9F2h9ZqFwwBYrUKXRdAclv1Pi1oGEgqs1XXb5KxdQBxtlJdJ/oH1lk9cmrT8H+s1HXdcF85QFw9wavpvtXxVu037ZoHkkq3arooMpgFiJtpOvYcqrR97q1Nz/+0UtORyHaAJOkEVreJd5VYAZLKarps2AIQB6vUsscw3mzkh7cD5PkvFS1nNXLVAHGuEi0vOR+sTMUaSOq1QS0H6qUAce0AfaeWa4OmXgVnDUgq26Vj4HalAeJWOu4wGWt05Mdv0/r/3qhjN6sDIE61Xsfu1WupRABJleu4JLIRIA5VPurx7pV1IKmMhhPkY5UAcabMoM+7V/aBpFIN8rtZQ1mAuFKH17tXLgBJlck/JrIdILE9AOk0vZCBdSA63oS/ASBOVCr+AtoW45/BPpBUqln6Bt/RtQBxIekrILmGVCKBpDZI34vQmwaI/aRfUj1Wl0ooEPn37b0OEOtJ34K1z8oStW4ASVUKP66eqwRIzM7wDttZDMcRIKkK4XeK7c4CJFZneHdbWg3HFSCpUuFfOB0AidMZ3l0lqYQDSWV3yQ7peoBYbI3sGd4ea+s4uwMkldkhOqYjawBiL9lbtbvsrRflEJBUulN2VAFirTrRmdxq8ZO4BCSV2iI6rnUAsVRW9CHClhRAPkv02tLeLEDstNXzy+fuAkk15DRtmgFirnWCk5j3IpQJASK6CGpuHUAslJZ8Dq4hBZAvbUMER3cgDRDztcbl+MNNIKLHIa0AMV7FuJ6dZIBoOMQbrwCI6QQXWutKAWSpBK8y9QLEcM2C18/TAFn6KK9HbpCbAGK0Erl7THZlUwBZOsH7skZKAOLn/vHukhRAlv09JHf3eztADFYudpZ+uDQFkOWTW5IoVwEQc70h9vxgRQogK54sFHvGcCdAjFUr9vx5dQogKyf3nPp6gJg6uyL11FuuLgWQ1aqT2p0dSgPEs1O8DSmArJ7YTSetADFzakXqTdUtKYDkk9RNJ6OlADGR1HsaOlMAyS+pS+pdADFxXkVon3gwC5A8y0od9FUBRH9C9z+MvZYCSL69JnQqqxcg2qsU+mXWmAJI/kktsV0DEN29GZs7eH0CInUY0gsQzdXE8ADEByBShyG1ANFbbwwPQHwAInUYMgAQHzYgjSmAWDoMqQeIznbF8QDEDyBChyEDANFYfSwPQDwBkt3t1CYEDUsk8qaf8coUQIpJZq2iAYA4vgFpTgGkuJpd2oTAQdMGpDsFkGLrdmgTAgc9G5ChLECKPwwZcmcTggctp7BcPADxB4jMYUgfQLRUJfXQDkCKT+SFr9X6gAQJTmJpsIG0m59tFSDu/KAirwzvAYiGygSeA8mtCwCilsSiE7lygMjXLvCra2sAENUk3tnXARDxJJ5E35sFiHISC9+NrwGIdBJvDqgLAKKexNKpbQCRPjgU+L3VFQBEIoG7FvdlACKbwMuZRtYARKQ1Aq/WbwKIbP0C92AFAJFJ4J6s3QCRPbkocJN7GiBS+7uDDhwPAkR4t7c2AIhUAm8P3wEQwUrUbwHqCQAil/rLyXKlAJFrk/qZ93KACFau/hurFSByqe/zbgkAItkW64fpAJE8RN9XAhDZnV71ladqAOLOIXpTABDZmtQf7QSIM4foQ2mACJdWfrhQ8YYsgAgeojcGAJFO/UVymwDiyCH6YBog8psQ9WkBiCOH6PUBQORTf4fGOoA4cYg+EABER8pP33YCRKDMWPw3IH4CUd6EjGUA4sA0eLAB8ROI+iakHiDqbU/ABsRTIMq/u7YDxP4elg8bEE+BKG9CVPaxACL0W6oBINpqsLh19wbIN/bs+ea39PWvFKfg3/zLb3nQb674GX5zyT/zzT17vmF57pXfFLA9AUD2hDqbnFecgtnQhy6u+BkuLvfH9tie/BZ7+1gA+VUnFWfg2SRANJYdtbaPBZBf9YHiBFwKAaKzzdb2sQAisYe1cBAgWisdt7WPBRCJPazrIUD01mlrHwsgEntY0wDRXIWtfSyAvOip2ujfDQGiu17F910CRKF3FH89nQaI9lQfnFoHEF3fm7ic4/UbSEbxTG8bQIpvTm3sr4YA0V+H2iT1A6ToDipuvd8GiIEqFWepFCDFdkZt5OdCgJhI8c37jQAptptqI38OIEZSXA2hGyBFNvFMaeDnpwBi5oYstUd2RtMAKa6ZZFxF9x6I6ms1qgFSXJfUxv04QAyluFzIZoDYOMn7dAIghkqrLVrYDxAbJ3l92sPyHIjqHYulALFwJ+9xgPiyj1UPkGK6ojToj0OAmGtYaa7aAVJM95QG/QpADNauNFd9ACmiyUWlQZ8BiMGqleYqlwGI6asgfu1heQ9EcR+rGiCFdyFBe1j+A1Hbx2oFSOHdTs45rBgAUTuP1eMOkK98Vbqva/rOKN2ItTCZCCBfF5/NrxT5vcoovd1k1B0gX/2OeHq+MtNKv5I+DBMBRH4yv1rsF2un0nxVAKTQzioN+HmAGAaittJqE0AK7UYiXvcTHyBqr//pAkihPVAZ70chQAwDCZRe9D4IkEIvEyr9QroGEONAtinNWAYghXVUabjfBYhxIOuVZqwKIIWl9L6GxSmAGAeSzZl9c0PCgSjdyns/BIhxIGoLFrYDpLA+TMYL4+IEROkFcjsBUlhKb60+BRALQJRW9BwBSEEdUDriOwQQC0DKlOZsDUAK6XiCbnWPCxC1W95rAVJIsypjfQsgVoDsUJm0FoAU0nWVsZ4FiBUgSmtCdwKkkJReiXUUIFaAVBl9OVaygagsbjs/ARArQNIqz4SMAaSADqn8LpoLAWIFiNo6CGsBkn/HEnWnYmyAKN2vWAMQQ3dinQeIJSBKD001AkTq+xKrN2LFCYjS27HaAJJ/H6iM9BRALAEpUZm27QDJv48UBvphCBBLQIIhhXnbBZD8e6ww0LcBYg1Ij8K8DQMk7yZUNtUXAWINSJvKxKUBkm9K78Q66SOQldfzvekLkHqD78ZKMpATKuN82D8eR+6u8pnuHvEDiNK7f+oAkm/nVMZ50jceE5dXX+hh8fKED0AyKhPXDJB8u6wwzE9883H4fn7P2R/2AEiwT2HmtgAk324pDPM9z3y8m+99mfPvegCkT2HmugGSb3cVhvmmXz7O5r+O1uJZ94F0K8xcL0Dy7WOFYb7slY/CnpycdR7IFoWZ2w0QE9cJz3m1/VD8cM4BaTZ3pTDJC+ionAs54dPxR6HrlC5+6TjEnQV0Pq1OZeqcACKf/BJs+1VG2aOFDw4X/tzk/7t68fP907ai0/NlULoQUgKQ/FJ6ntCfyyAT9yOL6fkyKF0IKQNIfqksAD3vzwbkchQ/IMGYwo9UDRD9d5r4s3TOkcU4AlFZRqcOIPml8sCtP9cJ70ZxBKJypbARIPl1QWGQ7/ji42QUSyBvKvxIrQDJr0sKg3zDFyD34wmkS+FH2gyQ/FJZ4PaKJz5ORPEE0q7wI3UBJL9uJ+B5wtsxBaLyTGEPQPQfvp71w8fBxZgCaTJ2tyJAisqTxaVmo5gCaQCI20CO+QHkTlyB1AAEIOpNLgAEIEV3L/ZAZiKAvFofQPLrkcIgH+YQxCqQcoUfaS9A9APxY4Hba7EFUgYQgHh/HxZAAOJ2DwECkOJ7HHsgzwCyRMMAyS+VeffjMkgUWyCBsZ8JIAABCEAAAhCAACQxxyAAAYjl83QA4SwW10E4iwWQJF8HuQ4QroMAZPneAwhAALJ8xwACEIAk8XkQgBgAEv/nQXiikOdBLJ3j8QTILEBejScKAfJZsX2rCUAcB+LJW03CD2MKhLeaOA7Ek/diWX81L+/FWiYflmBLwpsVlRYqlUjPEmz+v1nRh0U8k/Bu3vCUZSB6FvH0/928PgBJxNvdw7lYAvH+7e4+AEnE+iDhzGIcgXi/PogPQJKxwlR4NY5AvF9hygcgyVijMJz8OIZAvF+j0AcgCVnlNjxS+Drpi9e+sE76nxafJiDer3LrA5CErJNezJms01/8C+QnUxGI/+uk+wBkv8ooT3sEpOBbsmZDx4FUqExdCUDyTGWUT/gEJPy3aldBnQNSZ+7ifpKBqDyUfs4rIAVtQ2ZD54E0K8zcMEBM3IZx2S8g4el8n51aOB26D2SLwsztBoiJuxVvegYkPJLfm6wfHgk9ANJt7F7FRAO5lYwrhZ82dXX1a+qLV6dCH4CoXCfsBki+XVYY5iehf82sdl/W3MzSf9A5IPsUZm4LQPLtXFIuhHzWxOmV9rMenp4I/QCidBmkGSAm7jXxZJXCVzq53Hsc7pxc/g+5BkTpMkgdQPJtWmWcT4ae9tbsnS8fjCzemX1rpT/iGpB6lYmrAEjeexwq43wx9Lep4+9dv/voxS1a84/uXn/v+NQq/71rQFSeJ4zSADFypfB2mJxcA9Jj7jphsoF8pDDQDwFiDciQwrztAkj+faCyqZ4CiCUgJSrTth0g+XdRZaRnAGIJSLXKtLUBJP9UHrqNzgPEEpBNKtPWCJD8U1oe4BpALAHZpjJtNQDJP6VnCucAYglIv8q0rQVIAc0rjPT8BECsAEmPK8zaWACQAlJ6q9pRgFgBUqUyaf0AKSSlVS5nAWIFSIvKpHUCpJCUFpi5BRArQHaoTFoLQArpuMpYPwaIFSDDKpNWC5BCOqD01sBDALEApExpztYApKCeqgz2KYBYAKKyuFQ0EjgCxIcFdF6ktETZ1aQA+br4bCosoNOhMmU7XQEi3x4tU39FZbTvJwXIHpe+CAMqU9YOEIN3Yy1OAcR42ZzJO7ESD+So0hHfuwAx3nqlGasCSGFNKg33NYAYT+lOxSgDkAJ7oDLcjwBiPJWlc6LBACAFdkPpF9I0QAyn9MafAhe4BcjzzioN+HmAGE7pYamoCSCFpvRurOhDgBhup9J8VQCk4J6pDPjCJECMllF5FiQaDQBScLeVfiUdB4jRapVmqwcghXdBacivAMRo7Uqz1QqQwptRGvLHADGa0q3uBS4ADZCXlwoXlcZ8BiAGU3ojVpTLAKSI7rGP5Q0QtT2svgAgRXSFfSxvgKjtYbUDpJhORpzH8gSI2jmsqB4gxXRQbdSvA8RYnWpTVQqQolJ6OVb0dAIghkqPKM1UfwCQorrEPpYfQBT3sDYDxMaVkCTsYzkCpCsyfhUEIM+bULodK5qfAoiRsmNK8zSaBkiR3VT7zXQOIEZqVpum7gAgRXZGbeTnAGKkfrVpagSIpRO90dsAMVBlZOEkL0AETvTG/wVyTgDpiCyc5AWIxIneZ5MA0V5m1MZJXoBInOiNTgNEe42RjZO8AJE40RvdBYj2eiMbJ3kB8rIPFH89TQNEcxWKM7Q9AIhCinf0xv1qugNAFO9TLO5OXo1AfFn+4NMm59WGf+FgrIHYX/6gdFxtgsYyjgHxZQEdqX2sS7EGYn8Bnc229rAAIrOP9XQSIBqBZEZs7WEBRGYfK94vIbUOZFNkaw8LIEL7WA8nAKINSHrI2h4WQIT2saKTANEGpD6ytocFEKl9rHsA0Qakz94eFkCk9rGiYwDRBKQmsreHBRCxfay7ANEEpNfiHhZAxPaxYvz2BrtAaiOLe1gA+btuqM7DfYBoATKgOjGdAUAEeifiRJaLQJRPYUXrACLSAzYhLgJR3oAMBgAR6TybEAeBqG9AWgAi0/4FNiHuAVHegIyvAYgrh+lxffbWIpCNynPSHQDEmcP0x5MAEQWSGVaek1qAuHOYHl0EiCiQNuUZ2RsARKz3lKdj4S2ACAJZO648I60AkeuA8mF6dBMggkC6lecjtxYggt1UnpBYLntrC0i1+nS8EQBEsGPqMzIHEDEg/erTsR4got1Xn5IzABEC0qg+GUNpgIh2Rn1OnuwHiAiQkn3qk9ESAES0ySfqk3INICJAtqlPxWgGIMJdUJ+V+D1baAVIjcBMbA4A4twNWVH0cBIgykAyQ+oTMV4KEPGuCvzi+h5AlIFsFZiHzgAg4h0WmJjFIwBRBFKZE5iH1wCiodsCM3N/AiBKQNIDArPwRgAQNy8WRtEFgCgBaZWYhBqAaGlOYG4WjgBEAUjluMAcDAQA0dIpid9eD6cAUjSQ7JDEFDS4DMSzBXS+0MRDiemJ0229phfQ6ZaYAPW7THQCkW+Pue/DGYn5idM9WYaXYGsUGf/GACAO37IYRfNvA6SoXhuTGP7daYBo66TIr7AHUwApouygyOjXBwBxfBMS3QBIEXWJjP1AABCNnRCZpNi8BcgkkI2RUxsQgCzdXZFZmj8MkAIrH3NrAwIQjZfTo+j+JEAKKjPg2AYEIFo3ITF5eMockG0yw94bAERzR2VmKjoFkAJqEBr1GoBo77bMVM0fAkjelY05twEByHJNL8pM1twEQPIs3S+0AakCiIGuCs3WFYDkWbvQiHcFADHQ/mdC83UOIHnVLDTeo6UAMdI5oQlb/DZA8mhDTmi8WwOAGGnigdCMLcwAZNWqx4VGW+g2d4Cs3nGhKYvmjwBklSrHpAZ7fQAQU/1XqUl7Mg2QFavYJzXUOwOAGOvwotS0fXwQICtUultqoHMVADHY96Tmze+HQ3QDEXoE5EXtAUBMnup9KjZzH00CZJkyu8RGeaQEIF6e6n3erQmALH0BfYfcIDcFADHbXbnJuw6QJeuUG+LeACCmb8lakJu+ywBZoi1yAzxeARDjXZCbv2gWIK/UIji+rQFAzF9Pvy84g6cA8qUaBEd3IA0QC72zKDeFnt6WpQ+I2A1YLy6BrAsA4vfFkOdCTgHk89sPQR/R1gAgVpp6FCX8OEQXEMnjj2hvFiCWOiE5jz6uz6YJyBbRca0LAGKrG6IzeWMCIC+uD3aKjmpX4BUQn5c/eLUDT0Xn8rZv92XpWP4gs0N0TEfW+AXE5wV0luhd0cmMPtrvFxANC+hkd8kO6foAIDa7Kjudnt39Lj+Z/2BQdkC3BQCx+hWZ/Fh2Qh9PJxrIf/wnssM5lAWIXSDhkUXZKX0yk2Ag3/8b2cHMVQUAsQwkfE92TqP5E4kF8vu/EB7LrwUAsQ4kvCM8qx5dVJedyP/yifBI7koDxAEgbz0Vnld/LqqLzuN/+6XwMI6VBQBxAEj4bWkg0fWpxAF5/3+Kj2JjABAngITXxef2wdsJA/Lv/0p8DHcEAHEEyNRD8dmdP5MoIH/0C/ERHC4BiCtAwrfnxec3ujGVGCDf/TP54RuvCgDiDBANhyFe7GbJTOB/+JmG0WsKAOIQENGHp/zZzRKZvz/+hYax6wwA4hSQiTsaZtn53SyB2futv9AxcP1pgLgFJDzwWMdEPzgScyC/99c6hm3f2gAgjgEJjy7omOqFc7EG8iMdu1dRriYAiHNAwjORlm5OxRbIb/1Ez5C9HgDEQSAarhf+qodHYgrk+z/XM2DbA4A4CWTinp4JX7gwEUMg7//4Ez3DNZgFiJtAwoOP9Ux5dP+d2AH5/s80jdW+8gAgjgLRckX95S3w35uKFZDv/o9fahqpseoAIM4CCU8sapr36NGJGAH5wc91DVO0IQCIw0DCs9pmPrpxICZAfvvP9A1SawAQp4GEl/VN/tNzE3EA8qO/0TdE2wKAOA4kvKVv+qMHx70H8vt/pXF8dqYB4jyQyY80fgOi24e9BvK7P9E5OIZO8AJE8WTvQ51fgoXL+70F8t0//UTn0OwrDQDiAZBw+qnOr0H05OyEn0D++P9qHZexygAgXgAJZ+a1fhOih2cm/APyw5/pHZRcXQAQT4CExxf1fhmiByc9A6KbR5TbEADEGyDht3ULie6f9AjIH+jmYd4HQFwX4gKRPK+b/1T7WETNQUyAxGsBnRWa1f+liO6fsnwskscCOn/vN/6xgZFoCeICRL49YXKFRI9mJ21+xlWXYMtsGori6QMgyl008dWInl066CyQ0s0jRsZgawAQD4GEV4x8O6KF69NOAqnoHDczAO0BQLwEYkpIFN09PekYkExjr6kPb8cHQASaMCYkenb1bYeAVHaMRjH3ARC/tiHPmzs35QSQbHO/wU9tywdAfDpS/7T568cnLANJ13aNmfzIWwOAeA3EzNnezz1Udf24RSC1nSNmP25LABDPgYSzi2a/M9HjKzNWgFS3Dxv+pDZ9AMSnu05eNXJ80iiQTK15HVGuOQBIDIDYEBJFCx+enzYEpGLTznELn9D8/YkA0XX3+3xkpUfX3p3SDCS7ftteOx/Osg+ASDbzLLLU4v2rpw5p+lSHTv2zgZytDzZWFwAkNkDC6UeRxR7fmj0qe/534ujsrcc2P9K+ygAgMQISHrwX2W1+7tr5GYkdrv0z56/NLVj+NIOlAUBiBSScuh050MPbF08eLvYE1+T0yYu3H7rwMXZmA4DEDEg4cTVypSf3bl4+d2J6Mm8YJ85dvnnviTM//7Z0AJDYAbFwyXDV3a5H9+7cuHLx7Kljx44dPvS8l8feL3r+b06dvXjlxp17j+Yd+6kNvn8XIIYviDj3XfOwsQ0BQGIKJHznCV9w1dNX1QFAYgskPPyAr7ja6avyACAxBhJOfcCXXKHt2QAgsQbi4KG6P+Ved+h7BxBt951wIFLk4UdNAJAEAAkPfsSXvYj61wYASQQQl64Z+lNnOgBIQoCE4WmuiBTWeJNr3zuAaO3IQ770BTRcFQAkUUA431tIO0oCgCQMSBh++xnf/PxuLml08GvH8gf6e+suX/482lUWJAhIUhbQye9s1ntcNFz14uDX0gFAEgrk+bH6xxBYsaGqIABIcoGEU1wSWalt2QAgiQYShu8+xcEyjawPAoAkHUh44AYUlqxrTQAQgDzvxCM0vNLeuiAACEBeHol8j9NZXzp5tTUbAAQgv34Y9z4oPtfAuiAACEA+f03kwgIuPrszsTUdAAQgX2qaC+sv660IAoAA5NXOccY3ikaaggAgAFmy/VeSfrD+r9tLAoAAZPn9rA8T7eN//cMgAAhAVryyntxHqX7+h9/5KkAAsur5rGQ+KPK3P37/OwABSB4dTOLNJ3/xOy8mEyAAyaejSTvl+9P/9HIyAQKQ/DqWJCI//cFnkwkQgOTb8aTcffKzH/56MgECkPw7eT9hPAACEIh8of/9wy9OJkAAApFfX/j40fvfAQhAFInci+vO1R+9/8pkAgQghTdzO4a3aP3lD5aaTIAApJgOX43X0yKf/PnvLT2ZAAFIcR24GJ91d/72v//OcpMJEIAU2+TZeLxl7ud/8t3lJxMgAFHoxC3fD0Z++Zd/+P5KkwkQgKjdx3jB57vh/8+Pf3eVyQQIQJTv0rrp5wH7Jz/5g9UnEyAAEThgP//AOx5//Sf/Lp/JBAhARHrnhk+rHP7iz/9znpOZeCAsoCN2UuvkB34Y+Rf/6Df+ft6z+ZWkA5FvT5jYPDAytr0+E8QygGAEHQCJhxEXXzg3GmcdAPHtmP3inFM6+jdXp4N4BxDfLiGeuenG64JGuxtLg/gHEP+amLk0x6YDIABZaUNy8so9Ozds5fra60uDxAQQjw/bZy7cNru7NdrTWp0JEhVAPG/67A0zN6QMdjVVBMkLIHHYlBw9c+VDfeeAn/7z9saqTJDMABKfexuPz16fk72cOD93ffb4gXBPkNwAErMOHTtz8YOPHivKePzRBxfPHHvr078TIACJ3ang6RPnLt+6+3FhUh5/fPfm5XMnpie++JcBBCDxbf+hmRNnLly6cfvu8+49et5LNI9f/OO9F//y9o1LF86cmDm0f7m/AiAAoRUCCEAIIAAhgACEAAIQAghAAAIQgAAEIAABCEAAAhCAAAQgAAEIQAggACGA2ALCq0fj1NfFZzPxrx7l5dVxSn4yebs7QAACEIAABCAAAQhAAEIAAQgBBCAEEIAQQAACEIAABCAAAQhAAAIQgAAEIAABCEAAQgABCAEEIAQQgBBAAEIAAQhAAAIQgAAEIAABCEAAAhCAAAQgAAEIAQQgBBCAEEAAQgABCEAAAhCAAAQgAAEIQAACEIAABCAAUYsFdOIUC+iwBButEEuwAYQAAhACCEAIIAAhgAAEIAABCEAAAhCAAAQgAAEIQAACEIAAhAACEAIIQAggACGAAIQAAhCAAAQgAAEIQAACEIAABCAAAQhAAAIQAghACCAAIYAAhAACEIAABCAAAQhAAAIQgPxdvJs3TvFuXt7uTivE290BQgABCAEEIAQQgBBAAAIQgAAEIAABCEAAAhCAAAQgAAEIQABCAAEIAQQgBBCAEEAAQgABCEAAAhCAAAQgAAEIQAACEIAABCAAAQgBBCAEEIAQQABCAAEIQAACEIAABCAAAQhAAAIQgAAEIAABCAEEIAQQgBBAXAPCAjpxigV0WIKNVogl2ABCAAEIAQQgBBCAEEAAAhCAAAQgAAEIQAACEIAABCAAAQhAAEIAAQgBBCAEEIAQQABCAAEIQAACEIAABCAAAQhAAAIQgAAEIAABCAEEIAQQgBBAAEIAAQhAABJLILxZMU7xZkXezUsrxLt5AUIAAQgBBCAEEIAQQAACEIAABCAAAQhAAAIQgAAEIAABCEAAQgABCAEEIAQQgBBAAEIAAQhAAAIQgAAEIAABCEAAAhCAAAQgAAEIAQQgBBCAEEAAQgABCEAAAhCAAAQgAAEIQAACEIAABCAAAQgBBCAEEIAQQFwDwgI6cYoFdFiCjVaIJdgAQgABCAEEIAQQgBBAAAIQgAAEIAABCEAAAhCAAAQgAAEIQABCAAEIAQQgBBCAEEAAQgABCEAAAhCAAAQgAAEIQAACEAIIQMj1vrFnzze/Reb75p493+DrR0RERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERERESUmP4/PVupnBEseEQAAAAASUVORK5CYII="
+        };
+
+        let databinding = {};
+
+        let judgements = all_mem_analyses.data.map(x => x.judgement);
+        let majority = MathHelper.majority(judgements);
+        this.setButtonImage(this._display_image[majority]);
+
+        this.dataparams = [all_mem_analyses, target_bw];
+
+        let layout = new Layout(this.button_subwindow);
+        layout.setRect("Title", new Pos(0, 0), new Pos(100, 10), new RU_DataViewText());
+        layout.setRect("Balance", new Pos(70, 60), new Pos(30, 20), new RU_DataViewNumberBlock().setTitle("Balance").setOptions({
+            display_title: false,
+            text_align: "left",
+            draw_bar: ["left"],
+            padding: { left: 10 },
+        }).setDataAnalysisFunction(d => {
+            let x = d;
+          
+            let expected_bandwidth = x.data.map(x => x.data.expected_bandwidth);
+            let target_bw = x.data[0].data.Memory_Target_Bandwidth;
+
+            let ret = undefined;
+
+            if(toplevel_use_mean)
+                ret = Math.round(MathHelper.mean(expected_bandwidth) * 100. / target_bw);
+            else if(toplevel_use_median)
+                ret = Math.round(MathHelper.median(expected_bandwidth) * 100. / target_bw);
+            else ObjectHelper.assert("Undefined mode", false);
+
+            ObjectHelper.assert("is number", ret instanceof Number || typeof ret == "number");
+
+            return ret;
+        }).setColorScaling(x => 100 - x));
+        layout.setRect("Bandwidth", new Pos(0, 60), new Pos(70, 20), new RU_DataViewFormLayout().setTitle("Bandwidth").setDataAnalysisFunction(d => {
+            let x = d;
+
+            let expected_bandwidth = x.data.map(x => x.data.expected_bandwidth);
+
+            //#TODO: For B/s, we should try to use PAPI_REF_CYC. It seems to 
+            //       vary between chips.
+
+            let val = {};
+
+            if(toplevel_use_mean) {
+                val = { title: "Bandwidth", value: MathHelper.mean(expected_bandwidth).toFixed(3).toString() + " B/c" };
+            }
+            else if(toplevel_use_median) {
+                val = { title: "Bandwidth", value: MathHelper.median(expected_bandwidth).toFixed(3).toString() + " B/c" };
+            }
+            else ObjectHelper.assert("Undefined mode", false);
+            return {
+                fontsize: 16,
+                rows: [
+                    val,
+                    { title: "Target Bandwidth", value: this.Memory_Target_Bandwidth.toFixed(3).toString() + " B/c" } 
+                ],
+                padding: {left: 10, right: 10, top: 0, bottom: 0 },
+                rawdata: d
+            };
+        }));
+
+        layout.setRect("Graph", new Pos(0, 10), new Pos(70, 50), new RU_DataViewBarGraph(
+            {
+                type: "horizontalBar",
+                xAxes: [{
+                    type: 'linear',
+                    display: false,
+                    position: 'bottom',
+                    id: 'data-axis',
+                    gridLines: {
+                        display: false
+                    },
+                    ticks: { 
+                        beginAtZero: true
+                    },
+                    scaleLabel: { labelString: "Misses", display: true }
+                }, {
+                    type: 'linear',
+                    display: false,
+                    ticks: {
+                        max: 1,
+                        min: -1,
+                    },
+                    id: 'corr-axis',
+                    position: 'top',
+                    gridLines: {
+                        display: false
+                    },
+                    scaleLabel: { labelString: "Correlation", display: true }
+                }
+            ],
+                yAxes: [{
+                    display: false,
+                    position: 'left',
+                    id: 'thread-axis',
+                    gridLines: {
+                        display: true
+                    },
+                    scaleLabel: { labelString: "Thread", display: true }
+                }
+            ],
+        }).setDataAnalysisFunction(x => {
+            if (x == null) return ;
+            let l2misses = x.data.map(x => x.data.L2_TCM);
+            let l3misses = x.data.map(x => x.data.L3_TCM);
+            let tot_cyc = x.data.map(x => x.data.TOT_CYC);
+            let bw_mem = x.data.map(x => x.data.mem_bandwidth);
+            let bw_l3 = x.data.map(x => x.data.l3_bandwidth);
+
+            console.log("l2 miss length: " + l2misses.length);
+
+            let colors = RU_DataViewBarGraph.colorList().slice(0, l2misses[0].length + 1 + 2);
+
+            let datasets = [];
+
+            ObjectHelper.logObject("l2misses", l2misses);
+            let z1 = MathHelper.zip(l2misses, tot_cyc);
+            ObjectHelper.logObject("z1", z1);
+
+            let l2corr = z1.map(x => MathHelper.sample_corr(x[0], x[1]));
+            let l3corr = MathHelper.zip(l3misses, tot_cyc).map(x => MathHelper.sample_corr(x[0], x[1]));
+
+            ObjectHelper.logObject("l2corr", l2corr);
+            ObjectHelper.logObject("l3corr", l3corr);
+
+            // Now we average the correlations
+            let avg_l2_corr = MathHelper.mean(l2corr);
+            let avg_l3_corr = MathHelper.mean(l3corr);
+
+            let i = 0;
+            for (let tcm of [l2misses, l3misses]) {
+                let labelstr = "[Unknown]";
+                if(tcm == l2misses) labelstr = "L2_TCM";
+                if(tcm == l3misses) labelstr = "L3_TCM";
+
+                let tmp = 0;
+
+                if(toplevel_use_mean)
+                    tmp = MathHelper.zip2d(tcm).map(x => MathHelper.mean(x));
+                else if(toplevel_use_median)
+                    tmp = MathHelper.zip2d(tcm).map(x => MathHelper.median(x));
+                else ObjectHelper.assert("Undefined mode", false);
+                ObjectHelper.logObject("tmp", tmp);
+
+                datasets.push({ label: labelstr, xAxisID: "data-axis", yAxisID: "thread-axis", data: tmp, backgroundColor: colors[i] });
+                i++;
+            }
+
+            let rho_col1 = colors[i];
+            let rho_col2 = colors[i + 1];
+            datasets.push({ label: "L2 corr.", xAxisID: "corr-axis", data: l2corr, backgroundColor: colors[i], hidden: true });
+            datasets.push({ label: "L3 corr.", xAxisID: "corr-axis", data: l3corr, backgroundColor: colors[i + 1], hidden: true });
+
+            let chartData = {
+                labels: [...(Array(l2misses[0].length).keys())].map(x => "Thread " + x.toString()),
+                "datasets": datasets,
+
+            };
+
+
+            return chartData;
+        }).linkMouse(this.button_subwindow).changeGraphOptions(x => {
+            x.options.title.text = "Cache misses (mean over all runs)";
+            x.options.title.display = false;
+            x.options.legend = {
+                position: 'top',
+                display: true
+            };
+        }));
+
+        
+        databinding["Balance"] = all_mem_analyses;
+        databinding["Title"] = new DataBlock({ fontsize: 32, text: "Memory performance", color: "black", align: "center" }, "Text");
+        databinding['Graph'] = all_mem_analyses;
+        databinding['Bandwidth'] = all_mem_analyses;
+
+        layout.setDataBinding(databinding);
+
+
+        this.button_subwindow.setLayout(layout);
+
+        this.setOnEnterHover(p => { this.color = "#FF0000"; this.button_subwindow_state = 'open'; })
+        this.setOnLeaveHover(p => { this.color = "orange"; if (!this.is_locked_open) this.button_subwindow_state = 'collapsed'; })
+        this.setOnClick((p, mb) => { this.is_locked_open = !this.is_locked_open; });
+    }
+
+
+
+}
\ No newline at end of file
diff --git a/diode/next_run_button.js b/diode/next_run_button.js
new file mode 100644
index 0000000000..ee62b16c81
--- /dev/null
+++ b/diode/next_run_button.js
@@ -0,0 +1,2 @@
+// Button to fine tune the next runs
+
diff --git a/diode/optgraph/DaceState.py b/diode/optgraph/DaceState.py
new file mode 100644
index 0000000000..06cf328d5d
--- /dev/null
+++ b/diode/optgraph/DaceState.py
@@ -0,0 +1,227 @@
+""" State of DaCe program in DIODE. """
+import os
+import re
+import sys
+import copy
+import dace
+import tempfile
+import traceback
+from six import StringIO
+
+from dace import types
+from dace.transformation import optimizer
+from dace.sdfg import SDFG
+from dace.frontend.python import parser
+from dace.frontend.python.parser import DaceProgram
+
+
+class DaceState:
+    """ This class abstracts away the DaCe implementation from the GUI, the 
+        idea is that you pass in a string of DaCe code and this class will 
+        compile the code, give you access to the SDFG and the generated code, 
+        as well as the matching optimization patterns. 
+    
+        DaCe requires the code to be in a file (for code inspection), but 
+        while the user types in the GUI we do not have the data available in a 
+        file. Thus we create a temp directory and save it there. However, the 
+        user might check for the filename in the code, thus we provide the 
+        original file name in argv[0].
+    """
+
+    # TODO: rewrite this class to use in-memory code.
+
+    def __init__(self,
+                 dace_code,
+                 fake_fname,
+                 source_code=None,
+                 sdfg=None,
+                 headless=False):
+
+        self.compiled = False
+        self.dace_tmpfile = None
+        self.dace_filename = os.path.basename(fake_fname)
+        self.sdfg = sdfg  # This is the toplevel one
+        self.sdfgs = []  # This is a collection of all the SDFGs
+        self.generated_code = []
+        self.generated_code_files = None
+        self.matching_patterns = []
+        self.headless = headless
+        self.dace_code = dace_code
+        self.source_code = source_code
+        self.errors = [
+        ]  # Any errors that arise from compilation are placed here to show
+        # them once the sdfg is rendered
+
+        self.has_multiple_eligible_sdfgs = False
+
+        if self.sdfg is not None:
+            self.compiled = True
+
+            # Generate python stub to initialize inputs and load the SDFG
+            self.dace_code = """
+# THIS IS AN AUTOGENERATED LOADER FOR A SAVED SDFG
+
+import dace
+import numpy
+
+{initializers}
+
+sdfg = dace.SDFG.from_file("sdfg.out")
+func = sdfg.compile()
+func({args})
+
+{prints}    """.format(
+                initializers=self.get_arg_initializers(),
+                args=", ".join([x + "=" + x for x in self.get_call_args()]),
+                prints='\n'.join([
+                    'print("{x} =", numpy.array2string({x}))'.format(x=x)
+                    for x in self.get_call_args()
+                ]))
+            self.sdfgs = [('deserialized', self.sdfg)]
+
+        tempdir = tempfile.mkdtemp()
+        self.dace_tmpfile = os.path.join(tempdir, self.dace_filename)
+        fh = open(self.dace_tmpfile, "w")
+        fh.write(self.dace_code)
+        fh.close()
+
+        # Create SDFG unless we already have one
+        if self.sdfg is None:
+            saved_argv = sys.argv
+            sys.argv = [self.dace_filename]
+            gen_module = {}
+            code = compile(self.dace_code, self.dace_tmpfile, 'exec')
+            try:
+                exec(code, gen_module)
+            except Exception as ex:
+                self.errors.append(ex)
+
+            # Find dace programs
+            self.sdfgs = [(name, parser.parse_from_function(obj))
+                          for name, obj in gen_module.items()
+                          if isinstance(obj, DaceProgram)]
+            self.sdfgs += [(name, obj) for name, obj in gen_module.items()
+                           if isinstance(obj, SDFG)]
+            # TODO: detecting parents is broken, just take the first one for now
+            self.sdfg = self.sdfgs[0][1]
+            if len(self.sdfg) > 1:
+                self.has_multiple_eligible_sdfgs = True
+
+    def get_arg_initializers(self):
+        sdfg = self.sdfg
+        if sdfg is None:
+            raise ValueError("Need an SDFG to produce initializers")
+        data = set()
+        for state in sdfg.nodes():
+            data.update(
+                set((n.data, n.desc(sdfg)) for n in state.nodes()
+                    if isinstance(n, dace.graph.nodes.AccessNode)))
+
+        sym_args = [k for k, _ in sdfg.undefined_symbols(False).items()]
+        data_args = [d for d in data if not d[1].transient]
+
+        initializer = ""
+        for d in data_args:
+            initializer += str(d[0]) + " = dace.ndarray([" + \
+            ", ".join([str(x) for x in list(d[1].shape)]) + "], " + \
+            "dtype=dace." + d[1].dtype.to_string() + ")\n"
+
+        return initializer
+
+    def get_call_args(self):
+        sdfg = self.sdfg
+        if sdfg is None:
+            raise ValueError("Need an SDFG to produce call arguments")
+        data = set()
+        for state in sdfg.nodes():
+            data.update(
+                set((n.data, n.desc(sdfg)) for n in state.nodes()
+                    if isinstance(n, dace.graph.nodes.AccessNode)))
+
+        sym_args = [k for k, _ in sdfg.undefined_symbols(False).items()]
+        data_args = [d for d in data if not d[1].transient]
+
+        call_args = []
+        for d in data_args:
+            call_args.append(d[0])
+
+        return call_args
+
+    def compile(self):
+        try:
+            self.sdfg.validate()
+            code = self.sdfg.generate_code()
+            self.generated_code = code
+            self.compiled = True
+        # TODO: SDFG validation errors should be treated separately
+        #except dace.sdfg.InvalidSDFGError:
+        except:
+            exstr = StringIO()
+            formatted_lines = traceback.format_exc().splitlines()
+            exstr.write("Compilation failed:\n%s\n\n" % formatted_lines[-1])
+            traceback.print_exc(file=exstr)
+            self.generated_code = exstr.getvalue()
+            if self.headless == True:
+                print("Codegen failed!\n" + str(self.generated_code))
+                sys.exit(-1)
+
+    def get_dace_generated_files(self):
+        """ Writes the generated code to a temporary file and returns the file
+            name. Compiles the code if not already compiled. """
+        tempdir = tempfile.mkdtemp()
+        self.generated_code_files = []
+
+        for codeobj in self.generated_code:
+            name = codeobj.name
+            extension = codeobj.language
+            gencodefile = os.path.join(tempdir, '%s.%s' % (name, extension))
+
+            with open(gencodefile, "w") as fh:
+                # Clear location indicators from code
+                clean_code = re.sub(r'\s*////__DACE:.*', '', codeobj.code)
+                fh.write(clean_code)
+
+            self.generated_code_files.append(gencodefile)
+
+        return self.generated_code_files
+
+    def get_dace_tmpfile(self):
+        """ Returns the current temporary path to the generated code files. """
+        return self.dace_tmpfile
+
+    def get_dace_fake_fname(self):
+        """ Returns the original filename of the DaCe program, i.e., the name 
+            of the file he stored to, before performing modifications in the 
+            editor """
+        return self.dace_filename
+
+    def set_is_compiled(self, state):
+        self.compiled = state
+
+    def get_dace_code(self):
+        return self.dace_code
+
+    def get_generated_code(self):
+        if self.compiled == False:
+            self.compile()
+        return self.generated_code
+
+    def get_sdfg(self):
+        if self.compiled == False:
+            self.compile()
+        return self.sdfgs[0][1]
+
+    def set_sdfg(self, sdfg, name="Main SDFG"):
+        self.sdfgs = [(name, sdfg)]
+        self.sdfg = sdfg
+        self.compiled = False
+        if self.compiled == False:
+            self.compile()
+
+    def get_sdfgs(self):
+        """ Returns the current set of SDFGs in the workspace.
+            @rtype: Tuples of (name, SDFG).
+        """
+        if self.compiled == False:
+            self.compile()
+        return self.sdfgs
diff --git a/diode/optgraph/__init__.py b/diode/optgraph/__init__.py
new file mode 100644
index 0000000000..8b13789179
--- /dev/null
+++ b/diode/optgraph/__init__.py
@@ -0,0 +1 @@
+
diff --git a/diode/optgraph/optgraph.py b/diode/optgraph/optgraph.py
new file mode 100644
index 0000000000..ab5f369ae2
--- /dev/null
+++ b/diode/optgraph/optgraph.py
@@ -0,0 +1,391 @@
+""" Transformation graph pane management, includes both the (deprecated) 
+    GraphViz version and Tree-View version. """
+import uuid
+import copy
+from .DaceState import DaceState
+from dace.transformation.pattern_matching import Transformation
+from dace.transformation.optimizer import SDFGOptimizer
+from diode.rendered_graph import RenderedGraph
+import re
+import datetime
+import gi
+from gi.repository import Gtk, Gdk
+gi.require_version('Gtk', '3.0')
+
+
+class OptimizationGraphNode:
+    """ Representation of a node (transformation) in the transformation 
+        graph. """
+
+    def __init__(self, label=None):
+
+        self.uid = str(uuid.uuid4())
+        self.label = label
+        self.color = "#000000FF"
+        self.dace_state = None
+        self.pattern_match = None
+        self.expanded = False
+        self.parent = None
+
+        if label == None:
+            self.label = str(self.uid)
+        if type(self.label) != str:
+            raise (TypeError)
+
+    def to_dot(self):
+        return "\"" + self.uid + "\"" + " [color=\"" + self.color + \
+               "\", label=\"" + self.label + "\"];"
+
+    def set_color(self, r, g, b, a=1.0):
+        self.color = "#%02x%02x%02x%02x" % (int(r * 255), int(g * 255),
+                                            int(b * 255), int(a * 255))
+
+    def set_dace_state(self, dace_state):
+        self.dace_state = dace_state
+
+    def get_dace_state(self):
+        return self.dace_state
+
+    def set_pattern_match(self, pattern_match):
+        self.pattern_match = pattern_match
+
+    def get_pattern_match(self):
+        return self.pattern_match
+
+    def get_expanded(self):
+        return self.expanded
+
+    def apply_pattern_match(self):
+        if self.pattern_match is None: return
+        if self.parent is None: raise ValueError
+        if self.dace_state is None:
+            self.dace_state = copy.deepcopy(self.parent.get_dace_state())
+            top_level_sdfg = self.dace_state.get_sdfg()
+            actual_sdfg = top_level_sdfg.sdfg_list[self.pattern_match.sdfg_id]
+            self.pattern_match.apply_pattern(actual_sdfg)
+
+    def get_label(self):
+        return self.label
+
+    def get_uid(self):
+        return self.uid
+
+    def set_expanded(self, status=True):
+        self.expanded = status
+
+    def is_expanded(self):
+        return self.expanded
+
+    def set_parent(self, node):
+        self.parent = node
+
+    def get_parent(self):
+        return self.parent
+
+
+class OptimizationGraphEdge:
+    """ Representation of an edge (transformation application) in the 
+        transformation graph. """
+
+    def __init__(self, tail, head, label="", pattern=None):
+        self.pattern = pattern
+        if type(tail) != OptimizationGraphNode:
+            print("tail argument to OptimizationGraphEdge is not a node")
+        if type(head) != OptimizationGraphNode:
+            print("head argument to OptimizationGraphEdge is not a node")
+
+        self.tail = tail
+        self.head = head
+        self.label = label
+        self.uid = str(uuid.uuid4())
+
+    def set_label(self, label):
+        self.label = label
+
+    def to_dot(self):
+        return "\"" + self.tail.get_uid() + "\"" + " -> " + "\"" + \
+               self.head.get_uid() + "\"" + "[label=\"" + self.label + "\"];"
+
+
+class OptimizationGraph:
+    """ Representation of the transformation graph structure. """
+
+    def __init__(self, graph_da, treeview_widget, expand_node_callback,
+                 hover_node_callback, activate_node_callback):
+        self.treestore = Gtk.TreeStore(str, Gdk.Color)
+        self.nodes = []
+        self.edges = []
+        self.current = None
+        self.expand_node_callback = expand_node_callback
+        self.hover_node_callback = hover_node_callback
+        self.activate_node_callback = activate_node_callback
+        self.highlighted_elements = []
+        self.treeview = treeview_widget
+
+        self.last_click_time = 0
+
+        # Initialize the graph view
+        self.rendered_optgraph = RenderedGraph()
+        self.optgraph_da = graph_da
+        self.rendered_optgraph.set_drawing_area(graph_da)
+        self.optgraph_da.connect("draw", self.OnDrawGraph)
+        self.optgraph_da.connect("scroll-event", self.OnScrollGraph)
+        self.optgraph_da.connect("button-press-event", self.OnButtonPressGraph)
+        self.optgraph_da.connect("button-release-event",
+                                 self.OnButtonReleaseGraph)
+        self.optgraph_da.connect("motion-notify-event", self.OnMouseMoveGraph)
+
+        # Initialize the treeview widget
+        renderer = Gtk.CellRendererText()
+        column = Gtk.TreeViewColumn(
+            "Optimization", renderer, text=0, foreground_gdk=1)
+        self.treeview.append_column(column)
+        self.treeview.connect("row-activated", self.OnRowActivate, None)
+        selection = self.treeview.get_selection()
+        selection.connect("changed", self.OnRowChanged, None)
+
+    def OnDrawGraph(self, widget, cr):
+        self.rendered_optgraph.render(widget, cr)
+        return False
+
+    def OnOptgraphNodeExpand(self, nodename):
+        self.ExpandNode(nodename)
+
+    def OnButtonPressGraph(self, widget, ev):
+        x, y = ev.x, ev.y
+        elem = self.rendered_optgraph.get_element_by_coords(x, y)
+        self.rendered_optgraph.handle_button_press(ev)
+        time_now = int((datetime.datetime.utcnow() - datetime.datetime(
+            2015, 1, 1)).total_seconds() * 1000)
+        delta_t = time_now - self.last_click_time
+        self.last_click_time = time_now
+        if type(elem).__name__ == "Node":
+            nodename = elem.id.decode('utf-8')
+            clicked = self.find_node(nodename)
+            self.rendered_optgraph.clear_highlights()
+            self.rendered_optgraph.highlight_element(elem)
+            self.activate_node_callback(nodename, clicked.pattern_match)
+            if delta_t < 250:
+                # Double click, expand, make sure to use the right pattern
+                # match properties
+                self.set_current(clicked)
+                self.expand_node(clicked)
+                self.expand_node_callback(nodename, clicked.pattern_match)
+        return False
+
+    def OnMouseMoveGraph(self, widget, ev):
+        self.rendered_optgraph.handle_drag_motion(ev)
+        x, y = ev.x, ev.y
+        elem = self.rendered_optgraph.get_element_by_coords(x, y)
+        if type(elem).__name__ == "Node":
+            nodename = elem.id.decode('utf-8')
+            hovered = self.find_node(nodename)
+            self.hover_node_callback(nodename, hovered.pattern_match)
+        else:
+            self.hover_node_callback(None, None)
+        return False
+
+    def OnButtonReleaseGraph(self, widget, ev):
+        self.rendered_optgraph.handle_button_release(ev)
+        return False
+
+    def OnScrollGraph(self, widget, ev):
+        d = self.rendered_optgraph.determine_scroll_direction(ev)
+        self.rendered_optgraph.zoom(d, pos=(ev.x, ev.y))
+        widget.queue_draw()
+        return False
+
+    def transform_label_uc_to_num(self, label_with_underscores):
+        m = re.search("^(.+?)(_*)$", label_with_underscores)
+        nuc = len(m.group(2))
+        return m.group(1) + "_(" + str(nuc) + ")"
+
+    def transform_label_num_to_uc(self, label_with_underscores):
+        m = re.search("^(.+?)_\((\d+)\)$", label_with_underscores)
+        if m is None:
+            raise ValueError(str(label_with_underscores) + \
+                             " is not the kind of label I expected")
+        nuc = int(m.group(2))
+        uc = "_" * nuc
+        return m.group(1) + uc
+
+    def parse_color_to_gdk_color(self, color):
+        # #00ff00ff
+        m = re.match("#(..)(..)(..)", color)
+        if m is None:
+            raise ValueError(str(color) + " is not a recognized " "color")
+        r, g, b = int(m.group(1), 16), int(m.group(2), 16), int(m.group(3), 16)
+        # GDK colors are 16-bit
+        r = r / 255 * 65535
+        g = g / 255 * 65535
+        b = b / 255 * 65535
+        color = Gdk.Color(r, g, b)
+        return color
+
+    def update_treestore(self, parent, parent_path):
+        # Add all the edges from n
+        for e in self.edges:
+            if e.tail == parent:
+                label = self.transform_label_uc_to_num(e.head.label)
+                color = self.parse_color_to_gdk_color(e.head.color)
+                p = self.treestore.append(
+                    parent=parent_path, row=[label, color])
+                self.update_treestore(e.head, p)
+
+    def OnChange(self, preserve_view=False):
+        # This is called whenever something about the stored data changed.
+        self.rendered_optgraph.set_dotcode(self.to_dot(), preserve_view)
+
+        # Clear the treeview
+        self.treestore.clear()
+
+        # Do a graph traversal and add everything to the treeview
+        for n in self.nodes:
+            if n.parent == None:
+                label = self.transform_label_uc_to_num(n.label)
+                color = self.parse_color_to_gdk_color(n.color)
+                p = self.treestore.append(parent=None, row=[label, color])
+                self.update_treestore(n, p)
+        self.treeview.set_model(self.treestore)
+
+    def OnRowChanged(self, treeselection, userdata):
+        model, paths = treeselection.get_selected_rows()
+        for path in paths:
+            titer = model.get_iter(path)
+            if titer is not None:
+                value = model.get_value(titer, 0)
+                nodename = self.transform_label_num_to_uc(value)
+                nodes = self.find_nodes_by_label(nodename)
+                if len(nodes) != 1:
+                    raise ValueError("More than one optgraph node named " + \
+                                     str(nodename))
+                pm = nodes[0].get_pattern_match()
+                self.hover_node_callback(nodename, pm)
+
+    def clear(self):
+        self.nodes = []
+        self.edges = []
+        self.current = None
+
+    def get_nodes(self):
+        return self.nodes
+
+    def OnRowActivate(self, treeview, path, view_column, userdata):
+        model = treeview.get_model()
+        titer = model.get_iter(path)
+        if titer is not None:
+            value = model.get_value(titer, 0)
+            nodename = self.transform_label_num_to_uc(value)
+            nodes = self.find_nodes_by_label(nodename)
+            if len(nodes) != 1:
+                raise ValueError("More than one optgraph node named " + \
+                        str(nodename))
+            pm = nodes[0].get_pattern_match()
+            nodeid = nodes[0].get_uid()
+            self.rendered_optgraph.clear_highlights()
+            self.set_current(nodes[0])
+            self.expand_node(nodes[0])
+            self.expand_node_callback(nodeid, pm)
+
+    def set_current(self, node):
+        if type(node) == str:
+            node = self.find_node(node)
+        current = self.get_current()
+        if current is not None: self.set_explored(current)
+        self.current = node
+        node.apply_pattern_match()
+        node.get_dace_state().compile()
+        node.set_color(0, 1, 0)
+
+    def set_explored(self, node):
+        if type(node) == str:
+            node = self.find_node(node)
+        node.set_color(0, 0, 0)
+
+    def set_unexplored(self, node):
+        if type(node) == str:
+            node = self.find_node(node)
+        node.set_color(0.6, 0.6, 0.6)
+
+    def get_current(self):
+        return self.current
+
+    def add_node(self, parent=None, pattern=None, label=None):
+        existing = self.find_nodes_by_label(label)
+        while len(existing) > 0:
+            label += "_"
+            existing = self.find_nodes_by_label(label)
+        node = OptimizationGraphNode(label=label)
+        self.nodes.append(node)
+        node.set_parent(parent)
+        node.set_pattern_match(pattern)
+        return node
+
+    def add_edge(self, tail, head, label="", pattern=None):
+        t = tail
+        h = head
+        if type(tail) == str:
+            t = self.find_node(tail)
+        if type(head) == str:
+            h = self.find_node(head)
+        self.edges.append(OptimizationGraphEdge(t, h, label, pattern))
+
+    def find_node(self, uid):
+        for n in self.nodes:
+            if n.uid == uid:
+                return n
+        print("Node " + uid + " does not exist!")
+        return None
+
+    def find_nodes_by_label(self, label):
+        res = [n for n in self.nodes if n.label == label]
+        return res
+
+    def to_dot(self):
+        dotstr = "digraph G {\n"
+        for n in self.nodes:
+            dotstr += n.to_dot()
+            dotstr += "\n"
+        for e in self.edges:
+            dotstr += e.to_dot()
+            dotstr += "\n"
+        dotstr += "}\n"
+        return dotstr
+
+    def clear_subtree(self, root):
+        nodes = []
+        for edge in list(self.edges):
+            if edge.tail == root:
+                self.edges.remove(edge)
+                nodes.append(edge.head)
+        for node in nodes:
+            self.clear_subtree(node)
+            self.nodes.remove(node)
+        root.set_expanded(False)
+
+    def expand_node(self, node):
+        if type(node) == str: node = self.find_node(node)
+        if node.expanded == True: return
+
+        if node.pattern_match is not None:
+            node.get_dace_state().set_sdfg(
+                copy.deepcopy(node.parent.get_dace_state().get_sdfg()))
+            top_level_sdfg = node.get_dace_state().get_sdfg()
+            actual_sdfg = top_level_sdfg.sdfg_list[node.pattern_match.sdfg_id]
+            node.pattern_match.apply_pattern(actual_sdfg)
+
+        # Add optgraph nodes for matching patterns
+        node_dace_state = node.get_dace_state()
+        opt = SDFGOptimizer(node_dace_state.get_sdfg())
+        ptrns = opt.get_pattern_matches()
+        for p in ptrns:
+            label = type(p).__name__
+            nn = self.add_node(label=label, parent=node)
+            nn.set_pattern_match(p)
+            self.set_unexplored(nn)
+            self.add_edge(tail=node, head=nn, label="", pattern=p)
+
+        node.get_dace_state().set_is_compiled(False)
+        node.get_dace_state().compile()
+        node.set_expanded(True)
+        self.OnChange()
diff --git a/diode/optimization_hints/balance.html b/diode/optimization_hints/balance.html
new file mode 100644
index 0000000000..966276bb9c
--- /dev/null
+++ b/diode/optimization_hints/balance.html
@@ -0,0 +1,26 @@
+<!DOCTYPE html>
+<html>
+<head>
+    <meta charset="utf-8" />
+    <title>Balance information</title>
+</head>
+<body>
+
+    <h1>Balance information</h1>    
+    <h2>Formal Definition</h2>
+    
+        Balance actually describes the imbalance of the result.
+        The number displayed is defined as the maximum difference of each thread over multiple runs, <math><mi>i</mi><mo>in</mo> <mi>runs</mi></math>: <math><mi>max</mi><mo>(</mo><mi>abs</mi><mo>(</mo><msub><mi>C</mi><mi>i</mi></msub> <mo>-</mo> <munder><mi>mean</mi> <mi>j</mi></munder><msub><mi>C</mi><mi>j</mi></msub><mo>)</mo><mo>)</mo></math>
+    </p>
+    <h2>Interpreting Balance</h2>
+    <p>
+        Balance is an indicator of the health of the result. Specifically, runs with a balance number of <math><mo>></mo><mn>1</mn></math> should always be repeated.<br>
+        High balance numbers can also be indicators for unbalanced workloads, where one core is doing significantly more work than another.
+    </p>
+    <h2>Improving Balance</h2>
+    <p>
+        The balance number can be improved by reducing the difference in workload between different cores. If the current workload is very small, increasing the workload might also help.
+    </p>
+    
+</body>
+</html>
\ No newline at end of file
diff --git a/diode/optimization_hints/efficiency.html b/diode/optimization_hints/efficiency.html
new file mode 100644
index 0000000000..ad8dfd6f3b
--- /dev/null
+++ b/diode/optimization_hints/efficiency.html
@@ -0,0 +1,39 @@
+<!DOCTYPE html>
+<html>
+<head>
+    <meta charset="utf-8" />
+    <title>Efficiency information</title>
+</head>
+<body>
+    <h1>Efficiency information</h1>    
+    <h2>Formal Definition</h2>
+    <p>
+        The number displayed is defined as <math><mi>eff</mi><mo>=</mo>
+            <mfrac>
+                <mfrac>
+                    <msub>
+                        <mi>T</mi>
+                        <mn>N</mn>
+                    </msub>
+                    <msub>
+                        <mi>T</mi>
+                        <mn>1</mn>
+                    </msub>
+                </mfrac>
+                <mi>N</mi>
+            </mfrac>
+        </math>, for <math><mi>N</mi></math> being the maximum number of threads committed with OMP_NUM_THREADS and <math><msub><mi>T</mi><mi>i</mi></msub></math> begin the time the section took with <math><mi>i</mi></math> threads.
+    </p>
+    <h2>Interpreting efficiency</h2>
+    <p>
+        Efficiency represents scalabilty to many cores. An efficiency close to 100% is desirable, but speedups can happen even with low efficiency. (See speedup) <br>
+        <math><mi>efficiency</mi><mo>&gt;</mo><mn>1</mn></math> implies superscalar speedup. In this case, it is likely that one best solution has been found.
+    </p>
+    <h2>Improving efficiency</h2>
+    <p>
+        Efficiency is affected by multiple parameters. In small programs, removing a parallel section or parallelizing a larger super-section might improve efficiency.
+        Low efficiency can also be caused by contention (<math><mi>speedup</mi><mo>&lt;</mo><mn>1</mn></math>). In this case, using StripMining and InLocalStorage/OutLocalStorage might mitigate contention and improve efficiency.
+    </p>
+    
+</body>
+</html>
\ No newline at end of file
diff --git a/diode/parallelization_button.js b/diode/parallelization_button.js
new file mode 100644
index 0000000000..88ce387256
--- /dev/null
+++ b/diode/parallelization_button.js
@@ -0,0 +1,232 @@
+
+
+class ParallelizationButton extends Button {
+    constructor(ctx, targetsection, all_analyses, critical_path_analysis) {
+        super(ctx);
+
+        this._display_image = {
+            "1": "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAyAAAAMgCAMAAADsrvZaAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsMAAA7DAcdvqGQAAAAYdEVYdFNvZnR3YXJlAHBhaW50Lm5ldCA0LjEuMWMqnEsAAADwUExURQAAAP///0z/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AEz/AAAAAAEEAAMMAAUUAAcYAAkgAAokAAsoAA4wAA80ABE8ABNAABREABVIABlUABpYABtcABxgAB1kACFwACJ0ACN4ACR8ACaAACqPACuTAC2XAC6bADKrADSvADWzADrDADvHAEDXAETnAEjzAEr7AEz/ABpIQhYAAAAqdFJOUwAABAgMEBQYHCAkKCwwNDxAREhUWFxgZHB0eHyAj5OXm6uvs8PH1+fz+8szC2kAAArxSURBVHja7d3ZUlRXAIbRTfQ1cpmkKpd5/xfAJJiQYAMXKXqAZhYUB2SKUVOi0Nh9xj2sdaNlWTbnwGe3Fr3/EAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAgPUuJPvKNTx1d+M4tgNkeuwXE8CIh1pcEnkFAICAQEAgIBAQCAgGBgEBAIIBAQCAgEBAICAQEAgIBgYBAQCCAQEAgIBAQCAgEBAICAYGAQACBgEBAICAQEAjEpoQJtp/7edhnLtEzCAgEBAIIBAQCAgGBgEBAICAQEAiU47FbEJnvf0nnY/0hubu7MvYMkrjxinsQTx8CUYg+BKIQqvUhkCgLWXUPWrBaoQ+BRGmokBb6GAaBZOJaIS30cS0QhdBsHwKJtpA1N6FBaxX7EEi0hWwppME+tir2IZBoXSqkwT4ug0AUQuN9CCTqQgZuQgMGNfoQSNSFDBXSQB/DGn0IJGrnCmmgj/MgEIXQSh8Cib6QdTehhvWafQgk+kImCqnRx6RmHwKJ3iuF1OjjVRCIQmitD4EkUciGm1DBRgN9CCSJQqYKqdDHtIE+nGqShNMQfnIXFu3jNAhEIbTah0BSKWRp6Ud3YX6bu830IZBUnDwKCpm/j/2TIJCyHAWFzN/HURCIQmi9D//Nm1IhB+7BXA6a60MgKTlcdg/msHwYBFKmXYXM0cduEIhC6KQPgaRWyBP34EFPmu1DIKmZOvr9ISvTIJCyGUd4qI9xEIhC3IPO+hCIQvQhkMxsO/r9PqvbQSAE4wgz+qh6gLtAFKIPgShEHwLh/0JGjn6/bW3UTh8CSdWVcYTbfWxdBYFwm/mQ231cBoGgkM77EEjKhTj6/YN6AyACydeFQj70cREEwn3Mh9QfABGIQvQhkGILGZV99Pv6qN0+BJK6t0WPI6xP3gaB8JCS50OaGAARiEL0IZCiC9kucz5kY7v9PgSSg7PpdYl9TM+CQJjHaYGFNDUAIpASvC6ukI76EEguhezeFHW9jQ3kCKSUf6nvl1RIcwM5AinFi4IKaXIARCAK0YdAeF9IKQM7B931IZCcnB4XcZmNDuQIpCRHJRTS8ACIQBSiD4HwqZDnmV/gk277EEhuDl9mfXmND+QIpDS7ORfSxsCBQBSiD4HwuZCzTC/safd9CCTLQt5keVmrkyAQGnCznWMhbQ0cCEQh+hAIeRfSUx8CybWQnXdZXc/auKe3TAokU9fjnAppc+BAIArRh0DIt5Ae+xBIzoVMLrO4jsGox+sQSMauxjkUMhj2+UwokJxdZlBI2wMgAlGIPgTCjEImV0l//OvjfvsQSO4utlMuZH3S93cECCR35wkX0sUAiEAUcqUPgTC7kJ00j37f2Om/D4GU4G2S4wgb0xjeOyyQEqQ4H9LVAIhASLGQSPoQSCmF7KV19PvmXhx9CKQUZ0mNI2zux3JCpEBKkdJ8SJcDIAIhtUIi6kMgJRVymMgHehhPHwIpyUka4wjLMS1lCaQkScyHdDwAIhCSKiSuPgRSWiEnkX+Av8bVh0BKcxD3OMLKThAIfYp6PqSHARCBkEwh8fUhkBILiXVg52l8fQikyELiPPq9l4EcgXBHnOMIfQ0cCIQUComzD4EoRB8C4U4hkQ3s9DaQIxDuFdc4Qp8DBwIh9kLWtqI9uksgCtGHQLivkEgGdgajiI9+FEjB4hjYGQwvgkCIUQzzIf0OSAmEuAuJvA+BlF5Iz0e/r08in+IVSOHe9VpI/wM5AuFhfc6HrE9eB4GgkGT7EAjhvKej3zem8fchEEJ400shcQzkCIRv62M+ZGP6IggEhSTdh0D4WEjHR79v7qfRh0D46GWnhcQzkCMQ5tPlfMjm/nEQCApJvg+B8LmQrnZrjtLpQyB89rybL9yoBnIEQmR/tS/vBYGgkCz6EAhfFnLa8gP8llYfAuFL++1+g1R0AzkCYTGtzoesTIJAUEg2fQiEu4W8aukP/iO9PgTCXdN23ii+Og4CIQPtjCOsDm8EgkKy6kMg3F/I24b/xL/S7EMgzCik2QPd1kZp9iEQ7tfsOELMAwcCoe9C0u1DILRfSMJ9CITZhWw3c/T7YJxuHwJhtmbGEWIfOBAIfRaSdh8Cod1CEu9DIDxcSM2j36MfyBEItdQb2Il/IEcg1FNnPiSFARCB0FchGfQhEL5dSMWj35MYyBEItVUb2EljIEcg1FdlPmRjehoEgkKy7kMgzFfIgke/b+7n0YdAmM9iAzspDRwIhCYsMh+yuX8UBIJCsu9DIMxfyLwvm47z6UMgzO94vkISGzgQCE2Zaz5keTcIBIUU0YdAWKyQF9/4Db/n1YdAWMzew99gleDAgUBo0oPzISvjIBAUUkwfAmHxQma9zePP/PoQCIvbuf+N5qujIBCYMR+yOrwWCMwoJM8+BEK1Qs6/+pW/8+xDIFQr5KsD4dYy7UMgVPPlOMLa1mUQCNxfSL59CITqhVx8+tmzfPsQCNULmQw+/DgY5duHQKjucvhfIYPheRAI3HX+vpC8+wiPfZapU8jldtZ9CIR6hfyT+QV6iQUCAYGAQEAgIBAQCAgEBAICAQQCAgGBgEBAICAQEAgIBAQCAgEEAgIBgYBAQCAgEBAICAQEAggEBAICAYGAQEAgIBAQCAgEBAIIBAQCAgGBgEBAICAQEAgIBBAICAQEAgIBgYBAQCAgEBAICAQQCAgEBAICAYGAQEAgIBAQCCAQEAgIBAQCAgGBgEBAICAQEAggEBAICAQEAgIBgYBAQCAgEEAgIBAQCAgEBAICAYGAQEAgIBBAICAQEAgIBAQCAgGBgEBAICAQQCAgEBAICAQEAgIBgYBAQCCAQEAgIBAQCAgEBAICAYGAQEAggEBAICAQEAgIBAQCAgGBgEAAgYBAQCAgEBAICAQEAgIBgYBAAIGAQEAgIBAQCAgEBAICAYEAAgGBgEBAICAQEAgIBAQCAgGBAAIBgYBAQCAgEBAICAQEAgIBBAICAYGAQEAgIBAQCAgEBAICAQQCAgGBgEBAICAQEAgIBAQCAgEEAgIBgYBAQCAgEBAICAQEAggEBAICAYGAQEAgIBAQCAgEBAIIBAQCAgGBgEBAICAQEAgIBBAICAQEAgIBgYBAQCAgEBAICAQQCAgEBAICAYGAQEAgIBAQCCAQEAgIBAQCAgGBgEBAICAQEAggEBAICAQEAgIBgYBAQCAgEEAgIBAQCAgEBAICAYGAQEAgIBBAICAQEAgIBAQCAgGBgEBAICAQQCAgEBAICAQEAgIBgYBAQCCAQEAgIBAQCAgEBAICAYGAQEAggEBAICAQEAgIBAQCAgGBgEAAgYBAQCAgEBAICAQEAgIBgYBAAIGAQEAgIBAQCAgEBAICAYEAAgGBgEBAICAQEAgIBAQCAgGBAAIBgYBAQCAgEBAICAQEAgIBBAICAYGAQEAgIBAQCAgEBAICAQQCAgGBQKuWEn3kG5+6vL64Yv2EegYBgYBAQCAgEBAICAQEAgIBgQACAYGAQEAgIBAQCAgEBAICAYEAAoEKHvX2yEtuPp5BQCAgEBAIIBAQCAgEBAICAYGAQEAgIBAQCCAQEAgIBAQCAgGBgEBAICAQQCAgEBAICAQEAgIBgYBAQCAgEEAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACJ+hfSpnbJu0OeOwAAAABJRU5ErkJggg==",
+            "-1": "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAyAAAAMgCAMAAADsrvZaAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsMAAA7DAcdvqGQAAAAYdEVYdFNvZnR3YXJlAHBhaW50Lm5ldCA0LjEuMWMqnEsAAAF9UExURQAAAP////8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAP8AAAAAAAQAAAgAAAwAABAAABQAABgAABwAACAAACQAACgAACwAADAAADQAADgAADwAAEAAAEQAAEgAAEwAAFAAAFQAAFgAAFwAAGAAAGQAAGwAAHAAAHQAAHwAAIAAAIMAAIcAAI8AAJMAAJcAAJsAAJ8AAKMAAKcAAKsAAK8AALMAALcAALsAAL8AAMMAAMcAAMsAAM8AANMAANcAANsAAN8AAOMAAOcAAOsAAO8AAPMAAPcAAPsAAP8AAOUBRfcAAABBdFJOUwAABAgMEBQYHCAkKCwwNDg8QERITFBUWFxgZGhscHR4fICDh4uPk5ebn6Onq6+zt7u/w8fLz9PX29/j5+vv8/f7AemI3gAADy1JREFUeNrt3VtzFFdiwHFdBknoiiQuEgKk0UgzmovA+wUGifsdsVwMNjjZqjzkI+Xb7DfIQx5StUkqtblVBYzttRdcqfLukiKOjUHSMDOa6T6n+/d7sAHNpc8Z/emZpk9rYAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAgAAsLWV+iAWvMt2aXR/4/huBwJ7G682BP/3999ke5LDXmS7/bj13eWCg+N3zvwgEdmvcevvfta+eCwR2WX304/83/uMbgcAHFn7z06/O/u5Vhsc55KWmC9Pld78uT9uDwC+N/urCu9+svH7xZ4HAu++as1ff+zjy7Ys3AoGfbOy8//vy85cCgf+3/NmHf1L//bc+pMP/mS/u/rPivD0IvDVx7vweO5XvX/4gEBg49MnFvf545VU2zzkRCJ29J9+8sfcXSt9k8lCWQOjI2oP9vlL5r68FQs6d/Kv9v7b5z3/M4B7Ta077jpRbfbV8xB6EPDt8brvVl5dfvfyTQMitwrkrrW+w+t2LvwiEvKrf/tgt1l++EAg5Vfz047ep/vsfsjXowUif+Y3v16Qd/9u2bvZ32dqHOIpFe6Y22rvdxpS3WJHvu+jC6CcX2rvhL5ZPDXZAIAKJ+p3G2Wvt3nT1D8/fZOclFghtvXG61/5tf14+JRCB5MTpzzu5df1fv8vMS+xDOh83V+rs9qU5H9LtQfJj4txWZ3dYfv3VD95iCSQnCp9c6vQuxT++XT4lEIHkwFDjZud3Wvv6yzcCEUgelB52c6+N//xaIALJgcW/7u5+m//0ylEsMm+m3O09yzNZGL89CK2MfbLd7V2zcclegdBC4f2L8HYmE5fsFQgtVO8e5N7rL74UiEAybOXJwe5f+7dvBSKQzDr2Nwd9hHP/GPsPwXUUi/1MVnvwHm3CHsQeJJtG9r4Ib4dv0mK/ZK9A2Oe9xeb1XjxM6ZvnAhFIBlV+3aPH+e+vBCKQzFl61qtHqv9LzJfs9SGdvcyt9+6x1mftQexBsmX87FbvHizqS/YKhN0K5y738uGK3z1/IxCBZMZQ/VZvH3Dtqy8FIpDMKD7q9SNuRHvJXoHwoYXf9P4xz/7udaR7U98PvG+60o9HrUR6yV57EN43+qvtfjzsyuvnfxaIQKI3fJAlUq2sfhvloSyB8J7qTr8eufzipUAEErnlz/r32LXfR7h8yod0fuFosZ+PXpy3B7EHidnEufN93T29fvmDQAQSrZ4skWpl5VV0Pya6kIPXfTudp/1tbEMcrPf9tNsr//MPkR3K8hmEnz8iJHBa+p01H9KJ0+JSEs/yeFEgxOhIKZnnKR8RCPEZqyR01KRZGxMIsRmujST1VBc3hwVCZMoJXuDtWlUgxOXM0SSf7d6KQIjJ8eVkn+/ZcYEQj8nE/3GiMikQYjFSS/xT89bmiECIw2B1NPknvdQYFAhRWJtO41lvVgRCDJYW0nneh6cEQvjmimk9c2lOIIRuvJLaZ4FmbVwghK1QTXFB0HajIBBCNlhJ9S/xq7VBgRCwlZQ/BtwtCYRwLaR+IOnJgkAI1XQAy18r0wIhTKPVAD4BNBujAiFECS6RaiX05VMCyau1QE6ovV4RCOE5E8ySjPsrAiE0R5fD2ZZnRwVCWCbKIW1NdUIghCSFJVKthLx8SiA5NFgJ7NJUl8M950QgOVQK7uKGt8sCIRQnA7w87qNTAiEMs6shbtXanEAIweGNIN/vN+vjAiF9w7VAVyltbxYEQtpSXiLVytXakEBI2XLAP2h2pygQ0nXidMhb9/mCQEjTdOA/IbAyIxDSM1IN/OU+H97yKYHkx1At+CtGh7d8SiD5sTYV/jbeqAiEdJw+EcNWPlgRCGmYX45jO1eOCoTkTVQi+YEczdqkQEjaoWo0P3t5qzEiEJI1WDkcz8ZeCemcE4HkwupsTFt7Z10gJGnxZFzb++lJgZCcI6XYtrg8KxCSMrYxGNsmN+uHBUIyhmuH4tvoC6EsnxJI5lUmYtzqazWBkISQl0i1srMqEPrv+JlYt/zpCYHQb1Nr8W57ED99SiCZNhLPGSa7bYWwfEogWTZUHY158y81hgVCH5Wm497+m2WB0D+nFmIfwcMzAqFf5lbiH8PqvEDoj/H4zjDZrVmfEAj9UIj5ANY7W41DAqH3BjfGszGQq/UhgdBzK7NZGcmdkkDotYVT2RnLk0WB0Fsza1kaTeWIQOil0SwcwHqnWR8TCL0zHP5FeDtzMb3lUwLJoPXJrI3o+oZA6JUzx7I3pl8XBUJvHFvO4qieHRcIvTC5ns1xbUwJhIMbycYZJrultHxKINkyuDGW1aFdTuWcE4FkS2kmu2O7VRYIB7O0mOXRPTotEA5itpjt8ZXmBEL3DmfrDJPdmvVxgdCt4Voh60PcTvycE4FkRmaWSLWS+PIpgWTG8lweRnm3KBC6sXA6H+P8fFEgdG66lJeRlmcEQqdGqrl5Jc8nes6JQDIhc0ukWkl0+ZRAMqE0lafR3igLhE6snMjXeB+sCIT2Hf0ibyP+4qhAaNdkLX9jrk0KhPaMbG7lb9BbmyMCoa1XsH45j8NOavmUQGK3fjuf4769LhA+bunTvI780yWB8DGz6/kd+/qsQGhtvN7M7+ATWT4lkJgVGhfyPPwLjYJAaKF2Ld/jv1YTCPtb3cn7DOysCoT9LDw1B08XBMLepsvmYGCgPC0Q9jKaxzNMdtvaHBUIuw03LpqEty42hgXC7rcWN83Bj26WBcKHlh+ag588XBYI75svmoN3ivMC4Zcm8nyGyW7N+oRAeOeQA1jv29o8JBB+fs3qV0zC+670bfmUQOKzdsccfOjOmkD40cnH5mC3xycFwltHnGGyp/IRgTAwcNgBrL0164cFQsEZJvu52JflUwKJS/W6OdjP9apA8q54zxzs715RIPl2/Jk5aOXZcYHk2dSGOWhtY0og+WWJ1Ef1fvmUQKIxVL9kEj7mUq/PORFINCq3zMHH3aoIJJ9OWyLVloenBZJHcyVz0J7SnEDyxxKptvV2+ZRAolBobJuEdm338pwTgcRgqH7VJLTvag8PZQkkBqt3zUEn7q4KJE8WPzMHnflsUSD5MVMxB52qzAgkL8YaDmB1rNkYE0g+WCLVlV4tnxJI8G8WbpiDbtyoCCQPVu6bg+7cXxFI9h37whx064tjAsm6yao56F51UiDZNmKJ1EFsbY4IJMuG6pdNwkFcPvg5JwIJWPm2OTiY22WBZNfSI3NwUI+WBJJVc+vm4ODW5wSSTeM1Z5j0QLM2LpAsKjQumIReuHCwc04EEqah2jWT0BvXakMCyZyVHXPQKzsrAsmahafmoHeeLggkW6YtkeqpyrRAsmR087xJ6KXz3V+yVyDhGd60RKrHLm4OCyQ7bwgskeq5rpdPCSQ4yw/MQe89WBZINhwtmoN+KB4VSBZMOMOkP5q1CYHEzxKpvulu+ZRAgjJUu2IS+uVKN+ecCCQoa3fMQf/cWRNI3E4+Ngf99PikQGI2WzYH/VWeFUi8DjuA1W/N2mGBxKrgDJP+u7hZEEikqpZIJeBaVSBxKt4zB0m4VxRIjE48MwfJeHZCIPGZskQqMZUpgcRm1BkmydnqZPmUQEIw1LhkEpJzqTEkkLh2+jfNQZJuVgQSkzMPzUGyHp4RSDzmV81B0lbnBRILS6RS0PbyKYGk7VBj2yQkb7txSCAxGKpdNQlpuNre8imBpKx01xyk425JIOFbfGIO0vJkUSChm3GGSYoqMwIJ21jDAawUNRtjAgmZJVIpa2P5lEDS3MVfNwfpul4RSLiK981B2u4XBRKqY5ZIBeDZMYGEabJqDkJQnRRIiCyRCsRHlk8JJB1DtcsmIQyXW55zIpB0lG+bg1DcLgskNKcemYNwPDolkLDMrZmDkKzNCSQk43VnmASlWR8XSDgKm5ZIBWZ733NOBJI4S6QCtO/yKYEkrrhjDsKzs885J4OpbdHBnvlNJ/vPdAb42yTfIgQ5xME+vaCJ7u/93QECAYGAQEAgIBAQCAgEBAICAQQCAgGBgEBAICAQEAgIBAQCAgEEAgIBgYBAQCAgEBAICAQEAggEBAICAYGAQEAgIBAQCAgEBAIIBAQCAgGBgEBAICAQEAgIBBAICAQEAgIBgYBAQCAgEBAICAQQCAgEBAICAYGAQEAgIBAQCCAQEAgIBAQCAgGBgEBAICAQEAggEBAICAQEAgIBgYBAQCAgEBCIKQCBgEBAICAQEAgIBAQCAgGBAAIBgYBAQCAgEBAICAQEAgIBgQACAYGAQEAgIBAQCAgEBAICAQQCAgGBgEBAICAQEAgIBAQCAgEEAgIBgYBAQCAgEBAICAQEAggEBAICAYGAQEAgIBAQCAgEBAIIBAQCAgGBgEBAICAQEAgIBBAICAQEAgIBgYBAQCAgEBAICAQQCAgEBAICAYGAQEAgIBAQCAjEFIBAQCAgEBAICAQEAgIBgYBAAIGAQEAgIBAQCAgEBAICAYGAQACBgEBAICAQEAgIBAQCAgGBAAIBgYBAQCAgEBAICAQEAgIBgQACAYGAQEAgIBAQCAgEBAICAQQCAgGBgEBAICAQEAgIBAQCAgEEAgIBgYBAQCAgEBAICAQEAggEBAICAYGAQEAgIBAQCAgEBAIIBAQCAgGBgEBAICAQEAgIBARiCkAgIBAQCAgEBAICAYGAQEAggEBAICAQEAgIBAQCAgGBgEBAIIBAQCAgEBAICAQEAgIBgYBAAIGAQEAgIBAQCAgEBAICAYGAQACBgEBAICAQEAgIBAQCAgGBAAIBgYBAQCAgEBAICAQEAgIBgQACAYGAQEAgIBAQCAgEBAICAQQCAgGBgEBAICAQEAgIBAQCAgEEAgIBgYBAQCAgEBAICAQEAgIxBSAQEAgIBAQCAgGBgEBAICAQQCAgEBAICAQEAgIBgYBAIAcGI33mN166bH1zhfqC2oOAQEAgIBAQCAgEBAICAYGAQACBgEBAICAQEAgIBAQCAgGBgEAAgUAXhlN75kGTjz0ICAQEAgIBBAICAYGAQEAgIBAQCAgEBAICAQQCAgGBgEBAICAQEAgIBAQCCAQEAgIBgYBAQCAgEBAICAQEAggEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACBS/wtsNykqRxjn7QAAAABJRU5ErkJggg=="
+        };
+
+        this.setButtonImage(this._display_image[critical_path_analysis.judgement]);
+
+        let databinding = {};
+
+        let layout = new Layout(this.button_subwindow);
+        layout.setRect("Title", new Pos(0, 0), new Pos(100, 10), new RU_DataViewText());
+        layout.setRect("PathInfo", new Pos(0, 20), new Pos(70, 30), new RU_DataViewFormLayout().setTitle("PathInfo").setDataAnalysisFunction(d => {
+            let x = d;
+
+            let efficiencies = x.data.efficiency;
+            let target_thread_num = max_func(efficiencies, y => y.thread_num);
+
+            let path_1_thread = 0;
+            if(toplevel_use_mean)
+                path_1_thread = MathHelper.mean(x.data.critical_paths.find(x => x.thread_num == 1).value);
+            else if(toplevel_use_median)
+                path_1_thread = MathHelper.median(x.data.critical_paths.find(x => x.thread_num == 1).value);
+            else ObjectHelper.assert("Undefined mode", false);
+
+            let path_max_thread = 0;
+            if(toplevel_use_mean)
+                path_max_thread = MathHelper.mean(x.data.critical_paths.find(x => x.thread_num == target_thread_num).value);
+            else if(toplevel_use_median)
+                path_max_thread = MathHelper.median(x.data.critical_paths.find(x => x.thread_num == target_thread_num).value);
+            else ObjectHelper.assert("Undefined mode", false);
+            
+            let descstr_1 = "unknown";
+            let descstr_max = "unknown";
+            if(toplevel_use_mean) {
+                descstr_1 = "mean";
+                descstr_max = "mean";
+            }
+            else if(toplevel_use_median) {
+                descstr_1 = "median";
+                descstr_max = "median";
+            }
+
+            return {
+                fontsize: 16,
+                rows: [
+                    { title: "Threads", value: target_thread_num },
+                    { title: "Serial Path (" + descstr_1 + ")", value: path_1_thread.toFixed(3).toString() + " cycles" },
+                    { title: "Critical Path (" + descstr_max + ")", value: path_max_thread.toFixed(3).toString() + " cycles" } // As of yet unknown. 
+                ],
+                padding: {left: 10, right: 10, top: 0, bottom: 0 },
+                rawdata: d
+            };
+        }));
+        layout.setRect("Balance", new Pos(70, 50), new Pos(30, 20), new RU_DataViewNumberBlock().setTitle("Balance").setDataAnalysisFunction(x => {
+            let balance_max = x.data.balance_max * 100.0;
+            let p = Math.round(balance_max);
+
+            return p;
+        }).setColorScaling(x => Math.min(Math.pow(x, 2.) / 10, 100.)));
+        layout.setRect("Efficiency", new Pos(70, 20), new Pos(30, 20), new RU_DataViewNumberBlock().setTitle("Efficiency").setOptions({
+            draw_bar: ["left"],
+            padding: { left: 10 },
+            display_title: true,
+            text_align: "center",
+        }).setDataAnalysisFunction(x => {
+            let efficiencies = x.data.efficiency;
+            let target_thread_num = max_func(efficiencies, y => y.thread_num);
+            
+            let efficiency = 0;
+            
+            if(toplevel_use_mean)
+                efficiency = MathHelper.mean(efficiencies.find(y => y.thread_num == target_thread_num).value);
+            else if(toplevel_use_median)
+                efficiency = MathHelper.median(efficiencies.find(y => y.thread_num == target_thread_num).value);
+            else ObjectHelper.assert("Unknown mode", false);
+
+            return Math.round(100. * efficiency);
+        }).setColorScaling(x => 100 - x).setInformationFilePath("optimization_hints/efficiency.html"));
+
+        let thread_graph = new RU_DataViewBarGraph({
+            type: 'bar',
+            yAxes: [{
+                type: "linear",
+                display: true,
+                position: 'left',
+                id: 'axis-1'
+            }
+            ]
+        }).setDataAnalysisFunction(x => {
+            
+            let tcs = [];
+            if (x != null) {
+                ObjectHelper.logObject("x.data", x.data);
+                tcs = x.data.map(x => x.data.cycles_per_thread);
+            }
+
+            ObjectHelper.logObject("tcs", tcs);
+
+            let colors = RU_DataViewBarGraph.colorList().slice(0, tcs.length + 1);
+
+            let datasets = [];
+
+            let i = 0;
+            for (let tc of tcs) {
+                datasets.push({ label: "run " + i.toString(), yAxisID: "axis-1", data: tc, backgroundColor: colors[i] });
+                i++;
+            }
+
+            let chartData = {
+                labels: [...Array(tcs[0].length).keys()],
+                "datasets": datasets,
+
+            };
+
+            return chartData;
+        }).linkMouse(layout._layout_clickable).changeGraphOptions(x => {
+            x.options.title.text = "PAPI_TOT_CYC per thread";
+            x.options.scales.yAxes.find(x => x.id == 'axis-1').scaleLabel = { labelString: "Cycles", display: true };
+            x.options.scales.yAxes.find(x => x.id == 'axis-1').ticks.beginAtZero = true;
+            x.options.scales.xAxes = [{ scaleLabel: { labelString: "Thread", display: true } }];
+        });
+
+        let efficiency_graph = new RU_DataViewBarGraph({
+            type: 'line',
+            yAxes: [{
+                type: "linear",
+                display: true,
+                position: 'left',
+                id: 'axis-1'
+            }, {
+                type: "linear",
+                display: true,
+                position: 'right',
+                id: 'axis-2'
+            }
+            ]
+        }).setDataAnalysisFunction(x => {
+            let critical_paths = [];
+            if (x != null) {
+                critical_paths = x.data.critical_paths;
+            }
+
+            let speedup = [];
+            if (x != null) {
+                speedup = x.data.speedup;
+            }
+
+            let efficiency = [];
+            if (x != null) {
+                efficiency = x.data.efficiency;
+            }
+
+            let colors = RU_DataViewBarGraph.colorList().slice(0, 4);
+
+            let datasets = [];
+
+            let graphcp = 0;
+            if(toplevel_use_mean) {
+                graphcp = critical_paths.map(cp => MathHelper.mean(cp.value));
+            }
+            else if(toplevel_use_median) {
+                graphcp = critical_paths.map(cp => MathHelper.median(cp.value));
+            }
+            else ObjectHelper.assert("Unknown mode", false);
+            ObjectHelper.logObject("gcp", graphcp);
+
+            let i = 0;
+            // Add the critical paths
+            datasets.push({ label: "Critical path", fill: false, yAxisID: "axis-1", data: graphcp, backgroundColor: colors[0], borderColor: colors[0] });
+            
+            let agg_func = undefined;
+            if(toplevel_use_mean)
+                agg_func = x => MathHelper.mean(x);
+            else if(toplevel_use_median)
+                agg_func = x => MathHelper.median(x);
+            else
+                ObjectHelper.assert("undefined mode", false);
+
+            // Add the speedup
+            datasets.push({ label: "Speedup", fill: false, yAxisID: "axis-2", data: speedup.map(sp => agg_func(sp.value)), backgroundColor: colors[1], borderColor: colors[1] });
+
+            datasets.push({ label: "Efficiency", fill: false, yAxisID: "axis-2", data: efficiency.map(sp => agg_func(sp.value)), backgroundColor: colors[2], borderColor: colors[2] });
+
+            let chartData = {
+                labels: critical_paths.map(x => x.thread_num),
+                "datasets": datasets,
+
+            };
+
+            return chartData;
+        }).linkMouse(layout._layout_clickable).changeGraphOptions(x => {
+            x.options.title.text = "Parallel efficiency";
+            x.options.scales.yAxes.find(x => x.id == 'axis-1').scaleLabel = { labelString: "Cycles", display: true };
+            x.options.scales.yAxes.find(x => x.id == 'axis-1').ticks.beginAtZero = true;
+            x.options.scales.yAxes.find(x => x.id == 'axis-2').scaleLabel = { labelString: "Relative Perf.", display: true };
+            x.options.scales.yAxes.find(x => x.id == 'axis-2').ticks.beginAtZero = true;
+            x.options.scales.xAxes = [{ scaleLabel: { labelString: "OMP_NUM_THREADS", display: true } }];
+        }).setInformationFilePath("optimization_hints/efficiency.html");
+
+
+        layout.setMultiviewRect("Graph", new Pos(0, 50), new Pos(70, 50), [thread_graph, efficiency_graph]);
+        // Otherwise, we have a section that we should still process
+        let section = new Section(targetsection);
+
+
+        let ta = new ThreadAnalysis(section);
+        let db = ta.analyze();
+
+
+        databinding["Balance"] = db;
+        databinding["Efficiency"] = critical_path_analysis;
+        databinding["Title"] = new DataBlock({ fontsize: 32, text: "Parallelization results", color: "black", align: "center" }, "Text");
+        databinding['Graph'] = [all_analyses, critical_path_analysis];
+        databinding['PathInfo'] = critical_path_analysis;
+
+        this.dataparams = [targetsection, all_analyses, critical_path_analysis];
+        layout.setDataBinding(databinding);
+
+
+        this.button_subwindow.setLayout(layout);
+
+        this.setOnEnterHover(p => { this.color = "#00FF00"; this.button_subwindow_state = 'open'; })
+        this.setOnLeaveHover(p => { this.color = "orange"; if (!this.is_locked_open) this.button_subwindow_state = 'collapsed'; })
+        this.setOnClick((p, mb) => { this.is_locked_open = !this.is_locked_open; });
+    }
+}
diff --git a/diode/pattern_editor.py b/diode/pattern_editor.py
new file mode 100644
index 0000000000..f69ec7e86b
--- /dev/null
+++ b/diode/pattern_editor.py
@@ -0,0 +1,372 @@
+import gi
+gi.require_version('Gtk', '3.0')
+gi.require_version('GtkSource', '3.0')
+from gi.repository import Gtk, GtkSource
+
+from diode.rendered_graph import RenderedGraph
+from diode.abstract_sdfg import AbstractSDFG
+from diode.images import ImageStore
+from diode.property_renderer import PropertyRenderer, _get_edge_label
+
+
+class PatternEditor:
+    def __init__(self, builder):
+
+        self.buttons = [
+            {
+                "image": "cursor.png",
+                "type": "mouse",
+                "tool": "Mouse"
+            },
+            {
+                "image": "delete.png",
+                "type": "delete",
+                "tool": "Delete"
+            },
+            {
+                "image": "array.png",
+                "type": "node",
+                "tool": "Array"
+            },
+            {
+                "image": "edge_thin.png",
+                "type": "edge",
+                "tool": "Memlet"
+            },
+            {
+                "image": "map.png",
+                "type": "node",
+                "tool": "Map"
+            },
+            {
+                "image": "unmap.png",
+                "type": "node",
+                "tool": "Unmap"
+            },
+            {
+                "image": "tasklet.png",
+                "type": "node",
+                "tool": "Tasklet"
+            },
+            {
+                "image": "stream.png",
+                "type": "node",
+                "tool": "Stream"
+            },
+            {
+                "image": "stream_map.png",
+                "type": "node",
+                "tool": "Stream Map"
+            },
+            {
+                "image": "stream_unmap.png",
+                "type": "node",
+                "tool": "Stream Unmap"
+            },
+            {
+                "image": "state.png",
+                "type": "node",
+                "tool": "State"
+            },
+            {
+                "image": "state_trans.png",
+                "type": "edge",
+                "tool": "State Transition"
+            },
+        ]
+
+        self.active_tool = None  # an element of self.buttons
+        self.builder = builder
+        self.main_sdfg = None
+        self.first_selected_node_for_edge = None
+
+        self.rendered_main_sdfg = RenderedGraph()
+        sdfg_da = self.builder.get_object("patedmainsdfg")
+        self.rendered_main_sdfg.set_drawing_area(sdfg_da)
+
+        self.abstract_find_sdfg = AbstractSDFG()
+        self.rendered_find_sdfg = RenderedGraph()
+        find_da = self.builder.get_object("find_da")
+        self.rendered_find_sdfg.set_drawing_area(find_da)
+
+        self.abstract_replace_sdfg = AbstractSDFG()
+        self.rendered_replace_sdfg = RenderedGraph()
+        replace_da = self.builder.get_object("replace_da")
+        self.rendered_replace_sdfg.set_drawing_area(replace_da)
+
+        tbuffer = self.builder.get_object("pe_sourceview").get_buffer()
+        self.init_syntax_highlighting("pe_sourceview", "python")
+        self.image_store = ImageStore()
+
+        plabel = self.builder.get_object("pe_propertylabel")
+        pgrid = self.builder.get_object("pe_propertygrid")
+        self.propren = PropertyRenderer(plabel, pgrid, self.OnSDFGUpdate)
+
+        self.load_buttons()
+        self.connect_signals()
+
+    def OnSDFGUpdate(self, sdfg, nodeid, propname, newval):
+        self.rendered_main_sdfg.set_dotcode(self.main_sdfg.draw())
+
+    def connect_signals(self):
+        find_da = self.builder.get_object("find_da")
+        replace_da = self.builder.get_object("replace_da")
+        sdfg_da = self.builder.get_object("patedmainsdfg")
+
+        sdfg_da.connect("draw", self.OnDrawMainSDFG)
+        find_da.connect("draw", self.OnDrawFindSDFG)
+        replace_da.connect("draw", self.OnDrawReplaceSDFG)
+        sdfg_da.connect("scroll-event", self.OnScrollMainSDFG)
+        find_da.connect("scroll-event", self.OnScrollFindSDFG)
+        replace_da.connect("scroll-event", self.OnScrollReplaceSDFG)
+        sdfg_da.connect("button-press-event", self.OnButtonPressMainSDFG)
+        sdfg_da.connect("button-release-event", self.OnButtonReleaseMainSDFG)
+        sdfg_da.connect("motion-notify-event", self.OnMouseMoveMainSDFG)
+        find_da.connect("button-press-event", self.OnButtonPressFindSDFG)
+        replace_da.connect("button-press-event", self.OnButtonPressReplaceSDFG)
+
+    def load_buttons(self):
+        toolbar = self.builder.get_object("pated_toolbar")
+        for b in self.buttons:
+            pixbuf = self.image_store.get_image(b["image"])
+            image = Gtk.Image.new_from_pixbuf(pixbuf)
+            button = Gtk.ToggleToolButton()
+            button.set_icon_widget(image)
+            toolbar.add(button)
+            b["button"] = button
+            if b["tool"] == "Mouse":
+                self.active_tool = b
+            button.connect("toggled", self.OnToggleTBButton, b)
+
+    def init_syntax_highlighting(self, widgetname, language):
+        tbuffer = self.builder.get_object(widgetname).get_buffer()
+        lang_manager = GtkSource.LanguageManager()
+        language = lang_manager.get_language(language)
+        tbuffer.set_language(language)
+        tbuffer.set_highlight_syntax(True)
+
+    def set_main_sdfg(self, sdfg):
+        self.main_sdfg = sdfg
+        dotcode = sdfg.draw()
+        self.rendered_main_sdfg.set_dotcode(dotcode)
+
+    def OnDrawMainSDFG(self, widget, cr):
+        self.rendered_main_sdfg.render(widget, cr)
+        return False
+
+    def OnDrawFindSDFG(self, widget, cr):
+        self.rendered_find_sdfg.render(widget, cr)
+        return False
+
+    def OnDrawReplaceSDFG(self, widget, cr):
+        self.rendered_replace_sdfg.render(widget, cr)
+        return False
+
+    def OnToggleTBButton(self, widget, button):
+        self.active_tool["button"].set_active(False)
+        statuslabel = self.builder.get_object("run_status_text")
+        if button["type"] == "node":
+            statuslabel.set_text("Click \"find\" or \"replace\" pane to " + \
+                      "add a " + button["tool"] + " node.")
+        elif button["type"] == "edge":
+            statuslabel.set_text("In the \"find\" or \"replace\" pane, " + \
+                      "click two nodes between which you want to add a " + \
+                      button["tool"] + " edge.")
+        elif button["type"] == "edge_redir":
+            statuslabel.set_text("In the \"find\" or \"replace\" pane, " + \
+                      "click an edge, followed by the new node it should " + \
+                      "attach to.")
+        elif button["tool"] == "Delete":
+            statuslabel.set_text("Click a node or edge in the \"find\" or " + \
+                      "\"replace\" pane in oder to delete it.")
+        self.active_tool = button
+        return True
+
+    def OnScrollMainSDFG(self, widget, ev):
+        d = self.rendered_main_sdfg.determine_scroll_direction(ev)
+        self.rendered_main_sdfg.zoom(d, pos=(ev.x, ev.y))
+        widget.queue_draw()
+        return False
+
+    def OnScrollFindSDFG(self, widget, ev):
+        d = self.rendered_find_sdfg.determine_scroll_direction(ev)
+        self.rendered_find_sdfg.zoom(d, pos=(ev.x, ev.y))
+        widget.queue_draw()
+        return False
+
+    def OnScrollReplaceSDFG(self, widget, ev):
+        d = self.rendered_replace_sdfg.determine_scroll_direction(ev)
+        self.rendered_replace_sdfg.zoom(d, pos=(ev.x, ev.y))
+        widget.queue_draw()
+        return False
+
+    def OnButtonPressMainSDFG(self, widget, ev):
+        x, y = ev.x, ev.y
+        elem = self.rendered_main_sdfg.get_element_by_coords(x, y)
+        if ev.button == 1:
+            self.rendered_main_sdfg.handle_button_press(ev)
+            elem = self.rendered_main_sdfg.get_element_by_coords(x, y)
+            if elem is not None:
+                self.rendered_main_sdfg.highlight_element(elem)
+                self.propren.render_properties_for_element(
+                    self.main_sdfg, elem)
+
+        elif ev.button == 3:
+            if elem == None:
+                self.rendered_main_sdfg.clear_highlights()
+            else:
+                self.rendered_main_sdfg.highlight_element(elem)
+
+    def OnButtonReleaseMainSDFG(self, widget, ev):
+        self.rendered_main_sdfg.handle_button_release(ev)
+        return False
+
+    def OnMouseMoveMainSDFG(self, widget, ev):
+        self.rendered_main_sdfg.handle_drag_motion(ev)
+        return False
+
+    def OnRepFindNodePropsChanged(self, widget, data):
+        elem_in_replace = False
+        elem = self.abstract_find_sdfg.find_node(data)
+        if elem == None:
+            elem = self.abstract_replace_sdfg.find_node(data)
+            elem_in_replace = True
+        if elem == None:
+            raise ValueError("Could not find node " + data)
+            return
+        newval = widget.get_text()
+        elem.set_label(newval)
+        new_dot = ""
+        if elem_in_replace == False:
+            new_dot = self.abstract_find_sdfg.to_dot()
+            self.rendered_find_sdfg.set_dotcode(new_dot)
+        else:
+            new_dot = self.abstract_replace_sdfg.to_dot()
+            self.rendered_replace_sdfg.set_dotcode(new_dot)
+
+    def OnRepFindEdgePropsChanged(self, widget, data):
+        elem_in_replace = False
+        elem = self.abstract_find_sdfg.find_edge(data[0], data[1])
+        if elem == None:
+            elem = self.abstract_replace_sdfg.find_edge(data[0], data[1])
+            elem_in_replace = True
+        if elem == None:
+            raise ValueError("Could not find node " + data)
+            return
+        newval = widget.get_text()
+        elem.set_label(newval)
+        new_dot = ""
+        if elem_in_replace == False:
+            new_dot = self.abstract_find_sdfg.to_dot()
+            self.rendered_find_sdfg.set_dotcode(new_dot)
+        else:
+            new_dot = self.abstract_replace_sdfg.to_dot()
+            self.rendered_replace_sdfg.set_dotcode(new_dot)
+
+    def render_properties_for_repfind_node(self, elem, abstract_graph):
+        nodeid = elem.id.decode('utf-8')
+        node = abstract_graph.find_node(nodeid)
+        grid = self.builder.get_object("pe_propertygrid")
+        self.clear_property_list()
+
+        rownum = 0
+        label = Gtk.Label()
+        label.set_label("Node Label")
+        label.set_tooltip_text("set the label")
+        grid.attach(label, 0, rownum, 1, 1)
+        widget = Gtk.Entry()
+        widget.set_text(node.get_label())
+        nuid = node.get_uid()
+        widget.connect("changed", self.OnRepFindNodePropsChanged, nuid)
+
+        grid.attach(widget, 1, rownum, 1, 1)
+        rownum += 1
+        grid.show_all()
+
+    def render_properties_for_repfind_edge(self, tailelem, headelem,
+                                           abstract_graph):
+        tail_nodeid = tailelem.id.decode('utf-8')
+        tailnode = abstract_graph.find_node(tail_nodeid)
+        head_nodeid = headelem.id.decode('utf-8')
+        headnode = abstract_graph.find_node(head_nodeid)
+        edge = abstract_graph.find_edge(tail_nodeid, head_nodeid)
+
+        grid = self.builder.get_object("pe_propertygrid")
+        self.clear_property_list()
+
+        rownum = 0
+        label = Gtk.Label()
+        label.set_label("Edge Label")
+        label.set_tooltip_text("set the label")
+        grid.attach(label, 0, rownum, 1, 1)
+        widget = Gtk.Entry()
+        widget.set_text(_get_edge_label(edge))
+        widget.connect("changed", self.OnRepFindEdgePropsChanged,
+                       [tail_nodeid, head_nodeid])
+        grid.attach(widget, 1, rownum, 1, 1)
+        rownum += 1
+        grid.show_all()
+
+    def button_press_in_replace_or_find(self, widget, ev, graph):
+        rendered_graph = None
+        abstract_sdfg = None
+        if graph == "replace":
+            rendered_graph = self.rendered_replace_sdfg
+            abstract_graph = self.abstract_replace_sdfg
+        elif graph == "find":
+            rendered_graph = self.rendered_find_sdfg
+            abstract_graph = self.abstract_find_sdfg
+        else:
+            raise ValueError("graph must be find or replace")
+
+        # if the active tool is the mouse, show properties of clicked elem
+        if self.active_tool["tool"] == "Mouse":
+            elem = rendered_graph.get_element_by_coords(ev.x, ev.y)
+            rendered_graph.clear_highlights()
+            rendered_graph.highlight_element(elem)
+            label = self.builder.get_object("pe_propertylabel")
+            self.clear_property_list()
+            if type(elem).__name__ == "Node":
+                label.set_text("Properties of: " + elem.id.decode('utf-8'))
+                self.render_properties_for_repfind_node(elem, abstract_graph)
+            elif type(elem).__name__ == "Edge":
+                tailelem = elem.src
+                headelem = elem.dst
+                label.set_text("Properties of: " + tailelem.id.decode('utf-8') \
+                               + " -> " + headelem.id.decode('utf-8'))
+                self.render_properties_for_repfind_edge(
+                    tailelem, headelem, abstract_graph)
+            else:
+                label.set_text("Properties of: (Nothing selected)")
+            return False
+
+        elif self.active_tool["type"] == "node":
+            abstract_graph.add_node(self.active_tool["tool"])
+            new_dot = abstract_graph.to_dot()
+            rendered_graph.set_dotcode(new_dot)
+
+        elif self.active_tool["type"] == "edge":
+            elem = rendered_graph.get_element_by_coords(ev.x, ev.y)
+            if elem == None:
+                return
+            if self.first_selected_node_for_edge == None:
+                self.first_selected_node_for_edge = elem.id.decode('utf-8')
+            else:
+                second_selected_node_for_edge = elem.id.decode('utf-8')
+                abstract_graph.add_edge(self.first_selected_node_for_edge,
+                                        second_selected_node_for_edge)
+                self.first_selected_node_for_edge = None
+                new_dot = abstract_graph.to_dot()
+                rendered_graph.set_dotcode(new_dot)
+
+        elif self.active_tool["tool"] == "Delete":
+            elem = rendered_graph.get_element_by_coords(ev.x, ev.y)
+            abstract_graph.delete_node(elem.id.decode('utf-8'))
+            new_dot = abstract_graph.to_dot()
+            rendered_graph.set_dotcode(new_dot)
+
+    def OnButtonPressFindSDFG(self, widget, ev):
+        self.button_press_in_replace_or_find(widget, ev, "find")
+
+    def OnButtonPressReplaceSDFG(self, widget, ev):
+        self.button_press_in_replace_or_find(widget, ev, "replace")
diff --git a/diode/performance_plot.py b/diode/performance_plot.py
new file mode 100644
index 0000000000..feb585cf20
--- /dev/null
+++ b/diode/performance_plot.py
@@ -0,0 +1,46 @@
+import re
+import numpy as np
+from matplotlib.figure import Figure
+from matplotlib.backends.backend_gtk3cairo import FigureCanvasGTK3Cairo \
+        as FigureCanvas
+
+
+class PerformancePlot:
+    def __init__(self, builder):
+        self.builder = builder
+        self.data = []
+        self.xlabs = []
+
+    def parse_result_log(self, resfile):
+        with open(resfile) as f:
+            data = f.read()
+        p = re.compile('\s(\d+\.\d+)$', re.MULTILINE)
+        times = p.findall(data)
+        return times
+
+    def add_run(self, name, times):
+        self.xlabs.append(name)
+        times_np = np.asfarray(times, float)
+        self.data.append(np.median(times_np))
+
+    def render(self):
+        sw = self.builder.get_object('performance_window')
+        old = sw.get_children()
+        for w in old:
+            w.destroy()
+
+        fig = Figure(figsize=(15, 15), dpi=70)
+        ax = fig.add_subplot(111)
+        ax.set_ylabel("Runtime [s]")
+        ax.set_xlabel("Variant [order of exec.]")
+        ax.bar(x=np.arange(len(self.data)), height=(self.data))
+        for idx, data in enumerate(self.data):
+            ax.annotate(
+                str(data)[:6],
+                horizontalalignment='center',
+                xy=(idx, data),
+                xytext=(idx, data))
+        ax.plot()
+        canvas = FigureCanvas(fig)
+        sw.add_with_viewport(canvas)
+        sw.show_all()
diff --git a/diode/property_renderer.py b/diode/property_renderer.py
new file mode 100644
index 0000000000..53a6d58c42
--- /dev/null
+++ b/diode/property_renderer.py
@@ -0,0 +1,504 @@
+import re
+import gi
+import dace
+gi.require_version('Gtk', '3.0')
+from gi.repository import Gdk, Gtk, GtkSource
+
+import astunparse
+import xdot.ui.elements
+
+
+def _get_edge_label(edge: xdot.ui.elements.Edge) -> str:
+    """ Helper function to get the label of an xdot Edge from its shape. """
+    labels = [
+        s.t for s in edge.shapes if isinstance(s, xdot.ui.elements.TextShape)
+    ]
+    return "\n".join(labels)
+
+
+class PropertyRenderer:
+    """ Renders GUI for node, edge, state, and transformation properties. """
+
+    def __init__(self, label, grid, on_update_cb):
+        self.propertygrid = grid
+        self.propertylabel = label
+        self.on_update_cb = on_update_cb
+        self.screen = Gdk.Screen.get_default()
+        self.gtk_provider = Gtk.CssProvider()
+        self.gtk_context = Gtk.StyleContext()
+
+    def clear_properties(self):
+        old = self.propertygrid.get_children()
+        for w in old:
+            w.destroy()
+
+    def split_nodeid_in_state_and_nodeid(self, nodeid):
+        match = re.match("s(\d+)_(\d+)", nodeid)
+        if match:
+            ids = match.groups()
+            return int(ids[0]), int(ids[1])
+        else:
+            match = re.match("dummy_(\d+)", nodeid)
+            if match:
+                ids = match.groups()
+                return int(ids[0]), None
+            else:
+                raise ValueError("Node ID " + nodeid + " has the wrong form")
+                return None
+
+    def get_value_from_widget(self, widget):
+        newval = ""
+        if isinstance(widget, Gtk.Switch):
+            newval = widget.get_active()
+        elif isinstance(widget, Gtk.Entry):
+            newval = widget.get_text()
+        elif isinstance(widget, Gtk.ComboBoxText):
+            newval = widget.get_active_text()
+        elif isinstance(widget, Gtk.TextBuffer):
+            start = widget.get_start_iter()
+            end = widget.get_end_iter()
+            newval = widget.get_text(start, end, True)
+        else:
+            print("Unhandled widget type \"{}\" found "
+                  "while reading node properties".format(type(widget)))
+        return newval
+
+    def node_props_changed(self, widget, *data):
+        # The callback of a switch looks a bit different than all other
+        # objects, we handle this here
+        sdfg, nodeid, prop = data[len(data)-1][0], \
+                             data[len(data)-1][1], \
+                             data[len(data)-1][2]
+        string_val = self.get_value_from_widget(widget)
+        sid, nid = self.split_nodeid_in_state_and_nodeid(nodeid)
+        node = sdfg.nodes()[sid].nodes()[nid]
+        dace.properties.set_property_from_string(prop, node, string_val, sdfg)
+        self.on_update_cb(sdfg, "node", nodeid, prop.attr_name, string_val)
+
+    def memlet_props_changed(self, widget, *data):
+        # The callback of a switch looks a bit different than all other
+        # objects, we handle this here
+        (sdfg, memlet, tail, head, label,
+         prop) = (data[len(data) - 1][0], data[len(data) - 1][1],
+                  data[len(data) - 1][2], data[len(data) - 1][3],
+                  data[len(data) - 1][4], data[len(data) - 1][5])
+        string_val = self.get_value_from_widget(widget)
+        try:
+            dace.properties.set_property_from_string(prop, memlet, string_val,
+                                                     sdfg)
+            self.revert_render_property_error(widget)
+        except ValueError as e:
+            self.render_property_error(e, widget)
+        self.on_update_cb(sdfg, "memlet", tail, head, label, prop.attr_name,
+                          string_val)
+
+    def state_props_changed(self, widget, *data):
+        # The callback of a switch looks a bit different than all other
+        # objects, we handle this here
+        sdfg, stateid, prop = data[len(data)-1][0], \
+                              data[len(data)-1][1], \
+                              data[len(data)-1][2]
+
+        newval = self.get_value_from_widget(widget)
+        dace.properties.set_property_from_string(prop,
+                                                 sdfg.nodes()[stateid], newval)
+        self.on_update_cb(sdfg, "state", sdfg, stateid, prop.attr_name, newval)
+
+    def interstate_props_changed(self, widget, *data):
+        sdfg, tail, head, prop = data[len(data)-1][0], \
+                                 data[len(data)-1][1], \
+                                 data[len(data)-1][2], \
+                                 data[len(data)-1][3]
+        string_val = self.get_value_from_widget(widget)
+        edges = sdfg.edges_between(sdfg.nodes()[tail], sdfg.nodes()[head])
+        _, _, v = edges[0]
+
+        try:
+            dace.properties.set_property_from_string(prop, v, string_val, sdfg)
+            self.revert_render_property_error(widget)
+        except ValueError as e:
+            self.render_property_error(e, widget)
+            self.on_update_cb(sdfg, "statetrans", tail, head, prop.attr_name,
+                              string_val)
+
+    def render_property_error(self, exception, widget):
+        if isinstance(widget, Gtk.Entry):
+            widget.set_tooltip_text(str(exception))
+            widget_style_context = widget.get_style_context()
+            # TODO: this should probably be unique
+            widget.set_name("name_entry")
+            self.gtk_context.add_provider_for_screen(
+                self.screen, self.gtk_provider,
+                Gtk.STYLE_PROVIDER_PRIORITY_APPLICATION)
+            data = ("#name_entry.red {background-image: "
+                    "linear-gradient(0deg, #ffe8e8, #ff0000) ;}")
+            self.gtk_provider.load_from_data(data.encode('utf-8'))
+            widget_style_context.add_class("red")
+        else:
+            raise exception
+
+    def revert_render_property_error(self, widget):
+        if isinstance(widget, Gtk.Entry):
+            widget_style_context = widget.get_style_context()
+            widget_style_context.remove_class("red")
+            widget.set_tooltip_text("")
+
+    def pattern_props_changed(self, widget, *data):
+        # The callback of a switch looks a bit different than all other
+        # objects, we handle this here
+        optgraph, nodeid, pattern_match, prop = data[len(data)-1][0], \
+                                                data[len(data)-1][1], \
+                                                data[len(data)-1][2], \
+                                                data[len(data)-1][3]
+
+        newval = self.get_value_from_widget(widget)
+        dace.properties.set_property_from_string(prop, pattern_match, newval)
+        self.on_update_cb(None, "pattern_match", optgraph, nodeid,
+                          prop.attr_name, newval)
+
+    def free_symbol_changed(self, widget, *data):
+        sdfg, symname = data[len(data) - 1][0], data[len(data) - 1][1]
+        value = self.get_value_from_widget(widget)
+        for sym in sdfg.undefined_symbols(True):
+            if str(sym) == symname:
+                symbol = dace.symbolic.symbol(symname)
+                symbol.set(value)
+
+    def render_prop(self, prop, value, callback, callback_data, sdfg):
+        widget = None
+        if prop.dtype == bool and value is not None:
+            widget = Gtk.Switch()
+            widget.set_active(value)
+            widget.connect("state-set", callback, callback_data)
+        elif isinstance(prop, dace.properties.CodeProperty):
+            buf = GtkSource.Buffer()
+            value = prop.to_string(value)
+            widget = GtkSource.View.new_with_buffer(buf)
+            lang_manager = GtkSource.LanguageManager()
+            language = lang_manager.get_language("python")
+            buf.set_language(language)
+            buf.set_text(value)
+            buf.set_highlight_syntax(True)
+            buf.connect("changed", callback, callback_data)
+        elif prop.enum is not None:
+            widget = Gtk.ComboBoxText()
+
+            if isinstance(prop, dace.properties.DataProperty):
+                enum = prop.enum(sdfg)
+            else:
+                enum = prop.enum
+
+            for i, option in enumerate(enum):
+                widget.append_text(prop.to_string(option))
+                if option == value:
+                    widget.set_active(i)
+            widget.connect("changed", callback, callback_data)
+        else:
+            widget = Gtk.Entry()
+            widget.set_text(prop.to_string(value))
+            widget.connect("changed", callback, callback_data)
+        return widget
+
+    def node_props(self, sdfg, nodeid):
+        sid, nid = self.split_nodeid_in_state_and_nodeid(nodeid)
+        node = sdfg.nodes()[sid].nodes()[nid]
+        properties = node.properties()
+        for rownum, (prop, value) in enumerate(properties):
+            name_label = Gtk.Label()
+            name_label.set_label(prop.attr_name)
+            name_label.set_tooltip_text(prop.desc)
+            self.propertygrid.attach(name_label, 0, rownum, 1, 1)
+            callback_data = [sdfg, nodeid, prop]
+            widget = self.render_prop(prop, value, self.node_props_changed,
+                                      callback_data, sdfg)
+            self.propertygrid.attach(widget, 1, rownum, 1, 1)
+        self.propertygrid.show_all()
+
+    def memlet_props(self, sdfg, memlet, tail, head, label):
+        properties = memlet.properties()
+        for rownum, (prop, value) in enumerate(properties):
+            name_label = Gtk.Label()
+            name_label.set_label(prop.attr_name)
+            name_label.set_tooltip_text(prop.desc)
+            self.propertygrid.attach(name_label, 0, rownum, 1, 1)
+            callback_data = [sdfg, memlet, tail, head, label, prop]
+            #print("rendering prop for: " +str(prop)+" name: "+str(prop.attr_name)+" dtype: "+str(prop.dtype)+" value: "+str(value))
+            widget = self.render_prop(prop, value, self.memlet_props_changed,
+                                      callback_data, sdfg)
+            self.propertygrid.attach(widget, 1, rownum, 1, 1)
+        self.propertygrid.show_all()
+
+    def state_props(self, sdfg, stateid):
+        properties = sdfg.nodes()[stateid].properties()
+        for rownum, (prop, value) in enumerate(properties):
+            name_label = Gtk.Label()
+            name_label.set_label(prop.attr_name)
+            name_label.set_tooltip_text(prop.desc)
+            self.propertygrid.attach(name_label, 0, rownum, 1, 1)
+            callback_data = [sdfg, stateid, prop]
+            widget = self.render_prop(prop, value, self.state_props_changed,
+                                      callback_data, sdfg)
+            self.propertygrid.attach(widget, 1, rownum, 1, 1)
+        self.propertygrid.show_all()
+
+    def interstate_props(self, sdfg, tail, head):
+        edges = sdfg.edges_between(sdfg.nodes()[tail], sdfg.nodes()[head])
+        _, _, v = edges[0]
+        properties = v.properties()
+        for rownum, (prop, value) in enumerate(properties):
+            name_label = Gtk.Label()
+            name_label.set_label(prop.attr_name)
+            name_label.set_tooltip_text(prop.desc)
+            self.propertygrid.attach(name_label, 0, rownum, 1, 1)
+            callback_data = [sdfg, tail, head, prop]
+            widget = self.render_prop(prop, value,
+                                      self.interstate_props_changed,
+                                      callback_data, sdfg)
+            self.propertygrid.attach(widget, 1, rownum, 1, 1)
+        self.propertygrid.show_all()
+
+    def pattern_props(self, optgraph, nodeid, pattern_match):
+        if pattern_match is None:
+            return
+
+        properties = pattern_match.properties()
+        for rownum, (prop, value) in enumerate(properties):
+            name_label = Gtk.Label()
+            name_label.set_label(prop.attr_name)
+            name_label.set_tooltip_text(prop.desc)
+            self.propertygrid.attach(name_label, 0, rownum, 1, 1)
+            callback_data = [optgraph, nodeid, pattern_match, prop]
+            widget = self.render_prop(prop, value, self.pattern_props_changed,
+                                      callback_data, None)
+            self.propertygrid.attach(widget, 1, rownum, 1, 1)
+        self.propertygrid.show_all()
+
+    def is_dummy(self, nodeid):
+        match = re.match("dummy_(\d+)")
+        if match:
+            return True
+        return False
+
+    def render_properties_for_node(self, sdfg, stateid, nodeid):
+        self.propertylabel.set_text("Node Properties:")
+        self.clear_properties()
+        combined_nodeid = "s" + str(stateid) + "_" + str(nodeid)
+        self.node_props(sdfg, combined_nodeid)
+
+    def render_properties_for_state(self, sdfg, stateid):
+        self.propertylabel.set_text("State Properties:")
+        self.clear_properties()
+        self.state_props(sdfg, stateid)
+
+    def render_properties_for_element(self, sdfg, elem):
+        if type(elem).__name__ == "Node":
+            # if this is a dummy node, show properties for the state
+            nodeid = elem.id.decode('utf-8')
+            self.propertylabel.set_text("Node Properties:")
+            self.clear_properties()
+            self.node_props(sdfg, nodeid)
+
+        elif type(elem).__name__ == "Edge":
+            tail = elem.src.id.decode('utf-8')
+            head = elem.dst.id.decode('utf-8')
+            edge_label = _get_edge_label(elem)
+            sid1, nid1 = self.split_nodeid_in_state_and_nodeid(head)
+            sid2, nid2 = self.split_nodeid_in_state_and_nodeid(tail)
+            if sid1 != sid2:
+                self.clear_properties()
+                self.propertylabel.set_text("State Transition Properties:")
+                self.interstate_props(sdfg, sid2, sid1)
+                return
+            sid = int(sid1)
+            srcnode = sdfg.nodes()[sid].nodes()[int(nid2)]
+            dstnode = sdfg.nodes()[sid].nodes()[int(nid1)]
+            mid = -1
+            for i, (_, _, _, _,
+                    d) in enumerate(sdfg.nodes()[sid].edges_between(
+                        srcnode, dstnode)):
+                if str(d) == edge_label:
+                    mid = i
+                    break
+            if mid < 0:
+                raise ValueError("No memlet with this label was found!")
+            memlet = d
+            self.propertylabel.set_text("Memlet Properties: " + str(memlet))
+            self.clear_properties()
+            self.memlet_props(sdfg, memlet, tail, head, edge_label)
+
+        else:
+            self.propertylabel.set_text("Properties: (Nothing selected)")
+
+    def render_free_symbols(self, sdfg):
+        label = self.propertylabel
+        grid = self.propertygrid
+        label.set_text("Symbols of the SDFG")
+        self.clear_properties()
+        rownum = 0
+        for sym in sdfg.undefined_symbols(True):
+            symname = str(sym)
+            name_label = Gtk.Label()
+            name_label.set_label(symname)
+            name_label.set_tooltip_text("Value of the symbolic variable " +
+                                        symname)
+            grid.attach(name_label, 0, rownum, 1, 1)
+            widget = Gtk.Entry()
+            callback_data = [sdfg, symname]
+            widget.connect("changed", self.free_symbol_changed, callback_data)
+            grid.attach(widget, 1, rownum, 4, 1)
+            rownum += 1
+
+        data_label = Gtk.Label()
+        data_label.set_text("Data of the SDFG")
+        grid.attach(data_label, 0, rownum, 5, 1)
+        rownum += 1
+
+        for name, dtype in sdfg.arrays.items():
+
+            label_name = Gtk.Label()
+            label_name.set_text(str(name))
+            label_name.set_tooltip_text("Name of the data element")
+            grid.attach(label_name, 0, rownum, 1, 1)
+
+            label_type = Gtk.Label()
+            if isinstance(dtype, dace.data.Array):
+                label_type.set_text("Array")
+            elif isinstance(dtype, dace.data.Stream):
+                label_type.set_text("Stream")
+            elif isinstance(dtype, dace.data.Scalar):
+                label_type.set_text("Scalar")
+            else:
+                label_type.set_text(str(type(dtype)))
+            label_type.set_tooltip_text("Type of the data element")
+            grid.attach(label_type, 1, rownum, 1, 1)
+
+            label_shape = Gtk.Label()
+            if dtype is not None:
+                label_shape.set_text(str(dtype.shape))
+            else:
+                label_shape.set_text("None")
+            label_shape.set_tooltip_text("Shape of the data element")
+            grid.attach(label_shape, 2, rownum, 1, 1)
+
+            button_edit = Gtk.Button(
+                None, image=Gtk.Image(stock=Gtk.STOCK_EDIT))
+            button_edit.set_tooltip_text("Edit this data element")
+            button_edit.connect("clicked", self.render_properties_for_data,
+                                [str(name), sdfg])
+            grid.attach(button_edit, 3, rownum, 1, 1)
+
+            button_delete = Gtk.Button(
+                None, image=Gtk.Image(stock=Gtk.STOCK_DELETE))
+            button_delete.set_tooltip_text("Delete this data element")
+            button_delete.connect("clicked", self.delete_data_cb,
+                                  [str(name), sdfg])
+            grid.attach(button_delete, 4, rownum, 1, 1)
+            rownum += 1
+
+        adddata_label = Gtk.Label()
+        adddata_label.set_text("Add Data to the SDFG")
+        grid.attach(adddata_label, 0, rownum, 5, 1)
+        rownum += 1
+
+        adddata_box = Gtk.Grid()
+        grid.attach(adddata_box, 0, rownum, 5, 1)
+        namefield = Gtk.Entry()
+        addscalar = Gtk.Button("Scalar")
+        addscalar.connect("clicked", self.add_data_cb,
+                          ["scalar", sdfg, namefield])
+        addarray = Gtk.Button("Array")
+        addarray.connect("clicked", self.add_data_cb,
+                         ["array", sdfg, namefield])
+        addstream = Gtk.Button("Stream")
+        addstream.connect("clicked", self.add_data_cb,
+                          ["stream", sdfg, namefield])
+        adddata_box.attach(namefield, 0, 1, 2, 1)
+        adddata_box.attach(addscalar, 2, 1, 1, 1)
+        adddata_box.attach(addarray, 3, 1, 1, 1)
+        adddata_box.attach(addstream, 4, 1, 1, 1)
+        rownum += 1
+
+        grid.show_all()
+
+    def add_data_cb(self, button, cb_data):
+        datatype = cb_data[0]
+        sdfg = cb_data[1]
+        namefield = cb_data[2]
+        if datatype == "scalar":
+            sdfg.add_scalar(namefield.get_text(), dace.types.int32)
+        elif datatype == "array":
+            sdfg.add_array(namefield.get_text(), [2, 2], dace.types.float32)
+        elif datatype == "stream":
+            sdfg.add_stream(namefield.get_text(), dace.types.float32, 1)
+        self.render_free_symbols(sdfg)
+
+    def show_delete_error(self, sdfg, state, element):
+        dialog = Gtk.MessageDialog(None, 0, Gtk.MessageType.INFO,
+                                   Gtk.ButtonsType.OK,
+                                   "Data cannot be deleted while in use!")
+        dialog.format_secondary_text(
+            "The data item you tried to delete is still in use and thus it "
+            "cannot be deleted.")
+        dialog.run()
+        dialog.destroy()
+
+    def delete_data_cb(self, button, data):
+        name = data[0]
+        sdfg = data[1]
+
+        # Traverse the SDFG, find any occurance of data "name", if it exists
+        # show an error and do not delete
+        for state in sdfg.nodes():
+            for node in state.nodes():
+                if isinstance(node, dace.graph.nodes.AccessNode):
+                    if str(node.data) == name:
+                        self.show_delete_error(sdfg, state, node)
+                        return None
+            for memlet in state.edges():
+                if str(memlet.data) == name:
+                    self.show_delete_error(sdfg, state, memlet)
+                    return None
+        sdfg.remove_data(name)
+        self.render_free_symbols(sdfg)
+
+    def render_properties_for_data(self, button, data):
+        name = data[0]
+        sdfg = data[1]
+        self.clear_properties()
+        self.propertylabel.set_text("Edit Data: " + name)
+        self.data_props(sdfg, name)
+
+    def data_props(self, sdfg, name):
+        data = None
+        for d in sdfg.arrays.items():
+            if d[0] == name:
+                data = d[1]
+        if data is None:
+            raise ValueError("Data item " + name + " not found in SDFG " +
+                             sdfg)
+        properties = data.properties()
+        for rownum, (prop, value) in enumerate(properties):
+            name_label = Gtk.Label()
+            name_label.set_label(prop.attr_name)
+            name_label.set_tooltip_text(prop.desc)
+            self.propertygrid.attach(name_label, 0, rownum, 1, 1)
+            callback_data = [sdfg, name, data, prop]
+            widget = self.render_prop(prop, value, self.data_props_changed,
+                                      callback_data, None)
+            self.propertygrid.attach(widget, 1, rownum, 1, 1)
+        self.propertygrid.show_all()
+
+    def data_props_changed(self, widget, *data):
+        sdfg, name, data, prop = data[len(data)-1][0], \
+                                 data[len(data)-1][1], \
+                                 data[len(data)-1][2], \
+                                 data[len(data)-1][3]
+        newval = self.get_value_from_widget(widget)
+        dace.properties.set_property_from_string(prop, data, newval)
+
+    def render_properties_for_pattern(self, optgraph, nodeid, pattern_match):
+        label = self.propertylabel
+        tname = optgraph.find_node(nodeid).get_label()
+        label.set_text("Transformation properties for " + str(tname) + ":")
+        self.clear_properties()
+        self.pattern_props(optgraph, nodeid, pattern_match)
diff --git a/diode/remote_execution.py b/diode/remote_execution.py
new file mode 100644
index 0000000000..ef4d466cd7
--- /dev/null
+++ b/diode/remote_execution.py
@@ -0,0 +1,498 @@
+import os
+import sys
+import stat
+import dace
+import pickle
+import tempfile
+import traceback
+import subprocess
+import dace.types
+from string import Template
+from dace.codegen.compiler import generate_program_folder
+from dace.config import Config
+from dace.codegen.instrumentation.perfsettings import PerfSettings, PerfUtils, PerfMetaInfo, PerfMetaInfoStatic, PerfPAPIInfoStatic
+
+
+class Executor:
+    """ Remote DaCe program execution management class for DIODE. """
+
+    def __init__(self, perfplot, headless, sdfg_renderer, async_host=None):
+        self.counter = 0
+        self.perfplot = perfplot
+        self.headless = headless
+        self.rendered_graph = sdfg_renderer
+
+        self.running_async = async_host != None
+        self.async_host = async_host
+
+    def run(self, dace_state, fail_on_nonzero=False):
+        dace_progname = dace_state.get_sdfg().name
+        code_objects = dace_state.get_generated_code()
+
+        # Figure out whether we should use MPI for launching
+        use_mpi = False
+        for code_object in code_objects:
+            if code_object.target.target_name == 'mpi':
+                use_mpi = True
+                break
+
+        # Check validity of at least the default for now
+        if PerfSettings.perf_enable_instrumentation(
+        ) and PerfSettings.perf_enable_counter_sanity_check():
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Reading remote PAPI Counters")
+            PerfPAPIInfoStatic.info.load_info()
+            # TODO: Should iterate over all nodes to find counter overrides
+            papi_counters_valid = PerfPAPIInfoStatic.info.check_counters(
+                [PerfSettings.perf_default_papi_counters()])
+            if (not papi_counters_valid):
+                print("Stopped execution. Counter settings do not meet "
+                      "requirements")
+                if self.running_async:
+                    # Add information about what is being run
+                    self.async_host.notify(
+                        "An error occurred when reading remote PAPI counters")
+                return
+
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Done reading remote PAPI Counters")
+
+        remote_workdir = Config.get("execution", "general", "workdir")
+        remote_dace_dir = remote_workdir + "/.dacecache/%s/" % dace_progname
+        self.show_output("Executing DaCe program " + dace_progname + " on " + \
+                Config.get("execution", "general", "host") + "\n")
+
+        if PerfSettings.perf_enable_instrumentation():
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Checking remote PAPI Counters")
+            PerfUtils.read_available_perfcounters()
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Done checking remote PAPI Counters")
+
+        try:
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Generating remote workspace")
+            tmpfolder = tempfile.mkdtemp()
+            generate_program_folder(code_objects, tmpfolder)
+            self.create_remote_directory(remote_dace_dir)
+            self.copy_folder_to_remote(tmpfolder, remote_dace_dir)
+
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Compiling...")
+            # call compile.py on the remote node in the copied folder
+            self.remote_compile(remote_dace_dir, dace_progname)
+
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Done compiling")
+
+            # copy the input file and the fatso (with the right name)
+            # to remote_dace_dir
+            so_name = "lib" + dace_progname + "." + Config.get(
+                'compiler', 'library_extension')
+            self.copy_file_from_remote(remote_dace_dir + "/build/" + so_name,
+                                       tmpfolder + "/" + so_name)
+            self.copy_file_to_remote(tmpfolder + "/" + so_name,
+                                     remote_dace_dir)
+
+            dace_file = dace_state.get_dace_tmpfile()
+            if dace_file is None:
+                raise ValueError("Dace file is None!")
+
+            # copy the SDFG
+            try:
+                local_sdfg = tmpfolder + "/sdfg.out"
+                sdfg = dace_state.get_sdfg()
+                with open(local_sdfg, 'wb') as f:
+                    pickle.dump(sdfg, f, pickle.HIGHEST_PROTOCOL)
+                    remote_sdfg = remote_workdir + "/sdfg.out"
+                    self.copy_file_to_remote(local_sdfg, remote_sdfg)
+            except:
+                print(
+                    "Could NOT pickle the SDFG! This is bad for Matlab/Tensorflow generated SDFGs!"
+                )
+
+            remote_dace_file = remote_workdir + "/" + os.path.basename(
+                dace_file)
+            self.copy_file_to_remote(dace_file, remote_dace_file)
+
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("All files copied to remote")
+
+            # We got the file there, now we can run with different
+            # configurations.
+            for iteration in range(0, PerfSettings.perf_multirun_num()):
+                omp_thread_num = None
+                if (PerfSettings.perf_multirun_num() != 1):
+                    opt, val = PerfSettings.perf_multirun_options()[iteration]
+                    if (opt == "omp_num_threads"):
+                        omp_thread_num = val
+
+                if self.running_async:
+                    # Add information about what is being run
+                    self.async_host.notify("Running option threads=" +
+                                           str(omp_thread_num))
+
+                self.remote_exec_dace(
+                    remote_workdir,
+                    remote_dace_file,
+                    use_mpi,
+                    fail_on_nonzero,
+                    omp_num_threads=omp_thread_num)
+
+                if self.running_async:
+                    # Add information about what is being run
+                    self.async_host.notify("Done option threads=" +
+                                           str(omp_thread_num))
+
+            self.show_output("Execution Terminated\n")
+
+            try:
+                self.copy_file_from_remote(remote_workdir + "/results.log",
+                                           ".")
+            except:
+                pass
+
+            # Copy back the vectorization results
+            if PerfSettings.perf_enable_vectorization_analysis():
+                if self.running_async:
+                    self.async_host.notify("Running vectorization check")
+
+                self.copy_file_from_remote(
+                    remote_dace_dir + "/build/vecreport.txt", ".")
+                with open("vecreport.txt") as r:
+                    content = r.read()
+                    print("Vecreport:")
+                    print(content)
+
+                    # Now analyze this...
+                    for code_object in code_objects:
+                        code_object.perf_meta_info.analyze(content)
+                os.remove("vecreport.txt")
+
+                if self.running_async:
+                    self.async_host.notify("vectorization check done")
+
+            # Copy back the instrumentation results
+            if PerfSettings.perf_enable_instrumentation():
+                if self.running_async:
+                    # Add information about what is being run
+                    self.async_host.notify("Analyzing performance data")
+                try:
+                    self.copy_file_from_remote(
+                        remote_workdir + "/instrumentation_results.txt", ".")
+                    self.remote_delete_file(remote_workdir +
+                                            "/instrumentation_results.txt")
+                    content = ""
+                    readall = False
+                    with open("instrumentation_results.txt") as ir:
+
+                        if readall:
+                            content = ir.read()
+
+                        if readall and PerfSettings.perf_print_instrumentation_output(
+                        ):
+                            print(
+                                "vvvvvvvvvvvvv Instrumentation Results vvvvvvvvvvvvvv"
+                            )
+                            print(content)
+                            print(
+                                "^^^^^^^^^^^^^ Instrumentation Results ^^^^^^^^^^^^^^"
+                            )
+
+                        if readall:
+                            PerfUtils.print_instrumentation_output(content)
+                        else:
+                            PerfUtils.print_instrumentation_output(ir)
+
+                    os.remove("instrumentation_results.txt")
+                except FileNotFoundError:
+                    print(
+                        "[Warning] Could not transmit instrumentation results")
+
+                if self.running_async:
+                    # Add information about what is being run
+                    self.async_host.notify("Done Analyzing performance data")
+
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Cleaning up")
+
+            try:
+                self.remote_delete_file(remote_workdir + "/results.log")
+            except:
+                pass
+
+            self.remote_delete_file(remote_dace_file)
+            self.remote_delete_dir(remote_dace_dir)
+
+            try:
+                res = self.update_performance_plot("results.log",
+                                                   str(self.counter))
+                os.remove("results.log")
+            except FileNotFoundError:
+                print("WARNING: results.log could not be read")
+
+            if self.running_async:
+                # Add information about what is being run
+                self.async_host.notify("Done cleaning")
+
+            # Also, update the performance data.
+            self.rendered_graph.set_memspeed_target()
+            self.rendered_graph.render_performance_data()
+        except Exception as e:
+            print("\n\n\n")
+            print("!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!")
+            print("Running the program failed:")
+            traceback.print_exc()
+            print(
+                "Inspect above output for more information about executed command sequence."
+            )
+            print("!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!")
+            if self.headless:
+                sys.exit(1)
+
+        if self.running_async:
+            self.async_host.notify("All done")
+        self.counter += 1
+
+    def update_performance_plot(self, resfile, name):
+        # Each result.log will give us many runs of one size and optimization.
+        # We ignore everything in the result log except the timing
+        times = self.perfplot.parse_result_log(resfile)
+        self.perfplot.add_run(name, times)
+        self.perfplot.render()
+        t = sorted([float(s) for s in times])
+        print(t)
+        return t[int(len(t) / 2)]
+
+    def show_output(self, outstr):
+        """ Displays output of any ongoing compilation or computation. """
+        if isinstance(outstr, str):
+            print(outstr, end="", flush=True)
+            return
+        sys.stdout.buffer.write(outstr)
+
+    def remote_delete_file(self, delfile):
+        s = Template(Config.get("execution", "general", "execcmd"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            command="rm " + delfile)
+        self.exec_cmd_and_show_output(cmd)
+
+    def remote_delete_dir(self, deldir):
+        s = Template(Config.get("execution", "general", "execcmd"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            command="rm -r " + deldir)
+        self.exec_cmd_and_show_output(cmd)
+
+    def delete_local_folder(self, path):
+        os.removedirs(path)
+
+    def remote_exec_dace(self,
+                         remote_workdir,
+                         dace_file,
+                         use_mpi=True,
+                         fail_on_nonzero=False,
+                         omp_num_threads=None):
+        run = "${command} "
+        if use_mpi == True:
+            run = Config.get("execution", "mpi", "mpiexec")
+            nprocs = Config.get("execution", "mpi", "num_procs")
+        else:
+            nprocs = 1
+        repetitions = Config.get("execution", "general", "repetitions")
+
+        omp_num_threads_str = ""
+        omp_num_threads_unset_str = ""
+        perf_instrumentation_result_marker = ""
+        if (omp_num_threads != None):
+            omp_num_threads_str = "export OMP_NUM_THREADS=" + str(
+                omp_num_threads) + "\n"
+            omp_num_threads_unset_str = "unset OMP_NUM_THREADS\n"
+            perf_instrumentation_result_marker = "echo '# ;%s; Running in multirun config' >> %s/instrumentation_results.txt\n" % (
+                omp_num_threads_str.replace("\n", ""), remote_workdir)
+
+        # Create a startscript which exports necessary env-vars
+        start_sh = "set -x\n" + \
+                   "export DACE_compiler_use_cache=1\n" + \
+                   "export DACE_optimizer_interface=''\n" + \
+                   "export DACE_profiling=1\n" + \
+                   "export DACE_treps=" + str(repetitions) +"\n" + \
+                   omp_num_threads_str + \
+                   "cd " + remote_workdir + "\n" + \
+                   perf_instrumentation_result_marker
+        s = Template(run + " ")
+        cmd = s.substitute(command="python3 " + dace_file, num_procs=nprocs)
+        start_sh += cmd + "\n"
+        start_sh += "export RETVAL=$?\n"
+        start_sh += (
+            "unset DACE_compiler_use_cache\n" +
+            "unset DACE_optimizer_interface\n" + "unset DACE_treps\n" +
+            "unset DACE_profiling\n" + omp_num_threads_unset_str +
+            # TODO: separate program error and system error
+            "exit $RETVAL\n")
+        tempdir = tempfile.mkdtemp()
+        startsh_file = os.path.join(tempdir, "start.sh")
+        fh = open(startsh_file, "w")
+        fh.write(start_sh)
+        fh.close()
+        st = os.stat(startsh_file)
+        os.chmod(startsh_file, st.st_mode | stat.S_IEXEC)
+
+        workdir = Config.get("execution", "general", "workdir")
+
+        self.copy_file_to_remote(
+            startsh_file,
+            Config.get("execution", "general", "workdir") + "/start.sh")
+
+        s = Template(Config.get("execution", "general", "execcmd"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            command=workdir + "/start.sh")
+        self.exec_cmd_and_show_output(cmd, fail_on_nonzero)
+
+        self.remote_delete_file(workdir + "/start.sh")
+
+    def remote_compile(self, rem_path, dace_progname):
+        compile_cmd = "python3 -m dace.codegen.compiler " + str(
+            rem_path) + " " + dace_progname
+        s = Template(Config.get("execution", "general", "execcmd"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            command=compile_cmd)
+        self.exec_cmd_and_show_output(cmd)
+
+    def create_remote_directory(self, path):
+        """ Creates a path on a remote node.
+
+            @note: We use `mkdir -p` for now, which is not portable.
+        """
+        mkdircmd = "mkdir -p " + path
+        s = Template(Config.get("execution", "general", "execcmd"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"), command=mkdircmd)
+        self.exec_cmd_and_show_output(cmd)
+
+    def copy_file_to_remote(self, src, dst):
+        s = Template(Config.get("execution", "general", "copycmd_l2r"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            srcfile=src,
+            dstfile=dst)
+        self.exec_cmd_and_show_output(cmd)
+
+    def copy_folder_to_remote(self, src, dst):
+        for root, subdirs, files in os.walk(src):
+            for filename in files:
+                file_path = os.path.join(root, filename)
+                self.copy_file_to_remote(file_path, dst + "/" + filename)
+            for subdir in subdirs:
+                self.create_remote_directory(dst + "/" + str(subdir))
+                self.copy_folder_to_remote(src + "/" + str(subdir),
+                                           dst + "/" + str(subdir))
+            return
+
+    def copy_file_from_remote(self, src, dst):
+        s = Template(Config.get("execution", "general", "copycmd_r2l"))
+        cmd = s.substitute(
+            host=Config.get("execution", "general", "host"),
+            srcfile=src,
+            dstfile=dst)
+        self.exec_cmd_and_show_output(cmd)
+
+    def exec_cmd_and_show_output(self, cmd, fail_on_nonzero=True):
+        self.show_output(cmd + "\n")
+        p = subprocess.Popen(
+            cmd, shell=True, stdout=subprocess.PIPE, stderr=subprocess.STDOUT)
+
+        while True:
+            out = p.stdout.read(1)
+            if out == '' or out == b'':
+                break
+            if out != '' and out != b'':
+                self.show_output(out)
+        stdout, _ = p.communicate(timeout=60)
+        self.show_output(stdout)
+        if p.returncode != 0 and fail_on_nonzero:
+            print("The command " + cmd + " failed (retcode " +\
+                    str(p.returncode) + ")!\n")
+            if self.headless:
+                os._exit(p.returncode)
+            else:
+                raise ValueError("The command " + cmd + " failed (retcode " + \
+                         str(p.returncode) + ")!")
+
+
+import threading, queue
+
+
+class AsyncExecutor:
+    """ Asynchronous remote execution. """
+
+    def __init__(self, perfplot, headless, sdfg_renderer, diode):
+
+        self.executor = Executor(perfplot, headless, sdfg_renderer, self)
+        self.to_thread_message_queue = queue.Queue(128)
+        self.from_thread_message_queue = queue.Queue(128)
+        self.diode = diode
+        self.running_thread = None
+
+    def notify(self, message):
+
+        import time
+
+        print("Got message " + str(message))
+
+        def deferred():
+
+            status_text = self.diode.builder.get_object("run_status_text")
+            status_progress_bar = self.diode.builder.get_object("run_status")
+            status_text.set_text(message)
+
+        from gi.repository import GObject
+        GObject.idle_add(deferred)
+
+        if (message == "All done"):
+            self.to_thread_message_queue.put("quit")
+
+        time.sleep(0.001)  # Equivalent of `sched_yield()` for Python
+
+    def run_async(self, dace_state, fail_on_nonzero=False):
+        if self.running_thread != None and self.running_thread.is_alive():
+            print("Cannot start another thread!")
+            return
+
+        def task():
+            self.run()
+
+        self.running_thread = threading.Thread(target=task)
+        self.running_thread.start()
+        self.to_thread_message_queue.put(("run", dace_state, fail_on_nonzero))
+
+    def callMethod(self, obj, name, *args):
+        return getattr(obj, name)(*args)
+
+    def run(self):
+        while True:
+            # Read a message (blocking)
+            msg = self.to_thread_message_queue.get()
+            if (msg == "quit"):
+                break
+
+            # Unwrap and call
+            ret = self.callMethod(self.executor, *msg)
+
+            # Put the return value (including the complete command)
+            self.from_thread_message_queue.put(("retval", ret, *msg))
+
+    def join(self, timeout=None):
+        pass
diff --git a/diode/rendered_graph.py b/diode/rendered_graph.py
new file mode 100644
index 0000000000..3135b81804
--- /dev/null
+++ b/diode/rendered_graph.py
@@ -0,0 +1,263 @@
+import sys
+import subprocess
+
+from xdot.ui.elements import Graph
+from xdot.dot.lexer import ParseError
+from xdot.dot.parser import XDotParser
+
+import xdot
+print(xdot.ui.elements.__file__)
+
+from gi.repository import Gdk
+
+
+class RenderedGraph:
+    """ Legacy xdot-based SDFG renderer. """
+
+    def __init__(self):
+        self.ZOOM_INCREMENT = 1.1
+
+        self.xdot_graph = None
+        self.drawing_area = None
+        self.trans_x = 0.0
+        self.trans_y = 0.0
+        self.drag_start_x = None
+        self.drag_start_y = None
+        self.drag_trans_x = 0.0
+        self.drag_trans_y = 0.0
+        self.dragging = False
+        self.zoom_ratio = 1.0
+        self.highlighted_elements = []
+        self.click_cb = None
+        self.sdfg = None
+
+    def set_click_cb(self, cb):
+        self.click_cb = cb
+
+    def determine_scroll_direction(self, ev):
+        if ev.direction == Gdk.ScrollDirection.SMOOTH:
+            (t, x, y) = Gdk.Event.get_scroll_deltas(ev)
+            if y < 0: return "in"
+            else: return "out"
+        else:
+            if ev.direction == Gdk.ScrollDirection.UP:
+                return "in"
+            elif ev.direction == Gdk.ScrollDirection.DOWN:
+                return "out"
+        raise ValueError("Unrecognized scroll direction")
+
+    def set_dotcode(self, dotcode, preserve_view=False):
+        if self.drawing_area == None:
+            raise ValueError("You need to assign a drawing area first.")
+        xdotcode = self.run_xdot(dotcode)
+        if xdotcode is None:
+            raise ValueError("xdot didn't work on input:\n" + dotcode)
+        xdotcb = xdotcode.encode('utf-8')
+        parser = XDotParser(xdotcb)
+        self.xdot_graph = parser.parse()
+        if preserve_view == False:
+            self.zoom("default", center=True)
+        self.drawing_area.queue_draw()
+
+    def set_drawing_area(self, drawing_area):
+        self.drawing_area = drawing_area
+        self.drawing_area.connect("draw", self.render)
+        self.drawing_area.set_events(Gdk.EventMask.BUTTON_PRESS_MASK
+                                     | Gdk.EventMask.SCROLL_MASK
+                                     | Gdk.EventMask.BUTTON_MOTION_MASK)
+        self.drawing_area.connect("scroll-event", self.OnScrollSDFG)
+        self.drawing_area.connect("button-press-event", self.OnButtonPressSDFG)
+        self.drawing_area.connect("button-release-event",
+                                  self.OnButtonReleaseSDFG)
+        self.drawing_area.connect("motion-notify-event", self.OnMouseMoveSDFG)
+
+    def OnScrollSDFG(self, widget, ev):
+        d = self.determine_scroll_direction(ev)
+        self.zoom(d, pos=(ev.x, ev.y))
+        widget.queue_draw()
+        return False
+
+    def OnButtonPressSDFG(self, widget, ev):
+        x, y = ev.x, ev.y
+        elem = self.get_element_by_coords(x, y)
+        self.clear_highlights()
+        if elem is not None:
+            self.highlight_element(elem)
+        self.handle_button_press(ev)
+        if self.click_cb is not None:
+            self.click_cb(self.sdfg, elem)
+        else:
+            print("No callback for clicks on element set!")
+        return False
+
+    def OnMouseMoveSDFG(self, widget, ev):
+        self.handle_drag_motion(ev)
+        return False
+
+    def OnButtonReleaseSDFG(self, widget, ev):
+        self.handle_button_release(ev)
+        return False
+
+    def render_sdfg(self, sdfg):
+        self.sdfg = sdfg
+        dotcode = sdfg.draw()
+        self.set_dotcode(dotcode)
+
+    def set_memspeed_target(self):
+        pass
+
+    def render_performance_data(self):
+        pass
+
+    def render(self, wid, cr):
+        if self.drawing_area == None:
+            raise ValueError("You need to assign a drawing area first.")
+        cr.set_source_rgba(1.0, 1.0, 1.0, 1.0)
+        cr.paint()
+        cr.save()
+        rect = self.drawing_area.get_allocation()
+        cr.translate(0.5 * rect.width, 0.5 * rect.height)
+        cr.scale(self.zoom_ratio, self.zoom_ratio)
+        cr.translate(-self.trans_x, -self.trans_y)
+        if self.xdot_graph != None:
+            self.xdot_graph.draw(cr, highlight_items=self.highlighted_elements)
+        cr.restore()
+
+    def zoom(self, zoom_direction, center=False, pos=None):
+        # Constrain zoom ratio to a sane range to prevent numerical
+        # instability.
+        zoom_ratio = min(self.zoom_ratio, 1E4)
+        zoom_ratio = max(self.zoom_ratio, 1E-6)
+
+        if zoom_direction == "in":
+            zoom_ratio *= self.ZOOM_INCREMENT
+        elif zoom_direction == "out":
+            zoom_ratio /= self.ZOOM_INCREMENT
+        elif zoom_direction == "default":
+            zoom_ratio = 1.0
+        else:
+            return
+
+        if center:
+            self.trans_x = self.xdot_graph.width / 2
+            self.trans_y = self.xdot_graph.height / 2
+        elif pos is not None:
+            rect = self.drawing_area.get_allocation()
+            x, y = pos
+            x -= 0.5 * rect.width
+            y -= 0.5 * rect.height
+            self.trans_x += x / self.zoom_ratio - x / zoom_ratio
+            self.trans_y += y / self.zoom_ratio - y / zoom_ratio
+        self.zoom_ratio = zoom_ratio
+
+    def handle_button_press(self, ev):
+        self.drag_start_x = ev.x
+        self.drag_start_y = ev.y
+        self.drag_trans_x = self.trans_x
+        self.drag_trans_y = self.trans_y
+        self.dragging = True
+
+    def handle_drag_motion(self, ev):
+        if self.dragging:
+            delta_x = self.drag_start_x - ev.x
+            delta_y = self.drag_start_y - ev.y
+            self.trans_x = self.drag_trans_x + (delta_x / self.zoom_ratio)
+            self.trans_y = self.drag_trans_y + (delta_y / self.zoom_ratio)
+            self.drag_start = None
+            self.drawing_area.queue_draw()
+
+    def handle_button_release(self, ev):
+        self.handle_drag_motion(ev)
+        self.dragging = False
+
+    def get_element_by_coords(self, x, y):
+        if self.xdot_graph == None:
+            return None
+        x, y = self.window2graph(x, y)
+        elem = self.xdot_graph.get_element(x, y)
+        return elem
+
+    def get_subgraph_by_coords(self, x, y):
+        if self.xdot_graph == None:
+            return None
+        x, y = self.window2graph(x, y)
+        subg_label = self.xdot_graph.get_subgraph(x, y)
+        return subg_label
+
+    def get_element_by_id(self, id):
+        for n in self.xdot_graph.nodes:
+            if n.id.decode('utf-8') == id:
+                return n
+        return None
+
+    def highlight_element(self, elem):
+        self.highlighted_elements.append(elem)
+        self.drawing_area.queue_draw()
+
+    def center_highlights(self):
+
+        xmin, xmax = float('Inf'), -float('Inf')
+        ymin, ymax = float('Inf'), -float('Inf')
+
+        for e in self.highlighted_elements:
+            x1, x2, y1, y2 = e.bounding
+            if (x1 > -float('Inf')) & (x1 < float('Inf')):
+                if x1 > xmax:
+                    xmax = x1
+                if x1 < xmin:
+                    xmin = x1
+            if (y1 > -float('Inf')) & (y1 < float('Inf')):
+                if y1 > ymax:
+                    ymax = y1
+                if y1 < ymin:
+                    ymin = y1
+            if (x2 > -float('Inf')) & (x2 < float('Inf')):
+                if x2 > xmax:
+                    xmax = x2
+                if x2 < xmin:
+                    xmin = x2
+            if (y2 > -float('Inf')) & (y2 < float('Inf')):
+                if y2 > ymax:
+                    ymax = y2
+                if y2 < ymin:
+                    ymin = y2
+
+            self.trans_x = xmin
+            self.trans_y = ymin
+
+        self.drawing_area.queue_draw()
+
+    def clear_highlights(self):
+        self.highlighted_elements = []
+        self.drawing_area.queue_draw()
+
+    def window2graph(self, x, y):
+        rect = self.drawing_area.get_allocation()
+        x -= 0.5 * rect.width
+        y -= 0.5 * rect.height
+        x /= self.zoom_ratio
+        y /= self.zoom_ratio
+        x += self.trans_x
+        y += self.trans_y
+        return x, y
+
+    def run_xdot(self, dotcode):
+        try:
+            p = subprocess.Popen(
+                ["dot", '-Txdot'],
+                stdin=subprocess.PIPE,
+                stdout=subprocess.PIPE,
+                stderr=subprocess.PIPE,
+                shell=False,
+                universal_newlines=False)
+        except OSError as exc:
+            error = '%s: %s' % (None, exc.strerror)
+            p = subprocess.CalledProcessError(exc.errno, None, exc.strerror)
+        else:
+            xdotcode, error = p.communicate(dotcode.encode())
+        error = error.rstrip()
+        if error:
+            sys.stderr.write(str(error) + '\n')
+        if p.returncode != 0:
+            return None
+        return xdotcode.decode()
diff --git a/diode/rendered_graph_html5.py b/diode/rendered_graph_html5.py
new file mode 100644
index 0000000000..8ec64bbc9d
--- /dev/null
+++ b/diode/rendered_graph_html5.py
@@ -0,0 +1,230 @@
+import sys
+import os
+import subprocess
+import asyncio
+import time
+import json
+import websockets
+import threading
+
+import gi
+gi.require_version('WebKit2', '4.0')
+from gi.repository import Gdk, WebKit2, Gtk
+
+from gi.repository import GObject
+
+
+class RenderedGraphHTML5:
+    """ HTML5-based SDFG renderer. """
+
+    def __init__(self, on_click):
+        self.drawing_area = None
+        self.on_click_cb = on_click
+        self.websocket = None
+        self.render_queue = []
+        self.restore_render_queue = []  # Render queue copy
+        self.command_queue = [
+        ]  # Similar to render_queue, but for everything except SDFGs.
+        self.restore_command_queue = []  # Command queue copy
+        threading.Thread(target=self.run_asyncio, daemon=True).start()
+
+    # The following functions highlight nodes
+    def get_element_by_id(self, id):
+        return id
+
+    def highlight_element(self, elem):
+        state, node = elem.split('_')
+        state = state[1:]  # strip the 's' from the state number.
+        self.command_queue.append(
+            '{ "type": "highlight-element", "node-id": %d, "sdfg-id": %d }' %
+            (int(node), int(state)))
+        return
+
+    def clear_highlights(self):
+        self.command_queue.append('{ "type": "clear-highlights" }')
+        return
+
+    # End highlight functions
+
+    def run_asyncio(self):
+        try:
+            loop = asyncio.SelectorEventLoop()
+            asyncio.set_event_loop(loop)
+            asyncio.get_event_loop().run_until_complete(
+                websockets.serve(self.ConnHandler, 'localhost', 8023))
+            asyncio.get_event_loop().run_forever()
+        except Exception as e:
+            pass
+
+    async def ConnHandler(self, websocket, path):
+        self.websocket = websocket
+        print("Client connected!")
+        print("render queue is " + str(len(self.render_queue)) +
+              " elements long")
+        for sdfg in self.render_queue:
+            json = sdfg.toJSON()
+            await websocket.send(json)
+
+        self.render_queue.clear()
+
+        while (True):
+            try:
+                msg = await websocket.recv()
+                self.message_handler(msg)
+
+                for sdfg in self.render_queue:
+                    json = sdfg.toJSON()
+                    await websocket.send(json)
+                self.render_queue.clear()
+                for cmd in self.command_queue:
+                    # The difference in the command queue: All data must
+                    # already be in JSON format
+                    await websocket.send(cmd)
+                self.command_queue.clear()
+            except websockets.ConnectionClosed:
+                # If the connection was closed, probably a refresh was
+                # requested. This also means that we have to re-queue
+                print("Restoring render queue after abort")
+                self.render_queue = self.restore_render_queue.copy()
+                self.command_queue = self.restore_command_queue.copy()
+                break
+
+    def message_handler(self, msg):
+        m = json.loads(msg)
+        if m["msg_type"] == "info":
+            pass
+        elif m["msg_type"] == "click":
+
+            def mainthread_click_cb():
+                self.on_click_cb(m["clicked_elements"])
+                return False
+
+            GObject.idle_add(mainthread_click_cb)
+
+        elif m["msg_type"] == "heartbeat":
+            pass  # Safe to ignore
+        else:
+            print("Unknown/Unhandled message from renderer: " +
+                  str(m["msg_type"]))
+
+    def render_sdfg(self, sdfg):
+        self.render_queue.append(sdfg)
+
+        self.restore_render_queue = self.render_queue.copy()
+        self.command_queue = []
+        self.restore_command_queue = []
+
+    def render_performance_data(self):
+        try:
+            with open("perf.json") as perfjson:
+                data = perfjson.read()
+                # This is about 5KB for a really small example. Not sure how well this scales tbh... We'll see I guess
+                self.command_queue.append(data)
+                self.restore_command_queue = self.command_queue.copy()
+        except FileNotFoundError as e:
+            # Well then not. Whatever floats your boat.
+            pass
+
+    def set_memspeed_target(self):
+        from dace.codegen.instrumentation.perfsettings import PerfPAPIInfoStatic
+        self.command_queue.append('{ "type": "MemSpeed", "payload": "%s" }' %
+                                  str(PerfPAPIInfoStatic.info.memspeed))
+
+    def set_drawing_area(self, drawing_area):
+        self.drawing_area = drawing_area
+        browserholder = WebKit2.WebView()
+        browserholder.set_editable(False)
+        scriptdir = os.path.dirname(os.path.abspath(__file__))
+        renderer_html_file = os.path.join(scriptdir, "renderer.html")
+        with open(renderer_html_file, 'r') as f:
+            renderer_html = f.read()
+
+        settings = browserholder.get_settings()
+        settings.set_enable_write_console_messages_to_stdout(True)
+        settings.set_javascript_can_open_windows_automatically(
+            True)  # For popping windows out
+        browserholder.set_settings(settings)
+        browserholder.connect('create', RenderedGraphHTML5.new_window)
+
+        # Add the scroll handler
+        def ScrollBinder(widget, event):
+            return RenderedGraphHTML5.onScroll(browserholder, widget, event)
+
+        browserholder.add_events(Gdk.EventMask.SCROLL_MASK)
+        browserholder.connect('scroll-event', ScrollBinder)
+
+        browserholder.load_uri("file://" + renderer_html_file)
+        self.drawing_area.add(browserholder)
+        browserholder.show()
+
+    @staticmethod
+    def onScroll(webkit, widget, event):
+        """ Handles scrolling (actually, only zoom-Scrolling (CTRL + MouseWheel)) """
+        # Handles zoom in / zoom out on Ctrl+mouse wheel
+
+        zoom_const = 0.1
+        scale_mode = "internal"
+
+        accel_mask = Gtk.accelerator_get_default_mod_mask()
+        if event.state & accel_mask == Gdk.ModifierType.CONTROL_MASK:
+            _dir = event.direction
+            _zoom = None
+            if _dir == Gdk.ScrollDirection.UP:
+                _zoom = zoom_const
+            elif _dir == Gdk.ScrollDirection.DOWN:
+                _zoom = -zoom_const
+            direction = event.get_scroll_deltas()[1]
+            if direction == 0:
+                direction = event.get_scroll_deltas()[2]
+            if direction > 0:  # scrolling down -> zoom out
+                _zoom = -zoom_const
+            elif direction < 0:
+                _zoom = zoom_const
+
+            if scale_mode == "external":
+                webkit.set_zoom_level(webkit.get_zoom_level() + _zoom)
+            elif scale_mode == "internal":
+                # We dispatch a script, yippie!
+                webkit.run_javascript("window.zoom_func(" + str(_zoom) + ");")
+            return True
+        return False
+
+    @staticmethod
+    def subwindow_close(window, event):
+
+        print("Subwindow-close called")
+        for widget in window.get_children():
+            widget.run_javascript(
+                "window._thisclient.passMessage({type: 'close'}); console.log('close requested');"
+            )
+        return False
+
+    @staticmethod
+    def new_window(view, nav_action):
+        print("new_window called")
+        win = Gtk.Window()
+
+        win.connect('delete-event', RenderedGraphHTML5.subwindow_close)
+
+        browserwindow = WebKit2.WebView()
+        browserwindow.set_editable(False)
+        req = nav_action.get_request()
+        uri = req.get_uri()
+
+        settings = browserwindow.get_settings()
+        settings.set_enable_write_console_messages_to_stdout(True)
+        settings.set_javascript_can_open_windows_automatically(True)
+        browserwindow.set_settings(settings)
+        browserwindow.connect('create', RenderedGraphHTML5.new_window)
+        print(str(uri))
+        browserwindow.load_uri(uri)
+
+        win.add(browserwindow)
+
+        browserwindow.show()
+        win.show()
+
+        win.resize(600, 400)
+        win.set_keep_above(True)
+
+        return browserwindow
diff --git a/diode/rendered_graphs.py b/diode/rendered_graphs.py
new file mode 100644
index 0000000000..5850cc3403
--- /dev/null
+++ b/diode/rendered_graphs.py
@@ -0,0 +1,114 @@
+import gi
+import sys
+import subprocess
+
+gi.require_version('Gtk', '3.0')
+
+from xdot.ui.elements import Graph
+from xdot.dot.lexer import ParseError
+from xdot.dot.parser import XDotParser
+
+from diode.rendered_graph import RenderedGraph
+
+from gi.repository import Gdk, Gtk
+
+
+class RenderedGraphs:
+    """ SDFG rendering engine management and abstraction class. """
+
+    def __init__(self, builder):
+        self.builder = builder
+        self.container = None
+        self.renderer = None
+        self.rendered_graphs = []
+        self.on_click_cb = None
+
+    def set_container(self, container):
+        # Add a notebook with a single page: "Main SDFG"
+        self.container = container
+
+    def set_render_engine(self, engine):
+        if engine not in ["html5", "xdot"]:
+            raise ValueError("Rendering engine " + str(engine) +
+                             " is not supported.")
+        else:
+            self.renderer = engine
+
+    def render_sdfgs(self, name_sdfg_tuples):
+        # Delete all old pages
+        notebook = self.builder.get_object("sdfg_notebook")
+        num_pages = notebook.get_n_pages()
+        for page_index in range(0, num_pages):
+            notebook.remove_page(-1)
+        # Delete the old `rendered_graph`s
+        self.rendered_graphs = []
+        for elem in name_sdfg_tuples:
+            scroll = Gtk.ScrolledWindow()
+            drawingarea = Gtk.DrawingArea()
+            scroll.add(drawingarea)
+            label = Gtk.Label(label=elem[0])
+            rendered_graph = RenderedGraph()
+            rendered_graph.set_click_cb(self.on_click_cb)
+            rendered_graph.set_drawing_area(drawingarea)
+            rendered_graph.render_sdfg(elem[1])
+            self.rendered_graphs.append(rendered_graph)
+            notebook.append_page(scroll, label)
+        window = self.builder.get_object("main_window")
+        window.show_all()
+
+    def clear_highlights(self):
+        for rg in self.rendered_graphs:
+            rg.clear_highlights()
+
+    def set_on_click_cb(self, cb):
+        self.on_click_cb = cb
+        for rg in self.rendered_graphs:
+            rg.set_on_click_cb(cb)
+
+    def get_element_by_id(self, sdfg, id):
+        for rg in self.rendered_graphs:
+            if rg.sdfg == sdfg:
+                return rg.get_element_by_id(id)
+
+    def get_element_by_sdfg_node(self, sdfg, stateid, nodeid):
+        for rg in self.rendered_graphs:
+            if rg.sdfg == sdfg:
+                search_nid = "s" + str(stateid) + "_" + str(nodeid)
+                for n in rg.xdot_graph.nodes:
+                    nid = n.id.decode('utf-8')
+                    if nid == search_nid:
+                        return n
+                raise ValueError("Node was not found!")
+            raise ValueError("SDFG was not found!")
+
+    def highlight_element(self, sdfg, elem):
+        # Find the right pane, switch to that pane and call highlight_element
+        # on that graph
+        for idx, rg in enumerate(self.rendered_graphs):
+            if rg.sdfg == sdfg:
+                self.switch_to_sdfg(sdfg)
+                rg.highlight_element(elem)
+                rg.center_highlights()
+                return
+        raise ValueError("SDFG was not found!")
+
+    def switch_to_sdfg(self, sdfg):
+        pane = None
+        for idx, rg in enumerate(self.rendered_graphs):
+            if rg.sdfg == sdfg:
+                pane = idx
+        if pane is None:
+            raise ValueError("SDFG was not found!")
+        notebook = self.builder.get_object("sdfg_notebook")
+        notebook.set_current_page(pane)
+
+    def currently_displayed_sdfg(self):
+        notebook = self.builder.get_object("sdfg_notebook")
+        idx = notebook.get_current_page()
+        return self.rendered_graphs[idx].sdfg
+
+    def set_memspeed_target(self):
+        pass
+
+    def render_performance_data(self):
+        pass
diff --git a/diode/renderer.html b/diode/renderer.html
new file mode 100644
index 0000000000..fe3cd3d95a
--- /dev/null
+++ b/diode/renderer.html
@@ -0,0 +1,740 @@
+<html>
+
+<head>
+    <meta charset="utf-8" />
+</head>
+
+<body>
+    <canvas id="myCanvas" width="1200" height="1200" style="border:1px solid #000000;"> </canvas>
+    <script src="global_vars.js"></script> <!-- Global defines -->
+    <script src="dagre.js"></script> <!-- Graph library -->
+    <script src="renderer_util.js"></script> <!-- Offloaded display functions and classes -->
+    <script src="sdfg_renderer.js"></script> <!-- Offloaded SDFG drawing functions -->
+    <script src="datahelper.js"></script> <!-- Offloaded data analysis functions and classes -->
+    <script src="Chart.bundle.min.js"></script> <!-- Library for charts -->
+    
+    <!-- Offloaded button constructors -->
+    <script src="parallelization_button.js"></script>
+    <script src="memory_button.js"></script>
+
+    <!-- Subwindow magic -->
+    <script src="windowing.js"></script>
+
+    <script>
+
+        var socket = 0; // this is the websocket we use to communicate with the Python part
+        var LINEHEIGHT = 10;
+        var current_sdfg = 0;
+
+        var global_state = null;
+
+        // This is a zoom function
+        // To be called only by DIODE itself!
+        window.zoom_func = scale => {
+            console.log("Zooming");
+            global_state.canvas_manager.scale(scale);
+        };
+
+        window.get_zoom = () => {
+            return global_state.canvas_manager.getScale();
+        };
+
+        window.onerror = (msg, url, lineNo, colNo, error) => {
+            if(error == null) {
+                console.log("Error object was null.");
+            }
+            console.log("Error: " + msg + " in " + url.toString() + ":" + lineNo + ":" + colNo);
+            console.log("Error occurred.\n" + error.message);
+        }
+
+        window.onload = function () {
+            global_state = new SdfgState();
+            global_state.setCtx(document.getElementById("myCanvas").getContext("2d"));
+            socket = new WebSocket('ws://localhost:8023/');
+            setInterval(x => socket.send(JSON.stringify({ "msg_type": "heartbeat" })), 100); // Set the heartbeat.
+            socket.onopen = function (event) {
+                socket.send(JSON.stringify({"msg_type": "info", "info": "client connected"}));
+            }
+            socket.onmessage = function (event) {
+                message_handler(event.data);
+            }
+            socket.onclose = function (event) {
+                console.log("Connection closed");
+            }
+            socket.onerror = function (event) {
+                console.log("There was an error with the connection");
+            }
+            
+        }
+
+        document.getElementById("myCanvas").addEventListener('click', function(event) { 
+            let x = event.pageX - document.getElementById("myCanvas").offsetLeft;
+            let y = event.pageY - document.getElementById("myCanvas").offsetTop;
+            
+            if(window.get_zoom != undefined) {
+                x /= window.get_zoom();
+                y /= window.get_zoom();
+            }
+
+            clicked_elements = [];
+            current_sdfg.nodes.forEach(function (state) {
+                if (isWithinBB(x,y, state.attributes.layout)) {
+                    let elem = {'type': state.attributes.type, 'id': state.id};
+                    clicked_elements.push(elem);
+                    state.attributes.nodes.forEach(function (node) {
+                        if (isWithinBB(x,y, node.attributes.layout)) {
+                            let elem = {'type': node.attributes.type, 'id': node.id};
+                            clicked_elements.push(elem);
+                        }
+                    });
+                }
+            });
+            socket.send(JSON.stringify({"msg_type": "click", "clicked_elements": clicked_elements}));
+        });
+
+
+        class SdfgState {
+            constructor() {
+                this.ctx = null;
+                this.sdfg = null;
+                this.perfdata = null;
+                this.graph = null;
+                this.top_level_graph = null;
+                this.state_graphs = {};
+                this.canvas_manager = null;
+                this.brackets = [];
+
+                this.target_memory_speed = 0;
+
+                this.highlights = []; // List of tuples of stateid and nodeid to highlight.
+
+                this.graphcache = {}; // Cache for the graphs
+            }
+
+
+            setTargetMemBandwidth(target_bandwidth) {
+                console.log("Target memory speed:" + target_bandwidth);
+                this.target_memory_speed = target_bandwidth;
+            }
+            defaultRun() {
+                return this.perfdata.payload.find(x => x.runopts == "# ;export OMP_NUM_THREADS=4; Running in multirun config").data;
+            }
+            setCtx(ctx) { this.ctx = ctx; this.canvas_manager = new CanvasDrawManager(ctx, this); this.canvas_manager.draw(); }
+            setSDFG(sdfg) { this.sdfg = sdfg; this.graphcache = {}; }
+            setPerfData(pd) { this.perfdata = pd; }
+            clearPerfData() { this.perfdata = null; this.brackets = []; }
+            addHighlight(hl) {
+                this.highlights.push(hl);
+            }
+            clearHighlights() {
+                this.highlights = [];
+            }
+
+            setGraph(stateid, g) { this.state_graphs[stateid] = g; }
+
+            setTopLevelGraph(g) { ObjectHelper.assert("g valid", g != null); this.top_level_graph = g; }
+
+            nodeid(unified) {
+                return unified & 0xFFFF;
+            }
+
+            // Passed through from graph
+            node(stateid, nodeid) {
+                return this.state_graphs[stateid].node(this.nodeid(nodeid));
+            }
+
+            addBracket(b) {
+                this.brackets.push(b);
+            }
+
+            drawSDFG() {
+                let g = this.top_level_graph;
+                if(g == null) return; // Nothing to draw (yet)
+                paint_sdfg(g, this.sdfg, new DrawNodeState(this.ctx, -1));
+
+                // Draw what is inside the state boxes, offset by the top 
+                // left corner of the state box
+                this.sdfg.nodes.forEach(state => {
+                    let state_x_offs = g.node(state.id).x - g.node(state.id).width / 2.0;
+                    let state_y_offs = g.node(state.id).y - g.node(state.id).height / 2.0;
+                    let ctx = this.ctx;
+                    ctx.fillText(state.id, state_x_offs+1.0*LINEHEIGHT, state_y_offs+1.0*LINEHEIGHT);
+                    let state_g = null;
+                    if(cache_graphs && (state.id in this.graphcache)) {
+                        state_g = this.graphcache[state.id];
+                    }
+                    else {
+                        if (state.attributes.collapsed == true) {
+                            state_g = new dagre.graphlib.Graph();
+                            g.setGraph({});
+                            g.setDefaultEdgeLabel(function (u, v) { return {}; });
+                            dagre.layout(g);
+                        }
+                        else {
+                            state_g = layout_state(state.attributes);
+                            addXYOffset(state_g, state_x_offs + 2*LINEHEIGHT, state_y_offs+2*LINEHEIGHT);
+                            this.graphcache[state.id] = state_g;
+                        }
+
+                    }
+                    
+                    paint_state(state_g, new DrawNodeState(ctx, state.id));
+                });
+            }
+
+            // drawPerfInfo for all states
+            drawAllPerfInfo() {
+                let states = ObjectHelper.flatten(this.defaultRun().map(x => new SuperSection(x).allSectionStateIds()));
+                states = MathHelper.unique(states);
+                ObjectHelper.logObject("All states", states);
+                for(let x of states) {
+                    this.drawPerfInfo(x);
+                }
+            }
+
+            drawPerfInfo(stateid) {
+                let ctx = this.ctx;
+
+                let scopedict = {};
+
+                let payload = this.perfdata.payload;
+
+                // Check correctness of data
+                let x = new ResultVerifier(payload);
+
+                // There are 2 types of nodes: Those that open a scope 
+                // (MapEntry) and those that don't (the rest).
+                // We search first for all section entry nodes.
+                let all_entry_nodes = ObjectHelper.flatten(this.defaultRun().map(x => new SuperSection(x).allSectionNodeIds(stateid)));
+                // Filter that down to uniques.
+                all_entry_nodes = MathHelper.unique(all_entry_nodes);
+                // For every entry node, we create an array.
+                for(let tmp of all_entry_nodes) {
+                    scopedict[tmp] = [];
+                }
+                // Add an extra one for the top-level
+                scopedict[null] = [];
+
+
+                for(let sdfgnode of this.sdfg.nodes) {
+                    if(sdfgnode.id != stateid) continue;
+                    for(let node of sdfgnode.attributes.nodes) {
+                        let gnode = this.node(stateid, node.id);
+                        if(gnode == undefined) {
+                            console.log("Welp. How did this happen? (undefined gnode for (" + stateid + ", " + node.id + "))");
+                        }
+                        else {
+                            let anchor_x = gnode.x + gnode.width / 2.;
+                            let anchor_y = gnode.y - gnode.height / 2;
+
+                            // We are only interested in scopes for now.
+                            // Ordering scopes "upside down", because an 
+                            // entry node is not marked as scope entry, but 
+                            // nodes inside that scope have a link to the 
+                            // starting node.
+                            if(node.scope_entry != null) {
+                                // This node has a scope entry. Add this node to the dict with the key of the entry node (reverse the mapping basically)
+                                if(scopedict[node.scope_entry] == undefined) {
+                                    scopedict[node.scope_entry] = [];
+                                    console.log("Had to create an array for unknown entry node " + node.scope_entry);
+                                }
+                                scopedict[node.scope_entry].push(this.nodeid(node.id));
+                            }
+                        }
+                    }
+                }
+
+
+                // Topologically sort scopes
+                let sorted_array = [];
+                {
+                    let changed = true;
+                    while(changed) {
+                        changed = false;
+                        
+                        for(let key in scopedict) {
+                            // Skip objects already taken
+                            if(sorted_array.some(e => e == key)) continue; 
+
+
+                            // For each array, check if the selected key is a dependency. If it is, check next (we are going outside -> inside)
+                            let retry = false;
+                            Object.keys(scopedict).forEach(e => {
+                                if(scopedict[e].some(o => o == key && !sorted_array.some(x => x == e))) {
+                                    retry = true;
+                                }
+                            });
+                            if(retry) continue;
+                            
+                            // Otherwise, we'll have a change
+                            changed = true;
+
+                            // Add the key to the sorted array
+                            sorted_array.push(key);
+                        }
+
+                    }
+                }
+
+                // Since we now have the sorted array, but in the wrong 
+                // order, reverse it.
+                let sa = sorted_array.reverse();
+
+                for(let k of sa) {
+                    if(k == null) break;
+                    if(k == "null") break;
+
+                    if(!all_entry_nodes.map(x => x.toString()).includes(k.toString())) {
+                        continue; // Skip nodes for which we don't have performance data.
+                    }
+
+                    // From the key, we can read the affected nodes
+                    let affected = new Array();
+                    affected.push(k);
+                    let tmp = scopedict[k];
+                    affected.push(...tmp);
+
+
+                    // Now get the maximum and minimum y-positions
+                    let top = min_func(affected, x => {
+
+                        let gnode = this.node(stateid, x);
+
+                        return gnode.y - gnode.height / 2;
+                    });
+                    let bot = max_func(affected, x => {
+                        let gnode = this.node(stateid, x);
+
+                        return gnode.y + gnode.height / 2;
+                    });
+
+                    // Now we just have to do the same for the right side :)
+                    let right = max_func(affected, x => {
+                        let gnode = this.node(stateid, x);
+
+                        return gnode.x + gnode.width / 2;
+                    });
+
+                    let b = new Bracket(ctx);
+                    global_state.addBracket(b);
+
+
+                    let sections = this.defaultRun();
+                    let targetsection = null;
+                    // Only selects one single section
+                    for(let section of sections) {
+                        let clsec = new SuperSection(section);
+                        ObjectHelper.assert("correct type", clsec instanceof SuperSection);
+                                                
+                        if(clsec.containsSection(k, stateid)) {
+                            targetsection = clsec.toSection(k, stateid);
+                            if(targetsection === undefined) continue;
+                            break; 
+                        }
+                    }
+                    if(targetsection == null) {
+                        console.log("Failed to obtain a valid section!");
+                        if(targetsection_ignore_error) continue; 
+                    }
+                    ObjectHelper.assert("targetsection valid", targetsection != null);
+                    
+                    let path_analysis = new CriticalPathAnalysis(this.perfdata.payload, k, stateid).analyze();
+
+                    // Same thing as above, now for multiple runs.
+                    let all_threads_data = sections.map(x => new SuperSection(x).toSection(k, stateid)).filter(x => x != undefined && x._entries != undefined).map(x => new ThreadAnalysis(x).analyze());
+                    let all_threads_data_supersection = sections.map(x => new SuperSectionThreadAnalysis(new SuperSection(x), k, stateid).analyze());
+                    let all_analyses = new DataBlock(all_threads_data, "all_threads");
+
+                    let but1 = new ParallelizationButton(ctx, targetsection, all_analyses, path_analysis);
+                    b.addButton(but1);
+
+                    let oldcode = () => {
+                        // #TODO: Before running the memory analysis, we should group the same sections (i.e. the sections with the same nodeids together)
+                        // This way, sections are grouped by nodeid and threadid, so every section only has 1 node and 1 thread if certain conditions apply.
+                        let presel = sections.filter(x => x['entry_node'] == k);
+                        // After preselecting, merge all ENTRIES with the same thread together to subgroups
+                        let entries = presel.map(x => x.entries);
+                        let grouped = ObjectHelper.groupBy(entries, x => {  return x[0]['node']; }); // #TODO: Thread_id is not yet an element in the dict
+                        ObjectHelper.assert("Grouped length should be 1", grouped.length == 1); // Only true when considering sections containing only 1 node
+                        grouped = grouped[0];
+                        let flattened = ObjectHelper.flatten(grouped);
+                        let threadgrouped = ObjectHelper.groupBy(flattened, x => x['thread']);
+                        let merged_section = {
+                            "entry_node": presel[0]['entry_node'],
+                            "static_movement": MathHelper.sum(presel.map(x => parseInt(x['static_movement']))),
+                            "entries": flattened
+                        };
+                        // Merge the nodes and then pass to the constructor 
+                        // below
+                        let all_mem_analyses = new DataBlock(sections.filter(x => x['entry_node'] == k).map(x => new MemoryAnalysis(new Section(x), global_state.target_memory_speed).analyze()), "all_thread_mem");
+                    };
+
+                    //ObjectHelper.logObject("sections", sections);
+                    let supersections = sections.map(x => new SuperSection(x));
+                    
+                    let memsubsel = supersections.map(x => x.getSections(k, stateid)).filter(x => x != undefined);
+                    // Flattening is a bad idea here because it does not group sections together
+                    memsubsel = ObjectHelper.flatten(memsubsel);
+                    let all_mem_analyses = new DataBlock(memsubsel.map(x => new MemoryAnalysis(new Section(x)).analyze()).filter(x => x != undefined), "all_thread_mem");
+                    
+                    // Try with a full supersection analysis
+                    let supersection_all_mem_analyses = new DataBlock(sections.map(x => new SuperSectionMemoryAnalysis(new SuperSection(x), k, stateid, global_state.target_memory_speed).analyze()).filter(x => x != null), "all_thread_mem");
+
+                    all_mem_analyses = supersection_all_mem_analyses;
+
+                    let but2 = new MemoryButton(ctx, all_mem_analyses, global_state.target_memory_speed);
+                    b.addButton(but2);
+
+
+
+                    let but3 = new Button(ctx);
+                    but3.setOnEnterHover(p => { but3.color = "#0000FF"; });
+                    but3.setOnLeaveHover(p => { but3.color = "orange"; });
+
+                    but1.setOnDoubleClick(p => {
+                        let newwin = new DiodeWindow(window);
+                        newwin.setSenderData({ 
+                            className: "ParallelizationButton",
+                            dataParams: but1.dataparams
+                        });
+                        let subwin = newwin.open("subwindow.html", "_blank");
+                        if(!subwin) {
+                            console.log("Failed to open subwindow");
+                            alert("failed to open subwindow");
+                        }
+
+                        return true;
+                    });
+                    but2.setOnDoubleClick(p => {
+                        let newwin = new DiodeWindow(window);
+                        newwin.setSenderData({ 
+                            className: "MemoryButton",
+                            dataParams: but2.dataparams
+                        });
+                        let subwin = newwin.open("subwindow.html", "_blank");
+                        if(!subwin) {
+                            console.log("Failed to open subwindow");
+                            alert("failed to open subwindow");
+                        }
+
+                        return true;
+                    });
+
+                    b.addButton(but3);
+
+                    b.setupEventListeners();
+                    
+                    b.drawEx(new Pos(right, top), new Pos(right, bot), 50, 20, true, () => {});
+
+                    this.canvas_manager.addDrawable(b);
+                }
+            }
+
+        }
+
+        function message_handler(msg) {
+            let sdfg = JSON.parse(msg);
+            if (sdfg.type == "SDFG") {
+                global_state.clearPerfData();
+                global_state.canvas_manager.clearDrawables();
+                global_state.setSDFG(sdfg); // Set the SDFG to a global helper
+                current_sdfg = sdfg;
+
+                // New SDFG also means completely cleared brackets.
+                for(let x of global_state.brackets) {
+                    x.destroy();
+                }
+                global_state.brackets = [];
+                global_state.graphcache = {};
+                global_state.canvas_manager.clearDrawables();
+            }
+            else if(sdfg.type == "PerfInfo") {
+                global_state.setPerfData(sdfg);
+                global_state.canvas_manager.clearDrawables();
+                global_state.drawAllPerfInfo();
+                return;
+            }
+            else if(sdfg.type == "MemSpeed") {
+                global_state.setTargetMemBandwidth(sdfg.payload);
+                return;
+            }
+            else if(sdfg.type == "highlight-element") {
+                let tmp = {
+                    "state-id": sdfg['sdfg-id'],
+                    "node-id": sdfg['node-id']
+                };
+                global_state.addHighlight(tmp);
+                return;
+            }
+            else if(sdfg.type == "clear-highlights") {
+                global_state.clearHighlights();
+                return;
+            }
+            else {
+                console.log("Expected to receive an SDFG, but I got " + msg);
+                return;
+            }
+
+            
+
+            // draw the state boxes
+            let g = layout_sdfg(sdfg);
+            let bb = calculateBoundingBox(g);
+            let cnvs = document.getElementById("myCanvas");
+            cnvs.width = Math.min(Math.max(bb.width + 1000, cnvs.width), 16384);
+            cnvs.height = Math.min(Math.max(bb.height + 1000, cnvs.height), 16384);
+            paint_sdfg(g, null, new DrawNodeState(global_state.ctx, -1));
+
+            global_state.setTopLevelGraph(g);
+
+            // draw what is inside the state boxes, offset by the top left corner of the state box
+            sdfg.nodes.forEach(function (state) {
+                let state_x_offs = g.node(state.id).x - g.node(state.id).width / 2.0;
+                let state_y_offs = g.node(state.id).y - g.node(state.id).height / 2.0;
+                let ctx = document.getElementById("myCanvas").getContext('2d');
+                ctx.fillText(state.id, state_x_offs+1.0*LINEHEIGHT, state_y_offs+1.0*LINEHEIGHT);
+                let state_g = layout_state(state.attributes);
+                addXYOffset(state_g, state_x_offs + 2*LINEHEIGHT, state_y_offs+2*LINEHEIGHT);
+                paint_state(state_g, new DrawNodeState(ctx, state.id));
+                global_state.setGraph(state.id, state_g);
+            });
+
+        }
+
+        function isWithinBB(x, y, layoutinfo) {
+            if ((x > layoutinfo.x - layoutinfo.width/2.0) && 
+                (x < layoutinfo.x + layoutinfo.width/2.0) &&
+                (y > layoutinfo.y - layoutinfo.height/2.0) &&
+                (y < layoutinfo.y + layoutinfo.height/2.0)) {
+                    return true;
+            }
+            return false;
+        }
+
+        function addXYOffset(g, x_offs, y_offs) {
+            "use strict";
+            g.nodes().forEach(function (v) {
+                g.node(v).x += x_offs;
+                g.node(v).y += y_offs;
+            });
+            g.edges().forEach(function (e) {
+                let edge = g.edge(e);
+                edge.x += x_offs;
+                edge.y += y_offs;
+                edge.points.forEach(function (p) {
+                    p.x += x_offs;
+                    p.y += y_offs;
+                }); 
+            });
+        }
+
+        function paint_sdfg(g, sdfg, drawnodestate) {
+
+            ObjectHelper.assert("drawnodestate must be defined", drawnodestate != undefined);
+
+            let ctx = drawnodestate.ctx;
+
+            g.nodes().forEach( v => {
+                let topleft_x = g.node(v).x - g.node(v).width / 2.0;
+                let topleft_y = g.node(v).y - g.node(v).height / 2.0;
+                
+                ctx.beginPath();
+                ctx.moveTo(topleft_x, topleft_y);
+                ctx.lineTo(topleft_x + g.node(v).width, topleft_y);
+                ctx.lineTo(topleft_x + g.node(v).width, topleft_y+g.node(v).height);
+                ctx.lineTo(topleft_x, topleft_y+g.node(v).height);
+                ctx.lineTo(topleft_x, topleft_y);
+                ctx.closePath();
+                ctx.strokeStyle="blue";
+                ctx.stroke();
+
+            });
+            g.edges().forEach(e => {
+                drawnodestate.draw_edge(g.edge(e));
+            });
+        
+        }
+
+        function layout_sdfg(sdfg) {
+
+            // layout the sdfg as a dagre graph
+            let g = new dagre.graphlib.Graph();
+
+            let ctx = global_state.ctx;
+
+            // Set an object for the graph label
+            g.setGraph({});
+
+            // Default to assigning a new object as a label for each new edge.
+            g.setDefaultEdgeLabel(function (u, v) { return {}; });
+
+            // layout each state to get its size
+            sdfg.nodes.forEach(function (state) {
+                let stateinfo = {};
+                stateinfo.label = state.id;
+                if (state.attributes.collapsed == true) {
+                    stateinfo.width = ctx.measureText(stateinfo.label).width;
+                    stateinfo.height = LINEHEIGHT;
+                } 
+                else {
+                    let state_g = layout_state(state.attributes);
+                    stateinfo = calculateBoundingBox(state_g);
+                }
+                stateinfo.width += 4*LINEHEIGHT;
+                stateinfo.height += 4*LINEHEIGHT;
+                g.setNode(state.id, stateinfo);
+            });
+
+            sdfg.edges.forEach(function (edge) {
+                let label = edge.attributes
+                let ctx = document.getElementById("myCanvas").getContext('2d');
+                let textmetrics = ctx.measureText(label);
+                g.setEdge(edge.src, edge.dst, { name: label, label: label, height: LINEHEIGHT, width: textmetrics.width });
+            });
+
+            dagre.layout(g);
+
+            // annotate the sdfg with its layout info
+            sdfg.nodes.forEach(function (state) {
+                let gnode = g.node(state.id);
+                state.attributes.layout = {};
+                state.attributes.layout.x = gnode.x;
+                state.attributes.layout.y = gnode.y;
+                state.attributes.layout.width = gnode.width;
+                state.attributes.layout.height = gnode.height;
+            });
+
+            sdfg.edges.forEach(function (edge) {
+                let gedge = g.edge(edge.src, edge.dst);
+                edge.attributes = {};
+                // FIXME: edge.attributes should be an object when we generate json SDFG in Python
+                edge.attributes.label = gedge.label; 
+                edge.attributes.layout = {};
+                edge.attributes.layout.width = gedge.width;
+                edge.attributes.layout.height = gedge.height;
+                edge.attributes.layout.x = gedge.x;
+                edge.attributes.layout.y = gedge.y;
+                edge.attributes.layout.points = gedge.points;
+            });
+
+            return (g);
+
+        }
+
+        function layout_state(sdfg_state) {
+            // layout the state as a dagre graph
+
+            let g = new dagre.graphlib.Graph();
+
+            // Set an object for the graph label
+            g.setGraph({});
+
+            // Default to assigning a new object as a label for each new edge.
+            g.setDefaultEdgeLabel(function (u, v) { return {}; });
+
+            // Add nodes to the graph. The first argument is the node id. The 
+            // second is metadata about the node (label, width, height),
+            // which will be updated by dagre.layout (will add x,y).
+
+            sdfg_state.nodes.forEach(function (node) {
+                let nodesize = calculateNodeSize(sdfg_state, node)
+                node.attributes.layout = {}
+                node.attributes.layout.width = nodesize.width;
+                node.attributes.layout.height = nodesize.height;
+                node.attributes.layout.label = node.attributes.label;
+                node.attributes.layout.type = node.attributes.type;
+                node.attributes.layout.in_connectors = node.attributes.in_connectors;
+                node.attributes.layout.out_connectors = node.attributes.out_connectors;
+                g.setNode(node.id, node.attributes.layout);
+            });
+
+            sdfg_state.edges.forEach(function (edge) {
+                let label = edge.attributes.label
+                let ctx = document.getElementById("myCanvas").getContext('2d');
+                let textmetrics = ctx.measureText(label);
+                g.setEdge(edge.src, edge.dst, { label: label, height: LINEHEIGHT, width: textmetrics.width });
+            });
+
+            dagre.layout(g);
+            return g;
+        }
+
+        function calculateNodeSize(sdfg_state, node) {
+            let ctx = document.getElementById("myCanvas").getContext('2d');
+            let labelsize = ctx.measureText(node.attributes.label).width;
+            let inconnsize = 0;
+            let outconnsize = 0;
+            node.attributes.in_connectors.forEach(function(conn) {
+                // add 10px of margin around each connector
+                inconnsize += ctx.measureText(conn).width + 10;
+            });
+            node.attributes.out_connectors.forEach(function(conn) {
+                // add 10px of margin around each connector
+                outconnsize += ctx.measureText(conn).width + 10;
+            });
+            let maxwidth = Math.max(labelsize, inconnsize, outconnsize);
+            let maxheight = 2*LINEHEIGHT;
+            if (node.attributes.in_connectors.length + node.attributes.out_connectors.length > 0) {
+                maxheight += 4*LINEHEIGHT;
+            }
+
+            let size = { width: maxwidth, height: maxheight }
+
+            // add something to the size based on the shape of the node
+            if (node.attributes.type == "ArrayNode") {
+                size.width += size.height;
+            }
+            else if (node.attributes.type == "MapEntry") {
+                size.width += 2.0 * size.height;
+            }
+            else if (node.attributes.type == "MapExit") {
+                size.width += 2.0 * size.height;
+            }
+            else if (node.attributes.type == "Tasklet") {
+                size.width += 2.0 * (size.height / 3.0);
+            }
+            else if (node.attributes.type == "Reduce") {
+                size.width *= 2;
+                size.height = size.width / 3.0;
+            }
+            else {
+            }
+
+            return size
+        }
+
+
+
+        function calculateBoundingBox(g) {
+            // iterate over all objects, calculate the size of the bounding box
+            let bb = {};
+            bb.width = 0;
+            bb.height = 0;
+
+            g.nodes().forEach(function (v) {
+                let x = g.node(v).x + g.node(v).width / 2.0;
+                let y = g.node(v).y + g.node(v).height / 2.0;
+                if (x > bb.width) bb.width = x;
+                if (y > bb.height) bb.height = y;
+            });
+
+            return bb;
+        }
+
+        
+
+        function paint_state(g, drawnodestate) {
+            g.nodes().forEach(function (v) {
+                drawnodestate.draw_node(g.node(v), v);
+            });
+            g.edges().forEach(function (e) {
+                let edge = g.edge(e);
+                ObjectHelper.assert("edge invalid", edge);
+                drawnodestate.draw_edge(g.edge(e));
+            });
+        }
+
+    </script>
+</body>
+
+</html>
\ No newline at end of file
diff --git a/diode/renderer_util.js b/diode/renderer_util.js
new file mode 100644
index 0000000000..13392a5f82
--- /dev/null
+++ b/diode/renderer_util.js
@@ -0,0 +1,1578 @@
+// Renderer utilities.
+
+
+
+function max_func(array, func) {
+    if(array == undefined) {
+        console.trace("undefined parameter");
+    }
+    ObjectHelper.assert("Array is non-empty", array.length > 0);
+    let max_obj = null;
+    let max_val = func(array[0]);
+    for(let x of array) {
+        let m = func(x);
+        if(m > max_val) {
+            max_val = m;
+            max_obj = x;
+        }
+    }
+
+    return max_val;
+}
+
+function max_func_obj(array, func, objfunc) {
+    if(array.length == 0) {
+        return null;
+    }
+    let max_obj = objfunc(array[0]);
+    let max_val = func(array[0]);
+    for(let x of array) {
+        let m = func(x);
+        if(m > max_val) {
+            max_val = m;
+            max_obj = objfunc(x);
+        }
+    }
+
+    return max_obj;
+}
+
+function min_func(array, func) {
+    let min_obj = null;
+    let min_val = func(array[0]);
+    for(let x of array) {
+        let m = func(x);
+        if(m < min_val) {
+            min_val = m;
+            min_obj = x;
+        }
+    }
+
+    return min_val;
+}
+
+class Pos {
+    constructor(x, y) {
+        this.x = x;
+        this.y = y;
+    }
+
+    toString() {
+        return "(" + this.x + ", " + this.y + ")";
+    }
+
+    minus(other) {
+        return new Pos(this.x - other.x, this.y - other.y);
+    }
+    plus(other) {
+        return new Pos(this.x + other.x, this.y + other.y);
+    }
+    times(other) {
+        return new Pos(this.x * other.x, this.y * other.y);
+    }
+    
+    multiply(num) {
+        return new Pos(this.x * num, this.y * num);
+    }
+
+    dist() {
+        return Math.sqrt(this.x * this.x + this.y * this.y);
+    }
+
+    inRect(topleft, bottomright) {
+        if (this.x < topleft.x || this.y < topleft.y) return false;
+        if (this.x > bottomright.x || this.y > bottomright.y) return false;
+        return true;
+    }
+}
+
+
+class Clickable {
+
+    constructor() {
+
+        this.children = Array();
+
+        this.onEnterHover = function () { return false; }
+        this.onLeaveHover = function () { return false; }
+        this.onClick = function () { return false; }
+        this.onDoubleClick = function () { return false; }
+        this.onMouseMove = function () { return false; }
+        
+
+        this.clickable_state = "not_hovered";
+
+        this.enable_func = () => true;
+    }
+
+    destroy() {
+        this.children = [];
+    }
+
+    addChild(clickable) {
+        this.children.push(clickable);
+    }
+
+    addVIPChild(clickable) {
+        this.children.unshift(clickable);
+    }
+
+    setEnableFunc(func) {
+        this.enable_func = func;
+    }
+
+    onUpdateDoubleClick(mousepos, mb) {
+        if(!this.enable_func()) return false;
+        if(this.is_inside(mousepos)) {
+            if(this.onDoubleClick())
+                return true;
+        }
+        else {
+            // Deselect?
+        }
+
+        for(let c of this.children) {
+            if(c.onUpdateDoubleClick(mousepos, mb)) {
+                return true;
+            }
+        }
+        return false;
+    }
+
+    onUpdateClick(mousepos, mb) {
+        if(!this.enable_func()) return false;
+        if(this.is_inside(mousepos)) {
+            if(this.onClick())
+                return true;
+        }
+        else {
+            // Deselect?
+        }
+
+        for(let c of this.children) {
+            if(c.onUpdateClick(mousepos, mb)) {
+                return true;
+            }
+        }
+        return false;
+    }
+
+    onUpdateMove(mousepos) {
+        if(!this.enable_func()) return false;
+        if (this.is_inside(mousepos)) {
+            this.onMouseMove(mousepos);
+            if (this.clickable_state == "not_hovered") {
+                this.clickable_state = "hovered";
+                if (this.onEnterHover(mousepos)) {
+                    return true;
+                }
+            }
+        }
+        else {
+            if (this.clickable_state == "hovered") {
+                this.clickable_state = "not_hovered";
+                if (this.onLeaveHover()) {
+                    return true;
+                }
+            }
+        }
+        // Else pass to children
+        for (let c of this.children) {
+            if (c.onUpdateMove(mousepos)) {
+                return true;
+            }
+        }
+
+        return false;
+    }
+
+    is_inside(pos) {
+        // Abstract
+        console.log("Abstract function called (is_inside)");
+        return false
+    }
+
+    setOnEnterHover(func) {
+        this.onEnterHover = func;
+    }
+
+    setOnLeaveHover(func) {
+        this.onLeaveHover = func;
+    }
+
+    setOnClick(func) {
+        this.onClick = func;
+    }
+
+    setOnDoubleClick(func) {
+        this.onDoubleClick = func;
+    }
+
+    setOnMouseMove(func) {
+        this.onMouseMove = func;
+    }
+
+}
+
+class SubWindow extends Clickable {
+    constructor(ctx, x, y, width, height) {
+        super();
+
+        this.ctx = ctx;
+        this.topleft = new Pos(x, y);
+        this.targetwidth = width;
+        this.targetheight = height;
+        this._sizetrans = 0;
+
+        this.subwindow_trans_change = 10;
+
+        this.subwindow_popped_out = false;
+
+        this.layout = null;
+    }
+
+    setLayout(layout) {
+        this.layout = layout;
+        return this;
+    }
+
+    width() {
+        return this.targetwidth * this._sizetrans / 100.;
+    }
+
+    height() {
+        return this.targetheight * this._sizetrans / 100.;
+    }
+
+    draw(topleft, state, ctx_override = null) {
+
+        if (topleft != null)
+            this.topleft = topleft;
+
+        let ctx = this.ctx;
+        if(ctx_override) {
+            ctx = ctx_override;
+        }
+
+        ctx.save();
+
+        ctx.beginPath();
+        ctx.strokeStyle = "black";
+        ctx.lineWidth = 2;
+        ctx.rect(this.topleft.x, this.topleft.y, this.width(), this.height());
+        ctx.stroke();
+
+        if (state == 'open') {
+            this._sizetrans += this.subwindow_trans_change;
+            if (this._sizetrans > 100.)
+                this._sizetrans = 100.;
+        } else if (state == 'collapsed') {
+            this._sizetrans -= this.subwindow_trans_change;
+            if (this._sizetrans < 0.)
+                this._sizetrans = 0.;
+        }
+
+        if(this.layout != null) {
+            if(this._sizetrans != 0)
+                this.layout.draw(ctx);
+        }
+
+        ctx.restore();
+    }
+
+    is_inside(pos) {
+        let offsetpos = pos.minus(this.topleft);
+
+        if (offsetpos.x < 0 || offsetpos.y < 0) return false;
+        if (offsetpos.x > this.width() || this.height()) return false;
+        return true;
+    }
+
+
+}
+
+class Button extends Clickable {
+    constructor(ctx) {
+        super();
+        this.ctx = ctx;
+        this.state = 0;
+        this.update = 0;
+
+        this.color = "orange";
+        this.button_subwindow_state = "collapsed";
+        this.button_subwindow = new SubWindow(ctx, 0, 0, 600, 400);
+        this.addChild(this.button_subwindow);
+        this.is_locked_open = false;
+
+        this.topleft = new Pos(0,0);
+        this.size = new Pos(0,0);
+
+        this.button_image = null;
+    }
+
+    setButtonImage(data_url) {
+        if(data_url == undefined) {
+            return this;
+        }
+        this.button_image = new Image();
+        this.button_image.src = data_url;
+    }
+
+    is_inside(pos) {
+        let offsetpos = pos.minus(this.topleft);
+
+        if (offsetpos.x < 0 || offsetpos.y < 0) return false;
+        if (offsetpos.x > this.size.x || offsetpos.y > this.size.y) return false;
+        return true;
+    }
+
+    draw(topleft, size) {
+        this.update++;
+        let now = new Date();
+        let ctx = this.ctx;
+
+        this.topleft = topleft
+        this.size = size
+
+        ctx.save();
+
+        ctx.beginPath();
+        // Draw the rect
+        if(this.button_image == null) {
+            if (this.state == 0) {
+            
+                let cfseg = 2 * Math.PI / 1000.;
+                let mid = size.x / 2;
+                let fac = now.getMilliseconds();
+                let xval = Math.cos(cfseg * fac) * mid;
+                let yval = Math.sin(cfseg * fac) * mid;
+
+
+                let grad = ctx.createLinearGradient(topleft.x + mid + xval, topleft.y + mid + yval, topleft.x + mid - xval, topleft.y + mid - yval);
+                grad.addColorStop(0, this.color);
+                grad.addColorStop(1, "white");
+                ctx.fillStyle = grad;
+                
+            }
+
+            ctx.strokeStyle = "#000000";
+            ctx.lineWidth = 2;
+            ctx.rect(topleft.x, topleft.y, size.x, size.y);
+            ctx.stroke();
+            if (this.state == 0) {
+                ctx.fill();
+            }
+        }
+        else {
+            ctx.drawImage(this.button_image, topleft.x, topleft.y, size.x, size.y);
+            ctx.strokeStyle = "#000000";
+            ctx.lineWidth = 2;
+            ctx.rect(topleft.x, topleft.y, size.x, size.y);
+            ctx.stroke();
+        }
+
+        ctx.restore();
+
+        // Draw the window
+        this.button_subwindow.draw(topleft, this.button_subwindow_state);
+
+        topleft = null;
+        size = null;
+        now = null;
+
+    }
+
+
+}
+
+// Class to specify where which elements are located
+class Layout {
+    constructor(subwindow) {
+        this.parent = subwindow;
+        this._layout = {};
+        this.databinding = null;
+        let _this = this;
+        this._layout_clickable = new class extends Clickable {
+            is_inside(pos) {
+                for(let x of Object.keys(_this._layout)) {
+                    let val = _this._layout[x];
+                    let realsize = new Pos(_this.parent.width() * val.size.x / 100.0, _this.parent.height() * val.size.y / 100.0);
+                    let realtopleft = _this.parent.topleft.plus(val.topleft.times(new Pos(_this.parent.width() / 100.0, _this.parent.height() / 100.0)));
+
+                    let inside = realtopleft.x < pos.x && pos.x < (realtopleft.x + realsize.x) && realtopleft.y < pos.y && pos.y < (realtopleft.y + realsize.y);
+                    let has_handlers = true;
+                    if(inside && has_handlers) {
+                        this._tmp_hovered = x;
+                        this._mouse_pos = pos;
+                        return true;
+                    }
+                }
+                delete this._tmp_hovered;
+                return false;
+            }
+        }
+
+        this._layout_clickable.setOnClick(() => {
+            let target = this._layout_clickable._tmp_hovered;
+            let val = this._layout[target];
+
+            let realsize = new Pos(this.parent.width() * val.size.x / 100.0, this.parent.height() * val.size.y / 100.0);
+            let realtopleft = this.parent.topleft.plus(val.topleft.times(new Pos(this.parent.width() / 100.0, this.parent.height() / 100.0)));
+
+            let pos = this._layout_clickable._mouse_pos;
+
+            // check if the "information-button" is clicked
+            {
+                let dv = this.getCurrentDataView(target);
+                if(dv.information_button_pos != null) {
+                    if(dv.information_button_pos.x < pos.x && dv.information_button_pos.y < pos.y) {
+                        if(dv.information_button_pos.x + dv.information_button_size.x > pos.x && dv.information_button_pos.y + dv.information_button_size.y > pos.y) {
+                            console.log("Information Button clicked!");
+                            dv.showOptimizationHints();
+                            return true;
+                        }
+                    }
+                }
+            }
+
+            let left = realtopleft.x + realsize.x - 20;
+            let right = realtopleft.x + realsize.x;
+            let top = realtopleft.y;
+            let bottom = realtopleft.y + 20;
+            if(val.is_multiview) {
+                if(left < pos.x && pos.x < right && top < pos.y && pos.y < bottom) {
+                    val.multiview_selector = (val.multiview_selector + 1) % val.dataviews.length;
+                    return true;
+                }
+                else {
+                }
+            }
+            
+            return false; // Someone else can have this event.
+            
+
+        });
+
+        this._layout_clickable.setOnMouseMove((pos) => {
+            // We want to know the position and which element is hovered so 
+            // that we can forward events
+            let target = this._layout_clickable._tmp_hovered;
+            if(target) {
+                let val = this.getCurrentDataView(target);
+                val.mouseInside(pos);
+                return true;
+            }
+            else {
+            }
+        });
+
+        this._layout_clickable.setOnDoubleClick(() => {
+            let target = this._layout_clickable._tmp_hovered;
+            let val = this._layout[target];
+
+            let realsize = new Pos(this.parent.width() * val.size.x / 100.0, this.parent.height() * val.size.y / 100.0);
+            let realtopleft = this.parent.topleft.plus(val.topleft.times(new Pos(this.parent.width() / 100.0, this.parent.height() / 100.0)));
+
+            let pos = this._layout_clickable._mouse_pos;
+
+            let left = realtopleft.x + realsize.x - 20;
+            let right = realtopleft.x + realsize.x;
+            let top = realtopleft.y;
+            let bottom = realtopleft.y + 20;
+            if(val.is_multiview) {
+                if(left < pos.x && pos.x < right && top < pos.y && pos.y < bottom) {
+                    return false;
+                }
+                else {
+                    // Trigger the options for the underlying dataview
+                    let dv = val.dataviews[val.multiview_selector];
+
+                    dv.openSettingsWindow();
+                    return true;
+                }
+            }
+            else {
+                // Trigger the options for the underlying dataview
+                let dv = val.dataview;
+
+                dv.openSettingsWindow();
+
+                return true;
+            }
+            
+            return false; // Someone else can have this event.
+            
+
+        });
+
+        subwindow.addVIPChild(this._layout_clickable);
+    }
+
+    setEventHandlers(click, enter, leave) {
+        this._layout_clickable.setOnClick(click);
+        this._layout_clickable.setOnEnterHover(enter);
+        this._layout_clickable.setOnLeaveHover(leave);
+    }
+
+    getCurrentDataView(rect_name) {
+        let val = this._layout[rect_name];
+        if(val.is_multiview) {
+            return val.dataviews[val.multiview_selector];
+        }
+        else {
+            return val.dataview;
+        }
+    }
+
+    getLayoutObject() {
+        let ret = {};
+        for(let k of Object.keys(this._layout)) {
+            if(k == 'dataview') continue;
+            if(k == 'dataviews') continue;
+            ret[k] = this._layout[k];
+        }
+        return this._layout;
+    }
+
+    // Sizes all must be in percentages.
+    setRect(name, topleft, size, dataview) {
+        this._layout[name] = {};
+        this._layout[name].topleft = topleft;
+        this._layout[name].size = size;
+        this._layout[name].dataview = dataview;
+    }
+
+    setMultiviewRect(name, topleft, size, dataview_array) {
+        if(dataview_array.length < 1) {
+            alert("You cannot set a multiview with no contents!");
+            return;
+        }
+        this._layout[name] = {};
+        this._layout[name].topleft = topleft;
+        this._layout[name].size = size;
+        this._layout[name].dataviews = dataview_array;
+        this._layout[name].is_multiview = true;
+        this._layout[name].multiview_selector = dataview_array.length - 1;
+
+        // Set enable functions so that charts that are hidden behind others 
+        // don't get the events propagated
+        for(let i = 0; i < dataview_array.length; ++i) {
+            let x = dataview_array[i];
+            if(x instanceof RU_DataViewBarGraph) {
+                x.dvbg_clickable.setEnableFunc(() => {
+                    return this._layout[name].multiview_selector == i;
+                });
+            }
+        }
+    }
+
+    setDataBinding(db) {
+        this.databinding = db;
+    }
+
+    draw(ctx) {
+        let databinding = this.databinding;
+        for(let x of Object.keys(this._layout)) {
+            let val = this._layout[x];
+
+            let realsize = new Pos(this.parent.width() * val.size.x / 100.0, this.parent.height() * val.size.y / 100.0);
+            let realtopleft = this.parent.topleft.plus(val.topleft.times(new Pos(this.parent.width() / 100.0, this.parent.height() / 100.0)));
+
+            let dv = null;
+            if(val.is_multiview) {
+                dv = val.dataviews[val.multiview_selector];
+            }
+            else {
+                dv = val.dataview;
+            }
+            if(dv == undefined) {
+                console.log("Undefined dv for key " + x);
+            }
+            dv.setRect(realtopleft, realsize);
+
+            if(val.is_multiview) {
+                dv.draw(ctx, databinding[x][val.multiview_selector], this.parent._sizetrans / 100.);
+                ctx.save();
+                ctx.beginPath();
+                ctx.fillStyle="lime";
+                ctx.rect(realtopleft.x + realsize.x - 20, realtopleft.y, 20, 20);
+                ctx.fill();
+                ctx.restore();
+            }
+            else {
+                dv.draw(ctx, databinding[x], this.parent._sizetrans / 100.);
+            }
+        }
+    }
+}
+
+// Class holding data
+class DataBlock {
+    constructor(data, type) {
+        this.setData(data, type);
+    }
+    setData(data, type) {
+        this.data = data;
+        this.datatype = type;
+
+        return this;
+    }
+}
+
+// Class providing an interface to plug some graphs into.
+class RU_DataView {
+    constructor() {
+        this.topleft = null;
+        this.size = null;
+        this.analyze = x => x;
+        this.color_scaling_func = x => x;
+
+        this.update_data = true;
+
+        /* String that contains an html markup with general information about 
+           the implications of this dataview (i.e. how to interpret the 
+           displayed results) */
+        this.information_html_string = "";
+
+        this.information_overlay_age = 0;
+        this.information_overlay_alpha = 0.0;
+        this.information_overlay_timer = null;
+
+        this.information_button_pos = null;
+        this.information_button_size = null;
+        this.optimization_hint_path = "error";
+    }
+
+    setInformationFilePath(path) {
+        this.optimization_hint_path = path;
+
+        return this;
+    }
+
+    showOptimizationHints() {
+        
+        console.log("Opening " + this.optimization_hint_path);
+        let win = window.open(this.optimization_hint_path);
+
+        
+        ObjectHelper.assert("win valid", win != null);
+
+        return this;
+    }
+
+    mouseInside(pos) {
+        // Use this to set the opacity of the information overlay.
+        ObjectHelper.assert("pos valid", pos);
+        ObjectHelper.assert("topleft valid", this.topleft);
+        ObjectHelper.assert("size valid", this.size);
+        
+        let diff = pos.minus(this.topleft.plus(new Pos(this.size.x / 2, 0)));
+
+        let dist = diff.dist();
+
+        if(dist <= 30) {
+            this.information_overlay_age = -1;
+            window.clearTimeout(this.information_overlay_timer);
+            this.information_overlay_timer = window.setTimeout(() => this.information_overlay_age = 0,  3000);
+        }
+        else {
+            this.information_overlay_age = 0;
+        }
+
+        if(dist <= 10) {
+            this.information_overlay_alpha = 1.0;
+        }
+        else {
+            // Scale squared
+            let d = (dist) / 10;
+            this.information_overlay_alpha = 1.0 / (d);
+        }
+    }
+
+    // Draw an overlay (containing the info-button at least). This function 
+    // should be called by all subclasses after drawing.
+    drawOverlay(ctx, scale = 1.0) {
+
+        if(this.information_overlay_age >= 0) {
+
+            ++this.information_overlay_age;
+            if(this.information_overlay_age > 1) {
+                // About 3 secs should be alright.
+                this.information_overlay_alpha -= 0.05;
+                if(this.information_overlay_alpha < 0) {
+                    this.information_overlay_alpha = 0;
+                }
+                this.information_overlay_age = 0; 
+            }
+        }
+
+        let alpha = 0.9; // Transparency value
+        alpha *= this.information_overlay_alpha;
+        let subalpha = 1.0;
+        subalpha *= this.information_overlay_alpha;
+        // #TODO: Make the transparency value dependent on the mouse position
+        let information_sign = "\u{1F6C8}"; // This is the circled information icon.
+        let base_fontsize = 18;
+        ctx.save();
+        ctx.beginPath();
+
+        // Setup font
+        ctx.textAlign = "center";
+        ctx.textBaseline = "top";
+        ctx.font = Math.round(base_fontsize * scale) + "px sans-serif";
+
+        // Get the textwidth
+        let textwidth = ctx.measureText(information_sign).width;
+        let textheight = textwidth;
+        // Draw background first
+        ctx.fillStyle = "rgba(128, 128, 128, " + alpha + ")";
+        ctx.strokeStyle = "rgba(128, 128, 128, " + subalpha + ")";
+        this.information_button_pos = new Pos(this.topleft.x + (this.size.x - textwidth) * scale / 2 - textwidth, this.topleft.y);
+        this.information_button_size = new Pos(textwidth * 3, textheight * 1.1);
+        ctx.rect(this.information_button_pos.x, this.information_button_pos.y, this.information_button_size.x, this.information_button_size.y);
+        ctx.fill();
+        ctx.stroke();
+
+        ctx.beginPath();
+        
+        alpha = 3.0 * this.information_overlay_alpha; 
+        ctx.fillStyle = "rgba(0, 0, 255, " + alpha + ")";
+        let center_x = this.topleft.x + this.size.x * scale / 2;
+        ctx.fillText(information_sign, center_x, this.topleft.y);
+
+
+        ctx.restore();
+    }
+
+    // Gets the settings (data analysis function and subclass-specific settings)
+    getSettingsDict() {
+        let ret = {
+            "analysis_func": {
+                type: "code",
+                value: this.analyze.toString(),
+                description: "#TODO"
+            }
+        };
+
+        return ret;
+    }
+
+    setSettingsDict(dict) {
+        let func = eval(dict['analysis_func'].value.toString());
+        ObjectHelper.assert("Object type function", func instanceof Function);
+        this.setDataAnalysisFunction(func);
+        this.update_data = true; // Trigger an update request
+    }
+
+    setInformationHTMLString(str) {
+        this.information_html_string = str;
+        return this;
+    }
+
+    getInformationHTMLString() {
+        return this.information_html_string;
+    }
+
+    openSettingsWindow() {
+        let sw = new DiodeWindow(window);
+        // Set the callback function for user-defined data
+        sw.setCallback(x => {
+            
+            if(x.type == "ClientOpened") {
+                sw.reply(x.source, x.origin, {
+                    type: "settings-data",
+                    data: this.getSettingsDict()
+                });
+            }
+            else if(x.type == "save-settings") {
+                this.setSettingsDict(x.data);
+                console.log("Applied new settings");
+            }
+            else {
+                console.log("Undefined operation reached! " + JSON.stringify(x));
+            }
+        });
+        let cw = sw.open("DataViewSettings.html", "_blank");
+        if(!cw) {
+            alert("Failed to open child window...");
+        }
+    }
+
+    setTitle(title) {
+        // Abstract
+        return this;
+    }
+    draw(ctx, datablock, scale) {
+        console.log("Abstract function called");
+    }
+
+    getSendableObject() {
+        return this;
+    }
+
+    fromSendableObject(object) {
+        console.log("Abstract function fromSendableObject() called");
+        return this;
+    }
+
+    setDataAnalysisFunction(func) {
+        this.analyze = func;
+        return this;
+    }
+
+    setRect(topleft, size) {
+        this.topleft = topleft;
+        this.size = size;
+        return this;
+    }
+    
+    drawRect(ctx) {
+        ctx.beginPath();
+        ctx.rect(this.topleft.x, this.topleft.y, this.size.x, this.size.y);
+        ctx.stroke();
+    }
+}
+
+// Class to draw text
+class RU_DataViewText extends RU_DataView {
+    constructor() {
+        super();
+    }
+
+    fromSendableObject(obj) {
+        // We don't need to do anything here because the data is in the datablock...
+        return this;
+    }
+    draw(ctx, datablock, scale) {
+
+        if(this.update_data) {
+        }
+        if(scale == 0.)
+            return;
+
+
+        ctx.save();
+
+        ctx.beginPath();
+
+
+        ctx.font = Math.round(scale * datablock.data.fontsize).toString() + "px sans-serif";
+        ctx.fillStyle = datablock.data.color;
+        ctx.textAlign=datablock.data.align;
+        let textheight = ctx.measureText('M').width;
+        ctx.fillText(datablock.data.text, this.topleft.x + this.size.x / 2, this.topleft.y + textheight);
+
+        ctx.restore();
+
+    }
+
+}
+
+// Class to bar graphs (using chart.js)
+class RU_DataViewBarGraph extends RU_DataView {
+    constructor(optstruct = null) {
+        super();
+
+        let typevar = "bar";
+        let xAxisVar = undefined;
+        let yAxisVar = undefined;
+
+        this.dvbg_chart_data = null;
+
+        if(optstruct != null) {
+            if(optstruct.type != undefined) {
+                typevar = optstruct.type;
+            }
+            if(optstruct.xAxes != undefined) {
+                xAxisVar = optstruct.xAxes;
+            }
+            if(optstruct.yAxes != undefined) {
+                yAxisVar = optstruct.yAxes;
+            }
+        }
+
+        // We create a new canvas just for the chart (mainly to avoid 
+        // potential trouble with different update handlers)
+        this.dvbg_canvas_scaler = document.createElement("div");
+        this.dvbg_canvas = document.createElement("canvas");
+        this.dvbg_ctx = this.dvbg_canvas.getContext("2d");
+
+        this.dvbg_canvas_scaler.appendChild(this.dvbg_canvas);
+        document.body.appendChild(this.dvbg_canvas_scaler);
+
+        let chartsettings = {
+            type: typevar,
+            data: this.analyze(null),
+            options: {
+                responsive: true,
+                title: {
+                    display: true,
+                    text: "Chart test..."
+                },
+                tooltips: {
+                    mode: 'index',
+                    intersect: true
+                },
+                scales: {
+                    
+                }
+            }
+        };
+
+        if(xAxisVar != undefined) {
+            chartsettings.options.scales.xAxes = xAxisVar;
+        }
+        if(yAxisVar != undefined) {
+            chartsettings.options.scales.yAxes = yAxisVar;
+        }
+
+        this.dvbg_chart = new Chart(this.dvbg_ctx, chartsettings);
+
+        this.dvbg_canvas_scaler.style.visibility = "hidden";
+        this.dvbg_canvas_scaler.style.position = "fixed";
+
+        // A Clickable instance, for forwarding events.
+        let parent_this = this;
+        this.dvbg_clickable = new class extends Clickable {
+            constructor() {
+                super();
+            }
+
+            is_inside(pos) {
+                if(parent_this.topleft == null) return false;
+                return (parent_this.topleft.x < pos.x && pos.x < parent_this.topleft.x + parent_this.size.x) && (parent_this.topleft.y < pos.y && pos.y < parent_this.topleft.y + parent_this.size.y);
+            }
+
+            onUpdateMove(pos) {
+                if(!this.enable_func()) return false;
+                if(!this.is_inside(pos))
+                    return false;
+                // Actually, we want to pass this to the other object.
+                // We have to translate the event to the target.
+                let rect = parent_this.dvbg_canvas.getBoundingClientRect();
+                let event = new MouseEvent("mousemove", {
+                    offsetX: pos.x,
+                    offsetY: pos.y,
+                    clientX: rect.left + pos.x - parent_this.topleft.x,
+                    clientY: rect.top + pos.y - parent_this.topleft.y
+                });
+
+                // Now pass to other canvas
+                parent_this.dvbg_canvas.dispatchEvent(event);
+            }
+
+            onUpdateClick(pos) {
+                if(!this.enable_func()) return false;
+                if(!this.is_inside(pos))
+                    return false;
+                // Actually, we just want to pass this to the other object.
+                // We have to translate the event to the target.
+                let rect = parent_this.dvbg_canvas.getBoundingClientRect();
+                let event = new MouseEvent("click", {
+                    offsetX: pos.x,
+                    offsetY: pos.y,
+                    clientX: rect.left + pos.x - parent_this.topleft.x,
+                    clientY: rect.top + pos.y - parent_this.topleft.y
+                });
+
+                // Now pass to other canvas
+                parent_this.dvbg_canvas.dispatchEvent(event);
+            }
+        }
+    }
+
+    getSettingsDict() {
+        let basedict = super.getSettingsDict();
+
+        // Collect the information
+        let xAxes = this.dvbg_chart.options.scales.xAxes;
+        let yAxes = this.dvbg_chart.options.scales.yAxes;
+        let display_xAxes = xAxes.some(x => x == undefined || x.display);
+        let display_yAxes = yAxes.some(x => x == undefined || x.display);
+
+        let display_legend = this.dvbg_chart.options.legend.display;
+
+        console.log("x: " + display_xAxes + ", y: " + display_yAxes);
+
+        // Now we can append to the basedict
+        basedict['graph_general_options'] = {
+            type: "group", // Specify subgroup
+            value: {
+                // new subgroup of all graph options
+                "display_horizonal_axis": {
+                    type: "bool",
+                    value: display_xAxes,
+                    description: "Display the horizontal axes"
+                },
+                "display_vertical_axis": {
+                    type: "bool",
+                    value: display_yAxes,
+                    description: "Display the vertical axes"
+                },
+                "display_legend": {
+                    type: "bool",
+                    value: display_legend,
+                    description: "Display the legend"
+                }
+            },
+            description: "General graph display options"
+
+        };
+
+        return basedict;
+    }
+
+    setSettingsDict(dict) {
+        super.setSettingsDict(dict);
+
+        // Now do our stuff
+        let general_graph_options = dict['graph_general_options'].value;
+        let dha = general_graph_options['display_horizonal_axis'].value;
+        let dva = general_graph_options['display_vertical_axis'].value;
+        let dl = general_graph_options['display_legend'].value;
+
+        // Set all axis
+        this.dvbg_chart.options.scales.xAxes.forEach(x => x.display = dha);
+        this.dvbg_chart.options.scales.yAxes.forEach(x => x.display = dva);
+
+        this.dvbg_chart.options.legend.display = dl;
+
+        console.log("Set settingsDict for dvbg");
+
+        this.dvbg_chart.update();
+    }
+
+    getSendableObject() {
+        let ret = {};
+
+        ret.chartsettings = this.dvbg_chart.options;
+        ret.analyze = this.analyze.toString();
+
+
+        return ret;
+    }
+
+    fromSendableObject(obj) {
+        this.setDataAnalysisFunction(new Function(obj.analyze));
+        return this;
+    }
+
+
+    // Provides an array of some pre-selected colors.
+    static colorList() {
+        let ret = new Array();
+
+        //ret.push("red");
+        ret.push("rgba(255, 0, 0, 0.7)");
+        //ret.push("green");
+        ret.push("rgba(0, 128, 0, 0.7)");
+        //ret.push("blue");
+        ret.push("rgba(0, 0, 255, 0.7)");
+        //ret.push("#E8ADAA"); // rose
+        ret.push("rgba(232, 173, 170, 0.7)");
+        //ret.push("cyan");
+        ret.push("rgba(0, 255, 255, 0.7)");
+        //ret.push("black");
+        ret.push("rgba(0, 0, 0, 0.7)");
+        //ret.push("aqua");
+        ret.push("rgba(16, 255, 255, 0.7)");
+        //ret.push("lime");
+        ret.push("rgba(0, 255, 0, 0.7)");
+        //ret.push("fuchsia");
+        ret.push("rgba(255, 0, 255, 0.7)");
+        //ret.push("navy");
+        ret.push("rgba(0, 0, 128, 0.7)");
+        //ret.push("purple");
+        ret.push("rgba(128, 0, 128, 0.7)");
+        //ret.push("gray");
+        ret.push("rgba(128, 128, 128, 0.7)");
+
+        return ret;
+    }
+
+    linkMouse(parent) {
+        parent.addChild(this.dvbg_clickable);
+        return this;
+    }
+
+    changeGraphOptions(func) {
+        func(this.dvbg_chart);
+        return this;
+    }
+
+    draw(ctx, datablock, scale) {
+        if(scale == 0.)
+            return;
+
+        let update = this.prev_scale != scale;
+        this.prev_scale = scale;
+        this.dvbg_canvas_scaler.style.height = Math.max(this.size.y * scale, 1) + "px";
+        this.dvbg_canvas_scaler.style.width = Math.max(this.size.x * scale, 1) + "px";
+
+
+        if(this.update_data)
+        {
+            this.dvbg_chart_data = this.analyze(datablock);
+            let datacpy = JSON.parse(JSON.stringify(this.dvbg_chart_data)); 
+            this.dvbg_chart.data = datacpy;
+            this.update_data = false;
+            this.dvbg_chart.update();
+        }
+
+        if(datablock == this.dvbg_cached_data) {
+            // no update needed
+        }
+        else {
+            this.dvbg_cached_data = datablock;
+
+            this.dvbg_chart.update();
+        }
+
+        if(update) {
+            // Force refresh
+            this.dvbg_chart.resize();
+        }
+        
+        
+        ctx.save();
+
+        ctx.beginPath();
+
+        // Blit this onto our canvas
+        ctx.drawImage(this.dvbg_canvas, this.topleft.x, this.topleft.y, this.size.x, this.size.y);
+
+        ctx.restore();
+
+    }
+
+}
+
+class RU_DataViewNumberBlock extends RU_DataView {
+    constructor() {
+        super();
+
+        this.opt = {
+            display_title: true,
+            text_align: "center",
+            draw_bar: undefined, // Defines where to draw a bar. Options: "left", "right", "top", "bottom" or any combination thereof
+            padding: undefined
+        };
+
+        this.dvnb_cached_data = undefined;
+    }
+
+    setTitle(title) {
+        this.title = title;
+        return this;
+    }
+
+    setOptions(opt) {
+        this.opt = opt;
+        return this;
+    }
+
+    setStringFormatter(func) {
+        this.stringformatter = func;
+    }
+
+    static percent2color(percent) {
+        let step = 255 / 100.;
+
+        let val = Math.floor(step * percent) * 2;
+
+        let ret = 0;
+
+        if(percent <= 50) {
+            ret = 0x0000ff00 + 0x00010000 * val;
+        }
+        else {
+            ret = 0x00ff0000 + 0x00000100 * Math.round(255.0 - val);
+        }
+        return '#' + ret.toString(16).padStart(6, '0');
+    }
+    setColorScaling(func) {
+        this.color_scaling_func = func;
+        return this;
+    }
+    draw(ctx, datablock, scale) {
+
+        if(this.update_data)
+        {
+            this.dvnb_cached_data = this.analyze(datablock);
+            this.update_data = false;
+        }
+
+        if(scale == 0.)
+            return; //Nothing to do if we are size 0
+        
+        ctx.save();
+
+        ctx.beginPath();
+        ctx.textAlign="center";
+        ctx.font= Math.round(16 * scale).toString() + "px sans-serif";
+        
+        let textheight = ctx.measureText('M').width; // Approximating text height by width of M (which should be about square...)
+        let x = this.topleft.x + this.size.x / 2;
+        let y = this.topleft.y + textheight;
+
+        if(this.opt.display_title) {
+            ctx.fillText(this.title, x, y); // Set the title
+        }
+
+        // This class displays a single number.
+        // `datablock` is of type DataBlock, generated by ThreadAnalysis
+
+        // We'll only be interested in balance_max.
+        let p = this.dvnb_cached_data;
+
+        ctx.beginPath();
+        ctx.textAlign = this.opt.text_align;
+        ctx.font="bold " + (50 * scale).toString() + "px sans-serif";
+        ctx.fillStyle=RU_DataViewNumberBlock.percent2color(this.color_scaling_func(p));
+        textheight = ctx.measureText('M').width;
+        ctx.textBaseline = "middle";
+
+        let left_pad = 0;
+        if(this.opt.padding) {
+            if(this.opt.padding.left) {
+                left_pad = this.opt.padding.left;
+            }
+        }
+        if(this.opt.text_align == "center")
+            ctx.fillText(String(p) + "%", x + left_pad, this.topleft.y + this.size.y / 2);
+        else if(this.opt.text_align == "left")
+            ctx.fillText(String(p) + "%", this.topleft.x + left_pad, this.topleft.y + this.size.y / 2);
+        
+        
+        if(this.opt.draw_bar) {
+            for(x of this.opt.draw_bar) {
+                if(x == "left") {
+                    ctx.beginPath();
+                    ctx.strokeStyle="black";
+                    ctx.lineWidth = 3;
+                    ctx.moveTo(this.topleft.x, this.topleft.y);
+                    ctx.lineTo(this.topleft.x, this.topleft.y + this.size.y * scale);
+                    ctx.stroke();
+                }
+                else {
+                    console.log("#TODO: Implement this other DataViewNumberBlock bar: " + x);
+                }
+            }
+        }
+
+        this.drawOverlay(ctx, scale);
+
+        ctx.restore();
+    }
+}
+
+// This is a text-based "form-layout"-style dataview
+class RU_DataViewFormLayout extends RU_DataView {
+    constructor() {
+        super();
+    }
+
+    draw(ctx, datablock_in, scale) {
+
+        if(scale == 0.) return;
+
+        ctx.save();
+        ctx.beginPath();
+
+        // Transform to drawable
+        let datablock = this.analyze(datablock_in);
+
+
+        let fontsize = datablock.fontsize;
+        let form_entries = datablock.rows;
+
+        let padding = datablock.padding;
+
+        let realfontsize = Math.round(fontsize * scale);
+
+        ctx.font = realfontsize + "px sans-serif";
+
+        // Determine the max width of the left side
+        let name_width = max_func(form_entries.map(x => x.title), x => ctx.measureText(x + ": ").width);
+
+        let text_height = ctx.measureText('M').width;
+        let line_spacing = 10;
+
+        // Determine the length of the right side (to allow for alignment)
+        let val_width = max_func(form_entries.map(x => x.val), x => ctx.measureText(x).width);
+
+        let i = 0;
+        for(let line of form_entries) {
+            let name = line.title;
+            let val = line.value;
+
+            // Draw title
+            ctx.beginPath();
+            ctx.textAlign = "left";
+            ctx.fillText(name, this.topleft.x + padding.left * scale, this.topleft.y + padding.top + (i+1) * (text_height + line_spacing));
+
+            // Align the text on the right side
+            ctx.beginPath();
+            ctx.textAlign = "right";
+            // Draw value
+            ctx.fillText(val, this.topleft.x + (this.size.x - padding.right) * scale, this.topleft.y + padding.top + (i+1) * (text_height + line_spacing));
+
+            i++;
+        }
+
+        ctx.restore();
+    }
+}
+
+class Bracket extends Clickable {
+    constructor(ctx) {
+        super();
+        this.ctx = ctx;
+        this.buttons = new Array();
+
+        this.start = new Pos(0, 0);
+        this.end = new Pos(0, 0);
+        this.offset = 0;
+        this.startoffset = 0;
+
+        this.listeners = [];
+    }
+
+    setupEventListeners() {
+        let canvas = this.ctx.canvas;
+
+
+        let mouseXtrans = x => x;
+        let mouseYtrans = y => y;
+
+        if(window.get_zoom != undefined) {
+            mouseXtrans = x => x / window.get_zoom();
+            mouseYtrans = y => y / window.get_zoom();
+        }
+
+        let mm_lis = e => {
+            this.onUpdateMove(new Pos(mouseXtrans(e.offsetX), mouseYtrans(e.offsetY)));
+        };
+        let c_lis = e => {
+            this.onUpdateClick(new Pos(mouseXtrans(e.offsetX), mouseYtrans(e.offsetY)), e.button);
+        };
+        let dc_lis = e => {
+            this.onUpdateDoubleClick(new Pos(mouseXtrans(e.offsetX), mouseYtrans(e.offsetY)), e.button);
+        };
+
+        this.listeners.push(['mousemove', mm_lis]);
+        this.listeners.push(['click', c_lis]);
+        this.listeners.push(['dblclick', dc_lis]);
+
+        let ctx = this.ctx;
+        for(let lis of this.listeners) {
+            ctx.canvas.addEventListener(...lis);
+        }
+    }
+
+    destroy() {
+        super.destroy();
+
+        for(let lis of this.listeners) {
+            this.ctx.canvas.removeEventListener(...lis);
+        }
+
+        this.buttons = null;
+        this.start = null;
+        this.end = null;
+        this.ctx = null;
+    }
+
+    drawEx(start, end, offset, startoffset, animate, updatefunc) {
+        this.start = start;
+        this.end = end;
+        this.offset = offset;
+        this.startoffset = startoffset;
+        this.animate = animate;
+        this.updatefunc = updatefunc;
+
+        this.draw();
+    }
+
+    is_inside(pos) {
+        let max_x = this.start.x;
+        let max_y = this.end.y;
+
+        for (let b of this.buttons) {
+            if(b == undefined) {
+                console.log("Undefined button!");
+            }
+            else if(b.topleft == undefined) {
+                console.log("Undefined topleft");
+            }
+            max_x = Math.max(b.topleft.x + b.size.x, max_x);
+            max_y = Math.max(b.topleft.y + b.size.y, max_y);
+        }
+
+        if (pos.inRect(this.start, new Pos(max_x, max_y))) {
+            return true;
+        }
+
+        return false
+    }
+
+    draw() {
+        let start = this.start;
+        let end = this.end;
+        let offset = this.offset;
+        let startoffset = this.startoffset;
+        let ctx = this.ctx;
+
+        if(this.animate && this.updatefunc == null)
+            ctx.clearRect(0, 0, ctx.canvas.width, ctx.canvas.height);
+
+        let button_offset = 20;
+
+        ctx.save();
+        ctx.beginPath();
+        ctx.lineWidth = 5;
+        ctx.strokeStyle = "#0000FF";
+        ctx.moveTo(start.x + startoffset, start.y);
+
+        ctx.lineTo(start.x + offset, start.y);
+        ctx.stroke();
+
+        ctx.lineTo(start.x + offset, end.y);
+        ctx.stroke();
+
+        ctx.lineTo(end.x + startoffset, end.y);
+        ctx.stroke();
+
+        ctx.restore();
+
+        this.drawButtons(new Pos(start.x + offset + button_offset, start.y));
+
+
+        if(this.animate) {
+            let _this = this;
+
+            
+            window.requestAnimationFrame(() => {
+                if(this.updatefunc == null) {
+                    _this.draw();
+                }
+                else {
+                    this.updatefunc();
+                }
+            });
+        }
+    }
+
+    drawButtons(startpos) {
+        let xpos = startpos.x;
+        let ypos = startpos.y;
+
+        startpos = null;
+
+        let size = 30;
+        let b;
+        for (b of this.buttons) {
+            let tl = new Pos(xpos, ypos);
+            b.draw(tl, new Pos(size, size), this.ctx);
+            xpos += size + 10;
+        }
+    }
+
+    addButton(b) {
+        this.buttons.push(b);
+        this.addChild(b);
+    }
+
+};
+
+function onDraw() {
+    let c = document.getElementById("canvas");
+
+    let ctx = c.getContext("2d");
+    let b = new Bracket(ctx);
+    let but1 = new Button(ctx);
+    but1.setOnEnterHover(p => { but1.color = "#00FF00"; but1.button_subwindow_state = 'open'; })
+    but1.setOnLeaveHover(p => { but1.color = "orange"; but1.button_subwindow_state = 'collapsed'; })
+    b.addButton(but1);
+    let but2 = new Button(ctx);
+    but2.setOnEnterHover(p => { but2.color = "#FF0000"; })
+    but2.setOnLeaveHover(p => { but2.color = "orange"; })
+    b.addButton(but2);
+    let but3 = new Button(ctx);
+    but3.setOnEnterHover(p => { but3.color = "#0000FF"; })
+    but3.setOnLeaveHover(p => { but3.color = "orange"; })
+    b.addButton(but3);
+
+    c.addEventListener('mousemove', e => {
+        b.onUpdateMove(new Pos(e.offsetX, e.offsetY));
+    });
+
+
+    b.drawEx(new Pos(10, 10), new Pos(10, 500), 100, 30);
+}
+
+var _canvas_manager_counter = 0;
+
+// Class to manage drawing of all resources. This ensures that one object 
+// does not overwrite another by mistake.
+class CanvasDrawManager {
+    static counter() {
+        return _canvas_manager_counter++;
+    }
+    constructor(ctx, ref_global_state) {
+        this.ctx = ctx;
+        this.drawables = [];
+        this.ref_global_state = ref_global_state;
+        this.indices = [];
+
+        this.request_scale = false;
+        this.scale_factor = {x: 1, y: 1};
+    }
+
+    addDrawable(obj) {
+        this.drawables.push(obj);
+        this.indices.push({"c": CanvasDrawManager.counter(), "d": obj});
+    }
+
+    clearDrawables() {
+        for(let x of this.drawables) {
+            x.destroy();
+        }
+        this.drawables = [];
+        this.indices = [];
+    }
+
+    scale(diff) {
+        this.request_scale = true;
+
+        this.scale_factor.x += diff;
+        this.scale_factor.y += diff;
+    }
+
+    getScale() {
+        ObjectHelper.assert("Uniform scale", this.scale_factor.x == this.scale_factor.y);
+        return this.scale_factor.x; // We don't allow non-uniform scaling.
+    }
+
+
+    draw() {
+        let ctx = this.ctx;
+
+        // Always clear with the old scale
+        ctx.clearRect(0, 0, ctx.canvas.width / this.scale_factor.x, ctx.canvas.height / this.scale_factor.y);
+        if(this.request_scale) {
+            ctx.setTransform(1,0,0,1,0,0);
+            ObjectHelper.logObject("zoom", this.scale_factor);
+            ctx.scale(this.scale_factor.x, this.scale_factor.y);
+            this.request_scale = false;
+        }
+        this.ref_global_state.drawSDFG();
+        for(let d of this.drawables) {
+            d.draw();
+        }
+
+        let _this = this;
+     
+        window.requestAnimationFrame(() => _this.draw());
+    }
+}
\ No newline at end of file
diff --git a/diode/roofline.js b/diode/roofline.js
new file mode 100644
index 0000000000..f653983573
--- /dev/null
+++ b/diode/roofline.js
@@ -0,0 +1,12 @@
+
+class RooflineGraph {
+
+    constructor(ctx) {
+        this.ctx = ctx;
+    }
+
+    draw() {
+
+    }
+
+}
\ No newline at end of file
diff --git a/diode/sdfg_editor.py b/diode/sdfg_editor.py
new file mode 100644
index 0000000000..b85b1112d0
--- /dev/null
+++ b/diode/sdfg_editor.py
@@ -0,0 +1,707 @@
+import gi
+import re
+from collections import OrderedDict
+gi.require_version('Gtk', '3.0')
+gi.require_version('GtkSource', '3.0')
+from gi.repository import Gtk, GtkSource
+
+import dace
+from diode.rendered_graph import RenderedGraph
+from diode.images import ImageStore
+from diode.property_renderer import PropertyRenderer, _get_edge_label
+
+
+class SDFGEditor:
+    def __init__(self, builder):
+
+        self.buttons = [
+            {
+                "image": "cursor.png",
+                "type": "mouse",
+                "tool": "Mouse"
+            },
+            {
+                "image": "delete.png",
+                "type": "delete",
+                "tool": "Delete"
+            },
+            {
+                "image": "array.png",
+                "type": "node",
+                "tool": "Array"
+            },
+            {
+                "image": "edge_thin.png",
+                "type": "edge",
+                "tool": "Memlet"
+            },
+            {
+                "image": "map.png",
+                "type": "node",
+                "tool": "Map"
+            },
+            {
+                "image": "tasklet.png",
+                "type": "node",
+                "tool": "Tasklet"
+            },
+            {
+                "image": "stream.png",
+                "type": "node",
+                "tool": "Stream"
+            },
+            {
+                "image": "stream_map.png",
+                "type": "node",
+                "tool": "Consume"
+            },
+            {
+                "image": "state.png",
+                "type": "node",
+                "tool": "State"
+            },
+            {
+                "image": "state_trans.png",
+                "type": "edge",
+                "tool": "State Transition"
+            },
+            {
+                "image": "edge_head_redir.png",
+                "type": "edge_redir",
+                "tool": "Head Redirection"
+            },
+            {
+                "image": "edge_tail_redir.png",
+                "type": "edge_redir",
+                "tool": "Tail Redirection"
+            },
+        ]
+
+        self.active_tool = None  # an element of self.buttons
+        self.builder = builder
+        self.current_editing_script = ""
+        self.sdfg_changed = False
+
+        # Initialize the SDFG to a valid one. Otherwise, we need
+        # to check in all the functions that use it if it is None.
+        self.sdfg = dace.SDFG("newsdfg", OrderedDict(), {})
+
+        self.first_selected_node_for_edge = None
+        self.first_selected_state_for_edge = None
+        self.selected_edge_for_redir = None
+
+        self.rendered_sdfg = RenderedGraph()
+        self.sdfg_da = self.builder.get_object("sdfg_editor_da")
+        self.rendered_sdfg.set_drawing_area(self.sdfg_da)
+
+        plabel = self.builder.get_object("se_propertylabel")
+        pgrid = self.builder.get_object("se_propertygrid")
+        self.propren = PropertyRenderer(plabel, pgrid, self.OnSDFGUpdate)
+
+        self.image_store = ImageStore()
+        self.load_buttons()
+        self.connect_signals()
+
+    def emit_script_cmd(self, cmd):
+        self.current_editing_script += "sdfg_edit." + cmd + "\n"
+        self.sdfg_changed = True
+
+    def reset_edit_script(self):
+        self.sdfg_changed = False
+        self.current_editing_script = ""
+
+    def get_sdfg(self):
+        return self.sdfg
+
+    def get_edit_script(self):
+        return self.current_editing_script
+
+    def connect_signals(self):
+        sdfg_da = self.builder.get_object("sdfg_editor_da")
+        sdfg_da.connect("draw", self.OnDrawSDFG)
+        sdfg_da.connect("scroll-event", self.OnScrollSDFG)
+        sdfg_da.connect("button-press-event", self.OnButtonPressSDFG)
+        sdfg_da.connect("button-release-event", self.OnButtonReleaseSDFG)
+        sdfg_da.connect("motion-notify-event", self.OnMouseMoveSDFG)
+
+    def sdfg_modified(self):
+        """ Returns True if the SDFG has been edited. """
+        return self.sdfg_changed
+
+    def OnSDFGUpdate(self, sdfg, elemtype, *args):
+        if elemtype == "node":
+            nodeid, propname, newval = args
+            self.emit_script_cmd("ChangeSDFGNodeProperties(\"" + \
+                                 str(nodeid) +   "\", \"" + \
+                                 str(propname) + "\", \"" + \
+                                 str(newval) + "\"")
+        elif elemtype == "memlet":
+            tail, head, label, propname, newval = args
+            self.emit_script_cmd("ChangeSDFGMemletProperties(\"" + \
+                                 str(tail) +     "\", \"" + \
+                                 str(head) +     "\", \"" + \
+                                 str(label) +    "\", \"" + \
+                                 str(propname) + "\", \"" + \
+                                 str(newval) + "\"")
+        self.sdfg_changed = True
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+
+    def load_buttons(self):
+        toolbar = self.builder.get_object("sdfged_toolbar")
+        for b in self.buttons:
+            pixbuf = self.image_store.get_image(b["image"])
+            image = Gtk.Image.new_from_pixbuf(pixbuf)
+            button = Gtk.ToggleToolButton()
+            button.set_icon_widget(image)
+            toolbar.add(button)
+            b["button"] = button
+            if b["tool"] == "Mouse":
+                self.active_tool = b
+            button.connect("toggled", self.OnToggleTBButton, b)
+
+    def init_syntax_highlighting(self, widgetname, language):
+        tbuffer = self.builder.get_object(widgetname).get_buffer()
+        lang_manager = GtkSource.LanguageManager()
+        language = lang_manager.get_language(language)
+        tbuffer.set_language(language)
+        tbuffer.set_highlight_syntax(True)
+
+    def set_sdfg(self, sdfg):
+        self.sdfg = sdfg
+        dotcode = sdfg.draw()
+        self.rendered_sdfg.set_dotcode(dotcode)
+
+    def OnDrawSDFG(self, widget, cr):
+        self.rendered_sdfg.render(widget, cr)
+        return False
+
+    def OnToggleTBButton(self, widget, button):
+        if button["tool"] != self.active_tool["tool"]:
+            self.active_tool["button"].set_active(False)
+        statuslabel = self.builder.get_object("run_status_text")
+
+        if button["type"] == "node":
+            statuslabel.set_text("Click the SDFG pane to add a " + \
+                      button["tool"] + " node")
+        elif button["type"] == "edge":
+            statuslabel.set_text("Click two nodes between which you want " + \
+                      "to add a " + button["tool"] + " edge")
+        elif button["tool"] == "Delete":
+            statuslabel.set_text("Click a node or edge to delete it")
+        elif button["type"] == "edge_redir":
+            statuslabel.set_text("Click an Edge followed by the new head/tail")
+        self.active_tool = button
+        return True
+
+    def OnScrollSDFG(self, widget, ev):
+        d = self.rendered_sdfg.determine_scroll_direction(ev)
+        self.rendered_sdfg.zoom(d, pos=(ev.x, ev.y))
+        widget.queue_draw()
+        return False
+
+    def parse_state_label(self, label):
+        """ Take a state label as a string in the form "s0 (BEGIN)" and return
+            zero as an integer. """
+        if not isinstance(label, str): return None
+        p = re.compile("s(\d+)")
+        m = p.match(label)
+        if m:
+            return int(m.group(1))
+        else:
+            return None
+
+    def show_error(self, errormsg):
+        dialog = Gtk.MessageDialog(
+            self.builder.get_object("main_window"), 0, Gtk.MessageType.ERROR,
+            Gtk.ButtonsType.CANCEL, "User Error:")
+        dialog.format_secondary_text(errormsg)
+        dialog.run()
+        dialog.destroy()
+
+    def from_elem2sdfg(self, elem):
+        sid, nid = self.split_nodeid_in_state_and_nodeid(elem)
+        return self.sdfg.nodes()[sid].nodes()[nid]
+
+### Scripting Interface
+
+    def AddArray(self, arrayname, statelabel):
+        state = self.parse_state_label(statelabel)
+        if state is None:
+            raise ValueError("State " + statelabel + " not found / parsable")
+        arrtype = self.sdfg.add_array("newarray", dace.types.float64, (1, ))
+        newarray = dace.graph.nodes.AccessNode(arrtype)
+        self.sdfg.nodes()[state].add_node(newarray)
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def AddStream(self, streamname, statelabel):
+        state = self.parse_state_label(statelabel)
+        if state == None:
+            raise ValueError("State " + statelabel + " not found / parsable")
+        streamtype = self.sdfg.add_stream("newstream", dace.types.float64, 1,
+                                          0)
+        newstream = dace.graph.nodes.AccessNode(streamtype)
+        self.sdfg.nodes()[state].add_node(newstream)
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def AddMemlet(self, memletname, srcnodeid, dstnodeid):
+        srcnode = self.from_elem2sdfg(srcnodeid)
+        dstnode = self.from_elem2sdfg(dstnodeid)
+
+        # Create a dummy (but valid) array
+        mdata = self.sdfg.add_array("newarray", dace.types.float64, (1, ))
+        msubset = dace.subsets.Range([("0", "N-1", "1")])
+        newmemlet = dace.memlet.Memlet(mdata, 1, msubset, 1)
+        sid1, nid1 = self.split_nodeid_in_state_and_nodeid(srcnodeid)
+        sid2, nid2 = self.split_nodeid_in_state_and_nodeid(dstnodeid)
+        if sid1 != sid2:
+            raise ValueError("You cannot create memlets between states.")
+            return False
+        else:
+            # TODO: connectors
+            self.sdfg.nodes()[sid1].add_edge(srcnode, None, dstnode, None,
+                                             newmemlet)
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def AddStateTransition(self, s1, s2):
+        s1 = self.parse_state_label(s1)
+        s2 = self.parse_state_label(s2)
+        self.sdfg.add_edge(self.sdfg.nodes()[s1],
+                           self.sdfg.nodes()[s2],
+                           dace.graph.edges.InterstateEdge())
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def AddMap(self, mapname, statelabel):
+        newmap = dace.graph.nodes.Map(mapname, ["i"],
+                                      dace.subsets.Range([("0", "N-1", "1")]))
+        state = self.parse_state_label(statelabel)
+        if state == None:
+            raise ValueError("State " + statelabel + " not found / parsable")
+        self.sdfg.nodes()[state].add_node(dace.graph.nodes.MapEntry(newmap))
+        self.sdfg.nodes()[state].add_node(dace.graph.nodes.MapExit(newmap))
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def AddConsume(self, consumename, statelabel):
+        state = self.parse_state_label(statelabel)
+        if state == None:
+            raise ValueError("State " + statelabel + " not found / parsable")
+        newconsume = dace.graph.nodes.Consume(
+            consumename, ["i"], dace.subsets.Range([("0", "N-1", "1")]))
+        self.sdfg.nodes()[state].add_node(
+            dace.graph.nodes.ConsumeEntry(newconsume))
+        self.sdfg.nodes()[state].add_node(
+            dace.graph.nodes.ConsumeExit(newconsume))
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def AddState(self, statelabel):
+        newstate = dace.SDFGState(statelabel, self.sdfg)
+        self.sdfg.add_node(newstate)
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def AddTasklet(self, taskletname, statelabel):
+        state = self.parse_state_label(statelabel)
+        if state == None:
+            self.show_error("You cannot put tasklets outside of states")
+            return False
+        newtasklet = dace.graph.nodes.Tasklet(taskletname)
+        self.sdfg.nodes()[state].add_node(newtasklet)
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def DeleteNode(self, nodeid):
+        sid, nid = self.split_nodeid_in_state_and_nodeid(nodeid)
+        rmnode = self.sdfg.nodes()[sid].nodes()[nid]
+        self.sdfg.nodes()[sid].remove_node(rmnode)
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+    def DeleteEdge(self, tailname, headname, edgelabel):
+        sid1, nid1 = self.split_nodeid_in_state_and_nodeid(headname)
+        sid2, nid2 = self.split_nodeid_in_state_and_nodeid(tailname)
+        if sid1 != sid2:
+            # The SDFG is an OrderedDiGraph -> max one edge between each
+            # node pair
+            tail = self.sdfg.nodes()[sid2]
+            head = self.sdfg.nodes()[sid1]
+            edges = self.sdfg.edges_between(tail, head)
+            if len(edges) != 1:
+                raise ValueError(
+                    "There should be one edge between " + str(tailname) + \
+                    " and " + str(headname) + ", found " + str(edges) + \
+                    " instead")
+            self.sdfg.remove_edge(edges[0])
+            self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+            self.sdfg_changed = True
+            return
+        # An SDFGState is a MultiDiGraph, there could be more than one
+        # edge between a node pair, thus we use the label to identify which
+        # one we should delete.
+        sid = int(sid1)
+        srcnd = self.sdfg.nodes()[sid].nodes()[nid2]
+        dstnd = self.sdfg.nodes()[sid].nodes()[nid1]
+        for e in self.sdfg.nodes()[sid].edges_between(srcnd, dstnd):
+            _, _, _, _, d = e
+            if str(d) == edgelabel:
+                self.sdfg.nodes()[sid].remove_edge(e)
+                self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+                self.sdfg_changed = True
+                return
+        raise ValueError("No memlet with this label was found!")
+
+    def DeleteState(self, statename):
+        sid = self.parse_state_label(statename)
+        state = self.sdfg.nodes()[sid]
+        self.sdfg.remove_node(state)
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        self.sdfg_changed = True
+
+
+### End of scripting interface
+
+    def add_array(self, x, y):
+        state_label = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        state = self.parse_state_label(state_label)
+        if state == None:
+            self.show_error("You cannot put arrays outside of states")
+            return False
+        self.AddArray("newarray", state_label)
+        self.emit_script_cmd("AddArray(\"newarray\", \"" + state_label +\
+                             "\")")
+        return True
+
+    def add_stream(self, x, y):
+        streamtype = self.sdfg.add_stream("newstream", dace.types.float64, 1,
+                                          0)
+        newstream = dace.graph.nodes.AccessNode(streamtype)
+        state_label = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        state = self.parse_state_label(state_label)
+        if state == None:
+            self.show_error("You cannot put streams outside of states")
+            return False
+        self.sdfg.nodes()[state].add_node(newstream)
+        self.emit_script_cmd("AddStream(\"newstream\", \"" + state_label + \
+                             "\")")
+        return True
+
+    def add_memlet(self, x, y):
+        elem = self.rendered_sdfg.get_element_by_coords(x, y)
+        if self.first_selected_node_for_edge == None:
+            if type(elem).__name__ == "Node":
+                self.first_selected_node_for_edge = elem
+            else:
+                self.show_error("You cannot create memlets between a " + type(
+                    self.first_selected_node_for_edge).__name__ + \
+                    " and a " + type(elem).__name__ + \
+                    " choose two nodes instead")
+                self.first_selected_node_for_edge = None
+            return False
+
+        srcnodeid = self.first_selected_node_for_edge.id.decode('utf-8')
+        dstnodeid = elem.id.decode('utf-8')
+
+        sid1, nid1 = self.split_nodeid_in_state_and_nodeid(srcnodeid)
+        sid2, nid2 = self.split_nodeid_in_state_and_nodeid(dstnodeid)
+        if sid1 != sid2:
+            self.show_error("You cannot create memlets between states.")
+            return False
+        else:
+            self.AddMemlet("newarray", srcnodeid, dstnodeid)
+            self.emit_script_cmd("AddMemlet(\"newarray\", \"" + srcnodeid + \
+                                 "\", \"" + dstnodeid + "\")")
+        self.first_selected_node_for_edge = None
+        return True
+
+    def add_state_trans(self, x, y):
+        sg = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        if sg == None:
+            self.first_selected_state_for_edge = None
+            self.show_error(
+                "You need to select a state to add a state transition")
+            return False
+        if self.first_selected_state_for_edge == None:
+            self.first_selected_state_for_edge = sg
+            return False
+        self.AddStateTransition(self.first_selected_state_for_edge, sg)
+        self.emit_script_cmd("AddStateTransition(\"" + \
+                             self.first_selected_state_for_edge + \
+                             "\", \"" + sg + "\")")
+
+        self.first_selected_state_for_edge = None
+        return True
+
+    def add_state_trans_by_label(self, taillabel, headlabel):
+        taillabel = self.convert_dummy_to_state(taillabel)
+        headlabel = self.convert_dummy_to_state(headlabel)
+        self.AddStateTransition(taillabel, headlabel)
+        self.emit_script_cmd("AddStateTransition(\"" + taillabel + "\", \"" + \
+                             headlabel + "\")")
+        return True
+
+    def convert_dummy_to_state(self, nodeid):
+        match = re.match("s(\d+)_(\d+)", nodeid)
+        if match:
+            return "s" + str(match.groups()[0]) + "_" + str(match.groups()[1])
+        match = re.match("s(\d+)", nodeid)
+        if match: return "s" + str(match.groups()[0])
+        match = re.match("dummy_(\d+)", nodeid)
+        if match: return "s" + str(match.groups()[0])
+        else: raise ValueError(str(nodeid) + " is not a state label")
+
+    def redirect(self, x, y, direction):
+        # If the user didn't select an edge yet, store the selected edge
+        # and reset in case they select a node
+        if self.selected_edge_for_redir == None:
+            elem = self.rendered_sdfg.get_element_by_coords(x, y)
+            if (elem == None) or (type(elem).__name__ != "Edge"):
+                self.selected_edge_for_redir = None
+                return False
+            self.selected_edge_for_redir = elem
+            return False
+
+        # At this point we have our selected edge (stored as xdot elem) and we
+        # expect the user to select either an SDFG node or a state. We need to
+        # check if the selection makes sense, i.e., if the user selected an
+        # interstate edge, the selection now refers to the state, otherwise
+        # an SDFG node.
+        edge = self.selected_edge_for_redir
+        self.selected_edge_for_redir = None
+        elem = self.rendered_sdfg.get_element_by_coords(x, y)
+        node = elem.id.decode('utf-8')
+        sg = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        tail = edge.src.id.decode('utf-8')
+        head = edge.dst.id.decode('utf-8')
+        edge_label = _get_edge_label(edge)
+        sid1, nid1 = self.split_nodeid_in_state_and_nodeid(head)
+        sid2, nid2 = self.split_nodeid_in_state_and_nodeid(tail)
+        sid_redir, nid_redir = self.split_nodeid_in_state_and_nodeid(node)
+
+        # We are redirecting an interstate edge, delete the old edge and add
+        # the new one
+        # TODO: handle properties on edge
+        if sid1 != sid2:
+            # tail / head could be a dummy node (dumm_X), but we (and the user)
+            # expect states here (sX)
+            self.delete_state_transition(tail, head)
+            if direction == "head": self.add_state_trans_by_label(tail, sg)
+            elif direction == "tail": self.add_state_trans_by_label(sg, head)
+            else: raise ValueError("direction \"head\" or \"tail\" expected.")
+            return True
+
+        # We are redirecting a Memlet, find the memlet first, so that we can
+        # attach it to the new edge
+
+        # An SDFGState is a MultiDiGraph, there could be more than one
+        # edge between a node pair, thus we use the label to identify which
+        # one we should delete.
+        sid = int(sid1)
+        srcnd = self.sdfg.nodes()[sid].nodes()[nid2]
+        dstnd = self.sdfg.nodes()[sid].nodes()[nid1]
+        redir = self.sdfg.nodes()[sid_redir].nodes()[nid_redir]
+        for e in self.sdfg.nodes()[sid].edges_between(srcnd, dstnd):
+            u, uconn, v, vconn, d = e
+            if str(d) == edge_label:
+                if direction == 'head': v = redir
+                elif direction == 'tail': u = redir
+                else:
+                    raise ValueError(
+                        "direction \"head\" or \"tail\" expected.")
+
+                self.DeleteEdge(tail, head, edge_label)
+                self.sdfg.nodes()[sid].add_edge(u, uconn, v, vconn, d)
+                print("Finished the redirection, deleted edge (" + str(tail) +
+                      "," + str(head) + "," + str(edge_label) +
+                      ") added new edge (" + str(u) + ", " + str(v) + ", " +
+                      str(d) + ")")
+                return True
+        raise ValueError("No memlet with this label was found!")
+
+    def add_map(self, x, y):
+        state_label = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        state = self.parse_state_label(state_label)
+        if state == None:
+            self.show_error("You cannot put maps outside of states")
+            return False
+        self.AddMap("newmap", state_label)
+        self.emit_script_cmd("AddMap(\"newmap\", \"" + state_label + "\")")
+        return True
+
+    def add_consume(self, x, y):
+        state_label = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        state = self.parse_state_label(state_label)
+        if state == None:
+            self.show_error("You cannot put consumes outside of states")
+            return False
+        self.AddConsume("newconsume", state_label)
+        self.emit_script_cmd("sdfg_editor.AddConsume(\"newconsume\"," + \
+                             state_label + "\")")
+        return True
+
+    def add_state(self, x, y):
+        num_states = len(self.sdfg.nodes())
+        state_label = "s" + str(num_states)
+        self.AddState(state_label)
+        self.emit_script_cmd("AddState(\"" + state_label + "\")")
+        return True
+
+    def add_tasklet(self, x, y):
+        state_label = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        state = self.parse_state_label(state_label)
+        if state == None:
+            self.show_error("You cannot put tasklets outside of states")
+            return False
+        self.AddTasklet("newtasklet", state_label)
+        self.emit_script_cmd("AddTasklet(\"newtasklet\", \"" +\
+                             state_label + "\")")
+        return True
+
+    def delete_node(self, x, y):
+        elem = self.rendered_sdfg.get_element_by_coords(x, y)
+        if elem == None:
+            return False
+        nodeid = elem.id.decode('utf-8')
+        self.DeleteNode(nodeid)
+        self.emit_script_cmd("DeleteNode(\"" + nodeid + "\")")
+        return True
+
+    def DeleteStateTransition(self, s1_label, s2_label):
+        s1 = self.parse_state_label(s1_label)
+        s2 = self.parse_state_label(s2_label)
+        tail = self.sdfg.nodes()[s1]
+        head = self.sdfg.nodes()[s2]
+        edges = self.sdfg.edges_between(tail, head)
+        if len(edges) != 1:
+            raise ValueError(
+                "There should be one edge between s" + str(s1) + \
+                " and s" + str(s2) + ", found " + str(edges) + \
+                " instead")
+        self.sdfg.remove_edge(edges[0])
+        self.rendered_sdfg.set_dotcode(self.sdfg.draw())
+        return True
+
+    def delete_state_transition(self, s1_label, s2_label):
+        # s1_label and s2_label could refer to a dummy node here, convert
+        # to a proper state. We don't want to expose the user (or the rest of
+        # the code) to dummy nodes.
+        s1_label = self.convert_dummy_to_state(s1_label)
+        s2_label = self.convert_dummy_to_state(s2_label)
+        self.DeleteStateTransition(s1_label, s2_label)
+        self.emit_script_cmd("DeleteStateTransition(\"" + s1_label + \
+                             "\", \"" + s2_label + "\", \"\")")
+        return True
+
+    def delete_edge(self, x, y):
+        elem = self.rendered_sdfg.get_element_by_coords(x, y)
+        if elem == None:
+            return False
+        tail = elem.src.id.decode('utf-8')
+        head = elem.dst.id.decode('utf-8')
+        edge_label = _get_edge_label(elem)
+        sid1, nid1 = self.split_nodeid_in_state_and_nodeid(head)
+        sid2, nid2 = self.split_nodeid_in_state_and_nodeid(tail)
+        if sid1 != sid2:
+            self.delete_state_transition(tail, head)
+            return True
+        # An SDFGState is a MultiDiGraph, there could be more than one
+        # edge between a node pair, thus we use the label to identify which
+        # one we should delete.
+        sid = int(sid1)
+        srcnd = self.sdfg.nodes()[sid].nodes()[nid2]
+        dstnd = self.sdfg.nodes()[sid].nodes()[nid1]
+        for e in self.sdfg.nodes()[sid].edges_between(srcnd, dstnd):
+            _, _, _, _, d = e
+            if str(d) == edge_label:
+                self.DeleteEdge(tail, head, edge_label)
+                self.emit_script_cmd("DeleteEdge(\"" + tail + "\", \"" + head + \
+                                     "\", \"" + edge_label + "\")")
+
+                return True
+        raise ValueError("No memlet with this label was found!")
+
+    def delete_state(self, x, y):
+        subgraph = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        self.DeleteState(subgraph)
+        self.emit_script_cmd("DeleteState(\"" + subgraph + "\")")
+        return True
+
+    def delete_element(self, x, y):
+        elem = self.rendered_sdfg.get_element_by_coords(x, y)
+        subgraph = self.rendered_sdfg.get_subgraph_by_coords(x, y)
+        if type(elem).__name__ == "Node":
+            modified = self.delete_node(x, y)
+            return modified
+        if type(elem).__name__ == "Edge":
+            modified = self.delete_edge(x, y)
+            return modified
+        if subgraph is not None:
+            modified = self.delete_state(x, y)
+            return modified
+
+    def OnButtonPressSDFG(self, widget, ev):
+        x, y = ev.x, ev.y
+        self.rendered_sdfg.clear_highlights()
+        if ev.button != 1: return
+        modified_sdfg = False
+        if self.active_tool["tool"] == "Mouse":
+            self.rendered_sdfg.handle_button_press(ev)
+            elem = self.rendered_sdfg.get_element_by_coords(x, y)
+            if elem is not None:
+                self.rendered_sdfg.highlight_element(elem)
+                self.propren.render_properties_for_element(self.sdfg, elem)
+            else:
+                self.propren.render_free_symbols(self.sdfg)
+        if self.active_tool["tool"] == "Array":
+            modified_sdfg = self.add_array(x, y)
+        if self.active_tool["tool"] == "Stream":
+            modified_sdfg = self.add_stream(x, y)
+        if self.active_tool["tool"] == "Memlet":
+            modified_sdfg = self.add_memlet(x, y)
+        if self.active_tool["tool"] == "Map":
+            modified_sdfg = self.add_map(x, y)
+        if self.active_tool["tool"] == "Consume":
+            modified_sdfg = self.add_consume(x, y)
+        if self.active_tool["tool"] == "Tasklet":
+            modified_sdfg = self.add_tasklet(x, y)
+        if self.active_tool["tool"] == "State":
+            modified_sdfg = self.add_state(x, y)
+        if self.active_tool["tool"] == "Delete":
+            modified_sdfg = self.delete_element(x, y)
+        if self.active_tool["tool"] == "State Transition":
+            modified_sdfg = self.add_state_trans(x, y)
+        if self.active_tool["tool"] == "Tail Redirection":
+            modified_sdfg = self.redirect(x, y, "tail")
+        if self.active_tool["tool"] == "Head Redirection":
+            modified_sdfg = self.redirect(x, y, "head")
+
+        if modified_sdfg == True:
+            self.set_sdfg(self.sdfg)
+            self.sdfg_da.queue_draw()
+
+        return False
+
+    def OnButtonReleaseSDFG(self, widget, ev):
+        self.rendered_sdfg.handle_button_release(ev)
+        return False
+
+    def OnMouseMoveSDFG(self, widget, ev):
+        self.rendered_sdfg.handle_drag_motion(ev)
+        return False
+
+    def split_nodeid_in_state_and_nodeid(self, nodeid):
+        match = re.match("s(\d+)_(\d+)", nodeid)
+        if match:
+            ids = match.groups()
+            return int(ids[0]), int(ids[1])
+        else:
+            match = re.match("dummy_(\d+)", nodeid)
+            if match:
+                ids = match.groups()
+                return int(ids[0]), None
+            else:
+                raise ValueError("Node ID " + nodeid + " has the wrong form")
+                return None
diff --git a/diode/sdfg_renderer.js b/diode/sdfg_renderer.js
new file mode 100644
index 0000000000..e7d6efd546
--- /dev/null
+++ b/diode/sdfg_renderer.js
@@ -0,0 +1,211 @@
+// This class groups node-drawing to reduce redundancies and non-global state.
+class DrawNodeState {
+
+    constructor(ctx, stateid) {
+        this.ctx = ctx;
+        this.stateid = stateid;
+
+    }
+
+    highlights() {
+
+        return global_state.highlights.filter(x => x['state-id'] == this.stateid).map(x => x['node-id']);
+    }
+
+    nodeColor(nodeid) {
+        if(this.highlights().filter(x => x == nodeid).length > 0) {
+            return "red";
+        }
+        else {
+            return "black";
+        }
+    }
+
+    drawArrow(ctx, p1, p2, size) {
+        "use strict";
+        let _ctx = ctx;
+        _ctx.save();
+        // Rotate the context to point along the path
+        let dx = p2.x - p1.x;
+        let dy = p2.y - p1.y;
+        let len = Math.sqrt(dx * dx + dy * dy);
+        _ctx.translate(p2.x, p2.y);
+        _ctx.rotate(Math.atan2(dy, dx));
+
+        // arrowhead
+        _ctx.beginPath();
+        _ctx.moveTo(0, 0);
+        _ctx.lineTo(-2 * size, -size);
+        _ctx.lineTo(-2 * size, size);
+        _ctx.closePath();
+        _ctx.fill();
+        _ctx.restore();
+    }
+
+    drawArrayNode(node, nodeid) {
+        let ctx = this.ctx;
+        var topleft_x = node.x - node.width / 2.0;
+        var topleft_y = node.y - node.height / 2.0;
+        ctx.beginPath();
+        ctx.moveTo(topleft_x + node.height / 2.0, topleft_y);
+        ctx.arc(topleft_x + node.height / 2.0, topleft_y + node.height / 2.0, node.height / 2.0, 1.5 * Math.PI, 0.5 * Math.PI, true);
+        ctx.lineTo(topleft_x + node.width - node.height, topleft_y + node.height);
+        ctx.arc(topleft_x + node.width - node.height / 2.0, topleft_y + node.height / 2.0, node.height / 2.0, 0.5 * Math.PI, 1.5 * Math.PI, true);
+        ctx.lineTo(topleft_x + node.height / 2.0, topleft_y);
+        ctx.closePath();
+        ctx.strokeStyle = this.nodeColor(nodeid);
+        ctx.stroke();
+        var textmetrics = ctx.measureText(node.label);
+        ctx.fillText(node.label, node.x - textmetrics.width / 2.0, node.y + LINEHEIGHT / 2.0);
+    }
+
+    drawMapEntryNode(node, nodeid) {
+        let ctx = this.ctx;
+        var topleft_x = node.x - node.width / 2.0;
+        var topleft_y = node.y - node.height / 2.0;
+        ctx.beginPath();
+        ctx.moveTo(topleft_x, topleft_y + node.height);
+        ctx.lineTo(topleft_x + node.width, topleft_y + node.height);
+        ctx.lineTo(topleft_x + node.width - node.height, topleft_y);
+        ctx.lineTo(topleft_x + node.height, topleft_y);
+        ctx.lineTo(topleft_x, topleft_y + node.height);
+        ctx.closePath();
+        ctx.strokeStyle = this.nodeColor(nodeid);
+        ctx.stroke();
+        var textmetrics = ctx.measureText(node.label);
+        ctx.fillText(node.label, node.x - textmetrics.width / 2.0, node.y + LINEHEIGHT / 2.0);
+
+        this.drawConnectors(node.in_connectors, topleft_x+node.height, topleft_y);
+        this.drawConnectors(node.out_connectors, topleft_x+node.height, topleft_y+node.height - 2*LINEHEIGHT);
+    }
+
+    drawMapExitNode(node, nodeid) {
+        let ctx = this.ctx;
+        var topleft_x = node.x - node.width / 2.0;
+        var topleft_y = node.y - node.height / 2.0;
+        ctx.beginPath();
+        ctx.moveTo(topleft_x, topleft_y);
+        ctx.lineTo(topleft_x + node.width, topleft_y);
+        ctx.lineTo(topleft_x + node.width - node.height, topleft_y + node.height);
+        ctx.lineTo(topleft_x + node.height, topleft_y + node.height);
+        ctx.lineTo(topleft_x, topleft_y);
+        ctx.closePath();
+        ctx.strokeStyle = this.nodeColor(nodeid);
+        ctx.stroke();
+        var textmetrics = ctx.measureText(node.label);
+        ctx.fillText(node.label, node.x - textmetrics.width / 2.0, node.y + LINEHEIGHT / 2.0);
+
+        this.drawConnectors(node.in_connectors, topleft_x+node.height, topleft_y);
+        this.drawConnectors(node.out_connectors, topleft_x+node.height, topleft_y+node.height-2*LINEHEIGHT);
+    }
+
+    drawTaskletNode(node, nodeid) {
+        var ctx = this.ctx;
+        var topleft_x = node.x - node.width / 2.0;
+        var topleft_y = node.y - node.height / 2.0;
+        var hexseg = node.height / 3.0;
+        ctx.beginPath();
+        ctx.moveTo(topleft_x, topleft_y + hexseg);
+        ctx.lineTo(topleft_x + hexseg, topleft_y);
+        ctx.lineTo(topleft_x + node.width - hexseg, topleft_y);
+        ctx.lineTo(topleft_x + node.width, topleft_y + hexseg);
+        ctx.lineTo(topleft_x + node.width, topleft_y + 2 * hexseg);
+        ctx.lineTo(topleft_x + node.width - hexseg, topleft_y + node.height);
+        ctx.lineTo(topleft_x + hexseg, topleft_y + node.height);
+        ctx.lineTo(topleft_x, topleft_y + 2 * hexseg);
+        ctx.lineTo(topleft_x, topleft_y + 1 * hexseg);
+        ctx.closePath();
+        ctx.strokeStyle = this.nodeColor(nodeid);
+        ctx.stroke();
+        var textmetrics = ctx.measureText(node.label);
+        ctx.fillText(node.label, node.x - textmetrics.width / 2.0, node.y + LINEHEIGHT / 2.0);
+        this.drawConnectors(node.in_connectors, topleft_x+hexseg, topleft_y);
+        this.drawConnectors(node.out_connectors, topleft_x+hexseg, topleft_y+node.height-2*LINEHEIGHT);
+    }
+
+    drawReduceNode(node, nodeid) {
+        let ctx = this.ctx;
+        var topleft_x = node.x - node.width / 2.0;
+        var topleft_y = node.y - node.height / 2.0;
+
+        ctx.beginPath();
+        ctx.moveTo(topleft_x, topleft_y);
+        ctx.lineTo(topleft_x + node.width / 2, topleft_y + node.height);
+        ctx.lineTo(topleft_x + node.width, topleft_y);
+        ctx.lineTo(topleft_x, topleft_y);
+        ctx.closePath();
+        ctx.strokeStyle = this.nodeColor(nodeid);
+        ctx.stroke();
+        var textmetrics = ctx.measureText(node.label);
+        ctx.fillText(node.label, node.x - textmetrics.width / 2.0, node.y - node.height / 4.0 + LINEHEIGHT / 2.0);
+    }
+
+    drawConnectors(labels, topleft_x, topleft_y, connarea_width) {
+        let next_topleft_x = topleft_x + 5;
+        let next_topleft_y = topleft_y;
+        var ctx = this.ctx;
+        labels.forEach(function(label) {
+            let labelwidth = ctx.measureText(label).width;
+            ctx.beginPath();
+            ctx.moveTo(next_topleft_x, next_topleft_y);
+            ctx.lineTo(next_topleft_x + labelwidth, next_topleft_y);
+            ctx.lineTo(next_topleft_x + labelwidth, next_topleft_y + 2 * LINEHEIGHT);
+            ctx.lineTo(next_topleft_x, next_topleft_y + 2 * LINEHEIGHT);
+            ctx.lineTo(next_topleft_x, next_topleft_y);
+            ctx.closePath();
+            ctx.strokeStyle = "black";
+            ctx.stroke();
+            ctx.fillText(label, next_topleft_x, next_topleft_y + 1.5*LINEHEIGHT);
+            next_topleft_x += labelwidth + 10;
+        });
+    }
+
+    draw_node(node, nodeid) {
+        // TODO: add all node types here, leave rectangle as fallback
+        if (node.type == "ArrayNode") {
+            this.drawArrayNode(node, nodeid)
+        }
+        else if (node.type == "MapEntry") {
+            this.drawMapEntryNode(node, nodeid)
+        }
+        else if (node.type == "MapExit") {
+            this.drawMapExitNode(node, nodeid)
+        }
+        else if (node.type == "Tasklet") {
+            this.drawTaskletNode(node, nodeid)
+        }
+        else if (node.type == "Reduce") {
+            this.drawReduceNode(node, nodeid)
+        }
+        else {
+            let ctx = this.ctx;
+            var topleft_x = node.x - node.width / 2.0;
+            var topleft_y = node.y - node.height / 2.0;
+            ctx.beginPath();
+            ctx.moveTo(topleft_x, topleft_y);
+            ctx.lineTo(topleft_x + node.width, topleft_y);
+            ctx.lineTo(topleft_x + node.width, topleft_y + node.height);
+            ctx.lineTo(topleft_x, topleft_y + node.height);
+            ctx.lineTo(topleft_x, topleft_y);
+            ctx.closePath();
+            ctx.strokeStyle = this.nodeColor(nodeid);
+            ctx.stroke();
+            ctx.fillText(node.label, node.x - node.width / 2, node.y);
+        }
+    }
+
+    draw_edge(edge) {
+        let ctx = this.ctx;
+        ctx.beginPath();
+        ctx.moveTo(edge.points[0].x, edge.points[0].y);
+        for(let elem of edge.points) {
+            ctx.lineTo(elem.x, elem.y)
+        };
+        ctx.strokeStyle = "black";
+        ctx.stroke();
+        if (edge.points.length < 2) return;
+        this.drawArrow(ctx, edge.points[edge.points.length - 2], edge.points[edge.points.length - 1], 5);
+        ctx.fillText(edge.label, edge.x - edge.width, edge.y);
+    }
+
+}
\ No newline at end of file
diff --git a/diode/subwindow.html b/diode/subwindow.html
new file mode 100644
index 0000000000..de96ea9dfa
--- /dev/null
+++ b/diode/subwindow.html
@@ -0,0 +1,34 @@
+<!DOCTYPE html>
+<html>
+<head>
+    <meta charset="utf-8" />
+    <script src="global_vars.js"></script> <!-- Global defines -->
+    <script src="dagre.js"></script> <!-- Graph library -->
+    <script src="renderer_util.js"></script> <!-- Offloaded display functions and classes -->
+    <script src="sdfg_renderer.js"></script> <!-- Offloaded sdfg drawing functions -->
+    <script src="datahelper.js"></script> <!-- Offloaded data analysis functions and classes -->
+    <script src="Chart.bundle.min.js"></script> <!-- Library for charts -->
+
+
+    <script src="parallelization_button.js"></script>
+    <script src="memory_button.js"></script>
+    
+    <script src="windowing.js"></script>
+    
+    
+    <script>
+        window.onload = () => {
+            // This needs to be global
+            let _thisclient = new ClientSide(window);    
+            window._thisclient = _thisclient;
+        }
+    </script>
+</head>
+<body style="margin: 0">
+    <div id="maindiv">
+        <canvas id="subwindowcanvas" width="800" height="800" style="border:1px solid black;">
+
+        </canvas>
+    </div>
+</body>
+</html>
\ No newline at end of file
diff --git a/diode/windowing.js b/diode/windowing.js
new file mode 100644
index 0000000000..8efb0844f5
--- /dev/null
+++ b/diode/windowing.js
@@ -0,0 +1,246 @@
+// Contains resources used to provide a "native" multi-window interface
+
+// Class to create the window on the parent side
+class DiodeWindow {
+    constructor(parent) {
+        this.parent = parent;
+        this.window = null;
+
+        this.message_userdef = null;
+
+        this._msg_func = x => this.message(x);
+        this.parent.addEventListener('message', this._msg_func);
+    }
+
+    open(url = '', target = '', features = '', replace = false) {
+        this.window = parent.open(url, target, features, replace);
+
+        if(!this.window) return this.window; // some error occurred
+
+        return this.window;
+    }
+
+    setCallback(x) {
+        this.message_userdef = x;
+    }
+
+    passMessage(msg_obj) {
+        return this.window.postMessage(msg_obj, "*");
+    }
+
+    reply(win, origin, msg_obj) {
+        if(win != this.window && this.window != null) {
+            console.log("We do not talk to strangers");
+        }
+        return this.passMessage(msg_obj);
+    }
+
+    setSenderData(data) {
+        //this.layout_data = data;
+        this.window_data = data;
+    }
+
+    destroy() {
+        this.parent.removeEventListener('message', this._msg_func)
+    }
+
+    serialize_dataview(dv) {
+        return dv.getSendableObject();
+    }
+
+    serialize_array(arr) {
+        let ret = [];
+        for(let x of arr) {
+            ret.push(this.serialize(x));
+        }
+        return ret;
+    }
+    
+    serialize_function(func) {
+        return func.toString();
+    }
+
+    serialize_default(def) {
+        if(def == undefined) {
+            return "";
+        }
+        let ret = {};
+        let keys = ObjectHelper.listKeys(def);
+        for(let x of keys) {
+            let val = def[x];
+
+            ret[x] = this.serialize(val);
+        }
+
+        return ret;
+    }
+
+
+
+    serialize(obj) {
+        if(obj instanceof RU_DataView) {
+            return this.serialize_dataview(obj);
+        }
+        else if(obj instanceof Array) {
+            return this.serialize_array(obj);
+        }
+        else if(obj instanceof Object)
+            return this.serialize_default(obj);
+        else if(typeof obj == "function")
+            return this.serialize_function(obj);
+        else
+            return obj;
+    }
+
+    message(event) {
+        let _data = event.data;
+        if(event.source != this.window) {
+            return; 
+        }
+
+        if(_data.type == "ClientOpened") {
+            // Respond with the data
+           
+            if(this.message_userdef == null) {
+                let answer = {
+                    type: "DisplayData",
+                    data: this.window_data
+                };
+                this.reply(event.source, event.origin, answer);
+            }
+            else {
+                // Pass to custom function.
+                this.message_userdef(_data);
+            }
+
+        }
+        else if(_data.type == "close") {
+            console.log("Parent received close message");
+            this.window.close();
+            this.destroy();
+        }
+        else if(this.message_userdef != null) {
+            // Custom callback method.
+
+            
+            this.message_userdef(_data);
+        }
+        else {
+            console.log("Unknown type " + JSON.stringify(_data));
+        }
+    }
+
+}
+
+
+// Class managing the client side
+class ClientSide {
+    constructor(thiswindow, userfunc = null) {
+        this.thiswindow = thiswindow;
+
+        this.owner = thiswindow.opener;
+
+        this.thiswindow.addEventListener('message', x => this.message(x));
+
+        this.thiswindow.addEventListener('onbeforeunload', x => this.destroy(x));
+
+        this.subwindow = null;
+
+        this.message_userdef = userfunc;
+
+        this.passMessage({
+            type: "ClientOpened"
+        });
+    }
+
+    destroy(x) {
+        x.preventDefault();
+        x.returnValue = '';
+        console.log("Destroying client window");
+    }
+
+    setCallback(x) {
+        this.message_userdef = x;
+    }
+
+    passMessage(msg_obj) {
+        return this.owner.postMessage(msg_obj, "*");
+    }
+
+    reply(win, origin, msg_obj) {
+        if(win != this.owner) {
+            return;
+        }
+        return this.passMessage(msg_obj);
+    }
+
+    // Reads the event coming from the parent window
+    message(event) {
+        
+        let data = event.data;
+
+        if(data.type == "DisplayData") {
+
+            let canvas = window.document.getElementById('subwindowcanvas');
+            if(!canvas) {
+                alert("Didn't find canvas!");
+            }
+            let ctx = canvas.getContext('2d');
+            if(!ctx) {
+                alert("Didn't find context!");
+            }
+            
+            let content = data.data;
+            let classname = content.className;
+            if(!classname) {
+                alert("Classname is not set!");
+            }
+            let dataparams = content.dataParams;
+
+            let class_obj = null;
+            if(classname == "ParallelizationButton") {
+                class_obj = ParallelizationButton;
+            }
+            else if(classname == "MemoryButton") {
+                class_obj = MemoryButton;
+            }
+
+            let new_obj = new class_obj(ctx, ...(dataparams));
+
+            if(!new_obj) {
+                alert("Failed to instantiate class " + classname);
+            }
+
+            {
+                let subwindow_width = new_obj.button_subwindow.targetwidth;
+                let subwindow_height = new_obj.button_subwindow.targetheight;
+                let subwindow_left = new_obj.topleft.x;
+                let subwindow_top = new_obj.topleft.y;
+
+                ctx.canvas.width = subwindow_width;
+                ctx.canvas.height = subwindow_height;
+
+                window.resizeTo(subwindow_width, subwindow_height);
+
+                new_obj.button_subwindow_state = 'open'; // Open Sesame
+                new_obj.is_locked_open = true;
+                let b = new Bracket(ctx);
+
+                b.setupEventListeners();
+
+                b.addButton(new_obj);
+
+                b.drawEx(new Pos(-20, 0), new Pos(0, 0), 0, 0, true);
+
+            }
+        }
+        else if(this.message_userdef != null) {
+            this.message_userdef(data);
+        }
+        else  {
+            console.log("Unknown event caught");
+            ObjectHelper.logObject("message", data);
+        }
+        
+    }
+}
\ No newline at end of file
diff --git a/samples/fpga/filter_fpga.py b/samples/fpga/filter_fpga.py
new file mode 100644
index 0000000000..aef20613a0
--- /dev/null
+++ b/samples/fpga/filter_fpga.py
@@ -0,0 +1,298 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+N = dace.symbol("N", positive=True)
+
+
+def make_copy_to_device(sdfg):
+
+    pre_state = sdfg.add_state("copy_to_device")
+
+    A_host = pre_state.add_read("A")
+
+    A_device = pre_state.add_write("A_device")
+
+    pre_state.add_edge(A_host, None, A_device, None,
+                       dace.memlet.Memlet.simple(A_device, "0:N"))
+
+    return pre_state
+
+
+def make_copy_to_host(sdfg):
+
+    post_state = sdfg.add_state("copy_to_host")
+
+    B_device = post_state.add_read("B_device")
+    outsize_device = post_state.add_read("outsize_device")
+
+    B_host = post_state.add_write("B")
+    outsize_host = post_state.add_write("outsize")
+
+    post_state.add_edge(B_device, None, B_host, None,
+                        dace.memlet.Memlet.simple(B_device, "0:N"))
+    post_state.add_edge(outsize_device, None, outsize_host, None,
+                        dace.memlet.Memlet.simple(outsize_device, "0"))
+
+    return post_state
+
+
+def make_compute_state(sdfg):
+
+    state = sdfg.add_state("compute_state")
+
+    A = state.add_read("A_device")
+    ratio = state.add_read("ratio")
+
+    outsize = state.add_write("outsize_device")
+    B = state.add_write("B_device")
+
+    for_loop_sdfg = make_nested_sdfg(sdfg)
+    nested_sdfg = state.add_nested_sdfg(for_loop_sdfg, sdfg,
+                                        {"A_nested", "ratio_nested"},
+                                        {"B_nested", "outsize_nested"})
+
+    state.add_edge(A, None, nested_sdfg, "A_nested",
+                   dace.memlet.Memlet.simple(A, "0:N"))
+    state.add_edge(ratio, None, nested_sdfg, "ratio_nested",
+                   dace.memlet.Memlet.simple(ratio, "0"))
+    state.add_edge(nested_sdfg, "B_nested", B, None,
+                   dace.memlet.Memlet.simple(B, "0:N"))
+    state.add_edge(nested_sdfg, "outsize_nested", outsize, None,
+                   dace.memlet.Memlet.simple(outsize, "0"))
+
+    return state
+
+
+def make_nested_sdfg(parent):
+
+    sdfg = dace.SDFG("filter_nested", parent=parent)
+
+    sdfg.add_scalar(
+        "outsize_buffer",
+        dtype=dace.uint32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+    sdfg.add_array(
+        "outsize_nested", [1],
+        dtype=dace.uint32,
+        storage=dace.types.StorageType.FPGA_Global)
+    sdfg.add_array(
+        "A_device", [N],
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+    sdfg.add_array(
+        "B_device", [N],
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+    sdfg.add_scalar(
+        "ratio_nested",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    set_zero = make_set_zero(sdfg)
+    loop_entry = sdfg.add_state("loop_entry")
+    loop_body = make_loop_body(sdfg)
+    write_out_size = make_write_out_size(sdfg)
+
+    sdfg.add_edge(
+        set_zero,
+        loop_entry,
+        dace.graph.edges.InterstateEdge(assignments={"i": 0}))
+
+    sdfg.add_edge(
+        loop_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "i < N", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        loop_entry,
+        write_out_size,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "i >= N", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        loop_body,
+        loop_entry,
+        dace.graph.edges.InterstateEdge(assignments={"i": "i + 1"}))
+
+    return sdfg
+
+
+def make_set_zero(sdfg):
+
+    set_zero = sdfg.add_state("set_zero")
+    tasklet = set_zero.add_tasklet("set_zero", {}, {"size_zero"},
+                                   "size_zero = 0")
+    outsize = set_zero.add_write("outsize_buffer")
+    set_zero.add_edge(tasklet, "size_zero", outsize, None,
+                      dace.memlet.Memlet.simple(outsize, "0"))
+
+    return set_zero
+
+
+def make_write_out_size(sdfg):
+
+    write_out = sdfg.add_state("write_out")
+    outsize_buffer = write_out.add_read("outsize_buffer")
+    outsize = write_out.add_write("outsize_nested")
+    write_out.add_edge(outsize_buffer, None, outsize, None,
+                       dace.memlet.Memlet.simple(outsize, "0"))
+
+    return write_out
+
+
+def make_loop_body(sdfg):
+
+    state = sdfg.add_state("loop_body")
+
+    A = state.add_read("A_device")
+    B = state.add_write("B_device")
+    ratio = state.add_read("ratio_nested")
+
+    outsize_buffer_in = state.add_read("outsize_buffer")
+    outsize_buffer_out = state.add_write("outsize_buffer")
+
+    tasklet = state.add_tasklet(
+        "filter", {"a", "write_index", "r"}, {"b", "size_out"}, "if a > r:"
+        "\n\tb[write_index] = a"
+        "\n\tsize_out = write_index + 1")
+
+    state.add_edge(A, None, tasklet, "a", dace.memlet.Memlet.simple(A, "i"))
+    state.add_edge(ratio, None, tasklet, "r",
+                   dace.memlet.Memlet.simple(ratio, "0"))
+    state.add_edge(
+        tasklet, "b", B, None,
+        dace.memlet.Memlet(B, dace.symbolic.pystr_to_symbolic("-1"),
+                           dace.subsets.Range.from_array(B.desc(sdfg)), 1))
+    state.add_edge(outsize_buffer_in, None, tasklet, "write_index",
+                   dace.memlet.Memlet.simple(outsize_buffer_in, "0"))
+    state.add_edge(
+        tasklet, "size_out", outsize_buffer_out, None,
+        dace.memlet.Memlet(outsize_buffer_out,
+                           dace.symbolic.pystr_to_symbolic("-1"),
+                           dace.properties.SubsetProperty.from_string("0"), 1))
+
+    return state
+
+
+def make_sdfg(specialize):
+
+    if not specialize:
+        sdfg = dace.SDFG("filter_fpga")
+    else:
+        sdfg = dace.SDFG("filter_fpga_{}".format(N.get()))
+
+    sdfg.add_array(
+        "A_device", [N],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    sdfg.add_array("A", [N], dtype=dace.float32)
+
+    sdfg.add_array(
+        "B_device", [N],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    sdfg.add_array(
+        "outsize_device", [1],
+        dtype=dace.uint32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    sdfg.add_array("B", [N], dtype=dace.float32)
+    sdfg.add_array("outsize", [1], dtype=dace.uint32)
+    sdfg.add_scalar(
+        "ratio",
+        storage=dace.types.StorageType.FPGA_Global,
+        dtype=dace.float32)
+
+    copy_to_device_state = make_copy_to_device(sdfg)
+    compute_state = make_compute_state(sdfg)
+    copy_to_host_state = make_copy_to_host(sdfg)
+
+    sdfg.add_edge(copy_to_device_state, compute_state,
+                  dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(compute_state, copy_to_host_state,
+                  dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+def regression(A, ratio):
+    return A[np.where(A > ratio)]
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("N", type=int)
+    parser.add_argument("ratio", type=float)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all symbols at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    N.set(args["N"])
+
+    A = dace.ndarray([N], dtype=dace.float32)
+    B = dace.ndarray([N], dtype=dace.float32)
+    outsize = dace.scalar(dace.uint32, allow_conflicts=True)
+    outsize[0] = 0
+
+    ratio = np.float32(args["ratio"])
+
+    print("Predicate-Based Filter. size={}, ratio={} ({}specialized)".format(
+        N.get(), ratio, "" if args["specialize"] else "not "))
+
+    A[:] = np.random.rand(N.get()).astype(dace.float32.type)
+    B[:] = dace.float32(0)
+
+    sdfg = make_sdfg(args["specialize"])
+    sdfg.draw_to_file()
+    if args["specialize"]:
+        sdfg.specialize()
+        sdfg(A=A, B=B, outsize=outsize, ratio=ratio)
+    else:
+        sdfg(A=A, B=B, outsize=outsize, ratio=ratio, N=N)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('filter', 'numpy', 0, regression, A, ratio)
+
+    filtered = regression(A, ratio)
+
+    if len(filtered) != outsize[0]:
+        print(
+            "Difference in number of filtered items: %d (DaCe) vs. %d (numpy)"
+            % (outsize[0], len(filtered)))
+        totalitems = min(outsize[0], N.get())
+        print('DaCe:', B[:totalitems].view(type=np.ndarray))
+        print('Regression:', filtered.view(type=np.ndarray))
+        exit(1)
+
+    # Sort the outputs
+    filtered = np.sort(filtered)
+    B[:outsize[0]] = np.sort(B[:outsize[0]])
+
+    if len(filtered) == 0:
+        print("==== Program end ====")
+        exit(0)
+
+    diff = np.linalg.norm(filtered - B[:outsize[0]]) / float(outsize[0])
+    print("Difference:", diff)
+    if diff > 1e-5:
+        totalitems = min(outsize[0], N.get())
+        print('DaCe:', B[:totalitems].view(type=np.ndarray))
+        print('Regression:', filtered.view(type=np.ndarray))
+
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/fpga/filter_fpga_vectorized.py b/samples/fpga/filter_fpga_vectorized.py
new file mode 100644
index 0000000000..9a905ced36
--- /dev/null
+++ b/samples/fpga/filter_fpga_vectorized.py
@@ -0,0 +1,639 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+from dace.types import StorageType, Language
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+from dace.subsets import Indices
+
+N = dace.symbol("N", positive=True)
+W = dace.symbol("W", positive=True)
+dtype = dace.float32
+buffer_size = 2048  # Of internal FIFOs
+
+
+def make_copy_to_device(sdfg):
+
+    pre_state = sdfg.add_state("copy_to_device")
+
+    A_host = pre_state.add_array("A", [N], dtype=dtype)
+
+    A_device = pre_state.add_array(
+        "A_device", [N],
+        dtype=dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+
+    pre_state.add_edge(
+        A_host, None, A_device, None,
+        dace.memlet.Memlet.simple(A_device, "0:N", veclen=W.get()))
+
+    return pre_state
+
+
+def make_copy_to_host(sdfg):
+
+    post_state = sdfg.add_state("copy_to_host")
+
+    B_device = post_state.add_array(
+        "B_device", [N],
+        dtype=dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    outsize_device = post_state.add_array(
+        "outsize_device", [1],
+        dtype=dace.uint32,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+
+    B_host = post_state.add_array("B", [N], dtype=dtype)
+    outsize_host = post_state.add_array("outsize", [1], dtype=dace.uint32)
+
+    post_state.add_edge(B_device, None, B_host, None,
+                        dace.memlet.Memlet.simple(B_device, "0:N", W.get()))
+    post_state.add_edge(outsize_device, None, outsize_host, None,
+                        dace.memlet.Memlet.simple(outsize_device, "0"))
+
+    return post_state
+
+
+def make_iteration_space(sdfg, add_one=False):
+
+    loop_begin = sdfg.add_state("loop_begin")
+    loop_entry = sdfg.add_state("loop_entry")
+    loop_body = sdfg.add_state("loop_body")
+    loop_end = sdfg.add_state("loop_end")
+
+    sdfg.add_edge(
+        loop_begin,
+        loop_entry,
+        dace.graph.edges.InterstateEdge(assignments={"i": 0}))
+
+    sdfg.add_edge(
+        loop_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "i < N" + (" + W" if add_one else ""),
+                language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        loop_entry,
+        loop_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "i >= N" + (" + W" if add_one else ""),
+                language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        loop_body,
+        loop_entry,
+        dace.graph.edges.InterstateEdge(assignments={"i": "i + W"}))
+
+    return loop_body
+
+
+def make_compute_state(state):
+
+    A_pipe = state.add_stream(
+        "_A_pipe",
+        dtype=dtype,
+        buffer_size=buffer_size,
+        veclen=W.get(),
+        storage=StorageType.FPGA_Global)
+    B_pipe = state.add_stream(
+        "_B_pipe",
+        dtype=dtype,
+        buffer_size=buffer_size,
+        veclen=W.get(),
+        storage=StorageType.FPGA_Global)
+    valid_pipe = state.add_stream(
+        "_valid_pipe", dtype=dtype, storage=StorageType.FPGA_Global)
+    ratio = state.add_scalar(
+        "ratio_nested", dtype=dtype, storage=StorageType.FPGA_Global)
+
+    count = state.add_scalar(
+        "count", dtype=dace.uint32, storage=StorageType.FPGA_Registers)
+
+    # Inline Vivado HLS code
+    code = """\
+constexpr int num_stages = 2 * W;
+using Stage_t = ap_uint<hlslib::ConstLog2(num_stages)>;
+using Count_t = ap_uint<hlslib::ConstLog2(W)>;
+using Vec_t = typename std::remove_reference<decltype(A_pipe_in)>::type::Data_t;
+using Data_t = decltype(ratio);
+
+Vec_t stages[num_stages];
+#pragma HLS ARRAY_PARTITION variable=stages complete
+
+Stage_t num_shifts[num_stages][W];
+#pragma HLS ARRAY_PARTITION variable=num_shifts complete
+
+bool non_zero[num_stages][W];
+#pragma HLS ARRAY_PARTITION variable=non_zero complete
+
+Vec_t output, next;
+
+Count_t elements_in_output = 0;
+
+unsigned count = 0;
+
+for (unsigned i = 0; i < N / W + 1; ++i) {
+  #pragma HLS PIPELINE II=1
+
+  if (i < N / W) {
+    stages[0] = A_pipe_in.pop();
+  } else {
+    stages[0] = Vec_t(static_cast<Data_t>(0));
+  }
+
+  // Initialize shift counters
+  Stage_t empty_slots_left = 0;
+  const Count_t elements_in_output_local = elements_in_output;
+  Count_t additional_elements = 0;
+  for (unsigned w = 0; w < W; ++w) {
+    #pragma HLS UNROLL
+    if (stages[0][w] < ratio) {
+      ++empty_slots_left;
+      non_zero[0][w] = false;
+      num_shifts[0][w] = 0;
+    } else {
+      non_zero[0][w] = true;
+      num_shifts[0][w] =
+          W - elements_in_output_local + empty_slots_left;
+      ++additional_elements;
+    }
+  }
+  elements_in_output =
+      (elements_in_output_local + additional_elements) % W;
+
+  // Merge stages
+  for (int s = 1; s < num_stages; ++s) {
+    #pragma HLS UNROLL
+    for (unsigned w = 0; w < W; ++w) {
+      #pragma HLS UNROLL
+      // If we're already in the correct place, just propagate forward
+      if (num_shifts[s - 1][w] == 0 && non_zero[s - 1][w]) {
+        stages[s][w] = stages[s - 1][w];
+        num_shifts[s][w] = 0;
+        non_zero[s][w] = true;
+      } else {
+        // Otherwise shift if the value is non-zero
+        const Stage_t shifts = num_shifts[s - 1][(w + 1) % W];
+        if (shifts > 0) {
+          stages[s][w] = stages[s - 1][(w + 1) % W];
+          num_shifts[s][w] = shifts - 1;
+          non_zero[s][w] = true;
+        } else {
+          stages[s][w] = 0;
+          num_shifts[s][w] = 0;
+          non_zero[s][w] = false;
+        }
+      }
+    }
+  }
+
+  // Fill up vector
+  Count_t num_curr = 0;
+  Count_t num_next = 0;
+  for (unsigned w = 0; w < W; ++w) {
+    #pragma HLS UNROLL
+    const bool is_taken = w < elements_in_output_local;
+    const bool is_non_zero = non_zero[num_stages - 1][w];
+    if (!is_taken) {
+      if (is_non_zero) {
+        ++num_curr;
+        output[w] = stages[num_stages - 1][w];
+      }
+    } else {
+      ++num_curr;
+      if (is_non_zero) {
+        next[w] = stages[num_stages - 1][w];
+        ++num_next;
+      }
+    }
+  }
+
+  const bool is_full = num_curr == W;
+  const bool last_iter = i == N / W;
+  B_pipe_out.push(output);
+  valid_pipe_out.push(is_full || last_iter);
+  if (is_full) {
+    output = next;
+    next = Vec_t(Data_t(0));
+    ++count;
+  }
+
+} // End loop
+
+count_out = std::min<unsigned>(W * count + elements_in_output, N);"""
+
+    tasklet = state.add_tasklet(
+        "filter", {"A_pipe_in", "ratio_in"},
+        {"B_pipe_out", "valid_pipe_out", "count_out"},
+        code,
+        language=Language.CPP)
+
+    state.add_memlet_path(
+        A_pipe,
+        tasklet,
+        dst_conn="A_pipe_in",
+        memlet=Memlet.simple(A_pipe, "0", num_accesses="N", veclen=W.get()))
+    state.add_memlet_path(
+        ratio, tasklet, dst_conn="ratio_in", memlet=Memlet.simple(ratio, "0"))
+    state.add_memlet_path(
+        tasklet,
+        B_pipe,
+        src_conn="B_pipe_out",
+        memlet=Memlet.simple(B_pipe, "0", num_accesses="N", veclen=W.get()))
+    state.add_memlet_path(
+        tasklet,
+        valid_pipe,
+        src_conn="valid_pipe_out",
+        memlet=Memlet.simple(valid_pipe, "0", num_accesses="N"))
+    state.add_memlet_path(
+        tasklet, count, src_conn="count_out", memlet=Memlet.simple(count, "0"))
+
+
+def make_compute_sdfg():
+
+    sdfg = SDFG("filter_compute")
+
+    state = sdfg.add_state("compute")
+
+    make_compute_state(state)
+
+    return sdfg
+
+
+def make_read_sdfg():
+
+    sdfg = SDFG("filter_read")
+
+    state = make_iteration_space(sdfg)
+
+    A = state.add_array(
+        "A_mem", [N], dtype=dtype, storage=StorageType.FPGA_Global)
+    A_pipe = state.add_stream(
+        "_A_pipe",
+        dtype=dtype,
+        buffer_size=buffer_size,
+        veclen=W.get(),
+        storage=StorageType.FPGA_Local)
+
+    state.add_memlet_path(
+        A,
+        A_pipe,
+        memlet=Memlet(
+            A_pipe, 1, Indices(["0"]), W.get(), other_subset=Indices(["i"])))
+
+    return sdfg
+
+
+def make_write_sdfg():
+
+    sdfg = SDFG("filter_write")
+
+    loop_begin = sdfg.add_state("loop_begin")
+    loop_entry = sdfg.add_state("loop_entry")
+    state = sdfg.add_state("loop_body")
+    loop_end = sdfg.add_state("loop_end")
+
+    i_write_zero = loop_begin.add_scalar(
+        "i_write",
+        dtype=dace.types.uint32,
+        transient=True,
+        storage=StorageType.FPGA_Registers)
+    zero_tasklet = loop_begin.add_tasklet("zero", {}, {"i_write_out"},
+                                          "i_write_out = 0")
+    loop_begin.add_memlet_path(
+        zero_tasklet,
+        i_write_zero,
+        src_conn="i_write_out",
+        memlet=Memlet.simple(i_write_zero, "0"))
+
+    sdfg.add_edge(
+        loop_begin,
+        loop_entry,
+        dace.graph.edges.InterstateEdge(assignments={"i": 0}))
+
+    sdfg.add_edge(
+        loop_entry,
+        state,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "i < N + W", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        loop_entry,
+        loop_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "i >= N + W", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        state,
+        loop_entry,
+        dace.graph.edges.InterstateEdge(assignments={"i": "i + W"}))
+
+    B = state.add_array(
+        "B_mem", [N], dtype=dtype, storage=StorageType.FPGA_Global)
+    B_pipe = state.add_stream(
+        "_B_pipe",
+        dtype=dtype,
+        buffer_size=buffer_size,
+        veclen=W.get(),
+        storage=StorageType.FPGA_Local)
+    valid_pipe = state.add_stream(
+        "_valid_pipe",
+        dtype=dace.typeclass(bool),
+        buffer_size=buffer_size,
+        storage=StorageType.FPGA_Local)
+    i_write_in = state.add_scalar(
+        "i_write",
+        dtype=dace.types.uint32,
+        transient=True,
+        storage=StorageType.FPGA_Registers)
+    i_write_out = state.add_scalar(
+        "i_write",
+        dtype=dace.types.uint32,
+        transient=True,
+        storage=StorageType.FPGA_Registers)
+
+    tasklet = state.add_tasklet(
+        "write", {"b_in", "valid_in", "i_write_in"}, {"b_out", "i_write_out"},
+        "if valid_in:"
+        "\n\tb_out[i_write_in] = b_in"
+        "\n\ti_write_out = i_write_in + 1"
+        "\nelse:"
+        "\n\ti_write_out = i_write_in")
+
+    state.add_memlet_path(
+        B_pipe,
+        tasklet,
+        dst_conn="b_in",
+        memlet=Memlet.simple(B_pipe, "0", veclen=W.get()))
+    state.add_memlet_path(
+        valid_pipe,
+        tasklet,
+        dst_conn="valid_in",
+        memlet=Memlet.simple(valid_pipe, "0"))
+    state.add_memlet_path(
+        i_write_in,
+        tasklet,
+        dst_conn="i_write_in",
+        memlet=Memlet.simple(i_write_in, "0"))
+    state.add_memlet_path(
+        tasklet,
+        i_write_out,
+        src_conn="i_write_out",
+        memlet=Memlet.simple(i_write_out, "0"))
+    state.add_memlet_path(
+        tasklet, B, src_conn="b_out", memlet=Memlet.simple(B, "0:N", W.get()))
+
+    return sdfg
+
+
+def make_main_state(sdfg):
+
+    state = sdfg.add_state("filter")
+
+    A = state.add_array(
+        "A_device", [N],
+        dtype=dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    ratio = state.add_scalar(
+        "ratio", storage=StorageType.FPGA_Global, dtype=dtype)
+
+    outsize = state.add_array(
+        "outsize_device", [1],
+        dtype=dace.uint32,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    B = state.add_array(
+        "B_device", [N],
+        dtype=dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+
+    A_pipe_in = state.add_stream(
+        "A_pipe",
+        dtype=dtype,
+        buffer_size=buffer_size,
+        veclen=W.get(),
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    A_pipe_out = state.add_stream(
+        "A_pipe",
+        dtype=dtype,
+        buffer_size=buffer_size,
+        veclen=W.get(),
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    B_pipe_in = state.add_stream(
+        "B_pipe",
+        dtype=dtype,
+        buffer_size=buffer_size,
+        veclen=W.get(),
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    B_pipe_out = state.add_stream(
+        "B_pipe",
+        dtype=dtype,
+        buffer_size=buffer_size,
+        veclen=W.get(),
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    valid_pipe_in = state.add_stream(
+        "valid_pipe",
+        dtype=dace.typeclass(bool),
+        buffer_size=buffer_size,
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    valid_pipe_out = state.add_stream(
+        "valid_pipe",
+        dtype=dace.typeclass(bool),
+        buffer_size=buffer_size,
+        transient=True,
+        storage=StorageType.FPGA_Local)
+
+    read_sdfg = make_read_sdfg()
+    read_tasklet = state.add_nested_sdfg(read_sdfg, sdfg, {"A_mem"},
+                                         {"_A_pipe"})
+
+    compute_sdfg = make_compute_sdfg()
+    compute_tasklet = state.add_nested_sdfg(
+        compute_sdfg, sdfg, {"_A_pipe", "ratio_nested"},
+        {"_B_pipe", "_valid_pipe", "count"})
+
+    write_sdfg = make_write_sdfg()
+    write_tasklet = state.add_nested_sdfg(
+        write_sdfg, sdfg, {"_B_pipe", "_valid_pipe"}, {"B_mem"})
+
+    state.add_memlet_path(
+        A,
+        read_tasklet,
+        dst_conn="A_mem",
+        memlet=Memlet.simple(A, "0:N", num_accesses="N", veclen=W.get()))
+    state.add_memlet_path(
+        read_tasklet,
+        A_pipe_out,
+        src_conn="_A_pipe",
+        memlet=Memlet.simple(
+            A_pipe_out, "0", num_accesses="N", veclen=W.get()))
+
+    state.add_memlet_path(
+        A_pipe_in,
+        compute_tasklet,
+        dst_conn="_A_pipe",
+        memlet=Memlet.simple(A_pipe_in, "0", num_accesses="N", veclen=W.get()))
+    state.add_memlet_path(
+        ratio,
+        compute_tasklet,
+        dst_conn="ratio_nested",
+        memlet=Memlet.simple(ratio, "0"))
+    state.add_memlet_path(
+        compute_tasklet,
+        B_pipe_out,
+        src_conn="_B_pipe",
+        memlet=Memlet.simple(
+            B_pipe_out, "0", num_accesses="N", veclen=W.get()))
+    state.add_memlet_path(
+        compute_tasklet,
+        valid_pipe_out,
+        src_conn="_valid_pipe",
+        memlet=Memlet.simple(valid_pipe_out, "0", num_accesses="N"))
+    state.add_memlet_path(
+        compute_tasklet,
+        outsize,
+        src_conn="count",
+        memlet=Memlet.simple(outsize, "0"))
+
+    state.add_memlet_path(
+        B_pipe_in,
+        write_tasklet,
+        dst_conn="_B_pipe",
+        memlet=Memlet.simple(B_pipe_in, "0", veclen=W.get(), num_accesses="N"))
+    state.add_memlet_path(
+        valid_pipe_in,
+        write_tasklet,
+        dst_conn="_valid_pipe",
+        memlet=Memlet.simple(valid_pipe_in, "0", num_accesses="N"))
+    state.add_memlet_path(
+        write_tasklet,
+        B,
+        src_conn="B_mem",
+        memlet=Memlet.simple(B, "0:N", num_accesses=-1, veclen=W.get()))
+
+    return state
+
+
+def make_sdfg(specialize):
+
+    if not specialize:
+        sdfg = dace.SDFG("filter_fpga_vectorized_{}".format(W.get()))
+    else:
+        sdfg = dace.SDFG("filter_fpga_vectorized_{}_{}".format(
+            W.get(), N.get()))
+
+    copy_to_device_state = make_copy_to_device(sdfg)
+    compute_state = make_main_state(sdfg)
+    copy_to_host_state = make_copy_to_host(sdfg)
+
+    sdfg.add_edge(copy_to_device_state, compute_state,
+                  dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(compute_state, copy_to_host_state,
+                  dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+def regression(A, ratio):
+    return A[np.where(A > ratio)]
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("N", type=int)
+    parser.add_argument("W", type=int)
+    parser.add_argument("ratio", type=float)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all symbols at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    if args["specialize"]:
+        W.set(args["W"])
+        num_stages = 2 * W.get() - 1
+        N.set(args["N"])
+        sdfg = make_sdfg(True)
+        sdfg.specialize()
+    else:
+        W.set(args["W"])
+        num_stages = 2 * W.get() - 1
+        sdfg = make_sdfg(False)
+        sdfg.specialize()
+        N.set(args["N"])
+    sdfg.add_constants({"num_stages": num_stages})
+
+    ratio = dtype(args["ratio"])
+
+    print("Predicate-Based Filter. size={}, ratio={} ({}specialized)".format(
+        N.get(), ratio, "" if args["specialize"] else "not "))
+
+    A = dace.ndarray([N], dtype=dtype)
+    B = dace.ndarray([N], dtype=dtype)
+    outsize = dace.scalar(dace.uint32, allow_conflicts=True)
+    outsize[0] = 0
+
+    A[:] = np.random.rand(N.get()).astype(dtype.type)
+    B[:] = dtype(0)
+
+    sdfg.draw_to_file()
+    if args["specialize"]:
+        sdfg.specialize()
+        sdfg(A=A, B=B, outsize=outsize, ratio=ratio)
+    else:
+        sdfg(A=A, B=B, outsize=outsize, ratio=ratio, N=N)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('filter', 'numpy', 0, regression, A, ratio)
+
+    filtered = regression(A, ratio)
+
+    if len(filtered) != outsize[0]:
+        print(
+            "Difference in number of filtered items: %d (DaCe) vs. %d (numpy)"
+            % (outsize[0], len(filtered)))
+        totalitems = min(outsize[0], N.get())
+        print('DaCe:', B[:totalitems].view(type=np.ndarray))
+        print('Regression:', filtered.view(type=np.ndarray))
+        exit(1)
+
+    if len(filtered) == 0:
+        print("==== Program end ====")
+        exit(0)
+
+    diff = np.abs(filtered - B[:outsize[0]])
+    mismatches = np.transpose(np.nonzero(diff > 1e-3 * filtered))
+    if mismatches.size > 0:
+        print("Mismatches found:")
+        for i in mismatches:
+            print("{} (should be {})".format(B[i], filtered[i]))
+        totalitems = min(outsize[0], N.get())
+        print('DaCe:', B[:totalitems].view(type=np.ndarray))
+        print('Regression:', filtered.view(type=np.ndarray))
+    else:
+        print("Results successfully verified.")
+
+    print("==== Program end ====")
+    exit(1 if mismatches.size > 0 else 0)
diff --git a/samples/fpga/gemm_fpga_pipelined.py b/samples/fpga/gemm_fpga_pipelined.py
new file mode 100644
index 0000000000..aa98112e73
--- /dev/null
+++ b/samples/fpga/gemm_fpga_pipelined.py
@@ -0,0 +1,294 @@
+import argparse
+import dace
+import numpy as np
+
+N = dace.symbol('N')
+M = dace.symbol('M')
+K = dace.symbol('K')
+
+
+def make_sdfg(specialized):
+
+    if specialized:
+        sdfg = dace.SDFG("gemm_fpga_pipelined_{}x{}x{}".format(
+            N.get(), K.get(), M.get()))
+    else:
+        sdfg = dace.SDFG("gemm_fpga_pipelined_NxKx{}".format(M.get()))
+
+    ###########################################################################
+    # Copy data to FPGA
+
+    pre_state = sdfg.add_state("pre_gemm")
+
+    A_host = pre_state.add_array("A", [N, K], dtype=dace.float32)
+    B_host = pre_state.add_array("B", [K, M], dtype=dace.float32)
+    C_host = pre_state.add_array("C", [N, M], dtype=dace.float32)
+
+    A_device = pre_state.add_array(
+        "A_device", [N, K],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    B_device = pre_state.add_array(
+        "B_device", [K, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    C_device = pre_state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    pre_state.add_edge(A_host, None, A_device, None,
+                       dace.memlet.Memlet.simple(A_device, "0:N, 0:K"))
+    pre_state.add_edge(B_host, None, B_device, None,
+                       dace.memlet.Memlet.simple(B_device, "0:K, 0:M"))
+    pre_state.add_edge(C_host, None, C_device, None,
+                       dace.memlet.Memlet.simple(C_device, "0:N, 0:M"))
+
+    ###########################################################################
+    # Compute
+
+    state = sdfg.add_state("gemm")
+    sdfg.add_edge(pre_state, state, dace.graph.edges.InterstateEdge())
+
+    A = state.add_array(
+        "A_device", [N, K],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    B = state.add_array(
+        "B_device", [K, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    C = state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    C_buffer_in = state.add_array(
+        "C_buffer", [M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    C_buffer_out = state.add_array(
+        "C_buffer", [M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    n_entry, n_exit = state.add_map(
+        "Map_N", {"n": "0:N"}, schedule=dace.types.ScheduleType.FPGA_Device)
+    k_entry, k_exit = state.add_map(
+        "Map_K", {"k": "0:K"}, schedule=dace.types.ScheduleType.FPGA_Device)
+    m_entry, m_exit = state.add_map(
+        "Map_M", {"m": "0:M"}, schedule=dace.types.ScheduleType.FPGA_Device)
+
+    state.add_nedge(n_entry, C_buffer_in, dace.memlet.EmptyMemlet())
+
+    ###########################################################################
+    # Nested SDFG
+
+    nested_sdfg = dace.SDFG("zero_or_wcr", parent=sdfg)
+
+    if_state = nested_sdfg.add_state("if_state")
+    then_state = nested_sdfg.add_state("then_state")
+    else_state = nested_sdfg.add_state("else_state")
+    end_state = nested_sdfg.add_state("end_state")
+    nested_sdfg.add_edge(
+        if_state,
+        then_state,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k == 0", language=dace.types.Language.Python)))
+    nested_sdfg.add_edge(
+        if_state,
+        else_state,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k != 0", language=dace.types.Language.Python)))
+    nested_sdfg.add_edge(then_state, end_state,
+                         dace.graph.edges.InterstateEdge())
+    nested_sdfg.add_edge(else_state, end_state,
+                         dace.graph.edges.InterstateEdge())
+
+    # These are identical, they only differ in their confres
+    then_tasklet = then_state.add_tasklet("multiply", {"a", "b"}, {"c_out"},
+                                          "c_out = a * b")
+    else_tasklet = else_state.add_tasklet("multiply", {"a", "b", "c_in"},
+                                          {"c_out"}, "c_out = c_in + a * b")
+
+    # Add scalar I/O
+    then_A_val = then_state.add_scalar(
+        "A_val", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    then_B_val = then_state.add_scalar(
+        "B_val", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    then_C_out = then_state.add_scalar(
+        "C_out", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+
+    else_A_val = else_state.add_scalar(
+        "A_val", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    else_B_val = else_state.add_scalar(
+        "B_val", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    else_C_in = else_state.add_scalar(
+        "C_in", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    else_C_out = else_state.add_scalar(
+        "C_out", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+
+    # Memlets
+    then_a_val_memlet = dace.memlet.Memlet.simple(then_A_val, "0")
+    then_b_val_memlet = dace.memlet.Memlet.simple(then_B_val, "0")
+    then_c_out_memlet = dace.memlet.Memlet.simple(then_C_out, "0")
+
+    else_a_val_memlet = dace.memlet.Memlet.simple(else_A_val, "0")
+    else_b_val_memlet = dace.memlet.Memlet.simple(else_B_val, "0")
+    else_c_in_memlet = dace.memlet.Memlet.simple(else_C_in, "0")
+    else_c_out_memlet = dace.memlet.Memlet.simple(else_C_out, "0")
+
+    # Draw paths within each state
+    then_state.add_memlet_path(
+        then_A_val, then_tasklet, memlet=then_a_val_memlet, dst_conn="a")
+    then_state.add_memlet_path(
+        then_B_val, then_tasklet, memlet=then_b_val_memlet, dst_conn="b")
+    then_state.add_memlet_path(
+        then_tasklet, then_C_out, memlet=then_c_out_memlet, src_conn="c_out")
+
+    else_state.add_memlet_path(
+        else_A_val, else_tasklet, memlet=else_a_val_memlet, dst_conn="a")
+    else_state.add_memlet_path(
+        else_B_val, else_tasklet, memlet=else_b_val_memlet, dst_conn="b")
+    else_state.add_memlet_path(
+        else_C_in, else_tasklet, memlet=else_c_in_memlet, dst_conn="c_in")
+    else_state.add_memlet_path(
+        else_tasklet, else_C_out, memlet=else_c_out_memlet, src_conn="c_out")
+
+    tasklet = state.add_nested_sdfg(nested_sdfg, sdfg,
+                                    {"A_val", "B_val", "C_in"}, {"C_out"})
+
+    ###########################################################################
+    # Compute continued
+
+    # tasklet = state.add_tasklet("multiply", {"a", "b"}, {"c"}, "c = a * b")
+
+    read_a_memlet = dace.memlet.Memlet.simple(A, "n, k")
+    read_b_memlet = dace.memlet.Memlet.simple(B, "k, m")
+    read_c_memlet = dace.memlet.Memlet.simple(C_buffer_in, "m")
+
+    state.add_memlet_path(
+        A,
+        n_entry,
+        k_entry,
+        m_entry,
+        tasklet,
+        memlet=read_a_memlet,
+        dst_conn="A_val")
+    state.add_memlet_path(
+        B,
+        n_entry,
+        k_entry,
+        m_entry,
+        tasklet,
+        memlet=read_b_memlet,
+        dst_conn="B_val")
+    state.add_memlet_path(
+        C_buffer_in,
+        k_entry,
+        m_entry,
+        tasklet,
+        memlet=read_c_memlet,
+        dst_conn="C_in")
+
+    write_buffer_memlet = dace.memlet.Memlet.simple(C_buffer_out, "m")
+
+    state.add_memlet_path(
+        tasklet,
+        m_exit,
+        k_exit,
+        C_buffer_out,
+        memlet=write_buffer_memlet,
+        src_conn="C_out")
+
+    write_c_memlet = dace.memlet.Memlet.simple(C, "n, 0:M")
+
+    state.add_memlet_path(C_buffer_out, n_exit, C, memlet=write_c_memlet)
+
+    ###########################################################################
+    # Copy back result
+
+    post_state = sdfg.add_state("post_gemm")
+    sdfg.add_edge(state, post_state, dace.graph.edges.InterstateEdge())
+
+    C_device = post_state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    C_host = post_state.add_array("C", [N, M], dtype=dace.float32)
+
+    post_state.add_edge(C_device, None, C_host, None,
+                        dace.memlet.Memlet.simple(C_device, "0:N, 0:M"))
+
+    return sdfg
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int)
+    parser.add_argument("N", type=int)
+    parser.add_argument("K", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all loop bounds at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    if not args["specialize"]:
+        M.set(args["M"])
+        # M must always be specialized, as it's used for the static buffer size
+        sdfg = make_sdfg(False)
+        sdfg.specialize()
+        N.set(args["N"])
+        K.set(args["K"])
+    else:
+        M.set(args["M"])
+        N.set(args["N"])
+        K.set(args["K"])
+        sdfg = make_sdfg(True)
+        sdfg.specialize()
+
+    print("Matrix multiplication {}x{}x{} ({}specialized)".format(
+        M.get(), N.get(), K.get(), "" if args["specialize"] else "not "))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A = np.ndarray([N.get(), K.get()], dtype=dace.float32.type)
+    B = np.ndarray([K.get(), M.get()], dtype=dace.float32.type)
+    C = np.ndarray([N.get(), M.get()], dtype=dace.float32.type)
+    A[:] = np.random.rand(M.get(), N.get()).astype(dace.float32.type)
+    B[:] = np.random.rand(N.get(), K.get()).astype(dace.float32.type)
+    C[:] = dace.float32(0)
+
+    A_regression = np.ndarray([N.get(), K.get()], dtype=np.float32)
+    B_regression = np.ndarray([K.get(), M.get()], dtype=np.float32)
+    C_regression = np.ndarray([N.get(), M.get()], dtype=np.float32)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+    C_regression[:] = C[:]
+
+    sdfg.draw_to_file()
+    if args["specialize"]:
+        sdfg(A=A, B=B, C=C)
+    else:
+        sdfg(A=A, B=B, C=C, N=N, K=K)
+    np.dot(A_regression, B_regression, C_regression)
+
+    diff = np.linalg.norm(C_regression - C) / float(dace.eval(M * K))
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/fpga/gemm_fpga_stream.py b/samples/fpga/gemm_fpga_stream.py
new file mode 100644
index 0000000000..3f316eafee
--- /dev/null
+++ b/samples/fpga/gemm_fpga_stream.py
@@ -0,0 +1,919 @@
+import argparse
+import dace
+import numpy as np
+import pdb
+import select
+import sys
+
+N = dace.symbol('N')
+M = dace.symbol('M')
+K = dace.symbol('K')
+
+
+def make_copy_to_fpga_state(sdfg):
+
+    ###########################################################################
+    # Copy data to FPGA
+
+    state = sdfg.add_state("copy_to_device")
+
+    A_host = state.add_array("A", [N, K], dtype=dace.float32)
+    B_host = state.add_array("B", [K, M], dtype=dace.float32)
+    C_host = state.add_array("C", [N, M], dtype=dace.float32)
+
+    A_device = state.add_array(
+        "A_device", [N, K],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    B_device = state.add_array(
+        "B_device", [K, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    C_device = state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_edge(A_host, None, A_device, None,
+                   dace.memlet.Memlet.simple(A_device, "0:N, 0:K"))
+    state.add_edge(B_host, None, B_device, None,
+                   dace.memlet.Memlet.simple(B_device, "0:K, 0:M"))
+    state.add_edge(C_host, None, C_device, None,
+                   dace.memlet.Memlet.simple(C_device, "0:N, 0:M"))
+
+    return state
+
+
+def make_copy_to_host_state(sdfg):
+
+    ###########################################################################
+    # Copy data to FPGA
+
+    state = sdfg.add_state("copy_to_host")
+
+    C_device = state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    C_host = state.add_array("C", [N, M], dtype=dace.float32)
+
+    state.add_edge(C_device, None, C_host, None,
+                   dace.memlet.Memlet.simple(C_host, "0:N, 0:M"))
+
+    return state
+
+
+def make_read_A_sdfg():
+
+    sdfg = dace.SDFG("gemm_read_A")
+
+    n_begin = sdfg.add_state("n_begin")
+    n_entry = sdfg.add_state("n_entry")
+    n_end = sdfg.add_state("n_end")
+
+    k_begin = sdfg.add_state("k_begin")
+    k_entry = sdfg.add_state("k_entry")
+    k_end = sdfg.add_state("k_end")
+
+    loop_body = sdfg.add_state("read_memory")
+
+    sdfg.add_edge(
+        n_begin,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": 0}))
+    sdfg.add_edge(
+        k_begin,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": 0}))
+
+    sdfg.add_edge(
+        n_entry,
+        k_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n < N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k < K", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        k_end,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": "n + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": "k + 1"}))
+
+    sdfg.add_edge(
+        n_entry,
+        n_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n >= N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        k_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k >= K", language=dace.types.Language.Python)))
+
+    mem = loop_body.add_array(
+        "mem", [N, K],
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    pipe = loop_body.add_stream(
+        "pipe", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+
+    loop_body.add_memlet_path(
+        mem,
+        pipe,
+        memlet=dace.memlet.Memlet(
+            pipe,
+            dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("n, k")))
+
+    return sdfg
+
+
+def make_read_B_sdfg():
+
+    sdfg = dace.SDFG("gemm_read_B")
+
+    n_begin = sdfg.add_state("n_begin")
+    n_entry = sdfg.add_state("n_entry")
+    n_end = sdfg.add_state("n_end")
+
+    k_begin = sdfg.add_state("k_begin")
+    k_entry = sdfg.add_state("k_entry")
+    k_end = sdfg.add_state("k_end")
+
+    m_begin = sdfg.add_state("m_begin")
+    m_entry = sdfg.add_state("m_entry")
+    m_end = sdfg.add_state("m_end")
+
+    loop_body = sdfg.add_state("read_memory")
+
+    sdfg.add_edge(
+        n_begin,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": 0}))
+    sdfg.add_edge(
+        k_begin,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": 0}))
+    sdfg.add_edge(
+        m_begin,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": 0}))
+
+    sdfg.add_edge(
+        n_entry,
+        k_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n < N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        m_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k < K", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m < M", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        k_end,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": "n + 1"}))
+    sdfg.add_edge(
+        m_end,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": "k + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": "m + 1"}))
+
+    sdfg.add_edge(
+        n_entry,
+        n_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n >= N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        k_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k >= K", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        m_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m >= M", language=dace.types.Language.Python)))
+
+    mem = loop_body.add_array(
+        "mem", [K, M],
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    pipe = loop_body.add_stream(
+        "pipe", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+
+    loop_body.add_memlet_path(
+        mem,
+        pipe,
+        memlet=dace.memlet.Memlet(
+            pipe,
+            dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("k, m")))
+
+    return sdfg
+
+
+def make_write_C_sdfg():
+
+    sdfg = dace.SDFG("gemm_write_C")
+
+    n_begin = sdfg.add_state("n_begin")
+    n_entry = sdfg.add_state("n_entry")
+    n_end = sdfg.add_state("n_end")
+
+    m_begin = sdfg.add_state("m_begin")
+    m_entry = sdfg.add_state("m_entry")
+    m_end = sdfg.add_state("m_end")
+
+    loop_body = sdfg.add_state("write_memory")
+
+    sdfg.add_edge(
+        n_begin,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": 0}))
+    sdfg.add_edge(
+        m_begin,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": 0}))
+
+    sdfg.add_edge(
+        n_entry,
+        m_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n < N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m < M", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        m_end,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": "n + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": "m + 1"}))
+
+    sdfg.add_edge(
+        n_entry,
+        n_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n >= N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        m_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m >= M", language=dace.types.Language.Python)))
+
+    mem = loop_body.add_array(
+        "mem", [N, M],
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    pipe = loop_body.add_stream(
+        "pipe", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+
+    loop_body.add_memlet_path(
+        pipe,
+        mem,
+        memlet=dace.memlet.Memlet(
+            mem,
+            dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("n, m"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("0")))
+
+    return sdfg
+
+
+def make_compute_sdfg():
+
+    sdfg = dace.SDFG("gemm_compute")
+
+    n_begin = sdfg.add_state("n_begin")
+    n_entry = sdfg.add_state("n_entry")
+    n_end = sdfg.add_state("n_end")
+
+    k_begin = sdfg.add_state("k_begin")
+    k_entry = sdfg.add_state("k_entry")
+    k_end = sdfg.add_state("k_end")
+
+    m_begin = sdfg.add_state("m_begin")
+    m_entry = sdfg.add_state("m_entry")
+    m_end = sdfg.add_state("m_end")
+
+    c_begin = sdfg.add_state("c_begin")
+    c_entry = sdfg.add_state("c_entry")
+    c_end = sdfg.add_state("c_end")
+
+    state = sdfg.add_state("compute")
+
+    write_c_state = sdfg.add_state("write_c")
+
+    # Data nodes
+    A_pipe = state.add_stream(
+        "A_stream_in", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    B_pipe = state.add_stream(
+        "B_stream_in", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    C_pipe = write_c_state.add_stream(
+        "C_stream_out",
+        dace.float32,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    # N-loop
+    sdfg.add_edge(
+        n_begin,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": 0}))
+    sdfg.add_edge(
+        n_entry,
+        k_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n < N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        n_entry,
+        n_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n >= N", language=dace.types.Language.Python)))
+
+    # K-loop
+    sdfg.add_edge(
+        k_begin,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": 0}))
+    sdfg.add_edge(
+        k_entry,
+        m_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k < K", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        k_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k >= K", language=dace.types.Language.Python)))
+
+    # Inner M-loop
+    sdfg.add_edge(
+        m_begin,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": 0}))
+    sdfg.add_edge(
+        m_entry,
+        state,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m < M", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        m_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m >= M", language=dace.types.Language.Python)))
+
+    # Backtrack two loops
+    sdfg.add_edge(
+        state,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": "m + 1"}))
+    sdfg.add_edge(
+        m_end,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": "k + 1"}))
+
+    # Continue to next sequential loop
+    sdfg.add_edge(k_end, c_begin, dace.graph.edges.InterstateEdge())
+
+    # Tight C-loop
+    sdfg.add_edge(
+        c_begin,
+        c_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": 0}))
+    sdfg.add_edge(
+        c_entry,
+        write_c_state,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m < M", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        c_entry,
+        c_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m >= M", language=dace.types.Language.Python)))
+
+    # End inner loop
+    sdfg.add_edge(
+        write_c_state,
+        c_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": "m + 1"}))
+
+    # Backtrack
+    sdfg.add_edge(
+        c_end,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": "n + 1"}))
+
+    C_buffer_in = state.add_array(
+        "C_buffer", [M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    C_buffer_out = state.add_array(
+        "C_buffer", [M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    C_buffer_write = write_c_state.add_array(
+        "C_buffer", [M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    ###########################################################################
+    # Nested SDFG
+
+    nested_sdfg = dace.SDFG("gemm_nested", parent=sdfg)
+
+    if_state_c = nested_sdfg.add_state("if_state_c")
+    then_state_c = nested_sdfg.add_state("then_state_c")
+    else_state_c = nested_sdfg.add_state("else_state_c")
+    if_state_a = nested_sdfg.add_state("if_state_a")
+    then_state_a = nested_sdfg.add_state("then_state_a")
+    # No else state is necessary, but control flow detection seems to be broken
+    # for ifs with no else branch
+    else_state_a = nested_sdfg.add_state("else_state_a")
+    compute_state = nested_sdfg.add_state("compute_state")
+    nested_sdfg.add_edge(
+        if_state_c,
+        then_state_c,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k == 0", language=dace.types.Language.Python)))
+    nested_sdfg.add_edge(
+        if_state_c,
+        else_state_c,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k != 0", language=dace.types.Language.Python)))
+    nested_sdfg.add_edge(then_state_c, if_state_a,
+                         dace.graph.edges.InterstateEdge())
+    nested_sdfg.add_edge(else_state_c, if_state_a,
+                         dace.graph.edges.InterstateEdge())
+    nested_sdfg.add_edge(
+        if_state_a,
+        then_state_a,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m == 0", language=dace.types.Language.Python)))
+    nested_sdfg.add_edge(
+        if_state_a,
+        else_state_a,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m != 0", language=dace.types.Language.Python)))
+    nested_sdfg.add_edge(then_state_a, compute_state,
+                         dace.graph.edges.InterstateEdge())
+    nested_sdfg.add_edge(else_state_a, compute_state,
+                         dace.graph.edges.InterstateEdge())
+
+    compute_tasklet = compute_state.add_tasklet(
+        "multiply_add", {"a", "b", "c_in"}, {"c_out"}, "c_out = c_in + a * b")
+
+    # Then state C
+    zero_tasklet = then_state_c.add_tasklet("zero_C_buffer", {}, {"c_out"},
+                                            "c_out = 0")
+    C_val_out_then = then_state_c.add_scalar(
+        "C_val",
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    then_state_c.add_memlet_path(
+        zero_tasklet,
+        C_val_out_then,
+        src_conn="c_out",
+        memlet=dace.memlet.Memlet.simple(C_val_out_then, "0"))
+
+    # Else state C
+    C_in = else_state_c.add_scalar(
+        "C_in", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    C_val_out_else = else_state_c.add_scalar(
+        "C_val",
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    else_state_c.add_memlet_path(
+        C_in,
+        C_val_out_else,
+        memlet=dace.memlet.Memlet.simple(C_val_out_else, "0"))
+
+    # Then state A
+    A_in = then_state_a.add_scalar(
+        "A_in",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    A_val_out = then_state_a.add_scalar(
+        "A_val_out",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    then_state_a.add_memlet_path(
+        A_in,
+        A_val_out,
+        memlet=dace.memlet.Memlet(
+            A_val_out, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    # Compute state
+    A_val_in = compute_state.add_scalar(
+        "A_val_in",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    B_in = compute_state.add_scalar(
+        "B_in",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    C_val_in = compute_state.add_scalar(
+        "C_val",
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+    C_out = compute_state.add_scalar(
+        "C_out",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    compute_state.add_memlet_path(
+        A_val_in,
+        compute_tasklet,
+        memlet=dace.memlet.Memlet.simple(A_val_in, "0"),
+        dst_conn="a")
+    compute_state.add_memlet_path(
+        B_in,
+        compute_tasklet,
+        memlet=dace.memlet.Memlet.simple(B_in, "0"),
+        dst_conn="b")
+    compute_state.add_memlet_path(
+        C_val_in,
+        compute_tasklet,
+        memlet=dace.memlet.Memlet.simple(C_val_in, "0"),
+        dst_conn="c_in")
+    compute_state.add_memlet_path(
+        compute_tasklet,
+        C_out,
+        memlet=dace.memlet.Memlet.simple(C_out, "0"),
+        src_conn="c_out")
+
+    tasklet = state.add_nested_sdfg(nested_sdfg, sdfg,
+                                    {"A_in", "A_val_in", "B_in", "C_in"},
+                                    {"A_val_out", "C_out"})
+
+    ###########################################################################
+    # Compute continued
+
+    # Scalar buffer for A
+    A_reg_in = state.add_scalar(
+        "A_reg",
+        dtype=dace.float32,
+        transient=True,
+        toplevel=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+    A_reg_out = state.add_scalar(
+        "A_reg",
+        dtype=dace.float32,
+        transient=True,
+        toplevel=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+
+    state.add_memlet_path(
+        A_pipe,
+        tasklet,
+        memlet=dace.memlet.Memlet(
+            A_pipe, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        dst_conn="A_in")
+    state.add_memlet_path(
+        B_pipe,
+        tasklet,
+        memlet=dace.memlet.Memlet.simple(B_pipe, "0"),
+        dst_conn="B_in")
+    state.add_memlet_path(
+        C_buffer_in,
+        tasklet,
+        memlet=dace.memlet.Memlet(
+            C_buffer_in, dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("m"), 1),
+        dst_conn="C_in")
+    state.add_memlet_path(
+        tasklet,
+        C_buffer_out,
+        memlet=dace.memlet.Memlet.simple(C_buffer_out, "m"),
+        src_conn="C_out")
+    state.add_memlet_path(
+        A_reg_in,
+        tasklet,
+        memlet=dace.memlet.Memlet(
+            A_reg_in, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        dst_conn="A_val_in")
+    state.add_memlet_path(
+        tasklet,
+        A_reg_out,
+        memlet=dace.memlet.Memlet(
+            A_reg_out, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        src_conn="A_val_out")
+
+    ###########################################################################
+    # Write back state
+
+    write_c_state.add_memlet_path(
+        C_buffer_write,
+        C_pipe,
+        memlet=dace.memlet.Memlet(
+            C_pipe,
+            dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("m")))
+
+    return sdfg
+
+
+def make_fpga_state(sdfg):
+
+    state = sdfg.add_state("gemm")
+
+    read_A_sdfg = make_read_A_sdfg()
+    read_A_sdfg_node = state.add_nested_sdfg(read_A_sdfg, sdfg, {"mem"},
+                                             {"pipe"})
+
+    read_B_sdfg = make_read_B_sdfg()
+    read_B_sdfg_node = state.add_nested_sdfg(read_B_sdfg, sdfg, {"mem"},
+                                             {"pipe"})
+
+    compute_sdfg = make_compute_sdfg()
+    compute_sdfg_node = state.add_nested_sdfg(
+        compute_sdfg, sdfg, {"A_stream_in", "B_stream_in"}, {"C_stream_out"})
+
+    write_C_sdfg = make_write_C_sdfg()
+    write_C_sdfg_node = state.add_nested_sdfg(write_C_sdfg, sdfg, {"pipe"},
+                                              {"mem"})
+
+    A = state.add_array(
+        "A_device", [N, K],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    B = state.add_array(
+        "B_device", [K, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    C = state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    A_pipe_in = state.add_stream(
+        "A_pipe",
+        dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    B_pipe_in = state.add_stream(
+        "B_pipe",
+        dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    C_pipe_in = state.add_stream(
+        "C_pipe",
+        dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    A_pipe_out = state.add_stream(
+        "A_pipe",
+        dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    B_pipe_out = state.add_stream(
+        "B_pipe",
+        dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    C_pipe_out = state.add_stream(
+        "C_pipe",
+        dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    state.add_memlet_path(
+        A,
+        read_A_sdfg_node,
+        dst_conn="mem",
+        memlet=dace.memlet.Memlet(
+            A, dace.symbolic.pystr_to_symbolic("N * K"),
+            dace.properties.SubsetProperty.from_string("0:N, 0:K"), 1))
+    state.add_memlet_path(
+        read_A_sdfg_node,
+        A_pipe_out,
+        src_conn="pipe",
+        memlet=dace.memlet.Memlet(
+            A_pipe_out, dace.symbolic.pystr_to_symbolic("N * K"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+    state.add_memlet_path(
+        A_pipe_in,
+        compute_sdfg_node,
+        dst_conn="A_stream_in",
+        memlet=dace.memlet.Memlet(
+            A_pipe_in, dace.symbolic.pystr_to_symbolic("N * K"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    state.add_memlet_path(
+        B,
+        read_B_sdfg_node,
+        dst_conn="mem",
+        memlet=dace.memlet.Memlet(
+            B, dace.symbolic.pystr_to_symbolic("N * K * M"),
+            dace.properties.SubsetProperty.from_string("0:K, 0:M"), 1))
+    state.add_memlet_path(
+        read_B_sdfg_node,
+        B_pipe_out,
+        src_conn="pipe",
+        memlet=dace.memlet.Memlet(
+            B_pipe_out, dace.symbolic.pystr_to_symbolic("N * K * M"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+    state.add_memlet_path(
+        B_pipe_in,
+        compute_sdfg_node,
+        dst_conn="B_stream_in",
+        memlet=dace.memlet.Memlet(
+            B_pipe_in, dace.symbolic.pystr_to_symbolic("N * K * M"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    state.add_memlet_path(
+        write_C_sdfg_node,
+        C,
+        src_conn="mem",
+        memlet=dace.memlet.Memlet(
+            C, dace.symbolic.pystr_to_symbolic("N * M"),
+            dace.properties.SubsetProperty.from_string("0:N, 0:M"), 1))
+    state.add_memlet_path(
+        C_pipe_in,
+        write_C_sdfg_node,
+        dst_conn="pipe",
+        memlet=dace.memlet.Memlet(
+            C_pipe_in, dace.symbolic.pystr_to_symbolic("N * M"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+    state.add_memlet_path(
+        compute_sdfg_node,
+        C_pipe_out,
+        src_conn="C_stream_out",
+        memlet=dace.memlet.Memlet(
+            C_pipe_out, dace.symbolic.pystr_to_symbolic("N * M"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    return state
+
+
+def make_sdfg(specialized):
+
+    if specialized:
+        sdfg = dace.SDFG("gemm_fpga_stream_{}x{}x{}".format(
+            N.get(), K.get(), M.get()))
+    else:
+        sdfg = dace.SDFG("gemm_fpga_stream_NxKx{}".format(M.get()))
+
+    pre_state = make_copy_to_fpga_state(sdfg)
+    compute_state = make_fpga_state(sdfg)
+    post_state = make_copy_to_host_state(sdfg)
+
+    sdfg.add_edge(pre_state, compute_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(compute_state, post_state, dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int)
+    parser.add_argument("N", type=int)
+    parser.add_argument("K", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all loop bounds at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    if not args["specialize"]:
+        M.set(args["M"])
+        # M must always be specialized, as it's used for the static buffer size
+        sdfg = make_sdfg(False)
+        sdfg.specialize()
+        N.set(args["N"])
+        K.set(args["K"])
+    else:
+        M.set(args["M"])
+        N.set(args["N"])
+        K.set(args["K"])
+        sdfg = make_sdfg(True)
+        sdfg.specialize()
+
+    print("Matrix multiplication {}x{}x{} ({}specialized)".format(
+        M.get(), N.get(), K.get(), "" if args["specialize"] else "not "))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A = np.ndarray([N.get(), K.get()], dtype=dace.float32.type)
+    B = np.ndarray([K.get(), M.get()], dtype=dace.float32.type)
+    C = np.ndarray([N.get(), M.get()], dtype=dace.float32.type)
+    A[:] = 1  # np.random.rand(N.get(), K.get()).astype(dace.float32.type)
+    B[:] = 1  # np.random.rand(K.get(), M.get()).astype(dace.float32.type)
+    C[:] = dace.float32(0)
+
+    A_regression = np.ndarray([N.get(), K.get()], dtype=np.float32)
+    B_regression = np.ndarray([K.get(), M.get()], dtype=np.float32)
+    C_regression = np.ndarray([N.get(), M.get()], dtype=np.float32)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+    C_regression[:] = C[:]
+
+    sdfg.draw_to_file()
+    if args["specialize"]:
+        sdfg(A=A, B=B, C=C)
+    else:
+        sdfg(A=A, B=B, C=C, N=N, K=K)
+    np.dot(A_regression, B_regression, C_regression)
+
+    diff = np.abs(C_regression - C)
+    diff_total = np.sum(diff)
+    highest_diff = np.max(diff)
+    wrong_elements = np.transpose(np.nonzero(diff >= 0.01))
+
+    print("==== Program end ====")
+
+    if diff_total >= 0.01:
+        print("Verification failed!")
+        print("Total difference: {}".format(diff_total))
+        print("Incorrect elements: {} / {}".format(wrong_elements.shape[0],
+                                                   N.get() * M.get()))
+        print("Highest difference: {}".format(highest_diff))
+        print("** Result:\n", C)
+        print("** Reference:\n", C_regression)
+        print("Type \"debug\" to enter debugger, "
+              "or any other string to quit (timeout in 10 seconds)")
+        read, _, _ = select.select([sys.stdin], [], [], 10)
+        if len(read) > 0 and sys.stdin.readline().strip().lower() == "debug":
+            print("Entering debugger...")
+            pdb.set_trace()
+        else:
+            print("Exiting...")
+        exit(1)
+    else:
+        print("Results verified successfully.")
+    exit(0)
diff --git a/samples/fpga/gemm_fpga_systolic.py b/samples/fpga/gemm_fpga_systolic.py
new file mode 100644
index 0000000000..4f53744c39
--- /dev/null
+++ b/samples/fpga/gemm_fpga_systolic.py
@@ -0,0 +1,1096 @@
+import argparse
+import dace
+import numpy as np
+import pdb
+import select
+import sys
+
+N = dace.symbol("N")
+K = dace.symbol("K")
+M = dace.symbol("M")
+P = dace.symbol("P")
+
+
+def make_copy_to_fpga_state(sdfg):
+
+    ###########################################################################
+    # Copy data to FPGA
+
+    state = sdfg.add_state("copy_to_device")
+
+    A_host = state.add_array("A", [N, K], dtype=dace.float32)
+    B_host = state.add_array("B", [K, M], dtype=dace.float32)
+    C_host = state.add_array("C", [N, M], dtype=dace.float32)
+
+    A_device = state.add_array(
+        "A_device", [N, K],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    B_device = state.add_array(
+        "B_device", [K, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    C_device = state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_edge(A_host, None, A_device, None,
+                   dace.memlet.Memlet.simple(A_device, "0:N, 0:K"))
+    state.add_edge(B_host, None, B_device, None,
+                   dace.memlet.Memlet.simple(B_device, "0:K, 0:M"))
+    state.add_edge(C_host, None, C_device, None,
+                   dace.memlet.Memlet.simple(C_device, "0:N, 0:M"))
+
+    return state
+
+
+def make_copy_to_host_state(sdfg):
+
+    ###########################################################################
+    # Copy data to FPGA
+
+    state = sdfg.add_state("copy_to_host")
+
+    C_device = state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    C_host = state.add_array("C", [N, M], dtype=dace.float32)
+
+    state.add_edge(C_device, None, C_host, None,
+                   dace.memlet.Memlet.simple(C_host, "0:N, 0:M"))
+
+    return state
+
+
+def make_read_A_sdfg():
+
+    sdfg = dace.SDFG("gemm_read_A")
+
+    n_outer_begin = sdfg.add_state("n_outer_begin")
+    n_outer_entry = sdfg.add_state("n_outer_entry")
+    n_outer_end = sdfg.add_state("n_outer_end")
+
+    k_begin = sdfg.add_state("k_begin")
+    k_entry = sdfg.add_state("k_entry")
+    k_end = sdfg.add_state("k_end")
+
+    n_inner_begin = sdfg.add_state("n_inner_begin")
+    n_inner_entry = sdfg.add_state("n_inner_entry")
+    n_inner_end = sdfg.add_state("n_inner_end")
+
+    loop_body = sdfg.add_state("read_memory")
+
+    sdfg.add_edge(
+        n_outer_begin,
+        n_outer_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n0": 0}))
+    sdfg.add_edge(
+        n_outer_entry,
+        k_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n0 < N / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        n_outer_entry,
+        n_outer_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n0 >= N / P", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        k_begin,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": 0}))
+    sdfg.add_edge(
+        k_entry,
+        n_inner_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k < K", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        k_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k >= K", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        n_inner_begin,
+        n_inner_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n1": 0}))
+    sdfg.add_edge(
+        n_inner_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n1 < P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        n_inner_entry,
+        n_inner_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n1 >= P", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        loop_body,
+        n_inner_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n1": "n1 + 1"}))
+    sdfg.add_edge(
+        n_inner_end,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": "k + 1"}))
+    sdfg.add_edge(
+        k_end,
+        n_outer_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n0": "n0 + 1"}))
+
+    mem = loop_body.add_array(
+        "mem", [N, K],
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    pipe = loop_body.add_stream(
+        "pipe", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+
+    loop_body.add_memlet_path(
+        mem,
+        pipe,
+        memlet=dace.memlet.Memlet(
+            pipe,
+            dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string(
+                "n0 * P + n1, k")))
+
+    return sdfg
+
+
+def make_read_B_sdfg():
+
+    sdfg = dace.SDFG("gemm_read_B")
+
+    n_begin = sdfg.add_state("n_begin")
+    n_entry = sdfg.add_state("n_entry")
+    n_end = sdfg.add_state("n_end")
+
+    k_begin = sdfg.add_state("k_begin")
+    k_entry = sdfg.add_state("k_entry")
+    k_end = sdfg.add_state("k_end")
+
+    m_begin = sdfg.add_state("m_begin")
+    m_entry = sdfg.add_state("m_entry")
+    m_end = sdfg.add_state("m_end")
+
+    loop_body = sdfg.add_state("read_memory")
+
+    sdfg.add_edge(
+        n_begin,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": 0}))
+    sdfg.add_edge(
+        k_begin,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": 0}))
+    sdfg.add_edge(
+        m_begin,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": 0}))
+
+    sdfg.add_edge(
+        n_entry,
+        k_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n < N / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        m_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k < K", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m < M", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        k_end,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": "n + 1"}))
+    sdfg.add_edge(
+        m_end,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": "k + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": "m + 1"}))
+
+    sdfg.add_edge(
+        n_entry,
+        n_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n >= N / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        k_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k >= K", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        m_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m >= M", language=dace.types.Language.Python)))
+
+    mem = loop_body.add_array(
+        "mem", [K, M],
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    pipe = loop_body.add_stream(
+        "pipe", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+
+    loop_body.add_memlet_path(
+        mem,
+        pipe,
+        memlet=dace.memlet.Memlet(
+            pipe,
+            dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("k, m")))
+
+    return sdfg
+
+
+def make_write_C_sdfg():
+
+    sdfg = dace.SDFG("gemm_write_C")
+
+    n_begin = sdfg.add_state("n_begin")
+    n_entry = sdfg.add_state("n_entry")
+    n_end = sdfg.add_state("n_end")
+
+    m_begin = sdfg.add_state("m_begin")
+    m_entry = sdfg.add_state("m_entry")
+    m_end = sdfg.add_state("m_end")
+
+    loop_body = sdfg.add_state("write_memory")
+
+    sdfg.add_edge(
+        n_begin,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": 0}))
+    sdfg.add_edge(
+        m_begin,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": 0}))
+
+    sdfg.add_edge(
+        n_entry,
+        m_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n < N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m < M", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        m_end,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n": "n + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": "m + 1"}))
+
+    sdfg.add_edge(
+        n_entry,
+        n_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n >= N", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        m_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m >= M", language=dace.types.Language.Python)))
+
+    mem = loop_body.add_array(
+        "mem", [N, M],
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    pipe = loop_body.add_stream(
+        "pipe", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+
+    loop_body.add_memlet_path(
+        pipe,
+        mem,
+        memlet=dace.memlet.Memlet(
+            mem,
+            dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("n, m"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("0")))
+
+    return sdfg
+
+
+def make_compute_sdfg():
+
+    sdfg = dace.SDFG("gemm_compute")
+
+    n_begin = sdfg.add_state("n0_begin")
+    n_entry = sdfg.add_state("n0_entry")
+    n_end = sdfg.add_state("n0_end")
+
+    k_begin = sdfg.add_state("k_begin")
+    k_entry = sdfg.add_state("k_entry")
+    k_end = sdfg.add_state("k_end")
+
+    a_begin = sdfg.add_state("a_begin")
+    a_entry = sdfg.add_state("a_entry")
+    a_end = sdfg.add_state("a_end")
+
+    buffer_a_state = sdfg.add_state("read_a")
+
+    m_begin = sdfg.add_state("m_begin")
+    m_entry = sdfg.add_state("m_entry")
+    m_end = sdfg.add_state("m_end")
+
+    state = sdfg.add_state("compute")
+
+    c_n1_begin = sdfg.add_state("c_n1_begin")
+    c_n1_entry = sdfg.add_state("c_n1_entry")
+    c_n1_end = sdfg.add_state("c_n1_end")
+
+    c_m_begin = sdfg.add_state("c_m_begin")
+    c_m_entry = sdfg.add_state("c_m_entry")
+    c_m_end = sdfg.add_state("c_m_end")
+
+    write_c_state = sdfg.add_state("write_c")
+
+    # Data nodes
+    B_pipe_in = state.add_stream(
+        "B_stream_in", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    B_pipe_out = state.add_stream(
+        "B_stream_out",
+        dace.float32,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    # N-loop
+    sdfg.add_edge(
+        n_begin,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n0": 0}))
+    sdfg.add_edge(
+        n_entry,
+        k_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n0 < N / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        n_entry,
+        n_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n0 >= N / P", language=dace.types.Language.Python)))
+
+    # K-loop
+    sdfg.add_edge(
+        k_begin,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": 0}))
+    sdfg.add_edge(
+        k_entry,
+        a_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k < K", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        k_entry,
+        k_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k >= K", language=dace.types.Language.Python)))
+
+    # Buffer A-loop
+    sdfg.add_edge(
+        a_begin,
+        a_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n1": 0}))
+    sdfg.add_edge(
+        a_entry,
+        buffer_a_state,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n1 < P - p", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        buffer_a_state,
+        a_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n1": "n1 + 1"}))
+    sdfg.add_edge(
+        a_entry,
+        a_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n1 >= P - p", language=dace.types.Language.Python)))
+    sdfg.add_edge(a_end, m_begin, dace.graph.edges.InterstateEdge())
+
+    # Inner M-loop
+    sdfg.add_edge(
+        m_begin,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": 0}))
+    sdfg.add_edge(
+        m_entry,
+        state,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m < M", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        m_entry,
+        m_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m >= M", language=dace.types.Language.Python)))
+
+    # Backtrack two loops
+    sdfg.add_edge(
+        state,
+        m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": "m + 1"}))
+    sdfg.add_edge(
+        m_end,
+        k_entry,
+        dace.graph.edges.InterstateEdge(assignments={"k": "k + 1"}))
+
+    # Continue to next sequential loop
+    sdfg.add_edge(k_end, c_n1_begin, dace.graph.edges.InterstateEdge())
+
+    # Two C-loops
+    sdfg.add_edge(
+        c_n1_begin,
+        c_n1_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n1": 0}))
+    sdfg.add_edge(
+        c_n1_entry,
+        c_m_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n1 < p + 1", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        c_n1_entry,
+        c_n1_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "n1 >= p + 1", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        c_m_begin,
+        c_m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": 0}))
+    sdfg.add_edge(
+        c_m_entry,
+        write_c_state,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m < M", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        c_m_entry,
+        c_m_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "m >= M", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        write_c_state,
+        c_m_entry,
+        dace.graph.edges.InterstateEdge(assignments={"m": "m + 1"}))
+    sdfg.add_edge(
+        c_m_end,
+        c_n1_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n1": "n1 + 1"}))
+
+    # Backtrack
+    sdfg.add_edge(
+        c_n1_end,
+        n_entry,
+        dace.graph.edges.InterstateEdge(assignments={"n0": "n0 + 1"}))
+
+    # Scalar buffer for A
+    A_pipe_in = buffer_a_state.add_stream(
+        "A_stream_in", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    A_pipe_out = buffer_a_state.add_stream(
+        "A_stream_out",
+        dace.float32,
+        storage=dace.types.StorageType.FPGA_Local)
+    A_reg_out = buffer_a_state.add_scalar(
+        "A_reg",
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+    buffer_a_tasklet = buffer_a_state.add_tasklet(
+        "buffer_a", {"a_in"}, {"a_reg", "a_out"}, "if n1 == P - p - 1:"
+        "\n\ta_reg = a_in"
+        "\nelse:"
+        "\n\tif p < P - 1:"
+        "\n\t\ta_out = a_in")
+    buffer_a_state.add_memlet_path(
+        A_pipe_in,
+        buffer_a_tasklet,
+        memlet=dace.memlet.Memlet(
+            A_pipe_in, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        dst_conn="a_in")
+    buffer_a_state.add_memlet_path(
+        buffer_a_tasklet,
+        A_reg_out,
+        memlet=dace.memlet.Memlet(
+            A_reg_out, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        src_conn="a_reg")
+    buffer_a_state.add_memlet_path(
+        buffer_a_tasklet,
+        A_pipe_out,
+        memlet=dace.memlet.Memlet(
+            A_pipe_out, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        src_conn="a_out")
+
+    ###########################################################################
+    # Nested SDFG
+
+    nested_sdfg = dace.SDFG("gemm_nested", parent=sdfg)
+
+    if_state_c = nested_sdfg.add_state("if_state_c")
+    then_state_c = nested_sdfg.add_state("then_state_c")
+    else_state_c = nested_sdfg.add_state("else_state_c")
+    compute_state = nested_sdfg.add_state("compute_state")
+    nested_sdfg.add_edge(
+        if_state_c,
+        then_state_c,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k == 0", language=dace.types.Language.Python)))
+    nested_sdfg.add_edge(
+        if_state_c,
+        else_state_c,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "k != 0", language=dace.types.Language.Python)))
+    nested_sdfg.add_edge(then_state_c, compute_state,
+                         dace.graph.edges.InterstateEdge())
+    nested_sdfg.add_edge(else_state_c, compute_state,
+                         dace.graph.edges.InterstateEdge())
+
+    compute_tasklet = compute_state.add_tasklet(
+        "multiply_add", {"a_in", "b_in", "c_in"}, {"b_out", "c_out"},
+        "c_out = c_in + a_in * b_in\nif p < P - 1:\n\tb_out = b_in")
+
+    # Then state C
+    zero_tasklet = then_state_c.add_tasklet("zero_C_buffer", {}, {"c_out"},
+                                            "c_out = 0")
+    C_val_out_then = then_state_c.add_scalar(
+        "C_val",
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    then_state_c.add_memlet_path(
+        zero_tasklet,
+        C_val_out_then,
+        src_conn="c_out",
+        memlet=dace.memlet.Memlet.simple(C_val_out_then, "0"))
+
+    # Else state C
+    C_in = else_state_c.add_scalar(
+        "C_in", dtype=dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    C_val_out_else = else_state_c.add_scalar(
+        "C_val",
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    else_state_c.add_memlet_path(
+        C_in,
+        C_val_out_else,
+        memlet=dace.memlet.Memlet.simple(C_val_out_else, "0"))
+
+    # Compute state
+    A_val_in = compute_state.add_scalar(
+        "A_val_in",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    B_in = compute_state.add_stream(
+        "B_in",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    B_out = compute_state.add_stream(
+        "B_out",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    C_val_in = compute_state.add_scalar(
+        "C_val",
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+    C_out = compute_state.add_scalar(
+        "C_out",
+        dtype=dace.float32,
+        storage=dace.types.StorageType.FPGA_Registers)
+    compute_state.add_memlet_path(
+        A_val_in,
+        compute_tasklet,
+        memlet=dace.memlet.Memlet.simple(A_val_in, "0"),
+        dst_conn="a_in")
+    compute_state.add_memlet_path(
+        B_in,
+        compute_tasklet,
+        memlet=dace.memlet.Memlet.simple(B_in, "0"),
+        dst_conn="b_in")
+    compute_state.add_memlet_path(
+        compute_tasklet,
+        B_out,
+        memlet=dace.memlet.Memlet(
+            B_out, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        src_conn="b_out")
+    compute_state.add_memlet_path(
+        C_val_in,
+        compute_tasklet,
+        memlet=dace.memlet.Memlet.simple(C_val_in, "0"),
+        dst_conn="c_in")
+    compute_state.add_memlet_path(
+        compute_tasklet,
+        C_out,
+        memlet=dace.memlet.Memlet.simple(C_out, "0"),
+        src_conn="c_out")
+
+    tasklet = state.add_nested_sdfg(
+        nested_sdfg, sdfg, {"A_val_in", "B_in", "C_in"}, {"B_out", "C_out"})
+
+    ###########################################################################
+    # Compute continued
+
+    A_reg_in = state.add_scalar(
+        "A_reg",
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+
+    C_buffer_in = state.add_array(
+        "C_buffer", [M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    C_buffer_out = state.add_array(
+        "C_buffer", [M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    state.add_memlet_path(
+        B_pipe_in,
+        tasklet,
+        memlet=dace.memlet.Memlet(
+            B_pipe_in, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        dst_conn="B_in")
+    state.add_memlet_path(
+        tasklet,
+        B_pipe_out,
+        memlet=dace.memlet.Memlet(
+            B_pipe_out, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        src_conn="B_out")
+    state.add_memlet_path(
+        C_buffer_in,
+        tasklet,
+        memlet=dace.memlet.Memlet(
+            C_buffer_in, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("m"), 1),
+        dst_conn="C_in")
+    state.add_memlet_path(
+        tasklet,
+        C_buffer_out,
+        memlet=dace.memlet.Memlet.simple(C_buffer_out, "m"),
+        src_conn="C_out")
+    state.add_memlet_path(
+        A_reg_in,
+        tasklet,
+        memlet=dace.memlet.Memlet(
+            A_reg_in, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        dst_conn="A_val_in")
+
+    ###########################################################################
+    # Write back state
+
+    C_pipe_in = write_c_state.add_stream(
+        "C_stream_in", dace.float32, storage=dace.types.StorageType.FPGA_Local)
+    C_pipe_out = write_c_state.add_stream(
+        "C_stream_out",
+        dace.float32,
+        storage=dace.types.StorageType.FPGA_Local)
+    C_buffer_write = write_c_state.add_array(
+        "C_buffer", [M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    write_c_tasklet = write_c_state.add_tasklet(
+        "write_c", {"buffer_in", "forward_in"}, {"c_out"}, "if n1 == p:"
+        "\n\tc_out = buffer_in"
+        "\nelse:"
+        "\n\tif p > 0:"
+        "\n\t\tc_out = forward_in")
+    write_c_state.add_memlet_path(
+        C_buffer_write,
+        write_c_tasklet,
+        memlet=dace.memlet.Memlet(
+            C_buffer_in, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("m"), 1),
+        dst_conn="buffer_in")
+    write_c_state.add_memlet_path(
+        C_pipe_in,
+        write_c_tasklet,
+        memlet=dace.memlet.Memlet(
+            C_pipe_in, dace.symbolic.pystr_to_symbolic("-1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        dst_conn="forward_in")
+    write_c_state.add_memlet_path(
+        write_c_tasklet,
+        C_pipe_out,
+        memlet=dace.memlet.Memlet(
+            C_pipe_out, dace.symbolic.pystr_to_symbolic("1"),
+            dace.properties.SubsetProperty.from_string("0"), 1),
+        src_conn="c_out")
+
+    return sdfg
+
+
+def make_fpga_state(sdfg):
+
+    state = sdfg.add_state("gemm")
+
+    read_A_sdfg = make_read_A_sdfg()
+    read_A_sdfg_node = state.add_nested_sdfg(read_A_sdfg, sdfg, {"mem"},
+                                             {"pipe"})
+
+    read_B_sdfg = make_read_B_sdfg()
+    read_B_sdfg_node = state.add_nested_sdfg(read_B_sdfg, sdfg, {"mem"},
+                                             {"pipe"})
+
+    compute_sdfg = make_compute_sdfg()
+    compute_sdfg_node = state.add_nested_sdfg(
+        compute_sdfg, sdfg, {"A_stream_in", "B_stream_in", "C_stream_in"},
+        {"A_stream_out", "B_stream_out", "C_stream_out"})
+
+    write_C_sdfg = make_write_C_sdfg()
+    write_C_sdfg_node = state.add_nested_sdfg(write_C_sdfg, sdfg, {"pipe"},
+                                              {"mem"})
+
+    A = state.add_array(
+        "A_device", [N, K],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    B = state.add_array(
+        "B_device", [K, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    C = state.add_array(
+        "C_device", [N, M],
+        dtype=dace.float32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    A_pipe_read = state.add_stream(
+        "A_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    A_pipe_in = state.add_stream(
+        "A_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    B_pipe_read = state.add_stream(
+        "B_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    B_pipe_in = state.add_stream(
+        "B_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    C_pipe_in = state.add_stream(
+        "C_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    A_pipe_out = state.add_stream(
+        "A_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    B_pipe_out = state.add_stream(
+        "B_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    C_pipe_write = state.add_stream(
+        "C_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    C_pipe_out = state.add_stream(
+        "C_pipe",
+        dace.float32,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+
+    compute_entry, compute_exit = state.add_map(
+        "unroll_compute", {"p": "0:P"},
+        schedule=dace.ScheduleType.FPGA_Device,
+        unroll=True)
+
+    # Bring data nodes into scope
+    state.add_memlet_path(
+        compute_entry, A_pipe_in, memlet=dace.memlet.EmptyMemlet())
+    state.add_memlet_path(
+        compute_entry, B_pipe_in, memlet=dace.memlet.EmptyMemlet())
+    state.add_memlet_path(
+        compute_entry, C_pipe_in, memlet=dace.memlet.EmptyMemlet())
+    state.add_memlet_path(
+        A_pipe_out, compute_exit, memlet=dace.memlet.EmptyMemlet())
+    state.add_memlet_path(
+        B_pipe_out, compute_exit, memlet=dace.memlet.EmptyMemlet())
+    state.add_memlet_path(
+        C_pipe_out, compute_exit, memlet=dace.memlet.EmptyMemlet())
+
+    # Connect data nodes
+    state.add_memlet_path(
+        A_pipe_in,
+        compute_sdfg_node,
+        dst_conn="A_stream_in",
+        memlet=dace.memlet.Memlet(
+            A_pipe_in,
+            dace.symbolic.pystr_to_symbolic("(N / P) * K * (P - p)"),
+            dace.properties.SubsetProperty.from_string("p"), 1))
+    state.add_memlet_path(
+        B_pipe_in,
+        compute_sdfg_node,
+        dst_conn="B_stream_in",
+        memlet=dace.memlet.Memlet(
+            B_pipe_in, dace.symbolic.pystr_to_symbolic("N * K * M / P"),
+            dace.properties.SubsetProperty.from_string("p"), 1))
+    state.add_memlet_path(
+        C_pipe_in,
+        compute_sdfg_node,
+        dst_conn="C_stream_in",
+        memlet=dace.memlet.Memlet(
+            C_pipe_in, dace.symbolic.pystr_to_symbolic("(N / P) * M * p"),
+            dace.properties.SubsetProperty.from_string("p"), 1))
+    state.add_memlet_path(
+        compute_sdfg_node,
+        A_pipe_out,
+        src_conn="A_stream_out",
+        memlet=dace.memlet.Memlet(
+            A_pipe_out,
+            dace.symbolic.pystr_to_symbolic("(N / P) * K * (P - p - 1)"),
+            dace.properties.SubsetProperty.from_string("p + 1"), 1))
+    state.add_memlet_path(
+        compute_sdfg_node,
+        B_pipe_out,
+        src_conn="B_stream_out",
+        memlet=dace.memlet.Memlet(
+            B_pipe_out,
+            dace.symbolic.pystr_to_symbolic(
+                "int(p != P - 1) * (N / P) * K * M"),
+            dace.properties.SubsetProperty.from_string("p + 1"), 1))
+    state.add_memlet_path(
+        compute_sdfg_node,
+        C_pipe_out,
+        src_conn="C_stream_out",
+        memlet=dace.memlet.Memlet(
+            C_pipe_out,
+            dace.symbolic.pystr_to_symbolic("(N / P) * M * (p + 1)"),
+            dace.properties.SubsetProperty.from_string("p + 1"), 1))
+
+    state.add_memlet_path(
+        A,
+        read_A_sdfg_node,
+        dst_conn="mem",
+        memlet=dace.memlet.Memlet(
+            A, dace.symbolic.pystr_to_symbolic("N * K"),
+            dace.properties.SubsetProperty.from_string("0:N, 0:K"), 1))
+    state.add_memlet_path(
+        read_A_sdfg_node,
+        A_pipe_read,
+        src_conn="pipe",
+        memlet=dace.memlet.Memlet(
+            A_pipe_out, dace.symbolic.pystr_to_symbolic("N * K"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    state.add_memlet_path(
+        B,
+        read_B_sdfg_node,
+        dst_conn="mem",
+        memlet=dace.memlet.Memlet(
+            B, dace.symbolic.pystr_to_symbolic("(N / P) * K * M"),
+            dace.properties.SubsetProperty.from_string("0:K, 0:M"), 1))
+    state.add_memlet_path(
+        read_B_sdfg_node,
+        B_pipe_read,
+        src_conn="pipe",
+        memlet=dace.memlet.Memlet(
+            B_pipe_out, dace.symbolic.pystr_to_symbolic("(N / P) * K * M"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    state.add_memlet_path(
+        C_pipe_write,
+        write_C_sdfg_node,
+        dst_conn="pipe",
+        memlet=dace.memlet.Memlet(
+            C_pipe_out, dace.symbolic.pystr_to_symbolic("N * M"),
+            dace.properties.SubsetProperty.from_string("P"), 1))
+    state.add_memlet_path(
+        write_C_sdfg_node,
+        C,
+        src_conn="mem",
+        memlet=dace.memlet.Memlet(
+            C, dace.symbolic.pystr_to_symbolic("N * M"),
+            dace.properties.SubsetProperty.from_string("0:N, 0:M"), 1))
+
+    return state
+
+
+def make_sdfg(specialized):
+
+    if specialized:
+        sdfg = dace.SDFG("gemm_fpga_systolic_{}_{}x{}x{}".format(
+            P.get(), N.get(), K.get(), M.get()))
+    else:
+        sdfg = dace.SDFG("gemm_fpga_systolic_{}_NxKx{}".format(
+            P.get(), M.get()))
+
+    pre_state = make_copy_to_fpga_state(sdfg)
+    compute_state = make_fpga_state(sdfg)
+    post_state = make_copy_to_host_state(sdfg)
+
+    sdfg.add_edge(pre_state, compute_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(compute_state, post_state, dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int)
+    parser.add_argument("N", type=int)
+    parser.add_argument("K", type=int)
+    parser.add_argument("P", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all loop bounds at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    if not args["specialize"]:
+        P.set(args["P"])
+        M.set(args["M"])
+        # M must always be specialized, as it's used for the static buffer size
+        sdfg = make_sdfg(False)
+        sdfg.specialize()
+        N.set(args["N"])
+        K.set(args["K"])
+    else:
+        P.set(args["P"])
+        M.set(args["M"])
+        N.set(args["N"])
+        K.set(args["K"])
+        sdfg = make_sdfg(True)
+        sdfg.specialize()
+
+    print("Matrix multiplication {}x{}x{} with {} PEs ({}specialized)".format(
+        M.get(), N.get(), K.get(), P.get(), ""
+        if args["specialize"] else "not "))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A = np.ndarray([N.get(), K.get()], dtype=dace.float32.type)
+    B = np.ndarray([K.get(), M.get()], dtype=dace.float32.type)
+    C = np.ndarray([N.get(), M.get()], dtype=dace.float32.type)
+    A[:] = 1  # np.random.rand(N.get(), K.get()).astype(dace.float32.type)
+    B[:] = 1  # np.random.rand(K.get(), M.get()).astype(dace.float32.type)
+    C[:] = dace.float32(0)
+
+    A_regression = np.ndarray([N.get(), K.get()], dtype=np.float32)
+    B_regression = np.ndarray([K.get(), M.get()], dtype=np.float32)
+    C_regression = np.ndarray([N.get(), M.get()], dtype=np.float32)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+    C_regression[:] = C[:]
+
+    sdfg.draw_to_file()
+    if args["specialize"]:
+        sdfg(A=A, B=B, C=C)
+    else:
+        sdfg(A=A, B=B, C=C, N=N, K=K)
+    np.dot(A_regression, B_regression, C_regression)
+
+    diff = np.abs(C_regression - C)
+    diff_total = np.sum(diff)
+    highest_diff = np.max(diff)
+    wrong_elements = np.transpose(np.nonzero(diff >= 0.01))
+
+    print("==== Program end ====")
+
+    if diff_total >= 0.01:
+        print("Verification failed!")
+        print("Total difference: {}".format(diff_total))
+        print("Incorrect elements: {} / {}".format(wrong_elements.shape[0],
+                                                   N.get() * M.get()))
+        print("Highest difference: {}".format(highest_diff))
+        print("** Result:\n", C)
+        print("** Reference:\n", C_regression)
+        print("Type \"debug\" to enter debugger, "
+              "or any other string to quit (timeout in 10 seconds)")
+        read, _, _ = select.select([sys.stdin], [], [], 10)
+        if len(read) > 0 and sys.stdin.readline().strip().lower() == "debug":
+            print("Entering debugger...")
+            pdb.set_trace()
+        else:
+            print("Exiting...")
+        exit(1)
+    else:
+        print("Results verified successfully.")
+    exit(0)
diff --git a/samples/fpga/gemv_transposed_fpga.py b/samples/fpga/gemv_transposed_fpga.py
new file mode 100644
index 0000000000..47efba4f87
--- /dev/null
+++ b/samples/fpga/gemv_transposed_fpga.py
@@ -0,0 +1,303 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import numpy as np
+import select
+import sys
+
+N = dace.symbol("N")
+M = dace.symbol("M")
+dtype = dace.float64
+
+# This implementation of transposed DGEMV assumes that the two vectors (x and y) fit into
+# FPGA fast memory
+
+
+def make_init_state(sdfg):
+
+    state = sdfg.add_state("init")
+
+    a_host = state.add_array("A", (M, N), dtype)
+    a_device = state.add_array(
+        "A_device", (M, N),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    x_host = state.add_array("x", (M, ), dtype)
+    x_device = state.add_array(
+        "x_device", (M, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    y_host = state.add_array("y", (M, ), dtype)
+    y_device = state.add_array(
+        "y_device", (N, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        a_host,
+        a_device,
+        memlet=dace.memlet.Memlet.simple(a_device, "0:N, 0:M"))
+
+    state.add_memlet_path(
+        x_host, x_device, memlet=dace.memlet.Memlet.simple(x_device, "0:M"))
+
+    state.add_memlet_path(
+        y_host, y_device, memlet=dace.memlet.Memlet.simple(y_device, "0:N"))
+
+    return state
+
+
+def make_finalize_state(sdfg):
+
+    state = sdfg.add_state("finalize")
+
+    y_host = state.add_array("y", (M, ), dtype)
+    y_device = state.add_array(
+        "y_device", (N, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        y_device, y_host, memlet=dace.memlet.Memlet.simple(y_host, "0:N"))
+
+    return state
+
+
+def make_load_state(sdfg):
+
+    state = sdfg.add_state("load")
+
+    y = state.add_array(
+        "y_nested", (N, ), dtype, storage=dace.types.StorageType.FPGA_Global)
+    y_buffer = state.add_array(
+        "y_buffer", (N, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    state.add_memlet_path(
+        y, y_buffer, memlet=dace.memlet.Memlet.simple(y_buffer, "0:N"))
+
+    return state
+
+
+def make_store_state(sdfg):
+
+    state = sdfg.add_state("store")
+
+    y_buffer = state.add_array(
+        "y_buffer", (N, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    y = state.add_array(
+        "y_nested", (N, ), dtype, storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        y_buffer, y, memlet=dace.memlet.Memlet.simple(y, "0:N"))
+
+    return state
+
+
+def make_compute_state(sdfg):
+
+    state = sdfg.add_state("compute")
+
+    a = state.add_array(
+        "A_nested", (M, N), dtype, storage=dace.types.StorageType.FPGA_Global)
+    x = state.add_array(
+        "x_nested", (M, ), dtype, storage=dace.types.StorageType.FPGA_Global)
+    y_buffer = state.add_array(
+        "y_buffer", (N, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    cols_entry, cols_exit = state.add_map("cols", {"m": "0:M"})
+    rows_entry, rows_exit = state.add_map("rows", {"n": "0:N"})
+
+    tasklet = state.add_tasklet("update", {"a", "x_in"}, {"update"},
+                                "update = a * x_in")
+
+    wcr_memlet = dace.memlet.Memlet.simple(
+        y_buffer, "n", wcr_str="lambda a, b: a + b", wcr_identity=0)
+
+    state.add_memlet_path(
+        a,
+        cols_entry,
+        rows_entry,
+        tasklet,
+        dst_conn="a",
+        memlet=dace.memlet.Memlet.simple(a, "m, n"))
+    state.add_memlet_path(
+        x,
+        cols_entry,
+        rows_entry,
+        tasklet,
+        dst_conn="x_in",
+        memlet=dace.memlet.Memlet.simple(x, "m"))
+    state.add_memlet_path(
+        tasklet,
+        rows_exit,
+        cols_exit,
+        y_buffer,
+        src_conn="update",
+        memlet=wcr_memlet)
+
+    return state
+
+
+def make_outer_compute_state(sdfg):
+
+    state = sdfg.add_state("gemv_transposed")
+
+    nested_sdfg = dace.SDFG("gemv_transposed", parent=sdfg)
+    load_state = make_load_state(nested_sdfg)
+    compute_state = make_compute_state(nested_sdfg)
+    store_state = make_store_state(nested_sdfg)
+    nested_sdfg.add_edge(load_state, compute_state,
+                         dace.graph.edges.InterstateEdge())
+    nested_sdfg.add_edge(compute_state, store_state,
+                         dace.graph.edges.InterstateEdge())
+
+    tasklet = state.add_nested_sdfg(nested_sdfg, sdfg,
+                                    {"A_nested", "x_nested"}, {"y_nested"})
+
+    a_device = state.add_array(
+        "A_device", (M, N),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    x_device = state.add_array(
+        "x_device", (M, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    y_device = state.add_array(
+        "y_device", (N, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        a_device,
+        tasklet,
+        dst_conn="A_nested",
+        memlet=dace.memlet.Memlet.simple(a_device, "0:M, 0:N"))
+    state.add_memlet_path(
+        x_device,
+        tasklet,
+        dst_conn="x_nested",
+        memlet=dace.memlet.Memlet.simple(x_device, "0:M"))
+    state.add_memlet_path(
+        tasklet,
+        y_device,
+        src_conn="y_nested",
+        memlet=dace.memlet.Memlet.simple(y_device, "0:N"))
+
+    return state
+
+
+def make_sdfg(specialize):
+
+    if specialize:
+        name = "gemv_transposed_{}x{}".format(N.get(), M.get())
+    else:
+        name = "gemv_transposed_{}xM".format(N.get())
+
+    sdfg = dace.SDFG(name)
+
+    init_state = make_init_state(sdfg)
+    fpga_state = make_outer_compute_state(sdfg)
+    finalize_state = make_finalize_state(sdfg)
+
+    sdfg.add_edge(init_state, fpga_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(fpga_state, finalize_state,
+                  dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("N", type=int)
+    parser.add_argument("M", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Also fix M in hardware")
+    args = vars(parser.parse_args())
+
+    N.set(args["N"])
+    if args["specialize"]:
+        print("Specializing M...")
+        M.set(args["M"])
+
+    gemv = make_sdfg(args["specialize"])
+    gemv.draw_to_file()
+    gemv.specialize()
+
+    if not args["specialize"]:
+        M.set(args["M"])
+
+    print("Running GEMV {}x{} ({}specialized)".format(
+        N.get(), M.get(), ("" if args["specialize"] else "not ")))
+
+    A = dace.ndarray([M, N], dtype=dtype)
+    x = dace.ndarray([M], dtype=dtype)
+    y = dace.ndarray([N], dtype=dtype)
+
+    # Intialize: randomize A, x and y
+    # A[:, :] = np.random.rand(M.get(), N.get()).astype(dtype.type)
+    # x[:] = np.random.rand(M.get()).astype(dtype.type)
+    # y[:] = np.random.rand(N.get()).astype(dtype.type)
+    A[:, :] = 1
+    x[:] = 1
+    y[:] = 0
+
+    # Regression
+    regression = np.matmul(np.transpose(A), x) + y
+
+    #############################################
+    # Run DaCe program
+
+    if args["specialize"]:
+        gemv(A=A, x=x, y=x)
+    else:
+        gemv(A=A, M=M, x=x, y=y)
+
+    residual = np.linalg.norm(y - regression) / dace.eval(N * M)
+    print("Residual:", residual)
+    diff = np.abs(y - regression)
+    wrong_elements = np.transpose(np.nonzero(diff >= 0.01))
+    highest_diff = np.max(diff)
+
+    print("==== Program end ====")
+    if residual >= 0.01 or highest_diff >= 0.01:
+        print("Verification failed!")
+        print("Residual: {}".format(residual))
+        print("Incorrect elements: {} / {}".format(wrong_elements.shape[0],
+                                                   dace.eval(N * M)))
+        print("Highest difference: {}".format(highest_diff))
+        print("** Result:\n", y)
+        print("** Reference:\n", regression)
+        print("Type \"debug\" to enter debugger, "
+              "or any other string to quit (timeout in 10 seconds)")
+        read, _, _ = select.select([sys.stdin], [], [], 10)
+        if len(read) > 0 and sys.stdin.readline().strip().lower() == "debug":
+            print("Entering debugger...")
+            import pdb
+            pdb.set_trace()
+        else:
+            print("Exiting...")
+        exit(1)
+    exit(0)
diff --git a/samples/fpga/histogram_fpga.py b/samples/fpga/histogram_fpga.py
new file mode 100644
index 0000000000..7d8ac3de22
--- /dev/null
+++ b/samples/fpga/histogram_fpga.py
@@ -0,0 +1,245 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+W = dace.symbol("W")
+H = dace.symbol("H")
+num_bins = dace.symbol("num_bins")
+num_bins.set(256)
+dtype = dace.float32
+
+
+def make_copy_to_fpga_state(sdfg):
+
+    state = sdfg.add_state("copy_to_fpga")
+
+    a_host = state.add_array("A", (H, W), dtype)
+    a_device = state.add_array(
+        "A_device", (H, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    hist_host = state.add_array("hist", (num_bins, ), dace.uint32)
+    hist_device = state.add_array(
+        "hist_device", (num_bins, ),
+        dace.uint32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        a_host,
+        a_device,
+        memlet=dace.memlet.Memlet.simple(a_device, "0:H, 0:W"))
+    state.add_memlet_path(
+        hist_host,
+        hist_device,
+        memlet=dace.memlet.Memlet.simple(hist_device, "0:num_bins"))
+
+    return state
+
+
+def make_copy_to_host_state(sdfg):
+
+    state = sdfg.add_state("copy_to_host")
+
+    hist_device = state.add_array(
+        "hist_device", (num_bins, ),
+        dace.uint32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    hist_host = state.add_array("hist", (num_bins, ), dace.uint32)
+
+    state.add_memlet_path(
+        hist_device,
+        hist_host,
+        memlet=dace.memlet.Memlet.simple(hist_host, "0:num_bins"))
+
+    return state
+
+
+def make_compute_state(sdfg):
+
+    state = sdfg.add_state("histogram_fpga")
+
+    a = state.add_array(
+        "A_in", (H, W), dtype, storage=dace.types.StorageType.FPGA_Global)
+    hist = state.add_array(
+        "hist_buffer", (num_bins, ),
+        dace.uint32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    entry, exit = state.add_map("map", {"i": "0:H", "j": "0:W"})
+
+    tasklet = state.add_tasklet("compute", {"a"}, {"out"},
+                                "out[int(float(num_bins) * a)] = 1")
+
+    read_memlet = dace.memlet.Memlet.simple(a, "i, j")
+    write_memlet = dace.memlet.Memlet.simple(
+        hist, "0:num_bins", wcr_str="lambda a, b: a + b", wcr_identity=0)
+
+    state.add_memlet_path(a, entry, tasklet, memlet=read_memlet, dst_conn="a")
+    state.add_memlet_path(
+        tasklet, exit, hist, memlet=write_memlet, src_conn="out")
+
+    return state
+
+
+def make_init_buffer_state(sdfg):
+
+    state = sdfg.add_state("init_buffer")
+
+    hist_buffer = state.add_array(
+        "hist_buffer", (num_bins, ),
+        dace.uint32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    entry, exit = state.add_map("init_map", {"i": "0:num_bins"})
+    tasklet = state.add_tasklet("zero", {}, {"out"}, "out = 0")
+    state.add_nedge(entry, tasklet, dace.memlet.EmptyMemlet())
+    state.add_memlet_path(
+        tasklet,
+        exit,
+        hist_buffer,
+        src_conn="out",
+        memlet=dace.memlet.Memlet.simple(hist_buffer, "i"))
+
+    return state
+
+
+def make_write_buffer_state(sdfg):
+
+    state = sdfg.add_state("write_buffer")
+
+    hist_buffer = state.add_array(
+        "hist_buffer", (num_bins, ),
+        dace.uint32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    hist_dram = state.add_array(
+        "hist_out", (num_bins, ),
+        dace.uint32,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        hist_buffer,
+        hist_dram,
+        memlet=dace.memlet.Memlet.simple(hist_dram, "0:num_bins"))
+
+    return state
+
+
+def make_nested_sdfg(parent):
+
+    sdfg = dace.SDFG("compute", parent=parent)
+
+    init_state = make_init_buffer_state(sdfg)
+    compute_state = make_compute_state(sdfg)
+    finalize_state = make_write_buffer_state(sdfg)
+
+    sdfg.add_edge(init_state, compute_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(compute_state, finalize_state,
+                  dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+def make_sdfg(specialize):
+
+    if specialize:
+        sdfg = dace.SDFG("histogram_fpga_{}x{}".format(H.get(), W.get()))
+    else:
+        sdfg = dace.SDFG("histogram_fpga")
+
+    copy_to_fpga_state = make_copy_to_fpga_state(sdfg)
+
+    state = sdfg.add_state("compute")
+    nested_sdfg = make_nested_sdfg(sdfg)
+    tasklet = state.add_nested_sdfg(nested_sdfg, sdfg, {"A_in"}, {"hist_out"})
+    a_device = state.add_array(
+        "A_device", (H, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    hist_device = state.add_array(
+        "hist_device", (num_bins, ),
+        dace.uint32,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    state.add_memlet_path(
+        a_device,
+        tasklet,
+        dst_conn="A_in",
+        memlet=dace.memlet.Memlet.simple(a_device, "0:H, 0:W"))
+    state.add_memlet_path(
+        tasklet,
+        hist_device,
+        src_conn="hist_out",
+        memlet=dace.memlet.Memlet.simple(hist_device, "0:num_bins"))
+
+    copy_to_host_state = make_copy_to_host_state(sdfg)
+
+    sdfg.add_edge(copy_to_fpga_state, state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(state, copy_to_host_state, dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("H", type=int)
+    parser.add_argument("W", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all symbols at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    if args["specialize"]:
+        H.set(args["H"])
+        W.set(args["W"])
+        histogram = make_sdfg(True)
+        histogram.specialize()
+    else:
+        histogram = make_sdfg(False)
+        histogram.specialize()
+        H.set(args["H"])
+        W.set(args["W"])
+
+    print("Histogram {}x{} ({}specialized)".format(
+        H.get(), W.get(), "" if args["specialize"] else "not "))
+
+    histogram.draw_to_file()
+
+    A = dace.ndarray([H, W], dtype=dtype)
+    hist = dace.ndarray([num_bins], dtype=dace.uint32)
+
+    A[:] = np.random.rand(H.get(), W.get()).astype(dace.float32.type)
+    hist[:] = dace.uint32(0)
+
+    if args["specialize"]:
+        histogram(A=A, hist=hist)
+    else:
+        histogram(A=A, H=H, W=W, hist=hist)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('histogram', 'numpy', dace.eval(H * W), np.histogram, A,
+                      num_bins)
+
+    diff = np.linalg.norm(
+        np.histogram(A, bins=num_bins.get(), range=(0.0, 1.0))[0][1:-1] -
+        hist[1:-1])
+
+    print("Difference:", diff)
+    if diff > 1e-5:
+        print("Validation failed.")
+    print("==== Program end ====")
+
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/fpga/histogram_fpga_parallel.py b/samples/fpga/histogram_fpga_parallel.py
new file mode 100644
index 0000000000..9913d18334
--- /dev/null
+++ b/samples/fpga/histogram_fpga_parallel.py
@@ -0,0 +1,355 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+from dace import SDFG, Memlet, EmptyMemlet
+from dace.types import StorageType, ScheduleType
+from dace.subsets import Indices
+
+W = dace.symbol("W")
+H = dace.symbol("H")
+P = dace.symbol("P")
+num_bins = dace.symbol("num_bins")
+num_bins.set(256)
+dtype = dace.float32
+itype = dace.uint32
+
+
+def make_copy_to_fpga_state(sdfg):
+
+    state = sdfg.add_state("copy_to_fpga")
+
+    a_host = state.add_array("A", (H, W), dtype)
+    a_device = state.add_array(
+        "A_device", (H, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    hist_host = state.add_array("hist", (num_bins, ), itype)
+    hist_device = state.add_array(
+        "hist_device", (num_bins, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        a_host,
+        a_device,
+        memlet=dace.memlet.Memlet.simple(a_device, "0:H, 0:W", veclen=P.get()))
+    state.add_memlet_path(
+        hist_host,
+        hist_device,
+        memlet=dace.memlet.Memlet.simple(hist_device, "0:num_bins"))
+
+    return state
+
+
+def make_copy_to_host_state(sdfg):
+
+    state = sdfg.add_state("copy_to_host")
+
+    hist_device = state.add_array(
+        "hist_device", (num_bins, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    hist_host = state.add_array("hist", (num_bins, ), itype)
+
+    state.add_memlet_path(
+        hist_device,
+        hist_host,
+        memlet=dace.memlet.Memlet.simple(
+            hist_host, "0:num_bins", veclen=P.get()))
+
+    return state
+
+
+def make_compute_state(sdfg):
+
+    state = sdfg.add_state("histogram_fpga")
+
+    a = state.add_stream(
+        "A_pipe_in", dtype, storage=dace.types.StorageType.FPGA_Local)
+    hist = state.add_array(
+        "hist_buffer", (num_bins, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    entry, exit = state.add_map(
+        "map", {
+            "h": "0:H",
+            "w": "0:W:P"
+        }, schedule=ScheduleType.FPGA_Device)
+
+    tasklet = state.add_tasklet("compute", {"a"}, {"out"},
+                                "out[int(float(num_bins) * a)] = 1")
+
+    read_memlet = dace.memlet.Memlet.simple(a, "0")
+    write_memlet = dace.memlet.Memlet.simple(
+        hist,
+        "0:num_bins",
+        wcr_str="lambda a, b: a + b",
+        wcr_identity=0,
+        num_accesses=1)
+
+    state.add_memlet_path(a, entry, tasklet, memlet=read_memlet, dst_conn="a")
+    state.add_memlet_path(
+        tasklet, exit, hist, memlet=write_memlet, src_conn="out")
+
+    return state
+
+
+def make_init_buffer_state(sdfg):
+
+    state = sdfg.add_state("init_buffer")
+
+    hist_buffer = state.add_array(
+        "hist_buffer", (num_bins, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    entry, exit = state.add_map(
+        "init_map", {"i": "0:num_bins"}, schedule=ScheduleType.FPGA_Device)
+    tasklet = state.add_tasklet("zero", {}, {"out"}, "out = 0")
+    state.add_nedge(entry, tasklet, dace.memlet.EmptyMemlet())
+    state.add_memlet_path(
+        tasklet,
+        exit,
+        hist_buffer,
+        src_conn="out",
+        memlet=dace.memlet.Memlet.simple(hist_buffer, "i"))
+
+    return state
+
+
+def make_write_buffer_state(sdfg):
+
+    state = sdfg.add_state("write_buffer")
+
+    hist_buffer = state.add_array(
+        "hist_buffer", (num_bins, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    hist_dram = state.add_stream(
+        "hist_pipe_out", itype, storage=dace.types.StorageType.FPGA_Local)
+
+    state.add_memlet_path(
+        hist_buffer,
+        hist_dram,
+        memlet=dace.memlet.Memlet.simple(
+            hist_dram, "0:num_bins", num_accesses=1))
+
+    return state
+
+
+def make_compute_nested_sdfg(parent):
+
+    sdfg = SDFG("histogram_compute", parent=parent)
+
+    init_state = make_init_buffer_state(sdfg)
+    compute_state = make_compute_state(sdfg)
+    finalize_state = make_write_buffer_state(sdfg)
+
+    sdfg.add_edge(init_state, compute_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(compute_state, finalize_state,
+                  dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+def make_sdfg(specialize):
+
+    if specialize:
+        sdfg = SDFG("histogram_fpga_parallel_{}_{}x{}".format(
+            P.get(), H.get(), W.get()))
+    else:
+        sdfg = SDFG("histogram_fpga_parallel_{}".format(P.get()))
+
+    copy_to_fpga_state = make_copy_to_fpga_state(sdfg)
+
+    state = sdfg.add_state("compute")
+
+    # Compute module
+    nested_sdfg = make_compute_nested_sdfg(sdfg)
+    tasklet = state.add_nested_sdfg(nested_sdfg, sdfg, {"A_pipe_in"},
+                                    {"hist_pipe_out"})
+    A_pipes_out = state.add_stream(
+        "A_pipes",
+        dtype,
+        shape=(P, ),
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    A_pipes_in = state.add_stream(
+        "A_pipes",
+        dtype,
+        shape=(P, ),
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    hist_pipes_out = state.add_stream(
+        "hist_pipes",
+        itype,
+        shape=(P, ),
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    unroll_entry, unroll_exit = state.add_map(
+        "unroll_compute", {"p": "0:P"},
+        schedule=dace.ScheduleType.FPGA_Device,
+        unroll=True)
+    state.add_memlet_path(unroll_entry, A_pipes_in, memlet=EmptyMemlet())
+    state.add_memlet_path(hist_pipes_out, unroll_exit, memlet=EmptyMemlet())
+    state.add_memlet_path(
+        A_pipes_in,
+        tasklet,
+        dst_conn="A_pipe_in",
+        memlet=Memlet.simple(A_pipes_in, "p", num_accesses="W*H"))
+    state.add_memlet_path(
+        tasklet,
+        hist_pipes_out,
+        src_conn="hist_pipe_out",
+        memlet=Memlet.simple(hist_pipes_out, "p", num_accesses="num_bins"))
+
+    # Read module
+    a_device = state.add_array(
+        "A_device", (H, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    read_entry, read_exit = state.add_map(
+        "read_map", {
+            "h": "0:H",
+            "w": "0:W:P"
+        },
+        schedule=ScheduleType.FPGA_Device)
+    a_val = state.add_array(
+        "A_val", (P, ), dtype, transient=True, storage=StorageType.FPGA_Local)
+    read_unroll_entry, read_unroll_exit = state.add_map(
+        "read_unroll", {"p": "0:P"},
+        schedule=ScheduleType.FPGA_Device,
+        unroll=True)
+    read_tasklet = state.add_tasklet("read", {"A_in"}, {"A_pipe"},
+                                     "A_pipe = A_in[p]")
+    state.add_memlet_path(
+        a_device,
+        read_entry,
+        a_val,
+        memlet=Memlet(
+            a_val,
+            num_accesses=1,
+            subset=Indices(["0"]),
+            vector_length=P.get(),
+            other_subset=Indices(["h", "w"])))
+    state.add_memlet_path(
+        a_val,
+        read_unroll_entry,
+        read_tasklet,
+        dst_conn="A_in",
+        memlet=Memlet.simple(a_val, "0", veclen=P.get(), num_accesses=1))
+    state.add_memlet_path(
+        read_tasklet,
+        read_unroll_exit,
+        read_exit,
+        A_pipes_out,
+        src_conn="A_pipe",
+        memlet=Memlet.simple(A_pipes_out, "p"))
+
+    # Write module
+    hist_pipes_in = state.add_stream(
+        "hist_pipes",
+        itype,
+        shape=(P, ),
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    hist_device_out = state.add_array(
+        "hist_device", (num_bins, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    merge_entry, merge_exit = state.add_map(
+        "merge", {"nb": "0:num_bins"}, schedule=ScheduleType.FPGA_Device)
+    merge_reduce = state.add_reduce(
+        "lambda a, b: a + b", (0, ), "0", schedule=ScheduleType.FPGA_Device)
+    state.add_memlet_path(
+        hist_pipes_in,
+        merge_entry,
+        merge_reduce,
+        memlet=Memlet.simple(hist_pipes_in, "0:P", num_accesses=P))
+    state.add_memlet_path(
+        merge_reduce,
+        merge_exit,
+        hist_device_out,
+        memlet=dace.memlet.Memlet.simple(hist_device_out, "nb"))
+
+    copy_to_host_state = make_copy_to_host_state(sdfg)
+
+    sdfg.add_edge(copy_to_fpga_state, state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(state, copy_to_host_state, dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("H", type=int)
+    parser.add_argument("W", type=int)
+    parser.add_argument("P", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all symbols at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    if args["specialize"]:
+        H.set(args["H"])
+        W.set(args["W"])
+        P.set(args["P"])
+        histogram = make_sdfg(True)
+        histogram.specialize()
+    else:
+        P.set(args["P"])
+        histogram = make_sdfg(False)
+        histogram.specialize()
+        H.set(args["H"])
+        W.set(args["W"])
+
+    print("Histogram {}x{} ({}specialized)".format(
+        H.get(), W.get(), "" if args["specialize"] else "not "))
+
+    histogram.draw_to_file()
+
+    A = dace.ndarray([H, W], dtype=dtype)
+    hist = dace.ndarray([num_bins], dtype=itype)
+
+    A[:] = np.random.rand(H.get(), W.get()).astype(dace.float32.type)
+    hist[:] = itype(0)
+
+    if args["specialize"]:
+        histogram(A=A, hist=hist)
+    else:
+        histogram(A=A, H=H, W=W, hist=hist)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('histogram', 'numpy', dace.eval(H * W), np.histogram, A,
+                      num_bins)
+
+    ref = np.histogram(A, bins=num_bins.get(), range=(0.0, 1.0))[0]
+    diff = np.linalg.norm(ref[1:-1] - hist[1:-1])
+
+    print("Difference:", diff)
+    if diff > 1e-5:
+        print("** Kernel")
+        print(hist)
+        print("** Reference")
+        print(ref)
+        print("Validation failed.")
+    print("==== Program end ====")
+
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/fpga/jacobi_fpga_stream.py b/samples/fpga/jacobi_fpga_stream.py
new file mode 100644
index 0000000000..1eb650d64d
--- /dev/null
+++ b/samples/fpga/jacobi_fpga_stream.py
@@ -0,0 +1,666 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import numpy as np
+import select
+import sys
+from scipy import ndimage
+
+W = dace.symbol("W")
+H = dace.symbol("H")
+T = dace.symbol("T")
+dtype = dace.float32
+
+
+def add_tmp(state):
+    return state.add_array(
+        "tmp", (2, H, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+
+def make_init_state(sdfg):
+
+    state = sdfg.add_state("init")
+
+    a0 = state.add_array("A", (H, W), dtype)
+    tmp0 = add_tmp(state)
+    state.add_memlet_path(
+        a0, tmp0, memlet=dace.memlet.Memlet.simple(tmp0, "0, 0:H, 0:W"))
+
+    a1 = state.add_array("A", (H, W), dtype)
+    tmp1 = add_tmp(state)
+    state.add_memlet_path(
+        a1, tmp1, memlet=dace.memlet.Memlet.simple(tmp1, "1, 0:H, 0:W"))
+
+    return state
+
+
+def make_finalize_state(sdfg, even):
+
+    state = sdfg.add_state("finalize_" + ("even" if even else "odd"))
+
+    tmp = add_tmp(state)
+    a = state.add_array("A", (H, W), dtype)
+    state.add_memlet_path(
+        tmp,
+        a,
+        memlet=dace.memlet.Memlet.simple(
+            tmp, "{}, 0:H, 0:W".format(0 if even else 1)))
+
+    return state
+
+
+def make_compute_sdfg():
+
+    sdfg = dace.SDFG("compute")
+
+    time_begin = sdfg.add_state("time_begin")
+    time_entry = sdfg.add_state("time_entry")
+    time_end = sdfg.add_state("time_end")
+
+    y_begin = sdfg.add_state("y_begin")
+    y_entry = sdfg.add_state("y_entry")
+    y_end = sdfg.add_state("y_end")
+
+    x_begin = sdfg.add_state("x_begin")
+    x_entry = sdfg.add_state("x_entry")
+    x_end = sdfg.add_state("x_end")
+
+    pre_shift = sdfg.add_state("pre_shift")
+    loop_body = sdfg.add_state("compute_body")
+    post_shift = sdfg.add_state("post_shift")
+
+    sdfg.add_edge(
+        time_begin,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": 0}))
+    sdfg.add_edge(
+        y_begin,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": 0}))
+    sdfg.add_edge(
+        x_begin,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": 0}))
+
+    sdfg.add_edge(
+        time_entry,
+        y_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t < T", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        x_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y < H", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        pre_shift,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x < W", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        y_end,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": "t + 1"}))
+    sdfg.add_edge(
+        x_end,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": "y + 1"}))
+    sdfg.add_edge(pre_shift, loop_body, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(loop_body, post_shift, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(
+        post_shift,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": "x + 1"}))
+
+    sdfg.add_edge(
+        time_entry,
+        time_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t >= T", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        y_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y >= H", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        x_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x >= W", language=dace.types.Language.Python)))
+
+    stream_in = pre_shift.add_stream(
+        "stream_in", dtype, 1, storage=dace.types.StorageType.FPGA_Global)
+    stream_out = loop_body.add_stream(
+        "stream_out", dtype, 1, storage=dace.types.StorageType.FPGA_Global)
+
+    rows_in = pre_shift.add_array(
+        "row_buffers", (2, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local,
+        toplevel=True)
+    rows_out = post_shift.add_array(
+        "row_buffers", (2, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local,
+        toplevel=True)
+
+    window_buffer_in = post_shift.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+    window_buffer_out = pre_shift.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+    window_compute_in = loop_body.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+    window_shift_in = post_shift.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+    window_shift_out = post_shift.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+
+    code = """\
+if y >= 3 and x >= 3 and y < H - 1 and x < W - 1:
+    result = 0.2 * (window[0, 1] + window[1, 0] + window[1, 1] + window[1, 2] + window[2, 1])"""
+
+    tasklet = loop_body.add_tasklet("compute", {"window"}, {"result"}, code)
+
+    # Input window
+    loop_body.add_memlet_path(
+        window_compute_in,
+        tasklet,
+        dst_conn="window",
+        memlet=dace.memlet.Memlet.simple(window_compute_in, "0:3, 0:3"))
+
+    # Output result (conditional write)
+    out_memlet = dace.memlet.Memlet(
+        stream_out, dace.symbolic.pystr_to_symbolic("-1"),
+        dace.properties.SubsetProperty.from_string("0"), 1)
+    loop_body.add_memlet_path(
+        tasklet, stream_out, src_conn="result", memlet=out_memlet)
+
+    # Read row buffer
+    read_row_memlet = dace.memlet.Memlet(
+        rows_in,
+        dace.symbolic.pystr_to_symbolic("2"),
+        dace.properties.SubsetProperty.from_string("0:2, x"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("0:2, 2"))
+    pre_shift.add_memlet_path(
+        rows_in, window_buffer_out, memlet=read_row_memlet)
+
+    # Read from memory
+    read_memory_memlet = dace.memlet.Memlet(
+        stream_in,
+        dace.symbolic.pystr_to_symbolic("1"),
+        dace.properties.SubsetProperty.from_string("0"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("2, 2"))
+    pre_shift.add_memlet_path(
+        stream_in, window_buffer_out, memlet=read_memory_memlet)
+
+    # Shift window
+    shift_window_memlet = dace.memlet.Memlet(
+        window_shift_in,
+        dace.symbolic.pystr_to_symbolic("6"),
+        dace.properties.SubsetProperty.from_string("0:3, 1:3"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("0:3, 0:2"))
+    post_shift.add_memlet_path(
+        window_shift_in, window_shift_out, memlet=shift_window_memlet)
+
+    # To row buffer
+    write_row_memlet = dace.memlet.Memlet(
+        window_buffer_in,
+        dace.symbolic.pystr_to_symbolic("2"),
+        dace.properties.SubsetProperty.from_string("1:3, 2"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("0:2, x"))
+    post_shift.add_memlet_path(
+        window_buffer_in, rows_out, memlet=write_row_memlet)
+
+    return sdfg
+
+
+def make_read_sdfg():
+
+    sdfg = dace.SDFG("read_memory_sdfg")
+
+    time_begin = sdfg.add_state("time_begin")
+    time_entry = sdfg.add_state("time_entry")
+    time_end = sdfg.add_state("time_end")
+
+    y_begin = sdfg.add_state("y_begin")
+    y_entry = sdfg.add_state("y_entry")
+    y_end = sdfg.add_state("y_end")
+
+    x_begin = sdfg.add_state("x_begin")
+    x_entry = sdfg.add_state("x_entry")
+    x_end = sdfg.add_state("x_end")
+
+    loop_body = sdfg.add_state("read_memory")
+
+    sdfg.add_edge(
+        time_begin,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": 0}))
+    sdfg.add_edge(
+        y_begin,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": 0}))
+    sdfg.add_edge(
+        x_begin,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": 0}))
+
+    sdfg.add_edge(
+        time_entry,
+        y_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t < T", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        x_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y < H", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x < W", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        y_end,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": "t + 1"}))
+    sdfg.add_edge(
+        x_end,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": "y + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": "x + 1"}))
+
+    sdfg.add_edge(
+        time_entry,
+        time_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t >= T", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        y_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y >= H", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        x_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x >= W", language=dace.types.Language.Python)))
+
+    mem_read = loop_body.add_array(
+        "mem_read", (2, H, W),
+        dtype,
+        storage=dace.types.StorageType.FPGA_Global)
+    stream_to_kernel = loop_body.add_stream(
+        "stream_to_kernel",
+        dtype,
+        1,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    # Read from memory
+    read_memory_memlet = dace.memlet.Memlet(
+        mem_read,
+        dace.symbolic.pystr_to_symbolic("1"),
+        dace.properties.SubsetProperty.from_string("t%2, y, x"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("0"))
+    loop_body.add_memlet_path(
+        mem_read, stream_to_kernel, memlet=read_memory_memlet)
+
+    return sdfg
+
+
+def make_write_sdfg():
+
+    sdfg = dace.SDFG("write_memory_sdfg")
+
+    time_begin = sdfg.add_state("time_begin")
+    time_entry = sdfg.add_state("time_entry")
+    time_end = sdfg.add_state("time_end")
+
+    y_begin = sdfg.add_state("y_begin")
+    y_entry = sdfg.add_state("y_entry")
+    y_end = sdfg.add_state("y_end")
+
+    x_begin = sdfg.add_state("x_begin")
+    x_entry = sdfg.add_state("x_entry")
+    x_end = sdfg.add_state("x_end")
+
+    loop_body = sdfg.add_state("write_memory")
+
+    sdfg.add_edge(
+        time_begin,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": 0}))
+    sdfg.add_edge(
+        y_begin,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": 2}))
+    sdfg.add_edge(
+        x_begin,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": 2}))
+
+    sdfg.add_edge(
+        time_entry,
+        y_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t < T", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        x_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y < H - 2", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x < W - 2", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        y_end,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": "t + 1"}))
+    sdfg.add_edge(
+        x_end,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": "y + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": "x + 1"}))
+
+    sdfg.add_edge(
+        time_entry,
+        time_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t >= T", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        y_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y >= H - 2", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        x_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x >= W - 2", language=dace.types.Language.Python)))
+
+    stream_from_kernel = loop_body.add_stream(
+        "stream_from_kernel",
+        dtype,
+        1,
+        storage=dace.types.StorageType.FPGA_Global)
+    mem_write = loop_body.add_array(
+        "mem_write", (2, H, W),
+        dtype,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    # Read from memory
+    write_memory_memlet = dace.memlet.Memlet(
+        stream_from_kernel,
+        dace.symbolic.pystr_to_symbolic("1"),
+        dace.properties.SubsetProperty.from_string("0"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string(
+            "1 - t%2, y, x"))
+    loop_body.add_memlet_path(
+        stream_from_kernel, mem_write, memlet=write_memory_memlet)
+
+    return sdfg
+
+
+def make_outer_compute_state(sdfg):
+
+    state = sdfg.add_state("fpga_outer_state")
+
+    tmp_in = add_tmp(state)
+    stream_read_in = state.add_stream(
+        "stream_read",
+        dtype,
+        1,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    stream_read_out = state.add_stream(
+        "stream_read",
+        dtype,
+        1,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    stream_write_in = state.add_stream(
+        "stream_write",
+        dtype,
+        1,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+    stream_write_out = state.add_stream(
+        "stream_write",
+        dtype,
+        1,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local)
+
+    read_sdfg = make_read_sdfg()
+    read_sdfg_node = state.add_nested_sdfg(read_sdfg, sdfg, {"mem_read"},
+                                           {"stream_to_kernel"})
+    compute_sdfg = make_compute_sdfg()
+    compute_sdfg_node = state.add_nested_sdfg(compute_sdfg, sdfg,
+                                              {"stream_in"}, {"stream_out"})
+    write_sdfg = make_write_sdfg()
+    write_sdfg_node = state.add_nested_sdfg(
+        write_sdfg, sdfg, {"stream_from_kernel"}, {"mem_write"})
+
+    tmp_out = add_tmp(state)
+
+    state.add_memlet_path(
+        tmp_in,
+        read_sdfg_node,
+        dst_conn="mem_read",
+        memlet=dace.memlet.Memlet.simple(tmp_in, "0:2, 0:H, 0:W"))
+    state.add_memlet_path(
+        read_sdfg_node,
+        stream_read_out,
+        src_conn="stream_to_kernel",
+        memlet=dace.memlet.Memlet(
+            stream_read_out, dace.symbolic.pystr_to_symbolic("T*H*W"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    state.add_memlet_path(
+        stream_read_in,
+        compute_sdfg_node,
+        dst_conn="stream_in",
+        memlet=dace.memlet.Memlet(
+            stream_read_in, dace.symbolic.pystr_to_symbolic("T*H*W"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+    state.add_memlet_path(
+        compute_sdfg_node,
+        stream_write_out,
+        src_conn="stream_out",
+        memlet=dace.memlet.Memlet(
+            stream_write_out,
+            dace.symbolic.pystr_to_symbolic("T*(H - 2)*(W - 2)"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    state.add_memlet_path(
+        stream_write_in,
+        write_sdfg_node,
+        dst_conn="stream_from_kernel",
+        memlet=dace.memlet.Memlet(
+            stream_write_in,
+            dace.symbolic.pystr_to_symbolic("T*(H - 2)*(W - 2)"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+    state.add_memlet_path(
+        write_sdfg_node,
+        tmp_out,
+        src_conn="mem_write",
+        memlet=dace.memlet.Memlet.simple(tmp_out, "0:2, 0:H, 0:W"))
+
+    return state
+
+
+def make_sdfg(specialize):
+
+    if not specialize:
+        sdfg = dace.SDFG("jacobi_fpga_stream_Hx{}xT".format(W.get()))
+    else:
+        sdfg = dace.SDFG("jacobi_fpga_stream_{}x{}x{}".format(
+            H.get(), W.get(), T.get()))
+    init_state = make_init_state(sdfg)
+
+    fpga_state = make_outer_compute_state(sdfg)
+
+    finalize_even = make_finalize_state(sdfg, True)
+    finalize_odd = make_finalize_state(sdfg, False)
+
+    sdfg.add_edge(init_state, fpga_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(
+        fpga_state,
+        finalize_even,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "T % 2 == 0", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        fpga_state,
+        finalize_odd,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "T % 2 == 1", language=dace.types.Language.Python)))
+
+    return sdfg
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("H", type=int)
+    parser.add_argument("W", type=int)
+    parser.add_argument("T", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all loop bounds at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    W.set(args["W"])
+    if args["specialize"]:
+        print("Specializing H and T...")
+        H.set(args["H"])
+        T.set(args["T"])
+
+    jacobi = make_sdfg(args["specialize"])
+    jacobi.specialize()
+
+    if not args["specialize"]:
+        H.set(args["H"])
+        T.set(args["T"])
+
+    print("Jacobi Stencil {}x{} ({} steps, {}specialized)".format(
+        H.get(), W.get(), T.get(), ("" if args["specialize"] else "not ")))
+
+    A = dace.ndarray([H, W], dtype=dace.float32)
+
+    # Initialize arrays: Randomize A, zero B
+    A[:] = dace.float32(0)
+    A[2:H.get() - 2, 2:W.get() - 2] = 1
+    regression = np.ndarray([H.get() - 4, W.get() - 4], dtype=np.float32)
+    regression[:] = A[2:H.get() - 2, 2:W.get() - 2]
+
+    #############################################
+    # Run DaCe program
+
+    jacobi.draw_to_file()
+    if args["specialize"]:
+        jacobi(A=A)
+    else:
+        jacobi(A=A, H=H, T=T)
+
+    # Regression
+    kernel = np.array(
+        [[0, 0.2, 0], [0.2, 0.2, 0.2], [0, 0.2, 0]], dtype=np.float32)
+    for i in range(T.get()):
+        regression = ndimage.convolve(
+            regression, kernel, mode='constant', cval=0.0)
+
+    residual = np.linalg.norm(A[2:H.get() - 2, 2:W.get() - 2] -
+                              regression) / dace.eval(H * W)
+    print("Residual:", residual)
+    diff = np.abs(A[2:H.get() - 2, 2:W.get() - 2] - regression)
+    wrong_elements = np.transpose(np.nonzero(diff >= 0.01))
+    highest_diff = np.max(diff)
+
+    print("==== Program end ====")
+    if residual >= 0.01 or highest_diff >= 0.01:
+        print("Verification failed!")
+        print("Residual: {}".format(residual))
+        print("Incorrect elements: {} / {}".format(wrong_elements.shape[0],
+                                                   H.get() * W.get()))
+        print("Highest difference: {}".format(highest_diff))
+        print("** Result:\n", A[:min(6, H.get()), :min(6, W.get())])
+        print("** Reference:\n",
+              regression[:min(4, H.get()), :min(4, W.get())])
+        print("Type \"debug\" to enter debugger, "
+              "or any other string to quit (timeout in 10 seconds)")
+        read, _, _ = select.select([sys.stdin], [], [], 10)
+        if len(read) > 0 and sys.stdin.readline().strip().lower() == "debug":
+            print("Entering debugger...")
+            import pdb
+            pdb.set_trace()
+        else:
+            print("Exiting...")
+        exit(1)
+    exit(0)
diff --git a/samples/fpga/jacobi_fpga_systolic.py b/samples/fpga/jacobi_fpga_systolic.py
new file mode 100644
index 0000000000..a51e5b0418
--- /dev/null
+++ b/samples/fpga/jacobi_fpga_systolic.py
@@ -0,0 +1,688 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import numpy as np
+import select
+import sys
+from scipy import ndimage
+
+W = dace.symbol("W")
+H = dace.symbol("H")
+T = dace.symbol("T")
+P = dace.symbol("P")  # Number of processing elements
+dtype = dace.float32
+
+
+def add_tmp(state):
+    return state.add_array(
+        "tmp", (2, H, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+
+def make_init_state(sdfg):
+
+    state = sdfg.add_state("init")
+
+    a0 = state.add_array("A", (H, W), dtype)
+    tmp0 = add_tmp(state)
+    state.add_memlet_path(
+        a0, tmp0, memlet=dace.memlet.Memlet.simple(tmp0, "0, 0:H, 0:W"))
+
+    a1 = state.add_array("A", (H, W), dtype)
+    tmp1 = add_tmp(state)
+    state.add_memlet_path(
+        a1, tmp1, memlet=dace.memlet.Memlet.simple(tmp1, "1, 0:H, 0:W"))
+
+    return state
+
+
+def make_finalize_state(sdfg, even):
+
+    state = sdfg.add_state("finalize_" + ("even" if even else "odd"))
+
+    tmp = add_tmp(state)
+    a = state.add_array("A", (H, W), dtype)
+    state.add_memlet_path(
+        tmp,
+        a,
+        memlet=dace.memlet.Memlet.simple(
+            tmp, "{}, 0:H, 0:W".format(0 if even else 1)))
+
+    return state
+
+
+def make_compute_sdfg():
+
+    sdfg = dace.SDFG("compute")
+
+    time_begin = sdfg.add_state("time_begin")
+    time_entry = sdfg.add_state("time_entry")
+    time_end = sdfg.add_state("time_end")
+
+    y_begin = sdfg.add_state("y_begin")
+    y_entry = sdfg.add_state("y_entry")
+    y_end = sdfg.add_state("y_end")
+
+    x_begin = sdfg.add_state("x_begin")
+    x_entry = sdfg.add_state("x_entry")
+    x_end = sdfg.add_state("x_end")
+
+    pre_shift = sdfg.add_state("pre_shift")
+    loop_body = sdfg.add_state("compute_body")
+    post_shift = sdfg.add_state("post_shift")
+
+    sdfg.add_edge(
+        time_begin,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": 0}))
+    sdfg.add_edge(
+        y_begin,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": 1}))
+    sdfg.add_edge(
+        x_begin,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": 1}))
+
+    sdfg.add_edge(
+        time_entry,
+        y_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t < T / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        x_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y < H", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        pre_shift,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x < W", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        y_end,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": "t + 1"}))
+    sdfg.add_edge(
+        x_end,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": "y + 1"}))
+    sdfg.add_edge(pre_shift, loop_body, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(loop_body, post_shift, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(
+        post_shift,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": "x + 1"}))
+
+    sdfg.add_edge(
+        time_entry,
+        time_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t >= T / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        y_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y >= H", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        x_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x >= W", language=dace.types.Language.Python)))
+
+    stream_in = pre_shift.add_stream(
+        "stream_in", dtype, 1, storage=dace.types.StorageType.FPGA_Global)
+    stream_out = loop_body.add_stream(
+        "stream_out", dtype, 1, storage=dace.types.StorageType.FPGA_Global)
+
+    rows_in = pre_shift.add_array(
+        "row_buffers", (2, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local,
+        toplevel=True)
+    rows_out = post_shift.add_array(
+        "row_buffers", (2, W),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Local,
+        toplevel=True)
+
+    window_buffer_in = post_shift.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+    window_buffer_out = pre_shift.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+    window_compute_in = loop_body.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+    window_shift_in = post_shift.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+    window_shift_out = post_shift.add_array(
+        "sliding_window", (3, 3),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers,
+        toplevel=True)
+
+    code = """\
+if y >= 3 and x >= 3 and y < H - 1 and x < W - 1:
+    result = float(0.2) * (window[0, 1] + window[1, 0] + window[1, 1] + window[1, 2] + window[2, 1])
+elif y >= 2 and x >= 2:
+    result = window[1, 1]"""
+
+    tasklet = loop_body.add_tasklet("compute", {"window"}, {"result"}, code)
+
+    # Input window
+    loop_body.add_memlet_path(
+        window_compute_in,
+        tasklet,
+        dst_conn="window",
+        memlet=dace.memlet.Memlet.simple(window_compute_in, "0:3, 0:3"))
+
+    # Output result (conditional write)
+    out_memlet = dace.memlet.Memlet(
+        stream_out, dace.symbolic.pystr_to_symbolic("-1"),
+        dace.properties.SubsetProperty.from_string("0"), 1)
+    loop_body.add_memlet_path(
+        tasklet, stream_out, src_conn="result", memlet=out_memlet)
+
+    # Read row buffer
+    read_row_memlet = dace.memlet.Memlet(
+        rows_in,
+        dace.symbolic.pystr_to_symbolic("2"),
+        dace.properties.SubsetProperty.from_string("0:2, x"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("0:2, 2"))
+    pre_shift.add_memlet_path(
+        rows_in, window_buffer_out, memlet=read_row_memlet)
+
+    # Read from memory
+    read_memory_memlet = dace.memlet.Memlet(
+        stream_in, dace.symbolic.pystr_to_symbolic("-1"),
+        dace.properties.SubsetProperty.from_string("0"), 1)
+    read_memory_tasklet = pre_shift.add_tasklet(
+        "skip_last", {"read"}, {"window_buffer"},
+        "if y < H - 1 and x < W - 1:\n\twindow_buffer = read")
+    pre_shift.add_memlet_path(
+        stream_in,
+        read_memory_tasklet,
+        memlet=read_memory_memlet,
+        dst_conn="read")
+    pre_shift.add_memlet_path(
+        read_memory_tasklet,
+        window_buffer_out,
+        memlet=dace.memlet.Memlet.simple(window_buffer_out, "2, 2"),
+        src_conn="window_buffer")
+
+    # Shift window
+    shift_window_memlet = dace.memlet.Memlet(
+        window_shift_in,
+        dace.symbolic.pystr_to_symbolic("6"),
+        dace.properties.SubsetProperty.from_string("0:3, 1:3"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("0:3, 0:2"))
+    post_shift.add_memlet_path(
+        window_shift_in, window_shift_out, memlet=shift_window_memlet)
+
+    # To row buffer
+    write_row_memlet = dace.memlet.Memlet(
+        window_buffer_in,
+        dace.symbolic.pystr_to_symbolic("2"),
+        dace.properties.SubsetProperty.from_string("1:3, 2"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("0:2, x"))
+    post_shift.add_memlet_path(
+        window_buffer_in, rows_out, memlet=write_row_memlet)
+
+    return sdfg
+
+
+def make_read_sdfg():
+
+    sdfg = dace.SDFG("read_memory_sdfg")
+
+    time_begin = sdfg.add_state("time_begin")
+    time_entry = sdfg.add_state("time_entry")
+    time_end = sdfg.add_state("time_end")
+
+    y_begin = sdfg.add_state("y_begin")
+    y_entry = sdfg.add_state("y_entry")
+    y_end = sdfg.add_state("y_end")
+
+    x_begin = sdfg.add_state("x_begin")
+    x_entry = sdfg.add_state("x_entry")
+    x_end = sdfg.add_state("x_end")
+
+    loop_body = sdfg.add_state("read_memory")
+
+    sdfg.add_edge(
+        time_begin,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": 0}))
+    sdfg.add_edge(
+        y_begin,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": 1}))
+    sdfg.add_edge(
+        x_begin,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": 1}))
+
+    sdfg.add_edge(
+        time_entry,
+        y_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t < T / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        x_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y < H - 1", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x < W - 1", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        y_end,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": "t + 1"}))
+    sdfg.add_edge(
+        x_end,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": "y + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": "x + 1"}))
+
+    sdfg.add_edge(
+        time_entry,
+        time_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t >= T / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        y_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y >= H", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        x_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x >= W", language=dace.types.Language.Python)))
+
+    mem_read = loop_body.add_array(
+        "mem_read", (2, H, W),
+        dtype,
+        storage=dace.types.StorageType.FPGA_Global)
+    pipe = loop_body.add_stream(
+        "pipe", dtype, 1, storage=dace.types.StorageType.FPGA_Global)
+
+    # Read from memory
+    read_memory_memlet = dace.memlet.Memlet(
+        mem_read,
+        dace.symbolic.pystr_to_symbolic("1"),
+        dace.properties.SubsetProperty.from_string("t%2, y, x"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string("0"))
+    loop_body.add_memlet_path(mem_read, pipe, memlet=read_memory_memlet)
+
+    return sdfg
+
+
+def make_write_sdfg():
+
+    sdfg = dace.SDFG("write_memory_sdfg")
+
+    time_begin = sdfg.add_state("time_begin")
+    time_entry = sdfg.add_state("time_entry")
+    time_end = sdfg.add_state("time_end")
+
+    y_begin = sdfg.add_state("y_begin")
+    y_entry = sdfg.add_state("y_entry")
+    y_end = sdfg.add_state("y_end")
+
+    x_begin = sdfg.add_state("x_begin")
+    x_entry = sdfg.add_state("x_entry")
+    x_end = sdfg.add_state("x_end")
+
+    loop_body = sdfg.add_state("write_memory")
+
+    sdfg.add_edge(
+        time_begin,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": 0}))
+    sdfg.add_edge(
+        y_begin,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": 1}))
+    sdfg.add_edge(
+        x_begin,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": 1}))
+
+    sdfg.add_edge(
+        time_entry,
+        y_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t < T / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        x_begin,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y < H - 1", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        loop_body,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x < W - 1", language=dace.types.Language.Python)))
+
+    sdfg.add_edge(
+        y_end,
+        time_entry,
+        dace.graph.edges.InterstateEdge(assignments={"t": "t + 1"}))
+    sdfg.add_edge(
+        x_end,
+        y_entry,
+        dace.graph.edges.InterstateEdge(assignments={"y": "y + 1"}))
+    sdfg.add_edge(
+        loop_body,
+        x_entry,
+        dace.graph.edges.InterstateEdge(assignments={"x": "x + 1"}))
+
+    sdfg.add_edge(
+        time_entry,
+        time_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "t >= T / P", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        y_entry,
+        y_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "y >= H", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        x_entry,
+        x_end,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "x >= W", language=dace.types.Language.Python)))
+
+    pipe = loop_body.add_stream(
+        "pipe", dtype, 1, storage=dace.types.StorageType.FPGA_Global)
+    mem_write = loop_body.add_array(
+        "mem_write", (2, H, W),
+        dtype,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    # Read from memory
+    write_memory_memlet = dace.memlet.Memlet(
+        pipe,
+        dace.symbolic.pystr_to_symbolic("1"),
+        dace.properties.SubsetProperty.from_string("0"),
+        1,
+        other_subset=dace.properties.SubsetProperty.from_string(
+            "1 - t%2, y, x"))
+    loop_body.add_memlet_path(pipe, mem_write, memlet=write_memory_memlet)
+
+    return sdfg
+
+
+def make_outer_compute_state(sdfg):
+
+    state = sdfg.add_state("fpga_outer_state")
+
+    tmp_in = add_tmp(state)
+    pipes_memory_read = state.add_stream(
+        "pipes",
+        dtype,
+        1,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    pipes_read = state.add_stream(
+        "pipes",
+        dtype,
+        1,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    pipes_write = state.add_stream(
+        "pipes",
+        dtype,
+        1,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+    pipes_memory_write = state.add_stream(
+        "pipes",
+        dtype,
+        1,
+        transient=True,
+        shape=(P + 1, ),
+        storage=dace.types.StorageType.FPGA_Local)
+
+    read_sdfg = make_read_sdfg()
+    read_sdfg_node = state.add_nested_sdfg(read_sdfg, sdfg, {"mem_read"},
+                                           {"pipe"})
+    compute_sdfg = make_compute_sdfg()
+    compute_sdfg_node = state.add_nested_sdfg(compute_sdfg, sdfg,
+                                              {"stream_in"}, {"stream_out"})
+    write_sdfg = make_write_sdfg()
+    write_sdfg_node = state.add_nested_sdfg(write_sdfg, sdfg, {"pipe"},
+                                            {"mem_write"})
+
+    tmp_out = add_tmp(state)
+
+    state.add_memlet_path(
+        tmp_in,
+        read_sdfg_node,
+        dst_conn="mem_read",
+        memlet=dace.memlet.Memlet.simple(tmp_in, "0:2, 0:H, 0:W"))
+    state.add_memlet_path(
+        read_sdfg_node,
+        pipes_memory_write,
+        src_conn="pipe",
+        memlet=dace.memlet.Memlet(
+            pipes_memory_write, dace.symbolic.pystr_to_symbolic("(T/P)*H*W"),
+            dace.properties.SubsetProperty.from_string("0"), 1))
+
+    compute_entry, compute_exit = state.add_map(
+        "unroll_compute", {"p": "0:P"},
+        schedule=dace.ScheduleType.FPGA_Device,
+        unroll=True)
+    state.add_memlet_path(
+        compute_entry, pipes_read, memlet=dace.memlet.EmptyMemlet())
+    state.add_memlet_path(
+        pipes_read,
+        compute_sdfg_node,
+        dst_conn="stream_in",
+        memlet=dace.memlet.Memlet(
+            pipes_read, dace.symbolic.pystr_to_symbolic("(T/P)*H*W"),
+            dace.properties.SubsetProperty.from_string("p"), 1))
+    state.add_memlet_path(
+        compute_sdfg_node,
+        pipes_write,
+        src_conn="stream_out",
+        memlet=dace.memlet.Memlet(
+            pipes_write, dace.symbolic.pystr_to_symbolic("(T/P)*H*W"),
+            dace.properties.SubsetProperty.from_string("p + 1"), 1))
+    state.add_memlet_path(
+        pipes_write, compute_exit, memlet=dace.memlet.EmptyMemlet())
+
+    state.add_memlet_path(
+        pipes_memory_read,
+        write_sdfg_node,
+        dst_conn="pipe",
+        memlet=dace.memlet.Memlet(
+            pipes_memory_read, dace.symbolic.pystr_to_symbolic("(T/P)*H*W"),
+            dace.properties.SubsetProperty.from_string("P"), 1))
+    state.add_memlet_path(
+        write_sdfg_node,
+        tmp_out,
+        src_conn="mem_write",
+        memlet=dace.memlet.Memlet.simple(tmp_out, "0:2, 0:H, 0:W"))
+
+    return state
+
+
+def make_sdfg(specialize_all):
+
+    name = "jacobi_fpga_systolic_{}_{}x{}x{}".format(
+        P.get(), ("H" if not specialize_all else H.get()), W.get(),
+        ("T" if not specialize_all else T.get()))
+
+    sdfg = dace.SDFG(name)
+    init_state = make_init_state(sdfg)
+
+    fpga_state = make_outer_compute_state(sdfg)
+
+    finalize_even = make_finalize_state(sdfg, True)
+    finalize_odd = make_finalize_state(sdfg, False)
+
+    sdfg.add_edge(init_state, fpga_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(
+        fpga_state,
+        finalize_even,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "(T / P) % 2 == 0", language=dace.types.Language.Python)))
+    sdfg.add_edge(
+        fpga_state,
+        finalize_odd,
+        dace.graph.edges.InterstateEdge(
+            condition=dace.properties.CodeProperty.from_string(
+                "(T / P) % 2 == 1", language=dace.types.Language.Python)))
+
+    return sdfg
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("H", type=int, nargs="?", default=64)
+    parser.add_argument("W", type=int, nargs="?", default=8192)
+    parser.add_argument("T", type=int, nargs="?", default=16)
+    parser.add_argument("P", type=int, nargs="?", default=8)
+    parser.add_argument(
+        "-specialize_all",
+        default=False,
+        action="store_true",
+        help="Fix all loop bounds at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    # Width and number of PEs must be known at compile time, as it will
+    # influence the hardware layout
+    W.set(args["W"])
+    P.set(args["P"])
+    if args["specialize_all"]:
+        print("Specializing H and T...")
+        H.set(args["H"])
+        T.set(args["T"])
+
+    jacobi = make_sdfg(args["specialize_all"])
+    jacobi.specialize()
+
+    if not args["specialize_all"]:
+        H.set(args["H"])
+        T.set(args["T"])
+
+    if T.get() % P.get() != 0:
+        raise ValueError(
+            "Iteration must be divisable by number of processing elements")
+
+    print("Jacobi Stencil {}x{} ({} steps) with {} PEs{}".format(
+        H.get(), W.get(), T.get(), P.get(),
+        (" (fully specialized)" if args["specialize_all"] else "")))
+
+    A = dace.ndarray([H, W], dtype=dace.float32)
+
+    # Initialize arrays: Randomize A, zero B
+    A[:] = dace.float32(0)
+    A[2:H.get() - 2, 2:W.get() - 2] = 1
+    regression = np.ndarray([H.get() - 4, W.get() - 4], dtype=np.float32)
+    regression[:] = A[2:H.get() - 2, 2:W.get() - 2]
+
+    #############################################
+    # Run DaCe program
+
+    jacobi.draw_to_file()
+    if args["specialize_all"]:
+        jacobi(A=A)
+    else:
+        jacobi(A=A, H=H, T=T)
+
+    # Regression
+    kernel = np.array(
+        [[0, 0.2, 0], [0.2, 0.2, 0.2], [0, 0.2, 0]], dtype=np.float32)
+    for i in range(T.get()):
+        regression = ndimage.convolve(
+            regression, kernel, mode='constant', cval=0.0)
+
+    residual = np.linalg.norm(A[2:H.get() - 2, 2:W.get() - 2] -
+                              regression) / dace.eval(H * W)
+    print("Residual:", residual)
+    diff = np.abs(A[2:H.get() - 2, 2:W.get() - 2] - regression)
+    wrong_elements = np.transpose(np.nonzero(diff >= 0.01))
+    highest_diff = np.max(diff)
+
+    print("==== Program end ====")
+    if residual >= 0.01 or highest_diff >= 0.01:
+        print("Verification failed!")
+        print("Residual: {}".format(residual))
+        print("Incorrect elements: {} / {}".format(wrong_elements.shape[0],
+                                                   H.get() * W.get()))
+        print("Highest difference: {}".format(highest_diff))
+        print("** Result:\n", A[:min(6, H.get()), :min(6, W.get())])
+        print("** Reference:\n",
+              regression[:min(4, H.get()), :min(4, W.get())])
+        print("Type \"debug\" to enter debugger, "
+              "or any other string to quit (timeout in 10 seconds)")
+        read, _, _ = select.select([sys.stdin], [], [], 10)
+        if len(read) > 0 and sys.stdin.readline().strip().lower() == "debug":
+            print("Entering debugger...")
+            import pdb
+            pdb.set_trace()
+        else:
+            print("Exiting...")
+        exit(1)
+    exit(0)
diff --git a/samples/fpga/spmv_fpga.py b/samples/fpga/spmv_fpga.py
new file mode 100644
index 0000000000..42ca102616
--- /dev/null
+++ b/samples/fpga/spmv_fpga.py
@@ -0,0 +1,438 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+import scipy
+import pdb
+import select
+import sys
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+nnz = dace.symbol('nnz')
+itype = dace.types.uint32
+dtype = dace.types.float32
+
+
+def make_pre_state(sdfg):
+
+    state = sdfg.add_state("pre_state")
+
+    a_row_host = state.add_array("A_row", (H + 1, ), itype)
+    a_row_device = state.add_array(
+        "A_row_device", (H + 1, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    a_col_host = state.add_array("A_col", (nnz, ), itype)
+    a_col_device = state.add_array(
+        "A_col_device", (nnz, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    a_val_host = state.add_array("A_val", (nnz, ), dtype)
+    a_val_device = state.add_array(
+        "A_val_device", (nnz, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    x_host = state.add_array("x", (W, ), dtype)
+    x_device = state.add_array(
+        "x_device", (W, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        a_row_host,
+        a_row_device,
+        memlet=dace.memlet.Memlet.simple(a_row_device, "0:H+1"))
+
+    state.add_memlet_path(
+        a_col_host,
+        a_col_device,
+        memlet=dace.memlet.Memlet.simple(a_col_device, "0:nnz"))
+
+    state.add_memlet_path(
+        a_val_host,
+        a_val_device,
+        memlet=dace.memlet.Memlet.simple(a_val_device, "0:nnz"))
+
+    state.add_memlet_path(
+        x_host, x_device, memlet=dace.memlet.Memlet.simple(x_device, "0:W"))
+
+    return state
+
+
+def make_post_state(sdfg):
+
+    state = sdfg.add_state("post_state")
+
+    b_device = state.add_array(
+        "b_device", (H, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    b_host = state.add_array("b", (H, ), dtype)
+
+    state.add_memlet_path(
+        b_device, b_host, memlet=dace.memlet.Memlet.simple(b_host, "0:H"))
+
+    return state
+
+
+def make_nested_sdfg(parent):
+
+    sdfg = dace.SDFG("spmv_inner", parent=parent)
+
+    set_zero_state = sdfg.add_state("set_zero")
+    set_zero_b = set_zero_state.add_scalar(
+        "b_buffer",
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+    set_zero_tasklet = set_zero_state.add_tasklet("set_zero", {}, {"b_out"},
+                                                  "b_out = 0")
+    set_zero_state.add_memlet_path(
+        set_zero_tasklet,
+        set_zero_b,
+        src_conn="b_out",
+        memlet=dace.memlet.Memlet.simple(set_zero_b, "0"))
+
+    write_back_state = sdfg.add_state("write_back")
+    write_back_b_buffer = write_back_state.add_scalar(
+        "b_buffer",
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+    write_back_b = write_back_state.add_scalar(
+        "b_write", dtype, storage=dace.types.StorageType.FPGA_Registers)
+    write_back_state.add_memlet_path(
+        write_back_b_buffer,
+        write_back_b,
+        memlet=dace.memlet.Memlet.simple(write_back_b, "0"))
+
+    state = sdfg.add_state("compute_cols")
+
+    sdfg.add_edge(set_zero_state, state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(state, write_back_state, dace.graph.edges.InterstateEdge())
+
+    compute_entry, compute_exit = state.add_map("compute_col",
+                                                {"j": "rowptr:rowend"})
+
+    indirection_tasklet = state.add_tasklet(
+        "indirection", {"x_val_in", "col_index"}, {"lookup"},
+        "lookup = x_val_in[col_index]")
+
+    x_in = state.add_scalar(
+        "x_in",
+        dtype,
+        storage=dace.types.StorageType.FPGA_Registers,
+        transient=True)
+
+    compute_tasklet = state.add_tasklet("compute", {"a", "x_val_in"}, {"out"},
+                                        "out = a * x_val_in")
+
+    b_buffer = state.add_scalar(
+        "b_buffer",
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Registers)
+    rowptr = state.add_scalar(
+        "row_begin", itype, storage=dace.types.StorageType.FPGA_Registers)
+    rowend = state.add_scalar(
+        "row_end", itype, storage=dace.types.StorageType.FPGA_Registers)
+    a_val = state.add_array(
+        "A_val_read", (nnz, ),
+        dtype,
+        storage=dace.types.StorageType.FPGA_Global)
+    a_col = state.add_array(
+        "A_col_read", (nnz, ),
+        itype,
+        storage=dace.types.StorageType.FPGA_Global)
+    x = state.add_array(
+        "x_read", (W, ), dtype, storage=dace.types.StorageType.FPGA_Global)
+
+    compute_entry._in_connectors.add("rowptr")
+    state.add_memlet_path(
+        rowptr,
+        compute_entry,
+        dst_conn="rowptr",
+        memlet=dace.memlet.Memlet.simple(rowptr, "0"))
+
+    compute_entry._in_connectors.add("rowend")
+    state.add_memlet_path(
+        rowend,
+        compute_entry,
+        dst_conn="rowend",
+        memlet=dace.memlet.Memlet.simple(rowend, "0"))
+
+    state.add_memlet_path(
+        a_val,
+        compute_entry,
+        compute_tasklet,
+        dst_conn="a",
+        memlet=dace.memlet.Memlet.simple(a_val, "j"))
+
+    state.add_memlet_path(
+        x,
+        compute_entry,
+        indirection_tasklet,
+        dst_conn="x_val_in",
+        memlet=dace.memlet.Memlet.simple(x, "0:W"))
+
+    state.add_memlet_path(
+        a_col,
+        compute_entry,
+        indirection_tasklet,
+        dst_conn="col_index",
+        memlet=dace.memlet.Memlet.simple(a_col, "j"))
+
+    state.add_memlet_path(
+        indirection_tasklet,
+        x_in,
+        src_conn="lookup",
+        memlet=dace.memlet.Memlet.simple(x_in, "0"))
+
+    state.add_memlet_path(
+        x_in,
+        compute_tasklet,
+        dst_conn="x_val_in",
+        memlet=dace.memlet.Memlet.simple(x_in, "0"))
+
+    state.add_memlet_path(
+        compute_tasklet,
+        compute_exit,
+        b_buffer,
+        src_conn="out",
+        memlet=dace.memlet.Memlet.simple(
+            b_buffer, "0", wcr_str="lambda a, b: a + b", wcr_identity=0))
+
+    return sdfg
+
+
+def make_main_state(sdfg):
+
+    state = sdfg.add_state("spmv")
+
+    a_row = state.add_array(
+        "A_row_device", (H + 1, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    a_col = state.add_array(
+        "A_col_device", (nnz, ),
+        itype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    a_val = state.add_array(
+        "A_val_device", (nnz, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    x = state.add_array(
+        "x_device", (W, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+    b = state.add_array(
+        "b_device", (H, ),
+        dtype,
+        transient=True,
+        storage=dace.types.StorageType.FPGA_Global)
+
+    row_entry, row_exit = state.add_map(
+        "compute_row", {"i": "0:H"},
+        schedule=dace.types.ScheduleType.FPGA_Device)
+
+    rowptr = state.add_scalar(
+        "rowptr",
+        itype,
+        storage=dace.types.StorageType.FPGA_Registers,
+        transient=True)
+    rowend = state.add_scalar(
+        "rowend",
+        itype,
+        storage=dace.types.StorageType.FPGA_Registers,
+        transient=True)
+
+    nested_sdfg = make_nested_sdfg(sdfg)
+    nested_sdfg_tasklet = state.add_nested_sdfg(
+        nested_sdfg, sdfg,
+        {"row_begin", "row_end", "A_val_read", "A_col_read", "x_read"},
+        {"b_write"})
+
+    state.add_memlet_path(
+        a_row,
+        row_entry,
+        rowptr,
+        memlet=dace.memlet.Memlet(
+            rowptr,
+            1,
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("i")))
+    state.add_memlet_path(
+        rowptr,
+        nested_sdfg_tasklet,
+        dst_conn="row_begin",
+        memlet=dace.memlet.Memlet.simple(rowptr, "0"))
+
+    state.add_memlet_path(
+        a_row,
+        row_entry,
+        rowend,
+        memlet=dace.memlet.Memlet(
+            rowend,
+            1,
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("i + 1")))
+    state.add_memlet_path(
+        rowend,
+        nested_sdfg_tasklet,
+        dst_conn="row_end",
+        memlet=dace.memlet.Memlet.simple(rowend, "0"))
+
+    state.add_memlet_path(
+        a_val,
+        row_entry,
+        nested_sdfg_tasklet,
+        dst_conn="A_val_read",
+        memlet=dace.memlet.Memlet.simple(a_val, "0:nnz"))
+
+    state.add_memlet_path(
+        x,
+        row_entry,
+        nested_sdfg_tasklet,
+        dst_conn="x_read",
+        memlet=dace.memlet.Memlet.simple(x, "0:W"))
+
+    state.add_memlet_path(
+        a_col,
+        row_entry,
+        nested_sdfg_tasklet,
+        dst_conn="A_col_read",
+        memlet=dace.memlet.Memlet.simple(a_col, "0:nnz"))
+
+    state.add_memlet_path(
+        nested_sdfg_tasklet,
+        row_exit,
+        b,
+        src_conn="b_write",
+        memlet=dace.memlet.Memlet.simple(b, "i"))
+
+    return state
+
+
+def make_sdfg(specialize):
+
+    if specialize:
+        name = "spmv_fpga_{}x{}x{}".format(H.get(), W.get(), nnz.get())
+    else:
+        name = "spmv_fpga"
+    sdfg = dace.SDFG(name)
+
+    pre_state = make_pre_state(sdfg)
+    main_state = make_main_state(sdfg)
+    post_state = make_post_state(sdfg)
+
+    sdfg.add_edge(pre_state, main_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(main_state, post_state, dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int)
+    parser.add_argument("H", type=int)
+    parser.add_argument("nnz", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all symbols at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    W.set(args["W"])
+    H.set(args["H"])
+    nnz.set(args["nnz"])
+
+    print(
+        'Sparse Matrix-Vector Multiplication {}x{} ({} non-zero elements, {}specialized)'.
+        format(W.get(), H.get(), nnz.get(), "not "
+               if not args["specialize"] else ""))
+
+    A_row = dace.ndarray([H + 1], dtype=itype)
+    A_col = dace.ndarray([nnz], dtype=itype)
+    A_val = dace.ndarray([nnz], dtype=dtype)
+
+    x = dace.ndarray([W], dtype)
+    b = dace.ndarray([H], dtype)
+
+    # Assuming uniform sparsity distribution across rows
+    nnz_per_row = nnz.get() // H.get()
+    nnz_last_row = nnz_per_row + (nnz.get() % H.get())
+    if nnz_last_row > W.get():
+        print('Too many nonzeros per row')
+        exit(1)
+
+    # RANDOMIZE SPARSE MATRIX
+    A_row[0] = itype(0)
+    A_row[1:H.get()] = itype(nnz_per_row)
+    A_row[-1] = itype(nnz_last_row)
+    A_row = np.cumsum(A_row, dtype=itype.type)
+
+    # Fill column data
+    for i in range(H.get() - 1):
+        A_col[nnz_per_row*i:nnz_per_row*(i+1)] = \
+            np.sort(np.random.choice(W.get(), nnz_per_row, replace=False))
+    # Fill column data for last row
+    A_col[nnz_per_row * (H.get() - 1):] = np.sort(
+        np.random.choice(W.get(), nnz_last_row, replace=False))
+
+    A_val[:] = np.random.rand(nnz.get()).astype(dtype.type)
+    #########################
+
+    x[:] = np.random.rand(W.get()).astype(dtype.type)
+    #b[:] = dtype(0)
+
+    # Setup regression
+    A_sparse = scipy.sparse.csr_matrix(
+        (A_val, A_col, A_row), shape=(H.get(), W.get()))
+
+    spmv = make_sdfg(args["specialize"])
+    if args["specialize"]:
+        spmv.specialize()
+    spmv.draw_to_file()
+    spmv(A_row=A_row, A_col=A_col, A_val=A_val, x=x, b=b, H=H, W=W, nnz=nnz)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('spmv', 'scipy', 0, A_sparse.dot, x)
+
+    diff = np.linalg.norm(A_sparse.dot(x) - b) / float(H.get())
+    print("Difference:", diff)
+    if diff >= 1e-5:
+        print("Validation failed.")
+        print("Result:")
+        print(b)
+        print("Reference:")
+        print(A_sparse.dot(x))
+        print("Type \"debug\" to enter debugger, "
+              "or any other string to quit (timeout in 10 seconds)")
+        read, _, _ = select.select([sys.stdin], [], [], 10)
+        if len(read) > 0 and sys.stdin.readline().strip().lower() == "debug":
+            print("Entering debugger...")
+            pdb.set_trace()
+        else:
+            print("Exiting...")
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/fpga/spmv_fpga_stream.py b/samples/fpga/spmv_fpga_stream.py
new file mode 100644
index 0000000000..e9c0bd7181
--- /dev/null
+++ b/samples/fpga/spmv_fpga_stream.py
@@ -0,0 +1,909 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+import scipy
+import pdb
+import select
+import sys
+
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+from dace.graph.edges import InterstateEdge
+from dace.types import ScheduleType, StorageType, Language
+from dace.properties import CodeProperty
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+nnz = dace.symbol('nnz')
+itype = dace.types.uint32
+dtype = dace.types.float32
+
+
+def make_pre_state(sdfg):
+
+    state = sdfg.add_state("pre_state")
+
+    a_row_host = state.add_array("A_row", (H + 1, ), itype)
+    a_row_device = state.add_array(
+        "A_row_device", (H + 1, ),
+        itype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+
+    a_col_host = state.add_array("A_col", (nnz, ), itype)
+    a_col_device = state.add_array(
+        "A_col_device", (nnz, ),
+        itype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+
+    a_val_host = state.add_array("A_val", (nnz, ), dtype)
+    a_val_device = state.add_array(
+        "A_val_device", (nnz, ),
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+
+    x_host = state.add_array("x", (W, ), dtype)
+    x_device = state.add_array(
+        "x_device", (W, ),
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+
+    state.add_memlet_path(
+        a_row_host,
+        a_row_device,
+        memlet=dace.memlet.Memlet.simple(a_row_device, "0:H+1"))
+
+    state.add_memlet_path(
+        a_col_host,
+        a_col_device,
+        memlet=dace.memlet.Memlet.simple(a_col_device, "0:nnz"))
+
+    state.add_memlet_path(
+        a_val_host,
+        a_val_device,
+        memlet=dace.memlet.Memlet.simple(a_val_device, "0:nnz"))
+
+    state.add_memlet_path(
+        x_host, x_device, memlet=dace.memlet.Memlet.simple(x_device, "0:W"))
+
+    return state
+
+
+def make_post_state(sdfg):
+
+    state = sdfg.add_state("post_state")
+
+    b_device = state.add_array(
+        "b_device", (H, ),
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    b_host = state.add_array("b", (H, ), dtype)
+
+    state.add_memlet_path(
+        b_device, b_host, memlet=dace.memlet.Memlet.simple(b_host, "0:H"))
+
+    return state
+
+
+def make_write_sdfg():
+
+    sdfg = SDFG("spmv_write")
+
+    begin = sdfg.add_state("begin")
+    entry = sdfg.add_state("entry")
+    state = sdfg.add_state("body")
+    end = sdfg.add_state("end")
+
+    sdfg.add_edge(begin, entry, InterstateEdge(assignments={"h": "0"}))
+
+    sdfg.add_edge(
+        entry,
+        state,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "h < H", language=Language.Python)))
+
+    sdfg.add_edge(
+        entry,
+        end,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "h >= H", language=Language.Python)))
+
+    sdfg.add_edge(state, entry, InterstateEdge(assignments={"h": "h + 1"}))
+
+    result_to_write_in = state.add_stream(
+        "b_pipe", dtype, storage=StorageType.FPGA_Local)
+    b = state.add_array("b_mem", (H, ), dtype, storage=StorageType.FPGA_Global)
+
+    state.add_memlet_path(result_to_write_in, b, memlet=Memlet.simple(b, "h"))
+
+    return sdfg
+
+
+def make_iteration_space(sdfg):
+
+    pre_state = sdfg.add_state("pre_state")
+
+    rows_begin = sdfg.add_state("rows_begin")
+    rows_entry = sdfg.add_state("rows_entry")
+    rows_end = sdfg.add_state("rows_end")
+
+    shift_rowptr = sdfg.add_state("shift_rowptr")
+    read_rowptr = sdfg.add_state("read_rowptr")
+
+    cols_begin = sdfg.add_state("cols_begin")
+    cols_entry = sdfg.add_state("cols_entry")
+    cols_end = sdfg.add_state("cols_end")
+
+    body = sdfg.add_state("compute")
+
+    post_state = sdfg.add_state("post_state")
+
+    sdfg.add_edge(pre_state, rows_begin, InterstateEdge())
+
+    sdfg.add_edge(
+        rows_begin, rows_entry, InterstateEdge(assignments={"h": "0"}))
+    sdfg.add_edge(
+        rows_entry,
+        shift_rowptr,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "h < H", language=Language.Python)))
+    sdfg.add_edge(
+        rows_entry,
+        rows_end,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "h >= H", language=Language.Python)))
+
+    sdfg.add_edge(shift_rowptr, read_rowptr, InterstateEdge())
+    sdfg.add_edge(read_rowptr, cols_begin, InterstateEdge())
+
+    sdfg.add_edge(
+        cols_begin, cols_entry, InterstateEdge(assignments={"c": "0"}))
+    sdfg.add_edge(
+        cols_entry,
+        body,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "c < row_end - row_begin", language=Language.Python)))
+    sdfg.add_edge(
+        cols_entry,
+        cols_end,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "c >= row_end - row_begin", language=Language.Python)))
+
+    sdfg.add_edge(body, cols_entry, InterstateEdge(assignments={"c": "c + 1"}))
+    sdfg.add_edge(cols_end, post_state, InterstateEdge())
+    sdfg.add_edge(
+        post_state, rows_entry, InterstateEdge(assignments={"h": "h + 1"}))
+
+    row_end_first = pre_state.add_scalar(
+        "row_end", itype, transient=True, storage=StorageType.FPGA_Registers)
+    row_pipe_first = pre_state.add_stream(
+        "row_pipe", itype, storage=StorageType.FPGA_Local)
+    pre_state.add_memlet_path(
+        row_pipe_first,
+        row_end_first,
+        memlet=Memlet.simple(row_end_first, "0"))
+
+    row_end_shift = shift_rowptr.add_scalar(
+        "row_end", itype, transient=True, storage=StorageType.FPGA_Registers)
+    row_begin_shift = shift_rowptr.add_scalar(
+        "row_begin",
+        itype,
+        transient=True,
+        toplevel=True,
+        storage=StorageType.FPGA_Registers)
+    shift_rowptr.add_memlet_path(
+        row_end_shift,
+        row_begin_shift,
+        memlet=Memlet.simple(row_begin_shift, "0"))
+
+    row_pipe = read_rowptr.add_stream(
+        "row_pipe", itype, storage=StorageType.FPGA_Local)
+    row_end = read_rowptr.add_scalar(
+        "row_end", itype, transient=True, storage=StorageType.FPGA_Registers)
+    read_rowptr.add_memlet_path(
+        row_pipe, row_end, memlet=Memlet.simple(row_end, "0"))
+
+    return pre_state, body, post_state
+
+
+def make_compute_nested_sdfg():
+
+    sdfg = SDFG("spmv_compute_nested")
+
+    if_state = sdfg.add_state("if")
+    then_state = sdfg.add_state("then")
+    else_state = sdfg.add_state("else")
+    end_state = sdfg.add_state("end")
+
+    sdfg.add_edge(
+        if_state,
+        then_state,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "c == 0", language=Language.Python)))
+    sdfg.add_edge(
+        if_state,
+        else_state,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "c != 0", language=Language.Python)))
+    sdfg.add_edge(then_state, end_state, InterstateEdge())
+    sdfg.add_edge(else_state, end_state, InterstateEdge())
+
+    a_in = if_state.add_scalar(
+        "a_in", dtype, storage=StorageType.FPGA_Registers)
+    x_in = if_state.add_scalar(
+        "x_in", dtype, storage=StorageType.FPGA_Registers)
+    b_tmp_out = if_state.add_scalar(
+        "b_tmp", dtype, transient=True, storage=StorageType.FPGA_Registers)
+    tasklet = if_state.add_tasklet("compute", {"_a_in", "_x_in"}, {"_b_out"},
+                                   "_b_out = _a_in * _x_in")
+    if_state.add_memlet_path(
+        a_in, tasklet, dst_conn="_a_in", memlet=Memlet.simple(a_in, "0"))
+    if_state.add_memlet_path(
+        x_in, tasklet, dst_conn="_x_in", memlet=Memlet.simple(x_in, "0"))
+    if_state.add_memlet_path(
+        tasklet,
+        b_tmp_out,
+        src_conn="_b_out",
+        memlet=Memlet.simple(b_tmp_out, "0"))
+
+    b_tmp_then_in = then_state.add_scalar(
+        "b_tmp", dtype, transient=True, storage=StorageType.FPGA_Registers)
+    b_then_out = then_state.add_scalar(
+        "b_out", dtype, storage=StorageType.FPGA_Registers)
+    then_state.add_memlet_path(
+        b_tmp_then_in, b_then_out, memlet=Memlet.simple(b_then_out, "0"))
+
+    b_tmp_else_in = else_state.add_scalar(
+        "b_tmp", dtype, transient=True, storage=StorageType.FPGA_Registers)
+    b_else_in = else_state.add_scalar(
+        "b_in", dtype, storage=StorageType.FPGA_Registers)
+    b_else_out = else_state.add_scalar(
+        "b_out", dtype, storage=StorageType.FPGA_Registers)
+    else_tasklet = else_state.add_tasklet(
+        "b_wcr", {"_b_in", "b_prev"}, {"_b_out"}, "_b_out = b_prev + _b_in")
+    else_state.add_memlet_path(
+        b_tmp_else_in,
+        else_tasklet,
+        dst_conn="_b_in",
+        memlet=Memlet.simple(b_tmp_else_in, "0"))
+    else_state.add_memlet_path(
+        b_else_in,
+        else_tasklet,
+        dst_conn="b_prev",
+        memlet=Memlet.simple(b_else_in, "0"))
+    else_state.add_memlet_path(
+        else_tasklet,
+        b_else_out,
+        src_conn="_b_out",
+        memlet=Memlet.simple(b_else_out, "0"))
+
+    return sdfg
+
+
+def make_compute_sdfg():
+
+    sdfg = SDFG("spmv_compute")
+
+    pre_state, body, post_state = make_iteration_space(sdfg)
+
+    a_pipe = body.add_stream("a_pipe", dtype, storage=StorageType.FPGA_Local)
+    x_pipe = body.add_stream("x_pipe", dtype, storage=StorageType.FPGA_Local)
+    b_buffer_in = body.add_scalar(
+        "b_buffer", dtype, transient=True, storage=StorageType.FPGA_Registers)
+    b_buffer_out = body.add_scalar(
+        "b_buffer", dtype, transient=True, storage=StorageType.FPGA_Registers)
+    nested_sdfg = make_compute_nested_sdfg()
+    tasklet = body.add_nested_sdfg(nested_sdfg, sdfg, {"a_in", "x_in", "b_in"},
+                                   {"b_out"})
+    body.add_memlet_path(
+        a_pipe, tasklet, dst_conn="a_in", memlet=Memlet.simple(a_pipe, "0"))
+    body.add_memlet_path(
+        b_buffer_in,
+        tasklet,
+        dst_conn="b_in",
+        memlet=Memlet.simple(b_buffer_in, "0"))
+    body.add_memlet_path(
+        x_pipe, tasklet, dst_conn="x_in", memlet=Memlet.simple(x_pipe, "0"))
+    body.add_memlet_path(
+        tasklet,
+        b_buffer_out,
+        src_conn="b_out",
+        memlet=Memlet.simple(b_buffer_out, "0"))
+
+    b_buffer_post_in = post_state.add_scalar(
+        "b_buffer", dtype, transient=True, storage=StorageType.FPGA_Registers)
+    b_pipe = post_state.add_stream(
+        "b_pipe", dtype, storage=StorageType.FPGA_Local)
+    post_state.add_memlet_path(
+        b_buffer_post_in, b_pipe, memlet=Memlet.simple(b_pipe, "0"))
+
+    return sdfg
+
+
+def make_read_x():
+
+    sdfg = SDFG("spmv_read_x")
+
+    pre_state, body, post_state = make_iteration_space(sdfg)
+
+    x_mem = body.add_array(
+        "x_mem", (W, ), dtype, storage=StorageType.FPGA_Global)
+    col_pipe = body.add_stream(
+        "col_pipe", itype, storage=StorageType.FPGA_Local)
+    compute_pipe = body.add_stream(
+        "compute_pipe", dtype, storage=StorageType.FPGA_Local)
+
+    tasklet = body.add_tasklet("read_x", {"x_in", "col_in"}, {"x_out"},
+                               "x_out = x_in[col_in]")
+
+    body.add_memlet_path(
+        x_mem, tasklet, dst_conn="x_in", memlet=Memlet.simple(x_mem, "0:W"))
+    body.add_memlet_path(
+        col_pipe,
+        tasklet,
+        dst_conn="col_in",
+        memlet=Memlet.simple(col_pipe, "0"))
+    body.add_memlet_path(
+        tasklet,
+        compute_pipe,
+        src_conn="x_out",
+        memlet=Memlet.simple(compute_pipe, "0"))
+
+    return sdfg
+
+
+def make_read_val():
+
+    sdfg = SDFG("spmv_read_val")
+
+    pre_state, body, post_state = make_iteration_space(sdfg)
+
+    a_val_mem = body.add_array(
+        "A_val_mem", (nnz, ), dtype, storage=StorageType.FPGA_Global)
+    compute_pipe = body.add_stream(
+        "compute_pipe", dtype, storage=StorageType.FPGA_Local)
+
+    tasklet = body.add_tasklet("read_val", {"a_in"}, {"a_out"},
+                               "a_out = a_in[row_begin + c]")
+
+    body.add_memlet_path(
+        a_val_mem,
+        tasklet,
+        dst_conn="a_in",
+        memlet=Memlet.simple(a_val_mem, "0:nnz"))
+    body.add_memlet_path(
+        tasklet,
+        compute_pipe,
+        src_conn="a_out",
+        memlet=Memlet.simple(compute_pipe, "0"))
+
+    return sdfg
+
+
+def make_read_col():
+
+    sdfg = SDFG("spmv_read_col")
+
+    pre_state, body, post_state = make_iteration_space(sdfg)
+
+    a_col = body.add_array(
+        "A_col_mem", (nnz, ), itype, storage=StorageType.FPGA_Global)
+    col_pipe = body.add_stream(
+        "col_pipe", itype, storage=StorageType.FPGA_Local)
+
+    tasklet = body.add_tasklet("read_col", {"col_in"}, {"col_out"},
+                               "col_out = col_in[row_begin + c]")
+
+    body.add_memlet_path(
+        a_col,
+        tasklet,
+        dst_conn="col_in",
+        memlet=Memlet.simple(a_col, "0:nnz"))
+    body.add_memlet_path(
+        tasklet,
+        col_pipe,
+        src_conn="col_out",
+        memlet=Memlet.simple(col_pipe, "0"))
+
+    return sdfg
+
+
+def make_read_row():
+
+    sdfg = SDFG("spmv_read_row")
+
+    begin = sdfg.add_state("begin")
+    entry = sdfg.add_state("entry")
+    end = sdfg.add_state("end")
+    body = sdfg.add_state("body")
+
+    sdfg.add_edge(begin, entry, InterstateEdge(assignments={"h": "0"}))
+    sdfg.add_edge(
+        entry,
+        body,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "h < H + 1", language=Language.Python)))
+    sdfg.add_edge(
+        entry,
+        end,
+        InterstateEdge(
+            condition=CodeProperty.from_string(
+                "h >= H + 1", language=Language.Python)))
+    sdfg.add_edge(body, entry, InterstateEdge(assignments={"h": "h + 1"}))
+
+    a_row_mem = body.add_array(
+        "A_row_mem", (H + 1, ), itype, storage=StorageType.FPGA_Global)
+    to_val_pipe = body.add_stream(
+        "to_val_pipe", itype, storage=StorageType.FPGA_Local)
+    to_col_pipe = body.add_stream(
+        "row_to_col", itype, storage=StorageType.FPGA_Local)
+    to_compute_pipe = body.add_stream(
+        "to_compute_pipe", itype, storage=StorageType.FPGA_Local)
+    to_x_pipe = body.add_stream(
+        "to_x_pipe", itype, storage=StorageType.FPGA_Local)
+    tasklet = body.add_tasklet(
+        "read_row", {"row_in"},
+        {"to_val_out", "to_col_out", "to_compute_out", "to_x_out"},
+        "to_val_out = row_in\n"
+        "to_col_out = row_in\n"
+        "to_compute_out = row_in\n"
+        "to_x_out = row_in")
+
+    body.add_memlet_path(
+        a_row_mem,
+        tasklet,
+        dst_conn="row_in",
+        memlet=Memlet.simple(a_row_mem, "h"))
+    body.add_memlet_path(
+        tasklet,
+        to_val_pipe,
+        src_conn="to_val_out",
+        memlet=Memlet.simple(to_val_pipe, "0"))
+    body.add_memlet_path(
+        tasklet,
+        to_col_pipe,
+        src_conn="to_col_out",
+        memlet=Memlet.simple(to_col_pipe, "0"))
+    body.add_memlet_path(
+        tasklet,
+        to_compute_pipe,
+        src_conn="to_compute_out",
+        memlet=Memlet.simple(to_compute_pipe, "0"))
+    body.add_memlet_path(
+        tasklet,
+        to_x_pipe,
+        src_conn="to_x_out",
+        memlet=Memlet.simple(to_x_pipe, "0"))
+
+    return sdfg
+
+
+def make_main_state(sdfg):
+
+    state = sdfg.add_state("spmv")
+
+    # Read row pointers and send to value and column readers
+    a_row = state.add_array(
+        "A_row_device", (H + 1, ),
+        itype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    row_to_val_out = state.add_stream(
+        "row_to_val", itype, transient=True, storage=StorageType.FPGA_Local)
+    row_to_col_out = state.add_stream(
+        "row_to_col", itype, transient=True, storage=StorageType.FPGA_Local)
+    row_to_x_out = state.add_stream(
+        "row_to_x", itype, transient=True, storage=StorageType.FPGA_Local)
+    row_to_compute_out = state.add_stream(
+        "row_to_compute",
+        itype,
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    read_row_sdfg = make_read_row()
+    read_row_tasklet = state.add_nested_sdfg(
+        read_row_sdfg, sdfg, {"A_row_mem"},
+        {"to_val_pipe", "to_col_pipe", "to_x_pipe", "to_compute_pipe"})
+    state.add_memlet_path(
+        a_row,
+        read_row_tasklet,
+        memlet=dace.memlet.Memlet.simple(a_row, "0:H+1"),
+        dst_conn="A_row_mem")
+    state.add_memlet_path(
+        read_row_tasklet,
+        row_to_val_out,
+        memlet=dace.memlet.Memlet.simple(row_to_val_out, "0", num_accesses=-1),
+        src_conn="to_val_pipe")
+    state.add_memlet_path(
+        read_row_tasklet,
+        row_to_col_out,
+        memlet=dace.memlet.Memlet.simple(row_to_col_out, "0", num_accesses=-1),
+        src_conn="to_col_pipe")
+    state.add_memlet_path(
+        read_row_tasklet,
+        row_to_x_out,
+        memlet=dace.memlet.Memlet.simple(row_to_x_out, "0", num_accesses=-1),
+        src_conn="to_x_pipe")
+    state.add_memlet_path(
+        read_row_tasklet,
+        row_to_compute_out,
+        memlet=dace.memlet.Memlet.simple(
+            row_to_compute_out, "0", num_accesses=-1),
+        src_conn="to_compute_pipe")
+
+    # Read columns of A using row pointers and send to x reader
+    a_col = state.add_array(
+        "A_col_device", (nnz, ),
+        itype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    row_to_col_in = state.add_stream(
+        "row_to_col", itype, transient=True, storage=StorageType.FPGA_Local)
+    col_to_x_out = state.add_stream(
+        "col_to_x", itype, transient=True, storage=StorageType.FPGA_Local)
+    read_col_sdfg = make_read_col()
+    read_col_tasklet = state.add_nested_sdfg(
+        read_col_sdfg, sdfg, {"A_col_mem", "row_pipe"}, {"col_pipe"})
+    state.add_memlet_path(
+        a_col,
+        read_col_tasklet,
+        memlet=dace.memlet.Memlet.simple(a_col, "0:nnz"),
+        dst_conn="A_col_mem")
+    state.add_memlet_path(
+        row_to_col_in,
+        read_col_tasklet,
+        memlet=dace.memlet.Memlet.simple(row_to_col_in, "0", num_accesses=-1),
+        dst_conn="row_pipe")
+    state.add_memlet_path(
+        read_col_tasklet,
+        col_to_x_out,
+        memlet=dace.memlet.Memlet.simple(col_to_x_out, "0", num_accesses=-1),
+        src_conn="col_pipe")
+
+    # Read values of A using row pointers and send to compute
+    a_val = state.add_array(
+        "A_val_device", (nnz, ),
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    row_to_val_in = state.add_stream(
+        "row_to_val", itype, transient=True, storage=StorageType.FPGA_Local)
+    val_to_compute_out = state.add_stream(
+        "val_to_compute",
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    read_val_sdfg = make_read_val()
+    read_val_tasklet = state.add_nested_sdfg(
+        read_val_sdfg, sdfg, {"A_val_mem", "row_pipe"}, {"compute_pipe"})
+    state.add_memlet_path(
+        a_val,
+        read_val_tasklet,
+        dst_conn="A_val_mem",
+        memlet=dace.memlet.Memlet.simple(a_val, "0:nnz"))
+    state.add_memlet_path(
+        row_to_val_in,
+        read_val_tasklet,
+        dst_conn="row_pipe",
+        memlet=dace.memlet.Memlet.simple(row_to_val_in, "0", num_accesses=-1))
+    state.add_memlet_path(
+        read_val_tasklet,
+        val_to_compute_out,
+        src_conn="compute_pipe",
+        memlet=dace.memlet.Memlet.simple(
+            val_to_compute_out, "0", num_accesses=-1))
+
+    # Read values of x using column pointers and send to compute
+    x = state.add_array(
+        "x_device", (W, ),
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    row_to_x_in = state.add_stream(
+        "row_to_x", itype, transient=True, storage=StorageType.FPGA_Local)
+    col_to_x_in = state.add_stream(
+        "col_to_x", itype, transient=True, storage=StorageType.FPGA_Local)
+    x_to_compute_out = state.add_stream(
+        "x_to_compute", dtype, transient=True, storage=StorageType.FPGA_Local)
+    read_x_sdfg = make_read_x()
+    read_x_tasklet = state.add_nested_sdfg(
+        read_x_sdfg, sdfg, {"x_mem", "col_pipe", "row_pipe"}, {"compute_pipe"})
+    state.add_memlet_path(
+        x,
+        read_x_tasklet,
+        dst_conn="x_mem",
+        memlet=dace.memlet.Memlet.simple(x, "0:W"))
+    state.add_memlet_path(
+        col_to_x_in,
+        read_x_tasklet,
+        dst_conn="col_pipe",
+        memlet=dace.memlet.Memlet.simple(col_to_x_in, "0", num_accesses=-1))
+    state.add_memlet_path(
+        row_to_x_in,
+        read_x_tasklet,
+        dst_conn="row_pipe",
+        memlet=dace.memlet.Memlet.simple(row_to_x_in, "0", num_accesses=-1))
+    state.add_memlet_path(
+        read_x_tasklet,
+        x_to_compute_out,
+        src_conn="compute_pipe",
+        memlet=dace.memlet.Memlet.simple(
+            x_to_compute_out, "0", num_accesses=-1))
+
+    # Receive values of A and x and compute resulting values of b
+    row_to_compute_in = state.add_stream(
+        "row_to_compute",
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    val_to_compute_in = state.add_stream(
+        "val_to_compute",
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    x_to_compute_in = state.add_stream(
+        "x_to_compute", dtype, transient=True, storage=StorageType.FPGA_Local)
+    result_to_write_out = state.add_stream(
+        "result_to_write",
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    compute_sdfg = make_compute_sdfg()
+    compute_tasklet = state.add_nested_sdfg(
+        compute_sdfg, sdfg, {"row_pipe", "a_pipe", "x_pipe"}, {"b_pipe"})
+    state.add_memlet_path(
+        row_to_compute_in,
+        compute_tasklet,
+        dst_conn="row_pipe",
+        memlet=dace.memlet.Memlet.simple(
+            row_to_compute_out, "0", num_accesses=-1))
+    state.add_memlet_path(
+        val_to_compute_in,
+        compute_tasklet,
+        dst_conn="a_pipe",
+        memlet=dace.memlet.Memlet.simple(
+            val_to_compute_in, "0", num_accesses=-1))
+    state.add_memlet_path(
+        x_to_compute_in,
+        compute_tasklet,
+        dst_conn="x_pipe",
+        memlet=dace.memlet.Memlet.simple(
+            x_to_compute_in, "0", num_accesses=-1))
+    state.add_memlet_path(
+        compute_tasklet,
+        result_to_write_out,
+        src_conn="b_pipe",
+        memlet=dace.memlet.Memlet.simple(
+            result_to_write_out, "0", num_accesses=-1))
+
+    # Write back values of b
+    result_to_write_in = state.add_stream(
+        "result_to_write",
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Local)
+    b = state.add_array(
+        "b_device", (H, ),
+        dtype,
+        transient=True,
+        storage=StorageType.FPGA_Global)
+    write_sdfg = make_write_sdfg()
+    write_tasklet = state.add_nested_sdfg(write_sdfg, sdfg, {"b_pipe"},
+                                          {"b_mem"})
+    state.add_memlet_path(
+        result_to_write_in,
+        write_tasklet,
+        dst_conn="b_pipe",
+        memlet=dace.memlet.Memlet.simple(
+            result_to_write_in, "0", num_accesses=-1))
+    state.add_memlet_path(
+        write_tasklet,
+        b,
+        src_conn="b_mem",
+        memlet=dace.memlet.Memlet.simple(b, "0:H"))
+
+    return state
+
+
+def make_nested_compute_state(sdfg):
+
+    state = sdfg.add_state("spmv")
+
+    row_entry, row_exit = state.add_map(
+        "compute_row", {"i": "0:H"}, schedule=ScheduleType.FPGA_Device)
+
+    rowptr = state.add_scalar(
+        "rowptr", itype, storage=StorageType.FPGA_Registers, transient=True)
+    rowend = state.add_scalar(
+        "rowend", itype, storage=StorageType.FPGA_Registers, transient=True)
+
+    nested_sdfg = make_nested_sdfg(sdfg)
+    nested_sdfg_tasklet = state.add_nested_sdfg(
+        nested_sdfg, sdfg,
+        {"row_begin", "row_end", "A_val_read", "A_col_read", "x_read"},
+        {"b_write"})
+
+    state.add_memlet_path(
+        a_row,
+        row_entry,
+        rowptr,
+        memlet=dace.memlet.Memlet(
+            rowptr,
+            1,
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("i")))
+    state.add_memlet_path(
+        rowptr,
+        nested_sdfg_tasklet,
+        dst_conn="row_begin",
+        memlet=dace.memlet.Memlet.simple(rowptr, "0"))
+
+    state.add_memlet_path(
+        a_row,
+        row_entry,
+        rowend,
+        memlet=dace.memlet.Memlet(
+            rowend,
+            1,
+            dace.properties.SubsetProperty.from_string("0"),
+            1,
+            other_subset=dace.properties.SubsetProperty.from_string("i + 1")))
+    state.add_memlet_path(
+        rowend,
+        nested_sdfg_tasklet,
+        dst_conn="row_end",
+        memlet=dace.memlet.Memlet.simple(rowend, "0"))
+
+    state.add_memlet_path(
+        a_val,
+        row_entry,
+        nested_sdfg_tasklet,
+        dst_conn="A_val_read",
+        memlet=dace.memlet.Memlet.simple(a_val, "0:nnz"))
+
+    state.add_memlet_path(
+        x,
+        row_entry,
+        nested_sdfg_tasklet,
+        dst_conn="x_read",
+        memlet=dace.memlet.Memlet.simple(x, "0:W"))
+
+    state.add_memlet_path(
+        a_col,
+        row_entry,
+        nested_sdfg_tasklet,
+        dst_conn="A_col_read",
+        memlet=dace.memlet.Memlet.simple(a_col, "0:nnz"))
+
+    state.add_memlet_path(
+        nested_sdfg_tasklet,
+        row_exit,
+        b,
+        src_conn="b_write",
+        memlet=dace.memlet.Memlet.simple(b, "i"))
+
+    return state
+
+
+def make_sdfg(specialize):
+
+    if specialize:
+        name = "spmv_fpga_stream_{}x{}x{}".format(H.get(), W.get(), nnz.get())
+    else:
+        name = "spmv_fpga_stream"
+    sdfg = dace.SDFG(name)
+
+    pre_state = make_pre_state(sdfg)
+    main_state = make_main_state(sdfg)
+    post_state = make_post_state(sdfg)
+
+    sdfg.add_edge(pre_state, main_state, dace.graph.edges.InterstateEdge())
+    sdfg.add_edge(main_state, post_state, dace.graph.edges.InterstateEdge())
+
+    return sdfg
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int)
+    parser.add_argument("H", type=int)
+    parser.add_argument("nnz", type=int)
+    parser.add_argument(
+        "-specialize",
+        default=False,
+        action="store_true",
+        help="Fix all symbols at compile time/in hardware")
+    args = vars(parser.parse_args())
+
+    W.set(args["W"])
+    H.set(args["H"])
+    nnz.set(args["nnz"])
+
+    print("Sparse Matrix-Vector Multiplication {}x{} "
+          "({} non-zero elements, {}specialized)".format(
+              W.get(), H.get(), nnz.get(), "not "
+              if not args["specialize"] else ""))
+
+    A_row = dace.ndarray([H + 1], dtype=itype)
+    A_col = dace.ndarray([nnz], dtype=itype)
+    A_val = dace.ndarray([nnz], dtype=dtype)
+
+    x = dace.ndarray([W], dtype)
+    b = dace.ndarray([H], dtype)
+
+    # Assuming uniform sparsity distribution across rows
+    nnz_per_row = nnz.get() // H.get()
+    nnz_last_row = nnz_per_row + (nnz.get() % H.get())
+    if nnz_last_row > W.get():
+        print('Too many nonzeros per row')
+        exit(1)
+
+    # RANDOMIZE SPARSE MATRIX
+    A_row[0] = itype(0)
+    A_row[1:H.get()] = itype(nnz_per_row)
+    A_row[-1] = itype(nnz_last_row)
+    A_row = np.cumsum(A_row, dtype=itype.type)
+
+    # Fill column data
+    for i in range(H.get() - 1):
+        A_col[nnz_per_row*i:nnz_per_row*(i+1)] = \
+            np.sort(np.random.choice(W.get(), nnz_per_row, replace=False))
+    # Fill column data for last row
+    A_col[nnz_per_row * (H.get() - 1):] = np.sort(
+        np.random.choice(W.get(), nnz_last_row, replace=False))
+
+    A_val[:] = np.random.rand(nnz.get()).astype(dtype.type)
+    #########################
+
+    x[:] = np.random.rand(W.get()).astype(dtype.type)
+    #b[:] = dtype(0)
+
+    # Setup regression
+    A_sparse = scipy.sparse.csr_matrix(
+        (A_val, A_col, A_row), shape=(H.get(), W.get()))
+
+    spmv = make_sdfg(args["specialize"])
+    if args["specialize"]:
+        spmv.specialize()
+    spmv.draw_to_file()
+    spmv(A_row=A_row, A_col=A_col, A_val=A_val, x=x, b=b, H=H, W=W, nnz=nnz)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('spmv', 'scipy', 0, A_sparse.dot, x)
+
+    diff = np.linalg.norm(A_sparse.dot(x) - b) / float(H.get())
+    print("Difference:", diff)
+    if diff >= 1e-5:
+        print("Validation failed.")
+        print("Result:")
+        print(b)
+        print("Reference:")
+        print(A_sparse.dot(x))
+        print("Type \"debug\" to enter debugger, "
+              "or any other string to quit (timeout in 10 seconds)")
+        read, _, _ = select.select([sys.stdin], [], [], 10)
+        if len(read) > 0 and sys.stdin.readline().strip().lower() == "debug":
+            print("Entering debugger...")
+            pdb.set_trace()
+        else:
+            print("Exiting...")
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/graph/bfs_bsp.py b/samples/graph/bfs_bsp.py
new file mode 100644
index 0000000000..bdc3c88101
--- /dev/null
+++ b/samples/graph/bfs_bsp.py
@@ -0,0 +1,335 @@
+import argparse
+import networkx as nx
+import scipy
+import scipy.io
+import numpy as np
+
+import dace as dp
+from collections import OrderedDict
+
+# Push-based, vertex-centric, data-driven, bulk-synchronous Breadth First Search
+
+V = dp.symbol('V')
+E = dp.symbol('E')
+vtype = dp.uint32
+INFINITY = np.iinfo(vtype.type).max
+
+storage = dp.StorageType.Default
+
+sdfg = dp.SDFG('bfs_bsp', OrderedDict([('srcnode', dp.data.Scalar(vtype))]))
+
+istate = sdfg.add_state('init')
+dstate = sdfg.add_state('doit')
+estate = sdfg.add_state('doit2')
+rst0 = sdfg.add_state('reset0')
+rst1 = sdfg.add_state('reset1')
+
+sdfg.add_edge(
+    istate, rst1, dp.graph.edges.InterstateEdge(assignments={'d': '1'}))
+sdfg.add_edge(rst1, dstate, dp.graph.edges.InterstateEdge())
+sdfg.add_edge(dstate, rst0,
+              dp.graph.edges.InterstateEdge('fsz1 > 0', {'d': 'd+1'}))
+sdfg.add_edge(rst0, estate, dp.graph.edges.InterstateEdge())
+sdfg.add_edge(estate, rst1,
+              dp.graph.edges.InterstateEdge('fsz0 > 0', {'d': 'd+1'}))
+
+######################
+# Initialization state
+
+frontiersize1 = istate.add_scalar(
+    'fsz0', dtype=dp.uint32, storage=storage, transient=True)
+frontier = istate.add_transient('frontier', [V], dtype=vtype, storage=storage)
+depth = istate.add_array('depth', [V], dtype=vtype, storage=storage)
+
+itask = istate.add_tasklet('init_depth', set(), {'d', 'f', 'ofsz'}, """
+d = 0
+f = srcnode
+ofsz = 1
+""")
+istate.add_edge(itask, 'd', depth, None, dp.Memlet.simple(depth, 'srcnode'))
+istate.add_edge(itask, 'f', frontier, None, dp.Memlet.simple(frontier, '0'))
+istate.add_edge(itask, 'ofsz', frontiersize1, None,
+                dp.Memlet.from_array('fsz0', frontiersize1.desc(sdfg)))
+
+######################
+# Reset states
+
+
+def create_reset_state(state, frontier_index):
+    frontiersize = state.add_scalar(
+        'fsz%d' % frontier_index,
+        dtype=dp.uint32,
+        storage=storage,
+        transient=True)
+    rtask = state.add_tasklet('reset_%d' % frontier_index, set(), {'ofsz'},
+                              'ofsz = 0')
+    state.add_edge(
+        rtask, 'ofsz', frontiersize, None,
+        dp.Memlet.from_array('fsz%d' % frontier_index,
+                             frontiersize.desc(sdfg)))
+
+
+create_reset_state(rst0, 0)
+create_reset_state(rst1, 1)
+
+######################
+# Computation state
+
+
+def create_computation_state(dstate, in_frontier, out_frontier,
+                             frontier_index):
+    frontiersize = dstate.add_scalar(
+        'fsz%d' % frontier_index,
+        dtype=dp.uint32,
+        storage=storage,
+        transient=True)
+    depth = dstate.add_array('depth', [V], dtype=vtype, storage=storage)
+    out_frontiersize = dstate.add_scalar(
+        'fsz%d' % (1 - frontier_index),
+        dtype=dp.uint32,
+        storage=storage,
+        transient=True)
+    out_depth = dstate.add_array('depth', [V], dtype=vtype, storage=storage)
+    G_row = dstate.add_array('G_row', [V + 1], dtype=vtype, storage=storage)
+    G_col = dstate.add_array('G_col', [E], dtype=vtype, storage=storage)
+    out_frontier_stream = dstate.add_stream(
+        'out_stream%d' % frontier_index, vtype, 1, transient=True)
+
+    me, mx = dstate.add_map('frontiermap', dict(f='0:fsz%d' % frontier_index))
+    me.in_connectors = {
+        'IN_F', 'IN_R', 'IN_C', 'IN_D',
+        'fsz%d' % frontier_index
+    }
+    me.out_connectors = {'OUT_F', 'OUT_R', 'OUT_C', 'OUT_D'}
+    mx.in_connectors = {'IN_F', 'IN_FSZ', 'IN_D'}
+    mx.out_connectors = {'OUT_F', 'OUT_FSZ', 'OUT_D'}
+
+    # Map inputs
+    dstate.add_edge(
+        frontiersize, None, me, 'fsz%d' % frontier_index,
+        dp.Memlet.from_array('fsz%d' % frontier_index,
+                             frontiersize.desc(sdfg)))
+    dstate.add_edge(
+        in_frontier, None, me, 'IN_F',
+        dp.Memlet.from_array(in_frontier.data, in_frontier.desc(sdfg)))
+    dstate.add_edge(G_row, None, me, 'IN_R',
+                    dp.Memlet.from_array('G_row', G_row.desc(sdfg)))
+    dstate.add_edge(G_col, None, me, 'IN_C',
+                    dp.Memlet.from_array('G_col', G_col.desc(sdfg)))
+    dstate.add_edge(depth, None, me, 'IN_D',
+                    dp.Memlet.from_array('depth', depth.desc(sdfg)))
+
+    # Map contents
+    rowb = dstate.add_scalar(
+        'rowb%d' % frontier_index, vtype, storage, transient=True)
+    rowe = dstate.add_scalar(
+        'rowe%d' % frontier_index, vtype, storage, transient=True)
+    indirection = dstate.add_tasklet(
+        'indirection', {'in_f', 'in_row'}, {'ob', 'oe'},
+        'ob = in_row[in_f]; oe = in_row[in_f + 1]')
+    dstate.add_edge(me, 'OUT_F', indirection, 'in_f',
+                    dp.Memlet.simple(in_frontier, 'f'))
+    dstate.add_edge(
+        me, 'OUT_R', indirection, 'in_row',
+        dp.Memlet('G_row', 2, dp.subsets.Range.from_array(G_row.desc(sdfg)),
+                  1))
+    dstate.add_edge(
+        indirection, 'ob', rowb, None,
+        dp.Memlet.from_array('rowb%d' % frontier_index, rowb.desc(sdfg)))
+    dstate.add_edge(
+        indirection, 'oe', rowe, None,
+        dp.Memlet.from_array('rowe%d' % frontier_index, rowe.desc(sdfg)))
+
+    # Internal neighbor map inputs
+    nme, nmx = dstate.add_map(
+        'neighbormap', dict(nid='rowb{f}:rowe{f}'.format(f=frontier_index)))
+    nme.in_connectors = {
+        'IN_C', 'IN_D',
+        'rowb%d' % frontier_index,
+        'rowe%d' % frontier_index
+    }
+    nme.out_connectors = {'OUT_C', 'OUT_D'}
+    nmx.in_connectors = {'IN_D', 'IN_FSZ', 'IN_F'}
+    nmx.out_connectors = {'OUT_D', 'OUT_FSZ', 'OUT_F'}
+
+    dstate.add_edge(
+        rowb, None, nme, 'rowb%d' % frontier_index,
+        dp.Memlet.from_array('rowb%d' % frontier_index, rowb.desc(sdfg)))
+    dstate.add_edge(
+        rowe, None, nme, 'rowe%d' % frontier_index,
+        dp.Memlet.from_array('rowe%d' % frontier_index, rowe.desc(sdfg)))
+
+    dstate.add_edge(me, 'OUT_C', nme, 'IN_C',
+                    dp.Memlet.from_array('G_col', G_col.desc(sdfg)))
+    dstate.add_edge(me, 'OUT_D', nme, 'IN_D',
+                    dp.Memlet.from_array('depth', depth.desc(sdfg)))
+
+    # Internal neighbor map contents
+    ctask = dstate.add_tasklet(
+        'update_and_push', {'in_col', 'in_depth'},
+        {'out_depth', 'out_fsz', 'out_frontier'}, """
+node = in_col[nid]
+dep = in_depth[node]
+if d < dep:
+    out_depth[node] = d
+    out_frontier = node
+    out_fsz = 1
+""")
+
+    # Internal inputs
+    dstate.add_edge(
+        nme, 'OUT_C', ctask, 'in_col',
+        dp.Memlet('G_col', -1, dp.subsets.Range.from_array(G_col.desc(sdfg)),
+                  1))
+    dstate.add_edge(
+        nme, 'OUT_D', ctask, 'in_depth',
+        dp.Memlet('depth', -1, dp.subsets.Range.from_array(depth.desc(sdfg)),
+                  1))
+
+    # Internal outputs
+    dstate.add_edge(
+        ctask, 'out_depth', nmx, 'IN_D',
+        dp.Memlet('depth', -1, dp.subsets.Range.from_array(depth.desc(sdfg)),
+                  1))
+    dstate.add_edge(
+        ctask, 'out_fsz', nmx, 'IN_FSZ',
+        dp.Memlet(
+            out_frontiersize,
+            -1,
+            dp.subsets.Indices([0]),
+            1,
+            wcr=dp.properties.LambdaProperty.from_string('lambda a,b: a+b'),
+            wcr_identity=0))
+    dstate.add_edge(
+        ctask, 'out_frontier', nmx, 'IN_F',
+        dp.Memlet(out_frontier_stream, -1,
+                  dp.subsets.Range.from_array(out_frontier_stream.desc(sdfg)),
+                  1))
+
+    # Internal neighbor map outputs
+    dstate.add_edge(
+        nmx, 'OUT_D', mx, 'IN_D',
+        dp.Memlet(depth, -1, dp.subsets.Range.from_array(depth.desc(sdfg)), 1))
+    dstate.add_edge(
+        nmx, 'OUT_FSZ', mx, 'IN_FSZ',
+        dp.Memlet(
+            out_frontiersize,
+            -1,
+            dp.subsets.Indices([0]),
+            1,
+            wcr=dp.properties.LambdaProperty.from_string('lambda a,b: a+b'),
+            wcr_identity=0))
+    dstate.add_edge(
+        nmx, 'OUT_F', mx, 'IN_F',
+        dp.Memlet.from_array(out_frontier_stream.data,
+                             out_frontier_stream.desc(sdfg)))
+
+    # Map outputs
+    dstate.add_edge(
+        mx, 'OUT_D', out_depth, None,
+        dp.Memlet(depth, -1, dp.subsets.Range.from_array(depth.desc(sdfg)), 1))
+    dstate.add_edge(
+        mx, 'OUT_FSZ', out_frontiersize, None,
+        dp.Memlet(
+            out_frontiersize,
+            -1,
+            dp.subsets.Indices([0]),
+            1,
+            wcr=dp.properties.LambdaProperty.from_string('lambda a,b: a+b'),
+            wcr_identity=0))
+    dstate.add_edge(
+        mx, 'OUT_F', out_frontier_stream, None,
+        dp.Memlet.from_array(out_frontier_stream.data,
+                             out_frontier_stream.desc(sdfg)))
+
+    # Stream->Array interface
+    dstate.add_nedge(
+        out_frontier_stream, out_frontier,
+        dp.Memlet.from_array(out_frontier.data, out_frontier.desc(sdfg)))
+
+
+frontier = dstate.add_transient('frontier', [V], dtype=vtype, storage=storage)
+frontier2 = dstate.add_transient(
+    'frontier2', [V], dtype=vtype, storage=storage)
+create_computation_state(dstate, frontier, frontier2, 0)
+
+frontier = estate.add_transient('frontier', [V], dtype=vtype, storage=storage)
+frontier2 = estate.add_transient(
+    'frontier2', [V], dtype=vtype, storage=storage)
+create_computation_state(estate, frontier2, frontier, 1)
+
+sdfg.draw_to_file()
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument("edges", type=int, nargs="?", default=64)
+    parser.add_argument("vertices", type=int, nargs="?", default=64)
+    parser.add_argument("-seed", type=int, nargs="?", default=None)
+    parser.add_argument("-source", type=int, nargs="?", default=0)
+    parser.add_argument("-loadmtx", type=str, nargs="?", default=None)
+    parser.add_argument("-loadgr", type=str, nargs="?", default=None)
+    parser.add_argument("-outfile", type=str, nargs="?", default=None)
+    args = vars(parser.parse_args())
+
+    E.set(args['edges'])
+    V.set(args['vertices'])
+    srcnode = args['source']
+    outfile = args['outfile']
+
+    regression = False
+    if args['loadgr'] is not None:
+        from support import readgr
+        V, E, G_row, G_col = readgr.read_grfile(args['loadgr'])
+    elif args['loadmtx'] is not None:
+        M = scipy.io.mmread(args['loadmtx']).tocsr()
+        E.set(M.nnz)
+        V.set(M.shape[0])
+        G_row = dp.ndarray([V + 1], dtype=vtype)
+        G_col = dp.ndarray([E], dtype=vtype)
+        G_row[:] = M.indptr
+        G_col[:] = M.indices
+    else:
+        # Generate a random graph
+        graph = nx.gnm_random_graph(V.get(), E.get(), seed=args['seed'])
+        E.set(E.get() * 2)
+
+        # Extract adjacency matrix
+        M = nx.to_scipy_sparse_matrix(graph, dtype=vtype.type).tocsr()
+        assert M.nnz == E.get()
+
+        G_row = dp.ndarray([V + 1], dtype=vtype)
+        G_col = dp.ndarray([E], dtype=vtype)
+        G_row[:] = M.indptr
+        G_col[:] = M.indices
+
+        # Regression
+        result = nx.shortest_path(graph, source=srcnode)
+        result = [
+            len(result[v]) - 1 if v in result else INFINITY
+            for v in range(V.get())
+        ]
+        regression = True
+
+    print('Data loaded')
+    print('Breadth-First Search E=%d, V=%d' % (dp.eval(E), dp.eval(V)))
+
+    # Allocate output arrays
+    depth = dp.ndarray([V], vtype)
+    depth[:] = vtype(INFINITY)
+
+    sdfg(G_row=G_row, G_col=G_col, depth=depth, srcnode=srcnode, E=E, V=V)
+
+    if regression:
+        print('Comparing results...')
+        diff = np.linalg.norm(depth - result) / float(dp.eval(V))
+        print("Difference:", diff)
+        exit(1 if diff >= 1e-5 else 0)
+
+    if args['outfile'] is not None:
+        print('Saving results...')
+        output = np.ndarray([dp.eval(V), 2], vtype.type)
+        output[:, 0] = np.arange(0, dp.eval(V))
+        output[:, 1] = depth[:]
+        np.savetxt(outfile, output, fmt='%d')
+        print('Results written to', outfile)
+    exit(0)
diff --git a/samples/graph/cc.py b/samples/graph/cc.py
new file mode 100644
index 0000000000..a6b693f3e0
--- /dev/null
+++ b/samples/graph/cc.py
@@ -0,0 +1,96 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+import networkx as nx
+
+E = dace.symbol('E')
+V = dace.symbol('V')
+
+EL = dace.ndarray([2 * E, 2], dace.uint64)
+comp = dace.ndarray([V], dace.uint64, allow_conflicts=True)
+
+
+@dace.program
+def shiloach_vishkin(EL, comp):
+    flag_hook = dace.define_local_scalar(dace.int32, allow_conflicts=True)
+
+    @dace.tasklet
+    def initflag():
+        out >> flag_hook
+        out = 1
+
+    @dace.map(_[0:V])
+    def init(v):
+        out >> comp[v]
+        out = v
+
+    while flag_hook:
+
+        @dace.tasklet
+        def resetflag():
+            out >> flag_hook
+            out = 0
+
+        @dace.map(_[0:2 * E])
+        def hook(e):
+            u << EL[e, 0]
+            v << EL[e, 1]
+            parents << comp(3)[:]
+            out >> comp(1)[:]
+            f >> flag_hook(-1)
+
+            pu = parents[u]
+            pv = parents[v]
+            ppv = parents[pv]
+
+            if pu < pv and pv == ppv:
+                out[ppv] = pu
+                f = 1
+
+        # Multi-jump version
+        @dace.map(_[0:V])
+        def shortcut(v):
+            inp << comp(-1)[0:v + 1]
+            out >> comp(-1)[v]
+
+            p = inp[v]
+            pp = inp[p]
+            while p != pp:
+                out = pp
+                p = pp
+                pp = inp[p]
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("edges", type=int, nargs="?", default=17)
+    parser.add_argument("vertices", type=int, nargs="?", default=16)
+    parser.add_argument("-seed", type=int, nargs="?", default=None)
+    args = vars(parser.parse_args())
+
+    E.set(args['edges'])
+    V.set(args['vertices'])
+
+    print('Connected Components (Shiloach-Vishkin) E=%d, V=%d' % (E.get(),
+                                                                  V.get()))
+
+    graph = nx.gnm_random_graph(V.get(), E.get(), seed=args['seed'])
+
+    comp[:] = np.arange(0, V.get(), dtype=dace.uint64.type)
+    EL[:E.get()] = np.array(
+        [[u, v] for u, v, d in nx.to_edgelist(graph)], dtype=dace.uint64.type)
+    EL[E.get():] = np.array(
+        [[v, u] for u, v, d in nx.to_edgelist(graph)], dtype=dace.uint64.type)
+
+    shiloach_vishkin(EL, comp)
+
+    cc = nx.number_connected_components(graph)
+    diff = abs(cc - len(np.unique(comp)))
+    print("Difference:", diff, '(SV:', len(np.unique(comp)), ', NX:', cc, ')')
+    print("==== Program end ====")
+    exit(0 if diff == 0 else 1)
diff --git a/samples/graph/support/Makefile b/samples/graph/support/Makefile
new file mode 100644
index 0000000000..740af8f18b
--- /dev/null
+++ b/samples/graph/support/Makefile
@@ -0,0 +1,10 @@
+.PHONY: all
+
+all: libreadgr.so
+
+libreadgr.so: readgr.cpp
+	g++ -Wall -O3 readgr.cpp -shared -fPIC -o libreadgr.so
+
+clean:
+	rm -rf *~ libreadgr.so
+
diff --git a/samples/graph/support/__init__.py b/samples/graph/support/__init__.py
new file mode 100644
index 0000000000..e69de29bb2
diff --git a/samples/graph/support/readgr.cpp b/samples/graph/support/readgr.cpp
new file mode 100644
index 0000000000..bd53494b40
--- /dev/null
+++ b/samples/graph/support/readgr.cpp
@@ -0,0 +1,478 @@
+/*
+* Copyright 1997, Regents of the University of Minnesota
+*
+* Extracted from Metis io.c http://glaros.dtc.umn.edu/gkhome/metis/metis/download
+*
+* This file contains routines related to I/O
+*
+* Started 8/28/94
+* George
+*
+* $Id: io.c 11932 2012-05-10 18:18:23Z dominique $
+*
+*/
+
+#include "readgr.h"
+
+
+//#include <crtdefs.h>
+#include <cstddef>
+#include <cstdarg>
+#include <cstring>
+#include <cstdlib>
+#include <cstdio>
+#include <cassert>
+
+/*************************************************************************/
+/*! This function initializes a graph_t data structure */
+/*************************************************************************/
+void InitGraph(graph_t *graph)
+{
+    memset((void *)graph, 0, sizeof(graph_t));
+
+    /* graph size constants */
+    graph->nvtxs = -1;
+    graph->nedges = -1;
+    graph->ncon = -1;
+    graph->mincut = -1;
+    graph->minvol = -1;
+    graph->nbnd = -1;
+
+    /* memory for the graph structure */
+    graph->xadj = NULL;
+    graph->vwgt = NULL;
+    graph->vsize = NULL;
+    graph->adjncy = NULL;
+    graph->adjwgt = NULL;
+    graph->label = NULL;
+    graph->cmap = NULL;
+    graph->tvwgt = NULL;
+    graph->invtvwgt = NULL;
+
+    graph->readvw = false;
+    graph->readew = false;
+
+    /* by default these are set to true, but the can be explicitly changed afterwards */
+    graph->free_xadj = 1;
+    graph->free_vwgt = 1;
+    graph->free_vsize = 1;
+    graph->free_adjncy = 1;
+    graph->free_adjwgt = 1;
+
+
+    /* memory for the partition/refinement structure */
+    graph->where = NULL;
+    graph->pwgts = NULL;
+    graph->id = NULL;
+    graph->ed = NULL;
+    graph->bndptr = NULL;
+    graph->bndind = NULL;
+    graph->nrinfo = NULL;
+    graph->ckrinfo = NULL;
+    graph->vkrinfo = NULL;
+
+    /* linked-list structure */
+    graph->coarser = NULL;
+    graph->finer = NULL;
+}
+
+/*************************************************************************/
+/*! This function creates and initializes a graph_t data structure */
+/*************************************************************************/
+graph_t *CreateGraph(void)
+{
+    graph_t *graph;
+
+    graph = (graph_t *)malloc(sizeof(graph_t));
+
+    InitGraph(graph);
+
+    return graph;
+}
+
+/*************************************************************************/
+/*! This function deallocates any memory stored in a graph */
+/*************************************************************************/
+void FreeGraph(graph_t *graph)
+{
+    
+    /* free graph structure */
+    if (graph->free_xadj)
+        free((void *)graph->xadj);
+    if (graph->free_vwgt)
+        free((void *)graph->vwgt);
+    if (graph->free_vsize)
+        free((void *)graph->vsize);
+    if (graph->free_adjncy)
+        free((void *)graph->adjncy);
+    if (graph->free_adjwgt)
+        free((void *)graph->adjwgt);
+
+    /* free partition/refinement structure */
+    //FreeRData(graph);
+
+    free((void *)graph->tvwgt);
+    free((void *)graph->invtvwgt);
+    free((void *)graph->label);
+    free((void *)graph->cmap);
+    free((void *)graph);
+
+}
+
+//static int exit_on_error = 1;
+
+/*************************************************************************/
+/*! This function prints an error message and exits
+*/
+/*************************************************************************/
+void errexit(const char *f_str, ...)
+{
+    va_list argp;
+
+    va_start(argp, f_str);
+    vfprintf(stderr, f_str, argp);
+    va_end(argp);
+
+    if (strlen(f_str) == 0 || f_str[strlen(f_str) - 1] != '\n')
+        fprintf(stderr, "\n");
+    fflush(stderr);
+
+    if (/*exit_on_error*/ 1)
+        exit(-2);
+
+    /* abort(); */
+}
+
+/*************************************************************************
+* This function opens a file
+**************************************************************************/
+FILE *gk_fopen(const char *fname, const char *mode, const char *msg)
+{
+    FILE *fp;
+    char errmsg[8192];
+
+    fp = fopen(fname, mode);
+    if (fp != NULL)
+        return fp;
+
+    sprintf(errmsg, "file: %s, mode: %s, [%s]", fname, mode, msg);
+    perror(errmsg);
+    errexit("Failed on gk_fopen()\n");
+
+    return NULL;
+}
+
+
+/*************************************************************************
+* This function closes a file
+**************************************************************************/
+void gk_fclose(FILE *fp)
+{
+    fclose(fp);
+}
+
+
+/*************************************************************************/
+/*! This function is the GKlib implementation of glibc's getline()
+function.
+\returns -1 if the EOF has been reached, otherwise it returns the
+number of bytes read.
+*/
+/*************************************************************************/
+ptrdiff_t gk_getline(char **lineptr, size_t *n, FILE *stream)
+{
+#ifdef HAVE_GETLINE
+    return getline(lineptr, n, stream);
+#else
+    size_t i;
+    int ch;
+
+    if (feof(stream))
+        return -1;
+
+    /* Initial memory allocation if *lineptr is NULL */
+    if (*lineptr == NULL || *n == 0) {
+        *n = 1024;
+        *lineptr = (char*)malloc((*n)*sizeof(char));
+    }
+
+    /* get into the main loop */
+    i = 0;
+    while ((ch = getc(stream)) != EOF) {
+        (*lineptr)[i++] = (char)ch;
+
+        /* reallocate memory if reached at the end of the buffer. The +1 is for '\0' */
+        if (i + 1 == *n) {
+            *n = 2 * (*n);
+            *lineptr = (char*)realloc(*lineptr, (*n)*sizeof(char));
+        }
+
+        if (ch == '\n')
+            break;
+    }
+    (*lineptr)[i] = '\0';
+
+    return (i == 0 ? -1 : i);
+#endif
+}
+
+/*************************************************************************/
+/*! This function reads in a sparse graph */
+/*************************************************************************/
+graph_t *ReadGraph(char* filename)
+{
+    idx_t i, j, k, l, fmt, ncon, nfields, readew, readvw, readvs, edge, ewgt;
+    idx_t *xadj, *adjncy, *vwgt, *adjwgt, *vsize;
+    char *line = NULL, fmtstr[256], *curstr, *newstr;
+    size_t lnlen = 0;
+    FILE *fpin;
+    graph_t *graph;
+
+    graph = CreateGraph();
+
+    fpin = gk_fopen(filename, "r", "ReadGRaph: Graph");
+
+    /* Skip comment lines until you get to the first valid line */
+    do {
+        if (gk_getline(&line, &lnlen, fpin) == -1)
+            errexit("Premature end of input file: file: %s\n", filename);
+    } while (line[0] == '%');
+
+
+    fmt = ncon = 0;
+    nfields = sscanf(line, "%" SCIDX " %" SCIDX " %" SCIDX " %" SCIDX,
+        &(graph->nvtxs), &(graph->nedges), &fmt, &ncon);
+
+    if (nfields < 2)
+        errexit("The input file does not specify the number of vertices and edges.\n");
+
+    if (graph->nvtxs <= 0 || graph->nedges <= 0)
+        errexit("The supplied nvtxs:%" PRIDX " and nedges:%" PRIDX " must be positive.\n",
+        graph->nvtxs, graph->nedges);
+
+    if (fmt > 111)
+        errexit("Cannot read this type of file format [fmt=%" PRIDX "]!\n", fmt);
+
+    sprintf(fmtstr, "%03" PRIDX , fmt % 1000);
+    readvs = (fmtstr[0] == '1');
+    readvw = (fmtstr[1] == '1');
+    readew = (fmtstr[2] == '1');
+
+    graph->readew = readew;
+
+    /*printf("%s %" PRIDX " %" PRIDX " %" PRIDX "\n", fmtstr, readvs, readvw, readew); */
+
+
+    if (ncon > 0 && !readvw)
+        errexit(
+        "------------------------------------------------------------------------------\n"
+        "***  I detected an error in your input file  ***\n\n"
+        "You specified ncon=%" PRIDX ", but the fmt parameter does not specify vertex weights\n"
+        "Make sure that the fmt parameter is set to either 10 or 11.\n"
+        "------------------------------------------------------------------------------\n", ncon);
+
+    graph->nedges *= 2;
+    ncon = graph->ncon = (ncon == 0 ? 1 : ncon);
+
+    xadj = graph->xadj = (idx_t*)malloc((graph->nvtxs + 1) * sizeof(idx_t));
+    memset((void *)xadj, 0, (graph->nvtxs + 1) * sizeof(idx_t));
+
+    adjncy = graph->adjncy = (idx_t*)malloc((graph->nedges) * sizeof(idx_t));
+
+    vwgt = graph->vwgt = (idx_t*)malloc((ncon*graph->nvtxs) * sizeof(idx_t));
+    memset((void *)vwgt, 1, (ncon*graph->nvtxs) * sizeof(idx_t));
+
+    adjwgt = graph->adjwgt = (idx_t*)malloc((graph->nedges) * sizeof(idx_t));
+    memset((void *)adjwgt, 1, (graph->nedges) * sizeof(idx_t));
+
+    vsize = graph->vsize = (idx_t*)malloc((graph->nvtxs) * sizeof(idx_t));
+    memset((void *)vsize, 1, (graph->nvtxs) * sizeof(idx_t));
+
+    /*----------------------------------------------------------------------
+    * Read the sparse graph file
+    *---------------------------------------------------------------------*/
+    for (xadj[0] = 0, k = 0, i = 0; i < graph->nvtxs; i++) {
+        do {
+            if (gk_getline(&line, &lnlen, fpin) == -1)
+                errexit("Premature end of input file while reading vertex %" PRIDX ".\n", i + 1);
+        } while (line[0] == '%');
+
+        curstr = line;
+        newstr = NULL;
+
+        /* Read vertex sizes */
+        if (readvs) {
+            vsize[i] = strtol(curstr, &newstr, 10);
+            if (newstr == curstr)
+                errexit("The line for vertex %" PRIDX " does not have vsize information\n", i + 1);
+            if (vsize[i] < 0)
+                errexit("The size for vertex %" PRIDX " must be >= 0\n", i + 1);
+            curstr = newstr;
+        }
+
+
+        /* Read vertex weights */
+        if (readvw) {
+            for (l = 0; l < ncon; l++) {
+                vwgt[i*ncon + l] = strtol(curstr, &newstr, 10);
+                if (newstr == curstr)
+                    errexit("The line for vertex %" PRIDX " does not have enough weights "
+                    "for the %" PRIDX " constraints.\n", i + 1, ncon);
+                if (vwgt[i*ncon + l] < 0)
+                    errexit("The weight vertex %" PRIDX " and constraint %" PRIDX " must be >= 0\n", i + 1, l);
+                curstr = newstr;
+            }
+        }
+
+        while (1) {
+            edge = strtol(curstr, &newstr, 10);
+            if (newstr == curstr)
+                break; /* End of line */
+            curstr = newstr;
+
+            if (edge < 1 || edge > graph->nvtxs)
+                errexit("Edge %" PRIDX " for vertex %" PRIDX " is out of bounds\n", edge, i + 1);
+
+            ewgt = 1;
+            if (readew) {
+                ewgt = strtol(curstr, &newstr, 10);
+                if (newstr == curstr)
+                    errexit("Premature end of line for vertex %" PRIDX "\n", i + 1);
+                if (ewgt <= 0)
+                    errexit("The weight (%" PRIDX ") for edge (%" PRIDX ", %" PRIDX ") must be positive.\n",
+                    ewgt, i + 1, edge);
+                curstr = newstr;
+            }
+
+            if (k == graph->nedges)
+                errexit("There are more edges in the file than the %" PRIDX " specified.\n",
+                graph->nedges / 2);
+
+            adjncy[k] = edge - 1;
+            adjwgt[k] = ewgt;
+            k++;
+        }
+        xadj[i + 1] = k;
+    }
+    gk_fclose(fpin);
+
+    if (k != graph->nedges) {
+        printf("------------------------------------------------------------------------------\n");
+        printf("***  I detected an error in your input file  ***\n\n");
+        printf("In the first line of the file, you specified that the graph contained\n"
+            "%" PRIDX " edges. However, I only found %" PRIDX " edges in the file.\n",
+            graph->nedges / 2, k / 2);
+        if (2 * k == graph->nedges) {
+            printf("\n *> I detected that you specified twice the number of edges that you have in\n");
+            printf("    the file. Remember that the number of edges specified in the first line\n");
+            printf("    counts each edge between vertices v and u only once.\n\n");
+        }
+        printf("Please specify the correct number of edges in the first line of the file.\n");
+        printf("------------------------------------------------------------------------------\n");
+        exit(0);
+    }
+
+    free((void *)line);
+
+    return graph;
+}
+
+#ifdef WIN32
+// Windows "host" byte order is little endian
+static inline uint64_t le64toh(uint64_t x) {
+    return x;
+}
+
+#endif
+
+/*************************************************************************/
+/*! This function reads in a sparse graph */
+/*************************************************************************/
+graph_t *ReadGraphGR(char* filename)
+{
+    idx_t *xadj, *adjncy, *vwgt, *adjwgt, *vsize;
+    FILE *fpin;
+    graph_t *graph;
+
+    graph = CreateGraph();
+
+    fpin = gk_fopen(filename, "r", "ReadGraphGR: Graph");
+
+    size_t read;
+    uint64_t x[4];
+    if (fread(x, sizeof(uint64_t), 4, fpin) != 4) {
+        errexit("Unable to read header\n");
+    }
+
+    if (x[0] != 1) /* version */
+        errexit("Unknown file version\n");
+
+    uint64_t sizeEdgeTy = le64toh(x[1]);
+    graph->nvtxs = x[2];
+    graph->nedges = x[3];
+
+    printf("%s has %lu nodes and %lu edges\n", filename, graph->nvtxs, graph->nedges);
+
+    xadj = graph->xadj = (idx_t*)calloc((graph->nvtxs + 1), sizeof(idx_t));
+    adjncy = graph->adjncy = (idx_t*)calloc((graph->nedges), sizeof(uint32_t));
+
+    vwgt = graph->vwgt = (idx_t*)calloc((0 * graph->nvtxs), sizeof(idx_t));  // file doesn't store node weights though.
+    graph->readvw = false;
+
+    adjwgt = graph->adjwgt = (idx_t*)calloc((graph->nedges), sizeof(idx_t));
+    vsize = graph->vsize = (idx_t*)calloc((graph->nvtxs), sizeof(idx_t));
+
+    assert(xadj != NULL);
+    assert(adjncy != NULL);
+    //assert(vwgt != NULL);
+    assert(adjwgt != NULL);
+
+    if (sizeof(idx_t) == sizeof(uint64_t)) {
+        read = fread(xadj + 1, sizeof(idx_t), graph->nvtxs, fpin); // This is little-endian data
+        if (read < graph->nvtxs)
+            errexit("Error: Partial read of node data\n");
+        fprintf(stderr, "read %llu nodes\n", graph->nvtxs);
+    }
+    else {
+        for (int i = 0; i < graph->nvtxs; i++) {
+            uint64_t rs;
+            if (fread(&rs, sizeof(uint64_t), 1, fpin) != 1) {
+                errexit("Error: Unable to read node data\n");
+            }
+            xadj[i + 1] = rs;
+        }
+    }
+
+    // edges are 32-bit
+
+    if (sizeof(idx_t) == sizeof(uint32_t)) {
+        read = fread(adjncy, sizeof(idx_t), graph->nedges, fpin); // This is little-endian data
+        if (read < graph->nedges)
+            errexit("Error: Partial read of edge destinations\n");
+
+        fprintf(stderr, "read %llu edges\n", graph->nedges);
+    }
+    else {
+        assert(false && "Not implemented"); /* need to convert sizes when reading */
+    }
+
+    if (sizeEdgeTy) {
+        if (graph->nedges % 2)
+            if (fseek(fpin, 4, SEEK_CUR) != 0)  // skip
+                errexit("Error when seeking\n");
+
+        if (sizeof(idx_t) == sizeof(uint32_t)) {
+            read = fread(adjwgt, sizeof(idx_t), graph->nedges, fpin); // This is little-endian data
+            graph->readew = true;
+            if (read < graph->nedges)
+                errexit("Error: Partial read of edge data\n");
+
+            fprintf(stderr, "read data for %llu edges\n", graph->nedges);
+        }
+        else {
+            assert(false && "Not implemented"); /* need to convert sizes when reading */
+        }
+    }
+
+    return graph;
+}
diff --git a/samples/graph/support/readgr.h b/samples/graph/support/readgr.h
new file mode 100644
index 0000000000..a0c9b09b01
--- /dev/null
+++ b/samples/graph/support/readgr.h
@@ -0,0 +1,147 @@
+/*
+* Copyright 1997, Regents of the University of Minnesota
+*
+* Extracted from Metis io.c and some of it's headers http://glaros.dtc.umn.edu/gkhome/metis/metis/download
+*
+* This file contains routines related to I/O
+*
+* Started 8/28/94
+* George
+*
+* $Id: io.c 11932 2012-05-10 18:18:23Z dominique $
+*
+*/
+
+#ifndef PARSER_H
+#define PARSER_H
+
+
+#include <cstdint>
+#include <vector>
+#include <map>
+#include <memory>
+#include <cstddef>
+#include "../../../dace/runtime/include/dace/dace.h"
+
+#define SCIDX  "ld"
+#define PRIDX  "I32d"
+
+typedef uint32_t idx_t;
+typedef float real_t;
+typedef uint64_t size_t;
+
+/*************************************************************************/
+/*! This data structure stores cut-based k-way refinement info about an
+adjacent subdomain for a given vertex. */
+/*************************************************************************/
+typedef struct cnbr_t {
+    idx_t pid;            /*!< The partition ID */
+    idx_t ed;             /*!< The sum of the weights of the adjacent edges
+                          that are incident on pid */
+} cnbr_t;
+
+
+/*************************************************************************/
+/*! The following data structure stores holds information on degrees for k-way
+partition */
+/*************************************************************************/
+typedef struct ckrinfo_t {
+    idx_t id;              /*!< The internal degree of a vertex (sum of weights) */
+    idx_t ed;                /*!< The total external degree of a vertex */
+    idx_t nnbrs;              /*!< The number of neighboring subdomains */
+    idx_t inbr;            /*!< The index in the cnbr_t array where the nnbrs list
+                           of neighbors is stored */
+} ckrinfo_t;
+
+
+/*************************************************************************/
+/*! This data structure stores volume-based k-way refinement info about an
+adjacent subdomain for a given vertex. */
+/*************************************************************************/
+typedef struct vnbr_t {
+    idx_t pid;            /*!< The partition ID */
+    idx_t ned;            /*!< The number of the adjacent edges
+                          that are incident on pid */
+    idx_t gv;             /*!< The gain in volume achieved by moving the
+                          vertex to pid */
+} vnbr_t;
+
+
+/*************************************************************************/
+/*! The following data structure holds information on degrees for k-way
+vol-based partition */
+/*************************************************************************/
+typedef struct vkrinfo_t {
+    idx_t nid;             /*!< The internal degree of a vertex (count of edges) */
+    idx_t ned;                /*!< The total external degree of a vertex (count of edges) */
+    idx_t gv;                /*!< The volume gain of moving that vertex */
+    idx_t nnbrs;              /*!< The number of neighboring subdomains */
+    idx_t inbr;            /*!< The index in the vnbr_t array where the nnbrs list
+                           of neighbors is stored */
+} vkrinfo_t;
+
+
+/*************************************************************************/
+/*! The following data structure holds information on degrees for k-way
+partition */
+/*************************************************************************/
+typedef struct nrinfo_t {
+    idx_t edegrees[2];
+} nrinfo_t;
+
+
+/*************************************************************************/
+/*! This data structure holds a graph */
+/*************************************************************************/
+typedef struct graph_t {
+    idx_t nvtxs, nedges;    /* The # of vertices and edges in the graph */
+    idx_t ncon;        /* The # of constrains */
+    idx_t *xadj;        /* Pointers to the locally stored vertices */
+    idx_t *vwgt;        /* Vertex weights */
+    idx_t *vsize;        /* Vertex sizes for min-volume formulation */
+    idx_t *adjncy;        /* Array that stores the adjacency lists of nvtxs */
+    idx_t *adjwgt;        /* Array that stores the weights of the adjacency lists */
+
+    idx_t *tvwgt;         /* The sum of the vertex weights in the graph */
+    real_t *invtvwgt;     /* The inverse of the sum of the vertex weights in the graph */
+
+    bool readvw; // did the source file contain vertex weights
+    bool readew; // did the source file contain edge weights
+
+    /* These are to keep track control if the corresponding fields correspond to
+    application or library memory */
+    int free_xadj, free_vwgt, free_vsize, free_adjncy, free_adjwgt;
+
+    idx_t *label;
+
+    idx_t *cmap;
+
+    /* Partition parameters */
+    idx_t mincut, minvol;
+    idx_t *where, *pwgts;
+    idx_t nbnd;
+    idx_t *bndptr, *bndind;
+
+    /* Bisection refinement parameters */
+    idx_t *id, *ed;
+
+    /* K-way refinement parameters */
+    ckrinfo_t *ckrinfo;   /*!< The per-vertex cut-based refinement info */
+    vkrinfo_t *vkrinfo;   /*!< The per-vertex volume-based refinement info */
+
+    /* Node refinement information */
+    nrinfo_t *nrinfo;
+
+    struct graph_t *coarser, *finer;
+} graph_t;
+
+FILE *gk_fopen(const char *fname, const char *mode, const char *msg);
+void gk_fclose(FILE *fp);
+ptrdiff_t gk_getline(char **lineptr, size_t *n, FILE *stream);
+
+DACE_EXPORTED graph_t *ReadGraph(char* filename);
+DACE_EXPORTED graph_t *ReadGraphGR(char* filename);
+
+DACE_EXPORTED void FreeGraph(graph_t *r_graph);
+
+#endif // PARSER_H
diff --git a/samples/graph/support/readgr.py b/samples/graph/support/readgr.py
new file mode 100644
index 0000000000..a0c238a3a8
--- /dev/null
+++ b/samples/graph/support/readgr.py
@@ -0,0 +1,94 @@
+from ctypes import *
+import os
+import numpy as np
+
+idx_t = c_uint32
+real_t = c_float
+size_t = c_uint64
+
+
+class cnbr_t(Structure):
+    _fields_ = [('pid', idx_t), ('ed', idx_t)]
+
+
+class ckrinfo_t(Structure):
+    _fields_ = [('id', idx_t), ('ed', idx_t), ('nnbrs', idx_t), ('inbr',
+                                                                 idx_t)]
+
+
+class vnbr_t(Structure):
+    _fields_ = [('pid', idx_t), ('ned', idx_t), ('gv', idx_t)]
+
+
+class vkrinfo_t(Structure):
+    _fields_ = [('nid', idx_t), ('ned', idx_t), ('gv', idx_t),
+                ('nnbrs', idx_t), ('inbr', idx_t)]
+
+
+class nrinfo_t(Structure):
+    _fields_ = [('edegrees', idx_t * 2)]
+
+
+# yapf: disable
+class graph_t (Structure):
+    _fields_ = [
+        ('nvtxs', idx_t),
+        ('nedges', idx_t),
+        ('ncon', idx_t),
+        ('xadj', POINTER(idx_t)),
+        ('vwgt', POINTER(idx_t)),
+        ('vsize', POINTER(idx_t)),
+        ('adjncy', POINTER(idx_t)),
+        ('adjwgt', POINTER(idx_t)),
+        ('tvwgt', POINTER(idx_t)),
+        ('invtvwgt', POINTER(real_t)),
+        ('readvw', c_bool),
+        ('readew', c_bool),
+        ('free_xadj', c_int),
+        ('free_vwgt', c_int),
+        ('free_vsize', c_int),
+        ('free_adjncy', c_int),
+        ('free_adjwgt', c_int),
+        ('label', POINTER(idx_t)),
+        ('cmap', POINTER(idx_t)),
+        ('mincut', idx_t),
+        ('minvol', idx_t),
+        ('where', POINTER(idx_t)),
+        ('pwgts', POINTER(idx_t)),
+        ('nbnd', idx_t),
+        ('bndptr', POINTER(idx_t)),
+        ('bndind', POINTER(idx_t)),
+        ('id', POINTER(idx_t)),
+        ('ed', POINTER(idx_t)),
+        ('ckrinfo', POINTER(ckrinfo_t)),
+        ('vkrinfo', POINTER(vkrinfo_t)),
+        ('nrinfo', POINTER(nrinfo_t)),
+        ('coarser', c_void_p),
+        ('finer', c_void_p)
+    ]
+# yapf: enable
+
+
+def read_grfile(filename, with_weights=False):
+    curpath = os.path.abspath(os.path.dirname(__file__))
+    lib = CDLL(os.path.join(curpath, 'libreadgr.so'))
+    lib.ReadGraphGR.restype = POINTER(graph_t)
+
+    # Read graph
+    graph = lib.ReadGraphGR(c_char_p(filename.encode('utf-8')))
+
+    V = graph.contents.nvtxs
+    E = graph.contents.nedges
+    
+    G_row = np.ctypeslib.as_array(graph.contents.xadj, shape=(V+1,))
+    G_col = np.ctypeslib.as_array(graph.contents.adjncy, shape=(E,))
+    if with_weights:
+        G_val = np.ctypeslib.as_array(graph.contents.adjwgt, shape=(E,))
+
+    # Do not free graph! numpy arrays are constructed from it
+    #lib.FreeGraph(graph)
+
+    if with_weights:
+        return V, E, G_row, G_col, G_val
+    else:
+        return V, E, G_row, G_col
diff --git a/samples/polybench/2mm.py b/samples/polybench/2mm.py
new file mode 100644
index 0000000000..2ceb10fc20
--- /dev/null
+++ b/samples/polybench/2mm.py
@@ -0,0 +1,117 @@
+import math
+import dace
+try:
+    import polybench
+except ImportError:
+    polybench = None
+
+NI = dace.symbol('NI')
+NJ = dace.symbol('NJ')
+NK = dace.symbol('NK')
+NL = dace.symbol('NL')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    NI: 16,
+    NJ: 18,
+    NK: 22,
+    NL: 24
+}, {
+    NI: 40,
+    NJ: 50,
+    NK: 70,
+    NL: 80
+}, {
+    NI: 180,
+    NJ: 190,
+    NK: 210,
+    NL: 220
+}, {
+    NI: 800,
+    NJ: 900,
+    NK: 1100,
+    NL: 1200
+}, {
+    NI: 1600,
+    NJ: 1800,
+    NK: 2200,
+    NL: 2400
+}]
+
+args = [
+    dace.ndarray([NI, NK], datatype),
+    dace.ndarray([NK, NJ], datatype),
+    dace.ndarray([NJ, NL], datatype),
+    dace.ndarray([NI, NL], datatype),
+    dace.ndarray([1], datatype),
+    dace.ndarray([1], datatype)
+]
+
+
+def init_array(A, B, C, D, alpha, beta):
+    ni = NI.get()
+    nj = NJ.get()
+    nk = NK.get()
+    nl = NL.get()
+
+    alpha[0] = datatype(1.5)
+    beta[0] = datatype(1.2)
+
+    for i in range(ni):
+        for j in range(nk):
+            A[i, j] = datatype((i * j + 1) % ni) / ni
+    for i in range(nk):
+        for j in range(nj):
+            B[i, j] = datatype(i * (j + 1) % nj) / nj
+    for i in range(nj):
+        for j in range(nl):
+            C[i, j] = datatype((i * (j + 3) + 1) % nl) / nl
+    for i in range(ni):
+        for j in range(nl):
+            D[i, j] = datatype(i * (j + 2) % nk) / nk
+
+
+@dace.program(datatype[NI, NK], datatype[NK, NJ], datatype[NJ, NL],
+              datatype[NI, NL], datatype[1], datatype[1])
+def k2mm(A, B, C, D, alpha, beta):
+    tmp = dace.define_local([NI, NJ], dtype=datatype)
+
+    @dace.map
+    def zerotmp(i: _[0:NI], j: _[0:NJ]):
+        out >> tmp[i, j]
+        out = 0.0
+
+    @dace.map
+    def mult_tmp(i: _[0:NI], j: _[0:NJ], k: _[0:NK]):
+        in_a << A[i, k]
+        in_b << B[k, j]
+        in_alpha << alpha
+        out >> tmp(1, lambda x, y: x + y)[i, j]
+        out = in_alpha * in_a * in_b
+
+    @dace.map
+    def mult_d(i: _[0:NI], j: _[0:NL]):
+        inp << D[i, j]
+        in_beta << beta
+        out >> D[i, j]
+
+        out = inp * in_beta
+
+    @dace.map
+    def comp_d(i: _[0:NI], j: _[0:NL], k: _[0:NJ]):
+        in_a << tmp[i, k]
+        in_b << C[k, j]
+        out >> D(1, lambda x, y: x + y)[i, j]
+        out = in_a * in_b
+
+
+if __name__ == '__main__':
+    if polybench:
+        polybench.main(sizes, args, [(3, 'D')], init_array, k2mm)
+    else:
+        [k.set(v) for k, v in sizes[2].items()]
+        init_array(*args)
+        k2mm(*args)
diff --git a/samples/polybench/3mm.py b/samples/polybench/3mm.py
new file mode 100644
index 0000000000..da364d2045
--- /dev/null
+++ b/samples/polybench/3mm.py
@@ -0,0 +1,115 @@
+import math
+import dace
+
+try:
+    import polybench
+except ImportError:
+    polybench = None
+
+NI = dace.symbol('NI')
+NJ = dace.symbol('NJ')
+NK = dace.symbol('NK')
+NL = dace.symbol('NL')
+NM = dace.symbol('NM')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    NI: 16,
+    NJ: 18,
+    NK: 20,
+    NL: 22,
+    NM: 24
+}, {
+    NI: 40,
+    NJ: 50,
+    NK: 60,
+    NL: 70,
+    NM: 80
+}, {
+    NI: 180,
+    NJ: 190,
+    NK: 200,
+    NL: 210,
+    NM: 220
+}, {
+    NI: 800,
+    NJ: 900,
+    NK: 1000,
+    NL: 1100,
+    NM: 1200
+}, {
+    NI: 1600,
+    NJ: 1800,
+    NK: 2000,
+    NL: 2200,
+    NM: 2400
+}]
+
+args = [
+    dace.ndarray([NI, NK], datatype),
+    dace.ndarray([NK, NJ], datatype),
+    dace.ndarray([NJ, NM], datatype),
+    dace.ndarray([NM, NL], datatype),
+    dace.ndarray([NI, NL], datatype)
+]
+
+
+def init_array(A, B, C, D, G):
+    ni = NI.get()
+    nj = NJ.get()
+    nk = NK.get()
+    nl = NL.get()
+    nm = NM.get()
+
+    for i in range(ni):
+        for j in range(nk):
+            A[i, j] = datatype((i * j + 1) % ni) / (5 * ni)
+    for i in range(nk):
+        for j in range(nj):
+            B[i, j] = datatype((i * (j + 1) + 2) % nj) / (5 * nj)
+    for i in range(nj):
+        for j in range(nm):
+            C[i, j] = datatype(i * (j + 3) % nl) / (5 * nl)
+    for i in range(nm):
+        for j in range(nl):
+            D[i, j] = datatype((i * (j + 2) + 2) % nk) / (5 * nk)
+
+
+@dace.program(datatype[NI, NK], datatype[NK, NJ], datatype[NJ, NM],
+              datatype[NM, NL], datatype[NI, NL])
+def k3mm(A, B, C, D, G):
+    E = dace.define_local([NI, NJ], dtype=datatype)
+    F = dace.define_local([NJ, NL], dtype=datatype)
+
+    @dace.map
+    def mult_E(i: _[0:NI], j: _[0:NJ], k: _[0:NK]):
+        in_a << A[i, k]
+        in_b << B[k, j]
+        out >> E(1, lambda x, y: x + y, 0)[i, j]
+        out = in_a * in_b
+
+    @dace.map
+    def mult_F(i: _[0:NJ], j: _[0:NL], k: _[0:NM]):
+        in_a << C[i, k]
+        in_b << D[k, j]
+        out >> F(1, lambda x, y: x + y, 0)[i, j]
+        out = in_a * in_b
+
+    @dace.map
+    def mult_G(i: _[0:NI], j: _[0:NL], k: _[0:NJ]):
+        in_a << E[i, k]
+        in_b << F[k, j]
+        out >> G(1, lambda x, y: x + y, 0)[i, j]
+        out = in_a * in_b
+
+
+if __name__ == '__main__':
+    if polybench:
+        polybench.main(sizes, args, [(4, 'G')], init_array, k3mm)
+    else:
+        [k.set(v) for k, v in sizes[2].items()]
+        init_array(*args)
+        k3mm(*args)
diff --git a/samples/polybench/__init__.py b/samples/polybench/__init__.py
new file mode 100644
index 0000000000..8b13789179
--- /dev/null
+++ b/samples/polybench/__init__.py
@@ -0,0 +1 @@
+
diff --git a/samples/polybench/adi.py b/samples/polybench/adi.py
new file mode 100644
index 0000000000..331492e747
--- /dev/null
+++ b/samples/polybench/adi.py
@@ -0,0 +1,175 @@
+import dace
+import polybench
+
+N = dace.symbol('N')
+tsteps = dace.symbol('tsteps')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    tsteps: 20,
+    N: 20
+}, {
+    tsteps: 40,
+    N: 60
+}, {
+    tsteps: 100,
+    N: 200
+}, {
+    tsteps: 500,
+    N: 1000
+}, {
+    tsteps: 1000,
+    N: 2000
+}]
+
+DX = datatype(1.0) / N
+DY = datatype(1.0) / N
+DT = datatype(1.0) / tsteps
+B1 = datatype(2.0)
+B2 = datatype(1.0)
+mul1 = B1 * DT / (DX * DX)
+mul2 = B2 * DT / (DY * DY)
+a = -mul1 / datatype(2.0)
+b = datatype(1.0) + mul1
+c = a
+d = -mul2 / datatype(2.0)
+e = datatype(1.0) + mul2
+f = d
+args = [dace.ndarray([N, N], datatype)]
+
+
+def init_array(u):
+    n = N.get()
+    for i in range(n):
+        for j in range(n):
+            u[i, j] = datatype(i + n - j) / n
+
+
+@dace.program(datatype[N, N])
+def adi(u):
+    v = dace.define_local([N, N], datatype)
+    p = dace.define_local([N, N], datatype)
+    q = dace.define_local([N, N], datatype)
+
+    a = dace.define_local([1], datatype)
+    b = dace.define_local([1], datatype)
+    c = dace.define_local([1], datatype)
+    d = dace.define_local([1], datatype)
+    e = dace.define_local([1], datatype)
+    f = dace.define_local([1], datatype)
+
+    @dace.tasklet
+    def init():
+        out_a >> a
+        out_b >> b
+        out_c >> c
+        out_d >> d
+        out_e >> e
+        out_f >> f
+        out_a = -(datatype(2) * (datatype(1) / tsteps) /
+                  (datatype(1) / (N * N))) / datatype(2)
+        out_b = datatype(1) + (datatype(2) * (datatype(1) / tsteps) /
+                               (datatype(1) / (N * N)))
+        out_c = -(datatype(2) * (datatype(1) / tsteps) /
+                  (datatype(1) / (N * N))) / datatype(2)
+        out_d = -(datatype(1) * (datatype(1) / tsteps) /
+                  (datatype(1) / (N * N))) / datatype(2)
+        out_e = datatype(1) + (datatype(1) * (datatype(1) / tsteps) /
+                               (datatype(1) / (N * N)))
+        out_f = -(datatype(1) * (datatype(1) / tsteps) /
+                  (datatype(1) / (N * N))) / datatype(2)
+
+    for t in range(tsteps):
+        # TODO(later): For more transformability, convert to nested SDFG
+        @dace.map
+        def colsweep(i: _[1:N - 1]):
+            uin_prev << u[:, i - 1]
+            uin << u[:, i]
+            uin_next << u[:, i + 1]
+            pin << p[i, :]
+            qin << q[i, :]
+            in_a << a
+            in_b << b
+            in_c << c
+            in_d << d
+            in_e << e
+            in_f << f
+            vin << v[:, i]
+
+            # Intermediate outputs that are re-read are marked as
+            # dynamic access
+            v0i >> v(-1)[0, i]
+            pi0 >> p(-1)[i, 0]
+            qi0 >> q(-1)[i, 0]
+            vNi >> v(-1)[N - 1, i]
+            pout >> p[i, :]
+            qout >> q[i, :]
+            vout >> v[:, i]
+
+            # Init
+            v0i = datatype(1.0)
+            pi0 = datatype(0.0)
+            qi0 = datatype(1.0)
+
+            # Column sweep
+            for j in range(1, N - 1):
+                pout[j] = -in_c / (in_a * pin[j - 1] + in_b)
+                qout[j] = (-in_d * uin_prev[j] +
+                           (datatype(1.0) + datatype(2.0) * in_d) * uin[j] -
+                           in_f * uin_next[j] - in_a * qin[j - 1]) / (
+                               in_a * pin[j - 1] + in_b)
+
+            vNi = datatype(1.0)
+
+            # Column sweep 2
+            for j in range(N - 2, 0, -1):
+                vout[j] = pin[j] * vin[j + 1] + qin[j]
+
+        # TODO(later): For more transformability, convert to nested SDFG
+        @dace.map
+        def rowsweep(ir: _[1:N - 1]):
+            vin_prev << v[ir - 1, :]
+            vin << v[ir, :]
+            vin_next << v[ir + 1, :]
+            uin << u[ir, :]
+            pin << p[ir, :]
+            qin << q[ir, :]
+            in_a << a
+            in_b << b
+            in_c << c
+            in_d << d
+            in_e << e
+            in_f << f
+
+            u0i >> u(-1)[ir, 0]
+            pi0 >> p(-1)[ir, 0]
+            qi0 >> q(-1)[ir, 0]
+
+            pout >> p[ir, :]
+            qout >> q[ir, :]
+            uout >> u[ir, :]
+
+            uNi >> u(-1)[ir, N - 1]
+
+            u0i = datatype(1.0)
+            pi0 = datatype(0.0)
+            qi0 = datatype(1.0)
+
+            for j in range(1, N - 1):
+                pout[j] = -in_f / (in_d * pin[j - 1] + in_e)
+                qout[j] = (-in_a * vin_prev[j] +
+                           (datatype(1.0) + datatype(2.0) * in_a) * vin[j] -
+                           in_c * vin_next[j] - in_d * qin[j - 1]) / (
+                               in_d * pin[j - 1] + in_e)
+
+            uNi = datatype(1.0)
+
+            for j in range(N - 2, 0, -1):
+                uout[j] = pin[j] * uin[j + 1] + qin[j]
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(0, 'u')], init_array, adi)
diff --git a/samples/polybench/atax.py b/samples/polybench/atax.py
new file mode 100644
index 0000000000..afb03cab96
--- /dev/null
+++ b/samples/polybench/atax.py
@@ -0,0 +1,75 @@
+import math
+import dace
+import polybench
+
+N = dace.symbol('N')
+M = dace.symbol('M')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 38,
+    N: 42,
+}, {
+    M: 116,
+    N: 124,
+}, {
+    M: 390,
+    N: 410,
+}, {
+    M: 1900,
+    N: 2100,
+}, {
+    M: 1800,
+    N: 2200,
+}]
+
+args = [
+    dace.ndarray([M, N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype)
+]
+
+
+def init_array(A, x, y):
+    n = N.get()
+    m = M.get()
+    fn = datatype(n)
+
+    for i in range(n):
+        x[i] = 1 + (i / fn)
+    for i in range(m):
+        for j in range(n):
+            A[i, j] = datatype((i + j) % n) / (5 * m)
+
+
+@dace.program(datatype[M, N], datatype[N], datatype[N])
+def atax(A, x, y):
+    tmp = dace.define_local([M], dtype=datatype)
+
+    @dace.map
+    def reset_y(i: _[0:N]):
+        out >> y[i]
+        out = 0.0
+
+    for i in range(M):
+
+        @dace.map
+        def compute_tmp(j: _[0:N]):
+            inA << A[i, j]
+            inx << x[j]
+            out >> tmp(1, lambda a, b: a + b, 0)[i]
+            out = inA * inx
+
+        @dace.map
+        def compute_y(j: _[0:N]):
+            inA << A[i, j]
+            intmp << tmp[i]
+            outy >> y(1, lambda a, b: a + b)[j]
+            outy = inA * intmp
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(2, 'y')], init_array, atax)
diff --git a/samples/polybench/bicg.py b/samples/polybench/bicg.py
new file mode 100644
index 0000000000..589e9399c7
--- /dev/null
+++ b/samples/polybench/bicg.py
@@ -0,0 +1,70 @@
+import math
+import dace
+import polybench
+
+N = dace.symbol('N')
+M = dace.symbol('M')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 38,
+    N: 42,
+}, {
+    M: 116,
+    N: 124,
+}, {
+    M: 390,
+    N: 410,
+}, {
+    M: 1900,
+    N: 2100,
+}, {
+    M: 1800,
+    N: 2200,
+}]
+
+args = [
+    dace.ndarray([N, M], datatype),
+    dace.ndarray([M], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([M], datatype),
+    dace.ndarray([N], datatype)
+]
+
+
+def init_array(A, s, q, p, r):
+    n = N.get()
+    m = M.get()
+
+    for i in range(m):
+        p[i] = datatype(i % m) / m
+    for i in range(n):
+        r[i] = datatype(i % n) / n
+        for j in range(m):
+            A[i, j] = datatype(i * (j + 1) % n) / n
+
+
+@dace.program(datatype[N, M], datatype[M], datatype[N], datatype[M],
+              datatype[N])
+def bicg(A, s, q, p, r):
+    @dace.map
+    def reset_s(i: _[0:M]):
+        out >> s[i]
+        out = 0.0
+
+    @dace.map
+    def compute(i: _[0:N], j: _[0:M]):
+        inA << A[i, j]
+        inr << r[i]
+        inp << p[j]
+        outs >> s(1, lambda a, b: a + b)[j]
+        outq >> q(1, lambda a, b: a + b)[i]
+        outs = inr * inA
+        outq = inA * inp
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(1, 's'), (2, 'q')], init_array, bicg)
diff --git a/samples/polybench/cholesky.py b/samples/polybench/cholesky.py
new file mode 100644
index 0000000000..ea72c4506b
--- /dev/null
+++ b/samples/polybench/cholesky.py
@@ -0,0 +1,76 @@
+import math
+import dace
+import polybench
+import numpy as np
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{N: 40}, {N: 120}, {N: 400}, {N: 2000}, {N: 4000}]
+
+args = [dace.ndarray([N, N], datatype)]
+
+
+def init_array(A):
+    n = N.get()
+
+    for i in range(0, n, 1):
+        for j in range(0, i + 1, 1):
+            # Python does modulo, while C does remainder ...
+            A[i, j] = datatype(-(j % n)) / n + 1
+        for j in range(i + 1, n, 1):
+            A[i, j] = datatype(0)
+        A[i, i] = datatype(1)
+
+    A[:] = np.dot(A, np.transpose(A))
+
+
+@dace.program(datatype[N, N])
+def cholesky(A):
+    for i in range(0, N, 1):
+        for j in range(0, i, 1):
+
+            @dace.map
+            def k_loop1(k: _[0:j]):
+                i_in << A[i, k]
+                j_in << A[j, k]
+                out >> A(1, lambda x, y: x + y)[i, j]
+                out = -i_in * j_in
+
+            @dace.tasklet
+            def div():
+                ij_in << A[i, j]
+                jj_in << A[j, j]
+                out >> A[i, j]
+                out = ij_in / jj_in
+
+        @dace.map
+        def k_loop2(k: _[0:i]):
+            k_in << A[i, k]
+            out >> A(1, lambda x, y: x + y)[i, i]
+            out = -k_in * k_in
+
+        @dace.tasklet
+        def sqrt():
+            inp << A[i, i]
+            out >> A[i, i]
+            out = math.sqrt(inp)
+
+
+def print_result(filename, *args):
+    with open(filename, 'w') as fp:
+        fp.write("==BEGIN DUMP_ARRAYS==\n")
+        fp.write("begin dump: %s\n" % 'A')
+        for i in range(0, N.get()):
+            for j in range(0, i + 1):
+                fp.write("{:.7f} ".format(args[0][i, j]))
+            fp.write("\n")
+        fp.write("\nend   dump: %s\n" % 'A')
+        fp.write("==END   DUMP_ARRAYS==\n")
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, print_result, init_array, cholesky)
diff --git a/samples/polybench/correlation.py b/samples/polybench/correlation.py
new file mode 100644
index 0000000000..2757b4005c
--- /dev/null
+++ b/samples/polybench/correlation.py
@@ -0,0 +1,108 @@
+import math
+import dace
+import polybench
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 28,
+    N: 32
+}, {
+    M: 80,
+    N: 100
+}, {
+    M: 240,
+    N: 260
+}, {
+    M: 1200,
+    N: 1400
+}, {
+    M: 2600,
+    N: 3000
+}]
+
+args = [
+    dace.ndarray([N, M], datatype),
+    dace.ndarray([M, M], datatype),
+    dace.ndarray([M], datatype),
+    dace.ndarray([M], datatype), M, N
+]
+
+
+def init_array(data, corr, mean, stddev, M, N):
+    n = N.get()
+    m = M.get()
+    for i in range(n):
+        for j in range(m):
+            data[i, j] = datatype(i * j) / m + i
+
+
+@dace.program(datatype[N, M], datatype[M, M], datatype[M], datatype[M])
+def correlation(data, corr, mean, stddev):
+    @dace.map
+    def comp_mean(j: _[0:M], i: _[0:N]):
+        inp << data[i, j]
+        out >> mean(1, lambda x, y: x + y, 0)[j]
+        out = inp
+
+    @dace.map
+    def comp_mean2(j: _[0:M]):
+        inp << mean[j]
+        out >> mean[j]
+        out = inp / N
+
+    @dace.map
+    def comp_stddev(j: _[0:M], i: _[0:N]):
+        inp << data[i, j]
+        inmean << mean[j]
+        out >> stddev(1, lambda x, y: x + y, 0)[j]
+        out = (inp - inmean) * (inp - inmean)
+
+    @dace.map
+    def comp_stddev2(j: _[0:M]):
+        inp << stddev[j]
+        out >> stddev[j]
+        out = math.sqrt(inp / N)
+        if out <= 0.1:
+            out = 1.0
+
+    @dace.map
+    def center_data(i: _[0:N], j: _[0:M]):
+        ind << data[i, j]
+        m << mean[j]
+        sd << stddev[j]
+        oud >> data[i, j]
+        oud = (ind - m) / (math.sqrt(datatype(N)) * sd)
+
+    @dace.map
+    def comp_corr_diag(i: _[0:M]):
+        corrout >> corr[i, i]
+        corrout = 1.0
+
+    @dace.map
+    def comp_corr_row(i: _[0:M - 1]):
+        @dace.map
+        def comp_corr_col(j: _[i + 1:M]):
+            @dace.map
+            def comp_cov_k(k: _[0:N]):
+                indi << data[k, i]
+                indj << data[k, j]
+                cov_ij >> corr(1, lambda x, y: x + y, 0)[i, j]
+                cov_ij = (indi * indj)
+
+    @dace.map
+    def symmetrize(i: _[0:M - 1]):
+        @dace.map
+        def symmetrize_col(j: _[i + 1:M]):
+            corrin << corr[i, j]
+            corrout >> corr[j, i]
+            corrout = corrin
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(1, 'corr')], init_array, correlation)
diff --git a/samples/polybench/covariance.py b/samples/polybench/covariance.py
new file mode 100644
index 0000000000..dff233045c
--- /dev/null
+++ b/samples/polybench/covariance.py
@@ -0,0 +1,86 @@
+import dace
+import polybench
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 28,
+    N: 32
+}, {
+    M: 80,
+    N: 100
+}, {
+    M: 240,
+    N: 260
+}, {
+    M: 1200,
+    N: 1400
+}, {
+    M: 2600,
+    N: 3000
+}]
+
+args = [
+    dace.ndarray([N, M], datatype),
+    dace.ndarray([M, M], datatype),
+    dace.ndarray([M], datatype), M, N
+]
+
+
+def init_array(data, cov, mean, M, N):
+    n = N.get()
+    m = M.get()
+    for i in range(n):
+        for j in range(m):
+            data[i, j] = datatype(i * j) / m
+
+
+@dace.program(datatype[N, M], datatype[M, M], datatype[M])
+def covariance(data, cov, mean):
+    @dace.map
+    def comp_mean(j: _[0:M], i: _[0:N]):
+        inp << data[i, j]
+        out >> mean(1, lambda x, y: x + y, 0)[j]
+        out = inp
+
+    @dace.map
+    def comp_mean2(j: _[0:M]):
+        inp << mean[j]
+        out >> mean[j]
+        out = inp / N
+
+    @dace.map
+    def sub_mean(i: _[0:N], j: _[0:M]):
+        ind << data[i, j]
+        m << mean[j]
+        oud >> data[i, j]
+        oud = ind - m
+
+    @dace.map
+    def comp_cov_row(i: _[0:M]):
+        @dace.map
+        def comp_cov_col(j: _[i:M]):
+            @dace.map
+            def comp_cov_k(k: _[0:N]):
+                indi << data[k, i]
+                indj << data[k, j]
+                cov_ij >> cov(1, lambda x, y: x + y, 0)[i, j]
+                cov_ij = (indi * indj)
+
+    @dace.map
+    def symmetrize(i: _[0:M]):
+        @dace.map
+        def symmetrize_col(j: _[i:M]):
+            cov_ij << cov[i, j]
+            covout >> cov(2)[:, :]
+            covout[i, j] = cov_ij / (N - 1)
+            covout[j, i] = cov_ij / (N - 1)
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(1, 'cov')], init_array, covariance)
diff --git a/samples/polybench/deriche.py b/samples/polybench/deriche.py
new file mode 100644
index 0000000000..bc9ad916bd
--- /dev/null
+++ b/samples/polybench/deriche.py
@@ -0,0 +1,210 @@
+import math
+import dace
+import polybench
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float32
+
+# Dataset sizes
+sizes = [{
+    W: 64,
+    H: 64,
+}, {
+    W: 192,
+    H: 128,
+}, {
+    W: 720,
+    H: 480,
+}, {
+    W: 4096,
+    H: 2160,
+}, {
+    W: 7680,
+    H: 4320,
+}]
+
+args = [
+    dace.ndarray([W, H], datatype),
+    dace.ndarray([W, H], datatype),
+]
+
+# Constants
+alpha = datatype(0.25)
+k = (datatype(1.0) - math.exp(-alpha)) * (datatype(1.0) - math.exp(-alpha)) / (
+    datatype(1.0) + datatype(2.0) * alpha * math.exp(-alpha) - math.exp(
+        datatype(2.0) * alpha))
+a1 = a5 = k
+a2 = a6 = k * math.exp(-alpha) * (alpha - datatype(1.0))
+a3 = a7 = k * math.exp(-alpha) * (alpha + datatype(1.0))
+a4 = a8 = -k * math.exp(datatype(-2.0) * alpha)
+b1 = math.pow(datatype(2.0), -alpha)
+b2 = -math.exp(datatype(-2.0) * alpha)
+c1 = c2 = 1
+
+
+def init_array(imgIn, imgOut):
+    w = W.get()
+    h = H.get()
+
+    for i in range(w):
+        for j in range(h):
+            imgIn[i, j] = datatype((313 * i + 991 * j) % 65536) / 65535.0
+
+
+@dace.program(datatype[W, H], datatype[W, H])
+def deriche(imgIn, imgOut):
+    y1 = dace.define_local([W, H], dtype=datatype)
+    y2 = dace.define_local([W, H], dtype=datatype)
+    ym1 = dace.define_local([1], datatype)
+    ym2 = dace.define_local([1], datatype)
+    xm1 = dace.define_local([1], datatype)
+    tm1 = dace.define_local([1], datatype)
+    yp1 = dace.define_local([1], datatype)
+    yp2 = dace.define_local([1], datatype)
+    xp1 = dace.define_local([1], datatype)
+    xp2 = dace.define_local([1], datatype)
+    tp1 = dace.define_local([1], datatype)
+    tp2 = dace.define_local([1], datatype)
+
+    for i in range(W):
+
+        @dace.tasklet
+        def reset():
+            in_ym1 >> ym1
+            in_ym2 >> ym2
+            in_xm1 >> xm1
+            in_ym1 = 0
+            in_ym2 = 0
+            in_xm1 = 0
+
+        for j in range(H):
+
+            @dace.tasklet
+            def comp_y1():
+                in_img << imgIn[i, j]
+                in_xm1 << xm1
+                in_ym1 << ym1
+                in_ym2 << ym2
+                out_y1 >> y1[i, j]
+                out_xm1 >> xm1
+                out_ym1 >> ym1
+                out_ym2 >> ym2
+                out_y1 = a1 * in_img + a2 * in_xm1 + b1 * in_ym1 + b2 * in_ym2
+                out_xm1 = in_img
+                out_ym2 = in_ym1
+                out_ym1 = out_y1
+
+    for i in range(W):
+
+        @dace.tasklet
+        def reset2():
+            in_yp1 >> yp1
+            in_yp2 >> yp2
+            in_xp1 >> xp1
+            in_xp2 >> xp2
+            in_yp1 = 0
+            in_yp2 = 0
+            in_xp1 = 0
+            in_xp2 = 0
+
+        for j in range(H - 1, -1, -1):
+
+            @dace.tasklet
+            def comp_y2():
+                in_img << imgIn[i, j]
+                in_xp1 << xp1
+                in_xp2 << xp2
+                in_yp1 << yp1
+                in_yp2 << yp2
+                out_y2 >> y2[i, j]
+                out_xp1 >> xp1
+                out_xp2 >> xp2
+                out_yp1 >> yp1
+                out_yp2 >> yp2
+                out_y2 = a3 * in_xp1 + a4 * in_xp2 + b1 * in_yp1 + b2 * in_yp2
+                out_xp2 = in_xp1
+                out_xp1 = in_img
+                out_yp2 = in_yp1
+                out_yp1 = out_y2
+
+    @dace.map
+    def comp_iout(i: _[0:W], j: _[0:H]):
+        in_y1 << y1[i, j]
+        in_y2 << y2[i, j]
+        out_img >> imgOut[i, j]
+        out_img = c1 * (in_y1 + in_y2)
+
+    for j in range(H):
+
+        @dace.tasklet
+        def reset3():
+            in_ym1 >> ym1
+            in_ym2 >> ym2
+            in_tm1 >> tm1
+            in_ym1 = 0
+            in_ym2 = 0
+            in_tm1 = 0
+
+        for i in range(W):
+
+            @dace.tasklet
+            def comp_y12():
+                in_img << imgOut[i, j]
+                in_tm1 << tm1
+                in_ym1 << ym1
+                in_ym2 << ym2
+                out_y1 >> y1[i, j]
+                out_tm1 >> tm1
+                out_ym1 >> ym1
+                out_ym2 >> ym2
+                out_y1 = a5 * in_img + a6 * in_tm1 + b1 * in_ym1 + b2 * in_ym2
+                out_tm1 = in_img
+                out_ym2 = in_ym1
+                out_ym1 = out_y1
+
+    for j in range(H):
+
+        @dace.tasklet
+        def reset4():
+            in_yp1 >> yp1
+            in_yp2 >> yp2
+            in_tp1 >> tp1
+            in_tp2 >> tp2
+            in_yp1 = 0
+            in_yp2 = 0
+            in_tp1 = 0
+            in_tp2 = 0
+
+        for i in range(W - 1, -1, -1):
+
+            @dace.tasklet
+            def comp_y22():
+                in_img << imgOut[i, j]
+                in_tp1 << tp1
+                in_tp2 << tp2
+                in_yp1 << yp1
+                in_yp2 << yp2
+                out_y2 >> y2[i, j]
+                out_tp1 >> tp1
+                out_tp2 >> tp2
+                out_yp1 >> yp1
+                out_yp2 >> yp2
+                out_y2 = a7 * in_tp1 + a8 * in_tp2 + b1 * in_yp1 + b2 * in_yp2
+                out_tp2 = in_tp1
+                out_tp1 = in_img
+                out_yp2 = in_yp1
+                out_yp1 = out_y2
+
+    @dace.map
+    def comp_iout2(i: _[0:W], j: _[0:H]):
+        in_y1 << y1[i, j]
+        in_y2 << y2[i, j]
+        out_img >> imgOut[i, j]
+        out_img = c1 * (in_y1 + in_y2)
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(1, 'imgOut')], init_array, deriche)
diff --git a/samples/polybench/doitgen.py b/samples/polybench/doitgen.py
new file mode 100644
index 0000000000..c5dc329120
--- /dev/null
+++ b/samples/polybench/doitgen.py
@@ -0,0 +1,94 @@
+import math
+import dace
+import polybench
+
+NQ = dace.symbol('NQ')
+NR = dace.symbol('NR')
+NP = dace.symbol('NP')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    NQ: 8,
+    NR: 10,
+    NP: 12
+}, {
+    NQ: 20,
+    NR: 25,
+    NP: 30
+}, {
+    NQ: 40,
+    NR: 50,
+    NP: 60
+}, {
+    NQ: 140,
+    NR: 150,
+    NP: 160
+}, {
+    NQ: 220,
+    NR: 250,
+    NP: 270
+}]
+
+args = [dace.ndarray([NR, NQ, NP], datatype), dace.ndarray([NP, NP], datatype)]
+
+
+def init_array(A, C4):
+    nr = NR.get()
+    nq = NQ.get()
+    np = NP.get()
+
+    for i in range(nr):
+        for j in range(nq):
+            for k in range(np):
+                A[i, j, k] = datatype((i * j + k) % np) / np
+    for i in range(np):
+        for j in range(np):
+            C4[i, j] = datatype((i * j) % np) / np
+
+
+@dace.program(datatype[NR, NQ, NP], datatype[NP, NP])
+def doitgen(A, C4):
+    sum = dace.define_local([NR, NQ, NP], dtype=datatype)
+
+    @dace.map
+    def doit(r: _[0:NR], q: _[0:NQ]):
+        # nA << A[r, q, :]
+        # nC4 << C4[:, :]
+        # nSum << sum[r, q, :]
+        # nAout >> A[r, q, :]
+
+        @dace.map
+        def compute_sum(p: _[0:NP], s: _[0:NP]):
+            inA << A[r, q, s]
+            inC4 << C4[s, p]
+            s >> sum(1, lambda a, b: a + b, 0)[r, q, p]
+            s = inA * inC4
+
+        @dace.map
+        def compute_A(p: _[0:NP]):
+            insum << sum[r, q, p]
+            out >> A[r, q, p]
+            out = insum
+
+        # @dace.program(datatype[NP], datatype[NP, NP], datatype[NP],
+        #               datatype[NP])
+        # def internal(nA, nC4, nSum, nAout):
+        #     @dace.map
+        #     def compute_sum(p: _[0:NP], s: _[0:NP]):
+        #         inA << nA[s]
+        #         inC4 << nC4[s, p]
+        #         s >> nSum(1, lambda a, b: a + b, 0)[p]
+        #         s = inA * inC4
+
+        #     @dace.map
+        #     def compute_A(p: _[0:NP]):
+        #         insum << nSum[p]
+        #         out >> nAout[p]
+        #         out = insum
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(0, 'A')], init_array, doitgen)
diff --git a/samples/polybench/durbin.py b/samples/polybench/durbin.py
new file mode 100644
index 0000000000..57433ebb94
--- /dev/null
+++ b/samples/polybench/durbin.py
@@ -0,0 +1,89 @@
+import math
+import dace
+import polybench
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{N: 40}, {N: 120}, {N: 400}, {N: 2000}, {N: 4000}]
+
+args = [dace.ndarray([N], datatype), dace.ndarray([N], datatype)]
+
+
+def init_array(r, y):
+    n = N.get()
+
+    for i in range(0, n):
+        r[i] = datatype(n + 1 - i)
+
+
+@dace.program(datatype[N], datatype[N])
+def durbin(r, y):
+
+    alpha = dace.define_local([1], datatype)
+    beta = dace.define_local([1], datatype)
+    sum = dace.define_local([1], datatype)
+    z = dace.define_local([N], datatype)
+
+    @dace.tasklet
+    def init():
+        in_r << r[0]
+        out_y >> y[0]
+        out_a >> alpha
+        out_b >> beta
+        out_y = -in_r
+        out_a = -in_r
+        out_b = datatype(1)
+
+    for k in range(1, N, 1):
+
+        @dace.tasklet
+        def k_init():
+            in_a << alpha
+            in_b << beta
+            out_b >> beta
+            out_sum >> sum
+            out_b = (datatype(1) - in_a * in_a) * in_b
+            out_sum = datatype(0)
+
+        @dace.map
+        def set_sum(i: _[0:k]):
+            in_r << r[k - i - 1]
+            in_y << y[i]
+            out_sum >> sum(1, lambda x, y: x + y)
+            out_sum = in_r * in_y
+
+        @dace.tasklet
+        def set_alpha():
+            in_r << r[k]
+            in_sum << sum
+            in_b << beta
+            out_a >> alpha
+            out_a = -(in_r + in_sum) / in_b
+
+        @dace.map
+        def set_zeta(i: _[0:k]):
+            in_y << y[i]
+            kin_y << y[k - i - 1]
+            in_a << alpha
+            out_z >> z[i]
+            out_z = in_y + in_a * kin_y
+
+        @dace.map
+        def set_y1(i: _[0:k]):
+            in_z << z[i]
+            out_y >> y[i]
+            out_y = in_z
+
+        @dace.tasklet
+        def set_y2():
+            in_a << alpha
+            out_y >> y[k]
+            out_y = in_a
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(1, 'y')], init_array, durbin)
diff --git a/samples/polybench/fdtd-2d.py b/samples/polybench/fdtd-2d.py
new file mode 100644
index 0000000000..3e203068f4
--- /dev/null
+++ b/samples/polybench/fdtd-2d.py
@@ -0,0 +1,97 @@
+import dace
+import polybench
+
+NX = dace.symbol('NX')
+NY = dace.symbol('NY')
+TMAX = dace.symbol('TMAX')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    TMAX: 20,
+    NX: 20,
+    NY: 30
+}, {
+    TMAX: 40,
+    NX: 60,
+    NY: 80
+}, {
+    TMAX: 100,
+    NX: 200,
+    NY: 240
+}, {
+    TMAX: 500,
+    NX: 1000,
+    NY: 1200
+}, {
+    TMAX: 1000,
+    NX: 2000,
+    NY: 2600
+}]
+args = [
+    dace.ndarray([NX, NY], datatype),  # ex
+    dace.ndarray([NX, NY], datatype),  # ey
+    dace.ndarray([NX, NY], datatype),  # hz
+    dace.ndarray([TMAX], datatype),  # _fict_
+    # NX,
+    # NY,
+    # TMAX
+]
+
+
+def init_array(ex, ey, hz, _fict_):  #, NX, NY, TMAX):
+    nx = NX.get()
+    ny = NY.get()
+    tmax = TMAX.get()
+    for i in range(tmax):
+        _fict_[i] = datatype(i)
+    for i in range(nx):
+        for j in range(ny):
+            ex[i, j] = datatype(i * (j + 1)) / nx
+            ey[i, j] = datatype(i * (j + 2)) / ny
+            hz[i, j] = datatype(i * (j + 3)) / nx
+
+
+@dace.program(datatype[NX, NY], datatype[NX, NY], datatype[NX, NY],
+              datatype[TMAX])  #, dace.int32, dace.int32, dace.int32)
+def fdtd2d(ex, ey, hz, _fict_):  #, NX, NY, TMAX):
+    for t in range(TMAX):
+
+        @dace.map
+        def col0(j: _[0:NY]):
+            fict << _fict_[t]
+            out >> ey[0, j]
+            out = fict
+
+        @dace.map
+        def update_ey(i: _[1:NX], j: _[0:NY]):
+            eyin << ey[i, j]
+            hz1 << hz[i, j]
+            hz2 << hz[i - 1, j]
+            eyout >> ey[i, j]
+            eyout = eyin - datatype(0.5) * (hz1 - hz2)
+
+        @dace.map
+        def update_ex(i: _[0:NX], j: _[1:NY]):
+            exin << ex[i, j]
+            hz1 << hz[i, j]
+            hz2 << hz[i, j - 1]
+            exout >> ex[i, j]
+            exout = exin - datatype(0.5) * (hz1 - hz2)
+
+        @dace.map
+        def update_hz(i: _[0:NX - 1], j: _[0:NY - 1]):
+            hzin << hz[i, j]
+            ex1 << ex[i, j + 1]
+            ex2 << ex[i, j]
+            ey1 << ey[i + 1, j]
+            ey2 << ey[i, j]
+            hzout >> hz[i, j]
+            hzout = hzin - datatype(0.7) * (ex1 - ex2 + ey1 - ey2)
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(0, 'ex'), (1, 'ey'), (2, 'hz')], init_array,
+                   fdtd2d)
diff --git a/samples/polybench/floyd-warshall.py b/samples/polybench/floyd-warshall.py
new file mode 100644
index 0000000000..89513013bc
--- /dev/null
+++ b/samples/polybench/floyd-warshall.py
@@ -0,0 +1,47 @@
+import math
+import dace
+try:
+    import polybench
+except ImportError:
+    polybench = None
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.int32
+
+# Dataset sizes
+sizes = [{N: 60}, {N: 180}, {N: 500}, {N: 2800}, {N: 5600}]
+
+args = [dace.ndarray([N, N], datatype)]
+
+
+def init_array(path):
+    n = N.get()
+
+    for i in range(n):
+        for j in range(n):
+            path[i, j] = datatype(i * j % 7 + 1)
+            if (i + j) % 13 == 0 or (i + j) % 7 == 0 or (i + j) % 11 == 0:
+                path[i, j] = datatype(999)
+
+
+@dace.program(datatype[N, N])
+def floyd_warshall(path):
+    @dace.map
+    def k_map(k: _[0:N]):
+        @dace.map
+        def ij_map(i: _[0:N], j: _[0:N]):
+            ik_dist << path[i, k]
+            kj_dist << path[k, j]
+            out >> path(1, lambda x, y: min(x, y))[i, j]
+            out = ik_dist + kj_dist
+
+
+if __name__ == '__main__':
+    if polybench:
+        polybench.main(sizes, args, [(0, 'path')], init_array, floyd_warshall)
+    else:
+        [k.set(v) for k, v in sizes[2].items()]
+        init_array(*args)
+        floyd_warshall(*args)
diff --git a/samples/polybench/gemm.py b/samples/polybench/gemm.py
new file mode 100644
index 0000000000..560e0fb1d9
--- /dev/null
+++ b/samples/polybench/gemm.py
@@ -0,0 +1,92 @@
+import math
+import dace
+try:
+    import polybench
+except ImportError:
+    polybench = None
+
+NI = dace.symbol('NI')
+NJ = dace.symbol('NJ')
+NK = dace.symbol('NK')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    NI: 20,
+    NJ: 25,
+    NK: 30
+}, {
+    NI: 60,
+    NJ: 70,
+    NK: 80
+}, {
+    NI: 200,
+    NJ: 220,
+    NK: 240
+}, {
+    NI: 1000,
+    NJ: 1100,
+    NK: 1200
+}, {
+    NI: 2000,
+    NJ: 2300,
+    NK: 2600
+}]
+
+args = [
+    dace.ndarray([NI, NJ], datatype),
+    dace.ndarray([NI, NK], datatype),
+    dace.ndarray([NK, NJ], datatype),
+    dace.ndarray([1], datatype),
+    dace.ndarray([1], datatype)
+]
+
+
+def init_array(C, A, B, alpha, beta):
+    ni = NI.get()
+    nj = NJ.get()
+    nk = NK.get()
+
+    alpha[0] = datatype(1.5)
+    beta[0] = datatype(1.2)
+
+    for i in range(ni):
+        for j in range(nj):
+            C[i, j] = datatype((i * j + 1) % ni) / ni
+    for i in range(ni):
+        for j in range(nk):
+            A[i, j] = datatype(i * (j + 1) % nk) / nk
+    for i in range(nk):
+        for j in range(nj):
+            B[i, j] = datatype(i * (j + 2) % nj) / nj
+
+
+@dace.program(datatype[NI, NJ], datatype[NI, NK], datatype[NK, NJ],
+              datatype[1], datatype[1])
+def gemm(C, A, B, alpha, beta):
+    @dace.map
+    def mult_c(i: _[0:NI], j: _[0:NJ]):
+        inp << C[i, j]
+        in_beta << beta
+        out >> C[i, j]
+
+        out = inp * in_beta
+
+    @dace.map
+    def comp(i: _[0:NI], k: _[0:NK], j: _[0:NJ]):
+        in_a << A[i, k]
+        in_b << B[k, j]
+        in_alpha << alpha
+        out >> C(1, lambda x, y: x + y)[i, j]
+        out = in_alpha * in_a * in_b
+
+
+if __name__ == '__main__':
+    if polybench:
+        polybench.main(sizes, args, [(0, 'C')], init_array, gemm)
+    else:
+        [k.set(v) for k, v in sizes[2].items()]
+        init_array(*args)
+        gemm(*args)
diff --git a/samples/polybench/gemver.py b/samples/polybench/gemver.py
new file mode 100644
index 0000000000..3a1f0d50b1
--- /dev/null
+++ b/samples/polybench/gemver.py
@@ -0,0 +1,90 @@
+import math
+import dace
+import polybench
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{N: 40}, {N: 120}, {N: 400}, {N: 2000}, {N: 4000}]
+
+args = [
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([1], datatype),
+    dace.ndarray([1], datatype)
+]
+
+outputs = [(5, 'w')]
+
+
+def init_array(A, u1, v1, u2, v2, w, x, y, z, alpha, beta):
+    n = N.get()
+
+    alpha[0] = datatype(1.5)
+    beta[0] = datatype(1.2)
+
+    for i in range(n):
+        u1[i] = i
+        u2[i] = ((i + 1) / n) / 2.0
+        v1[i] = ((i + 1) / n) / 4.0
+        v2[i] = ((i + 1) / n) / 6.0
+        y[i] = ((i + 1) / n) / 8.0
+        z[i] = ((i + 1) / n) / 9.0
+        x[i] = 0.0
+        w[i] = 0.0
+        for j in range(n):
+            A[i, j] = datatype(i * j % n) / n
+
+
+@dace.program(datatype[N, N], datatype[N], datatype[N], datatype[N],
+              datatype[N], datatype[N], datatype[N], datatype[N], datatype[N],
+              datatype[1], datatype[1])
+def gemver(A, u1, v1, u2, v2, w, x, y, z, alpha, beta):
+    @dace.map
+    def add_uv(i: _[0:N], j: _[0:N]):
+        iu1 << u1[i]
+        iv1 << v1[j]
+        iu2 << u2[i]
+        iv2 << v2[j]
+        ia << A[i, j]
+        oa >> A[i, j]
+
+        oa = ia + iu1 * iv1 + iu2 * iv2
+
+    @dace.map
+    def comp_y(i: _[0:N], j: _[0:N]):
+        ib << beta
+        ia << A[j, i]
+        iy << y[j]
+        ox >> x(1, lambda a, b: a + b)[i]
+
+        ox = ib * ia * iy
+
+    @dace.map
+    def comp_xz(i: _[0:N]):
+        ix << x[i]
+        iz << z[i]
+        ox >> x[i]
+        ox = ix + iz
+
+    @dace.map
+    def comp_w(i: _[0:N], j: _[0:N]):
+        ialpha << alpha
+        ia << A[i, j]
+        ix << x[j]
+        ow >> w(1, lambda a, b: a + b)[i]
+        ow = ialpha * ia * ix
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, outputs, init_array, gemver)
diff --git a/samples/polybench/gesummv.py b/samples/polybench/gesummv.py
new file mode 100644
index 0000000000..2ec44a437a
--- /dev/null
+++ b/samples/polybench/gesummv.py
@@ -0,0 +1,64 @@
+import math
+import dace
+import polybench
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{N: 30}, {N: 90}, {N: 250}, {N: 1300}, {N: 2800}]
+
+args = [
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([1], datatype),
+    dace.ndarray([1], datatype)
+]
+
+outputs = [(4, 'y')]
+
+
+def init_array(A, B, tmp, x, y, alpha, beta):
+    n = N.get()
+
+    alpha[0] = datatype(1.5)
+    beta[0] = datatype(1.2)
+
+    for i in range(n):
+        x[i] = datatype(i % n) / n
+        for j in range(n):
+            A[i, j] = datatype((i * j + 1) % n) / n
+            B[i, j] = datatype((i * j + 2) % n) / n
+
+
+@dace.program(datatype[N, N], datatype[N, N], datatype[N], datatype[N],
+              datatype[N], datatype[1], datatype[1])
+def gesummv(A, B, tmp, x, y, alpha, beta):
+    @dace.map
+    def compute_ty(i: _[0:N], j: _[0:N]):
+        ia << A[i, j]
+        ib << B[i, j]
+        ix << x[j]
+        ot >> tmp(1, lambda a, b: a + b, 0)[i]
+        oy >> y(1, lambda a, b: a + b, 0)[i]
+
+        ot = ia * ix
+        oy = ib * ix
+
+    @dace.map
+    def update_y(i: _[0:N]):
+        iy << y[i]
+        ialpha << alpha
+        ibeta << beta
+        it << tmp[i]
+        oy >> y[i]
+        oy = ialpha * it + ibeta * iy
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, outputs, init_array, gesummv)
diff --git a/samples/polybench/gramschmidt.py b/samples/polybench/gramschmidt.py
new file mode 100644
index 0000000000..22217eccb7
--- /dev/null
+++ b/samples/polybench/gramschmidt.py
@@ -0,0 +1,106 @@
+import math
+import numpy as np
+import dace
+import polybench
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 20,
+    N: 30
+}, {
+    M: 60,
+    N: 180
+}, {
+    M: 200,
+    N: 240
+}, {
+    M: 1000,
+    N: 1200
+}, {
+    M: 2000,
+    N: 2600
+}]
+
+args = [
+    dace.ndarray([M, N], datatype),
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([M, N], datatype)
+]
+
+
+def init_array(A, R, Q):
+    m = M.get()
+    n = N.get()
+
+    for i in range(0, m, 1):
+        for j in range(0, n, 1):
+            A[i, j] = ((datatype((i * j) % m) / m) * 100) + 10
+            Q[i, j] = datatype(0)
+    for i in range(0, n, 1):
+        for j in range(0, n, 1):
+            R[i, j] = datatype(0)
+
+
+@dace.program(datatype[M, N], datatype[N, N], datatype[M, N])
+def gramschmidt(A, R, Q):
+
+    nrm = dace.define_local([1], datatype)
+
+    for k in range(0, N, 1):
+
+        @dace.tasklet
+        def set_nrm():
+            out_nrm >> nrm
+            out_nrm = datatype(0)
+
+        @dace.map
+        def set_sum(i: _[0:M]):
+            in_A << A[i, k]
+            out_nrm >> nrm(1, lambda x, y: x + y)
+            out_nrm = in_A * in_A
+
+        @dace.tasklet
+        def set_rkk():
+            in_nrm << nrm
+            out_R >> R[k, k]
+            out_R = math.sqrt(in_nrm)
+
+        @dace.map
+        def set_q(i: _[0:M]):
+            in_A << A[i, k]
+            in_R << R[k, k]
+            out_Q >> Q[i, k]
+            out_Q = in_A / in_R
+
+        @dace.map
+        def set_rna(j: _[k + 1:N]):
+            # for j in range(k+1, N, 1):
+
+            @dace.tasklet
+            def init_r():
+                out_R >> R[k, j]
+                out_R = datatype(0)
+
+            @dace.map
+            def set_r(i: _[0:M]):
+                in_A << A[i, j]
+                in_Q << Q[i, k]
+                out_R >> R(1, lambda x, y: x + y)[k, j]
+                out_R = in_A * in_Q
+
+            @dace.map
+            def set_a(i: _[0:M]):
+                in_R << R[k, j]
+                in_Q << Q[i, k]
+                out_A >> A(1, lambda x, y: x + y)[i, j]
+                out_A = -in_R * in_Q
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(1, 'R'), (2, 'Q')], init_array, gramschmidt)
diff --git a/samples/polybench/heat-3d.py b/samples/polybench/heat-3d.py
new file mode 100644
index 0000000000..ccedcb48fa
--- /dev/null
+++ b/samples/polybench/heat-3d.py
@@ -0,0 +1,80 @@
+import dace
+import polybench
+
+N = dace.symbol('N')
+tsteps = dace.symbol('tsteps')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    tsteps: 20,
+    N: 10
+}, {
+    tsteps: 40,
+    N: 20
+}, {
+    tsteps: 100,
+    N: 40
+}, {
+    tsteps: 500,
+    N: 120
+}, {
+    tsteps: 1000,
+    N: 200
+}]
+args = [
+    dace.ndarray([N, N, N], datatype),
+    dace.ndarray([N, N, N], datatype)  #, N, tsteps
+]
+
+
+@dace.program(datatype[N, N, N], datatype[N, N, N])  #, dace.int32, dace.int32)
+def heat3d(A, B):  #, N, tsteps):
+    for t in range(tsteps):
+
+        @dace.map
+        def a(i: _[1:N - 1], j: _[1:N - 1], k: _[1:N - 1]):
+            a11 << A[i + 1, j, k]
+            a12 << A[i - 1, j, k]
+            a21 << A[i, j + 1, k]
+            a22 << A[i, j - 1, k]
+            a31 << A[i, j, k + 1]
+            a32 << A[i, j, k - 1]
+            a << A[i, j, k]
+            b >> B[i, j, k]
+
+            b = 0.125 * (a11 - datatype(2.0) * a + a12) +\
+                0.125 * (a21 - datatype(2.0) * a + a22) +\
+                0.125 * (a31 - datatype(2.0) * a + a32) +\
+                a
+
+        @dace.map
+        def a(i: _[1:N - 1], j: _[1:N - 1], k: _[1:N - 1]):
+            a11 << B[i + 1, j, k]
+            a12 << B[i - 1, j, k]
+            a21 << B[i, j + 1, k]
+            a22 << B[i, j - 1, k]
+            a31 << B[i, j, k + 1]
+            a32 << B[i, j, k - 1]
+            a << B[i, j, k]
+            b >> A[i, j, k]
+
+            b = 0.125 * (a11 - datatype(2.0) * a + a12) +\
+                0.125 * (a21 - datatype(2.0) * a + a22) +\
+                0.125 * (a31 - datatype(2.0) * a + a32) +\
+                a
+
+
+def init_array(A, B):  #, N, tsteps):
+    n = N.get()
+    for i in range(n):
+        for j in range(n):
+            for k in range(n):
+                A[i, j, k] = datatype((i + j + (n - k)) * 10) / n
+                B[i, j, k] = datatype((i + j + (n - k)) * 10) / n
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(0, 'A')], init_array, heat3d)
diff --git a/samples/polybench/jacobi-1d.py b/samples/polybench/jacobi-1d.py
new file mode 100644
index 0000000000..edd888bd6f
--- /dev/null
+++ b/samples/polybench/jacobi-1d.py
@@ -0,0 +1,61 @@
+import dace
+import polybench
+from absl import app, flags
+
+N = dace.symbol('N')
+tsteps = dace.symbol('tsteps')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    tsteps: 20,
+    N: 30
+}, {
+    tsteps: 40,
+    N: 120
+}, {
+    tsteps: 100,
+    N: 400
+}, {
+    tsteps: 500,
+    N: 2000
+}, {
+    tsteps: 1000,
+    N: 4000
+}]
+args = [dace.ndarray([N], datatype),
+        dace.ndarray([N], datatype)]  #, N, tsteps]
+
+
+@dace.program(datatype[N], datatype[N])  #, dace.int32, dace.int32)
+def jacobi1d(A, B):  #, N, tsteps):
+    for t in range(tsteps):
+
+        @dace.map
+        def a(i: _[1:N - 1]):
+            a1 << A[i - 1]
+            a2 << A[i]
+            a3 << A[i + 1]
+            b >> B[i]
+            b = 0.33333 * (a1 + a2 + a3)
+
+        @dace.map
+        def b(i: _[1:N - 1]):
+            a1 << B[i - 1]
+            a2 << B[i]
+            a3 << B[i + 1]
+            b >> A[i]
+            b = 0.33333 * (a1 + a2 + a3)
+
+
+def init_array(A, B):  #, N, tsteps):
+    n = N.get()
+    for i in range(n):
+        A[i] = datatype(i + 2) / n
+        B[i] = datatype(i + 3) / n
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(0, 'A')], init_array, jacobi1d)
diff --git a/samples/polybench/jacobi-2d.py b/samples/polybench/jacobi-2d.py
new file mode 100644
index 0000000000..eeede070fa
--- /dev/null
+++ b/samples/polybench/jacobi-2d.py
@@ -0,0 +1,77 @@
+import dace
+try:
+    import polybench
+except ImportError:
+    polybench = None
+
+N = dace.symbol('N')
+tsteps = dace.symbol('tsteps')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    tsteps: 20,
+    N: 30
+}, {
+    tsteps: 40,
+    N: 90
+}, {
+    tsteps: 100,
+    N: 250
+}, {
+    tsteps: 500,
+    N: 1300
+}, {
+    tsteps: 1000,
+    N: 2800
+}]
+args = [
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([N, N], datatype)  #, N, tsteps
+]
+
+
+@dace.program(datatype[N, N], datatype[N, N])  #, dace.int32, dace.int32)
+def jacobi2d(A, B):  #, N, tsteps):
+    for t in range(tsteps):
+
+        @dace.map
+        def a(i: _[1:N - 1], j: _[1:N - 1]):
+            a1 << A[i, j]
+            a2 << A[i, j - 1]
+            a3 << A[i, j + 1]
+            a4 << A[i + 1, j]
+            a5 << A[i - 1, j]
+            b >> B[i, j]
+
+            b = 0.2 * (a1 + a2 + a3 + a4 + a5)
+
+        @dace.map
+        def b(i: _[1:N - 1], j: _[1:N - 1]):
+            a1 << B[i, j]
+            a2 << B[i, j - 1]
+            a3 << B[i, j + 1]
+            a4 << B[i + 1, j]
+            a5 << B[i - 1, j]
+            b >> A[i, j]
+
+            b = 0.2 * (a1 + a2 + a3 + a4 + a5)
+
+
+def init_array(A, B):  #, N, tsteps):
+    n = N.get()
+    for i in range(n):
+        for j in range(n):
+            A[i, j] = datatype(i * (j + 2) + 2) / n
+            B[i, j] = datatype(i * (j + 3) + 3) / n
+
+
+if __name__ == '__main__':
+    if polybench:
+        polybench.main(sizes, args, [(0, 'A')], init_array, jacobi2d)
+    else:
+        [k.set(v) for k, v in sizes[2].items()]
+        init_array(*args)
+        jacobi2d(*args)
diff --git a/samples/polybench/lu.py b/samples/polybench/lu.py
new file mode 100644
index 0000000000..f476fb2b61
--- /dev/null
+++ b/samples/polybench/lu.py
@@ -0,0 +1,61 @@
+import math
+import numpy as np
+import dace
+import polybench
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{N: 40}, {N: 120}, {N: 400}, {N: 2000}, {N: 4000}]
+
+args = [dace.ndarray([N, N], datatype)]
+
+
+def init_array(A):
+    n = N.get()
+
+    for i in range(0, n, 1):
+        for j in range(0, i + 1, 1):
+            # Python does modulo, while C does remainder ...
+            A[i, j] = datatype(-(j % n)) / n + 1
+        for j in range(i + 1, n, 1):
+            A[i, j] = datatype(0)
+        A[i, i] = datatype(1)
+
+    A[:] = np.dot(A, np.transpose(A))
+
+
+@dace.program(datatype[N, N])
+def lu(A):
+    for i in range(0, N, 1):
+        for j in range(0, i, 1):
+
+            @dace.map
+            def k_loop1(k: _[0:j]):
+                i_in << A[i, k]
+                j_in << A[k, j]
+                out >> A(1, lambda x, y: x + y)[i, j]
+                out = -i_in * j_in
+
+            @dace.tasklet
+            def div():
+                ij_in << A[i, j]
+                jj_in << A[j, j]
+                out >> A[i, j]
+                out = ij_in / jj_in
+
+        for j in range(i, N, 1):
+
+            @dace.map
+            def k_loop2(k: _[0:i]):
+                i_in << A[i, k]
+                j_in << A[k, j]
+                out >> A(1, lambda x, y: x + y)[i, j]
+                out = -i_in * j_in
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(0, 'A')], init_array, lu)
diff --git a/samples/polybench/ludcmp.py b/samples/polybench/ludcmp.py
new file mode 100644
index 0000000000..8ac8fbe7aa
--- /dev/null
+++ b/samples/polybench/ludcmp.py
@@ -0,0 +1,135 @@
+import math
+import dace
+import polybench
+import numpy as np
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{N: 40}, {N: 120}, {N: 400}, {N: 2000}, {N: 4000}]
+
+args = [
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype)
+]
+
+
+def init_array(A, b, x, y):
+    n = N.get()
+
+    x[:] = datatype(0)
+    y[:] = datatype(0)
+
+    for i in range(0, n, 1):
+        b[i] = datatype(i + 1) / datatype(n) / 2.0 + 4
+
+    for i in range(0, n, 1):
+        for j in range(0, i + 1, 1):
+            # Python does modulo, while C does remainder ...
+            A[i, j] = datatype(-(j % n)) / n + 1
+        for j in range(i + 1, n, 1):
+            A[i, j] = datatype(0)
+        A[i, i] = datatype(1)
+
+    A[:] = np.dot(A, np.transpose(A))
+
+
+@dace.program(datatype[N, N], datatype[N], datatype[N], datatype[N])
+def ludcmp(A, b, x, y):
+    w = dace.define_local([1], datatype)
+
+    for i in range(0, N, 1):
+        for j in range(0, i, 1):
+
+            @dace.tasklet
+            def init_w1():
+                in_A << A[i, j]
+                out >> w
+                out = in_A
+
+            @dace.map
+            def k_loop1(k: _[0:j]):
+                i_in << A[i, k]
+                j_in << A[k, j]
+                out >> w(1, lambda x, y: x + y)
+                out = -i_in * j_in
+
+            @dace.tasklet
+            def div1():
+                jj_in << A[j, j]
+                in_w << w
+                out >> A[i, j]
+                out = in_w / jj_in
+
+        for j in range(i, N, 1):
+
+            @dace.tasklet
+            def set_w2():
+                in_A << A[i, j]
+                out >> w
+                out = in_A
+
+            @dace.map
+            def k_loop2(k: _[0:i]):
+                i_in << A[i, k]
+                j_in << A[k, j]
+                out >> w(1, lambda x, y: x + y)
+                out = -i_in * j_in
+
+            @dace.tasklet
+            def set_a2():
+                in_w << w
+                out >> A[i, j]
+                out = in_w
+
+    for i in range(0, N, 1):
+
+        @dace.tasklet
+        def init_w3():
+            in_b << b[i]
+            out >> w
+            out = in_b
+
+        @dace.map
+        def set_w3(j: _[0:i]):
+            in_A << A[i, j]
+            in_y << y[j]
+            out >> w(1, lambda x, y: x + y)
+            out = -in_A * in_y
+
+        @dace.tasklet
+        def set_y3():
+            in_w << w
+            out >> y[i]
+            out = in_w
+
+    for i in range(N - 1, -1, -1):
+
+        @dace.tasklet
+        def init_w4():
+            in_y << y[i]
+            out >> w
+            out = in_y
+
+        @dace.map
+        def set_w4(j: _[i + 1:N]):
+            in_A << A[i, j]
+            in_x << x[j]
+            out >> w(1, lambda x, y: x + y)
+            out = -in_A * in_x
+
+        @dace.tasklet
+        def set_x4():
+            in_w << w
+            in_A << A[i, i]
+            out >> x[i]
+            out = in_w / in_A
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(2, 'x')], init_array, ludcmp)
diff --git a/samples/polybench/mvt.py b/samples/polybench/mvt.py
new file mode 100644
index 0000000000..4340b2efe0
--- /dev/null
+++ b/samples/polybench/mvt.py
@@ -0,0 +1,60 @@
+import math
+import dace
+import polybench
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    N: 40,
+}, {
+    N: 120,
+}, {
+    N: 400,
+}, {
+    N: 2000,
+}, {
+    N: 4000,
+}]
+
+args = [
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N, N], datatype)
+]
+
+
+def init_array(x1, x2, y_1, y_2, A):
+    n = N.get()
+
+    for i in range(n):
+        x1[i] = datatype(i % n) / n
+        x2[i] = datatype((i + 1) % n) / n
+        y_1[i] = datatype((i + 3) % n) / n
+        y_2[i] = datatype((i + 4) % n) / n
+        for j in range(n):
+            A[i, j] = datatype(i * j % n) / n
+
+
+@dace.program(datatype[N], datatype[N], datatype[N], datatype[N],
+              datatype[N, N])
+def mvt(x1, x2, y_1, y_2, A):
+    @dace.map
+    def compute(i: _[0:N], j: _[0:N]):
+        in_A1 << A[i, j]
+        in_A2 << A[j, i]
+        iny1 << y_1[j]
+        iny2 << y_2[j]
+        out1 >> x1(1, lambda a, b: a + b)[i]
+        out2 >> x2(1, lambda a, b: a + b)[i]
+        out1 = in_A1 * iny1
+        out2 = in_A2 * iny2
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(0, 'x1'), (1, 'x2')], init_array, mvt)
diff --git a/samples/polybench/nussinov.py b/samples/polybench/nussinov.py
new file mode 100644
index 0000000000..dc7b07f132
--- /dev/null
+++ b/samples/polybench/nussinov.py
@@ -0,0 +1,91 @@
+import math
+import dace
+import polybench
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.int32
+base = dace.int8
+
+# Dataset sizes
+sizes = [{N: 60}, {N: 180}, {N: 500}, {N: 2500}, {N: 5500}]
+
+args = [dace.ndarray([N], datatype), dace.ndarray([N, N], datatype)]
+
+
+def init_array(seq, table):
+    n = N.get()
+
+    for i in range(0, n):
+        seq[i] = datatype((i + 1) % 4)
+    table[:] = datatype(0)
+
+
+@dace.program(datatype[N], datatype[N, N])
+def nussinov(seq, table):
+    for i in range(N - 1, -1, -1):
+        for j in range(i + 1, N, 1):
+            if j - 1 >= 0:
+
+                @dace.tasklet
+                def set_table_1():
+                    center << table[i, j]
+                    west << table[i, j - 1]
+                    out >> table[i, j]
+                    out = max(center, west)
+
+            if i + 1 < N:
+
+                @dace.tasklet
+                def set_table_2():
+                    center << table[i, j]
+                    south << table[i + 1, j]
+                    out >> table[i, j]
+                    out = max(center, south)
+
+            if j - 1 >= 0 and i + 1 < N:
+                if i < j - 1:
+
+                    @dace.tasklet
+                    def set_table_3():
+                        center << table[i, j]
+                        swest << table[i + 1, j - 1]
+                        seq_i << seq[i]
+                        seq_j << seq[j]
+                        out >> table[i, j]
+                        out = max(center, swest + int(seq_i + seq_j == 3))
+                else:
+
+                    @dace.tasklet
+                    def set_table_4():
+                        center << table[i, j]
+                        swest << table[i + 1, j - 1]
+                        out >> table[i, j]
+                        out = max(center, swest)
+
+            for k in range(i + 1, j, 1):
+
+                @dace.tasklet
+                def set_table_5():
+                    center << table[i, j]
+                    k_center << table[i, k]
+                    k_south << table[k + 1, j]
+                    out >> table[i, j]
+                    out = max(center, k_center + k_south)
+
+
+def print_result(filename, *args):
+    with open(filename, 'w') as fp:
+        fp.write("==BEGIN DUMP_ARRAYS==\n")
+        fp.write("begin dump: %s\n" % 'table')
+        for i in range(0, N.get()):
+            for j in range(i, N.get()):
+                fp.write("{} ".format(args[1][i, j]))
+            fp.write("\n")
+        fp.write("\nend   dump: %s\n" % 'table')
+        fp.write("==END   DUMP_ARRAYS==\n")
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, print_result, init_array, nussinov)
diff --git a/samples/polybench/polybench.py b/samples/polybench/polybench.py
new file mode 100644
index 0000000000..1ad16e0faa
--- /dev/null
+++ b/samples/polybench/polybench.py
@@ -0,0 +1,94 @@
+from absl import app, flags
+import numpy as np
+import functools
+import dace
+
+flags.DEFINE_bool('simulate', False, 'Use the DaCe python simulator')
+flags.DEFINE_bool(
+    'specialize', True, 'Compile problem with evaluated ' +
+    '(specialized to constant value) symbols')
+flags.DEFINE_bool('sequential', False,
+                  'Automatically change all maps to sequential schedule')
+flags.DEFINE_bool('save', False, 'Save results to file')
+flags.DEFINE_enum('size', 'large',
+                  ['mini', 'small', 'medium', 'large', 'extralarge'],
+                  'Dataset/problem size')
+_SIZE_TO_IND = {
+    'mini': 0,
+    'small': 1,
+    'medium': 2,
+    'large': 3,
+    'extralarge': 4
+}
+
+FLAGS = flags.FLAGS
+
+
+def polybench_dump(filename, args, output_args):
+    """ Dumps the outputs in a format that matches the Polybench dumper. """
+    with open(filename, 'w') as fp:
+        fp.write("==BEGIN DUMP_ARRAYS==\n")
+
+        for i, name in output_args:
+            fp.write("begin dump: %s\n" % name)
+            np.savetxt(
+                fp,
+                args[i].reshape(
+                    args[i].shape[0],
+                    functools.reduce(lambda a, b: a * b, args[i].shape[1:],
+                                     1)),
+                fmt="%0.7lf")
+            fp.write("\nend   dump: %s\n" % name)
+
+        fp.write("==END   DUMP_ARRAYS==\n")
+
+
+def _main(sizes, args, output_args, init_array, func, argv, keywords=None):
+    print('Polybench test %s, problem size: %s' % (func.name, FLAGS.size))
+
+    # Initialize symbols with values from dataset size
+    psize = sizes[_SIZE_TO_IND[FLAGS.size]]
+    for k, v in psize.items():
+        k.set(v)
+
+    if FLAGS.simulate == False:
+        if isinstance(func, dace.SDFG):
+            sdfg = func
+        else:
+            sdfg = func.to_sdfg(*args)
+        if FLAGS.sequential:
+            for state in sdfg.nodes():
+                for node in state.nodes():
+                    if isinstance(node, dace.graph.nodes.MapEntry):
+                        node.map.schedule = dace.ScheduleType.Sequential
+        if FLAGS.specialize:
+            compiled_sdfg = sdfg.compile(specialize=True)
+        else:
+            compiled_sdfg = sdfg.compile()
+
+    print('Initializing arrays...')
+    init_array(*args)
+    print('Running %skernel...' % ('specialized ' if FLAGS.specialize else ''))
+
+    if FLAGS.simulate:
+        dace.simulate(func, *args)
+    else:
+        if isinstance(func, dace.SDFG):
+            compiled_sdfg(**keywords)
+        else:
+            compiled_sdfg(*args)
+
+    if FLAGS.save:
+        if not isinstance(output_args, list):
+            output_args(func.name + '.dace.out', *args)
+        else:
+            polybench_dump(func.name + '.dace.out', args, output_args)
+
+    print('==== Done ====')
+
+
+def main(sizes, args, outputs, init_array, func, keywords=None):
+    # Pass application arguments and command-line arguments through abseil
+    app.run(
+        lambda argv: _main(sizes, args, outputs, init_array, func, argv, keywords)
+    )
diff --git a/samples/polybench/seidel-2d.py b/samples/polybench/seidel-2d.py
new file mode 100644
index 0000000000..2c868f5fea
--- /dev/null
+++ b/samples/polybench/seidel-2d.py
@@ -0,0 +1,62 @@
+import dace
+import polybench
+from absl import app, flags
+
+N = dace.symbol('N')
+tsteps = dace.symbol('tsteps')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    tsteps: 20,
+    N: 40
+}, {
+    tsteps: 40,
+    N: 120
+}, {
+    tsteps: 100,
+    N: 400
+}, {
+    tsteps: 500,
+    N: 2000
+}, {
+    tsteps: 1000,
+    N: 4000
+}]
+args = [dace.ndarray([N, N], datatype), tsteps]
+
+
+@dace.program(datatype[N, N], dace.int32)
+def seidel2d(A, tsteps):
+    for t in range(tsteps):
+        for i in range(1, N - 1):
+            for j in range(1, N - 1):
+
+                @dace.tasklet
+                def a():
+                    a1 << A[i - 1, j - 1]
+                    a2 << A[i - 1, j]
+                    a3 << A[i - 1, j + 1]
+                    a4 << A[i, j - 1]
+                    a5 << A[i, j]
+                    a6 << A[i, j + 1]
+                    a7 << A[i + 1, j - 1]
+                    a8 << A[i + 1, j]
+                    a9 << A[i + 1, j + 1]
+                    out >> A[i, j]
+
+                    out = (a1 + a2 + a3 + a4 + a5 + a6 + a7 + a8 +
+                           a9) / datatype(9.0)
+
+
+def init_array(A, tsteps):
+    n = N.get()
+    for i in range(n):
+        for j in range(n):
+            A[i, j] = datatype(i * (j + 2) + 2) / n
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(0, 'A')], init_array, seidel2d)
diff --git a/samples/polybench/symm.py b/samples/polybench/symm.py
new file mode 100644
index 0000000000..1cf88e4255
--- /dev/null
+++ b/samples/polybench/symm.py
@@ -0,0 +1,98 @@
+import math
+import dace
+import polybench
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 20,
+    N: 30
+}, {
+    M: 60,
+    N: 80
+}, {
+    M: 200,
+    N: 240
+}, {
+    M: 1000,
+    N: 1200
+}, {
+    M: 2000,
+    N: 2600
+}]
+
+args = [
+    dace.ndarray([M, N], datatype),
+    dace.ndarray([M, M], datatype),
+    dace.ndarray([M, N], datatype),
+    dace.ndarray([1], datatype),
+    dace.ndarray([1], datatype)
+]
+
+outputs = [(0, 'C')]
+
+
+def init_array(C, A, B, alpha, beta):
+    n = N.get()
+    m = M.get()
+
+    alpha[0] = datatype(1.5)
+    beta[0] = datatype(1.2)
+
+    for i in range(m):
+        for j in range(n):
+            C[i, j] = datatype((i + j) % 100) / m
+            B[i, j] = datatype((n + i - j) % 100) / m
+    for i in range(m):
+        for j in range(i + 1):
+            A[i, j] = datatype((i + j) % 100) / m
+        for j in range(i + 1, m):
+            A[i, j] = -999
+            # regions of arrays that should not be used
+
+    print('aval', beta[0] * C[0, 0] + alpha[0] * B[0, 0] * A[0, 0])
+
+
+@dace.program(datatype[M, N], datatype[M, M], datatype[M, N], datatype[1],
+              datatype[1])
+def symm(C, A, B, alpha, beta):
+    @dace.map
+    def comp_all(j: _[0:N], i: _[0:M]):
+        temp2 = dace.define_local_scalar(datatype)
+
+        @dace.tasklet
+        def reset_tmp():
+            tmp >> temp2
+            tmp = 0
+
+        @dace.map
+        def comp_t2(k: _[0:i]):
+            ialpha << alpha
+            ia << A[i, k]
+            ibi << B[i, j]
+            ibk << B[k, j]
+            oc >> C(1, lambda a, b: a + b)[k, j]
+            ot2 >> temp2(1, lambda a, b: a + b)
+
+            oc = ialpha * ibi * ia
+            ot2 = ibk * ia
+
+        @dace.tasklet
+        def comp_rest():
+            ibeta << beta
+            ib << B[i, j]
+            iadiag << A[i, i]
+            ialpha << alpha
+            it2 << temp2
+            ic << C[i, j]
+            oc >> C[i, j]
+            oc = ibeta * ic + ialpha * ib * iadiag + ialpha * it2
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, outputs, init_array, symm)
diff --git a/samples/polybench/syr2k.py b/samples/polybench/syr2k.py
new file mode 100644
index 0000000000..2b55ea485f
--- /dev/null
+++ b/samples/polybench/syr2k.py
@@ -0,0 +1,81 @@
+import math
+import dace
+import polybench
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 20,
+    N: 30
+}, {
+    M: 60,
+    N: 80
+}, {
+    M: 200,
+    N: 240
+}, {
+    M: 1000,
+    N: 1200
+}, {
+    M: 2000,
+    N: 2600
+}]
+
+args = [
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([N, M], datatype),
+    dace.ndarray([N, M], datatype),
+    dace.ndarray([1], datatype),
+    dace.ndarray([1], datatype)
+]
+
+outputs = [(0, 'C')]
+
+
+def init_array(C, A, B, alpha, beta):
+    n = N.get()
+    m = M.get()
+
+    alpha[0] = datatype(1.5)
+    beta[0] = datatype(1.2)
+
+    for i in range(n):
+        for j in range(m):
+            A[i, j] = datatype((i * j + 1) % n) / n
+            B[i, j] = datatype((i * j + 2) % m) / m
+        for j in range(n):
+            C[i, j] = datatype((i * j + 3) % n) / m
+
+
+@dace.program(datatype[N, N], datatype[N, M], datatype[N, M], datatype[1],
+              datatype[1])
+def syr2k(C, A, B, alpha, beta):
+    @dace.map
+    def mult_c_rows(i: _[0:N]):
+        @dace.map
+        def mult_c_cols(j: _[0:i + 1]):
+            ic << C[i, j]
+            ib << beta
+            oc >> C[i, j]
+            oc = ic * ib
+
+    @dace.map
+    def compute(i: _[0:N], k: _[0:M]):
+        @dace.map
+        def compute_elem(j: _[0:i + 1]):
+            ialpha << alpha
+            ia << A[i, k]
+            iat << A[j, k]
+            ib << B[i, k]
+            ibt << B[j, k]
+            oc >> C(1, lambda a, b: a + b)[i, j]
+            oc = ialpha * iat * ib + ialpha * ibt * ia
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, outputs, init_array, syr2k)
diff --git a/samples/polybench/syrk.py b/samples/polybench/syrk.py
new file mode 100644
index 0000000000..b5afffeea0
--- /dev/null
+++ b/samples/polybench/syrk.py
@@ -0,0 +1,76 @@
+import math
+import dace
+import polybench
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 20,
+    N: 30
+}, {
+    M: 60,
+    N: 80
+}, {
+    M: 200,
+    N: 240
+}, {
+    M: 1000,
+    N: 1200
+}, {
+    M: 2000,
+    N: 2600
+}]
+
+args = [
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([N, M], datatype),
+    dace.ndarray([1], datatype),
+    dace.ndarray([1], datatype)
+]
+
+outputs = [(0, 'C')]
+
+
+def init_array(C, A, alpha, beta):
+    n = N.get()
+    m = M.get()
+
+    alpha[0] = datatype(1.5)
+    beta[0] = datatype(1.2)
+
+    for i in range(n):
+        for j in range(m):
+            A[i, j] = datatype((i * j + 1) % n) / n
+        for j in range(n):
+            C[i, j] = datatype((i * j + 2) % m) / m
+
+
+@dace.program(datatype[N, N], datatype[N, M], datatype[1], datatype[1])
+def syrk(C, A, alpha, beta):
+    @dace.map
+    def mult_c_rows(i: _[0:N]):
+        @dace.map
+        def mult_c_cols(j: _[0:i + 1]):
+            ic << C[i, j]
+            ib << beta
+            oc >> C[i, j]
+            oc = ic * ib
+
+    @dace.map
+    def compute(i: _[0:N], k: _[0:M]):
+        @dace.map
+        def compute_elem(j: _[0:i + 1]):
+            ialpha << alpha
+            ia << A[i, k]
+            iat << A[j, k]
+            oc >> C(1, lambda a, b: a + b)[i, j]
+            oc = ialpha * ia * iat
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, outputs, init_array, syrk)
diff --git a/samples/polybench/trisolv.py b/samples/polybench/trisolv.py
new file mode 100644
index 0000000000..5922e2f8f6
--- /dev/null
+++ b/samples/polybench/trisolv.py
@@ -0,0 +1,60 @@
+import math
+import dace
+import polybench
+import numpy as np
+
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{N: 40}, {N: 120}, {N: 400}, {N: 2000}, {N: 4000}]
+
+args = [
+    dace.ndarray([N, N], datatype),
+    dace.ndarray([N], datatype),
+    dace.ndarray([N], datatype)
+]
+
+
+def init_array(L, x, b):
+    n = N.get()
+
+    x[:] = datatype(-999)
+    for i in range(0, n, 1):
+        b[i] = datatype(i)
+    for i in range(0, n, 1):
+        for j in range(0, i + 1, 1):
+            L[i, j] = 2 * datatype(i + n - j + 1) / n
+        for j in range(i + 1, n, 1):
+            L[i, j] = datatype(0)
+
+
+@dace.program(datatype[N, N], datatype[N], datatype[N])
+def trisolv(L, x, b):
+    for i in range(0, N, 1):
+
+        @dace.tasklet
+        def init_x():
+            in_b << b[i]
+            out >> x[i]
+            out = in_b
+
+        @dace.map
+        def set_x(j: _[0:i]):
+            in_L << L[i, j]
+            in_x << x[j]
+            out >> x(1, lambda x, y: x + y)[i]
+            out = -in_L * in_x
+
+        @dace.tasklet
+        def div():
+            in_x << x[i]
+            in_L << L[i, i]
+            out >> x[i]
+            out = in_x / in_L
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, [(1, 'x')], init_array, trisolv)
diff --git a/samples/polybench/trmm.py b/samples/polybench/trmm.py
new file mode 100644
index 0000000000..bcb77e3d99
--- /dev/null
+++ b/samples/polybench/trmm.py
@@ -0,0 +1,82 @@
+import math
+import dace
+import polybench
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+#datatypes = [dace.float64, dace.int32, dace.float32]
+datatype = dace.float64
+
+# Dataset sizes
+sizes = [{
+    M: 20,
+    N: 30
+}, {
+    M: 60,
+    N: 80
+}, {
+    M: 200,
+    N: 240
+}, {
+    M: 1000,
+    N: 1200
+}, {
+    M: 2000,
+    N: 2600
+}]
+
+args = [
+    dace.ndarray([M, M], datatype),
+    dace.ndarray([M, N], datatype),
+    dace.ndarray([1], datatype),
+]
+
+outputs = [(1, 'B')]
+
+
+def init_array(A, B, alpha):
+    n = N.get()
+    m = M.get()
+
+    alpha[0] = datatype(1.5)
+
+    for i in range(m):
+        for j in range(i):
+            A[i, j] = datatype((i + j) % m) / m
+        A[i, i] = 1.0
+        for j in range(n):
+            B[i, j] = datatype((n + (i - j)) % n) / n
+
+
+@dace.program(datatype[M, M], datatype[M, N], datatype[1])
+def trmm(A, B, alpha):
+    @dace.map
+    def compute(j: _[0:N]):
+        @dace.map
+        def computecol(i: _[0:M]):
+            tmp = dace.define_local_scalar(datatype)
+
+            @dace.tasklet
+            def reset_tmp():
+                out >> tmp
+                out = 0
+
+            @dace.map
+            def compute_elem(k: _[i + 1:M]):
+                ia << A[k, i]
+                ib << B[k, j]
+                ob >> tmp(1, lambda a, b: a + b)
+                ob = ia * ib
+
+            @dace.tasklet
+            def mult():
+                ib << B[i, j]
+                ialpha << alpha
+                itmp << tmp
+                ob >> B[i, j]
+                ob = ialpha * (ib + itmp)
+
+
+if __name__ == '__main__':
+    polybench.main(sizes, args, outputs, init_array, trmm)
diff --git a/samples/sdfg_api/control_flow.py b/samples/sdfg_api/control_flow.py
new file mode 100644
index 0000000000..76a64f441f
--- /dev/null
+++ b/samples/sdfg_api/control_flow.py
@@ -0,0 +1,100 @@
+import dace
+import numpy as np
+
+T = dace.symbol('T')
+
+sdfg = dace.SDFG('cflow')
+
+sdfg.add_array('A', [2], dace.float32)
+sdfg.add_array('B', [2], dace.float32)
+
+
+# Sample state contents
+def mystate(state, src, dst):
+    src_node = state.add_read(src)
+    dst_node = state.add_write(dst)
+    me, mx = state.add_map('aaa', dict(i='0:2'))
+    tasklet = state.add_tasklet('aaa2', {'a'}, {'b'}, 'b = a')
+
+    # input path (src->me->tasklet[a])
+    state.add_memlet_path(
+        src_node,
+        me,
+        tasklet,
+        dst_conn='a',
+        memlet=dace.Memlet.simple(src, 'i'))
+    # output path (tasklet[b]->mx->dst)
+    state.add_memlet_path(
+        tasklet,
+        mx,
+        dst_node,
+        src_conn='b',
+        memlet=dace.Memlet.simple(dst, 'i'))
+
+
+# End state contents
+def endstate(state):
+    A = state.add_read('A')
+    t = state.add_tasklet('endtask', {'a'}, {}, 'printf("done %f\\n", a)')
+    state.add_edge(A, None, t, 'a', dace.Memlet.simple('A', '0'))
+
+
+# State construction
+state0 = sdfg.add_state('s0')
+mystate(state0, 'A', 'B')
+
+guard = sdfg.add_state('guard')
+
+loopstate0 = sdfg.add_state('loops0')
+mystate(loopstate0, 'A', 'B')
+
+loopstate1 = sdfg.add_state('loops1')
+mystate(loopstate1, 'B', 'A')
+
+state2 = sdfg.add_state('s2')
+endstate(state2)
+
+# State connection (control flow)
+
+# Note: dataflow (arrays) CAN affect control flow assignments and conditions,
+#       but not the other way around (you cannot change an interstate variable
+#       inside a state). The following code works as well:
+#sdfg.add_edge(state0, guard, dace.InterstateEdge(assigments=dict('k', 'A[0]')))
+
+# Loop initialization (k=0)
+sdfg.add_edge(state0, guard, dace.InterstateEdge(assignments=dict(k='0')))
+
+# Loop condition (k < T / k >= T)
+sdfg.add_edge(guard, loopstate0, dace.InterstateEdge('k < T'))
+sdfg.add_edge(guard, state2, dace.InterstateEdge('k >= T'))
+
+# Loop incrementation (k++)
+sdfg.add_edge(
+    loopstate1, guard, dace.InterstateEdge(assignments=dict(k='k+1')))
+
+# Loop-internal interstate edges
+sdfg.add_edge(loopstate0, loopstate1, dace.InterstateEdge())
+
+# Validate correctness of initial SDFG
+sdfg.validate()
+
+sdfg.draw_to_file('nofusion.dot')
+
+# Fuses redundant states and removes unnecessary transient arrays
+sdfg.apply_strict_transformations()
+
+sdfg.draw_to_file('withfusion.dot')
+
+######################################
+if __name__ == '__main__':
+    print('Program start')
+
+    a = np.random.rand(2).astype(np.float32)
+    b = np.random.rand(2).astype(np.float32)
+
+    print(a, b)
+
+    # Don't forget the symbols!
+    sdfg(A=a, B=b, T=5)
+
+    print(b - a)
diff --git a/samples/sdfg_api/cublas_tasklet.py b/samples/sdfg_api/cublas_tasklet.py
new file mode 100644
index 0000000000..562e0a91e9
--- /dev/null
+++ b/samples/sdfg_api/cublas_tasklet.py
@@ -0,0 +1,113 @@
+''' Example that defines a CUBLAS C++ tasklet. '''
+import dace as dp
+import numpy as np
+import os
+
+# First, add libraries to link (CUBLAS) to configuration
+cudaroot = os.environ['CUDA_ROOT']  # or any other environment variable
+dp.Config.append(
+    'compiler', 'cpu', 'libs', value='%s/lib64/libcublas.so' % cudaroot)
+######################################################################
+
+# Create symbols
+M = dp.symbol('M')
+K = dp.symbol('K')
+N = dp.symbol('N')
+M.set(25)
+K.set(26)
+N.set(27)
+
+# Create a GPU SDFG with a custom C++ tasklet
+sdfg = dp.SDFG('cublastest')
+state = sdfg.add_state()
+
+# Add arrays
+sdfg.add_array('A', [M, K], dtype=dp.float64)
+sdfg.add_array('B', [K, N], dtype=dp.float64)
+sdfg.add_array('C', [M, N], dtype=dp.float64)
+
+# Add transient GPU arrays
+sdfg.add_transient('gA', [M, K], dp.float64, dp.StorageType.GPU_Global)
+sdfg.add_transient('gB', [K, N], dp.float64, dp.StorageType.GPU_Global)
+sdfg.add_transient('gC', [M, N], dp.float64, dp.StorageType.GPU_Global)
+
+# Add custom C++ tasklet to graph
+tasklet = state.add_tasklet(
+    # Tasklet name (can be arbitrary)
+    name='gemm',
+    # Inputs and output names (will be obtained as raw pointers)
+    inputs={'a', 'b'},
+    outputs={'c'},
+    # Custom code (on invocation)
+    code='''
+    double alpha = 1.0, beta = 0.0;
+    cublasDgemm(handle, CUBLAS_OP_N, CUBLAS_OP_N,
+                M, N, K, &alpha, 
+                a, M, b, K, 
+                &beta,
+                c, M);
+    ''',
+    # Global code (top of file, can be used for includes and global variables)
+    code_global='''
+    #include <cublas_v2.h>
+    cublasHandle_t handle;
+    ''',
+    # Initialization code (called in __dace_init())
+    code_init='''
+    cublasCreate(&handle);
+    ''',
+    # Teardown code (called in __dace_exit())
+    code_exit='''
+    cublasDestroy(handle);
+    ''',
+    # Language (C++ in this case)
+    language=dp.Language.CPP)
+
+# Add CPU arrays, GPU arrays, and connect to tasklet
+A = state.add_read('A')
+B = state.add_read('B')
+C = state.add_write('C')
+gA = state.add_access('gA')
+gB = state.add_access('gB')
+gC = state.add_access('gC')
+
+# Memlets cover all data
+state.add_edge(gA, None, tasklet, 'a', dp.Memlet.simple('gA', '0:M, 0:K'))
+state.add_edge(gB, None, tasklet, 'b', dp.Memlet.simple('gB', '0:K, 0:N'))
+state.add_edge(tasklet, 'c', gC, None, dp.Memlet.simple('gC', '0:M, 0:N'))
+
+# Between two arrays we use a convenience function, `add_nedge`, which is
+# short for "no-connector edge", i.e., `add_edge(u, None, v, None, memlet)`.
+state.add_nedge(A, gA, dp.Memlet.simple('gA', '0:M, 0:K'))
+state.add_nedge(B, gB, dp.Memlet.simple('gB', '0:K, 0:N'))
+state.add_nedge(gC, C, dp.Memlet.simple('C', '0:M, 0:N'))
+
+######################################################################
+
+# Validate GPU SDFG
+sdfg.validate()
+
+# Draw SDFG to file
+sdfg.draw_to_file()
+
+######################################################################
+
+if __name__ == '__main__':
+    # Initialize arrays. We are using column-major order to support CUBLAS!
+    A = np.ndarray([M.get(), K.get()], dtype=np.float64, order='F')
+    B = np.ndarray([K.get(), N.get()], dtype=np.float64, order='F')
+    C = np.ndarray([M.get(), N.get()], dtype=np.float64, order='F')
+
+    A[:] = np.random.rand(M.get(), K.get())
+    B[:] = np.random.rand(K.get(), N.get())
+    C[:] = np.random.rand(M.get(), N.get())
+
+    C_ref = A @ B
+
+    # We can safely call numpy with arrays allocated on the CPU, since they
+    # will be copied.
+    sdfg(A=A, B=B, C=C, M=M, N=N, K=K)
+
+    diff = np.linalg.norm(C - C_ref)
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/sdfg_api/jagged_arrays.py b/samples/sdfg_api/jagged_arrays.py
new file mode 100644
index 0000000000..1079939a35
--- /dev/null
+++ b/samples/sdfg_api/jagged_arrays.py
@@ -0,0 +1,94 @@
+import dace
+import numpy as np
+
+sdfg = dace.SDFG('jagged')
+
+# Number of arrays
+num_arrays = dace.symbol('num_arrays')
+# Total length
+L = dace.symbol('L')
+
+sdfg.add_array('JaggedArray', [L], dace.float32)
+sdfg.add_array('JaggedOffsets', [num_arrays + 1], dace.int32)
+sdfg.add_scalar('off_ind', dace.int32, transient=True)
+sdfg.add_scalar('len_ind', dace.int32, transient=True)
+
+state = sdfg.add_state()
+
+read_jarr = state.add_read('JaggedArray')
+read_joff = state.add_read('JaggedOffsets')
+write_jarr = state.add_write('JaggedArray')
+offT = state.add_access('off_ind')
+lenT = state.add_access('len_ind')
+
+me, mx = state.add_map('readlen', dict(i='0:num_arrays'))
+
+indt = state.add_tasklet('indirection', {'offs'}, {'off', 'len'},
+                         'off = offs[i]; len = offs[i+1]')
+
+ime, imx = state.add_map('addone', dict(j='off_ind:len_ind'))
+ime.add_in_connector('off_ind')
+ime.add_in_connector('len_ind')
+task = state.add_tasklet('add1', {'a'}, {'b'}, 'b = a + 10*(i+1)')
+
+state.add_memlet_path(
+    read_joff,
+    me,
+    indt,
+    memlet=dace.Memlet.simple('JaggedOffsets', '0:num_arrays+1'),
+    dst_conn='offs')
+state.add_memlet_path(
+    read_jarr,
+    me,
+    ime,
+    task,
+    memlet=dace.Memlet.simple('JaggedArray', 'j'),
+    dst_conn='a')
+state.add_edge(indt, 'off', offT, None, dace.Memlet.simple('off_ind', '0'))
+state.add_edge(indt, 'len', lenT, None, dace.Memlet.simple('len_ind', '0'))
+
+state.add_edge(offT, None, ime, 'off_ind', dace.Memlet.simple('off_ind', '0'))
+state.add_edge(lenT, None, ime, 'len_ind', dace.Memlet.simple('len_ind', '0'))
+
+state.add_memlet_path(
+    task,
+    imx,
+    mx,
+    write_jarr,
+    src_conn='b',
+    memlet=dace.Memlet.simple('JaggedArray', 'j'))
+
+# Validate correctness of initial SDFG
+sdfg.validate()
+
+# Fuses redundant states and removes unnecessary transient arrays
+sdfg.apply_strict_transformations()
+
+sdfg.draw_to_file()
+
+######################################
+if __name__ == '__main__':
+    print('Program start')
+
+    arrs = 5
+    nnz = np.random.randint(1, 10, size=arrs, dtype=np.int32)
+    offs = np.ndarray([arrs + 1], dtype=np.int32)
+    offs[0] = 0
+    offs[1:] = np.cumsum(nnz)
+    length = np.sum(nnz)
+
+    jagged = np.random.rand(length).astype(np.float32)
+    ref = jagged.copy()
+
+    for i in range(arrs):
+        print(jagged[offs[i]:offs[i + 1]])
+        ref[offs[i]:offs[i + 1]] += 10 * (i + 1)
+
+    sdfg(JaggedArray=jagged, JaggedOffsets=offs, L=length, num_arrays=arrs)
+
+    for i in range(arrs):
+        print(jagged[offs[i]:offs[i + 1]])
+
+    diff = np.linalg.norm(ref - jagged)
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/sdfg_api/nested_states.py b/samples/sdfg_api/nested_states.py
new file mode 100644
index 0000000000..ab4d0c7ec6
--- /dev/null
+++ b/samples/sdfg_api/nested_states.py
@@ -0,0 +1,80 @@
+import dace
+import numpy as np
+
+sdfg = dace.SDFG('nested_main')
+
+sdfg.add_array('A', [2], dace.float32)
+
+
+# Sample state contents
+def mystate(state, src, dst):
+    src_node = state.add_read(src)
+    dst_node = state.add_write(dst)
+    tasklet = state.add_tasklet('aaa2', {'a'}, {'b'}, 'b = a + 1')
+
+    # input path (src->tasklet[a])
+    state.add_memlet_path(
+        src_node, tasklet, dst_conn='a', memlet=dace.Memlet.simple(src, '0'))
+    # output path (tasklet[b]->dst)
+    state.add_memlet_path(
+        tasklet, dst_node, src_conn='b', memlet=dace.Memlet.simple(dst, '0'))
+
+
+# Create nested SDFG
+sub_sdfg = dace.SDFG('nested_sub')
+
+# Declare arrays for nested SDFG (the names can differ from the top-level SDFG)
+# In this case, we read only one element out of the full arrays
+sub_sdfg.add_array('sA', [1], dace.float32)
+sub_sdfg.add_transient('sB', [1], dace.float32)
+sub_sdfg.add_array('sC', [1], dace.float32)
+
+# Create nested states
+state0 = sub_sdfg.add_state('subs0')
+mystate(state0, 'sA', 'sB')
+state1 = sub_sdfg.add_state('subs1')
+mystate(state1, 'sB', 'sC')
+
+sub_sdfg.add_edge(state0, state1, dace.InterstateEdge())
+
+#############
+
+# Create top-level SDFG
+state = sdfg.add_state('s0')
+me, mx = state.add_map('mymap', dict(k='0:2'))
+# NOTE: The names of the inputs/outputs of the nested SDFG must match array
+#       names above (lines 29, 31)!
+nsdfg = state.add_nested_sdfg(sub_sdfg, sdfg, {'sA'}, {'sC'})
+Ain = state.add_read('A')
+Aout = state.add_write('A')
+
+# Connect dataflow nodes
+state.add_memlet_path(
+    Ain, me, nsdfg, memlet=dace.Memlet.simple('A', 'k'), dst_conn='sA')
+state.add_memlet_path(
+    nsdfg, mx, Aout, memlet=dace.Memlet.simple('A', 'k'), src_conn='sC')
+###
+
+# Validate correctness of initial SDFG
+sdfg.validate()
+
+# The files will be output as nofusion.dot and nested_sub_nofusion.dot for the
+# nested SDFG.
+sdfg.draw_to_file('nofusion.dot')
+
+# Fuses redundant states and removes unnecessary transient arrays
+sdfg.apply_strict_transformations()
+
+sdfg.draw_to_file('withfusion.dot')
+
+######################################
+if __name__ == '__main__':
+    print('Program start')
+
+    a = np.random.rand(2).astype(np.float32)
+    b = np.zeros([2])
+    b[:] = a
+
+    sdfg(A=a)
+
+    print((b + 2) - a)
diff --git a/samples/sdfg_api/state_fusion.py b/samples/sdfg_api/state_fusion.py
new file mode 100644
index 0000000000..4caa9c7607
--- /dev/null
+++ b/samples/sdfg_api/state_fusion.py
@@ -0,0 +1,71 @@
+import dace
+import numpy as np
+
+sdfg = dace.SDFG('sfusion')
+
+sdfg.add_array('A', [2], dace.float32)
+sdfg.add_array('B', [2], dace.float32)
+
+
+# Sample state contents
+def mystate(state, src, dst):
+    src_node = state.add_read(src)
+    dst_node = state.add_write(dst)
+    me, mx = state.add_map('aaa', dict(i='0:2'))
+    tasklet = state.add_tasklet('aaa2', {'a'}, {'b'}, 'b = a')
+
+    # input path (src->me->tasklet[a])
+    state.add_memlet_path(
+        src_node,
+        me,
+        tasklet,
+        dst_conn='a',
+        memlet=dace.Memlet.simple(src, 'i'))
+    # output path (tasklet[b]->mx->dst)
+    state.add_memlet_path(
+        tasklet,
+        mx,
+        dst_node,
+        src_conn='b',
+        memlet=dace.Memlet.simple(dst, 'i'))
+
+
+state = sdfg.add_state('s0')
+mystate(state, 'A', 'B')
+
+state = sdfg.add_state('s1')
+mystate(state, 'B', 'A')
+
+state = sdfg.add_state('s2')
+mystate(state, 'A', 'B')
+
+state = sdfg.add_state('s3')
+mystate(state, 'B', 'A')
+
+# Loop over all states and connect them to each other programmatically
+nodes = list(sdfg.nodes())
+for i in range(len(nodes) - 1):
+    sdfg.add_edge(nodes[i], nodes[i + 1], dace.InterstateEdge())
+
+# Validate correctness of initial SDFG
+sdfg.validate()
+
+sdfg.draw_to_file('nofusion.dot')
+
+# Fuses redundant states and removes unnecessary transient arrays
+sdfg.apply_strict_transformations()
+
+sdfg.draw_to_file('withfusion.dot')
+
+######################################
+if __name__ == '__main__':
+    print('Program start')
+
+    a = np.random.rand(2).astype(np.float32)
+    b = np.random.rand(2).astype(np.float32)
+
+    print(a, b)
+
+    sdfg(A=a, B=b)
+
+    print(b - a)
diff --git a/samples/sdfg_api/stencil_boundaries.py b/samples/sdfg_api/stencil_boundaries.py
new file mode 100644
index 0000000000..3cd296b723
--- /dev/null
+++ b/samples/sdfg_api/stencil_boundaries.py
@@ -0,0 +1,90 @@
+""" Example of a 7x7 stencil with custom boundary conditions executed in 
+    parallel. """
+
+import dace
+import numpy as np
+from scipy import signal
+
+H = dace.symbol('H')
+W = dace.symbol('W')
+
+STENCIL_KERNEL = np.random.rand(7, 7).astype(np.float32)
+
+
+#################
+# Helper function
+def dirichlet_tasklet(state, B, x0, y0, width, height, initval=0):
+    # Set up map that only has one exit
+    _, me, mx = state.add_mapped_tasklet(
+        'boundary',
+        dict(i='%s:%s' % (y0, y0 + height), j='%s:%s' % (x0, x0 + width)), {},
+        '''b = %f''' % initval,
+        dict(b=dace.Memlet.simple(B.data, 'i,j')),
+        external_edges=False)
+    state.add_nedge(
+        mx, B,
+        dace.Memlet.simple(B.data,
+                           '%s:%s, %s:%s' % (y0, y0 + height, x0, x0 + width)))
+
+
+#################
+
+sdfg = dace.SDFG('stencilboundaries')
+
+# Add arrays and kernel
+sdfg.add_array('A', [H, W], dace.float32)
+sdfg.add_array('B', [H, W], dace.float32)
+sdfg.add_constants({'KERNEL': STENCIL_KERNEL})
+
+mainstate = sdfg.add_state()
+
+# The 7x7 stencil
+_, me, mx = mainstate.add_mapped_tasklet(
+    'stencil',
+    dict(i='3:H-3', j='3:W-3'),
+    dict(a=dace.Memlet.simple('A', 'i-3:i+4, j-3:j+4')),
+    '''
+b = 0
+for ky in range(7):
+    for kx in range(7):
+        b += a[ky,kx] * KERNEL[ky*7+kx]
+                                        ''',
+    dict(b=dace.Memlet.simple('B', 'i,j')),
+    external_edges=False)
+
+# Connect arrays (we want them to appear once for the main body and all bounds)
+A = mainstate.add_read('A')
+B = mainstate.add_write('B')
+mainstate.add_nedge(A, me, dace.Memlet.simple('A', '0:H, 0:W'))
+mainstate.add_nedge(mx, B, dace.Memlet.simple('B', '3:H-3, 3:W-3'))
+
+# Add boundary conditions
+dirichlet_tasklet(mainstate, B, 0, 0, 3, H)  # Left
+dirichlet_tasklet(mainstate, B, W - 3, 0, 3, H)  # Right
+dirichlet_tasklet(mainstate, B, 3, 0, W - 6, 3)  # Top
+dirichlet_tasklet(mainstate, B, 3, H - 3, W - 6, 3)  # Bottom
+
+sdfg.fill_scope_connectors()
+sdfg.validate()
+
+# NOTE: If GPUTransformState is applied, boundary kernels will run on separate
+# streams.
+if __name__ == '__main__':
+    sdfg.draw_to_file()
+
+    H, W = 24, 24
+
+    A = np.random.rand(H, W).astype(np.float32)
+    B = np.random.rand(H, W).astype(np.float32)
+
+    # Emulate same behavior as SDFG
+    reg = np.zeros((H, W), dtype=np.float32)
+    for i in range(3, H - 3):
+        for j in range(3, W - 3):
+            reg[i, j] = (A[i - 3:i + 4, j - 3:j + 4] * STENCIL_KERNEL).sum()
+
+    sdfg(A=A, B=B, H=H, W=W)
+
+    diff = np.linalg.norm(reg - B)
+    print('Difference:', diff)
+    exit(1 if diff >= 1e-4 else 0)
diff --git a/samples/simple/axpy.py b/samples/simple/axpy.py
new file mode 100644
index 0000000000..fce13c1572
--- /dev/null
+++ b/samples/simple/axpy.py
@@ -0,0 +1,78 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+import scipy as sp
+
+import os
+from timeit import default_timer as timer
+
+N = dace.symbol('N')
+
+A = dace.float64
+X = dace.ndarray([N], dtype=dace.float64)
+Y = dace.ndarray([N], dtype=dace.float64)
+
+
+@dace.program(dace.float64, dace.float64[N], dace.float64[N])
+def axpy(A, X, Y):
+    @dace.map(_[0:N])
+    def multiplication(i):
+        in_A << A
+        in_X << X[i]
+        in_Y << Y[i]
+        out >> Y[i]
+
+        out = in_A * in_X + in_Y
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("N", type=int, nargs="?", default=24)
+    parser.add_argument("contender", nargs="?", type=str, default="MKL")
+    parser.add_argument(
+        "--compile-only",
+        default=False,
+        action="store_true",
+        dest="compile-only")
+    args = vars(parser.parse_args())
+
+    N.set(args["N"])
+    contender = args["contender"]
+
+    print('Scalar-vector multiplication %d' % (N.get()))
+
+    # Initialize arrays: Randomize A and X, zero Y
+    A = dace.float64(np.random.rand())
+    X[:] = np.random.rand(N.get()).astype(dace.float64.type)
+    Y[:] = np.random.rand(N.get()).astype(dace.float64.type)
+
+    A_regression = np.float64()
+    X_regression = np.ndarray([N.get()], dtype=np.float64)
+    Y_regression = np.ndarray([N.get()], dtype=np.float64)
+    A_regression = A
+    X_regression[:] = X[:]
+    Y_regression[:] = Y[:]
+
+    if args["compile-only"]:
+        dace.compile(axpy, A, X, Y)
+    else:
+        axpy(A, X, Y)
+
+        c_axpy = sp.linalg.blas.get_blas_funcs(
+            'axpy', arrays=(X_regression, Y_regression))
+        if dace.Config.get_bool('profiling'):
+            dace.timethis('axpy', contender, dace.eval(2 * N), c_axpy,
+                          X_regression, Y_regression, N.get(), A_regression)
+        else:
+            c_axpy(X_regression, Y_regression, N.get(), A_regression)
+
+    diff = np.linalg.norm(Y_regression - Y) / float(dace.eval(N))
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/simple/ddot.py b/samples/simple/ddot.py
new file mode 100644
index 0000000000..39b4f450ae
--- /dev/null
+++ b/samples/simple/ddot.py
@@ -0,0 +1,55 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+N = dace.symbol()
+
+
+@dace.program
+def dot(A, B, out):
+    @dace.map
+    def product(i: _[0:N]):
+        a << A[i]
+        b << B[i]
+        o >> out(1, lambda x, y: x + y)
+        o = a * b
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("N", type=int, nargs="?", default=64)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([N], dtype=dace.float32)
+    B = dace.ndarray([N], dtype=dace.float32)
+    out_AB = dace.scalar(dace.float64)
+    out_AA = dace.scalar(dace.float64)
+
+    N.set(args["N"])
+
+    print('Dot product %d' % (N.get()))
+
+    A[:] = np.random.rand(N.get()).astype(dace.float32.type)
+    B[:] = np.random.rand(N.get()).astype(dace.float32.type)
+    out_AB[0] = dace.float64(0)
+    out_AA[0] = dace.float64(0)
+
+    cdot = dace.compile(dot, A, B, out_AB)
+    cdot(A, B, out_AB)
+
+    # To allow reloading the SDFG code file with the same name
+    del cdot
+
+    cdot_self = dace.compile(dot, A, A, out_AA)
+    cdot_self(A, A, out_AA)
+
+    diff_ab = np.linalg.norm(np.dot(A, B) - out_AB) / float(N.get())
+    diff_aa = np.linalg.norm(np.dot(A, A) - out_AA) / float(N.get())
+    print("Difference (A*B):", diff_ab)
+    print("Difference (A*A):", diff_aa)
+    exit(0 if (diff_ab <= 1e-5 and diff_aa <= 1e-5) else 1)
diff --git a/samples/simple/fibonacci.py b/samples/simple/fibonacci.py
new file mode 100644
index 0000000000..715798ad6d
--- /dev/null
+++ b/samples/simple/fibonacci.py
@@ -0,0 +1,49 @@
+import dace as dp
+import numpy as np
+
+
+@dp.program
+def fibonacci(iv: dp.int32[1], res: dp.float32[1]):
+    S = dp.define_stream(dp.int32, 0)
+
+    # Initialize stream with input value
+    @dp.tasklet
+    def init():
+        i << iv
+        s >> S
+        s = i
+
+    @dp.consume(S, 4)
+    def cons(elem, p):
+        sout >> S(-1)
+        val >> res(-1, lambda a, b: a + b)[0]
+
+        if elem == 1:
+            val = 1
+        elif elem > 1:  # Recurse by pushing smaller values
+            sout = elem - 1
+            sout = elem - 2
+
+
+def fibonacci_py(v):
+    """ Computes the Fibonacci sequence at point v. """
+    if v == 0:
+        return 0
+    if v == 1:
+        return 1
+    return fibonacci_py(v - 1) + fibonacci_py(v - 2)
+
+
+if __name__ == '__main__':
+    print('Fibonacci recursion using consume - Python frontend')
+    input = np.ndarray([1], np.int32)
+    output = np.ndarray([1], np.float32)
+    input[0] = 10
+    output[0] = 0
+    regression = fibonacci_py(input[0])
+
+    fibonacci(input, output)
+
+    diff = (regression - output[0])**2
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/simple/filter.py b/samples/simple/filter.py
new file mode 100644
index 0000000000..698a7cd3e1
--- /dev/null
+++ b/samples/simple/filter.py
@@ -0,0 +1,87 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+N = dace.symbol('N', positive=True)
+
+
+@dace.program(dace.float32[N], dace.float32[N], dace.uint32[1], dace.float32)
+def pbf(A, out, outsz, ratio):
+    ostream = dace.define_stream(dace.float32, 1)
+    ostream >> out
+
+    @dace.map(_[0:N])
+    def filter(i):
+        a << A[i]
+        b >> ostream(-1)
+        osz >> outsz(-1, lambda x, y: x + y, 0)
+
+        filter = (a > ratio)
+
+        if filter:
+            b = a
+
+        osz = filter
+
+
+def regression(A, ratio):
+    return A[np.where(A > ratio)]
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("N", type=int, nargs="?", default=64)
+    parser.add_argument("ratio", type=float, nargs="?", default=0.5)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([N], dtype=dace.float32)
+    B = dace.ndarray([N], dtype=dace.float32)
+    outsize = dace.scalar(dace.uint32, allow_conflicts=True)
+    outsize[0] = 0
+
+    N.set(args["N"])
+    ratio = np.float32(args["ratio"])
+
+    print('Predicate-Based Filter. size=%d, ratio=%f' % (N.get(), ratio))
+
+    A[:] = np.random.rand(N.get()).astype(dace.float32.type)
+    B[:] = dace.float32(0)
+
+    pbf(A, B, outsize, ratio)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('filter', 'numpy', 0, regression, A, ratio)
+
+    filtered = regression(A, ratio)
+
+    if len(filtered) != outsize[0]:
+        print(
+            "Difference in number of filtered items: %d (DaCe) vs. %d (numpy)"
+            % (outsize[0], len(filtered)))
+        totalitems = min(outsize[0], N.get())
+        print('DaCe:', B[:totalitems].view(type=np.ndarray))
+        print('Regression:', filtered.view(type=np.ndarray))
+        exit(1)
+
+    # Sort the outputs
+    filtered = np.sort(filtered)
+    B[:outsize[0]] = np.sort(B[:outsize[0]])
+
+    if len(filtered) == 0:
+        print("==== Program end ====")
+        exit(0)
+
+    diff = np.linalg.norm(filtered - B[:outsize[0]]) / float(outsize[0])
+    print("Difference:", diff)
+    if diff > 1e-5:
+        totalitems = min(outsize[0], N.get())
+        print('DaCe:', B[:totalitems].view(type=np.ndarray))
+        print('Regression:', filtered.view(type=np.ndarray))
+
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/simple/gemm.py b/samples/simple/gemm.py
new file mode 100644
index 0000000000..05a93f129d
--- /dev/null
+++ b/samples/simple/gemm.py
@@ -0,0 +1,92 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+
+import os
+from timeit import default_timer as timer
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+
+A = dace.ndarray([M, N], dtype=dace.float64)
+B = dace.ndarray([N, K], dtype=dace.float64)
+C = dace.ndarray([M, K], dtype=dace.float64)
+
+
+@dace.program(dace.float64[M, N], dace.float64[N, K], dace.float64[M, K])
+def gemm(A, B, C):
+    # Transient variable
+    tmp = dace.define_local([M, K, N], dtype=A.dtype)
+
+    @dace.map(_[0:M, 0:K, 0:N])
+    def multiplication(i, j, k):
+        in_A << A[i, k]
+        in_B << B[k, j]
+        out >> tmp[i, j, k]
+
+        out = in_A * in_B
+
+    dace.reduce(lambda a, b: a + b, tmp, C, axis=2, identity=0)
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=24)
+    parser.add_argument("N", type=int, nargs="?", default=24)
+    parser.add_argument("K", type=int, nargs="?", default=24)
+    parser.add_argument("contender", nargs="?", type=str, default="MKL")
+    parser.add_argument(
+        "--compile-only",
+        default=False,
+        action="store_true",
+        dest="compile-only")
+    parser.add_argument("--sdfg", type=str, nargs="?", default=None)
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+    K.set(args["K"])
+    contender = args["contender"]
+
+    print('Matrix multiplication %dx%dx%d' % (M.get(), N.get(), K.get()))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A[:] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    B[:] = np.random.rand(N.get(), K.get()).astype(dace.float64.type)
+    C[:] = dace.float64(0)
+
+    A_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([N.get(), K.get()], dtype=np.float64)
+    C_regression = np.ndarray([M.get(), K.get()], dtype=np.float64)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+    C_regression[:] = C[:]
+
+    if args["sdfg"] is not None:
+        sdfg = dace.SDFG.from_file(args["sdfg"])
+        gemmfunc = dace.compile(sdfg)
+    else:
+        gemmfunc = dace.compile(gemm, A, B, C)
+
+    if not args["compile-only"]:
+        gemmfunc(A, B, C)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('gemm', contender, dace.eval(2 * N * N * N), np.dot,
+                      A_regression, B_regression, C_regression)
+    else:
+        np.dot(A_regression, B_regression, C_regression)
+
+    #print(C.view(type=np.ndarray))
+
+    diff = np.linalg.norm(C_regression - C) / float(dace.eval(M * K))
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/simple/histogram.py b/samples/simple/histogram.py
new file mode 100644
index 0000000000..c003670688
--- /dev/null
+++ b/samples/simple/histogram.py
@@ -0,0 +1,56 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+BINS = 256  # dace.symbol('BINS')
+
+# CR version (for a declarative version, see histogram_declarative.py)
+
+
+@dace.program(dace.float32[H, W], dace.uint32[BINS])
+def histogram(A, hist):
+    @dace.map(_[0:H, 0:W])
+    def compute(i, j):
+        a << A[i, j]
+        out >> hist(1, lambda x, y: x + y)[:]
+
+        out[min(int(a * BINS), BINS - 1)] = 1
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=32)
+    parser.add_argument("H", type=int, nargs="?", default=32)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([H, W], dtype=dace.float32)
+    hist = dace.ndarray([BINS], dtype=dace.uint32)
+
+    W.set(args["W"])
+    H.set(args["H"])
+
+    print('Histogram %dx%d' % (W.get(), H.get()))
+
+    A[:] = np.random.rand(H.get(),
+                          W.get()).astype(dace.float32.type)  #randint(0, 256,
+    #        (H.get(), W.get())).astype(dace.uint8.type)
+    hist[:] = dace.uint32(0)
+
+    histogram(A, hist)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('histogram', 'numpy', dace.eval(H * W), np.histogram, A,
+                      BINS)
+
+    diff = np.linalg.norm(
+        np.histogram(A, bins=BINS, range=(0.0, 1.0))[0][1:-1] - hist[1:-1])
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/simple/histogram_declarative.py b/samples/simple/histogram_declarative.py
new file mode 100644
index 0000000000..eb7edf657e
--- /dev/null
+++ b/samples/simple/histogram_declarative.py
@@ -0,0 +1,61 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+BINS = 256  # dace.symbol('BINS')
+
+
+@dace.program(dace.uint8[H, W], dace.uint32[BINS])
+def histogram(A, hist):
+    # Declarative version
+    tmp = dace.define_local([H, W, BINS], dace.uint32)
+
+    @dace.map(_[0:H, 0:W, 0:BINS])
+    def zero_tmp(i, j, b):
+        t >> tmp[i, j, b]
+        t = 0
+
+    @dace.map(_[0:H, 0:W])
+    def compute_declarative(i, j):
+        a << A[i, j]
+        out >> tmp(1)[i, j, :]
+        out[a] = 1
+
+    dace.reduce(lambda a, b: a + b, tmp, hist, axis=(0, 1))
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=32)
+    parser.add_argument("H", type=int, nargs="?", default=32)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([H, W], dtype=dace.uint8)
+    hist = dace.ndarray([BINS], dtype=dace.uint32)
+
+    W.set(args["W"])
+    H.set(args["H"])
+
+    print('Histogram (dec) %dx%d' % (W.get(), H.get()))
+
+    A[:] = np.random.randint(0, 256,
+                             (H.get(), W.get())).astype(dace.uint8.type)
+    hist[:] = dace.uint32(0)
+
+    histogram(A, hist)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('histogram', 'numpy', dace.eval(H * W), np.histogram, A,
+                      BINS)
+
+    diff = np.linalg.norm(np.histogram(A, bins=BINS)[0] - hist)
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/simple/mandelbrot.py b/samples/simple/mandelbrot.py
new file mode 100644
index 0000000000..a2ea960346
--- /dev/null
+++ b/samples/simple/mandelbrot.py
@@ -0,0 +1,93 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+import sys
+
+W = dace.symbol()
+H = dace.symbol()
+MAXITER = dace.symbol()
+d = dace.symbol()
+
+
+@dace.program
+def mandelbrot(output: dace.uint16[H, W], maxiter: dace.uint32):
+    @dace.map(_[0:H, 0:W])
+    def compute_pixel(py, px):
+        out >> output[py, px]
+
+        x0 = -2.5 + ((float(px) / W) * 3.5)
+        y0 = -1 + ((float(py) / H) * 2)
+        x = 0.0
+        y = 0.0
+        iteration = 0
+        while (x * x + y * y < 2 * 2 and iteration < maxiter):
+            xtemp = x * x - y * y + x0
+            y = 2 * x * y + y0
+            x = xtemp
+            iteration = iteration + 1
+
+        out = iteration
+
+
+# Prints out a color (in [0,1]) to a 256-color ANSI terminal)
+def printcolor(val):
+    ESC = "\x1B["
+    MINVAL = 232
+    MAXVAL = 255
+    color = int(val * (MAXVAL - MINVAL) + MINVAL)
+    #232 -- 255
+    sys.stdout.write((ESC + "48;5;%dm " + ESC + "0m") % color)
+
+
+def printmatrix(mat, image_width=20, aspect_ratio=0.5):
+    h, w = mat.shape
+    ratio = image_width / float(w)
+    image_height = int(ratio * h * aspect_ratio)
+
+    mn = np.min(mat)
+    mx = np.max(mat)
+
+    # Subsampling
+    for y in range(image_height):
+        for x in range(image_width):
+            printcolor((mat[int(y / (ratio * aspect_ratio)),
+                            int(x / ratio)] - mn) / float(mx - mn))
+        sys.stdout.write('\n')
+    sys.stdout.flush()
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=64)
+    parser.add_argument("H", type=int, nargs="?", default=64)
+    parser.add_argument("MAXITER", type=int, nargs="?", default=1000)
+    args = vars(parser.parse_args())
+
+    W.set(args["W"])
+    H.set(args["H"])
+    MAXITER.set(args["MAXITER"])
+
+    print(
+        'Mandelbrot %dx%d (iterations=%d)' % (W.get(), H.get(), MAXITER.get()))
+
+    out = dace.ndarray([H, W], dtype=dace.uint16)
+    out[:] = dace.uint32(0)
+
+    # Run DaCe program
+    mandelbrot(out, MAXITER)
+
+    print('Result:')
+    printmatrix(out)
+
+    # Uncomment to output a PNG file
+    #import png
+    #with open('dacebrot.png', 'wb') as fp:
+    #    w = png.Writer(W.get(), H.get(), greyscale=True, bitdepth=8)
+    #    mn = np.min(out)
+    #    mx = np.max(out)
+    #    w.write(fp, 255.0 * (out - mn) / (mx - mn))
diff --git a/samples/simple/mat_add.py b/samples/simple/mat_add.py
new file mode 100644
index 0000000000..a067be0a2f
--- /dev/null
+++ b/samples/simple/mat_add.py
@@ -0,0 +1,77 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+
+import os
+from timeit import default_timer as timer
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+A = dace.ndarray([M, N], dtype=dace.float64)
+B = dace.ndarray([M, N], dtype=dace.float64)
+C = dace.ndarray([M, N], dtype=dace.float64)
+
+
+@dace.program(dace.float64[M, N], dace.float64[M, N], dace.float64[M, N])
+def mat_add(A, B, C):
+    @dace.map(_[0:M, 0:N])
+    def addition(i, j):
+        in_A << A[i, j]
+        in_B << B[i, j]
+        out >> C[i, j]
+
+        out = in_A + in_B
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=24)
+    parser.add_argument("N", type=int, nargs="?", default=24)
+    parser.add_argument("contender", nargs="?", type=str, default="MKL")
+    parser.add_argument(
+        "--compile-only",
+        default=False,
+        action="store_true",
+        dest="compile-only")
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+    contender = args["contender"]
+
+    print('Matrix addition %dx%d' % (M.get(), N.get()))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A[:] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    B[:] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    C[:] = dace.float64(0)
+
+    A_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    C_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+    C_regression[:] = C[:]
+
+    if args["compile-only"]:
+        dace.compile(mat_add, A, B, C)
+    else:
+        mat_add(A, B, C)
+
+        if dace.Config.get_bool('profiling'):
+            dace.timethis('mat_add', contender, dace.eval(2 * N * N * N),
+                          np.dot, A_regression, B_regression, C_regression)
+        else:
+            np.add(A_regression, B_regression, C_regression)
+
+    diff = np.linalg.norm(C_regression - C) / float(dace.eval(M * N))
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/simple/simple_stencil.py b/samples/simple/simple_stencil.py
new file mode 100644
index 0000000000..e293d485c5
--- /dev/null
+++ b/samples/simple/simple_stencil.py
@@ -0,0 +1,98 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import numpy as np
+from scipy import ndimage
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+MAXITER = dace.symbol('MAXITER')
+
+
+@dace.program(dace.float32[H, W], dace.int32)
+def jacobi(A, iterations):
+    # Transient variable
+    tmp = dace.define_local([H, W], dtype=A.dtype)
+
+    @dace.map(_[0:H, 0:W])
+    def reset_tmp(y, x):
+
+        out >> tmp[y, x]
+        out = 0.0
+
+    @dace.iterate(_[0:iterations])
+    def step(t):
+        @dace.map(_[1:H - 1, 1:W - 1])
+        def a2b(y, x):
+            in_N << A[y - 1, x]
+            in_S << A[y + 1, x]
+            in_W << A[y, x - 1]
+            in_E << A[y, x + 1]
+            in_C << A[y, x]
+            out >> tmp[y, x]
+
+            out = 0.2 * (in_C + in_N + in_S + in_W + in_E)
+
+        # Double buffering
+        @dace.map(_[1:H - 1, 1:W - 1])
+        def b2a(y, x):
+            in_N << tmp[y - 1, x]
+            in_S << tmp[y + 1, x]
+            in_W << tmp[y, x - 1]
+            in_E << tmp[y, x + 1]
+            in_C << tmp[y, x]
+            out >> A[y, x]
+
+            out = 0.2 * (in_C + in_N + in_S + in_W + in_E)
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=12)
+    parser.add_argument("H", type=int, nargs="?", default=12)
+    parser.add_argument("MAXITER", type=int, nargs="?", default=30)
+    args = vars(parser.parse_args())
+
+    W.set(args["W"])
+    H.set(args["H"])
+    MAXITER.set(args["MAXITER"])
+
+    print('Jacobi 5-point Stencil %dx%d (%d steps)' % (W.get(), H.get(),
+                                                       MAXITER.get()))
+
+    A = dace.ndarray([H, W], dtype=dace.float32)
+
+    # Initialize arrays: Randomize A, zero B
+    A[:] = dace.float32(0)
+    A[1:H.get() - 1, 1:W.get() - 1] = np.random.rand(
+        dace.eval(H - 2), dace.eval(W - 2)).astype(dace.float32.type)
+    regression = np.ndarray([H.get() - 2, W.get() - 2], dtype=np.float32)
+    regression[:] = A[1:H.get() - 1, 1:W.get() - 1]
+
+    #print(A.view(type=np.ndarray))
+
+    #############################################
+    # Run DaCe program
+
+    jacobi(A, MAXITER)
+
+    # Regression
+    kernel = np.array(
+        [[0, 0.2, 0], [0.2, 0.2, 0.2], [0, 0.2, 0]], dtype=np.float32)
+    for i in range(2 * MAXITER.get()):
+        regression = ndimage.convolve(
+            regression, kernel, mode='constant', cval=0.0)
+
+    residual = np.linalg.norm(A[1:H.get() - 1, 1:W.get() - 1] -
+                              regression) / dace.eval(H * W)
+    print("Residual:", residual)
+
+    #print(A.view(type=np.ndarray))
+    #print(regression.view(type=np.ndarray))
+
+    print("==== Program end ====")
+    exit(0 if residual <= 0.05 else 1)
diff --git a/samples/simple/spmv.py b/samples/simple/spmv.py
new file mode 100644
index 0000000000..7935959fd5
--- /dev/null
+++ b/samples/simple/spmv.py
@@ -0,0 +1,95 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+import scipy
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+nnz = dace.symbol('nnz')
+
+
+@dace.program(dace.uint32[H + 1], dace.uint32[nnz], dace.float32[nnz],
+              dace.float32[W], dace.float32[H])
+def spmv(A_row, A_col, A_val, x, b):
+    @dace.map(_[0:H])
+    def compute_row(i):
+        rowptr = dace.define_local_scalar(dace.uint32)
+        rowend = dace.define_local_scalar(dace.uint32)
+        rowptr << A_row[i]
+        rowend << A_row[i + 1]
+
+        @dace.map(_[rowptr:rowend])
+        def compute(j):
+            a << A_val[j]
+            in_x << x[A_col[j]]
+            out >> b(1, lambda x, y: x + y, 0)[i]
+
+            out = a * in_x
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("-W", type=int, nargs="?", default=64)
+    parser.add_argument("-H", type=int, nargs="?", default=64)
+    parser.add_argument("-nnz", type=int, nargs="?", default=640)
+    args = vars(parser.parse_args())
+
+    W.set(args["W"])
+    H.set(args["H"])
+    nnz.set(args["nnz"])
+
+    print('Sparse Matrix-Vector Multiplication %dx%d (%d non-zero elements)' %
+          (W.get(), H.get(), nnz.get()))
+
+    A_row = dace.ndarray([H + 1], dtype=dace.uint32)
+    A_col = dace.ndarray([nnz], dtype=dace.uint32)
+    A_val = dace.ndarray([nnz], dtype=dace.float32)
+
+    x = dace.ndarray([W], dace.float32)
+    b = dace.ndarray([H], dace.float32)
+
+    # Assuming uniform sparsity distribution across rows
+    nnz_per_row = nnz.get() // H.get()
+    nnz_last_row = nnz_per_row + (nnz.get() % H.get())
+    if nnz_last_row > W.get():
+        print('Too many nonzeros per row')
+        exit(1)
+
+    # RANDOMIZE SPARSE MATRIX
+    A_row[0] = dace.uint32(0)
+    A_row[1:H.get()] = dace.uint32(nnz_per_row)
+    A_row[-1] = dace.uint32(nnz_last_row)
+    A_row = np.cumsum(A_row, dtype=np.uint32)
+
+    # Fill column data
+    for i in range(H.get() - 1):
+        A_col[nnz_per_row*i:nnz_per_row*(i+1)] = \
+            np.sort(np.random.choice(W.get(), nnz_per_row, replace=False))
+    # Fill column data for last row
+    A_col[nnz_per_row * (H.get() - 1):] = np.sort(
+        np.random.choice(W.get(), nnz_last_row, replace=False))
+
+    A_val[:] = np.random.rand(nnz.get()).astype(dace.float32.type)
+    #########################
+
+    x[:] = np.random.rand(W.get()).astype(dace.float32.type)
+    #b[:] = dace.float32(0)
+
+    # Setup regression
+    A_sparse = scipy.sparse.csr_matrix(
+        (A_val, A_col, A_row), shape=(H.get(), W.get()))
+
+    spmv(A_row, A_col, A_val, x, b)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('spmv', 'scipy', 0, A_sparse.dot, x)
+
+    diff = np.linalg.norm(A_sparse.dot(x) - b) / float(H.get())
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/simple/sum.py b/samples/simple/sum.py
new file mode 100644
index 0000000000..68dc3a2f3c
--- /dev/null
+++ b/samples/simple/sum.py
@@ -0,0 +1,28 @@
+import dace
+import numpy as np
+import sys
+
+N = dace.symbol('N')
+
+
+@dace.program
+def sum(A: dace.float32[N], out: dace.float32[1]):
+    dace.reduce(lambda a, b: a + b, A, out, identity=0)
+
+
+if __name__ == '__main__':
+    if len(sys.argv) > 1:
+        N.set(int(sys.argv[1]))
+    else:
+        N.set(20)
+
+    print('Sum of %d elements' % N.get())
+
+    A = np.random.rand(N.get()).astype(np.float32)
+    out = np.zeros(1, dtype=np.float32)
+
+    sum(A, out)
+
+    diff = abs(out - np.sum(A))
+    print("Difference:", diff)
+    exit(1 if diff > 1e-5 else 0)
diff --git a/samples/simple/transpose.py b/samples/simple/transpose.py
new file mode 100644
index 0000000000..f80ef98a33
--- /dev/null
+++ b/samples/simple/transpose.py
@@ -0,0 +1,49 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+
+
+@dace.program(dace.float32[H, W], dace.float32[H, W])
+def transpose(A, B):
+    @dace.map(_[0:H, 0:W])
+    def compute(i, j):
+        a << A[j, i]
+        b >> B[i, j]
+
+        b = a
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=64)
+    parser.add_argument("H", type=int, nargs="?", default=64)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([H, W], dtype=dace.float32)
+    B = dace.ndarray([H, W], dtype=dace.float32)
+
+    W.set(args["W"])
+    H.set(args["H"])
+
+    print('Transpose %dx%d' % (W.get(), H.get()))
+
+    A[:] = np.random.rand(H.get(), W.get()).astype(dace.float32.type)
+    B[:] = dace.float32(0)
+
+    transpose(A, B)
+
+    if dace.Config.get_bool('profiling'):
+        dace.timethis('transpose', 'numpy', dace.eval(H * W), np.transpose, A)
+
+    diff = np.linalg.norm(np.transpose(A) - B) / float(dace.eval(H * W))
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/samples/tensorflow/lenet.py b/samples/tensorflow/lenet.py
new file mode 100644
index 0000000000..6c36fd697b
--- /dev/null
+++ b/samples/tensorflow/lenet.py
@@ -0,0 +1,102 @@
+# Author: Roman Haag
+# Adapted from https://github.com/tensorflow/models/blob/master/tutorials/image/mnist/convolutional.py
+
+import tensorflow as tf
+import numpy as np
+from dace.frontend.tensorflow import TFSession
+
+IMAGE_SIZE = 28
+NUM_CHANNELS = 1
+NUM_LABELS = 10
+SEED = 19323
+BATCH_SIZE = 64
+
+if __name__ == "__main__":
+
+    # Create synthetic image and label data
+    image = np.ndarray(
+        shape=(BATCH_SIZE, IMAGE_SIZE, IMAGE_SIZE, NUM_CHANNELS),
+        dtype=np.float32)
+    labels = np.zeros(shape=(BATCH_SIZE, ), dtype=np.int64)
+    for i in range(0, BATCH_SIZE):
+        label = i % 2
+        image[i, :, :, 0] = label - 0.5
+        labels[i] = label
+    image_node = tf.convert_to_tensor(image)
+    label_node = tf.convert_to_tensor(labels)
+
+    # Set up all variables
+    conv1_weights = tf.Variable(
+        tf.random_normal(
+            [5, 5, NUM_CHANNELS, 32], stddev=0.1, seed=SEED, dtype=tf.float32))
+    conv1_biases = tf.Variable(tf.zeros([32], dtype=tf.float32))
+    conv2_weights = tf.Variable(
+        tf.random_normal(
+            [5, 5, 32, 64], stddev=0.1, seed=SEED, dtype=tf.float32))
+    conv2_biases = tf.Variable(tf.constant(0.1, shape=[64], dtype=tf.float32))
+
+    # LeNet with sofmax cross entropy
+    convolutionTf1 = tf.nn.conv2d(
+        input=image_node,
+        filter=conv1_weights,
+        strides=[1, 1, 1, 1],
+        padding='VALID')
+    biasTf1 = tf.nn.bias_add(convolutionTf1, conv1_biases)
+    poolTf1 = tf.nn.max_pool(
+        biasTf1, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1], padding='VALID')
+    reluTf1 = tf.nn.relu(poolTf1)
+    convolutionTf2 = tf.nn.conv2d(
+        input=reluTf1,
+        filter=conv2_weights,
+        strides=[1, 1, 1, 1],
+        padding='VALID')
+    biasTf2 = tf.nn.bias_add(convolutionTf2, conv2_biases)
+    poolTf2 = tf.nn.max_pool(
+        biasTf2, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1], padding='VALID')
+    reluTf2 = tf.nn.relu(poolTf2)
+    flattenTf = tf.layers.flatten(reluTf2)
+    fullConnectionTf1 = tf.layers.dense(
+        flattenTf,
+        500,
+        activation=tf.nn.relu,
+        kernel_initializer=tf.random_uniform_initializer(
+            minval=-0.1, maxval=0.1, seed=SEED))
+    fullConnectionTf2 = tf.layers.dense(
+        fullConnectionTf1,
+        10,
+        activation=None,
+        kernel_initializer=tf.random_uniform_initializer(
+            minval=-0.1, maxval=0.1, seed=SEED))
+
+    logits = fullConnectionTf2
+    softmax = tf.nn.sparse_softmax_cross_entropy_with_logits(
+        labels=label_node, logits=logits)
+    loss = tf.reduce_mean(softmax, name="loss")
+
+    # Get gradient tensors and initializer operation
+    gradients = tf.gradients(loss, tf.trainable_variables())
+    init = tf.global_variables_initializer()
+
+    # Compute gradients and compare
+    # Tensorflow
+    with tf.Session() as sess:
+        sess.run(init)
+        tf_gradients = sess.run(gradients)
+
+    # DaCe
+    with TFSession(seed=SEED) as sess:
+        sess.run(init)
+        dace_gradients = sess.run(gradients)
+
+    # Compare
+    for tfgrad, dacegrad in zip(tf_gradients, dace_gradients):
+        inf_norm = np.linalg.norm((tfgrad - dacegrad).flatten())
+        print("Max. Diff:", inf_norm)
+        if (inf_norm <= 1e-4):
+            continue
+        else:
+            print("==== Program end====")
+            print("Error: norm too large")
+            exit(1)
+    print("==== Program end ====")
+    exit(0)
diff --git a/samples/tensorflow/simple_training_example.py b/samples/tensorflow/simple_training_example.py
new file mode 100644
index 0000000000..0dd03b621b
--- /dev/null
+++ b/samples/tensorflow/simple_training_example.py
@@ -0,0 +1,68 @@
+# Author: Roman Haag
+import tensorflow as tf
+from dace.frontend import tensorflow as dacetf
+import numpy as np
+
+SEED = 12983172
+ITER = 5
+
+# Small network with one dense layer, having a random standard normal kernel, no bias layer and no
+# activation function, softmax-cross-entropy loss.
+# Using gradient descent with constant learning rate to optimize.
+
+if __name__ == "__main__":
+
+    # Set up the network
+    image_node = tf.placeholder(dtype=tf.float32, shape=(20, 30))
+    label_node = tf.placeholder(tf.int32, shape=(20))
+    linear_layer = fullConnectionTf1 = tf.layers.dense(
+        image_node,
+        units=10,
+        activation=None,
+        use_bias=False,
+        kernel_initializer=tf.random_normal_initializer(seed=SEED))
+    softmax = tf.nn.sparse_softmax_cross_entropy_with_logits(
+        labels=label_node, logits=linear_layer)
+    loss = tf.reduce_mean(softmax, name="loss")
+
+    # Set up the optimizer
+    optimizer = tf.train.GradientDescentOptimizer(
+        learning_rate=tf.constant(0.01)).minimize(loss)
+
+    # Randomize inputs
+    image = np.random.rand(5, 20, 30).astype(np.float32)
+    label = np.random.randint(low=0, high=10, size=(5, 20), dtype=np.int32)
+
+    # Create initializer
+    trainable_variables = tf.trainable_variables()
+    init = tf.global_variables_initializer()
+
+    # Run with DaCe
+    with dacetf.TFSession(seed=SEED) as sess:
+        # TFSession.train is a special mode where one SDFG runs for the entire
+        # training process.
+        _, dace_variables = sess.train(optimizer, init, ITER, {
+            image_node: image,
+            label_node: label
+        }, trainable_variables)
+
+    # Run with Tensorflow (for correctness)
+    with tf.Session() as sess:
+        sess.run(init)
+        for i in range(ITER):
+            sess.run(optimizer, {image_node: image[i], label_node: label[i]})
+        tf_variables = sess.run(trainable_variables)
+
+    # Correctness check
+    for tfvar, dacevar in zip(tf_variables, dace_variables):
+        frob_norm = np.linalg.norm((tfvar - dacevar).flatten(), ord=1)
+        inf_norm = np.linalg.norm((tfvar - dacevar).flatten(), ord=np.inf)
+        print("Abs. Diff:", frob_norm)
+        print("Max. Diff:", inf_norm)
+        if (frob_norm < 1e-4 and inf_norm < 1e-5):
+            continue
+        else:
+            print("==== Program end ====")
+            exit(1)
+        print("==== Program end ====")
+        exit(0)
diff --git a/scripts/dacelab b/scripts/dacelab
new file mode 100755
index 0000000000..ddd63d150f
--- /dev/null
+++ b/scripts/dacelab
@@ -0,0 +1,49 @@
+#!/usr/bin/env python3
+
+import argparse
+import numpy
+import pickle
+import json
+
+import dace
+from dace.frontend.octave import parse
+
+
+def compile(inputfile):
+    buf = open(inputfile).read()
+    statements = parse.parse(buf, debug=False)
+    print("After Parsing")
+    print(str(statements))
+    print("===============")
+    statements.provide_parents() 
+    statements.specialize()
+    print("After Specialization:")
+    print(str(statements))
+    print("===============")
+    sdfg = statements.generate_code()
+    sdfg.fill_scope_connectors()
+    sdfg.set_sourcecode(buf, "matlab")
+    return sdfg
+
+
+def run_main():
+    argparser = argparse.ArgumentParser(
+        description="dacelab: An Octave to SDFG compiler"
+    )
+    argparser.add_argument("infile",
+                        metavar='infile',
+                        type=argparse.FileType('r'),
+                        help="Input file (Octave code)")
+    argparser.add_argument("-o", "--outfile",
+                        metavar='outfile',
+                        type=argparse.FileType('w'),
+                        default="out.sdfg",
+                        help="Output file, defaults to out.sdfg")
+    args = argparser.parse_args()
+    sdfg = compile(args.infile.name)
+    sdfg.save(args.outfile.name)
+    print("SDFG Generation finished")
+
+if __name__ == "__main__":
+    run_main()
+
diff --git a/scripts/diode b/scripts/diode
new file mode 100755
index 0000000000..0415965eb7
--- /dev/null
+++ b/scripts/diode
@@ -0,0 +1,6 @@
+#!/usr/bin/env python3
+
+from diode import diode
+
+if __name__ == '__main__':
+    diode.run_main()
diff --git a/setup.py b/setup.py
new file mode 100644
index 0000000000..7ddff1a150
--- /dev/null
+++ b/setup.py
@@ -0,0 +1,53 @@
+from setuptools import setup, find_packages
+import glob
+import os
+
+# Find runtime and external library files by obtaining the module path and
+# trimming the absolute path of the resulting files.
+dace_path = os.path.dirname(os.path.abspath(__file__)) + '/dace/'
+runtime_files = [
+    f[len(dace_path):]
+    for f in glob.glob(dace_path + 'runtime/include/**/*', recursive=True)
+]
+cub_files = [
+    f[len(dace_path):]
+    for f in glob.glob(dace_path + 'external/cub/cub/**/*', recursive=True)
+] + [dace_path + 'external/cub/LICENSE.TXT']
+hlslib_files = [
+    f[len(dace_path):] for f in glob.glob(
+        dace_path + 'external/hlslib/cmake/**/*', recursive=True)
+] + [
+    f[len(dace_path):] for f in glob.glob(
+        dace_path + 'external/hlslib/include/**/*', recursive=True)
+] + [dace_path + 'external/hlslib/LICENSE.md']
+
+setup(
+    name='dace',
+    version='0.8.0',
+    url='https://github.com/spcl/dace',
+    author='SPCL @ ETH Zurich',
+    author_email='talbn@inf.ethz.ch',
+    description='Data-Centric Parallel Programming Framework',
+    packages=find_packages(),
+    package_data={
+        '': [
+            '*.yml', 'codegen/CMakeLists.txt', 'codegen/tools/*.cpp',
+            '../diode/main.glade', '../diode/renderer.html',
+            '../diode/renderer_util.js', '../diode/dagre.js',
+            '../diode/Chart.bundle.min.js', '../diode/datahelper.js',
+            '../diode/sdfg_renderer.js', '../diode/parallelization_button.js',
+            '../diode/memory_button.js', '../diode/windowing.js',
+            '../diode/global_vars.js', '../diode/subwindow.html',
+            '../diode/DataViewSettings.js', '../diode/DataViewSettings.html',
+            'external/moodycamel/*.h', 'external/moodycamel/LICENSE.md',
+            'codegen/Xilinx_HLS.tcl.in'
+        ] + runtime_files + cub_files + hlslib_files
+    },
+    include_package_data=True,
+    install_requires=[
+        'matplotlib', 'numpy', 'networkx >= 2.2', 'astunparse', 'sympy',
+        'scipy', 'pyyaml', 'cmake', 'absl-py', 'ply', 'websockets', 'graphviz',
+        'xdot @ git+https://github.com/tbennun/xdot.py.git'
+    ],
+    # install_requires for DIODE: pygobject
+    scripts=['scripts/diode', 'scripts/dacelab'])
diff --git a/test_all.sh b/test_all.sh
new file mode 100644
index 0000000000..c2a9d61fcb
--- /dev/null
+++ b/test_all.sh
@@ -0,0 +1,224 @@
+#!/bin/bash
+
+set -a
+
+SCRIPTPATH="$( cd "$(dirname "$0")" ; pwd -P )"
+PYTHONPATH=$SCRIPTPATH
+
+DACE_debugprint="${DACE_debugprint:-0}"
+NOSTATUSBAR="${NOSTATUSBAR:-0}"
+ERRORS=0
+FAILED_TESTS=""
+SKIPS=0
+SKIPPED_TESTS=""
+TESTS=0
+CURTEST=""
+TOTAL_TESTS=0
+
+TEST_TIMEOUT=10
+
+RED='\033[0;31m'
+YELLOW='\033[0;33m'
+NC='\033[0m'
+TGAP="                                                                                  "
+
+join_by_newline() {
+    for a in $*; do
+        echo $a        
+    done
+    echo 9999
+}
+
+bail() {
+    ERRORSTR=$1
+    /bin/echo -e "${RED}ERROR${NC} in $ERRORSTR" 1>&2
+    ERRORS=`expr $ERRORS + 1`
+    FAILED_TESTS="${FAILED_TESTS} $ERRORSTR\n"
+}
+
+bail_skip() {
+    ERRORSTR=$1
+    /bin/echo -e "${YELLOW}SKIPPING${NC} $ERRORSTR" 1>&2
+    SKIPS=`expr $SKIPS + 1`
+    SKIPPED_TESTS="${SKIPPED_TESTS} $ERRORSTR\n"
+}
+
+
+test_start() {
+    TESTS=`expr $TESTS + 1`
+    CURTEST=$1
+    echo "---------- TEST: $1 ----------"
+}
+
+testcmd() {
+    if [ $NOSTATUSBAR -ne 0 ]; then
+        $*
+        return
+    fi
+    #$* | tee -a test.log
+    TESTCNT=`expr $TESTS - 1`
+    MSG="($TESTCNT / $TOTAL_TESTS) $CURTEST (Fails: $ERRORS)"
+    ($* || echo "_TFAIL_ $?") |& awk "BEGIN{printf \"$MSG\r\"} /_TFAIL_/{printf \"$TGAP\r\"; exit \$NF} {printf \"$TGAP\r\"; print; printf \"$MSG\r\";} END{printf \"$TGAP\r\"}"
+}
+
+################################################
+
+runtest_cpp() {
+    test_start $1
+    testcmd g++ -std=c++14 -Wall -Wextra -O3 -march=native -ffast-math -fopenmp -fPIC \
+        -I $SCRIPTPATH/dace/runtime/include $1 -o ./$1.out
+    if [ $? -ne 0 ]; then bail "$1 (compilation)"; fi
+    testcmd ./$1.out
+    retval=$?
+    rm -f $1.out
+    if [ $retval -ne 0 ]; then bail $1; fi
+}
+
+runtest_cu() {
+    test_start $1
+    testcmd nvcc -O3 -I $SCRIPTPATH/dace/runtime/include $1 -o ./$1.out
+    if [ $? -ne 0 ]; then bail "$1 (compilation)"; fi
+
+    # Check if GPU tests can be run
+    nvidia-smi >/dev/null 2>&1
+    if [ $? -ne 0 ]; then bail_skip $1; return; fi
+    
+    testcmd ./$1.out
+    retval=$?
+    rm -f $1.out
+    if [ $? -ne 0 ]; then bail $1; fi
+}
+
+runtest_optscript() {
+    test_start $1
+    testcmd $SCRIPTPATH/scripts/diode --local --headless --optscript=$1
+    if [ $? -ne 0 ]; then bail $1; fi
+}
+
+runtest_octave() {
+    test_start $1
+    testcmd $SCRIPTPATH/scripts/dacelab $1
+    if [ $? -ne 0 ]; then bail $1; fi
+}
+
+runtest_py() {
+    test_start $1
+    yes | testcmd python3 $1
+    if [ $? -ne 0 ]; then bail $1; fi
+}
+
+runtest_sh() {
+    test_start $1
+    testcmd ./$1
+    retval=$?
+    if [ $retval -eq 99 ]; then bail_skip $1; return; fi
+    if [ $retval -ne 0 ]; then bail $1; fi
+}
+
+endreport() {
+    PASSED=`expr $TESTS - $ERRORS - $SKIPS`
+    TOTAL=`expr $TESTS - $SKIPS`
+    echo "$PASSED / $TOTAL tests passed"
+    if [ $SKIPS -ne 0 ]; then
+        printf "Skipped tests:\n${SKIPPED_TESTS}"
+    fi    
+    if [ $ERRORS -ne 0 ]; then
+        printf "Failed tests:\n${FAILED_TESTS}"
+        exit 1
+    fi
+}
+
+echo "====== All-Inclusive Test Runner ======"
+
+cd $SCRIPTPATH/tests
+
+DACE_compiler_use_cache=0
+DACE_optimizer_interface="dace.transformation.optimizer.SDFGOptimizer"
+DACE_optimizer_detect_control_flow=1
+
+# Specific test(s)
+if [ $# -ne 0 ]; then
+    TOTAL_TESTS=$#
+    for arg in "$@"; do
+        if [[ $arg == *_test.cpp ]]; then
+            runtest_cpp $arg
+        elif [[ $arg == *_test.cu ]]; then
+            runtest_cu $arg
+        elif [[ $arg == *_test_opt.py ]]; then
+            runtest_optscript $arg
+        elif [[ $arg == *_test.py ]]; then
+            runtest_py $arg
+        elif [[ $arg == *.m ]]; then
+            runtest_octave $arg
+        elif [[ $arg == *_test.sh ]]; then
+            runtest_sh $arg
+        fi
+    done
+    endreport
+    exit 0
+fi
+
+# Count tests first
+for file in *_test.cpp; do
+    if [ $file == '*_test.cpp' ]; then break; fi # No files found
+    TOTAL_TESTS=`expr $TOTAL_TESTS + 1`
+done
+
+for file in *_test.cu; do
+    if [ $file == '*_test.cu' ]; then break; fi # No files found
+    TOTAL_TESTS=`expr $TOTAL_TESTS + 1`
+done
+
+for file in *_test_opt.py; do
+    if [ $file == '*_test_opt.py' ]; then break; fi # No files found
+    TOTAL_TESTS=`expr $TOTAL_TESTS + 1`
+done
+
+for file in *_test.py; do
+    if [ $file == '*_test.py' ]; then break; fi # No files found
+    TOTAL_TESTS=`expr $TOTAL_TESTS + 1`
+done
+
+for file in octave/*.m; do
+    if [ $file == '*.m' ]; then break; fi # No files found
+    TOTAL_TESTS=`expr $TOTAL_TESTS + 1`
+done
+
+for file in *_test.sh; do
+    if [ $file == '*_test.sh' ]; then break; fi # No files found
+    TOTAL_TESTS=`expr $TOTAL_TESTS + 1`
+done
+
+################################################################
+
+for file in *_test.cpp; do
+    if [ $file == '*_test.cpp' ]; then break; fi # No files found
+    runtest_cpp $file
+done
+
+for file in *_test.cu; do
+    if [ $file == '*_test.cu' ]; then break; fi # No files found
+    runtest_cu $file
+done
+
+for file in *_test_opt.py; do
+    if [ $file == '*_test_opt.py' ]; then break; fi # No files found
+    runtest_optscript $file
+done
+
+for file in *_test.py; do
+    if [ $file == '*_test.py' ]; then break; fi # No files found
+    runtest_py $file
+done
+
+for file in octave/*.m; do
+    if [ $file == '*.m' ]; then break; fi # No files found
+    runtest_octave $file
+done
+
+for file in *_test.sh; do
+    if [ $file == '*_test.sh' ]; then break; fi # No files found
+    runtest_sh $file
+done
+
+endreport
diff --git a/tests/add_edge_pair_test.py b/tests/add_edge_pair_test.py
new file mode 100644
index 0000000000..3e02b2cbda
--- /dev/null
+++ b/tests/add_edge_pair_test.py
@@ -0,0 +1,33 @@
+import dace
+import numpy as np
+
+sdfg = dace.SDFG('addedgepair')
+state = sdfg.add_state()
+
+# Add nodes
+t = state.add_tasklet('do', {'a'}, {'b'}, 'b = 2*a')
+a = state.add_array('A', [31], dace.float32)
+b = state.add_array('B', [1], dace.float32)
+me, mx = state.add_map('m', dict(i='0:31'))
+
+# Add edges
+state.add_edge_pair(
+    me, t, a, dace.Memlet.simple(a, 'i'), internal_connector='a')
+state.add_edge_pair(
+    mx,
+    t,
+    b,
+    dace.Memlet.simple(b, '0', wcr_str='lambda a,b: a+b'),
+    internal_connector='b',
+    scope_connector='o')
+
+sdfg.draw_to_file()
+
+if __name__ == '__main__':
+    A = np.random.rand(31).astype(np.float32)
+    B = np.array([0.], dtype=np.float32)
+    sdfg(A=A, B=B)
+
+    diff = np.linalg.norm(B[0] - np.sum(2 * A))
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/chained_nested_tasklet_test.py b/tests/chained_nested_tasklet_test.py
new file mode 100644
index 0000000000..966735b394
--- /dev/null
+++ b/tests/chained_nested_tasklet_test.py
@@ -0,0 +1,52 @@
+#!/usr/bin/env python
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+from dace.data import Scalar
+
+# Constructs an SDFG with two consecutive tasklets
+if __name__ == '__main__':
+    print('SDFG consecutive tasklet (nested) test')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    input = dp.ndarray([N], dp.int32)
+    output = dp.ndarray([N], dp.int32)
+    N.set(20)
+    input[:] = dp.int32(5)
+    output[:] = dp.int32(0)
+
+    # Construct SDFG
+    mysdfg = SDFG('ctasklet')
+    state = mysdfg.add_state()
+    A_ = state.add_array('A', [N], dp.int32)
+    B_ = state.add_array('B', [N], dp.int32)
+    mysdfg.add_scalar('something', dp.int32)
+
+    omap_entry, omap_exit = state.add_map('omap', dict(k='0:2'))
+    map_entry, map_exit = state.add_map('mymap', dict(i='0:N/2'))
+    tasklet = state.add_tasklet('mytasklet', {'a'}, {'b'}, 'b = 5*a')
+    state.add_edge(map_entry, None, tasklet, 'a', Memlet.simple(A_, 'k*N/2+i'))
+    tasklet2 = state.add_tasklet('mytasklet2', {'c'}, {'d'}, 'd = 2*c')
+    state.add_edge(tasklet, 'b', tasklet2, 'c', Memlet.simple(
+        'something', '0'))
+    state.add_edge(tasklet2, 'd', map_exit, None, Memlet.simple(B_, 'k*N/2+i'))
+
+    # Add outer edges
+    state.add_edge(A_, None, omap_entry, None, Memlet.simple(A_, '0:N'))
+    state.add_edge(omap_entry, None, map_entry, None,
+                   Memlet.simple(A_, 'k*N/2:(k+1)*N/2'))
+    state.add_edge(map_exit, None, omap_exit, None,
+                   Memlet.simple(B_, 'k*N/2:(k+1)*N/2'))
+    state.add_edge(omap_exit, None, B_, None, Memlet.simple(B_, '0:N'))
+
+    # Left for debugging purposes
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, B=output, N=N)
+
+    diff = np.linalg.norm(10 * input - output) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/chained_tasklet_test.py b/tests/chained_tasklet_test.py
new file mode 100644
index 0000000000..0cd25f77ca
--- /dev/null
+++ b/tests/chained_tasklet_test.py
@@ -0,0 +1,47 @@
+#!/usr/bin/env python
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+from dace.data import Scalar
+
+# Constructs an SDFG with two consecutive tasklets
+if __name__ == '__main__':
+    print('SDFG consecutive tasklet test')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    input = dp.ndarray([N], dp.int32)
+    output = dp.ndarray([N], dp.int32)
+    N.set(20)
+    input[:] = dp.int32(5)
+    output[:] = dp.int32(0)
+
+    # Construct SDFG
+    mysdfg = SDFG('ctasklet')
+    state = mysdfg.add_state()
+    A_ = state.add_array('A', [N], dp.int32)
+    B_ = state.add_array('B', [N], dp.int32)
+    mysdfg.add_scalar('something', dp.int32)
+
+    map_entry, map_exit = state.add_map('mymap', dict(i='0:N'))
+    tasklet = state.add_tasklet('mytasklet', {'a'}, {'b'}, 'b = 5*a')
+    state.add_edge(map_entry, None, tasklet, 'a', Memlet.simple(A_, 'i'))
+    tasklet2 = state.add_tasklet('mytasklet2', {'c'}, {'d'}, 'd = 2*c')
+    state.add_edge(tasklet, 'b', tasklet2, 'c', Memlet.simple(
+        'something', '0'))
+    state.add_edge(tasklet2, 'd', map_exit, None, Memlet.simple(B_, 'i'))
+
+    # Add outer edges
+    state.add_edge(A_, None, map_entry, None, Memlet.simple(A_, '0:N'))
+    state.add_edge(map_exit, None, B_, None, Memlet.simple(B_, '0:N'))
+
+    # Left for debugging purposes
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, B=output, N=N)
+
+    diff = np.linalg.norm(10 * input - output) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/compile_sdfg_test.py b/tests/compile_sdfg_test.py
new file mode 100644
index 0000000000..3f96934ae3
--- /dev/null
+++ b/tests/compile_sdfg_test.py
@@ -0,0 +1,52 @@
+#!/usr/bin/env python
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+# Constructs an SDFG manually and runs it
+if __name__ == '__main__':
+    print('SDFG direct compilation test')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    input = dp.ndarray([N], dp.int32)
+    output = dp.ndarray([N], dp.int32)
+    N.set(20)
+    input[:] = dp.int32(5)
+    output[:] = dp.int32(0)
+
+    # Construct SDFG
+    mysdfg = SDFG('mysdfg')
+    state = mysdfg.add_state()
+    A_ = state.add_array('A', [N], dp.int32)  # NOTE: The names A and B are not
+    B_ = state.add_array('B', [N], dp.int32)  # reserved, this is just to
+    # clarify that
+    # variable name != array name
+
+    # Easy way to add a tasklet
+    tasklet, map_entry, map_exit = state.add_mapped_tasklet(
+        'mytasklet',
+        dict(i='0:N'),
+        dict(a=Memlet.simple(A_, 'i')),
+        'b = 5*a',
+        dict(b=Memlet.simple(B_, 'i')))
+    # Alternatively (the explicit way):
+    #map_entry, map_exit = state.add_map('mymap', dict(i='0:N'))
+    #tasklet = state.add_tasklet('mytasklet', {'a'}, {'b'}, 'b = 5*a')
+    #state.add_edge(map_entry, None, tasklet, 'a', Memlet.simple(A_, 'i'))
+    #state.add_edge(tasklet, 'b', map_exit, None, Memlet.simple(B_, 'i'))
+
+    # Add outer edges
+    state.add_edge(A_, None, map_entry, None, Memlet.simple(A_, '0:N'))
+    state.add_edge(map_exit, None, B_, None, Memlet.simple(B_, '0:N'))
+
+    # Left for debugging purposes
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, B=output, N=N)
+
+    diff = np.linalg.norm(5 * input - output) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/confres_test.py b/tests/confres_test.py
new file mode 100644
index 0000000000..ccf1ad1de5
--- /dev/null
+++ b/tests/confres_test.py
@@ -0,0 +1,58 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+
+
+@dace.program(dace.float32[W, H], dace.float32[H, W, H], dace.float32[3],
+              dace.float32[1])
+def confres_test(A, B, red1, red2):
+    @dace.map(_[0:H, 0:W])
+    def compute(i, j):
+        a << A[j, i]
+        b >> B[i, j, 0]
+        c >> A[j + 1, i + 1]
+        r1 >> red1(1, lambda x, y: x * y)[1]
+        r2 >> red2(1, lambda x, y: x + y)[:]
+
+        b = a
+        c = 5 * b
+        r1 = 1
+        r2 = 2
+
+    dace.reduce(lambda a, b: a + b, A, red2)
+    dace.reduce(lambda a, b: a + b, B[2:H - 2, 5, :], red1[0])
+    dace.reduce(lambda a, b: a + b, B[3:H - 3, 5:7, :], red1[1:], axis=(2, 0))
+    dace.reduce(lambda a, b: a - b, B[2:H - 2, 5, :], red1[0])
+
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=20)
+    parser.add_argument("H", type=int, nargs="?", default=20)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([W, H], dtype=dace.float32)
+    B = dace.ndarray([H, W, H], dtype=dace.float32)
+    red1 = dace.ndarray([3], dtype=dace.float32)
+    red2 = dace.ndarray([1], dtype=dace.float32)
+
+    W.set(args["W"])
+    H.set(args["H"])
+
+    print('Conflict Resolution Test %dx%d' % (W.get(), H.get()))
+
+    A[:] = np.random.rand(H.get(), W.get()).astype(dace.float32.type)
+    B[:] = np.random.rand(H.get(), W.get(), H.get()).astype(dace.float32.type)
+    red1[:] = dace.float32(0)
+    red2[:] = dace.float32(0)
+
+    confres_test.compile(A, B, red1, red2)
+
+    print("==== Program end ====")
diff --git a/tests/constant_array_test.py b/tests/constant_array_test.py
new file mode 100644
index 0000000000..aaa2a662ec
--- /dev/null
+++ b/tests/constant_array_test.py
@@ -0,0 +1,64 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import numpy as np
+from scipy import ndimage
+
+N = dace.symbol('N')
+N.set(20)
+KERNEL = np.array([[0, -1, 0], [-1, 0, -1], [0, -1, 0]], dtype=np.float32)
+
+
+@dace.program(dace.float32[N, N], dace.float32[N, N])
+def stencil3x3(A, B):
+    @dace.map(_[1:N - 1, 1:N - 1])
+    def a2b(y, x):
+        input << A[y - 1:y + 2, x - 1:x + 2]
+        out >> B[y, x]
+        out = (kernel[0, 0] * input[0, 0] + kernel[0, 1] * input[0, 1] +
+               kernel[0, 2] * input[0, 2] + kernel[1, 0] * input[1, 0] +
+               kernel[1, 1] * input[1, 1] + kernel[1, 2] * input[1, 2] +
+               kernel[2, 0] * input[2, 0] + kernel[2, 1] * input[2, 1] +
+               kernel[2, 2] * input[2, 2])
+
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    print('Conv2D %dx%d' % (N.get(), N.get()))
+
+    A = dace.ndarray([N, N], dtype=dace.float32)
+    B = dace.ndarray([N, N], dtype=dace.float32)
+
+    # Initialize arrays: Randomize A, zero B
+    A[:] = dace.float32(0)
+    B[:] = dace.float32(0)
+    A[1:N.get() - 1, 1:N.get() - 1] = np.random.rand(
+        dace.eval(N - 2), dace.eval(N - 2)).astype(dace.float32.type)
+    regression = np.ndarray([N.get() - 2, N.get() - 2], dtype=np.float32)
+    regression[:] = A[1:N.get() - 1, 1:N.get() - 1]
+
+    #print(A.view(type=np.ndarray))
+
+    #############################################
+    # Run DaCe program
+
+    sdfg = stencil3x3.to_sdfg()
+    sdfg.add_constants({'kernel': KERNEL})
+    sdfg(A=A, B=B, N=N)
+
+    # Regression
+    regression = ndimage.convolve(
+        regression, KERNEL, mode='constant', cval=0.0)
+
+    residual = np.linalg.norm(B[1:N.get() - 1, 1:N.get() - 1] -
+                              regression) / dace.eval((N - 2) * (N - 2))
+    print("Residual:", residual)
+
+    #print(A.view(type=np.ndarray))
+    #print(regression.view(type=np.ndarray))
+
+    print("==== Program end ====")
+    exit(0 if residual <= 0.05 else 1)
diff --git a/tests/consume_chunk_cond_test.py b/tests/consume_chunk_cond_test.py
new file mode 100644
index 0000000000..56127414b1
--- /dev/null
+++ b/tests/consume_chunk_cond_test.py
@@ -0,0 +1,66 @@
+import dace as dp
+import numpy as np
+
+sdfg = dp.SDFG('fib_consume_cc')
+state = sdfg.add_state('state')
+nprocs = 4
+
+# Arrays
+initial_value = state.add_array('iv', [1], dp.int32)
+stream = state.add_stream('S', dp.int32, transient=True, buffer_size=256)
+stream_init = state.add_stream('S', dp.int32, transient=True)
+stream_out = state.add_stream('S', dp.int32, transient=True)
+output = state.add_array('res', [1], dp.float32)
+
+# Consume and tasklet
+consume_entry, consume_exit = state.add_consume(
+    'cons', ('p', str(nprocs)), 'res[0] >= 44', chunksize=2)
+tasklet = state.add_tasklet(
+    'fibonacci', {'s'}, {'sout', 'val'}, """
+for i in range(cons_numelems):
+    if s[i] == 1:
+        val = 1
+    elif s[i] > 1:
+        sout = s[i] - 1   # Recurse by pushing smaller values
+        sout = s[i] - 2
+""")
+
+# Edges
+state.add_nedge(initial_value, stream_init,
+                dp.Memlet.from_array(stream_init.data, stream_init.desc(sdfg)))
+state.add_edge(stream, None, consume_entry, 'IN_stream',
+               dp.Memlet.from_array(stream.data, stream.desc(sdfg)))
+state.add_edge(consume_entry, 'OUT_stream', tasklet, 's',
+               dp.Memlet.simple(stream, '0:2'))
+state.add_edge(tasklet, 'sout', consume_exit, 'IN_S',
+               dp.Memlet.simple(stream_out, '0', num_accesses=-1))
+state.add_edge(consume_exit, 'OUT_S', stream_out, None,
+               dp.Memlet.simple(stream_out, '0', num_accesses=-1))
+state.add_edge(
+    tasklet, 'val', consume_exit, 'IN_V',
+    dp.Memlet.simple(output, '0', wcr_str='lambda a,b: a+b', num_accesses=-1))
+state.add_edge(
+    consume_exit, 'OUT_V', output, None,
+    dp.Memlet.simple(output, '0', wcr_str='lambda a,b: a+b', num_accesses=-1))
+
+consume_exit.add_in_connector('IN_S')
+consume_exit.add_in_connector('IN_V')
+consume_exit.add_out_connector('OUT_S')
+consume_exit.add_out_connector('OUT_V')
+
+sdfg.draw_to_file()
+
+if __name__ == '__main__':
+    print('Fibonacci recursion using consume (with chunks, custom condition)')
+    input = np.ndarray([1], np.int32)
+    output = np.ndarray([1], np.float32)
+    input[0] = 10
+    output[0] = 0
+    regression = 44
+
+    sdfg(iv=input, res=output)
+
+    diff = output[0] - regression
+    print('Difference:', diff)
+    # Allowing for race conditions on quiescence condition
+    exit(1 if (diff < 0 or diff > nprocs) else 0)
diff --git a/tests/consume_test.py b/tests/consume_test.py
new file mode 100644
index 0000000000..a2ea63de12
--- /dev/null
+++ b/tests/consume_test.py
@@ -0,0 +1,72 @@
+import dace as dp
+import numpy as np
+
+sdfg = dp.SDFG('fib_consume')
+state = sdfg.add_state('state')
+
+# Arrays
+initial_value = state.add_array('iv', [1], dp.int32)
+stream = state.add_stream('S', dp.int32, transient=True)
+stream_init = state.add_stream('S', dp.int32, transient=True)
+stream_out = state.add_stream('S', dp.int32, transient=True)
+output = state.add_array('res', [1], dp.float32)
+
+# Consume and tasklet
+consume_entry, consume_exit = state.add_consume('cons', ('p', '4'))
+tasklet = state.add_tasklet(
+    'fibonacci', {'s'}, {'sout', 'val'}, """
+if s == 1:
+    val = 1
+elif s > 1:
+    sout = s - 1   # Recurse by pushing smaller values
+    sout = s - 2
+""")
+
+# Edges
+state.add_nedge(initial_value, stream_init,
+                dp.Memlet.from_array(stream_init.data, stream_init.desc(sdfg)))
+state.add_edge(stream, None, consume_entry, 'IN_stream',
+               dp.Memlet.from_array(stream.data, stream.desc(sdfg)))
+state.add_edge(consume_entry, 'OUT_stream', tasklet, 's',
+               dp.Memlet.from_array(stream.data, stream.desc(sdfg)))
+state.add_edge(tasklet, 'sout', consume_exit, 'IN_S',
+               dp.Memlet.simple(stream_out, '0', num_accesses=-1))
+state.add_edge(consume_exit, 'OUT_S', stream_out, None,
+               dp.Memlet.simple(stream_out, '0', num_accesses=-1))
+state.add_edge(
+    tasklet, 'val', consume_exit, 'IN_V',
+    dp.Memlet.simple(output, '0', wcr_str='lambda a,b: a+b', num_accesses=-1))
+state.add_edge(
+    consume_exit, 'OUT_V', output, None,
+    dp.Memlet.simple(output, '0', wcr_str='lambda a,b: a+b', num_accesses=-1))
+
+consume_exit.add_in_connector('IN_S')
+consume_exit.add_in_connector('IN_V')
+consume_exit.add_out_connector('OUT_S')
+consume_exit.add_out_connector('OUT_V')
+
+sdfg.draw_to_file()
+
+
+def fibonacci(v):
+    """ Computes the Fibonacci sequence at point v. """
+    if v == 0:
+        return 0
+    if v == 1:
+        return 1
+    return fibonacci(v - 1) + fibonacci(v - 2)
+
+
+if __name__ == '__main__':
+    print('Fibonacci recursion using consume')
+    input = np.ndarray([1], np.int32)
+    output = np.ndarray([1], np.float32)
+    input[0] = 10
+    output[0] = 0
+    regression = fibonacci(input[0])
+
+    sdfg(iv=input, res=output)
+
+    diff = (regression - output[0])**2
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/control_flow_test.py b/tests/control_flow_test.py
new file mode 100644
index 0000000000..007c3513fa
--- /dev/null
+++ b/tests/control_flow_test.py
@@ -0,0 +1,46 @@
+#!/usr/bin/env python
+import dace
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+
+
+@dace.program
+def control_flow_test(A, B, tol):
+    if tol < 4:
+        while tol < 4:
+
+            @dace.map(_[0:W])
+            def something(i):
+                a << A[0, i]
+                b >> B[0, i]
+                t >> tol(1, lambda x, y: x + y)
+                b = a
+                t = a * a
+    elif tol <= 5:
+
+        @dace.map(_[0:W])
+        def something(i):
+            a << A[0, i]
+            b >> B[0, i]
+            b = a
+    elif tol <= 6:
+
+        @dace.map(_[0:W])
+        def something(i):
+            a << A[0, i]
+            b >> B[0, i]
+            b = a
+    else:
+        for i in range(W):
+
+            @dace.map(_[0:W])
+            def something(j):
+                a << A[0, j]
+                b >> B[0, j]
+                b = a
+
+
+if __name__ == '__main__':
+    dace.compile(control_flow_test, dace.float32[W, H], dace.float32[H, W],
+                 dace.float32)
diff --git a/tests/copynd_test.py b/tests/copynd_test.py
new file mode 100644
index 0000000000..9c989afd45
--- /dev/null
+++ b/tests/copynd_test.py
@@ -0,0 +1,135 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import numpy as np
+from scipy import ndimage
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+    print('Copy ND tests')
+
+    N = dace.symbol('N')
+    N.set(20)
+
+    sdfg = dace.SDFG('copynd')
+    state = sdfg.add_state()
+
+    arrays = []
+
+    # Copy 1: sub-2d array to sub-2d array
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N, N], dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [5, 7], dace.float32))
+    state.add_edge(arrays[-2], None, arrays[-1], None,
+                   dace.memlet.Memlet.simple(arrays[-2], '5:10, N-7:N'))
+
+    # Copy 2: 1d subset of a 4d array to a 1d array
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N, N, N, N], dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N - 1], dace.float32))
+    state.add_edge(arrays[-2], None, arrays[-1], None,
+                   dace.memlet.Memlet.simple(arrays[-2], '4,1,1:N,2'))
+
+    # Copy 3: 5d array to 5d array
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [2, 3, 4, 5, 6],
+                        dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [2, 3, 4, 5, 6],
+                        dace.float32))
+    state.add_edge(
+        arrays[-2], None, arrays[-1], None,
+        dace.memlet.Memlet.simple(arrays[-2], '0:2,0:3,0:4,0:5,0:6'))
+
+    # Copy 4: contiguous 1d subset of a 4d array to a 1d array
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N, N, N, N], dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N - 1], dace.float32))
+    state.add_edge(arrays[-2], None, arrays[-1], None,
+                   dace.memlet.Memlet.simple(arrays[-2], '4,1,2,1:N'))
+
+    # Copy 5: 1d array to a 1d subset of a 4d array
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N - 2], dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N, N, N, N], dace.float32))
+    state.add_edge(arrays[-2], None, arrays[-1], None,
+                   dace.memlet.Memlet.simple(arrays[-1], '4,1:N-1,1,2'))
+
+    # Copy 6: 4d array to a contiguous 1d subset of a 4d array
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N - 2], dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N, N, N, N], dace.float32))
+    state.add_edge(arrays[-2], None, arrays[-1], None,
+                   dace.memlet.Memlet.simple(arrays[-1], '4,1,2,1:N-1'))
+
+    # Copy 7: True 4d copy (4d subarray to 4d array)
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N, N, N, N], dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N - 5, N - 4, 3, N - 2],
+                        dace.float32))
+    state.add_edge(
+        arrays[-2], None, arrays[-1], None,
+        dace.memlet.Memlet.simple(arrays[-2], '5:N,2:N-2,N-10:N-7,1:N-1'))
+
+    # Copy 8: 4d array with a stride to a 1d array
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N, N, N, N], dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [N / 2 - 1], dace.float32))
+    state.add_edge(arrays[-2], None, arrays[-1], None,
+                   dace.memlet.Memlet.simple(arrays[-2], '4,1,2,1:N-1:2'))
+
+    # Copy 9: 2d array to a 3d array with an offset
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [40, 40], dace.float32))
+    arrays.append(
+        state.add_array('A_' + str(len(arrays)), [3, 40, 40], dace.float32))
+    state.add_edge(
+        arrays[-2], None, arrays[-1], None,
+        dace.memlet.Memlet.simple(
+            arrays[-2], '20:40, 10:30', other_subset_str='2, 10:30, 20:40'))
+
+    sdfg.draw_to_file()
+
+    array_data = [
+        np.random.rand(*[dace.eval(s) for s in a.desc(sdfg).shape]).astype(
+            a.desc(sdfg).dtype.type) for a in arrays
+    ]
+
+    args = {anode.label: adata for anode, adata in zip(arrays, array_data)}
+    args['N'] = N.get()
+    sdfg(**args)
+
+    N = N.get()
+
+    diffs = [
+        np.linalg.norm(array_data[1] - array_data[0][5:10, N - 7:N]) / 5.0 *
+        7.0,
+        np.linalg.norm(array_data[3] - array_data[2][4, 1, 1:, 2]) / (N - 1),
+        np.linalg.norm(array_data[5] - array_data[4]) / 2.0 * 3 * 4 * 5 * 6,
+        np.linalg.norm(array_data[7] - array_data[6][4, 1, 2, 1:]) / (N - 1),
+        np.linalg.norm(array_data[9][4, 1:N - 1, 1, 2] - array_data[8]) /
+        (N - 2),
+        np.linalg.norm(array_data[11][4, 1, 2, 1:N - 1] - array_data[10]) /
+        (N - 2),
+        np.linalg.norm(array_data[13] -
+                       array_data[12][5:N, 2:N - 2, N - 10:N - 7, 1:N - 1]) / (
+                           (N - 5) * (N - 4) * 3 * (N - 2)),
+        np.linalg.norm(array_data[15] - array_data[14][4, 1, 2, 1:(N - 1):2]) /
+        (N / 2 - 1),
+        np.linalg.norm(array_data[17][2, 10:30, 20:40] -
+                       array_data[16][20:40, 10:30]) / 400
+    ]
+
+    print('Differences: ', diffs)
+
+    print("==== Program end ====")
+    exit(0 if all([diff < 1e-7 for diff in diffs]) else 1)
diff --git a/tests/cppunparse_test.py b/tests/cppunparse_test.py
new file mode 100644
index 0000000000..031aac1b34
--- /dev/null
+++ b/tests/cppunparse_test.py
@@ -0,0 +1,108 @@
+from __future__ import print_function
+from dace.codegen import cppunparse
+import six
+
+
+def test_py2cpp(func, expected_string):
+    result = cppunparse.py2cpp(func)
+    if result != expected_string:
+        print("ERROR in py2cpp, expected:\n%s\n\ngot:\n%s\n" %
+              (expected_string, result))
+        return False
+    return True
+
+
+def test_pyexpr2cpp(func, expected_string):
+    result = cppunparse.pyexpr2cpp(func)
+    if result != expected_string:
+        print("ERROR in pyexpr2cpp, expected:\n%s\n\ngot:\n%s\n" %
+              (expected_string, result))
+        return False
+    return True
+
+
+def gfunc(woo):
+    i = 0
+    result = 0
+    while i < woo and i > 0:
+        for j in range(i):
+            result += (2 // 1)**j
+    return result
+
+
+if __name__ == '__main__':
+    print('cppunparse unit test')
+    success = True
+
+    success &= test_py2cpp(
+        """def notype(a, b):
+    a = a + 5
+    c = a + b
+    return c*b
+""", """auto notype(auto a, auto b) {
+    a = (a + 5);
+    auto c = (a + b);
+    return (c * b);
+}""")
+
+    if six.PY3:
+        success &= test_py2cpp(
+            """def typed(a: int, b: float) -> float:
+    c = a + b
+    return c*b
+""", """float typed(int a, float b) {
+    auto c = (a + b);
+    return (c * b);
+}""")
+
+    # Ternary operators, strings
+    success &= test_py2cpp("""printf('%f\\n', a if b else c);""",
+                           """printf("%f\\n", (b ? a : c));""")
+
+    # Global functions, operators
+    success &= test_py2cpp(
+        gfunc, """auto gfunc(auto woo) {
+    auto i = 0;
+    auto result = 0;
+    while (((i < woo) && (i > 0))) {
+        for (auto j : range(i)) {
+            result += dace::math::pow(dace::math::ifloor(2 / 1), j);
+        }
+    }
+    return result;
+}""")
+
+    def lfunc():
+        exit(1 >> 3)
+
+    # Local functions
+    success &= test_py2cpp(lfunc, """auto lfunc() {
+    exit((1 >> 3));
+}""")
+
+    # void return value
+    if six.PY3:
+        success &= test_py2cpp("""
+def lfunc() -> None:
+    exit(1 >> 3)
+""", """void lfunc() {
+    exit((1 >> 3));
+}""")
+
+    # Local variable tracking
+    success &= test_py2cpp('l = 1 + a; l = l + 8;', """auto l = (1 + a);
+l = (l + 8);""")
+
+    # Operations (augmented assignment)
+    if six.PY3:
+        success &= test_py2cpp('l *= 3; l //= 8', """l *= 3;
+l = dace::math::ifloor(l / 8);""")
+
+    success &= test_pyexpr2cpp('a << 3', '(a << 3)')
+
+    # Array assignment
+    success &= test_py2cpp('A[i] = b[j]', """A[i] = b[j];""")
+
+    print('Result: %s' % ('PASSED' if success else 'FAILED'))
+    if not success:
+        exit(1)
diff --git a/tests/cr_complex_test.py b/tests/cr_complex_test.py
new file mode 100644
index 0000000000..b15ab74507
--- /dev/null
+++ b/tests/cr_complex_test.py
@@ -0,0 +1,31 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+N = 12
+
+
+@dace.program
+def program(input, output):
+    @dace.map(_[0:N])
+    def tasklet(i):
+        a << input[i]
+        b >> output(1, lambda a, b: a + b)
+        b = a
+
+
+if __name__ == "__main__":
+    print('CR non-atomic (complex value) test')
+
+    A = np.random.rand(N).astype(np.complex128)
+    A += np.random.rand(N).astype(np.complex128) * 1j
+    B = np.ndarray([1], dtype=A.dtype)
+    B[0] = 0
+
+    program(A, B)
+
+    diff = abs(np.sum(A) - B[0])
+    print("Difference:", diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cr_reinit_test.py b/tests/cr_reinit_test.py
new file mode 100644
index 0000000000..867d3eee74
--- /dev/null
+++ b/tests/cr_reinit_test.py
@@ -0,0 +1,32 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+N = 12
+
+
+@dace.program
+def program(input, output):
+    for t in range(3):
+
+        @dace.map(_[0:N])
+        def tasklet(i):
+            a << input[i]
+            b >> output(1, lambda a, b: a + b, 0)
+            b = a
+
+
+if __name__ == "__main__":
+    print('CR re-initialization test')
+
+    A = np.random.rand(N)
+    B = np.ndarray([1], dtype=A.dtype)
+    B[0] = 100
+
+    program(A, B)
+
+    diff = abs(3 * np.sum(A) - B[0])
+    print("Difference:", diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cuda_block_test.py b/tests/cuda_block_test.py
new file mode 100644
index 0000000000..f5b45fc8dd
--- /dev/null
+++ b/tests/cuda_block_test.py
@@ -0,0 +1,44 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+N = dace.symbol('N')
+
+V = dace.ndarray([N], dace.float64)
+Vout = dace.ndarray([N], dace.float64)
+
+
+@dace.program(dace.float64[N], dace.float64[N])
+def cudahello(V, Vout):
+    # Transient variable
+    @dace.map(_[0:N:32])
+    def multiplication(i):
+        @dace.map(_[0:32])
+        def mult_block(bi):
+            in_V << V[i + bi]
+            out >> Vout[i + bi]
+            out = in_V * 2
+
+        @dace.map(_[0:32])
+        def mult_block_2(bi):
+            in_V << V[i + bi]
+            out >> Vout[i + bi]
+            out = in_V * 2
+
+
+if __name__ == "__main__":
+    N.set(128)
+
+    print('Vector double CUDA (block) %d' % (N.get()))
+
+    V[:] = np.random.rand(N.get()).astype(dace.float64.type)
+    Vout[:] = dace.float64(0)
+
+    cudahello(V, Vout)
+
+    diff = np.linalg.norm(2 * V - Vout) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cuda_blockreduce.py b/tests/cuda_blockreduce.py
new file mode 100644
index 0000000000..31b0ad794e
--- /dev/null
+++ b/tests/cuda_blockreduce.py
@@ -0,0 +1,51 @@
+import numpy as np
+import dace
+from dace.memlet import Memlet
+
+# Create SDFG
+sdfg = dace.SDFG('block_reduction')
+state = sdfg.add_state('a')
+
+# Nodes
+A = state.add_array('A', (128, ), dace.float32)
+B = state.add_array('B', (2, ), dace.float32)
+me, mx = state.add_map('mymap', dict(bi='0:2'))
+mei, mxi = state.add_map('mymap2', dict(i='0:32'))
+red = state.add_reduce('lambda a,b: a+b', None, 0,
+                       dace.ScheduleType.GPU_ThreadBlock)
+tA = state.add_transient('tA', (2, ), dace.float32)
+tB = state.add_transient('tB', (1, ), dace.float32)
+write_tasklet = state.add_tasklet('writeout', {'inp'}, {'out'},
+                                  'if i == 0: out = inp')
+
+# Edges
+state.add_edge(A, None, me, None, Memlet.simple(A, '0:128'))
+state.add_edge(me, None, mei, None, Memlet.simple(A, '(64*bi):(64*bi+64)'))
+state.add_edge(mei, None, tA, None,
+               Memlet.simple('A', '(64*bi+2*i):(64*bi+2*i+2)'))
+state.add_edge(tA, None, red, None, Memlet.simple(tA, '0:2'))
+state.add_edge(red, None, tB, None, Memlet.simple(tB, '0'))
+state.add_edge(tB, None, write_tasklet, 'inp', Memlet.simple(tB, '0'))
+state.add_edge(write_tasklet, 'out', mxi, None,
+               Memlet('B', -1, dace.subsets.Indices(['bi']), 1))
+state.add_edge(mxi, None, mx, None, Memlet.simple(B, 'bi'))
+state.add_edge(mx, None, B, None, Memlet.simple(B, '0:2'))
+sdfg.fill_scope_connectors()
+
+if __name__ == '__main__':
+    print('Block reduction test')
+
+    sdfg.draw_to_file()
+
+    Adata = np.random.rand(128).astype(np.float32)
+    Bdata = np.random.rand(2).astype(np.float32)
+    sdfg(A=Adata, B=Bdata)
+
+    B_regression = np.zeros(2, dtype=np.float32)
+    B_regression[0] = np.sum(Adata[:64])
+    B_regression[1] = np.sum(Adata[64:])
+
+    diff = np.linalg.norm(B_regression - Bdata) / 128.0
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cuda_grid2d_test.py b/tests/cuda_grid2d_test.py
new file mode 100644
index 0000000000..c82d4edb90
--- /dev/null
+++ b/tests/cuda_grid2d_test.py
@@ -0,0 +1,37 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+H = dace.symbol('H')
+W = dace.symbol('W')
+
+V = dace.ndarray([H, W], dace.float64)
+Vout = dace.ndarray([H, W], dace.float64)
+
+
+@dace.program(dace.float64[H, W], dace.float64[H, W])
+def cudahello(V, Vout):
+    @dace.map(_[0:H, 0:W])
+    def multiplication(i, j):
+        in_V << V[i, j]
+        out >> Vout[i, j]
+        out = in_V * 2.0
+
+
+if __name__ == "__main__":
+    W.set(128)
+    H.set(64)
+
+    print('Vector double CUDA (grid 2D) %dx%d' % (W.get(), H.get()))
+
+    V[:] = np.random.rand(H.get(), W.get()).astype(dace.float64.type)
+    Vout[:] = dace.float64(0)
+
+    cudahello(V, Vout)
+
+    diff = np.linalg.norm(2 * V - Vout) / (H.get() * W.get())
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cuda_grid_test.py b/tests/cuda_grid_test.py
new file mode 100644
index 0000000000..ed73cfeaab
--- /dev/null
+++ b/tests/cuda_grid_test.py
@@ -0,0 +1,36 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+N = dace.symbol('N')
+
+V = dace.ndarray([N], dace.float32)
+Vout = dace.ndarray([N], dace.float32)
+
+
+@dace.program(dace.float32[N], dace.float32[N])
+def cudahello(V, Vout):
+    # Transient variable
+    @dace.map(_[0:N])
+    def multiplication(i):
+        in_V << V[i]
+        out >> Vout[i]
+        out = in_V * 2.0
+
+
+if __name__ == "__main__":
+    N.set(52)
+
+    print('Vector double CUDA %d' % (N.get()))
+
+    V[:] = np.random.rand(N.get()).astype(dace.float32.type)
+    Vout[:] = dace.float32(0)
+
+    cudahello(V, Vout)
+
+    diff = np.linalg.norm(2 * V - Vout) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cuda_highdim_kernel_test.py b/tests/cuda_highdim_kernel_test.py
new file mode 100644
index 0000000000..2fb4801164
--- /dev/null
+++ b/tests/cuda_highdim_kernel_test.py
@@ -0,0 +1,66 @@
+import dace
+import numpy as np
+
+# Symbols
+N = dace.symbol('N')
+M = dace.symbol('M')
+K = dace.symbol('K')
+L = dace.symbol('L')
+
+X = dace.symbol('X')
+Y = dace.symbol('Y')
+Z = dace.symbol('Z')
+W = dace.symbol('W')
+U = dace.symbol('U')
+
+
+@dace.program
+def highdim(A: dace.uint64[N, M, K, L, X, Y, Z, W, U],
+            B: dace.uint64[N, M, K, L]):
+    @dace.map
+    def kernel(i: _[5:N - 5], j: _[0:M], k: _[7:K - 1], l: _[0:L]):
+        @dace.map
+        def block(a: _[0:X], b: _[0:Y], c: _[1:Z], d: _[2:W - 2], e: _[0:U]):
+            input << A[i, j, k, l, a, b, c, d, e]
+            output >> B(1, lambda a, b: a + b)[i, j, k, l]
+            output = input
+
+
+def makendrange(*args):
+    result = []
+    for i in range(0, len(args), 2):
+        result.append((dace.eval(args[i]), dace.eval(args[i + 1] - 1), 1))
+    return result
+
+
+if __name__ == '__main__':
+    # 4D kernel with 5D block
+    N.set(12)
+    M.set(3)
+    K.set(14)
+    L.set(15)
+    X.set(1)
+    Y.set(2)
+    Z.set(3)
+    W.set(4)
+    U.set(5)
+    dims = tuple(s.get() for s in (N, M, K, L, X, Y, Z, W, U))
+    outdims = tuple(s.get() for s in (N, M, K, L))
+    print('High-dimensional GPU kernel test', dims)
+
+    A = np.random.randint(10, size=dims).astype(np.uint64)
+    B = np.zeros(outdims, dtype=np.uint64)
+    B_regression = np.zeros(outdims, dtype=np.uint64)
+
+    # Equivalent python code
+    for i, j, k, l in dace.ndrange(
+            makendrange(5, N - 5, 0, M, 7, K - 1, 0, L)):
+        for a, b, c, d, e in dace.ndrange(
+                makendrange(0, X, 0, Y, 1, Z, 2, W - 2, 0, U)):
+            B_regression[i, j, k, l] += A[i, j, k, l, a, b, c, d, e]
+
+    highdim(A, B)
+
+    diff = np.linalg.norm(B_regression - B) / dace.eval(N * M * K * L)
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cuda_smem2d_test.py b/tests/cuda_smem2d_test.py
new file mode 100644
index 0000000000..4c2fa7ea97
--- /dev/null
+++ b/tests/cuda_smem2d_test.py
@@ -0,0 +1,39 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+H = dace.symbol('H')
+W = dace.symbol('W')
+
+V = dace.ndarray([H, W], dace.float64)
+Vout = dace.ndarray([H, W], dace.float64)
+
+
+@dace.program(dace.float64[H, W], dace.float64[H, W])
+def cudahello(V, Vout):
+    @dace.map(_[0:H:8, 0:W:32])
+    def multiplication(i, j):
+        @dace.map(_[0:8, 0:32])
+        def mult_block(bi, bj):
+            in_V << V[i + bi, j + bj]
+            out >> Vout[i + bi, j + bj]
+            out = in_V * 2.0
+
+
+if __name__ == "__main__":
+    W.set(128)
+    H.set(64)
+
+    print('Vector double CUDA (shared memory 2D) %dx%d' % (W.get(), H.get()))
+
+    V[:] = np.random.rand(H.get(), W.get()).astype(dace.float64.type)
+    Vout[:] = dace.float64(0)
+
+    cudahello(V, Vout)
+
+    diff = np.linalg.norm(2 * V - Vout) / (H.get() * W.get())
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cuda_smem_test.py b/tests/cuda_smem_test.py
new file mode 100644
index 0000000000..cf98a8ea02
--- /dev/null
+++ b/tests/cuda_smem_test.py
@@ -0,0 +1,37 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+N = dace.symbol('N')
+
+V = dace.ndarray([N], dace.float64)
+Vout = dace.ndarray([N], dace.float64)
+
+
+@dace.program(dace.float64[N], dace.float64[N])
+def cudahello(A, Vout):
+    @dace.map(_[0:ceiling(N / 32)])
+    def multiplication(i):
+        @dace.map(_[i * 32:min(N, (i + 1) * 32)])
+        def mult_block(bi):
+            in_V << A[bi]
+            out >> Vout[bi]
+            out = in_V * 2.0
+
+
+if __name__ == "__main__":
+    N.set(144)
+
+    print('Vector double CUDA (shared memory) %d' % (N.get()))
+
+    V[:] = np.random.rand(N.get()).astype(dace.float64.type)
+    Vout[:] = dace.float64(0)
+
+    cudahello(V, Vout)
+
+    diff = np.linalg.norm(2 * V - Vout) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/cuda_test.sh b/tests/cuda_test.sh
new file mode 100644
index 0000000000..546ad2be29
--- /dev/null
+++ b/tests/cuda_test.sh
@@ -0,0 +1,191 @@
+#!/bin/bash
+
+set -a
+
+SCRIPTPATH="$( cd "$(dirname "$0")" ; pwd -P )"
+PYTHONPATH=$SCRIPTPATH/..
+
+DACE_debugprint="${DACE_debugprint:-0}"
+ERRORS=0
+FAILED_TESTS=""
+TESTS=0
+
+TEST_TIMEOUT=60
+
+RED='\033[0;31m'
+NC='\033[0m'
+
+# From http://mywiki.wooledge.org/BashFAQ/003
+get_latest_file() {
+    dir=$1
+    unset -v latest
+    for file in "$dir"/*/build/*.so; do
+        [[ $file -nt $latest ]] && latest=$file
+    done
+    # If there is no such file, return with an error
+    if [ -z "$latest" ]; then
+        return 2
+    fi
+    echo $latest
+}
+
+join_by_newline() {
+    for a in $*; do
+        echo $a        
+    done
+    echo 9999
+}
+
+################################################
+# Check for generated bad code
+
+badcode_check() {
+    FILENAME=$1
+    cuobjdump -sass $FILENAME > tmp.sass
+    grep ' STL' tmp.sass
+    if [ $? -eq 0 ]; then
+        echo "Possible register spill found! (store)"
+        rm -f tmp.sass
+        return 1
+    fi
+    echo $OBJ | grep ' LDL'
+    if [ $? -eq 0 ]; then
+        echo "Possible register spill found! (load)"
+        rm -f tmp.sass
+        return 1
+    fi
+    rm -f tmp.sass
+    echo "SUCCESS: Code contains no spills"
+    return 0
+}
+
+find_128() {
+    FILENAME=$1
+    cuobjdump -sass $FILENAME | grep '\.128 '
+    if [ $? -ne 0 ]; then
+        echo "ERROR: 128-bit memory operations not found"
+        return 1
+    fi
+    echo "SUCCESS: Code contains wide memory operations"
+    return 0
+}
+
+################################################
+
+checkoutput() {
+    OBJFILE=$(get_latest_file ".dacecache")
+    if [ $? -ne 0 ]; then
+        echo "ERROR: Shared object file not found"
+        return 2
+    fi
+    badcode_check $OBJFILE
+}
+
+check_vectorization() {
+    OBJFILE=$(get_latest_file ".dacecache")
+    if [ $? -ne 0 ]; then
+        echo "ERROR: Shared object file not found"
+        return 2
+    fi
+    find_128 $OBJFILE
+}
+
+bail() {
+    ERRORSTR=$1
+    /bin/echo -e "${RED}ERROR${NC} in $ERRORSTR" 1>&2
+    ERRORS=`expr $ERRORS + 1`
+    FAILED_TESTS="${FAILED_TESTS} $ERRORSTR\n"
+}
+
+runtestopt() {
+    TESTS=`expr $TESTS + 1`
+    opts=$(join_by_newline ${@:4})
+    echo "$opts\ny" | timeout $TEST_TIMEOUT $1 $PYTHONPATH/tests/$2
+    if [ $? -ne 0 ]; then
+        bail "$1 $2 ($3, optimized)"
+        return 1
+    fi
+    
+    checkoutput # Check for spills in the assembly
+    if [ $? -ne 0 ]; then bail "$1 $2 ($3, assembly)"; return 1; fi
+    return 0
+}
+
+runopt() {
+    TESTS=`expr $TESTS + 1`
+    opts=$(join_by_newline ${@:4})
+    echo "$opts\ny" | timeout $TEST_TIMEOUT $1 $PYTHONPATH/$2
+    if [ $? -ne 0 ]; then
+        bail "$1 $2 ($3, optimized)"
+        return 1
+    fi
+    
+    checkoutput # Check for spills in the assembly
+    if [ $? -ne 0 ]; then bail "$1 $2 ($3, assembly)"; return 1; fi
+    return 0
+}
+
+runone() {
+    echo "Running $1"
+    runtestopt $1 cuda_grid_test.py $2
+    runtestopt $1 cuda_grid_test.py $2 'GPUTransformMap$0'
+
+    runtestopt $1 cuda_grid2d_test.py $2
+    runtestopt $1 cuda_grid2d_test.py $2 'GPUTransformMap$0'
+    
+    runtestopt $1 cuda_grid_test.py $2 'GPUTransformMap$0' 'Vectorization$0'
+    if [ $? -eq 0 ]; then # Check that output was vectorized
+        check_vectorization
+        if [ $? -ne 0 ]; then bail "$1 cuda_grid_test.py ($2, wideload)"; fi
+    fi
+
+    runtestopt $1 cuda_block_test.py $2
+    runtestopt $1 cuda_block_test.py $2 'GPUTransformMap$0'
+
+    runtestopt $1 cuda_smem_test.py $2
+    runtestopt $1 cuda_smem_test.py $2 'GPUTransformMap$0'
+    runtestopt $1 cuda_smem_test.py $2 'GPUTransformMap$0' 'InLocalStorage$0(array="gpu_A")'
+    
+    runtestopt $1 cuda_smem2d_test.py $2
+    runtestopt $1 cuda_smem2d_test.py $2 'GPUTransformMap$0'
+    runtestopt $1 cuda_smem2d_test.py $2 'GPUTransformMap$0' 'InLocalStorage$0(array="gpu_V")'
+    
+    runopt $1 samples/simple/sum.py $2
+    runopt $1 samples/simple/sum.py $2 'GPUTransformMap$0'
+    
+    runtestopt $1 cuda_blockreduce.py $2 'GPUTransformMap$0'
+    
+    runtestopt $1 cuda_highdim_kernel_test.py $2 'GPUTransformMap$0(fullcopy=True)'
+    
+    runtestopt $1 multistream_copy_cudatest.py $2
+    runtestopt $1 multistream_kernel_cudatest.py $2
+    runtestopt $1 multistream_custom_cudatest.py $2
+
+    runtestopt $1 multiprogram_cudatest.py $2
+}
+
+runall() {
+    #runone python2 $1
+    runone python3 $1
+}
+
+# Check if GPU tests can be run
+nvidia-smi >/dev/null 2>&1
+if [ $? -ne 0 ]; then
+    echo "GPUs not available or unusable"
+    exit 99
+fi
+
+
+echo "====== Target: GPU ======"
+
+DACE_compiler_use_cache=0
+
+runall "GPU"
+
+PASSED=`expr $TESTS - $ERRORS`
+echo "$PASSED / $TESTS tests passed"
+if [ $ERRORS -ne 0 ]; then
+    printf "Failed tests:\n${FAILED_TESTS}"
+    exit 1
+fi
diff --git a/tests/custom_reduce_test.py b/tests/custom_reduce_test.py
new file mode 100644
index 0000000000..bba297aab7
--- /dev/null
+++ b/tests/custom_reduce_test.py
@@ -0,0 +1,17 @@
+import dace
+import numpy as np
+
+
+@dace.program
+def customreduction(A: dace.float32[20], out: dace.float32[1]):
+    dace.reduce(lambda a, b: a if a < b else b, A, out, identity=9999999)
+
+
+if __name__ == '__main__':
+    print('Custom reduction test')
+    A = np.random.rand(20).astype(np.float32)
+    B = np.zeros([1], dtype=np.float32)
+    customreduction(A, B)
+    diff = (B - np.min(A))
+    print('Difference:', diff)
+    exit(0 if diff < 1e-5 else 1)
diff --git a/tests/duplicate_arg_test.py b/tests/duplicate_arg_test.py
new file mode 100644
index 0000000000..c2bea65cff
--- /dev/null
+++ b/tests/duplicate_arg_test.py
@@ -0,0 +1,43 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+N = dace.symbol()
+
+
+@dace.program
+def dot(A, B, out):
+    @dace.map
+    def product(i: _[0:N]):
+        a << A[i]
+        b << B[i]
+        o >> out(1, lambda x, y: x + y)
+        o = a * b
+
+
+if __name__ == "__main__":
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("N", type=int, nargs="?", default=64)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([N], dtype=dace.float32)
+    out_AA = dace.scalar(dace.float64)
+
+    N.set(args["N"])
+
+    print('Dot product %d' % (N.get()))
+
+    A[:] = np.random.rand(N.get()).astype(dace.float32.type)
+    out_AA[0] = dace.float64(0)
+
+    cdot_self = dace.compile(dot, A, A, out_AA)
+    cdot_self(A, A, out_AA)
+
+    diff_aa = np.linalg.norm(np.dot(A, A) - out_AA) / float(N.get())
+    print("Difference:", diff_aa)
+    exit(0 if (diff_aa <= 1e-5) else 1)
diff --git a/tests/duplicate_naming_test.py b/tests/duplicate_naming_test.py
new file mode 100644
index 0000000000..04513a4d3b
--- /dev/null
+++ b/tests/duplicate_naming_test.py
@@ -0,0 +1,47 @@
+#!/usr/bin/env python
+import dace
+import numpy as np
+
+W = dace.symbol()
+
+number = 42
+
+
+@dace.external_function
+def f(A, number):
+    @dace.map(_[0:W])
+    def bla(i):
+        inp << A[i]
+        out >> number[i]
+        out = 2 * inp
+
+
+@dace.program
+def prog(A, B):
+    no = dace.define_local([number], dace.float32)
+    number = dace.define_local([W], dace.float32)
+
+    f(A, number)
+
+    @dace.map(_[0:W])
+    def bla2(i):
+        inp << number[i]
+        out >> B[i]
+        out = 2 * inp
+
+
+if __name__ == '__main__':
+    W.set(3)
+
+    A = dace.ndarray([W])
+    B = dace.ndarray([W])
+
+    A[:] = np.mgrid[0:W.get()]
+    B[:] = dace.float32(0.0)
+
+    prog(A, B)
+
+    diff = np.linalg.norm(4 * A - B) / W.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/dynamic_sdfg_functions_test.py b/tests/dynamic_sdfg_functions_test.py
new file mode 100644
index 0000000000..5ed10a2aa0
--- /dev/null
+++ b/tests/dynamic_sdfg_functions_test.py
@@ -0,0 +1,57 @@
+#!/usr/bin/env python3
+import math
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+# For comparison
+#@dp.program
+#def mymodexp_prog(A,B):
+#    @dp.map
+#    def compute(i: _[0:N]):
+#        a << A[i%N]
+#        b >> B[i]
+#
+#        b = math.exp(a)
+
+# Constructs an SDFG manually and runs it
+if __name__ == '__main__':
+    print('Dynamic SDFG test with math functions')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    N.set(20)
+
+    input = np.random.rand(N.get()).astype(np.float32)
+    output = dp.ndarray([N], dp.float32)
+    output[:] = dp.float32(0)
+
+    # Construct SDFG
+    mysdfg = SDFG('mymodexp')
+    state = mysdfg.add_state()
+    A = state.add_array('A', [N], dp.float32)
+    B = state.add_array('B', [N], dp.float32)
+
+    # Easy way to add a tasklet
+    tasklet, map_entry, map_exit = state.add_mapped_tasklet(
+        'mytasklet',
+        dict(i='0:N'),
+        dict(a=Memlet.simple(A, 'i % N')),
+        'b = math.exp(a)',
+        dict(b=Memlet.simple(B, 'i')))
+
+    # Add outer edges
+    state.add_edge(A, None, map_entry, None, Memlet.simple(A, '0:N'))
+    state.add_edge(map_exit, None, B, None, Memlet.simple(B, '0:N'))
+
+    # Left for debugging purposes
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, B=output, N=N)
+    #mymodexp_prog(input, output)
+
+    diff = np.linalg.norm(np.exp(input) - output) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/e2e_test.sh b/tests/e2e_test.sh
new file mode 100644
index 0000000000..aa0b4cae4a
--- /dev/null
+++ b/tests/e2e_test.sh
@@ -0,0 +1,61 @@
+#!/bin/bash
+
+set -a
+
+SCRIPTPATH="$( cd "$(dirname "$0")" ; pwd -P )"
+PYTHONPATH=$SCRIPTPATH/..
+
+ERRORS=0
+FAILED_TESTS=""
+TESTS=0
+
+TEST_TIMEOUT=10m
+
+RED='\033[0;31m'
+NC='\033[0m'
+        
+
+runtest() {
+    yes | timeout $TEST_TIMEOUT $1 $PYTHONPATH/samples/simple/$2
+    if [ $? -ne 0 ]; then
+        /bin/echo -e "${RED}ERROR${NC} in test $1 $2 ($3)" 1>&2
+        ERRORS=`expr $ERRORS + 1`
+        FAILED_TESTS="${FAILED_TESTS}    $1 $2 ($3)\n"
+    fi
+    TESTS=`expr $TESTS + 1`
+}
+
+runtestopt() {
+    echo "$4\ny" | timeout $TEST_TIMEOUT $1 $PYTHONPATH/samples/simple/$2
+    if [ $? -ne 0 ]; then
+        /bin/echo -e "${RED}ERROR${NC} in test $1 $2 ($3, optimized)" 1>&2
+        ERRORS=`expr $ERRORS + 1`
+        FAILED_TESTS="${FAILED_TESTS}    $1 $2 ($3, optimized)\n"
+    fi
+    TESTS=`expr $TESTS + 1`
+}
+
+
+runone() {
+    echo "Running $1"
+    runtest $1 gemm.py $2
+    runtest $1 simple_stencil.py $2
+    runtest $1 filter.py $2
+    runtest $1 histogram.py $2
+    runtest $1 spmv.py $2
+}
+
+runall() {
+    runone python3 $1
+}
+
+DACE_compiler_use_cache=0
+
+runall "CPU"
+
+PASSED=`expr $TESTS - $ERRORS`
+echo "$PASSED / $TESTS tests passed"
+if [ $ERRORS -ne 0 ]; then
+    printf "Failed tests:\n${FAILED_TESTS}"
+    exit 1
+fi
diff --git a/tests/emptymap_opt.py b/tests/emptymap_opt.py
new file mode 100644
index 0000000000..a3cf469462
--- /dev/null
+++ b/tests/emptymap_opt.py
@@ -0,0 +1,4 @@
+diode.SwitchPanes("SDFG Editor")
+sdfg_edit.AddState("s0")
+sdfg_edit.AddMap("newmap", "s0")
+diode.SwitchPanes("Optimizer")
diff --git a/tests/erroneous_lib_use_test.py b/tests/erroneous_lib_use_test.py
new file mode 100644
index 0000000000..2a09c24c57
--- /dev/null
+++ b/tests/erroneous_lib_use_test.py
@@ -0,0 +1,49 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+from dace.codegen.compiler import DuplicateDLLError, CompilationError
+import numpy as np
+
+
+# Dynamically creates DaCe programs with the same name
+def program_generator(size, factor):
+    @dace.program(
+        dace.float64[size], dace.float64[size], size=size, factor=factor)
+    def program(input, output):
+        @dace.map(_[0:size])
+        def tasklet(i):
+            a << input[i]
+            b >> output[i]
+            b = a * factor
+
+    return program
+
+
+if __name__ == "__main__":
+    print('Reloadable DaCe program test')
+
+    array_one = np.random.rand(10).astype(np.float64)
+    array_two = np.random.rand(20).astype(np.float64)
+    output_one = np.zeros(10, dtype=np.float64)
+    output_two = np.zeros(20, dtype=np.float64)
+
+    prog_one = program_generator(10, 2.0)
+    prog_two = program_generator(20, 4.0)
+
+    # This should NOT work (the two SDFGs will compile over the same file)
+    try:
+        func1 = dace.compile(prog_one)
+        func2 = dace.compile(prog_two)
+
+        func1(array_one, output_one)
+        func2(array_two, output_two)
+
+        diff1 = np.linalg.norm(2.0 * array_one - output_one) / 10.0
+        diff2 = np.linalg.norm(4.0 * array_two - output_two) / 20.0
+        print("Differences:", diff1, diff2)
+        print("This should definitely not work!")
+        exit(1)
+    except (DuplicateDLLError, CompilationError):
+        print("Exception successfully caught, test passed")
+        exit(0)
diff --git a/tests/external_tasklet.py b/tests/external_tasklet.py
new file mode 100644
index 0000000000..ce9aa1c133
--- /dev/null
+++ b/tests/external_tasklet.py
@@ -0,0 +1,34 @@
+#!/usr/bin/env python
+import dace
+
+
+@dace.program(dace.uint32[2], dace.uint32[2])
+def external_tasklet(A, B):
+    @dace.map(_[1:2])
+    def index(i):
+        a << A[i]
+        b >> B[i]
+        b = a + 1  # Will fail if not replaced by the optscript
+
+    @dace.tasklet('CPP', global_code='#include <cstdio>')
+    def index2():
+        a << A[0]
+        b >> B[0]
+        """
+        b = a;
+        printf("I have also been injected as raw C++ code\\n");
+        """
+
+
+if __name__ == '__main__':
+
+    A = dace.ndarray((2, ))
+    B = dace.ndarray((2, ))
+
+    A[:] = 5
+    B[:] = 0
+
+    external_tasklet(A, B)
+
+    if B[0] != 5 or B[1] != 5:
+        raise RuntimeError("Expected output {}, got {}".format(5, 0))
diff --git a/tests/external_tasklet_test_opt.py b/tests/external_tasklet_test_opt.py
new file mode 100644
index 0000000000..628525d93c
--- /dev/null
+++ b/tests/external_tasklet_test_opt.py
@@ -0,0 +1,7 @@
+diode.OpenPythonFile("external_tasklet.py")
+diode.ChangeSDFGProperties("s0_2", "language", "CPP")
+diode.ChangeSDFGProperties(
+    "s0_2", "code", "b = a;\nstd::cout << \"I have been injected "
+    "as raw C++ code! :-)\\n\";\n")
+diode.ChangeSDFGProperties("s0_2", "code_global", "#include <iostream>\n")
+diode.Run(fail_on_nonzero=True)
diff --git a/tests/filter_test_opt.py b/tests/filter_test_opt.py
new file mode 100644
index 0000000000..2e66f48fca
--- /dev/null
+++ b/tests/filter_test_opt.py
@@ -0,0 +1,2 @@
+diode.OpenPythonFile("../samples/simple/filter.py")
+result = diode.Run()
diff --git a/tests/gemm_test.sh b/tests/gemm_test.sh
new file mode 100644
index 0000000000..79afb2253d
--- /dev/null
+++ b/tests/gemm_test.sh
@@ -0,0 +1,57 @@
+#!/bin/bash
+
+set -a
+
+SCRIPTPATH="$( cd "$(dirname "$0")" ; pwd -P )"
+PYTHONPATH=$SCRIPTPATH/..
+
+DACE_debugprint="${DACE_debugprint:-0}"
+ERRORS=0
+FAILED_TESTS=""
+TESTS=0
+
+TEST_TIMEOUT=10m
+
+RED='\033[0;31m'
+NC='\033[0m'
+
+join_by_newline() {
+    for a in $*; do
+        echo $a        
+    done
+    echo 9999
+}
+
+
+runtestopt() {
+    opts=$(join_by_newline ${@:4})
+    echo "$opts\ny" | timeout $TEST_TIMEOUT $1 $PYTHONPATH/samples/simple/$2
+    if [ $? -ne 0 ]; then
+        /bin/echo -e "${RED}ERROR${NC} in test $1 $2 ($3, optimized)" 1>&2
+        ERRORS=`expr $ERRORS + 1`
+        FAILED_TESTS="${FAILED_TESTS}    $1 $2 ($3, optimized)\n"
+    fi
+    TESTS=`expr $TESTS + 1`
+}
+
+
+runone() {
+    echo "Running $1"
+    runtestopt $1 gemm.py $2
+    runtestopt $1 gemm.py $2 'MapReduceFusion$0'
+}
+
+runall() {
+    runone python3 $1
+}
+
+DACE_compiler_use_cache=0
+
+runall "CPU"
+
+PASSED=`expr $TESTS - $ERRORS`
+echo "$PASSED / $TESTS tests passed"
+if [ $ERRORS -ne 0 ]; then
+    printf "Failed tests:\n${FAILED_TESTS}"
+    exit 1
+fi
diff --git a/tests/graph_test.py b/tests/graph_test.py
new file mode 100644
index 0000000000..4e0f3355f7
--- /dev/null
+++ b/tests/graph_test.py
@@ -0,0 +1,110 @@
+import unittest
+from dace.graph.graph import *
+
+
+class TestOrderedGraphs(unittest.TestCase):
+    def test_ordered_digraph(self):
+        g = OrderedDiGraph()
+        g.add_edge(0, 7, "abc")
+        g.add_edge(0, 3, "def")
+        g.add_edge(0, 5, "ghi")
+        g.add_edge(9, 0, "jkl")
+        nodes = list(g.nodes())
+        edges = list(g.edges())
+        self.assertEqual(g.number_of_nodes(), 5)
+        self.assertEqual(g.number_of_edges(), 4)
+        self.assertEqual(g.in_degree(0), 1)
+        self.assertEqual(g.out_degree(0), 3)
+        self.assertEqual(nodes[0], 0)
+        self.assertEqual(nodes[1], 7)
+        self.assertEqual(nodes[2], 3)
+        self.assertEqual(nodes[3], 5)
+        self.assertEqual(nodes[4], 9)
+        self.assertEqual(edges[0].data, "abc")
+        self.assertEqual(edges[1].data, "def")
+        self.assertEqual(edges[2].data, "ghi")
+        self.assertEqual(edges[3].data, "jkl")
+        g.remove_edge(Edge(0, 3, "def"))
+        g.remove_edge(Edge(9, 0, "jkl"))
+        nodes = list(g.nodes())
+        edges = list(g.edges())
+        self.assertEqual(g.number_of_nodes(), 5)
+        self.assertEqual(g.number_of_edges(), 2)
+        self.assertEqual(g.in_degree(0), 0)
+        self.assertEqual(g.out_degree(0), 2)
+        self.assertEqual(nodes[0], 0)
+        self.assertEqual(nodes[1], 7)
+        self.assertEqual(nodes[2], 3)
+        self.assertEqual(nodes[3], 5)
+        self.assertEqual(nodes[4], 9)
+        self.assertEqual(len(edges), 2)
+        self.assertEqual(edges[0].data, "abc")
+        self.assertEqual(edges[1].data, "ghi")
+        g.remove_node(7)
+        self.assertEqual(g.number_of_nodes(), 4)
+        self.assertEqual(g.number_of_edges(), 1)
+        self.assertEqual(len(edges[0]), 3)
+
+    def test_ordered_multidigraph(self):
+
+        g = OrderedMultiDiGraph()
+        e0 = g.add_edge(0, 3, "abc")
+        e1 = g.add_edge(0, 3, "def")
+        e2 = g.add_edge(0, 3, "ghi")
+        e3 = g.add_edge(0, 3, "jkl")
+        g.add_edge(0, 4, "mno")
+        g.add_edge(4, 3, "pqr")
+        self.assertEqual(g.number_of_nodes(), 3)
+        self.assertEqual(g.number_of_edges(), 6)
+        self.assertEqual(g.in_degree(0), 0)
+        self.assertEqual(g.in_degree(3), 5)
+        self.assertEqual(g.out_degree(0), 5)
+        nodes = list(g.nodes())
+        edges = list(g.edges())
+        self.assertEqual(nodes[0], 0)
+        self.assertEqual(nodes[1], 3)
+        self.assertEqual(nodes[2], 4)
+        self.assertEqual(edges[0], e0)
+        self.assertEqual(edges[1], e1)
+        self.assertEqual(edges[2], e2)
+        self.assertEqual(edges[3], e3)
+        g.remove_edge(e2)
+        self.assertEqual(g.number_of_nodes(), 3)
+        self.assertEqual(g.number_of_edges(), 5)
+        self.assertEqual(g.in_degree(0), 0)
+        self.assertEqual(g.in_degree(3), 4)
+        self.assertEqual(g.out_degree(0), 4)
+        edges = list(g.edges())
+        self.assertEqual(edges[0], e0)
+        self.assertEqual(edges[1], e1)
+        self.assertEqual(edges[2], e3)
+        g.remove_node(4)
+        self.assertEqual(g.number_of_nodes(), 2)
+        self.assertEqual(g.number_of_edges(), 3)
+        self.assertEqual(g.in_degree(3), 3)
+        self.assertEqual(g.out_degree(0), 3)
+        self.assertEqual(len(edges[0]), 3)
+        h = OrderedMultiDiGraph()
+        e0 = h.add_edge(0, 1, None)
+        e1 = h.add_edge(0, 2, None)
+        e2 = h.add_edge(1, 3, None)
+        e3 = h.add_edge(3, 4, None)
+        e4 = h.add_edge(1, 5, None)
+        e5 = h.add_edge(2, 6, None)
+        e6 = h.add_edge(6, 7, None)
+        e7 = h.add_edge(6, 8, None)
+        e8 = h.add_edge(2, 6, None)
+        bfs_edges = h.bfs_edges(0)
+        self.assertEqual(next(bfs_edges), e0)
+        self.assertEqual(next(bfs_edges), e1)
+        self.assertEqual(next(bfs_edges), e2)
+        self.assertEqual(next(bfs_edges), e4)
+        self.assertEqual(next(bfs_edges), e5)
+        self.assertEqual(next(bfs_edges), e8)
+        self.assertEqual(next(bfs_edges), e3)
+        self.assertEqual(next(bfs_edges), e6)
+        self.assertEqual(next(bfs_edges), e7)
+
+
+if __name__ == "__main__":
+    unittest.main()
diff --git a/tests/ifchain_test.py b/tests/ifchain_test.py
new file mode 100644
index 0000000000..e4ca960bee
--- /dev/null
+++ b/tests/ifchain_test.py
@@ -0,0 +1,59 @@
+import dace
+import numpy as np
+
+
+@dace.program
+def noelse(A: dace.float32[1]):
+    if A[0] > 0:
+
+        @dace.tasklet
+        def mytask():
+            o >> A[0]
+            o = 5
+
+
+@dace.program
+def ifchain(A: dace.float32[1]):
+    if A[0] > 0:
+
+        @dace.tasklet
+        def mytask():
+            o >> A[0]
+            o = 5
+
+    if A[0] > 1:
+
+        @dace.tasklet
+        def mytask():
+            o >> A[0]
+            o = 1
+
+    if A[0] < 0:
+
+        @dace.tasklet
+        def mytask():
+            o >> A[0]
+            o = -5
+    else:
+
+        @dace.tasklet
+        def mytask():
+            o >> A[0]
+            o = 9
+
+
+if __name__ == '__main__':
+    print('If without else test')
+    A = np.ndarray([1], np.float32)
+    A[0] = 1
+    noelse(A)
+    if A[0] != 5:
+        print("ERROR in test: %f != 5" % A[0])
+        exit(1)
+    print('If chain test')
+    ifchain(A)
+    if A[0] != 9:
+        print("ERROR in test: %f != 9" % A[0])
+        exit(1)
+    print("Success!")
+    exit(0)
diff --git a/tests/indirection_test.py b/tests/indirection_test.py
new file mode 100644
index 0000000000..874ec78af7
--- /dev/null
+++ b/tests/indirection_test.py
@@ -0,0 +1,31 @@
+#!/usr/bin/env python
+import dace as dp
+import numpy as np
+
+W = dp.symbol('W')
+
+
+@dp.program
+def indirection(A, x, B):
+    @dp.map(_[0:W])
+    def ind(i):
+        bla << A[x[i]]
+        out >> B[i]
+        out = bla
+
+
+if __name__ == '__main__':
+    W.set(5)
+
+    A = dp.ndarray([W * W])
+    B = dp.ndarray([W])
+    x = dp.ndarray([W], dtype=dp.uint32)
+
+    A[:] = np.arange(10, 10 + W.get() * W.get())
+    B[:] = dp.float32(0)
+    x[:] = np.random.randint(0, W.get() * W.get(), W.get())
+
+    indirection(A, x, B)
+
+    print(x.view(type=np.ndarray))
+    print(B.view(type=np.ndarray))
diff --git a/tests/inlining_test.py b/tests/inlining_test.py
new file mode 100644
index 0000000000..81b321310b
--- /dev/null
+++ b/tests/inlining_test.py
@@ -0,0 +1,39 @@
+#!/usr/bin/env python
+import dace as dp
+
+W = dp.symbol('W')
+H = dp.symbol('H')
+
+
+@dp.external_function
+def mirror(i):
+    return -i
+
+
+@dp.external_function
+def transpose(input, output):
+    @dp.map(_[0:H, 0:W])
+    def compute(i, j):
+        a << input[j, i]
+        b >> output[i, j]
+        b = a
+
+
+@dp.external_function
+def bla(A, B, alpha):
+    @dp.tasklet
+    def something():
+        a << A[0, 0]
+        b >> B[0, 0]
+        b = alpha * a
+
+
+@dp.program
+def myprogram(A, B, cst):
+    dp.call(transpose, A, B)
+    dp.call(bla, A, B, -dp.call(mirror, cst) + 1)
+    bla(A, B, -dp.call(mirror, -cst) + 1)
+
+
+if __name__ == '__main__':
+    dp.compile(myprogram, dp.float32[W, H], dp.float32[H, W], dp.int32)
diff --git a/tests/intarg_test.py b/tests/intarg_test.py
new file mode 100644
index 0000000000..5ba9118b43
--- /dev/null
+++ b/tests/intarg_test.py
@@ -0,0 +1,31 @@
+#!/usr/bin/env python
+import dace as dp
+import numpy as np
+
+W = dp.symbol()
+
+
+@dp.program
+def prog(A, B, integer):
+    @dp.map(_[0:W])
+    def compute(i):
+        a << A[i]
+        b >> B[i]
+        b = a * integer
+
+
+if __name__ == '__main__':
+    W.set(3)
+
+    A = dp.ndarray([W])
+    B = dp.ndarray([W])
+
+    A[:] = np.mgrid[0:W.get()]
+    B[:] = dp.float32(0.0)
+
+    prog(A, B, 5)
+
+    diff = np.linalg.norm(5 * A - B) / W.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/local_inline_test.py b/tests/local_inline_test.py
new file mode 100644
index 0000000000..71b7237819
--- /dev/null
+++ b/tests/local_inline_test.py
@@ -0,0 +1,47 @@
+#!/usr/bin/env python
+import dace
+import numpy as np
+
+W = dace.symbol()
+
+
+@dace.external_function
+def bla(AA, BB):
+    tmp = dace.define_local([W], AA.dtype)
+
+    @dace.map(_[0:W])
+    def compute(i):
+        a << AA[i]
+        b >> tmp[i]
+        b = -a
+
+    @dace.map(_[0:W])
+    def compute2(i):
+        a << tmp[i]
+        b >> BB[i]
+        b = a + 1
+
+
+@dace.program
+def prog(A, B, C):
+    bla(A, B)
+    bla(B, C)
+
+
+if __name__ == '__main__':
+    W.set(3)
+
+    A = dace.ndarray([W])
+    B = dace.ndarray([W])
+    C = dace.ndarray([W])
+
+    A[:] = np.mgrid[0:W.get()]
+    B[:] = dace.float32(0.0)
+    C[:] = dace.float32(0.0)
+
+    prog(A, B, C)
+
+    diff = np.linalg.norm((-(-A + 1) + 1) - C) / W.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/loop_flat_test.cpp b/tests/loop_flat_test.cpp
new file mode 100644
index 0000000000..458d474d76
--- /dev/null
+++ b/tests/loop_flat_test.cpp
@@ -0,0 +1,119 @@
+#include <omp.h>
+#include <array>
+#include <cassert>
+#include <iostream>
+#include <thread>
+#include <tuple>
+#include <vector>
+
+#include <dace/intset.h>
+
+using dace::const_int_range;
+using dace::make_range;
+
+int main(int argc, char **argv) {
+    int runtime_size = 3;
+    if (argc > 1) {
+        runtime_size = std::stoi(argv[1]);
+    }
+
+    std::cout << "int_range test (no int range)" << std::endl;
+    int sum_regression = 0, sum;
+
+    #pragma omp parallel for
+    for (auto i0 = 3; i0 < 4; i0++)
+        for (auto i1 = -4; i1 < 18; i1 += 2)
+            for (auto i2 = 0; i2 < 10; i2 += 3) {
+                std::cout << "Thread " << omp_get_thread_num() << ": i0 = " << i0
+                    << ", i1 = " << i1 << " and i2 = " << i2 << std::endl;
+                sum_regression += i0 * 10000 + i1 * 100 + i2;
+            }
+
+    std::cout << "int_range test" << std::endl;
+
+    auto range = make_range(std::make_tuple(runtime_size, 4, 1),
+                            std::make_tuple(-4, 18, 2),
+                            std::make_tuple(0, 10, runtime_size));
+    sum = 0;
+    #pragma omp parallel for reduction(+:sum)
+    for (auto i = 0; i < range.size(); ++i) {
+        auto i0 = range.index_value(i, 0);
+        auto i1 = range.index_value(i, 1);
+        auto i2 = range.index_value(i, 2);
+
+        sum += i0 * 10000 + i1 * 100 + i2;
+
+        #pragma omp critical
+        std::cout << "Thread " << omp_get_thread_num() << ": i0 = " << i0
+            << ", i1 = " << i1 << " and i2 = " << i2 << std::endl;
+    }
+
+    if (sum != sum_regression) {
+        std::cout << "int_range regression failed (" << sum
+            << " != " << sum_regression << ")" << std::endl;
+        return 1;
+    }
+
+    std::cout << "const_int_range test" << std::endl;
+
+    sum = 0;
+
+    typedef const_int_range<3, 4, 1, -4, 18, 2, 0, 10, 3> myrange;
+    #pragma omp parallel for reduction(+:sum)
+    for (auto i = 0; i < myrange::size; ++i) {
+        auto i0 = myrange::index_value(i, 0);
+        auto i1 = myrange::index_value(i, 1);
+        auto i2 = myrange::index_value(i, 2);
+
+        sum += i0 * 10000 + i1 * 100 + i2;
+
+        #pragma omp critical
+        printf("Thread %d: i0 = %d, i1 = %d, i2 = %d\n", omp_get_thread_num(), i0,
+               i1, i2);
+    }
+
+
+    if (sum != sum_regression) {
+        std::cout << "const_int_range regression failed (" << sum
+            << " != " << sum_regression << ")" << std::endl;
+        return 2;
+    }
+
+    std::cout << "Correctness test" << std::endl;
+    std::vector<std::tuple<int, int, int>> valid_range;
+    for (auto i0 = 3; i0 < 4; i0++)
+        for (auto i1 = -4; i1 < 18; i1 += 2)
+            for (auto i2 = 0; i2 < 10; i2 += 3)
+                valid_range.emplace_back(i0, i1, i2);
+    auto test_range = make_range(
+        std::make_tuple(3, 4, 1),
+        std::make_tuple(-4, 18, 2),
+        std::make_tuple(0, 10, 3)
+    );
+    assert((int)valid_range.size() == test_range.size());
+    assert(valid_range.size() == myrange::size);
+    for (auto i = 0; i < test_range.size(); ++i) {
+        auto i0 = test_range.index_value(i, 0);
+        auto i1 = test_range.index_value(i, 1);
+        auto i2 = test_range.index_value(i, 2);
+        assert(valid_range[i] == std::make_tuple(i0, i1, i2));
+        auto i_all = test_range.index_values(i);
+        auto i00 = i_all[0];
+        auto i10 = i_all[1];
+        auto i20 = i_all[2];
+        assert(valid_range[i] == std::make_tuple(i00, i10, i20));
+        auto i01 = myrange::index_value(i, 0);
+        auto i11 = myrange::index_value(i, 1);
+        auto i21 = myrange::index_value(i, 2);
+        assert(valid_range[i] == std::make_tuple(i01, i11, i21));
+        auto i_all1 = myrange::index_values(i);
+        auto i02 = i_all1[0];
+        auto i12 = i_all1[1];
+        auto i22 = i_all1[2];
+        assert(valid_range[i] == std::make_tuple(i02, i12, i22));
+    }
+
+    std::cout << "PASS" << std::endl;
+
+    return 0;
+}
diff --git a/tests/mapreduce2_test.py b/tests/mapreduce2_test.py
new file mode 100644
index 0000000000..2ac9099906
--- /dev/null
+++ b/tests/mapreduce2_test.py
@@ -0,0 +1,57 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+
+A = dace.ndarray([M, N], dtype=dace.float64)
+B = dace.ndarray([N, K], dtype=dace.float64)
+C = dace.ndarray([M, K], dtype=dace.float64)
+
+
+@dace.program(dace.float64[M, N], dace.float64[N, K], dace.float64[M, K])
+def mapreduce_test_2(A, B, C):
+    # Transient variable
+    tmp = dace.define_local([M, K, N], dtype=A.dtype)
+
+    @dace.map(_[0:K, 0:N, 0:M])
+    def multiplication(j, k, i):
+        in_A << A[i, k]
+        in_B << B[k, j]
+        out >> tmp[i, j, k]
+
+        out = in_A * in_B
+
+    dace.reduce(lambda a, b: a + b, tmp, C, axis=2, identity=0)
+
+
+if __name__ == "__main__":
+
+    M.set(50)
+    N.set(20)
+    K.set(5)
+
+    print('Matrix multiplication %dx%dx%d' % (M.get(), N.get(), K.get()))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A[:] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    B[:] = np.random.rand(N.get(), K.get()).astype(dace.float64.type)
+    C[:] = dace.float64(0)
+
+    A_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([N.get(), K.get()], dtype=np.float64)
+    C_regression = np.ndarray([M.get(), K.get()], dtype=np.float64)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+    C_regression[:] = C[:]
+
+    mapreduce_test_2(A, B, C)
+    np.dot(A_regression, B_regression, C_regression)
+
+    diff = np.linalg.norm(C_regression - C) / float(dace.eval(M * K))
+    print("Difference:", diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/mapreduce3_test.py b/tests/mapreduce3_test.py
new file mode 100644
index 0000000000..10264e4f2a
--- /dev/null
+++ b/tests/mapreduce3_test.py
@@ -0,0 +1,54 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+
+
+@dace.program(dace.float32[1, H, 1, W, 1], dace.float32[H, W], dace.float32[1])
+def mapreduce_test_3(A, B, sum):
+    tmp = dace.define_local([1, H, 1, W, 1], dace.float32)
+
+    @dace.map(_[0:H, 0:W])
+    def compute_tile(i, j):
+        a << A[0, i, 0, j, 0]
+        b >> B[i, j]
+        t >> tmp[0, i, 0, j, 0]
+
+        b = a * 5
+        t = a * 5
+
+    dace.reduce(lambda a, b: a + b, tmp, sum)
+
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=128)
+    parser.add_argument("H", type=int, nargs="?", default=128)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([H, W], dtype=dace.float32)
+    B = dace.ndarray([H, W], dtype=dace.float32)
+    res = dace.ndarray([1], dtype=dace.float32)
+
+    W.set(args["W"])
+    H.set(args["H"])
+
+    print('Map-Reduce Test %dx%d' % (W.get(), H.get()))
+
+    A[:] = np.random.rand(H.get(), W.get()).astype(dace.float32.type)
+    B[:] = dace.float32(0)
+    res[:] = dace.float32(0)
+
+    mapreduce_test_3(A, B, res)
+
+    diff = np.linalg.norm(5 * A - B) / float(dace.eval(H * W))
+    diff_res = abs((np.sum(B) - res[0])).view(type=np.ndarray)
+    print("Difference:", diff, diff_res)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 and diff_res <= 1 else 1)
diff --git a/tests/mapreduce4_test.py b/tests/mapreduce4_test.py
new file mode 100644
index 0000000000..5484b08c03
--- /dev/null
+++ b/tests/mapreduce4_test.py
@@ -0,0 +1,62 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+
+A = dace.ndarray([M, N], dtype=dace.float64)
+B = dace.ndarray([N, K], dtype=dace.float64)
+C = dace.ndarray([M, K], dtype=dace.float64)
+D = dace.ndarray([M, K, N], dtype=dace.float64)
+
+
+@dace.program(dace.float64[M, N], dace.float64[N, K], dace.float64[M, K],
+              dace.float64[M, K, N])
+def mapreduce_test_4(A, B, C, D):
+    # Transient variable
+    tmp = dace.define_local([M, K, N], dtype=A.dtype)
+
+    @dace.map(_[0:K, 0:N, 0:M])
+    def multiplication(j, k, i):
+        in_A << A[i, k]
+        in_B << B[k, j]
+        scale >> D[i, j, k]
+        out >> tmp[i, j, k]
+
+        out = in_A * in_B
+        scale = in_A * 5
+
+    dace.reduce(lambda a, b: a + b, tmp, C, axis=2, identity=0)
+
+
+if __name__ == "__main__":
+
+    M.set(50)
+    N.set(20)
+    K.set(5)
+
+    print('Matrix multiplication %dx%dx%d' % (M.get(), N.get(), K.get()))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A[:] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    B[:] = np.random.rand(N.get(), K.get()).astype(dace.float64.type)
+    C[:] = dace.float64(0)
+    D[:] = dace.float64(0)
+
+    A_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([N.get(), K.get()], dtype=np.float64)
+    C_regression = np.ndarray([M.get(), K.get()], dtype=np.float64)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+    C_regression[:] = C[:]
+
+    mapreduce_test_4(A, B, C, D)
+    np.dot(A_regression, B_regression, C_regression)
+
+    diff = np.linalg.norm(C_regression - C) / float(dace.eval(M * K))
+    print("Difference:", diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/mapreduce_test.py b/tests/mapreduce_test.py
new file mode 100644
index 0000000000..31e4522e0d
--- /dev/null
+++ b/tests/mapreduce_test.py
@@ -0,0 +1,54 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+
+
+@dace.program(dace.float32[H, W], dace.float32[H, W], dace.float32[1])
+def mapreduce_test(A, B, sum):
+    tmp = dace.define_local([H, W], dace.float32)
+
+    @dace.map(_[0:H, 0:W])
+    def compute_tile(i, j):
+        a << A[i, j]
+        b >> B[i, j]
+        t >> tmp[i, j]
+
+        b = a * 5
+        t = a * 5
+
+    dace.reduce(lambda a, b: a + b, tmp, sum)
+
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=128)
+    parser.add_argument("H", type=int, nargs="?", default=128)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([H, W], dtype=dace.float32)
+    B = dace.ndarray([H, W], dtype=dace.float32)
+    res = dace.ndarray([1], dtype=dace.float32)
+
+    W.set(args["W"])
+    H.set(args["H"])
+
+    print('Map-Reduce Test %dx%d' % (W.get(), H.get()))
+
+    A[:] = np.random.rand(H.get(), W.get()).astype(dace.float32.type)
+    B[:] = dace.float32(0)
+    res[:] = dace.float32(0)
+
+    mapreduce_test(A, B, res)
+
+    diff = np.linalg.norm(5 * A - B) / float(dace.eval(H * W))
+    diff_res = abs((np.sum(B) - res[0])).view(type=np.ndarray)
+    print("Difference:", diff, diff_res)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 and diff_res <= 1 else 1)
diff --git a/tests/mapreducefusion_test_opt.py b/tests/mapreducefusion_test_opt.py
new file mode 100644
index 0000000000..8e2ca9efe8
--- /dev/null
+++ b/tests/mapreducefusion_test_opt.py
@@ -0,0 +1,4 @@
+diode.OpenPythonFile("mapreduce_test.py")
+diode.Run()
+diode.ExpandNode("MapReduceFusion")
+diode.Run()
diff --git a/tests/memlet_lifetime_validation_test.py b/tests/memlet_lifetime_validation_test.py
new file mode 100644
index 0000000000..1470831273
--- /dev/null
+++ b/tests/memlet_lifetime_validation_test.py
@@ -0,0 +1,119 @@
+#!/usr/bin/env python
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG, InvalidSDFGError
+from dace.memlet import Memlet
+from dace.data import Scalar
+
+if __name__ == '__main__':
+    print('SDFG memlet lifetime validation test')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    input = dp.ndarray([N], dp.int32)
+    output = dp.ndarray([N], dp.int32)
+    N.set(20)
+    input[:] = dp.int32(5)
+    output[:] = dp.int32(0)
+
+    # Construct SDFG 1
+    sdfg1 = SDFG('shouldntwork1')
+    state = sdfg1.add_state()
+    B = state.add_array('B', [N], dp.int32)
+    T = state.add_transient('T', [1], dp.int32)
+
+    tasklet_gen = state.add_tasklet('mytasklet', {}, {'b'}, 'b = 5')
+    map_entry, map_exit = state.add_map('mymap', dict(k='0:N'))
+
+    map_entry._in_connectors.add('IN_1')
+    map_entry._out_connectors.add('OUT_1')
+    map_exit._in_connectors.add('IN_1')
+    map_exit._out_connectors.add('OUT_1')
+
+    state.add_edge(tasklet_gen, 'b', map_entry, 'IN_1', Memlet.simple(T, '0'))
+    state.add_edge(map_entry, 'OUT_1', T, None, Memlet.simple(T, '0'))
+    state.add_edge(T, None, map_exit, 'IN_1', Memlet.simple(B, '0'))
+    state.add_edge(map_exit, 'OUT_1', B, None, Memlet.simple(B, '0'))
+
+    # Left for debugging purposes
+    sdfg1.draw_to_file()
+
+    try:
+        sdfg1.validate()
+        print("SDFG 1 passed validation, test FAILED")
+        exit(1)
+    except InvalidSDFGError:
+        print("Test 1 passed, exception successfully caught")
+
+    # Construct SDFG 2
+    sdfg2 = SDFG('shouldntwork2')
+    state = sdfg2.add_state()
+    A = state.add_array('A', [N], dp.int32)
+    B = state.add_array('B', [N], dp.int32)
+    T = state.add_transient('T', [1], dp.int32)
+
+    tasklet_gen = state.add_tasklet('mytasklet', {'a'}, {'b'}, 'b = 5*a')
+    map_entry, map_exit = state.add_map('mymap', dict(k='0:N'))
+
+    map_entry._in_connectors.add('IN_1')
+    map_entry._out_connectors.add('OUT_1')
+    map_exit._in_connectors.add('IN_1')
+    map_exit._out_connectors.add('OUT_1')
+
+    state.add_edge(B, None, map_entry, 'IN_1', Memlet.simple(B, '0'))
+    state.add_edge(map_entry, 'OUT_1', T, None, Memlet.simple(T, '0'))
+    state.add_edge(T, None, map_exit, 'IN_1', Memlet.simple(B, '0'))
+    state.add_edge(map_exit, 'OUT_1', tasklet_gen, 'a', Memlet.simple(B, '0'))
+    state.add_edge(tasklet_gen, 'b', A, None, Memlet.simple(A, '0'))
+
+    # Left for debugging purposes
+    sdfg2.draw_to_file()
+
+    try:
+        sdfg2.validate()
+        print("SDFG 2 passed validation, test FAILED")
+        exit(1)
+    except InvalidSDFGError:
+        print("Test 2 passed, exception successfully caught")
+
+    # Construct SDFG 3
+    sdfg3 = SDFG('shouldntwork2')
+    state = sdfg3.add_state()
+    A = state.add_array('A', [N], dp.int32)
+    B = state.add_array('B', [N], dp.int32)
+    T = state.add_transient('T', [N], dp.int32)
+
+    tasklet_gen = state.add_tasklet('mytasklet', {'a'}, {'b'}, 'b = 5*a')
+    map1_entry, map1_exit = state.add_map('mymap1', dict(k='0:N'))
+    map2_entry, map2_exit = state.add_map('mymap2', dict(k='0:N'))
+
+    map1_entry._in_connectors.add('IN_1')
+    map1_entry._out_connectors.add('OUT_1')
+    map1_exit._in_connectors.add('IN_1')
+    map1_exit._out_connectors.add('OUT_1')
+    map2_entry._in_connectors.add('IN_1')
+    map2_entry._out_connectors.add('OUT_1')
+    map2_exit._in_connectors.add('IN_1')
+    map2_exit._out_connectors.add('OUT_1')
+
+    state.add_edge(A, None, map1_entry, 'IN_1', Memlet.simple(A, '0:N'))
+    state.add_edge(map1_entry, 'OUT_1', tasklet_gen, 'a', Memlet.simple(
+        A, 'i'))
+    state.add_edge(tasklet_gen, 'b', map1_exit, 'IN_1', Memlet.simple(T, 'i'))
+    state.add_edge(map1_exit, 'OUT_1', map2_entry, 'IN_1',
+                   Memlet.simple(T, '0:N'))
+    state.add_edge(map2_entry, 'OUT_1', T, None, Memlet.simple(T, 'i'))
+    state.add_edge(T, None, map2_exit, 'IN_1', Memlet.simple(B, 'i'))
+    state.add_edge(map2_exit, 'OUT_1', B, None, Memlet.simple(B, '0:N'))
+
+    # Left for debugging purposes
+    sdfg3.draw_to_file()
+
+    try:
+        sdfg3.validate()
+        print("SDFG 3 passed validation, test FAILED")
+        exit(1)
+    except InvalidSDFGError:
+        print("Test 3 passed, exception successfully caught")
+
+    exit(0)
diff --git a/tests/mpihello.py b/tests/mpihello.py
new file mode 100644
index 0000000000..a7204aa14b
--- /dev/null
+++ b/tests/mpihello.py
@@ -0,0 +1,44 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+N = dace.symbol('N')
+
+materialize_V = """
+void __dace_materialize(const char* arrayname, int start, int end, void* outarray) {
+    for (int i=0; i<end-start; i++) ((double*)outarray)[i] = start+i;
+}
+"""
+
+serialize_Vout = """
+void __dace_serialize(const char* arrayname, int start, int end, const void* outarray) {
+    printf("someone asked me to write %s[%i,%i] = %lf\\n", arrayname, start, end, ((double*)outarray)[0]);
+}
+"""
+
+V = dace.ndarray([N], dace.float64, materialize_func=materialize_V)
+Vout = dace.ndarray([N], dace.float64, materialize_func=serialize_Vout)
+
+
+@dace.program(
+    dace.immaterial(dace.float64[N], materialize_V),
+    dace.immaterial(dace.float64[N], serialize_Vout))
+def mpihello(V, Vout):
+    # Transient variable
+    @dace.map(_[0:N])
+    def multiplication(i):
+        in_V << V[i]
+        out >> Vout[i]
+        printf("Hello %lf\n", in_V)
+        out = in_V
+
+
+if __name__ == "__main__":
+
+    N.set(50)
+
+    print('Vector add MPI %d' % (N.get()))
+
+    mpihello(V, Vout)
diff --git a/tests/mpihello2.py b/tests/mpihello2.py
new file mode 100644
index 0000000000..6b238b6e6a
--- /dev/null
+++ b/tests/mpihello2.py
@@ -0,0 +1,47 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+N = dace.symbol('N')
+
+materialize_V = """
+void __dace_materialize(const char* arrayname, int start, int end, void* outarray) {
+    for (int i=0; i<end-start; i++) ((double*)outarray)[i] = start+i;
+}
+"""
+
+serialize_Vout = """
+void __dace_serialize(const char* arrayname, int start, int end, const void* outarray) {
+    printf("someone asked me to write %s[%i,%i] = %lf\\n", arrayname, start, end, ((double*)outarray)[0]);
+}
+"""
+
+#V = dace.ndarray([N], dace.float64)
+V = dace.ndarray([N], dace.float64, materialize_V)
+#Vout = dace.ndarray([N], dace.float64)
+Vout = dace.ndarray([N], dace.float64, serialize_Vout)
+
+
+@dace.program(
+    dace.immaterial(dace.float64[N], materialize_V),
+    dace.immaterial(dace.float64[N], serialize_Vout))
+def mpihello(V, Vout):
+    # Transient variable
+    @dace.map(_[0:N:2])
+    def multiplication(i):
+        in_V << V[i:i + 2]
+        out >> Vout[i:i + 2]
+        printf("Hello %lf, %lf\n", in_V[0], in_V[1])
+        out[0] = in_V[1]
+        out[1] = in_V[0]
+
+
+if __name__ == "__main__":
+
+    N.set(50)
+
+    print('Vector add MPI %d' % (N.get()))
+
+    mpihello(V, Vout)
diff --git a/tests/mpihello2_test_opt.py b/tests/mpihello2_test_opt.py
new file mode 100644
index 0000000000..c219432fc2
--- /dev/null
+++ b/tests/mpihello2_test_opt.py
@@ -0,0 +1,5 @@
+# Embarassingly parallel MPI code, uses Immaterial storage
+diode.OpenPythonFile("mpihello2.py")
+result = diode.Run()
+diode.ExpandNode("MPITransformMap")
+diode.Run()
diff --git a/tests/mpihello_test_opt.py b/tests/mpihello_test_opt.py
new file mode 100644
index 0000000000..8644597203
--- /dev/null
+++ b/tests/mpihello_test_opt.py
@@ -0,0 +1,5 @@
+# Embarassingly parallel MPI code, uses Immaterial storage
+diode.OpenPythonFile("mpihello.py")
+result = diode.Run()
+diode.ExpandNode("MPITransformMap")
+diode.Run()
diff --git a/tests/multi_inline_test.py b/tests/multi_inline_test.py
new file mode 100644
index 0000000000..cfc150677a
--- /dev/null
+++ b/tests/multi_inline_test.py
@@ -0,0 +1,28 @@
+#!/usr/bin/env python
+import dace as dp
+
+W = dp.symbol('W')
+H = dp.symbol('H')
+
+
+@dp.external_function
+def transpose(input, output):
+    @dp.map(_[0:H, 0:W])
+    def compute(i, j):
+        a << input[j, i]
+        b >> output[i, j]
+        b = a
+
+
+@dp.external_function
+def bla(input, output):
+    dp.call(transpose, input, output)
+
+
+@dp.program
+def myprogram(A, B):
+    dp.call(bla, A, B)
+
+
+if __name__ == '__main__':
+    dp.compile(myprogram, dp.float32[W, H], dp.float32[H, W])
diff --git a/tests/multi_output_scope_test.py b/tests/multi_output_scope_test.py
new file mode 100644
index 0000000000..be1bfd0d5b
--- /dev/null
+++ b/tests/multi_output_scope_test.py
@@ -0,0 +1,39 @@
+#!/usr/bin/env python
+import dace
+import numpy as np
+
+W = dace.symbol()
+
+
+@dace.program
+def prog(A, stats):
+    @dace.map(_[0:W])
+    def compute(i):
+        inp << A[i]
+        sum >> stats(1, lambda x, y: x + y, 0)[0]
+        ssq >> stats(1, lambda x, y: x + y, 0)[1]
+
+        sum = inp
+        ssq = inp * inp
+
+
+if __name__ == '__main__':
+    W.set(120)
+
+    A = dace.ndarray([W])
+    stats = dace.ndarray([2])
+
+    A[:] = np.random.normal(3.0, 5.0, W.get())
+
+    prog(A, stats)
+
+    mean = stats[0] / W.get()
+    variance = stats[1] / W.get() - mean * mean
+    print("Mean: %f, Variance: %f" % (mean, variance))
+
+    diff_mean = abs(mean - np.mean(A))
+    print("Difference (mean):", diff_mean)
+    diff_var = abs(variance - np.var(A))
+    print("Difference (variance):", diff_var)
+    print("==== Program end ====")
+    exit(0 if diff_mean <= 1e-5 and diff_var <= 1e-4 else 1)
diff --git a/tests/multidim_indirection_test.py b/tests/multidim_indirection_test.py
new file mode 100644
index 0000000000..0895c617c4
--- /dev/null
+++ b/tests/multidim_indirection_test.py
@@ -0,0 +1,44 @@
+#!/usr/bin/env python
+import dace as dp
+import numpy as np
+
+W = dp.symbol('W')
+
+
+@dp.external_function
+def what(i):
+    return i * 4 + 5
+
+
+@dp.program
+def indirection(A: dp.float32[W, W, W], x: dp.int32[W], y: dp.int32[W],
+                B: dp.float32[W, W, W]):
+    @dp.map(_[0:W, 0:W, 0:W])
+    def ind(i, j, k):
+        inp << A[i, x[j]:x[j +
+                           1], y[k] / 2][1]  # evaluates to A[i,x[j]+1,y[k]/2]
+        out >> B[i, j, k]
+        out = inp
+
+
+if __name__ == '__main__':
+    W.set(5)
+
+    A = dp.ndarray([W, W, W], dtype=dp.float32)
+    B = dp.ndarray([W, W, W], dtype=dp.float32)
+    x = dp.ndarray([W], dtype=dp.int32)
+    y = dp.ndarray([W], dtype=dp.int32)
+
+    A[:] = np.mgrid[0:W.get(), 0:W.get(), 0:W.get()][0].astype(dp.float32.type)
+    B[:] = dp.float32(0)
+
+    x[:] = np.random.randint(0, W.get(), W.get())
+    x -= 1
+
+    y[:] = np.random.randint(0, W.get(), W.get())
+
+    indirection(A, x, y, B)
+
+    print(x.view(type=np.ndarray))
+    print(y.view(type=np.ndarray))
+    print(B.view(type=np.ndarray))
diff --git a/tests/multiple_cr_test.py b/tests/multiple_cr_test.py
new file mode 100644
index 0000000000..e58675e492
--- /dev/null
+++ b/tests/multiple_cr_test.py
@@ -0,0 +1,56 @@
+#!/usr/bin/env python3
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+# Constructs an SDFG with multiple tasklets manually and runs it
+if __name__ == '__main__':
+    print('SDFG multiple tasklet test')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    input = dp.ndarray([N], dp.int64)
+    sum = dp.ndarray([1], dp.int64)
+    product = dp.ndarray([1], dp.int64)
+    N.set(20)
+    input[:] = dp.int64(5)
+    sum[:] = dp.int64(0)
+    product[:] = dp.int64(1)
+
+    # Construct SDFG
+    mysdfg = SDFG('multiple_cr')
+    state = mysdfg.add_state()
+    A = state.add_array('A', [N], dp.int64)
+    s = state.add_array('s', [1], dp.int64)
+    p = state.add_array('p', [1], dp.int64)
+
+    map_entry, map_exit = state.add_map('mymap', dict(i='0:N'))
+    state.add_edge(A, None, map_entry, None, Memlet.simple(A, '0:N'))
+
+    # Tasklet 1
+    t1 = state.add_tasklet('task1', {'a'}, {'b'}, 'b = a')
+    state.add_edge(map_entry, None, t1, 'a', Memlet.simple(A, 'i'))
+    state.add_edge(
+        t1, 'b', map_exit, None,
+        Memlet.simple(s, '0', wcr_str='lambda a,b: a+b', wcr_identity=0))
+    state.add_edge(map_exit, None, s, None, Memlet.simple(s, '0'))
+
+    # Tasklet 2
+    t2 = state.add_tasklet('task2', {'a'}, {'b'}, 'b = a')
+    state.add_edge(map_entry, None, t2, 'a', Memlet.simple(A, 'i'))
+    state.add_edge(
+        t2, 'b', map_exit, None,
+        Memlet.simple(p, '0', wcr_str='lambda a,b: a*b', wcr_identity=1))
+    state.add_edge(map_exit, None, p, None, Memlet.simple(p, '0'))
+
+    # Left for debugging purposes
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, s=sum, p=product, N=N)
+
+    diff_sum = 5 * 20 - sum[0]
+    diff_prod = 5**20 - product[0]
+    print("Difference:", diff_sum, '(sum)', diff_prod, '(product)')
+    print("==== Program end ====")
+    exit(0 if diff_sum <= 1e-5 and diff_prod <= 1e-5 else 1)
diff --git a/tests/multiple_tasklet_test.py b/tests/multiple_tasklet_test.py
new file mode 100644
index 0000000000..006bd1dc32
--- /dev/null
+++ b/tests/multiple_tasklet_test.py
@@ -0,0 +1,48 @@
+#!/usr/bin/env python3
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+# Constructs an SDFG with multiple tasklets manually and runs it
+if __name__ == '__main__':
+    print('SDFG multiple tasklet test')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    input = dp.ndarray([N], dp.int32)
+    output = dp.ndarray([N], dp.int32)
+    N.set(20)
+    input[:] = dp.int32(5)
+    output[:] = dp.int32(0)
+
+    # Construct SDFG
+    mysdfg = SDFG('multiple_tasklets')
+    state = mysdfg.add_state()
+    A = state.add_array('A', [N], dp.int32)
+    B = state.add_array('B', [N], dp.int32)
+
+    map_entry, map_exit = state.add_map('mymap', dict(i='0:N:2'))
+
+    # Tasklet 1
+    t1 = state.add_tasklet('task1', {'a'}, {'b'}, 'b = 5*a')
+    state.add_edge(map_entry, None, t1, 'a', Memlet.simple(A, 'i'))
+    state.add_edge(t1, 'b', map_exit, None, Memlet.simple(B, 'i'))
+
+    # Tasklet 2
+    t2 = state.add_tasklet('task2', {'a'}, {'b'}, 'b = a + a + a + a + a')
+    state.add_edge(map_entry, None, t2, 'a', Memlet.simple(A, 'i+1'))
+    state.add_edge(t2, 'b', map_exit, None, Memlet.simple(B, 'i+1'))
+
+    state.add_edge(A, None, map_entry, None, Memlet.simple(A, '0:N'))
+    state.add_edge(map_exit, None, B, None, Memlet.simple(B, '0:N'))
+
+    # Left for debugging purposes
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, B=output, N=N)
+
+    diff = np.linalg.norm(5 * input - output) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/multiprogram_cudatest.py b/tests/multiprogram_cudatest.py
new file mode 100644
index 0000000000..18499087a4
--- /dev/null
+++ b/tests/multiprogram_cudatest.py
@@ -0,0 +1,48 @@
+import dace
+from dace.transformation import optimizer
+from dace.transformation.dataflow import GPUTransformMap
+import numpy as np
+
+@dace.program
+def prog1(A: dace.float32[32], B: dace.float32[32]):
+    @dace.map
+    def work1(i: _[0:32]):
+        a << A[i]
+        b >> B[i]
+        b = a * 2.0
+
+@dace.program
+def prog2(A: dace.float32[32], B: dace.float32[32]):
+    @dace.map
+    def work2(i: _[0:32]):
+        a << A[i]
+        b >> B[i]
+        b = a / 2.0
+
+
+######################################
+if __name__ == '__main__':
+    print('Multi-program CUDA test')
+
+    A = np.random.rand(32).astype(np.float32)
+    B = np.random.rand(32).astype(np.float32)
+    C = np.random.rand(32).astype(np.float32)
+
+    s1 = prog1.to_sdfg()
+    opt1 = optimizer.SDFGOptimizer(s1, inplace=True)
+    opt1.get_pattern_matches(patterns=[GPUTransformMap])[0].apply(s1)
+
+    s2 = prog2.to_sdfg()
+    opt2 = optimizer.SDFGOptimizer(s2, inplace=True)
+    opt2.get_pattern_matches(patterns=[GPUTransformMap])[0].apply(s2)
+
+    s1func = s1.compile(optimizer='')
+    s2func = s2.compile(optimizer='')
+
+    s1func(A=A, B=B)
+    s2func(A=B, B=C)
+
+    diff = np.linalg.norm(A - C)
+
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/multistate_init_test.py b/tests/multistate_init_test.py
new file mode 100644
index 0000000000..e3b7a49b0d
--- /dev/null
+++ b/tests/multistate_init_test.py
@@ -0,0 +1,43 @@
+#!/usr/bin/env python
+import dace
+import numpy as np
+
+W = dace.symbol()
+
+
+@dace.program
+def prog(A):
+    number = dace.define_local([1], dace.float32)
+
+    @dace.map(_[0:W])
+    def bla(i):
+        inp << A[i]
+        out >> A[i]
+        osum >> number(1, lambda x, y: x + y, 0)
+
+        out = 2 * inp
+        osum = inp
+
+    @dace.map(_[0:W])
+    def bla2(i):
+        inp << A[i]
+        out >> A[i]
+
+        out = 2 * inp
+
+
+if __name__ == '__main__':
+    W.set(3)
+
+    A = dace.ndarray([W])
+    regression = dace.ndarray([W])
+
+    A[:] = np.mgrid[0:W.get()]
+    regression[:] = A[:]
+
+    prog(A)
+
+    diff = np.linalg.norm(4 * regression - A) / W.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/multistream_copy_cudatest.py b/tests/multistream_copy_cudatest.py
new file mode 100644
index 0000000000..2b6cf6a0d5
--- /dev/null
+++ b/tests/multistream_copy_cudatest.py
@@ -0,0 +1,53 @@
+import dace
+import numpy as np
+
+sdfg = dace.SDFG('multistream')
+
+A = sdfg.add_array('A', [2], dace.float32, storage=dace.StorageType.CPU_Pinned)
+B = sdfg.add_array('B', [2], dace.float32, storage=dace.StorageType.CPU_Pinned)
+C = sdfg.add_array('C', [2], dace.float32, storage=dace.StorageType.CPU_Pinned)
+
+gA = sdfg.add_transient(
+    'gA', [2], dace.float32, storage=dace.StorageType.GPU_Global)
+gB = sdfg.add_transient(
+    'gB', [2], dace.float32, storage=dace.StorageType.GPU_Global)
+gC = sdfg.add_transient(
+    'gC', [2], dace.float32, storage=dace.StorageType.GPU_Global)
+
+state = sdfg.add_state('s0')
+
+a1 = state.add_read('A')
+a2 = state.add_access('gA')
+
+b1 = state.add_read('B')
+b2 = state.add_access('gB')
+
+c1 = state.add_access('gC')
+c2 = state.add_write('C')
+
+state.add_nedge(a1, a2, dace.Memlet.from_array('A', A))
+state.add_nedge(b1, b2, dace.Memlet.from_array('B', B))
+state.add_nedge(c1, c2, dace.Memlet.from_array('C', C))
+
+state.add_nedge(a2, c1, dace.Memlet.simple('gA', '0'))
+state.add_nedge(b2, c1, dace.Memlet.simple('gB', '1', other_subset_str='1'))
+
+# Validate correctness of initial SDFG
+sdfg.validate()
+
+sdfg.draw_to_file()
+
+######################################
+if __name__ == '__main__':
+    print('Multi-stream copy test')
+
+    a = np.random.rand(2).astype(np.float32)
+    b = np.random.rand(2).astype(np.float32)
+    c = np.random.rand(2).astype(np.float32)
+
+    sdfg(A=a, B=b, C=c)
+
+    refC = np.array([a[0], b[1]], dtype=np.float32)
+    diff = np.linalg.norm(c - refC)
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/multistream_custom_cudatest.py b/tests/multistream_custom_cudatest.py
new file mode 100644
index 0000000000..9b12de8661
--- /dev/null
+++ b/tests/multistream_custom_cudatest.py
@@ -0,0 +1,131 @@
+import dace as dp
+import numpy as np
+import os
+
+# First, add libraries to link (CUBLAS) to configuration
+cudaroot = os.environ['CUDA_ROOT']  # or any other environment variable
+dp.Config.append(
+    'compiler', 'cpu', 'libs', value='%s/lib64/libcublas.so' % cudaroot)
+######################################################################
+
+# Create symbols
+N = dp.symbol('N')
+N.set(27)
+
+# Create a GPU SDFG with a custom C++ tasklet
+sdfg = dp.SDFG('cublas_multistream_test')
+state = sdfg.add_state()
+
+# Add arrays
+sdfg.add_array('A', [N, N], dtype=dp.float64)
+sdfg.add_array('B', [N, N], dtype=dp.float64)
+sdfg.add_array('C', [N, N], dtype=dp.float64)
+
+# Add transient GPU arrays
+sdfg.add_transient('gA', [N, N], dp.float64, dp.StorageType.GPU_Global)
+sdfg.add_transient('gB', [N, N], dp.float64, dp.StorageType.GPU_Global)
+sdfg.add_transient('gC', [N, N], dp.float64, dp.StorageType.GPU_Global)
+sdfg.add_transient('gTmp', [N, N], dp.float64, dp.StorageType.GPU_Global)
+
+# Add custom C++ tasklet to graph
+tasklet = state.add_tasklet(
+    # Tasklet name (can be arbitrary)
+    name='gemm1',
+    # Inputs and output names (will be obtained as raw pointers)
+    inputs={'a', 'b'},
+    outputs={'c'},
+    # Custom code (on invocation)
+    code='''
+    double alpha = 1.0, beta = 0.0;
+    cublasSetStream(handle, __dace_current_stream);
+    cublasDgemm(handle, CUBLAS_OP_N, CUBLAS_OP_N,
+                N, N, N, &alpha, 
+                a, N, b, N, 
+                &beta,
+                c, N);
+    ''',
+    # Global code (top of file, can be used for includes and global variables)
+    code_global='''
+    #include <cublas_v2.h>
+    cublasHandle_t handle;
+    ''',
+    # Initialization code (called in __dace_init())
+    code_init='''
+    cublasCreate(&handle);
+    ''',
+    # Teardown code (called in __dace_exit())
+    code_exit='''
+    cublasDestroy(handle);
+    ''',
+    # Language (C++ in this case)
+    language=dp.Language.CPP)
+
+tasklet2 = state.add_tasklet(
+    name='gemm2',
+    inputs={'a', 'b'},
+    outputs={'c'},
+    code='''
+    double alpha = 1.0, beta = 0.0;
+    cublasSetStream(handle, __dace_current_stream);
+    cublasDgemm(handle, CUBLAS_OP_N, CUBLAS_OP_N,
+                N, N, N, &alpha, 
+                a, N, b, N, 
+                &beta,
+                c, N);
+    ''',
+    language=dp.Language.CPP)
+
+# Add CPU arrays, GPU arrays, and connect to tasklet
+A = state.add_read('A')
+B = state.add_read('B')
+C = state.add_write('C')
+
+gA = state.add_access('gA')
+gB = state.add_access('gB')
+gTmp = state.add_access('gTmp')
+gC = state.add_access('gC')
+
+# Memlets cover all data
+state.add_edge(gA, None, tasklet2, 'a', dp.Memlet.simple('gA', '0:N, 0:N'))
+state.add_edge(gA, None, tasklet, 'a', dp.Memlet.simple('gA', '0:N, 0:N'))
+state.add_edge(gB, None, tasklet, 'b', dp.Memlet.simple('gB', '0:N, 0:N'))
+state.add_edge(tasklet, 'c', gTmp, None, dp.Memlet.simple('gTmp', '0:N, 0:N'))
+
+state.add_edge(gTmp, None, tasklet2, 'b', dp.Memlet.simple('gTmp', '0:N, 0:N'))
+state.add_edge(tasklet2, 'c', gC, None, dp.Memlet.simple('gC', '0:N, 0:N'))
+
+# Between two arrays we use a convenience function, `add_nedge`, which is
+# short for "no-connector edge", i.e., `add_edge(u, None, v, None, memlet)`.
+state.add_nedge(A, gA, dp.Memlet.simple('gA', '0:N, 0:N'))
+state.add_nedge(B, gB, dp.Memlet.simple('gB', '0:N, 0:N'))
+state.add_nedge(gC, C, dp.Memlet.simple('gC', '0:N, 0:N'))
+
+######################################################################
+
+# Validate GPU SDFG
+sdfg.validate()
+
+# Draw SDFG to file
+sdfg.draw_to_file()
+
+######################################################################
+
+if __name__ == '__main__':
+    # Initialize arrays. We are using column-major order to support CUBLAS!
+    A = np.ndarray([N.get(), N.get()], dtype=np.float64, order='F')
+    B = np.ndarray([N.get(), N.get()], dtype=np.float64, order='F')
+    C = np.ndarray([N.get(), N.get()], dtype=np.float64, order='F')
+
+    A[:] = np.random.rand(N.get(), N.get())
+    B[:] = np.random.rand(N.get(), N.get())
+    C[:] = np.random.rand(N.get(), N.get())
+
+    out_ref = A @ A @ B
+
+    # We can safely call numpy with arrays allocated on the CPU, since they
+    # will be copied.
+    sdfg(A=A, B=B, C=C, N=N)
+
+    diff = np.linalg.norm(C - out_ref)
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/multistream_kernel_cudatest.py b/tests/multistream_kernel_cudatest.py
new file mode 100644
index 0000000000..20b30ba174
--- /dev/null
+++ b/tests/multistream_kernel_cudatest.py
@@ -0,0 +1,91 @@
+import dace
+import numpy as np
+
+sdfg = dace.SDFG('multistream_kernel')
+
+sdfg.add_array('A', [2], dace.float32, storage=dace.StorageType.CPU_Pinned)
+sdfg.add_array('B', [2], dace.float32, storage=dace.StorageType.CPU_Pinned)
+sdfg.add_array('C', [2], dace.float32, storage=dace.StorageType.CPU_Pinned)
+
+sdfg.add_transient(
+    'gA1', [2], dace.float32, storage=dace.StorageType.GPU_Global)
+sdfg.add_transient(
+    'gA2', [2], dace.float32, storage=dace.StorageType.GPU_Global)
+sdfg.add_transient(
+    'gB1', [2], dace.float32, storage=dace.StorageType.GPU_Global)
+sdfg.add_transient(
+    'gB2', [2], dace.float32, storage=dace.StorageType.GPU_Global)
+sdfg.add_transient(
+    'gC', [2], dace.float32, storage=dace.StorageType.GPU_Global)
+
+state = sdfg.add_state('s0')
+
+a = state.add_read('A')
+ga1 = state.add_access('gA1')
+ga2 = state.add_access('gA2')
+state.add_nedge(a, ga1, dace.Memlet.simple('A', '0:2'))
+
+b = state.add_read('B')
+gb1 = state.add_access('gB1')
+gb2 = state.add_access('gB2')
+state.add_nedge(b, gb1, dace.Memlet.simple('B', '0:2'))
+
+gc = state.add_access('gC')
+c = state.add_write('C')
+state.add_nedge(gc, c, dace.Memlet.simple('gC', '0:2'))
+
+t1, me1, mx1 = state.add_mapped_tasklet(
+    'addone',
+    dict(i='0:2'),
+    dict(inp=dace.Memlet.simple('gA1', 'i')),
+    'out = inp + 1',
+    dict(out=dace.Memlet.simple('gA2', 'i')),
+    dace.ScheduleType.GPU_Device)
+t2, me2, mx2 = state.add_mapped_tasklet(
+    'addtwo',
+    dict(i='0:2'),
+    dict(inp=dace.Memlet.simple('gB1', 'i')),
+    'out = inp + 2',
+    dict(out=dace.Memlet.simple('gB2', 'i')),
+    dace.ScheduleType.GPU_Device)
+
+t2, me3, mx3 = state.add_mapped_tasklet(
+    'twoarrays',
+    dict(i='0:2'),
+    dict(
+        inp1=dace.Memlet.simple('gA2', 'i'),
+        inp2=dace.Memlet.simple('gB2', 'i')),
+    'out = inp1 * inp2',
+    dict(out=dace.Memlet.simple('gC', 'i')),
+    dace.ScheduleType.GPU_Device)
+
+state.add_nedge(ga1, me1, dace.Memlet.simple('gA1', '0:2'))
+state.add_nedge(gb1, me2, dace.Memlet.simple('gB1', '0:2'))
+state.add_nedge(mx1, ga2, dace.Memlet.simple('gA2', '0:2'))
+state.add_nedge(mx2, gb2, dace.Memlet.simple('gB2', '0:2'))
+
+state.add_nedge(ga2, me3, dace.Memlet.simple('gA2', '0:2'))
+state.add_nedge(gb2, me3, dace.Memlet.simple('gB2', '0:2'))
+state.add_nedge(mx3, gc, dace.Memlet.simple('gC', '0:2'))
+
+sdfg.fill_scope_connectors()
+
+# Validate correctness of initial SDFG
+sdfg.validate()
+
+sdfg.draw_to_file()
+
+######################################
+if __name__ == '__main__':
+    print('Multi-stream kernel test')
+
+    a = np.random.rand(2).astype(np.float32)
+    b = np.random.rand(2).astype(np.float32)
+    c = np.random.rand(2).astype(np.float32)
+
+    sdfg(A=a, B=b, C=c)
+
+    refC = (a + 1) * (b + 2)
+    diff = np.linalg.norm(c - refC)
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/ndloop_test.py b/tests/ndloop_test.py
new file mode 100644
index 0000000000..9e43faf377
--- /dev/null
+++ b/tests/ndloop_test.py
@@ -0,0 +1,45 @@
+#!/usr/bin/env python
+
+
+def NDLoopTest():
+    from dace.frontend.python import ndloop
+
+    f1dres = []
+
+    def f1d(ind):
+        f1dres.append(ind)
+
+    f1dparamres = []
+
+    def f1dparam(i, p1, p2):
+        f1dparamres.append((p1, p2, i))
+
+    f2dres = []
+
+    def f2d(y, x):
+        f2dres.append((y, x))
+
+    expected_result = [0, 1, 2]
+    ndloop.NDLoop(slice(0, 2, None), f1d)
+    assert f1dres == expected_result
+
+    # Using args
+    expected_result = [(5, 6, 0), (5, 6, 1), (5, 6, 2)]
+    ndloop.NDLoop(slice(0, 2, None), f1dparam, 5, 6)
+    assert f1dparamres == expected_result
+
+    # Using kwargs
+    f1dparamres = []
+    expected_result = [(7, 8, 0), (7, 8, 1), (7, 8, 2)]
+    ndloop.NDLoop(slice(0, 2, None), f1dparam, p2=8, p1=7)
+    assert f1dparamres == expected_result
+
+    expected_result = [(0, 4), (0, 6), (0, 8), (1, 4), (1, 6), (1, 8), (2, 4),
+                       (2, 6), (2, 8)]
+    ndloop.NDLoop((slice(0, 2, None), slice(4, 9, 2)), f2d)
+    assert f2dres == expected_result
+
+
+if __name__ == "__main__":
+    NDLoopTest()
+    print("PASS")
diff --git a/tests/nested_reduce_test.py b/tests/nested_reduce_test.py
new file mode 100644
index 0000000000..b82b4e7851
--- /dev/null
+++ b/tests/nested_reduce_test.py
@@ -0,0 +1,36 @@
+import numpy as np
+import dace
+from dace.memlet import Memlet
+
+# Create SDFG
+sdfg = dace.SDFG('nested_reduction')
+state = sdfg.add_state('a')
+
+# Nodes
+A = state.add_array('A', (40, ), dace.float32)
+B = state.add_array('B', (20, ), dace.float32)
+me, mx = state.add_map('mymap', dict(i='0:20'))
+red = state.add_reduce('lambda a,b: a+b', None, 0)
+
+# Edges
+state.add_edge(A, None, me, None, Memlet.simple(A, '0:40'))
+state.add_edge(me, None, red, None, Memlet.simple(A, '(2*i):(2*i+2)'))
+state.add_edge(red, None, mx, None, Memlet.simple(B, 'i'))
+state.add_edge(mx, None, B, None, Memlet.simple(B, '0:20'))
+sdfg.fill_scope_connectors()
+
+if __name__ == '__main__':
+    print('Nested reduction test')
+
+    Adata = np.random.rand(40).astype(np.float32)
+    Bdata = np.random.rand(20).astype(np.float32)
+    sdfg(A=Adata, B=Bdata)
+
+    B_regression = np.zeros(20, dtype=np.float32)
+    B_regression[:] = Adata[::2]
+    B_regression[:] += Adata[1::2]
+
+    diff = np.linalg.norm(B_regression - Bdata) / 20.0
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/nested_sdfg_python_test.py b/tests/nested_sdfg_python_test.py
new file mode 100644
index 0000000000..721771dec0
--- /dev/null
+++ b/tests/nested_sdfg_python_test.py
@@ -0,0 +1,49 @@
+#!/usr/bin/env python
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+N = dp.symbol('N')
+
+
+@dp.program
+def sdfg_with_children(A: dp.float32[N, N], B: dp.float32[N, N]):
+    @dp.map
+    def elements(i: _[0:N], j: _[0:N]):
+        input << A[i, j]
+        output >> B[i, j]
+
+        @dp.program
+        def sdfg_internal(input: dp.float32, output: dp.float32):
+            @dp.tasklet
+            def init():
+                inp << input
+                out >> output
+                out = inp
+
+            for k in range(4):
+
+                @dp.tasklet
+                def do():
+                    inp << input
+                    oin << output
+                    out >> output
+                    out = oin * inp
+
+
+if __name__ == '__main__':
+    print('Nested SDFG test (Python syntax)')
+    # Externals (parameters, symbols)
+    N.set(64)
+
+    input = np.random.rand(N.get(), N.get()).astype(dp.float32.type)
+    output = np.zeros((N.get(), N.get()), dp.float32.type)
+
+    sdfg_with_children(input, output)
+
+    diff = np.linalg.norm(output - np.power(input, 5)) / dp.eval(N * N)
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/nested_sdfg_test.py b/tests/nested_sdfg_test.py
new file mode 100644
index 0000000000..144350565e
--- /dev/null
+++ b/tests/nested_sdfg_test.py
@@ -0,0 +1,65 @@
+#!/usr/bin/env python
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+N = dp.symbol('N')
+
+
+@dp.program
+def sdfg_internal(input: dp.float32, output: dp.float32):
+    @dp.tasklet
+    def init():
+        inp << input
+        out >> output
+        out = inp
+
+    for k in range(4):
+
+        @dp.tasklet
+        def do():
+            inp << input
+            oin << output
+            out >> output
+            out = oin * inp
+
+
+# Construct SDFG
+mysdfg = SDFG('outer_sdfg')
+state = mysdfg.add_state()
+A = state.add_array('A', [N, N], dp.float32)
+B = state.add_array('B', [N, N], dp.float32)
+
+map_entry, map_exit = state.add_map('elements', [('i', '0:N'), ('j', '0:N')])
+nsdfg = state.add_nested_sdfg(sdfg_internal.to_sdfg(), mysdfg, {'input'},
+                              {'output'})
+
+# Add edges
+state.add_memlet_path(
+    A, map_entry, nsdfg, dst_conn='input', memlet=Memlet.simple(A, 'i,j'))
+state.add_memlet_path(
+    nsdfg, map_exit, B, src_conn='output', memlet=Memlet.simple(B, 'i,j'))
+
+if __name__ == '__main__':
+    print('Nested SDFG test')
+    # Externals (parameters, symbols)
+
+    input = dp.ndarray([N, N], dp.float32)
+    output = dp.ndarray([N, N], dp.float32)
+    N.set(64)
+
+    input[:] = np.random.rand(N.get(), N.get()).astype(dp.float32.type)
+    output[:] = dp.float32(0)
+
+    # Left for debugging purposes
+    nsdfg.sdfg.draw_to_file('nested_sdfg.dot')
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, B=output, N=N)
+
+    diff = np.linalg.norm(output - np.power(input, 5)) / dp.eval(N * N)
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/numpy/map_syntax_test.py b/tests/numpy/map_syntax_test.py
new file mode 100644
index 0000000000..08088c730c
--- /dev/null
+++ b/tests/numpy/map_syntax_test.py
@@ -0,0 +1,25 @@
+import numpy as np
+import dace
+
+M, N, K = (dace.symbol(name) for name in ['M', 'N', 'K'])
+
+
+@dace.program
+def copy3d(A: dace.float32[M, N, K], B: dace.float32[M, N, K]):
+    for i in parrange(M):
+        for j, k in dace.map[0:N, 0:K]:
+            with dace.tasklet:
+                a << A[i, j, k]
+                b >> B[i, j, k]
+                b = a
+
+
+if __name__ == '__main__':
+    [sym.set(24) for sym in [M, N, K]]
+    A = np.random.rand(M.get(), N.get(), K.get()).astype(np.float32)
+    B = np.random.rand(M.get(), N.get(), K.get()).astype(np.float32)
+    copy3d(A, B)
+
+    diff = np.linalg.norm(B - A) / dace.eval(M * N)
+    print('Difference:', diff)
+    exit(1 if diff >= 1e-5 else 0)
diff --git a/tests/numpy/matrix_multiplication.py b/tests/numpy/matrix_multiplication.py
new file mode 100644
index 0000000000..69ebd75f85
--- /dev/null
+++ b/tests/numpy/matrix_multiplication.py
@@ -0,0 +1,75 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+import dace.frontend.common as np_frontend
+
+import os
+from timeit import default_timer as timer
+
+SDFG = dace.sdfg.SDFG
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+
+A = dace.ndarray([3, 7, 9, M, N], dtype=dace.float64)
+B = dace.ndarray([2, 5, 8, 4, N, K], dtype=dace.float64)
+C = dace.ndarray([3, 3, M, K], dtype=dace.float64)
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=128)
+    parser.add_argument("N", type=int, nargs="?", default=128)
+    parser.add_argument("K", type=int, nargs="?", default=128)
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+    K.set(args["K"])
+
+    print('Matrix multiplication %dx%dx%d' % (M.get(), N.get(), K.get()))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A[1, 2, 3] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    B[1, 3, 2, 1] = np.random.rand(N.get(), K.get()).astype(dace.float64.type)
+    C[2, 2] = dace.float64(0)
+
+    A_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([N.get(), K.get()], dtype=np.float64)
+    C_regression = np.ndarray([M.get(), K.get()], dtype=np.float64)
+    A_regression[:] = A[1, 2, 3]
+    B_regression[:] = B[1, 3, 2, 1]
+    C_regression[:] = C[2, 2]
+
+    mmul = SDFG(name='mmul')
+    state = mmul.add_state(label='mmul')
+    A_node = state.add_array('A', A.shape, dace.float64)
+    B_node = state.add_array('B', B.shape, dace.float64)
+    C_node = state.add_array('C', C.shape, dace.float64)
+    np_frontend.op_impl.matrix_multiplication(
+        state,
+        A_node,
+        A_node,
+        B_node,
+        B_node,
+        C_node,
+        C_node,
+        A_index=[1, 2, 3],
+        B_index=[1, 3, 2, 1],
+        C_index=[2, 2],
+        label='mmul')
+
+    mmul(A=A, B=B, C=C)
+    np.dot(A_regression, B_regression, C_regression)
+
+    rel_error = (np.linalg.norm(C_regression - C[2, 2], ord=2) /
+                 np.linalg.norm(C_regression, ord=2))
+    print("Relative error:", rel_error)
+    print("==== Program end ====")
+    exit(0 if rel_error <= 1e-15 else 1)
diff --git a/tests/numpy/matrix_multiplication_s.py b/tests/numpy/matrix_multiplication_s.py
new file mode 100644
index 0000000000..2f13939898
--- /dev/null
+++ b/tests/numpy/matrix_multiplication_s.py
@@ -0,0 +1,75 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+import dace.frontend.common as np_frontend
+
+import os
+from timeit import default_timer as timer
+
+SDFG = dace.sdfg.SDFG
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+
+A = dace.ndarray([3, 7, 9, M, N], dtype=dace.float64)
+B = dace.ndarray([2, 5, 8, 4, N, K], dtype=dace.float64)
+C = dace.ndarray([3, 3, M, K], dtype=dace.float64)
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=128)
+    parser.add_argument("N", type=int, nargs="?", default=128)
+    parser.add_argument("K", type=int, nargs="?", default=128)
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+    K.set(args["K"])
+
+    print('Matrix multiplication %dx%dx%d' % (M.get(), N.get(), K.get()))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A[1, 2, 3] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    B[1, 3, 2, 1] = np.random.rand(N.get(), K.get()).astype(dace.float64.type)
+    C[2, 2] = dace.float64(0)
+
+    A_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([N.get(), K.get()], dtype=np.float64)
+    C_regression = np.ndarray([M.get(), K.get()], dtype=np.float64)
+    A_regression[:] = A[1, 2, 3]
+    B_regression[:] = B[1, 3, 2, 1]
+    C_regression[:] = C[2, 2]
+
+    mmul = SDFG(name='mmul')
+    mmul.add_node(
+        np_frontend.op_impl.matrix_multiplication_s(
+            'A',
+            A.shape,
+            dace.float64,
+            'B',
+            B.shape,
+            dace.float64,
+            False,
+            'C',
+            C.shape,
+            dace.float64,
+            A_index=[1, 2, 3],
+            B_index=[1, 3, 2, 1],
+            C_index=[2, 2],
+            label='mmul'))
+
+    mmul(A=A, B=B, C=C)
+    np.dot(A_regression, B_regression, C_regression)
+
+    rel_error = (np.linalg.norm(C_regression - C[2, 2], ord=2) /
+                 np.linalg.norm(C_regression, ord=2))
+    print("Relative error:", rel_error)
+    print("==== Program end ====")
+    exit(0 if rel_error <= 1e-15 else 1)
diff --git a/tests/numpy/matrix_pointwise_op.py b/tests/numpy/matrix_pointwise_op.py
new file mode 100644
index 0000000000..ccdfbcc387
--- /dev/null
+++ b/tests/numpy/matrix_pointwise_op.py
@@ -0,0 +1,76 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+import dace.frontend.common as np_frontend
+
+import os
+from timeit import default_timer as timer
+
+SDFG = dace.sdfg.SDFG
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+
+A = dace.ndarray([3, 7, 9, M, N], dtype=dace.float64)
+B = dace.ndarray([2, 5, 8, 4, M, N], dtype=dace.float64)
+C = dace.ndarray([3, 3, 1], dtype=dace.float64)
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=128)
+    parser.add_argument("N", type=int, nargs="?", default=128)
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+
+    print('Matrix point-wise op %dx%d' % (M.get(), N.get()))
+
+    # Initialize arrays: Randomize A and B, zero C
+    A[1, 2, 3] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    B[1, 3, 2, 1] = np.random.rand(M.get(), N.get()).astype(dace.float64.type)
+    C[2, 2, 0] = dace.float64(0)
+
+    A_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([M.get(), N.get()], dtype=np.float64)
+    A_regression[:] = A[1, 2, 3]
+    B_regression[:] = B[1, 3, 2, 1]
+    C_regression = C[2, 2, 0]
+
+    mpwop = SDFG(name='mpwop')
+    state = mpwop.add_state(label='mpwop')
+    A_node = state.add_array('A', A.shape, dace.float64)
+    B_node = state.add_array('B', B.shape, dace.float64)
+    C_node = state.add_array('C', C.shape, dace.float64)
+    np_frontend.op_impl.matrix_pointwise_op(
+        state,
+        A_node,
+        A_node,
+        B_node,
+        B_node,
+        C_node,
+        C_node,
+        op='*',
+        reduce=True,
+        reduce_op='+',
+        A_index=[1, 2, 3],
+        B_index=[1, 3, 2, 1],
+        C_index=[2, 2, 0],
+        label='mpwop')
+
+    mpwop(A=A, B=B, C=C)
+    C_regression = np.dot(A_regression.flatten(), B_regression.flatten())
+
+    if C_regression != 0.0:
+        rel_error = np.abs(C_regression - C[2, 2, 0]) / np.abs(C_regression)
+    else:
+        rel_error = np.abs(C_regression - C[2, 2, 0])
+    print("Relative error:", rel_error)
+    print("==== Program end ====")
+    exit(0 if rel_error <= 1e-15 else 1)
diff --git a/tests/numpy/matrix_transpose.py b/tests/numpy/matrix_transpose.py
new file mode 100644
index 0000000000..e931061b9f
--- /dev/null
+++ b/tests/numpy/matrix_transpose.py
@@ -0,0 +1,66 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+import dace.frontend.common as np_frontend
+
+import os
+from timeit import default_timer as timer
+
+SDFG = dace.sdfg.SDFG
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+L = dace.symbol('L')
+
+A = dace.ndarray([L, K, M, N], dtype=dace.float64)
+B = dace.ndarray([L, N, M], dtype=dace.float64)
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=128)
+    parser.add_argument("N", type=int, nargs="?", default=128)
+    parser.add_argument("L", type=int, nargs="?", default=5)
+    parser.add_argument("K", type=int, nargs="?", default=10)
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+    K.set(args["K"])
+    L.set(args["L"])
+
+    print('Matrix transpose %dx%dx' % (M.get(), N.get()))
+
+    # Initialize arrays: Randomize A and B
+    A[:] = np.random.rand(L.get(), K.get(), M.get(),
+                          N.get()).astype(dace.float64.type)
+    B[:] = np.random.rand(L.get(), N.get(), M.get()).astype(dace.float64.type)
+
+    A_regression = np.ndarray(
+        [L.get(), K.get(), M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([L.get(), N.get(), M.get()], dtype=np.float64)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+
+    mtr = SDFG(name='mtr')
+    state = mtr.add_state(label='mtr')
+    A_node = state.add_array('A', A.shape, dace.float64)
+    B_node = state.add_array('B', B.shape, dace.float64)
+    np_frontend.op_impl.matrix_transpose(
+        state, A_node, A_node, B_node, B_node, [2, 3], [4], label='mtr')
+
+    mtr(A=A, B=B)
+    B_regression[4] = np.transpose(A_regression[2, 3])
+
+    rel_error = (np.linalg.norm(
+        (B_regression - B).flatten(), ord=2) / np.linalg.norm(
+            B_regression.flatten(), ord=2))
+    print("Relative error:", rel_error)
+    print("==== Program end ====")
+    exit(0 if rel_error <= 1e-15 else 1)
diff --git a/tests/numpy/matrix_transpose_s.py b/tests/numpy/matrix_transpose_s.py
new file mode 100644
index 0000000000..54dfb3401d
--- /dev/null
+++ b/tests/numpy/matrix_transpose_s.py
@@ -0,0 +1,73 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+import dace.frontend.common as np_frontend
+
+import os
+from timeit import default_timer as timer
+
+SDFG = dace.sdfg.SDFG
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+L = dace.symbol('L')
+
+A = dace.ndarray([L, K, M, N], dtype=dace.float64)
+B = dace.ndarray([L, N, M], dtype=dace.float64)
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=128)
+    parser.add_argument("N", type=int, nargs="?", default=128)
+    parser.add_argument("L", type=int, nargs="?", default=5)
+    parser.add_argument("K", type=int, nargs="?", default=10)
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+    K.set(args["K"])
+    L.set(args["L"])
+
+    print('Matrix transpose %dx%dx' % (M.get(), N.get()))
+
+    # Initialize arrays: Randomize A and B
+    A[:] = np.random.rand(L.get(), K.get(), M.get(),
+                          N.get()).astype(dace.float64.type)
+    B[:] = np.random.rand(L.get(), N.get(), M.get()).astype(dace.float64.type)
+
+    A_regression = np.ndarray(
+        [L.get(), K.get(), M.get(), N.get()], dtype=np.float64)
+    B_regression = np.ndarray([L.get(), N.get(), M.get()], dtype=np.float64)
+    A_regression[:] = A[:]
+    B_regression[:] = B[:]
+
+    mtr = SDFG(name='mtr')
+    mtr.add_node(
+        np_frontend.op_impl.matrix_transpose_s(
+            'A',
+            A.shape,
+            dace.float64,
+            False,
+            'B',
+            B.shape,
+            dace.float64,
+            A_index=[2, 3],
+            B_index=[4],
+            label='mtr'))
+
+    mtr(A=A, B=B)
+    B_regression[4] = np.transpose(A_regression[2, 3])
+
+    rel_error = (np.linalg.norm(
+        (B_regression - B).flatten(), ord=2) / np.linalg.norm(
+            B_regression.flatten(), ord=2))
+    print("Relative error:", rel_error)
+    print("==== Program end ====")
+    exit(0 if rel_error <= 1e-15 else 1)
diff --git a/tests/numpy/scalar_array_multiplication.py b/tests/numpy/scalar_array_multiplication.py
new file mode 100644
index 0000000000..e677ff8778
--- /dev/null
+++ b/tests/numpy/scalar_array_multiplication.py
@@ -0,0 +1,77 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+import dace.frontend.common as np_frontend
+
+import os
+from timeit import default_timer as timer
+
+SDFG = dace.sdfg.SDFG
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+L = dace.symbol('L')
+O = dace.symbol('O')
+
+alpha = dace.ndarray([L, O], dtype=dace.float64)
+A = dace.ndarray([M, N, K], dtype=dace.float64)
+B = dace.ndarray([M, N, K], dtype=dace.float64)
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=128)
+    parser.add_argument("N", type=int, nargs="?", default=128)
+    parser.add_argument("K", type=int, nargs="?", default=128)
+    parser.add_argument("L", type=int, nargs="?", default=5)
+    parser.add_argument("O", type=int, nargs="?", default=10)
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+    K.set(args["K"])
+    L.set(args["L"])
+    O.set(args["O"])
+
+    print('Scalar-Array multiplication %dx%dx%d' % (M.get(), N.get(), K.get()))
+
+    # Initialize arrays: Randomize alpha and A
+    alpha = np.random.rand(L.get(), O.get()).astype(dace.float64.type)
+    A[:] = np.random.rand(M.get(), N.get(), K.get()).astype(dace.float64.type)
+
+    alpha_regression = np.ndarray([L.get(), O.get()], dtype=np.float64)
+    alpha_regression[:] = alpha[:]
+    A_regression = np.ndarray([M.get(), N.get(), K.get()], dtype=np.float64)
+    A_regression[:] = A[:]
+
+    samul = SDFG(name='samul')
+    state = samul.add_state(label='samul')
+    alpha_node = state.add_array('alpha', alpha.shape, dace.float64)
+    A_node = state.add_array('A', A.shape, dace.float64)
+    B_node = state.add_array('B', B.shape, dace.float64)
+    np_frontend.op_impl.scalar_array_multiplication(
+        state,
+        alpha_node,
+        alpha_node,
+        A_node,
+        A_node,
+        B_node,
+        B_node,
+        alpha_index=[2, 7],
+        label='samul')
+
+    samul(alpha=alpha, A=A, B=B)
+    B_regression = alpha_regression[2, 7] * A_regression
+
+    rel_error = (np.linalg.norm(
+        (B_regression - B).flatten(), ord=2) / np.linalg.norm(
+            B_regression.flatten(), ord=2))
+    print("Relative error:", rel_error)
+    print("==== Program end ====")
+    exit(0 if rel_error <= 1e-15 else 1)
diff --git a/tests/numpy/scalar_array_multiplication_s.py b/tests/numpy/scalar_array_multiplication_s.py
new file mode 100644
index 0000000000..5333ef2a13
--- /dev/null
+++ b/tests/numpy/scalar_array_multiplication_s.py
@@ -0,0 +1,77 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import gc
+import argparse
+import dace
+import numpy as np
+import dace.frontend.common as np_frontend
+
+import os
+from timeit import default_timer as timer
+
+SDFG = dace.sdfg.SDFG
+
+M = dace.symbol('M')
+N = dace.symbol('N')
+K = dace.symbol('K')
+L = dace.symbol('L')
+O = dace.symbol('O')
+
+alpha = dace.ndarray([L, O], dtype=dace.float64)
+A = dace.ndarray([M, N, K], dtype=dace.float64)
+B = dace.ndarray([M, N, K], dtype=dace.float64)
+
+if __name__ == "__main__":
+    print("==== Program start ====")
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("M", type=int, nargs="?", default=128)
+    parser.add_argument("N", type=int, nargs="?", default=128)
+    parser.add_argument("K", type=int, nargs="?", default=128)
+    parser.add_argument("L", type=int, nargs="?", default=5)
+    parser.add_argument("O", type=int, nargs="?", default=10)
+    args = vars(parser.parse_args())
+
+    M.set(args["M"])
+    N.set(args["N"])
+    K.set(args["K"])
+    L.set(args["L"])
+    O.set(args["O"])
+
+    print('Scalar-Array multiplication %dx%dx%d' % (M.get(), N.get(), K.get()))
+
+    # Initialize arrays: Randomize alpha and A
+    alpha = np.random.rand(L.get(), O.get()).astype(dace.float64.type)
+    A[:] = np.random.rand(M.get(), N.get(), K.get()).astype(dace.float64.type)
+
+    alpha_regression = np.ndarray([L.get(), O.get()], dtype=np.float64)
+    alpha_regression[:] = alpha[:]
+    A_regression = np.ndarray([M.get(), N.get(), K.get()], dtype=np.float64)
+    A_regression[:] = A[:]
+
+    samul = SDFG(name='samul')
+    samul.add_node(
+        np_frontend.op_impl.scalar_array_multiplication_s(
+            'alpha',
+            alpha.shape,
+            dace.float64,
+            'A',
+            A.shape,
+            dace.float64,
+            False,
+            'B',
+            B.shape,
+            dace.float64,
+            alpha_index=[2, 7],
+            label='samul'))
+
+    samul(alpha=alpha, A=A, B=B)
+    B_regression = alpha_regression[2, 7] * A_regression
+
+    rel_error = (np.linalg.norm(
+        (B_regression - B).flatten(), ord=2) / np.linalg.norm(
+            B_regression.flatten(), ord=2))
+    print("Relative error:", rel_error)
+    print("==== Program end ====")
+    exit(0 if rel_error <= 1e-15 else 1)
diff --git a/tests/nxutil_test.py b/tests/nxutil_test.py
new file mode 100644
index 0000000000..5cf71c5fba
--- /dev/null
+++ b/tests/nxutil_test.py
@@ -0,0 +1,99 @@
+import unittest
+from dace.graph.nxutil import *
+
+
+class TestRangeConversion(unittest.TestCase):
+    def test_str_to_range(self):
+        self.assertEqual(
+            str_to_range("[1:3, 5:50:5, :7, 3:, :]"),
+            [("1", "3", None), ("5", "50", "5"), (None, "7", None),
+             ("3", None, None), (None, None, None)])
+        with self.assertRaises(ValueError):
+            str_to_range("[::]")
+        with self.assertRaises(ValueError):
+            str_to_range("[1:2:1:1]")
+        with self.assertRaises(ValueError):
+            str_to_range("[1:2, ]")
+        with self.assertRaises(ValueError):
+            str_to_range("1:2]")
+        with self.assertRaises(ValueError):
+            str_to_range("[1:2")
+        with self.assertRaises(TypeError):
+            str_to_range(["(", "1", ":", "2", ")"])
+
+    def test_range_to_str(self):
+        self.assertEqual(
+            range_to_str(
+                [(None, None, None), (1, None, None), (None, 3, None),
+                 (None, None, 7), (3, 8, None), (None, 8, 1), (4, None, 1),
+                 (1, 2, 3)],
+                limit_length=None), ("[(None):(None):(None), "
+                                     "1:(None):(None), "
+                                     "(None):3:(None), "
+                                     "(None):(None):7, "
+                                     "3:8:(None), "
+                                     "(None):8, "
+                                     "4:(None), "
+                                     "1:2:3]"))
+        self.assertEqual(range_to_str((1, 8, 2)), "[1:8:2]")
+        with self.assertRaises(ValueError):
+            range_to_str([(1, 2)])
+        with self.assertRaises(TypeError):
+            range_to_str(3)
+
+
+class GraphSearchSpace(object):
+    def __init__(self, graph, graph_node):
+        self.graph = graph
+        self.node = graph_node
+
+    def evaluate(self):
+        return self.node
+
+    def children_iter(self):
+        for _, child in self.graph.out_edges(self.node):
+            yield GraphSearchSpace(self.graph, child)
+
+
+class TestDLS(unittest.TestCase):
+    def test_simple(self):
+        graph = nx.DiGraph()
+        graph.add_nodes_from([-5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5, 6, 7, 8])
+        graph.add_edges_from([(-5, -1), (-1, 1), (-2, 2), (-1, 5), (1, 6),
+                              (2, 6), (-3, 3), (-4, 4), (3, 7), (4, 8), (5,
+                                                                         -4)])
+        sspace = GraphSearchSpace(graph, -5)
+
+        self.assertEqual(depth_limited_search(sspace, 0)[1], -5)
+        self.assertEqual(depth_limited_search(sspace, 1)[1], -1)
+        self.assertEqual(depth_limited_search(sspace, 2)[1], 5)
+        self.assertEqual(depth_limited_search(sspace, 3)[1], 6)
+        self.assertEqual(depth_limited_search(sspace, 4)[1], 6)
+        self.assertEqual(depth_limited_search(sspace, 5)[1], 8)
+        self.assertEqual(depth_limited_search(sspace, 1000)[1], 8)
+
+    def test_iter(self):
+        graph = nx.DiGraph()
+        graph.add_nodes_from([-5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5, 6, 7, 8])
+        graph.add_edges_from([(-5, -1), (-1, 1), (-2, 2), (-1, 5), (1, 6),
+                              (2, 6), (-3, 3), (-4, 4), (3, 7), (4, 8), (5,
+                                                                         -4)])
+        sspace = GraphSearchSpace(graph, -5)
+
+        def winner(sspace, depth):
+            return max(
+                ((t, t.evaluate())
+                 for t in depth_limited_dfs_iter(sspace, depth)),
+                key=lambda t: t[1])
+
+        self.assertEqual(winner(sspace, 0)[1], -5)
+        self.assertEqual(winner(sspace, 1)[1], -1)
+        self.assertEqual(winner(sspace, 2)[1], 5)
+        self.assertEqual(winner(sspace, 3)[1], 6)
+        self.assertEqual(winner(sspace, 4)[1], 6)
+        self.assertEqual(winner(sspace, 5)[1], 8)
+        self.assertEqual(winner(sspace, 1000)[1], 8)
+
+
+if __name__ == "__main__":
+    unittest.main()
diff --git a/tests/octave/add.m b/tests/octave/add.m
new file mode 100644
index 0000000000..759323dfc9
--- /dev/null
+++ b/tests/octave/add.m
@@ -0,0 +1,3 @@
+A = [1 2 3; 4 5 6; 7 8 9]
+B = [1 2 3; 4 5 6; 7 8 9]
+C = A+B
diff --git a/tests/octave/cholesky.m b/tests/octave/cholesky.m
new file mode 100644
index 0000000000..700446f4b1
--- /dev/null
+++ b/tests/octave/cholesky.m
@@ -0,0 +1,9 @@
+M = [2 -1 0; -1 2 -1 ; 0 -1 2] 
+n = length( M );
+L = zeros( n, n );
+for i=1:n
+   L(i, i) = sqrt(M(i, i) - L(i, :)*L(i, :)');
+   for j=(i + 1):n
+      L(j, i) = (M(j, i) - L(i,:)*L(j ,:)')/L(i, i);
+   end
+end
diff --git a/tests/octave/forloop.m b/tests/octave/forloop.m
new file mode 100644
index 0000000000..c8f4fb6a5e
--- /dev/null
+++ b/tests/octave/forloop.m
@@ -0,0 +1,6 @@
+A = 0
+for i=1:5
+    A = i+2
+end
+C = 42
+E = 5
diff --git a/tests/octave/matrix_scalar_add.m b/tests/octave/matrix_scalar_add.m
new file mode 100644
index 0000000000..7e53d3e663
--- /dev/null
+++ b/tests/octave/matrix_scalar_add.m
@@ -0,0 +1,3 @@
+A = [1 2 3; 4 5 6; 7 8 9]
+B = 5
+C = A+B
diff --git a/tests/octave/mult.m b/tests/octave/mult.m
new file mode 100644
index 0000000000..d4d376a611
--- /dev/null
+++ b/tests/octave/mult.m
@@ -0,0 +1,3 @@
+A = [1 2 3; 4 5 6; 7 8 9]
+B = [1 2 3; 4 5 6; 7 8 9]
+C = A*B
diff --git a/tests/octave/scalar_add.m b/tests/octave/scalar_add.m
new file mode 100644
index 0000000000..9d34acc59a
--- /dev/null
+++ b/tests/octave/scalar_add.m
@@ -0,0 +1,3 @@
+A = 7 
+B = 11
+C = A+B
diff --git a/tests/offset_stride_test.py b/tests/offset_stride_test.py
new file mode 100644
index 0000000000..fb2a67a607
--- /dev/null
+++ b/tests/offset_stride_test.py
@@ -0,0 +1,43 @@
+#!/usr/bin/env python
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+# Constructs an SDFG with two consecutive tasklets
+if __name__ == '__main__':
+    print('Multidimensional offset and stride test')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    input = dp.ndarray([N, N], dp.float32)
+    output = dp.ndarray([4, 3], dp.float32)
+    N.set(20)
+    input[:] = (np.random.rand(N.get(), N.get()) * 5).astype(dp.float32.type)
+    output[:] = dp.float32(0)
+
+    # Construct SDFG
+    mysdfg = SDFG('offset_stride')
+    state = mysdfg.add_state()
+    A_ = state.add_array('A', [3, 2], dp.int32, offset=[2, 3], strides=[N, N])
+    B_ = state.add_array(
+        'B', [3, 2], dp.int32, offset=[-1, -1], strides=[4, 3])
+
+    map_entry, map_exit = state.add_map('mymap', [('i', '1:4'), ('j', '1:3')])
+    tasklet = state.add_tasklet('mytasklet', {'a'}, {'b'}, 'b = a')
+    state.add_edge(map_entry, None, tasklet, 'a', Memlet.simple(A_, 'i,j'))
+    state.add_edge(tasklet, 'b', map_exit, None, Memlet.simple(B_, 'i,j'))
+
+    # Add outer edges
+    state.add_edge(A_, None, map_entry, None, Memlet.simple(A_, '1:4,1:3'))
+    state.add_edge(map_exit, None, B_, None, Memlet.simple(B_, '1:4,1:3'))
+
+    # Left for debugging purposes
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, B=output, N=N)
+
+    diff = np.linalg.norm(output[0:3, 0:2] - input[3:6, 4:6]) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/parallel_sections_test.py b/tests/parallel_sections_test.py
new file mode 100644
index 0000000000..0926e5110c
--- /dev/null
+++ b/tests/parallel_sections_test.py
@@ -0,0 +1,129 @@
+#!/usr/bin/env python
+
+import dace
+
+if __name__ == "__main__":
+
+    N = dace.symbol("N")
+
+    sdfg = dace.SDFG("parallel_sections")
+
+    state = sdfg.add_state("sections")
+
+    array_in = state.add_array("array_in", (2 * N, ), dace.types.int32)
+    array_out = state.add_array("array_out", (N, ), dace.types.int32)
+
+    fifo_in_a_0 = state.add_stream(
+        "fifo_in_a", dace.types.int32, 1, 1, transient=True)
+    fifo_in_b_0 = state.add_stream(
+        "fifo_in_b", dace.types.int32, 1, 1, transient=True)
+    fifo_in_a_1 = state.add_stream(
+        "fifo_in_a", dace.types.int32, 1, 1, transient=True)
+    fifo_in_b_1 = state.add_stream(
+        "fifo_in_b", dace.types.int32, 1, 1, transient=True)
+    fifo_out_0 = state.add_stream(
+        "fifo_out", dace.types.int32, 1, 1, transient=True)
+    fifo_out_1 = state.add_stream(
+        "fifo_out", dace.types.int32, 1, 1, transient=True)
+
+    ###########################################################################
+    # First processing element: reads from memory into a stream
+
+    read_a_entry, read_a_exit = state.add_map(
+        "read_map_a", {"i": "0:N"},
+        schedule=dace.types.ScheduleType.Sequential)
+    read_a_tasklet = state.add_tasklet(
+        "read_a", {"from_memory"}, {"to_stream"}, "to_stream = from_memory")
+
+    # Inner edges
+    state.add_edge(read_a_entry, None, read_a_tasklet, "from_memory",
+                   dace.memlet.Memlet.simple(array_in, "i"))
+    state.add_edge(read_a_tasklet, "to_stream", read_a_exit, None,
+                   dace.memlet.Memlet.simple(fifo_in_a_0, "0"))
+
+    # Outer edges
+    state.add_edge(array_in, None, read_a_entry, None,
+                   dace.memlet.Memlet.simple(array_in, "0:N"))
+    state.add_edge(read_a_exit, None, fifo_in_a_0, None,
+                   dace.memlet.Memlet.simple(fifo_in_a_0, "0"))
+
+    ###########################################################################
+    # Second processing element: reads from memory into a stream
+
+    read_b_entry, read_b_exit = state.add_map(
+        "read_map_b", {"i": "N:2*N"},
+        schedule=dace.types.ScheduleType.Sequential)
+    read_b_tasklet = state.add_tasklet(
+        "read_b", {"from_memory"}, {"to_stream"}, "to_stream = from_memory")
+
+    # Inner edges
+    state.add_edge(read_b_entry, None, read_b_tasklet, "from_memory",
+                   dace.memlet.Memlet.simple(array_in, "i"))
+    state.add_edge(read_b_tasklet, "to_stream", read_b_exit, None,
+                   dace.memlet.Memlet.simple(fifo_in_b_0, "0"))
+
+    # Outer edges
+    state.add_edge(array_in, None, read_b_entry, None,
+                   dace.memlet.Memlet.simple(array_in, "0:N"))
+    state.add_edge(read_b_exit, None, fifo_in_b_0, None,
+                   dace.memlet.Memlet.simple(fifo_in_b_0, "0"))
+
+    ###########################################################################
+    # Third processing element: reads from both input streams, adds the
+    # numbers, the writes it to the output stream
+
+    compute_entry, compute_exit = state.add_map(
+        "compute_map", {"i": "0:N"},
+        schedule=dace.types.ScheduleType.Sequential)
+    compute_tasklet = state.add_tasklet("compute", {"a", "b"}, {"c"},
+                                        "c = a + b")
+
+    # Inner edges
+    state.add_edge(compute_entry, None, compute_tasklet, "a",
+                   dace.memlet.Memlet.simple(fifo_in_a_1, "0"))
+    state.add_edge(compute_entry, None, compute_tasklet, "b",
+                   dace.memlet.Memlet.simple(fifo_in_b_1, "0"))
+    state.add_edge(compute_tasklet, "c", compute_exit, None,
+                   dace.memlet.Memlet.simple(fifo_out_0, "0"))
+
+    # Outer edges
+    state.add_edge(fifo_in_a_1, None, compute_entry, None,
+                   dace.memlet.Memlet.simple(fifo_in_a_1, "0"))
+    state.add_edge(fifo_in_b_1, None, compute_entry, None,
+                   dace.memlet.Memlet.simple(fifo_in_b_1, "0"))
+    state.add_edge(compute_exit, None, fifo_out_0, None,
+                   dace.memlet.Memlet.simple(fifo_out_0, "0"))
+
+    ###########################################################################
+    # Fourth processing element: reads from stream into an array
+
+    write_entry, write_exit = state.add_map(
+        "write_map", {"i": "0:N"}, schedule=dace.types.ScheduleType.Sequential)
+    write_tasklet = state.add_tasklet("write", {"from_stream"}, {"to_memory"},
+                                      "to_memory = from_stream")
+
+    # Inner edges
+    state.add_edge(write_entry, None, write_tasklet, "from_stream",
+                   dace.memlet.Memlet.simple(fifo_out_1, "0"))
+    state.add_edge(write_tasklet, "to_memory", write_exit, None,
+                   dace.memlet.Memlet.simple(array_out, "i"))
+
+    # Outer edges
+    state.add_edge(fifo_out_1, None, write_entry, None,
+                   dace.memlet.Memlet.simple(fifo_out_1, "0"))
+    state.add_edge(write_exit, None, array_out, None,
+                   dace.memlet.Memlet.simple(array_out, "0:N"))
+
+    ###########################################################################
+
+    N.set(1024)
+    array_in = dace.ndarray([2 * N], dace.types.int32)
+    array_in[:N.get()] = range(0, N.get())
+    array_in[N.get():] = range(0, N.get())
+    array_out = dace.ndarray([N], dace.types.int32)
+    sdfg(array_in=array_in, array_out=array_out, N=N)
+
+    for i, val in enumerate(array_out):
+        if val != 2 * i:
+            print(i, val)
+            raise ValueError
diff --git a/tests/properties_test.py b/tests/properties_test.py
new file mode 100644
index 0000000000..46be87af92
--- /dev/null
+++ b/tests/properties_test.py
@@ -0,0 +1,87 @@
+"""Unit tests for dace.graph.properties module."""
+
+import unittest
+import sympy as sp
+import dace
+from collections import OrderedDict
+
+
+class PropertyTests(unittest.TestCase):
+    """Implements unit tests for dace.graph.properties.Property class."""
+
+    def test_indirect_properties(self):
+
+        m = dace.graph.nodes.Map(
+            "test_map",
+            [sp.Symbol("i"), sp.Symbol("j"),
+             sp.Symbol("k")],
+            dace.subsets.Range([(0, 10, 1), (0, sp.Symbol("N"), 4),
+                                (0, sp.Symbol("M"), None)]))
+
+        entry = dace.graph.nodes.MapEntry(m)
+
+        to_string = dace.graph.nodes.Map.__properties__["params"].to_string
+        from_string = dace.graph.nodes.Map.__properties__["params"].from_string
+
+        self.assertTrue(to_string(m.params) == "[i, j, k]")
+        self.assertTrue(to_string(entry.params) == "[i, j, k]")
+
+        entry.params = from_string("[k, j, i]")
+
+        self.assertTrue(to_string(m.params) == "[k, j, i]")
+        self.assertTrue(to_string(entry.params) == "[k, j, i]")
+
+    def test_range_property(self):
+
+        m = dace.graph.nodes.Map(
+            "test_map",
+            [sp.Symbol("i"), sp.Symbol("j"),
+             sp.Symbol("k")],
+            dace.subsets.Range([(0, 9, 1), (0, sp.Symbol("N") - 1, 4),
+                                (1, sp.Symbol("M") - 1, None)]))
+
+        to_string = dace.graph.nodes.Map.__properties__["range"].to_string
+        from_string = dace.graph.nodes.Map.__properties__["range"].from_string
+
+        self.assertTrue("0:10, 0:N:4, 1:M" in to_string(m.range))
+
+        m.range = from_string("5:105:5, 0:2*N, 0:10*N:N")
+
+        self.assertTrue("5:105:5, 0:2*N, 0:10*N:N" in to_string(m.range))
+
+    def test_reference_property(self):
+
+        from_string = dace.memlet.Memlet.__properties__["data"].from_string
+
+        sdfg = dace.SDFG("test_sdfg", OrderedDict([("foo",
+                                                    dace.types.float32)]), [])
+
+        state0 = dace.SDFGState("s0", sdfg)
+        state1 = dace.SDFGState("s1", sdfg)
+        sdfg.add_node(state0)
+        sdfg.add_node(state1)
+
+        arr0 = sdfg.add_array("arr0", (16, 16), dace.types.float32)
+        data0 = dace.graph.nodes.AccessNode('arr0')
+
+        state0.add_node(data0)
+        arr1 = sdfg.add_array("arr1", (16, 16), dace.types.float32)
+        state0.add_node(dace.graph.nodes.AccessNode('arr1'))
+        arr2 = sdfg.add_array("arr2", (16, 16), dace.types.float32)
+        state1.add_node(dace.graph.nodes.AccessNode('arr2'))
+
+        memlet = dace.memlet.Memlet('arr2', 1, "0:N", 1)
+
+        with self.assertRaises(TypeError):
+            # Must pass SDFG as second argument
+            memlet = dace.memlet.Memlet(
+                dace.memlet.Memlet.__properties__["data"].from_string("arr0"),
+                None, 1, "i", 1)
+
+        memlet.data = 'arr0'
+
+        self.assertEqual(sdfg.arrays[memlet.data], arr0)
+
+
+if __name__ == '__main__':
+    unittest.main()
diff --git a/tests/range_from_string_test.py b/tests/range_from_string_test.py
new file mode 100644
index 0000000000..e2e43d5895
--- /dev/null
+++ b/tests/range_from_string_test.py
@@ -0,0 +1,83 @@
+"""Implements unit tests for dace.subsets.Range.from_string method."""
+
+import unittest
+from dace import subsets as sbs
+
+
+class RangeFromStringTests(unittest.TestCase):
+    """Implements unit tests for dace.subsets.Range.from_string method."""
+
+    def test_simple_uni_dim_range(self):
+
+        r = sbs.Range.from_string('0:M:2')
+        self.assertTrue(r.pystr() == '[(0, M - 1, 2)]', msg=r.pystr())
+
+    def test_simple_uni_dim_index(self):
+
+        r = sbs.Range.from_string('i')
+        self.assertTrue(r.pystr() == '[(i, i, 1)]', msg=r.pystr())
+
+    def test_simple_multi_dim_range(self):
+
+        r = sbs.Range.from_string('0:M, 4:N:3, -5:10:2')
+        self.assertTrue(
+            r.pystr() == '[(0, M - 1, 1), (4, N - 1, 3), (-5, 9, 2)]',
+            msg=r.pystr())
+
+    def test_simple_multi_dim_index(self):
+
+        r = sbs.Range.from_string('i, j, k')
+        self.assertTrue(
+            r.pystr() == '[(i, i, 1), (j, j, 1), (k, k, 1)]', msg=r.pystr())
+
+    def test_complex_uni_dim_range_1(self):
+
+        r = sbs.Range.from_string(
+            'regtile_j * rs_j : min(K, regtile_j * rs_j + rs_j)')
+        self.assertTrue(
+            r.pystr() ==
+            '[(regtile_j*rs_j, Min(K, regtile_j*rs_j + rs_j) - 1, 1)]',
+            msg=r.pystr())
+
+    def test_complex_uni_dim_range_2(self):
+
+        r = sbs.Range.from_string(
+            'tile_i * ts_i : min(int_ceil(M, rs_i), tile_i * ts_i + ts_i)')
+        self.assertTrue(
+            r.pystr() ==
+            '[(tile_i*ts_i, Min(tile_i*ts_i + ts_i, int_ceil(M, rs_i)) - 1, 1)]',
+            msg=r.pystr())
+
+    def test_complex_multi_dim_range_1(self):
+
+        r = sbs.Range.from_string(
+            '0:M:2, tile_i * ts_i : min(int_ceil(M, rs_i), tile_i * ts_i + ts_i)'
+        )
+        self.assertTrue(
+            r.pystr() ==
+            '[(0, M - 1, 2), (tile_i*ts_i, Min(tile_i*ts_i + ts_i, int_ceil(M, rs_i)) - 1, 1)]',
+            msg=r.pystr())
+
+    def test_complex_multi_dim_range_2(self):
+
+        r = sbs.Range.from_string(
+            'tile_i * ts_i : min(int_ceil(M, rs_i), tile_i * ts_i + ts_i), 0:M:2'
+        )
+        self.assertTrue(
+            r.pystr() ==
+            '[(tile_i*ts_i, Min(tile_i*ts_i + ts_i, int_ceil(M, rs_i)) - 1, 1), (0, M - 1, 2)]',
+            msg=r.pystr())
+
+    def test_complex_multi_dim_range_3(self):
+
+        r = sbs.Range.from_string(
+            'tile_i * ts_i : min(int_ceil(M, rs_i), tile_i * ts_i + ts_i), regtile_j * rs_j : min(K, regtile_j * rs_j + rs_j)'
+        )
+        self.assertTrue(
+            r.pystr() ==
+            '[(tile_i*ts_i, Min(tile_i*ts_i + ts_i, int_ceil(M, rs_i)) - 1, 1), (regtile_j*rs_j, Min(K, regtile_j*rs_j + rs_j) - 1, 1)]',
+            msg=r.pystr())
+
+
+if __name__ == '__main__':
+    unittest.main()
diff --git a/tests/read_after_write_test.py b/tests/read_after_write_test.py
new file mode 100644
index 0000000000..1b33117c1d
--- /dev/null
+++ b/tests/read_after_write_test.py
@@ -0,0 +1,45 @@
+#!/usr/bin/env python
+import dace as dp
+import numpy as np
+
+W = dp.symbol()
+
+
+@dp.program
+def raw_prog(A, B):
+    tmp = dp.define_local([W], A.dtype)
+
+    @dp.map(_[0:W])
+    def compute_tmp(i):
+        a << A[i]
+        b >> tmp[i]
+        b = a
+
+    @dp.map(_[0:W])
+    def compute_tmp_again(i):
+        a << tmp[i]
+        b >> tmp[i]
+        b = a + a
+
+    @dp.map(_[0:W])
+    def compute_output(i):
+        a << tmp[i]
+        b >> B[i]
+        b = a + a
+
+
+if __name__ == '__main__':
+    W.set(3)
+
+    A = dp.ndarray([W])
+    B = dp.ndarray([W])
+
+    A[:] = np.mgrid[0:W.get()]
+    B[:] = dp.float32(0.0)
+
+    raw_prog(A, B)
+
+    diff = np.linalg.norm(4 * A - B) / W.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/reduction_detection_test.py b/tests/reduction_detection_test.py
new file mode 100644
index 0000000000..fdc547506e
--- /dev/null
+++ b/tests/reduction_detection_test.py
@@ -0,0 +1,38 @@
+import dace
+
+
+def test_type(wcr_str, red_type):
+    assert dace.detect_reduction_type('lambda a,b: %s' % wcr_str) == red_type
+
+
+def test_comm_assoc(wcr_str, comm, assoc):
+    assert dace.is_op_commutative('lambda a,b: %s' % wcr_str) == comm
+    assert dace.is_op_associative('lambda a,b: %s' % wcr_str) == assoc
+
+
+if __name__ == '__main__':
+    test_type('a + b', dace.ReductionType.Sum)
+    test_type('a * b', dace.ReductionType.Product)
+    test_type('min(a, b)', dace.ReductionType.Min)
+    test_type('max(a, b)', dace.ReductionType.Max)
+    test_type('a | b', dace.ReductionType.Bitwise_Or)
+    test_type('a ^ b', dace.ReductionType.Bitwise_Xor)
+    test_type('a & b', dace.ReductionType.Bitwise_And)
+    test_type('a or b', dace.ReductionType.Logical_Or)
+    test_type('a != b', dace.ReductionType.Logical_Xor)
+    test_type('a and b', dace.ReductionType.Logical_And)
+    #test_type('b if b[0] < a[0] else a', dace.ReductionType.Min_Location)
+    #test_type('b if b[0] > a[0] else a', dace.ReductionType.Max_Location)
+
+    test_type('a * b + b', dace.ReductionType.Custom)
+    test_type('a / b', dace.ReductionType.Custom)
+
+    # Test for associativity / commutativity of operations
+    test_comm_assoc('a + b', True, True)
+    test_comm_assoc('a * b', True, True)
+    test_comm_assoc('(a + b) / 2', True, False)
+    test_comm_assoc('a', False, True)
+    test_comm_assoc('a * b + b', False, False)
+    test_comm_assoc('a / b', False, False)
+
+    print('PASSED')
diff --git a/tests/reloadable_lib_test.py b/tests/reloadable_lib_test.py
new file mode 100644
index 0000000000..ee445a1b8e
--- /dev/null
+++ b/tests/reloadable_lib_test.py
@@ -0,0 +1,40 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import dace
+import numpy as np
+
+
+# Dynamically creates DaCe programs with the same name
+def program_generator(size, factor):
+    @dace.program(
+        dace.float64[size], dace.float64[size], size=size, factor=factor)
+    def program(input, output):
+        @dace.map(_[0:size])
+        def tasklet(i):
+            a << input[i]
+            b >> output[i]
+            b = a * factor
+
+    return program
+
+
+if __name__ == "__main__":
+    print('Reloadable DaCe program test')
+
+    array_one = np.random.rand(10).astype(np.float64)
+    array_two = np.random.rand(20).astype(np.float64)
+    output_one = np.zeros(10, dtype=np.float64)
+    output_two = np.zeros(20, dtype=np.float64)
+
+    prog_one = program_generator(10, 2.0)
+    prog_two = program_generator(20, 4.0)
+
+    prog_one(array_one, output_one)
+    prog_two(array_two, output_two)
+
+    diff1 = np.linalg.norm(2.0 * array_one - output_one) / 10.0
+    diff2 = np.linalg.norm(4.0 * array_two - output_two) / 20.0
+    print("Differences:", diff1, diff2)
+    print("==== Program end ====")
+    exit(0 if diff1 <= 1e-5 and diff2 <= 1e-5 else 1)
diff --git a/tests/runtime_test.cpp b/tests/runtime_test.cpp
new file mode 100644
index 0000000000..2c480132f5
--- /dev/null
+++ b/tests/runtime_test.cpp
@@ -0,0 +1,151 @@
+#include <cstdio>
+#include <typeinfo>
+#include <cassert>
+#include <cstring>
+#include <iostream>
+
+#include <dace/dace.h>
+
+struct add_wcr
+{
+    template<typename T>
+    static void resolve(T *ptr, const T& value)
+    {
+        *ptr += value;
+    }
+};
+
+
+#define AW 1000
+#define AH 1000
+
+int main(int, char **)
+{
+    /////////////////////////////////////////////////////////
+    // Type checks
+    static_assert(std::is_pod<dace::vec<float, 1> >::value == true,
+                  "Length-1 float vector should be POD");
+
+    static_assert(std::is_scalar<dace::vec<double, 1> >::value == true,
+                  "Length-1 double vector should be scalar");
+
+    static_assert(std::is_scalar<dace::vec<float, 2> >::value == false,
+                  "Length-2 float vector should not be scalar");
+
+    static_assert(
+        std::is_scalar<dace::vec<std::complex<float>, 1> >::value == false,
+        "Length-1 complex vector should not be scalar");
+
+    static_assert(
+        std::is_pod<dace::vec<std::complex<float>, 1> >::value == false,
+        "Length-1 complex vector should not be POD");
+
+    /////////////////////////////////////////////////////////
+    // ArrayView
+
+    float *bla = new float[AW*AH];
+    for (int y = 0; y < AH; ++y)
+        for (int x = 0; x < AW; ++x)
+            bla[y*AW + x] = y * 10000 + x;
+    
+    dace::ArrayViewOut<float, 2> arr (bla, AW, 1);
+    dace::ArrayViewOut<float, 2, 2> arrv2(bla, AW, 1);
+
+    // Normal access
+    float v = arr(50, 50);
+    assert(v == 500050.0f);
+
+    // Vectorized access
+    dace::vec<float, 2> v2 = arrv2(50, 50);
+    assert(v2[0] == 500100.0f);
+    assert(v2[1] == 500101.0f);
+
+    // ROI
+    dace::ArrayViewIn<float, 2> arroi(bla + 250*AW + 500, AW, 1);
+    assert(arroi(-1, 4) == 2490504.0f);
+
+    // Ordinary write
+    arr.write(1337.0f, 1, 3);
+    assert(bla[1*AW+3] == 1337.0f);
+    
+    // Vector write
+    arrv2.write(dace::vec<float,2>{ 1337.0f, 7331.0f }, 5, 5);
+    assert(bla[5 * AW + 10] == 1337.0f);
+    assert(bla[5 * AW + 11] == 7331.0f);
+
+    // Write + Conflict resolution
+    arr.write_and_resolve(1, [](auto a, auto b) { return a + b; }, 5, 10);
+    arr.write_and_resolve<dace::ReductionType::Sum>(1, 5, 10);
+    assert(bla[5 * AW + 10] == 1339.0f);
+
+    // Scalar access
+    dace::ArrayViewOut<float, 0> scalar(bla + 25 * AW + 90);
+    dace::ArrayViewOut<float, 0, 2> scalar_vec(bla + 25 * AW + 90);
+    float val = scalar;
+    assert(val == 250090.0f);
+    
+    dace::vec<float, 2> val_vec = scalar_vec;
+    assert(val_vec[0] == 250090.0f);
+    assert(val_vec[1] == 250091.0f);
+
+    // Scalar writes
+    scalar.write(val + 1.0f);
+    assert(fabs(bla[25*AW+90] - 250091.0f) <= 1e-6);
+
+    scalar_vec.write_and_resolve(val_vec, [](auto a, auto b) { return a + b; });
+    scalar_vec.write_and_resolve<dace::ReductionType::Sum>(val_vec);
+    assert(fabs(bla[25 * AW + 90] - (250091.0f + 2*250090.0f)) <= 1e-6);
+    assert(fabs(bla[25 * AW + 91] - (250091.0f + 2*250091.0f)) <= 1e-6);
+    
+    // ArrayView with skips
+    dace::ArrayViewIn<float, 2> arr_skip(bla + 3*AW + 6, AW*2, 3);
+    assert(arr_skip(0, 0) == 30006.0);
+    assert(arr_skip(0, 1) == 30009.0);
+    assert(arr_skip(1, 2) == 50012.0);
+
+    delete[] bla;
+
+    /////////////////////////////////////////////////////////
+    // Streams
+    dace::Stream<uint64_t> s;
+    s.push(1);
+    std::atomic<uint64_t> result (0ULL);
+
+    // Stream consume (async, but useless operation)
+    dace::Consume<1>::consume(s, 8, [&](int /*pe*/, uint64_t& u)
+    {
+        if (u < 256ULL) {
+            result += u;
+            s.push(2 * u);
+            s.push(2 * u + 1);
+        }
+    });
+
+    assert(result == 255 * 128);
+
+    // Stream with array as a sink
+    double *bla2 = new double[10];
+    memset(bla2, 0xAE, sizeof(double) * 10);
+    dace::ArrayStreamView<double> s_bla2 (bla2);
+    dace::vec<double, 2> vec = dace::vec<double, 2>{ 3.0, 2.0 };
+    dace::vec<double, 2> vec_arr[2] = {0};
+
+    s_bla2.push<2>(vec);
+    assert(bla2[0] == 3.0);
+    assert(bla2[1] == 2.0);
+
+    s_bla2.push<2>(vec_arr, 2);
+    assert(bla2[2] == 0.0);
+    assert(bla2[3] == 0.0);
+    assert(bla2[4] == 0.0);
+    assert(bla2[5] == 0.0);
+
+    s_bla2.push(9.0);
+    assert(bla2[6] == 9.0);
+
+    delete[] bla2;
+
+    printf("Success!\n");
+    return 0;
+}
+
diff --git a/tests/sdfg_validate_names_test.py b/tests/sdfg_validate_names_test.py
new file mode 100644
index 0000000000..5149038370
--- /dev/null
+++ b/tests/sdfg_validate_names_test.py
@@ -0,0 +1,123 @@
+import unittest
+import dace
+
+
+# Try to detect invalid names in SDFG
+class NameValidationTests(unittest.TestCase):
+    # SDFG label
+    def test_sdfg_name1(self):
+        try:
+            sdfg = dace.SDFG(' ')
+            sdfg.validate()
+            self.fail('Failed to detect invalid SDFG')
+        except dace.sdfg.InvalidSDFGError as ex:
+            print('Exception caught:', ex)
+
+    def test_sdfg_name2(self):
+        try:
+            sdfg = dace.SDFG('3sat')
+            sdfg.validate()
+            self.fail('Failed to detect invalid SDFG')
+        except dace.sdfg.InvalidSDFGError as ex:
+            print('Exception caught:', ex)
+
+    # State
+    def test_state_duplication(self):
+        try:
+            sdfg = dace.SDFG('ok')
+            sdfg.add_state('also_ok')
+            s2 = sdfg.add_state('also_ok')
+            s2.set_label('also_ok')
+            sdfg.validate()
+            self.fail('Failed to detect duplicate state')
+        except dace.sdfg.InvalidSDFGError as ex:
+            print('Exception caught:', ex)
+
+    def test_state_name1(self):
+        try:
+            sdfg = dace.SDFG('ok')
+            sdfg.add_state('not ok')
+            sdfg.validate()
+            self.fail('Failed to detect invalid state')
+        except dace.sdfg.InvalidSDFGError as ex:
+            print('Exception caught:', ex)
+
+    def test_state_name2(self):
+        try:
+            sdfg = dace.SDFG('ok')
+            sdfg.add_state('$5')
+            sdfg.validate()
+            self.fail('Failed to detect invalid state')
+        except dace.sdfg.InvalidSDFGError as ex:
+            print('Exception caught:', ex)
+
+    # Array
+    def test_array(self):
+        try:
+            sdfg = dace.SDFG('ok')
+            state = sdfg.add_state('also_ok')
+            _8 = state.add_array('8', [1], dace.float32)
+            t = state.add_tasklet('tasklet', {'a'}, {}, 'print(a)')
+            state.add_edge(_8, None, t, 'a',
+                           dace.Memlet.from_array(_8.data, _8.desc(sdfg)))
+            sdfg.validate()
+            self.fail('Failed to detect invalid array name')
+        except (dace.sdfg.InvalidSDFGError, NameError) as ex:
+            print('Exception caught:', ex)
+
+    # Tasklet
+    def test_tasklet(self):
+        try:
+            sdfg = dace.SDFG('ok')
+            state = sdfg.add_state('also_ok')
+            A = state.add_array('A', [1], dace.float32)
+            B = state.add_array('B', [1], dace.float32)
+            t = state.add_tasklet(' tasklet', {'a'}, {'b'}, 'b = a')
+            state.add_edge(A, None, t, 'a',
+                           dace.Memlet.from_array(A.data, A.desc(sdfg)))
+            state.add_edge(t, 'b', B, None,
+                           dace.Memlet.from_array(B.data, B.desc(sdfg)))
+            sdfg.validate()
+            self.fail('Failed to detect invalid tasklet name')
+        except dace.sdfg.InvalidSDFGNodeError as ex:
+            print('Exception caught:', ex)
+
+    # Connector
+    def test_connector(self):
+        try:
+            sdfg = dace.SDFG('ok')
+            state = sdfg.add_state('also_ok')
+            A = state.add_array('A', [1], dace.float32)
+            B = state.add_array('B', [1], dace.float32)
+            t = state.add_tasklet('tasklet', {'$a'}, {' b'}, '')
+            state.add_edge(A, None, t, '$a',
+                           dace.Memlet.from_array(A.data, A.desc(sdfg)))
+            state.add_edge(t, ' b', B, None,
+                           dace.Memlet.from_array(B.data, B.desc(sdfg)))
+            sdfg.validate()
+            self.fail('Failed to detect invalid connectors')
+        except dace.sdfg.InvalidSDFGError as ex:
+            print('Exception caught:', ex)
+
+    # Interstate edge
+    def test_interstate_edge(self):
+        try:
+            sdfg = dace.SDFG('ok')
+            state = sdfg.add_state('also_ok')
+            A = state.add_array('A', [1], dace.float32)
+            B = state.add_array('B', [1], dace.float32)
+            t = state.add_tasklet('tasklet', {'a'}, {'b'}, 'b = a')
+            state.add_edge(A, None, t, 'a',
+                           dace.Memlet.from_array(A.data, A.desc(sdfg)))
+            state.add_edge(t, 'b', B, None,
+                           dace.Memlet.from_array(B.data, B.desc(sdfg)))
+            sdfg.add_edge(
+                state, state, dace.InterstateEdge(assignments={'%5': '1'}))
+            sdfg.validate()
+            self.fail('Failed to detect invalid interstate edge')
+        except dace.sdfg.InvalidSDFGInterstateEdgeError as ex:
+            print('Exception caught:', ex)
+
+
+if __name__ == '__main__':
+    unittest.main()
diff --git a/tests/sdfg_validate_scopes_test.py b/tests/sdfg_validate_scopes_test.py
new file mode 100644
index 0000000000..626315f6be
--- /dev/null
+++ b/tests/sdfg_validate_scopes_test.py
@@ -0,0 +1,29 @@
+import unittest
+import dace
+
+
+# Try to detect invalid scopes in SDFG
+class ScopeValidationTests(unittest.TestCase):
+    def test_connector_mismatch(self):
+        try:
+            sdfg = dace.SDFG('a')
+            state = sdfg.add_state()
+            me, mx = state.add_map('b', dict(i="0:1"))
+            A = state.add_array('A', [1], dace.float32)
+            T = state.add_tasklet('T', {'a'}, {}, 'printf("%f", a)')
+
+            me.add_in_connector('IN_a')
+            me.add_out_connector('OUT_b')
+            state.add_edge(A, None, me, 'IN_a',
+                           dace.Memlet.from_array(A.data, A.desc(sdfg)))
+            state.add_edge(me, 'OUT_b', T, 'a', dace.Memlet.simple(A, '0'))
+            state.add_edge(T, None, mx, None, dace.EmptyMemlet())
+
+            sdfg.validate()
+            self.fail('Failed to detect invalid SDFG')
+        except dace.sdfg.InvalidSDFGError as ex:
+            print('Exception caught:', ex)
+
+
+if __name__ == '__main__':
+    unittest.main()
diff --git a/tests/simple_control_flow_test.py b/tests/simple_control_flow_test.py
new file mode 100644
index 0000000000..e6e53fb0df
--- /dev/null
+++ b/tests/simple_control_flow_test.py
@@ -0,0 +1,63 @@
+#!/usr/bin/env python
+import dace
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+
+
+@dace.program
+def myprogram(A, B, tol):
+    # Tree
+    if tol < 4:
+
+        @dace.map(_[0:W])
+        def something(i):
+            a << A[0, i]
+            b >> B[0, i]
+            t >> tol(1, lambda x, y: x + y)
+            b = a
+            t = a * a
+    elif tol <= 5:
+
+        @dace.map(_[0:W])
+        def something(i):
+            a << A[0, i]
+            b >> B[0, i]
+            b = a
+    elif tol <= 6:
+
+        @dace.map(_[0:W])
+        def something(i):
+            a << A[0, i]
+            b >> B[0, i]
+            b = a
+    else:
+
+        @dace.map(_[0:W])
+        def something(j):
+            a << A[0, j]
+            b >> B[0, j]
+            b = a
+
+    for i in range(3):
+
+        @dace.map(_[0:W])
+        def something(j):
+            a << A[1, j]
+            b >> B[1, j]
+            b = a
+
+    while tol < 4:
+
+        @dace.map(_[0:W])
+        def something(i):
+            a << A[0, i]
+            b >> B[0, i]
+            t >> tol(1, lambda x, y: x + y)
+            b = a
+            t = a * a
+
+
+if __name__ == '__main__':
+    dace.compile(myprogram, dace.float32[W, H], dace.float32[H, W],
+                 dace.float32)
diff --git a/tests/simple_stencil_test_opt.py b/tests/simple_stencil_test_opt.py
new file mode 100644
index 0000000000..a432cbb515
--- /dev/null
+++ b/tests/simple_stencil_test_opt.py
@@ -0,0 +1,2 @@
+diode.OpenPythonFile("../samples/simple/simple_stencil.py")
+diode.Run()
diff --git a/tests/specialize_test.py b/tests/specialize_test.py
new file mode 100644
index 0000000000..0ca42a239d
--- /dev/null
+++ b/tests/specialize_test.py
@@ -0,0 +1,63 @@
+#!/usr/bin/env python3
+import math
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+if __name__ == '__main__':
+    print('Constant specialization test')
+
+    N = dp.symbol('N')
+    M = dp.symbol('M')
+    N.set(20)
+    M.set(30)
+    fullrange = '1:N-1,0:M'
+    irange = '1:N-1'
+    jrange = '0:M'
+
+    input = np.random.rand(N.get(), M.get()).astype(np.float32)
+    output = dp.ndarray([N, M], dtype=dp.float32)
+    output[:] = dp.float32(0)
+
+    ##########################################################################
+    spec_sdfg = SDFG('spectest')
+    state = spec_sdfg.add_state()
+    A = state.add_array('A', [N, M], dp.float32)
+    Atrans = state.add_transient('At', [N - 2, M], dp.float32)
+    B = state.add_array('B', [N, M], dp.float32)
+
+    state.add_edge(A, None, Atrans, None, Memlet.simple(A, fullrange))
+    _, me, mx = state.add_mapped_tasklet(
+        'compute',
+        dict(i=irange, j=jrange),
+        dict(a=Memlet.simple(Atrans, 'i-1,j')),
+        'b = math.exp(a)',
+        dict(b=Memlet.simple(B, 'i,j')))
+    state.add_edge(Atrans, None, me, None, Memlet.simple(Atrans, fullrange))
+    state.add_edge(mx, None, B, None, Memlet.simple(B, fullrange))
+    ##########################################################################
+
+    code_nonspec = spec_sdfg.generate_code(specialize=False)
+
+    if 'Dynamic' not in code_nonspec[0].code:
+        print('ERROR: Constants were needlessly specialized')
+        exit(1)
+
+    code_spec = spec_sdfg.generate_code(specialize=True)
+
+    if 'Dynamic' in code_spec[0].code:
+        print('ERROR: Constants were not properly specialized')
+        exit(2)
+
+    spec_sdfg.draw_to_file()
+    func = dp.compile(spec_sdfg, specialize=True)
+    func(A=input, B=output, N=N, M=M)
+
+    diff = np.linalg.norm(
+        np.exp(input[1:dp.eval(N - 1), 0:dp.eval(M)]) -
+        output[1:-1, :]) / dp.eval(N)
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 3)
diff --git a/tests/spmv_test_opt.py b/tests/spmv_test_opt.py
new file mode 100644
index 0000000000..4e4defe1b9
--- /dev/null
+++ b/tests/spmv_test_opt.py
@@ -0,0 +1,3 @@
+# spmv
+diode.OpenPythonFile("../samples/simple/spmv.py")
+result = diode.Run()
diff --git a/tests/state_transition_array_test.py b/tests/state_transition_array_test.py
new file mode 100644
index 0000000000..aa267de50e
--- /dev/null
+++ b/tests/state_transition_array_test.py
@@ -0,0 +1,38 @@
+import dace as dp
+import numpy as np
+
+sdfg = dp.SDFG('sta_test')
+s0 = sdfg.add_state()
+s1 = sdfg.add_state()
+s2 = sdfg.add_state()
+
+# Arrays
+inp = s0.add_array('inp', [1], dp.float32)
+A = s0.add_array('A', [1], dp.float32)
+t = s0.add_tasklet('seta', {'a'}, {'b'}, 'b = a')
+s0.add_edge(inp, None, t, 'a', dp.Memlet.from_array(inp.data, inp.desc(sdfg)))
+s0.add_edge(t, 'b', A, None, dp.Memlet.from_array(A.data, A.desc(sdfg)))
+
+A = s1.add_array('A', [1], dp.float32)
+t = s1.add_tasklet('geta', {'a'}, {}, 'printf("ok %f\\n", a + 1)')
+s1.add_edge(A, None, t, 'a', dp.Memlet.from_array(A.data, A.desc(sdfg)))
+
+A = s2.add_array('A', [1], dp.float32)
+t = s2.add_tasklet('geta', {'a'}, {}, 'printf("BAD %f\\n", a - 1)')
+s2.add_edge(A, None, t, 'a', dp.Memlet.from_array(A.data, A.desc(sdfg)))
+
+sdfg.add_edge(s0, s1, dp.InterstateEdge('A[0] > 3'))
+sdfg.add_edge(s0, s2, dp.InterstateEdge('A[0] <= 3'))
+
+sdfg.draw_to_file()
+
+if __name__ == '__main__':
+    print('Toplevel array usage in interstate edge')
+    input = np.ndarray([1], np.float32)
+    input[0] = 10
+    output = np.ndarray([1], np.float32)
+    output[0] = 10
+
+    sdfg(inp=input, A=output)
+
+    exit(0)
diff --git a/tests/state_transition_test.py b/tests/state_transition_test.py
new file mode 100644
index 0000000000..2fd50d7dcb
--- /dev/null
+++ b/tests/state_transition_test.py
@@ -0,0 +1,161 @@
+#!/usr/bin/env python
+
+import dace
+import re
+import sys
+
+if __name__ == "__main__":
+
+    if not dace.config.Config.get_bool('optimizer', 'detect_control_flow'):
+        print("Control flow not enabled. Skipping test.")
+        sys.exit(0)
+
+    sdfg = dace.SDFG("Transitions")
+
+    start = sdfg.add_state("start")
+    left = sdfg.add_state("left")
+    right = sdfg.add_state("right")
+    end = sdfg.add_state("end")
+
+    left_cond = dace.properties.CodeProperty.from_string(
+        "0 < 1", language=dace.types.Language.Python)
+
+    right_cond = dace.properties.CodeProperty.from_string(
+        "0 >= 1", language=dace.types.Language.Python)
+
+    sdfg.add_edge(start, left, dace.InterstateEdge(condition=left_cond))
+    sdfg.add_edge(start, right, dace.InterstateEdge(condition=right_cond))
+
+    s0 = sdfg.add_state("s0")
+
+    sdfg.add_edge(left, s0, dace.InterstateEdge())
+    sdfg.add_edge(right, end, dace.InterstateEdge())
+
+    s1_for_enter = sdfg.add_state("s1_for_enter")
+    s1_for_body = sdfg.add_state("s1_for_body")
+    x0_for = s1_for_body.add_array("x", (1, ), int)
+    x1_for = s1_for_body.add_array("x", (1, ), int)
+    tasklet_for = s1_for_body.add_tasklet("Update_x", {"x_in"}, {"x_out"},
+                                          "x_out = x_in + 1")
+    s1_for_body.add_edge(x0_for, None, tasklet_for, "x_in",
+                         dace.memlet.Memlet.simple(x0_for, "0"))
+    s1_for_body.add_edge(tasklet_for, "x_out", x1_for, None,
+                         dace.memlet.Memlet.simple(x1_for, "0"))
+
+    s2 = sdfg.add_state("s2")
+
+    for_assignment = dace.InterstateEdge(assignments={"i": 0})
+    sdfg.add_edge(s0, s1_for_enter, for_assignment)
+
+    for_entry = dace.InterstateEdge(
+        condition=dace.properties.CodeProperty.from_string(
+            "i < 16", language=dace.types.Language.Python))
+    sdfg.add_edge(s1_for_enter, s1_for_body, for_entry)
+
+    for_continue = dace.InterstateEdge(assignments={"i": "i + 1"})
+    sdfg.add_edge(s1_for_body, s1_for_enter, for_continue)
+
+    for_exit = dace.InterstateEdge(
+        condition=dace.properties.CodeProperty.from_string(
+            "i >= 16", language=dace.types.Language.Python))
+    sdfg.add_edge(s1_for_enter, s2, for_exit)
+
+    s3_while_enter = sdfg.add_state("s3_while_enter")
+    s3_while_body = sdfg.add_state("s3_while_body")
+    x0_while = s3_while_body.add_array("x", (1, ), int)
+    x1_while = s3_while_body.add_array("x", (1, ), int)
+    tasklet_while = s3_while_body.add_tasklet("Update_x", {"x_in"}, {"x_out"},
+                                              "x_out = x_in * 2; i *= 2")
+    s3_while_body.add_edge(x0_while, None, tasklet_while, "x_in",
+                           dace.memlet.Memlet.simple(x0_while, "0"))
+    s3_while_body.add_edge(tasklet_while, "x_out", x1_while, None,
+                           dace.memlet.Memlet.simple(x1_while, "0"))
+
+    s4 = sdfg.add_state("s4")
+
+    while_enter = dace.InterstateEdge()
+    sdfg.add_edge(s2, s3_while_enter, while_enter)
+
+    while_entry = dace.InterstateEdge(
+        condition=dace.properties.CodeProperty.from_string(
+            "i < 128", language=dace.types.Language.Python))
+    sdfg.add_edge(s3_while_enter, s3_while_body, while_entry)
+
+    while_continue = dace.InterstateEdge()
+    sdfg.add_edge(s3_while_body, s3_while_enter, while_continue)
+
+    while_exit = dace.InterstateEdge(
+        condition=dace.properties.CodeProperty.from_string(
+            "i >= 128", language=dace.types.Language.Python))
+    sdfg.add_edge(s3_while_enter, s4, while_exit)
+
+    sdfg.draw_to_file("sdfg.dot")
+
+    s5_then = sdfg.add_state("s5_then")
+    s6_else = sdfg.add_state("s6_else")
+
+    x1_else = s6_else.add_array("x", (1, ), int)
+    tasklet_else = s6_else.add_tasklet("Update_x", {}, {"x_out"}, "x_out = 42")
+    s6_else.add_edge(tasklet_else, "x_out", x1_else, None,
+                     dace.memlet.Memlet.simple(x1_else, "0"))
+
+    s7_then_then = sdfg.add_state("s7_then_then")
+
+    s8_end = sdfg.add_state("s8_end")
+
+    s9 = sdfg.add_state("s9")
+
+    if_cond = dace.properties.CodeProperty.from_string(
+        "i < 512", language=dace.types.Language.Python)
+    nested_if_cond = dace.properties.CodeProperty.from_string(
+        "i < 256", language=dace.types.Language.Python)
+
+    sdfg.add_edge(s4, s5_then, dace.InterstateEdge(condition=if_cond))
+    sdfg.add_edge(
+        s4,
+        s6_else,
+        dace.InterstateEdge(
+            condition=dace.frontend.python.astutils.negate_expr(if_cond)))
+
+    sdfg.add_edge(
+        s5_then, s7_then_then, dace.InterstateEdge(condition=nested_if_cond))
+    sdfg.add_edge(
+        s5_then,
+        s8_end,
+        dace.InterstateEdge(
+            condition=dace.frontend.python.astutils.negate_expr(
+                nested_if_cond)))
+
+    sdfg.add_edge(s7_then_then, s8_end, dace.InterstateEdge())
+
+    sdfg.add_edge(s8_end, s9, dace.InterstateEdge())
+
+    sdfg.add_edge(s6_else, s9, dace.InterstateEdge())
+
+    sdfg.add_edge(s9, end, dace.InterstateEdge())
+
+    code = sdfg.generate_code()[0].code
+
+    for_pattern = "for.*i\s*=\s*0.*i\s*<\s*16"
+    if re.search(for_pattern, code) is None:
+        raise RuntimeError("For loop not detected in state transitions")
+
+    while_pattern = "while.+i\s*<\s*128"
+    if re.search(while_pattern, code) is None:
+        raise RuntimeError("While loop not detected in state transitions")
+
+    if_pattern = "if.+i\s*<\s*512"
+    if re.search(if_pattern, code) is None:
+        raise RuntimeError("If not detected in state transitions")
+
+    else_pattern = "}\s*else\s*{"
+    if re.search(else_pattern, code) is None:
+        raise RuntimeError("Else not detected in state transitions")
+
+    x_output = dace.ndarray([1], dace.types.int32)
+    x_output[0] = 0
+    sdfg(x=x_output)
+    x_output = x_output[0]
+
+    if x_output != 128:
+        raise RuntimeError("Expected x = 128, got {}".format(x_output))
diff --git a/tests/stream_test.py b/tests/stream_test.py
new file mode 100644
index 0000000000..a0748df03f
--- /dev/null
+++ b/tests/stream_test.py
@@ -0,0 +1,16 @@
+import dace
+
+if __name__ == '__main__':
+    s = dace.define_stream()
+    S = dace.define_streamarray([2, 2])
+
+    for i in range(6):
+        s[0].append(i)
+        for j in range(2):
+            S[0, j].append(i + j)
+            S[1, j].append(i + j * 10)
+
+    while len(s[0]):
+        print(s[0].popleft())
+    while len(S[1, 1]):
+        print(S[1, 1].popleft())
diff --git a/tests/strided_range_copy_test.py b/tests/strided_range_copy_test.py
new file mode 100644
index 0000000000..3e376c5c65
--- /dev/null
+++ b/tests/strided_range_copy_test.py
@@ -0,0 +1,52 @@
+import dace
+import numpy as np
+
+sr = dace.SDFG('stiledcopy')
+s0 = sr.add_state('s0')
+
+A = s0.add_array('A', [2, 16, 4], dace.float32)
+B = s0.add_array('B', [4], dace.float32)
+C = s0.add_array('C', [2, 16, 4], dace.float32)
+
+D = s0.add_array('D', [128, 128], dace.float32)
+E = s0.add_array('E', [8, 8], dace.float32)
+F = s0.add_array('F', [128, 128], dace.float32)
+
+# Reading A at [1, 0:8:8:2, 3]
+s0.add_nedge(A, B, dace.Memlet.simple(A, '1, 0:10:8:2, 3'))
+s0.add_nedge(B, C, dace.Memlet.simple(C, '1, 0:10:8:2, 3'))
+
+# Emulate a blocked tiled matrix multiplication pattern
+s0.add_nedge(D, E, dace.Memlet.simple(D, '8:76:64:4,4:72:64:4'))
+s0.add_nedge(E, F, dace.Memlet.simple(F, '8:76:64:4,4:72:64:4'))
+
+if __name__ == '__main__':
+    print('Strided range copy tasklet test')
+    A = np.random.rand(2, 16, 4).astype(np.float32)
+    B = np.random.rand(4).astype(np.float32)
+    C = np.random.rand(2, 16, 4).astype(np.float32)
+    D = np.random.rand(128, 128).astype(np.float32)
+    E = np.random.rand(8, 8).astype(np.float32)
+    F = np.random.rand(128, 128).astype(np.float32)
+
+    sr.draw_to_file()
+
+    sr(A=A, B=B, C=C, D=D, E=E, F=F)
+
+    diffs = [
+        B[0:2] - A[1, 0:2, 3],
+        B[2:4] - A[1, 8:10, 3],
+        B[0:2] - C[1, 0:2, 3],
+        B[2:4] - C[1, 8:10, 3],
+        E[0:4, 0:4] - D[8:12, 4:8],
+        E[0:4, 4:8] - D[8:12, 68:72],
+        E[4:8, 0:4] - D[72:76, 4:8],
+        E[4:8, 4:8] - D[72:76, 68:72],
+        E[0:4, 0:4] - F[8:12, 4:8],
+        E[0:4, 4:8] - F[8:12, 68:72],
+        E[4:8, 0:4] - F[72:76, 4:8],
+        E[4:8, 4:8] - F[72:76, 68:72],
+    ]
+    diff_array = [np.linalg.norm(d) for d in diffs]
+    print('Differences:', diff_array)
+    exit(0 if np.average(np.array(diff_array)) <= 1e-5 else 1)
diff --git a/tests/strided_range_test.py b/tests/strided_range_test.py
new file mode 100644
index 0000000000..ae18357657
--- /dev/null
+++ b/tests/strided_range_test.py
@@ -0,0 +1,48 @@
+import dace
+from dace.memlet import Memlet
+import numpy as np
+
+sr = dace.SDFG('strided_range_test')
+s0 = sr.add_state('s0')
+
+A = s0.add_array('A', [2, 16, 4], dace.float32)
+B = s0.add_array('B', [16], dace.float32)
+tasklet = s0.add_tasklet(
+    'srtest', {'a'}, {'b'}, """
+b[0] = a[0,0] * 2
+b[1] = a[0,1] * 2
+b[2] = a[1,0] * 2
+b[3] = a[1,1] * 2
+""")
+me, mx = s0.add_map('srmap', dict(i='0:4'))
+
+# Reading A at [1,    2i:2i+8:8:2,    3]
+s0.add_memlet_path(
+    A,
+    me,
+    tasklet,
+    dst_conn='a',
+    memlet=Memlet.simple(A, '1, 2*i:2*i+10:8:2, 3'))
+
+# Writing B at [4*i:4*i+4]
+s0.add_memlet_path(
+    tasklet, mx, B, src_conn='b', memlet=Memlet.simple(B, '4*i:4*i+4'))
+
+if __name__ == '__main__':
+    print('Strided range tasklet test')
+    A = np.random.rand(2, 16, 4).astype(np.float32)
+    B = np.random.rand(16).astype(np.float32)
+
+    sr.draw_to_file()
+
+    sr(A=A, B=B)
+
+    diffs = [
+        B[0:2] - 2 * A[1, 0:2, 3], B[2:4] - 2 * A[1, 8:10, 3],
+        B[4:6] - 2 * A[1, 2:4, 3], B[6:8] - 2 * A[1, 10:12, 3],
+        B[8:10] - 2 * A[1, 4:6, 3], B[10:12] - 2 * A[1, 12:14, 3],
+        B[12:14] - 2 * A[1, 6:8, 3], B[14:16] - 2 * A[1, 14:16, 3]
+    ]
+    diff = np.linalg.norm(np.array(diffs))
+    print('Differences:', [np.linalg.norm(d) for d in diffs])
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/struct_test.py b/tests/struct_test.py
new file mode 100644
index 0000000000..87b3767920
--- /dev/null
+++ b/tests/struct_test.py
@@ -0,0 +1,82 @@
+import ctypes
+import dace
+import numpy as np
+
+csrmatrix = dace.struct(
+    'csr',  # CSR Matrix definition type
+    rows=dace.int32,
+    cols=dace.int32,
+    nnz=dace.int32,
+    data=(dace.pointer(dace.float32), 'nnz'),
+    rowsp1=dace.int32,
+    rowptr=(dace.pointer(dace.int32), 'rowsp1'),
+    colind=(dace.pointer(dace.int32), 'nnz'))
+
+sdfg = dace.SDFG('addone')
+state = sdfg.add_state()
+sdfg.add_array('sparsemats_in', [5], dtype=csrmatrix)
+sdfg.add_array('sparsemats_out', [5], dtype=csrmatrix)
+
+ome, omx = state.add_map('matrices', dict(i='0:5'))
+tasklet = state.add_tasklet(
+    'addone', {'mat_in'}, {'mat_out'},
+    '''
+for (int j = 0; j < mat_in.nnz; ++j) {
+    mat_out.data[j] = mat_in.data[j] + 1.0f;
+}
+''',
+    language=dace.Language.CPP)
+matr = state.add_read('sparsemats_in')
+matw = state.add_write('sparsemats_out')
+state.add_memlet_path(
+    matr,
+    ome,
+    tasklet,
+    dst_conn='mat_in',
+    memlet=dace.Memlet.simple('sparsemats_in', 'i'))
+# state.add_nedge(tasklet, omx, dace.EmptyMemlet())
+state.add_memlet_path(
+    tasklet,
+    omx,
+    matw,
+    src_conn='mat_out',
+    memlet=dace.Memlet.simple('sparsemats_out', 'i', num_accesses=-1))
+
+sdfg.draw_to_file()
+
+
+def toptr(arr):
+    return arr.__array_interface__['data'][0]
+
+
+if __name__ == '__main__':
+    func = sdfg.compile()
+    inp = np.ndarray([5], dtype=np.dtype(csrmatrix.as_ctypes()))
+    out = np.ndarray([5], dtype=np.dtype(csrmatrix.as_ctypes()))
+    in_data = []
+    out_data = []
+    refdata = []
+    for i in range(5):
+        in_data.append(np.array(list(range(i + 1))).astype(np.float32))
+        out_data.append(np.array(list(range(i + 1))).astype(np.float32))
+        refdata.append(np.array(list(range(i + 1))).astype(np.float32) + 1)
+
+        inp[i]['nnz'] = i + 1
+        inp[i]['data'] = toptr(in_data[-1])
+        inp[i]['rowsp1'] = len(range(i + 1)) + 1
+        inp[i]['rowptr'] = toptr(np.array(list(range(i + 1))).astype(np.int32))
+        inp[i]['colind'] = toptr(np.array(list(range(i + 1))).astype(np.int32))
+
+        out[i]['nnz'] = i + 1
+        out[i]['data'] = toptr(out_data[-1])
+        out[i]['rowsp1'] = len(range(i + 1)) + 1
+        out[i]['rowptr'] = toptr(np.array(list(range(i + 1))).astype(np.int32))
+        out[i]['colind'] = toptr(np.array(list(range(i + 1))).astype(np.int32))
+
+    func(sparsemats_in=inp, sparsemats_out=out)
+    diff = 0.0
+    for i in range(5):
+        diff += np.linalg.norm(out_data[i] - refdata[i])
+
+    print('Difference:', diff)
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/subarray_test.py b/tests/subarray_test.py
new file mode 100644
index 0000000000..bddfa06864
--- /dev/null
+++ b/tests/subarray_test.py
@@ -0,0 +1,29 @@
+#!/usr/bin/env python
+import dace as dp
+import numpy as np
+
+W = dp.symbol('W')
+
+
+@dp.program
+def subarray(A, B):
+    @dp.map(_[0:W])
+    def subarrays(i):
+        a << A[:, i, i, i]
+        a2 << A(1)[i, i, i, :]
+        b >> B[i, :, i, i]
+        b[i] = a[i] + a2[i]
+
+
+if __name__ == '__main__':
+    W.set(3)
+
+    A = dp.ndarray([W, W, W, W])
+    B = dp.ndarray([W, W, W, W])
+
+    A[:] = np.mgrid[0:W.get(), 0:W.get(), 0:W.get()]
+    for i in range(W.get()):
+        A[i, :] += 10 * (i + 1)
+    B[:] = dp.float32(0.0)
+
+    subarray(A, B)
diff --git a/tests/tasklet_test.py b/tests/tasklet_test.py
new file mode 100644
index 0000000000..59974b97a5
--- /dev/null
+++ b/tests/tasklet_test.py
@@ -0,0 +1,20 @@
+#!/usr/bin/env python
+import dace
+import math as mt
+import numpy as np
+
+
+@dace.program
+def myprint(input, N, M):
+    @dace.tasklet
+    def myprint():
+        a << input
+        for i in range(0, N):
+            for j in range(0, M):
+                printf("%f\n", mt.sin(a[i, j]))
+
+
+input = dace.ndarray([10, 10], dtype=dace.float32)
+input[:] = np.random.rand(10, 10).astype(dace.float32.type)
+
+myprint(input, 10, 10)
diff --git a/tests/tensorflow_compile_test.py b/tests/tensorflow_compile_test.py
new file mode 100644
index 0000000000..b5d474a469
--- /dev/null
+++ b/tests/tensorflow_compile_test.py
@@ -0,0 +1,32 @@
+#!/usr/bin/env python
+
+try:
+    import tensorflow as tf
+except ImportError:
+    print("WARNING: Tensorflow not found, skipping test")
+    exit(0)
+
+import numpy as np
+
+import dace
+from dace.frontend.tensorflow import TFSession
+
+if __name__ == '__main__':
+    print('DaCe Tensorflow frontend compile API test')
+
+    A = np.random.rand(16, 16).astype(np.float32)
+    B = np.random.rand(16, 16).astype(np.float32)
+
+    A_tf = tf.placeholder(tf.float32, shape=[16, 16])
+    B_tf = tf.placeholder(tf.float32, shape=[16, 16])
+
+    with TFSession() as sess:
+        # Simple matrix multiplication
+        func = sess.compile(A_tf @ B_tf)
+        func(feed_dict={A_tf: A, B_tf: B})
+        C = func(feed_dict={A_tf: A, B_tf: B})
+
+    diff = np.linalg.norm(C - (A @ B)) / (16 * 16)
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/tensorflow_test.py b/tests/tensorflow_test.py
new file mode 100644
index 0000000000..2f92b6d8c8
--- /dev/null
+++ b/tests/tensorflow_test.py
@@ -0,0 +1,30 @@
+#!/usr/bin/env python
+
+try:
+    import tensorflow as tf
+except ImportError:
+    print("WARNING: Tensorflow not found, skipping test")
+    exit(0)
+
+import numpy as np
+
+import dace
+from dace.frontend.tensorflow import TFSession
+
+if __name__ == '__main__':
+    print('DaCe Tensorflow frontend test')
+
+    A = np.random.rand(16, 16).astype(np.float32)
+    B = np.random.rand(16, 16).astype(np.float32)
+
+    A_tf = tf.placeholder(tf.float32, shape=[16, 16])
+    B_tf = tf.placeholder(tf.float32, shape=[16, 16])
+
+    with TFSession() as sess:
+        # Simple matrix multiplication
+        C = sess.run(A_tf @ B_tf, feed_dict={A_tf: A, B_tf: B})
+
+    diff = np.linalg.norm(C - (A @ B)) / (16 * 16)
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/threadlocal_stream_test.py b/tests/threadlocal_stream_test.py
new file mode 100644
index 0000000000..a86454624a
--- /dev/null
+++ b/tests/threadlocal_stream_test.py
@@ -0,0 +1,58 @@
+#!/usr/bin/env python3
+import math
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet, EmptyMemlet
+
+N = dp.symbol('N')
+sdfg = SDFG('tlstream')
+state = sdfg.add_state('doit')
+
+localarr = state.add_transient('la', [10], dp.float32)
+localstream = state.add_stream('ls', dp.float32, 1, transient=True)
+globalstream = state.add_stream('gs', dp.float32, 1, transient=True)
+globalarr = state.add_array('ga', [N], dp.float32)
+
+me, mx = state.add_map('par', dict(i='0:N'))
+tasklet = state.add_tasklet('arange', set(), {'a'}, 'a = i')
+
+state.add_nedge(me, tasklet, EmptyMemlet())
+state.add_edge(tasklet, 'a', localstream, None,
+               Memlet.from_array(localstream.data, localstream.desc(sdfg)))
+state.add_nedge(localstream, localarr,
+                Memlet.from_array(localarr.data, localarr.desc(sdfg)))
+state.add_nedge(localarr, mx,
+                Memlet.from_array(globalstream.data, globalstream.desc(sdfg)))
+state.add_nedge(mx, globalstream,
+                Memlet.from_array(globalstream.data, globalstream.desc(sdfg)))
+state.add_nedge(globalstream, globalarr,
+                Memlet.from_array(globalarr.data, globalarr.desc(sdfg)))
+
+sdfg.fill_scope_connectors()
+sdfg.draw_to_file()
+
+if __name__ == '__main__':
+    print('Thread-local stream test')
+
+    N.set(20)
+
+    output = np.ndarray([N.get()], dtype=np.float32)
+    output[:] = dp.float32(0)
+
+    code_nonspec = sdfg.generate_code()
+
+    if 'Threadlocal' not in code_nonspec[0].code:
+        print('ERROR: Thread-local stream was not created')
+        exit(1)
+
+    func = dp.compile(sdfg)
+    func(ga=output, N=N)
+
+    output = np.sort(output)
+
+    diff = np.linalg.norm(output - np.arange(0, N.get(), dtype=np.float32))
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 2)
diff --git a/tests/tile_test.py b/tests/tile_test.py
new file mode 100644
index 0000000000..1fde63dbe3
--- /dev/null
+++ b/tests/tile_test.py
@@ -0,0 +1,55 @@
+#!/usr/bin/env python
+from __future__ import print_function
+
+import argparse
+import dace
+import math
+import numpy as np
+
+W = dace.symbol('W')
+H = dace.symbol('H')
+
+TW = dace.symbol('TW')
+TH = dace.symbol('TH')
+
+
+@dace.program(dace.float32[H, W], dace.float32[H, W], dace.int32, dace.int32)
+def transpose_tiled(A, B, TW, TH):
+    @dace.map(_[0:H:TH, 0:W:TW])
+    def compute(tile_i, tile_j):
+        @dace.map(_[0:TH, 0:TW])
+        def compute_tile(i, j):
+            a << A[tile_j + j, tile_i + i]
+            b >> B[tile_i + i, tile_j + j]
+
+            b = a
+
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument("W", type=int, nargs="?", default=128)
+    parser.add_argument("H", type=int, nargs="?", default=128)
+    parser.add_argument("TH", type=int, nargs="?", default=16)
+    parser.add_argument("TW", type=int, nargs="?", default=16)
+    args = vars(parser.parse_args())
+
+    A = dace.ndarray([H, W], dtype=dace.float32)
+    B = dace.ndarray([H, W], dtype=dace.float32)
+
+    W.set(args["W"])
+    H.set(args["H"])
+    TW.set(args["TW"])
+    TH.set(args["TH"])
+
+    print('Transpose (Tiled) %dx%d (tile size: %dx%d)' % (W.get(), H.get(),
+                                                          TW.get(), TH.get()))
+
+    A[:] = np.random.rand(H.get(), W.get()).astype(dace.float32.type)
+    B[:] = dace.float32(0)
+
+    transpose_tiled(A, B, TW, TH)
+
+    diff = np.linalg.norm(np.transpose(A) - B) / float(dace.eval(H * W))
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/vector_min_test.py b/tests/vector_min_test.py
new file mode 100644
index 0000000000..53c73a56a4
--- /dev/null
+++ b/tests/vector_min_test.py
@@ -0,0 +1,51 @@
+#!/usr/bin/env python3
+import math
+import numpy as np
+
+import dace as dp
+from dace.sdfg import SDFG
+from dace.memlet import Memlet
+
+# Constructs an SDFG manually and runs it
+if __name__ == '__main__':
+    print('Dynamic SDFG test with vectorization and min')
+    # Externals (parameters, symbols)
+    N = dp.symbol('N')
+    N.set(20)
+
+    input = np.random.rand(N.get()).astype(np.float32)
+    input2 = np.random.rand(N.get()).astype(np.float32)
+    output = dp.ndarray([N], dp.float32)
+    output[:] = dp.float32(0)
+
+    # Construct SDFG
+    mysdfg = SDFG('myvmin')
+    state = mysdfg.add_state()
+    A = state.add_array('A', [N], dp.float32)
+    B = state.add_array('B', [N], dp.float32)
+    C = state.add_array('C', [N], dp.float32)
+
+    # Easy way to add a tasklet
+    tasklet, map_entry, map_exit = state.add_mapped_tasklet(
+        'mytasklet',
+        dict(i='0:N:2'),
+        dict(
+            a=Memlet.simple(A, 'i', veclen=2),
+            b=Memlet.simple(B, 'i', veclen=2)),
+        'c = min(a, b)',
+        dict(c=Memlet.simple(C, 'i', veclen=2)))
+
+    # Add outer edges
+    state.add_edge(A, None, map_entry, None, Memlet.simple(A, '0:N'))
+    state.add_edge(B, None, map_entry, None, Memlet.simple(B, '0:N'))
+    state.add_edge(map_exit, None, C, None, Memlet.simple(C, '0:N'))
+
+    # Left for debugging purposes
+    mysdfg.draw_to_file()
+
+    mysdfg(A=input, B=input2, C=output, N=N)
+
+    diff = np.linalg.norm(np.minimum(input, input2) - output) / N.get()
+    print("Difference:", diff)
+    print("==== Program end ====")
+    exit(0 if diff <= 1e-5 else 1)
diff --git a/tests/xilinx_test.sh b/tests/xilinx_test.sh
new file mode 100644
index 0000000000..6659c5b068
--- /dev/null
+++ b/tests/xilinx_test.sh
@@ -0,0 +1,97 @@
+#!/bin/bash
+
+set -a
+
+SCRIPTPATH="$( cd "$(dirname "$0")" ; pwd -P )"
+PYTHONPATH=$SCRIPTPATH/..
+
+DACE_debugprint="${DACE_debugprint:-0}"
+ERRORS=0
+FAILED_TESTS=""
+TESTS=0
+
+TEST_TIMEOUT=10
+
+RED='\033[0;31m'
+YELLOW='\033[0;33m'
+NC='\033[0m'
+
+
+################################################
+
+bail() {
+    ERRORSTR=$1
+    /bin/echo -e "${RED}ERROR${NC} in $ERRORSTR" 1>&2
+    ERRORS=`expr $ERRORS + 1`
+    FAILED_TESTS="${FAILED_TESTS} $ERRORSTR\n"
+}
+
+run_sample() {
+    # Args:
+    #  1 - Name of FPGA sample located in samples/fpga
+    #  2 - Boolean flag whether to assert II=1 in all loops
+    #  3-x - Other args to forward to kernel
+    TESTS=`expr $TESTS + 1`
+    echo -e "${YELLOW}Running test $1...${NC}"
+    yes | python3 ../samples/fpga/$1.py ${@:4}
+    if [ $? -ne 0 ]; then
+      bail "$1 (${RED}simulation failed${NC})"
+      return 1
+    fi
+    cd .dacecache/$2/build
+    make xilinx_synthesis
+    if [ $? -ne 0 ]; then
+      bail "$1 (${RED}high-level synthesis failed${NC})"
+      return 1
+    fi
+    if [ $3 -ne 0 ]; then
+      grep -n vivado_hls.log -e "Final II = \([2-9]\|1[0-9]+\)"
+      if [ $? == 0 ]; then
+        bail "$1 (${RED}design was not fully pipelined${NC})"
+      fi
+    fi
+    cd ../../../
+    return 0
+}
+
+run_all() {
+    # Args:
+    #  0: Boolean flag that runs all (1) or a reduced set (0) of samples
+    run_sample histogram_fpga_parallel histogram_fpga_parallel_16 0 128 128 16
+    run_sample spmv_fpga_stream spmv_fpga_stream 0 64 64 640
+    run_sample gemm_fpga_systolic gemm_fpga_systolic_4_64x64x64 1 64 64 64 4 -specialize 
+    run_sample filter_fpga_vectorized filter_fpga_vectorized_4 1 8192 4 0.25
+    # run_sample jacobi_fpga_systolic jacobi_fpga_systolic_4_Hx128xT 1 128 128 8 4
+    run_sample gemv_transposed_fpga gemv_transposed_1024xM 1 1024 1024
+    if [ "$1" -ne "0" ]; then
+      run_sample histogram_fpga histogram_fpga 0 128 128
+      run_sample spmv_fpga spmv_fpga 0 64 64 640
+      run_sample gemm_fpga_pipelined gemm_fpga_pipelined_NxKx128 1 128 128 128
+      run_sample gemm_fpga_stream gemm_fpga_stream_NxKx64 1 64 64 64
+      run_sample filter_fpga filter_fpga 1 8192 0.5
+      run_sample jacobi_fpga_stream jacobi_fpga_stream_Hx128xT 1 128 128 8
+    fi
+}
+
+# Check if xocc is vailable
+which xocc
+if [ $? -ne 0 ]; then
+  echo "xocc not available"
+  exit 99
+fi
+
+echo "====== Target: Xilinx ======"
+
+DACE_compiler_use_cache=0
+DACE_compiler_xilinx_mode="simulation"
+
+TEST_DIR="$( cd "$( dirname "${BASH_SOURCE[0]}" )" >/dev/null && pwd )"
+cd $TEST_DIR
+run_all ${1:-"0"}
+
+PASSED=`expr $TESTS - $ERRORS`
+echo "$PASSED / $TESTS tests passed"
+if [ $ERRORS -ne 0 ]; then
+    printf "Failed tests:\n${FAILED_TESTS}"
+    exit 1
+fi
diff --git a/tutorials/.gitignore b/tutorials/.gitignore
new file mode 100644
index 0000000000..763513e910
--- /dev/null
+++ b/tutorials/.gitignore
@@ -0,0 +1 @@
+.ipynb_checkpoints
diff --git a/tutorials/explicit.ipynb b/tutorials/explicit.ipynb
new file mode 100644
index 0000000000..104da233e0
--- /dev/null
+++ b/tutorials/explicit.ipynb
@@ -0,0 +1,505 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# DaCe with Explicit Dataflow in Python\n",
+    "\n",
+    "In this tutorial, we will use the explicit dataflow specification in Python to construct DaCe programs."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import dace"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Explicit dataflow is a Python-based syntax that is close to defining SDFGs. In explicit ` @dace.program `s, the code (Tasklets) and memory movement (Memlets) are specified separately, as we show below.\n",
+    "\n",
+    "## Matrix Transposition\n",
+    "\n",
+    "We begin with a simple example, transposing a matrix (out-of-place). \n",
+    "\n",
+    "First, since we do not know what the matrix sizes will be, we define symbolic sizes:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "M = dace.symbol('M')\n",
+    "N = dace.symbol('N')"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We now proceed to define the data-centric part of the application (i.e., the part that can be optimized by DaCe). It is a simple function which, when called, invokes the compilation and optimization procedure. It can also be compiled explicitly, as we show in the next example.\n",
+    "\n",
+    "DaCe programs use explicit types, so that they can be compiled. We provide a numpy-compatible set of types that can define N-dimensional tensors. For example, `dace.int64` defines a 64-bit signed integer scalar, and `dace.float32[133,8]` defines a 133-row and 8-column 2D array."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "@dace.program\n",
+    "def transpose(A: dace.float32[M, N], B: dace.float32[N, M]):\n",
+    "    # Inside the function we will define a tasklet in a map, which is shortened\n",
+    "    # to dace.map. We define the map range in the arguments:\n",
+    "    @dace.map\n",
+    "    def mytasklet(i: _[0:M], j: _[0:N]):\n",
+    "        # Pre-declaring the memlets is required in explicit dataflow, tasklets\n",
+    "        # cannot use any external memory apart from data flowing to/from it.\n",
+    "        a << A[i,j]  # Input memlet (<<)\n",
+    "        b >> B[j,i]  # Output memlet (>>)\n",
+    "        \n",
+    "        # The code\n",
+    "        b = a"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "And that's it! We will now define some regression test using numpy:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import numpy as np\n",
+    "A = np.random.rand(37, 11).astype(np.float32)\n",
+    "expected = A.transpose()\n",
+    "# Define an array for the output of the dace program\n",
+    "B = np.random.rand(11, 37).astype(np.float32)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Before we call `transpose`, we can inspect the SDFG:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Automatically applied 1 strict state fusions and removed 0 redundant arrays.\n"
+     ]
+    },
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"392pt\" height=\"544pt\"\n",
+       " viewBox=\"0.00 0.00 392.00 543.64\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 539.6371)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-539.6371 388,-539.6371 388,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-527.6371 376,-527.6371 376,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"341\" y=\"-512.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">mytasklet</text>\n",
+       "</g>\n",
+       "<!-- s0_3 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s0_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"89.213,-103 294.787,-103 368,-177 16,-177 89.213,-103\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"157,-151 157,-166 228,-166 228,-151 157,-151\"/>\n",
+       "<text text-anchor=\"start\" x=\"181.5\" y=\"-156\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"123\" y=\"-135.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">mytasklet[i=0:M, j=0:N]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"154,-114 154,-129 231,-129 231,-114 154,-114\"/>\n",
+       "<text text-anchor=\"start\" x=\"176.5\" y=\"-119\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_4 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s0_4</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"193\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"193\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B</text>\n",
+       "</g>\n",
+       "<!-- s0_3&#45;&gt;s0_4 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>s0_3:OUT_1&#45;&gt;s0_4</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M193,-114C193,-96.9004 193,-77.7449 193,-62.4659\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"196.5001,-62.1103 193,-52.1104 189.5001,-62.1104 196.5001,-62.1103\"/>\n",
+       "<text text-anchor=\"middle\" x=\"228\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B[0:N, 0:M]</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"294.787,-409.6371 89.213,-409.6371 16,-335.6371 368,-335.6371 294.787,-409.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"157,-383.6371 157,-398.6371 228,-398.6371 228,-383.6371 157,-383.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"181.5\" y=\"-388.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"123\" y=\"-368.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">mytasklet[i=0:M, j=0:N]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"154,-346.6371 154,-361.6371 231,-361.6371 231,-346.6371 154,-346.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"176.5\" y=\"-351.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"262.3276,-244.6633 262.3276,-267.9738 221.1306,-284.4569 162.8694,-284.4569 121.6724,-267.9738 121.6724,-244.6633 162.8694,-228.1803 221.1306,-228.1803 262.3276,-244.6633\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"177,-267.3186 177,-282.3186 208,-282.3186 208,-267.3186 177,-267.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"190\" y=\"-272.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">a</text>\n",
+       "<text text-anchor=\"start\" x=\"165\" y=\"-252.1186\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">mytasklet</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"176,-230.3186 176,-245.3186 208,-245.3186 208,-230.3186 176,-230.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"189\" y=\"-235.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">b</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_2 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s0_1:OUT_1&#45;&gt;s0_2:a</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M193,-346.6371C193,-321.736 193,-313.1033 193,-292.3964\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"196.5001,-292.3185 193,-282.3186 189.5001,-292.3186 196.5001,-292.3185\"/>\n",
+       "<text text-anchor=\"middle\" x=\"210\" y=\"-306.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"193\" cy=\"-478.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"193\" y=\"-474.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s0_1:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M193,-460.5268C193,-446.8457 193,-427.3195 193,-408.7872\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"196.5001,-408.6371 193,-398.6371 189.5001,-408.6372 196.5001,-408.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"228\" y=\"-431.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[0:M, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_2&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>s0_2:b&#45;&gt;s0_3:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M192,-230.3186C192,-205.4144 192.7588,-196.7866 192.9544,-176.0787\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"196.4547,-176.0158 193,-166 189.4547,-175.984 196.4547,-176.0158\"/>\n",
+       "<text text-anchor=\"middle\" x=\"209\" y=\"-198.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B[j, i]</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7f153e77ecc0>"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "sdfg = transpose.to_sdfg()\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We can now call `transpose` directly, or using the SDFG we created. When calling `transpose`, we need to feed the symbols as well as the arguments (since the arrays are `numpy` rather than symbolic `dace` arrays, see below tutorials). When prompted for transformations, we will now just press the \"Enter\" key to skip them."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "0. Pattern FPGATransformSDFG in transpose\n",
+      "1. Pattern FPGATransformState in mytasklet\n",
+      "2. Pattern GPUTransformState in transpose\n",
+      "3. Pattern NestSDFG in transpose\n",
+      "4. Pattern FPGATransformMap in mytasklet[i=0:M, j=0:N]\n",
+      "5. Pattern GPUTransformLocalStorage in mytasklet[i=0:M, j=0:N]\n",
+      "6. Pattern GPUTransformMap in mytasklet[i=0:M, j=0:N]\n",
+      "7. Pattern MapExpansion in mytasklet: ['i', 'j']\n",
+      "8. Pattern OrthogonalTiling in mytasklet: ['i', 'j']\n",
+      "9. Pattern StripMining in mytasklet: ['i', 'j']\n",
+      "Select the pattern to apply (0 - 9 or name$id): \n",
+      "You did not select a valid option. Quitting optimization ...\n",
+      "-- Configuring done\n",
+      "-- Generating done\n",
+      "-- Build files have been written to: /path/to/dace/tutorials/.dacecache/transpose/build\n",
+      "\n",
+      "[ 50%] Built target dacestub\n",
+      "Scanning dependencies of target transpose\n",
+      "[ 75%] Building CXX object CMakeFiles/transpose.dir/path/to/dace/tutorials/.dacecache/transpose/src/cpu/transpose.cpp.o\n",
+      "[100%] Linking CXX shared library libtranspose.so\n",
+      "[100%] Built target transpose\n",
+      "\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "0"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "sdfg(A=A, B=B, M=A.shape[0], N=A.shape[1])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Difference: 0.0\n"
+     ]
+    }
+   ],
+   "source": [
+    "print('Difference:', np.linalg.norm(expected - B))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Query (using Streams)\n",
+    "\n",
+    "In this example, we will use the Stream construct and symbolic dace ND arrays to create a simple parallel filter. We first define a symbolic size and a symbolically-sized array:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "N = dace.symbol('N')\n",
+    "\n",
+    "storage = dace.ndarray(shape=[N], dtype=dace.int32)\n",
+    "# The size of \"output\" will actually be lesser or equal to N, but we need to \n",
+    "# statically allocate the memory.\n",
+    "output = dace.ndarray(shape=[N], dtype=dace.int32)\n",
+    "# The size is a scalar\n",
+    "output_size = dace.scalar(dtype=dace.uint32)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "As with `transpose`, the DaCe program also consists of a tasklet nested in a Map, but also includes a Stream (to which we push outputs as necessary) that is directly connected to the output array, as well as a conflict-resolution output (because all tasklets in the map write to the same address:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "@dace.program\n",
+    "def query(data: dace.int32[N], output: dace.int32[N], outsz: dace.int32[1], \n",
+    "          threshold: dace.int32):\n",
+    "    # Define a local, unbounded (buffer_size=0) stream\n",
+    "    S = dace.define_stream(dace.int32, 0)\n",
+    "    \n",
+    "    # Define a memlet from S to the output\n",
+    "    S >> output\n",
+    "    \n",
+    "    # Filtering tasklet\n",
+    "    @dace.map\n",
+    "    def filter(i: _[0:N]):\n",
+    "        a << data[i]\n",
+    "        # Writing to S (no location necessary) a dynamic number of times (-1)\n",
+    "        out >> S(-1)\n",
+    "        # Writing to outsz dynamically (-1), if there is a conflict, we will sum the results\n",
+    "        osz >> outsz(-1, lambda a,b: a+b)   \n",
+    "        \n",
+    "        if a > threshold:\n",
+    "            # Pushing to a stream or writing with a conflict use the assignment operator\n",
+    "            out = a\n",
+    "            osz = 1"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We can compile `query` without defining anything further. However, before we call `query`, we will need to set the symbol sizes."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Automatically applied 1 strict state fusions and removed 0 redundant arrays.\n",
+      "0. Pattern FPGATransformSDFG in query\n",
+      "1. Pattern FPGATransformState in filter\n",
+      "2. Pattern GPUTransformState in query\n",
+      "3. Pattern NestSDFG in query\n",
+      "4. Pattern FPGATransformMap in filter[i=0:N]\n",
+      "5. Pattern GPUTransformLocalStorage in filter[i=0:N]\n",
+      "6. Pattern GPUTransformMap in filter[i=0:N]\n",
+      "7. Pattern MPITransformMap in filter\n",
+      "8. Pattern MapToForLoop in filter: ['i']\n",
+      "9. Pattern OrthogonalTiling in filter: ['i']\n",
+      "10. Pattern StripMining in filter: ['i']\n",
+      "11. Pattern Vectorization in 1 -> 2 -> 3\n",
+      "Select the pattern to apply (0 - 11 or name$id): \n",
+      "You did not select a valid option. Quitting optimization ...\n",
+      "-- Configuring done\n",
+      "-- Generating done\n",
+      "-- Build files have been written to: /path/to/dace/tutorials/.dacecache/query/build\n",
+      "\n",
+      "[ 50%] Built target dacestub\n",
+      "Scanning dependencies of target query\n",
+      "[ 75%] Building CXX object CMakeFiles/query.dir/path/to/dace/tutorials/.dacecache/query/src/cpu/query.cpp.o\n",
+      "/path/to/dace/tutorials/.dacecache/query/src/cpu/query.cpp: In function ‘void __program_query_internal(int*, int*, int*, int, int)’:\n",
+      "/path/to/dace/tutorials/.dacecache/query/src/cpu/query.cpp:19:23: warning: unused variable ‘out’ [-Wunused-variable]\n",
+      "                 auto &out = __out;\n",
+      "                       ^~~\n",
+      "/path/to/dace/tutorials/.dacecache/query/src/cpu/query.cpp:21:23: warning: unused variable ‘osz’ [-Wunused-variable]\n",
+      "                 auto &osz = __osz.ref<1>();\n",
+      "                       ^~~\n",
+      "[100%] Linking CXX shared library libquery.so\n",
+      "[100%] Built target query\n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "qfunc = query.compile()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "N.set(255)\n",
+    "thres = 50"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Define some random integers and zero outputs\n",
+    "import numpy as np\n",
+    "storage[:] = np.random.randint(0, 100, size=N.get())\n",
+    "output_size[0] = 0\n",
+    "output[:] = np.zeros(N.get()).astype(np.int32)\n",
+    "\n",
+    "# Compute expected output using numpy\n",
+    "expected = storage[np.where(storage > thres)]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Here we will just use the Python function prototype to call the code, since we do not invoke it through the SDFG:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "ndarray([131], dtype=uint32)"
+      ]
+     },
+     "execution_count": 13,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "qfunc(storage, output, output_size, thres)\n",
+    "output_size"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Difference: 0.0\n"
+     ]
+    }
+   ],
+   "source": [
+    "filtered_output = output[:output_size[0]]\n",
+    "# Sorting outputs to avoid concurrency-based reordering\n",
+    "print('Difference:', np.linalg.norm(np.sort(expected) - np.sort(filtered_output)))"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.6.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
diff --git a/tutorials/sdfg_api.ipynb b/tutorials/sdfg_api.ipynb
new file mode 100644
index 0000000000..3b05fb1097
--- /dev/null
+++ b/tutorials/sdfg_api.ipynb
@@ -0,0 +1,1313 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Using the SDFG API in DaCe\n",
+    "\n",
+    "In this tutorial, we will create an SDFG manually using the SDFG API. This interface gives full control over the representation, and it is also the one used for developing new transformations and other graph manipulation.\n",
+    "\n",
+    "The code we will write executes a stencil in a sequence (without boundary conditions). In SDFG terms, it is a sequential for-loop (state machine) of tasklets nested in maps."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import dace\n",
+    "import numpy as np"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We begin by defining the temporal and spatial dimensions as symbols:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "T = dace.symbol('T')\n",
+    "N = dace.symbol('N')"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Creating an SDFG requires giving it a name (which will be used in compilation to create the library files and function names):"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "sdfg = dace.SDFG('jacobi2d')"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Then, we need to define the set of data descriptors used throughout the Data nodes in the SDFG. Since we use a double-buffering approach, we define a 2D array `A` and a 2D array `tmp`. `tmp` is transient, which means it is not an input/output of the SDFG, and can thus participate in transformations."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "Array (dtype=float, shape=(N, N))"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "sdfg.add_array('A', shape=[N, N], dtype=dace.float32)\n",
+    "sdfg.add_transient('tmp', shape=[N, N], dtype=dace.float32)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Next, we construct a state, which will contain the main computational part:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"94pt\" height=\"99pt\"\n",
+       " viewBox=\"0.00 0.00 94.00 99.00\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 95)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-95 90,-95 90,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-83 78,-83 78,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"50.5\" y=\"-67.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">state_0</text>\n",
+       "</g>\n",
+       "<!-- dummy_0 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>dummy_0</title>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7f7dca12de48>"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "state = sdfg.add_state()\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now the SDFG contains only one empty state. We will create the contents of the main state, which is two stencils, `A->tmp` and `tmp->A`. Since the code is equivalent, we define a function once and call it twice:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "def mainstate(state, src_node, dst_node):\n",
+    "    # Creates Map (entry and exit nodes), Tasklet node, and connects the three\n",
+    "    tasklet, map_entry, map_exit = state.add_mapped_tasklet(\n",
+    "        '%s_to_%s' % (src_node.data, dst_node.data),        # name\n",
+    "        dict(i='1:N-1', j='1:N-1'),                         # map range\n",
+    "        dict(inp=dace.Memlet.simple(src_node.data,          # input memlets\n",
+    "                                    'i-1:i+2, j-1:j+2')),\n",
+    "        '''                                                 # code\n",
+    "out = 0.2 * (inp[0,1] + inp[1,0] + inp[1,1] +               # (5-point Jacobi)\n",
+    "              inp[1,2] + inp[2,1])\n",
+    "        ''',\n",
+    "        dict(out=dace.Memlet.simple(dst_node.data, 'i,j'))  # output memlets\n",
+    "    )\n",
+    "    \n",
+    "    #######################\n",
+    "    # Add external connections from map to arrays\n",
+    "\n",
+    "    # Add input path (src->entry) with the overall memory accessed\n",
+    "    # NOTE: This can be inferred automatically by the system\n",
+    "    #       using external_edges=True in `add_mapped_tasklet`\n",
+    "    #       or using the `propagate_edge` function.\n",
+    "    state.add_edge(\n",
+    "        src_node, None,\n",
+    "        map_entry, None,\n",
+    "        memlet=dace.Memlet.simple(src_node.data, '0:N, 0:N'))\n",
+    "    \n",
+    "    # Add output path (exit->dst)\n",
+    "    state.add_edge(\n",
+    "        map_exit, None,\n",
+    "        dst_node, None,\n",
+    "        memlet=dace.Memlet.simple(dst_node.data, '1:N-1, 1:N-1'))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We add and connect the read, access (read/write), and write nodes for the main state, as well as the code:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"238pt\" height=\"99pt\"\n",
+       " viewBox=\"0.00 0.00 238.00 99.00\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 95)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-95 234,-95 234,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-83 222,-83 222,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"194.5\" y=\"-67.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">state_0</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"187\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"187\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"115\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"115\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"43\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"43\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7f7dca12de48>"
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "A_in = state.add_read('A')\n",
+    "tmp = state.add_access('tmp')\n",
+    "A_out = state.add_write('A')\n",
+    "\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"446pt\" height=\"836pt\"\n",
+       " viewBox=\"0.00 0.00 446.00 836.27\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 832.2743)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-832.2743 442,-832.2743 442,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-820.2743 430,-820.2743 430,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"402.5\" y=\"-805.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">state_0</text>\n",
+       "</g>\n",
+       "<!-- s0_4 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s0_4</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"290.889,-575.3005 290.889,-598.611 248.7774,-615.094 189.2226,-615.094 147.111,-598.611 147.111,-575.3005 189.2226,-558.8174 248.7774,-558.8174 290.889,-575.3005\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"200,-597.9557 200,-612.9557 238,-612.9557 238,-597.9557 200,-597.9557\"/>\n",
+       "<text text-anchor=\"start\" x=\"212\" y=\"-602.9557\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">inp</text>\n",
+       "<text text-anchor=\"start\" x=\"191\" y=\"-582.7557\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"200,-560.9557 200,-575.9557 238,-575.9557 238,-560.9557 200,-560.9557\"/>\n",
+       "<text text-anchor=\"start\" x=\"212\" y=\"-565.9557\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s0_5 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_5</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"100.2883,-471.6371 337.7117,-471.6371 422.2676,-507.6371 15.7324,-507.6371 100.2883,-471.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"219\" y=\"-485.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_4&#45;&gt;s0_5 -->\n",
+       "<g id=\"edge5\" class=\"edge\">\n",
+       "<title>s0_4:out&#45;&gt;s0_5</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M219,-560.9557C219,-546.8328 219,-531.1368 219,-518.0732\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"222.5001,-517.8112 219,-507.8112 215.5001,-517.8113 222.5001,-517.8112\"/>\n",
+       "<text text-anchor=\"middle\" x=\"242\" y=\"-529.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"219\" cy=\"-402.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"219\" y=\"-398.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s0_6 -->\n",
+       "<g id=\"node7\" class=\"node\">\n",
+       "<title>s0_6</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"337.7117,-333.6371 100.2883,-333.6371 15.7324,-297.6371 422.2676,-297.6371 337.7117,-333.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"219\" y=\"-311.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_6 -->\n",
+       "<g id=\"edge8\" class=\"edge\">\n",
+       "<title>s0_1&#45;&gt;s0_6</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M219,-384.6107C219,-372.8289 219,-357.1978 219,-343.7952\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"222.5001,-343.6405 219,-333.6405 215.5001,-343.6405 222.5001,-343.6405\"/>\n",
+       "<text text-anchor=\"middle\" x=\"258.5\" y=\"-355.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_5&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge6\" class=\"edge\">\n",
+       "<title>s0_5&#45;&gt;s0_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M219,-471.6107C219,-459.8289 219,-444.1978 219,-430.7952\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"222.5001,-430.6405 219,-420.6405 215.5001,-430.6405 222.5001,-430.6405\"/>\n",
+       "<text text-anchor=\"middle\" x=\"277.5\" y=\"-442.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[1:N &#45; 1, 1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"219\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"219\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_7 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s0_7</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"290.889,-206.6633 290.889,-229.9738 248.7774,-246.4569 189.2226,-246.4569 147.111,-229.9738 147.111,-206.6633 189.2226,-190.1803 248.7774,-190.1803 290.889,-206.6633\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"200,-229.3186 200,-244.3186 238,-244.3186 238,-229.3186 200,-229.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"212\" y=\"-234.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">inp</text>\n",
+       "<text text-anchor=\"start\" x=\"191\" y=\"-214.1186\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"200,-192.3186 200,-207.3186 238,-207.3186 238,-192.3186 200,-192.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"212\" y=\"-197.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s0_8 -->\n",
+       "<g id=\"node6\" class=\"node\">\n",
+       "<title>s0_8</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"100.2883,-103 337.7117,-103 422.2676,-139 15.7324,-139 100.2883,-103\"/>\n",
+       "<text text-anchor=\"middle\" x=\"219\" y=\"-117.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_7&#45;&gt;s0_8 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s0_7:out&#45;&gt;s0_8</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M219,-192.3186C219,-178.1957 219,-162.4997 219,-149.4361\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"222.5001,-149.1741 219,-139.1741 215.5001,-149.1741 222.5001,-149.1741\"/>\n",
+       "<text text-anchor=\"middle\" x=\"236\" y=\"-160.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_8&#45;&gt;s0_2 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>s0_8&#45;&gt;s0_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M219,-102.9735C219,-91.1918 219,-75.5607 219,-62.1581\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"222.5001,-62.0033 219,-52.0034 215.5001,-62.0034 222.5001,-62.0033\"/>\n",
+       "<text text-anchor=\"middle\" x=\"271.5\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[1:N &#45; 1, 1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_6&#45;&gt;s0_7 -->\n",
+       "<g id=\"edge7\" class=\"edge\">\n",
+       "<title>s0_6&#45;&gt;s0_7:inp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M219,-297.4631C219,-285.7529 219,-269.9219 219,-254.7007\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"222.5001,-254.3185 219,-244.3186 215.5001,-254.3186 222.5001,-254.3185\"/>\n",
+       "<text text-anchor=\"middle\" x=\"291\" y=\"-268.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i &#45; 1:i + 2, j &#45; 1:j + 2]</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node8\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"219\" cy=\"-771.2743\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"219\" y=\"-767.5743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_3 -->\n",
+       "<g id=\"node9\" class=\"node\">\n",
+       "<title>s0_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"337.7117,-702.2743 100.2883,-702.2743 15.7324,-666.2743 422.2676,-666.2743 337.7117,-702.2743\"/>\n",
+       "<text text-anchor=\"middle\" x=\"219\" y=\"-680.5743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s0_3</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M219,-753.2478C219,-741.4661 219,-725.835 219,-712.4324\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"222.5001,-712.2776 219,-702.2777 215.5001,-712.2777 222.5001,-712.2776\"/>\n",
+       "<text text-anchor=\"middle\" x=\"252.5\" y=\"-724.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_3&#45;&gt;s0_4 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>s0_3&#45;&gt;s0_4:inp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M219,-666.1002C219,-654.39 219,-638.5591 219,-623.3379\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"222.5001,-622.9557 219,-612.9557 215.5001,-622.9558 222.5001,-622.9557\"/>\n",
+       "<text text-anchor=\"middle\" x=\"285\" y=\"-637.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i &#45; 1:i + 2, j &#45; 1:j + 2]</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7f7dca12de48>"
+      ]
+     },
+     "execution_count": 8,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "mainstate(state, A_in, tmp)\n",
+    "mainstate(state, tmp, A_out)\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Notice the boxes inside the tasklet nodes, connected to the edges. These are **connectors**, the way to identify the edge's behavior. A connector has a type and shape, just like arrays, and is also used to create unique paths through map scopes. \n",
+    "\n",
+    "In the above case, it is clear that the edges leading to and from the map entry/exit nodes form a path. However, when multiple edges are involved, it may be ambiguous. To uniquely identify paths, scope entry/exit nodes can have input connectors that begin with `IN_` and output connectors that begin with `OUT_`. As a convenience function, an SDFG can try to fill its scope connectors on its own (using the data names on the memlets):"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"526pt\" height=\"988pt\"\n",
+       " viewBox=\"0.00 0.00 526.00 988.27\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 984.2743)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-984.2743 522,-984.2743 522,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-972.2743 510,-972.2743 510,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"482.5\" y=\"-957.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">state_0</text>\n",
+       "</g>\n",
+       "<!-- s0_7 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s0_7</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"331.889,-244.6633 331.889,-267.9738 289.7774,-284.4569 230.2226,-284.4569 188.111,-267.9738 188.111,-244.6633 230.2226,-228.1803 289.7774,-228.1803 331.889,-244.6633\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-267.3186 241,-282.3186 279,-282.3186 279,-267.3186 241,-267.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-272.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">inp</text>\n",
+       "<text text-anchor=\"start\" x=\"232\" y=\"-252.1186\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-230.3186 241,-245.3186 279,-245.3186 279,-230.3186 241,-230.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-235.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s0_8 -->\n",
+       "<g id=\"node9\" class=\"node\">\n",
+       "<title>s0_8</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"117.0839,-103 400.9161,-103 502,-177 16,-177 117.0839,-103\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-151 213,-166 306,-166 306,-151 213,-151\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-156\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-135.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-114 210,-129 310,-129 310,-114 210,-114\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-119\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_7&#45;&gt;s0_8 -->\n",
+       "<g id=\"edge7\" class=\"edge\">\n",
+       "<title>s0_7:out&#45;&gt;s0_8:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-230.3186C260,-205.4175 260,-196.7847 260,-176.0778\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-176 260,-166 256.5001,-176 263.5001,-176\"/>\n",
+       "<text text-anchor=\"middle\" x=\"277\" y=\"-198.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"260\" cy=\"-923.2743\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"260\" y=\"-919.5743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_3 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"400.9161,-854.2743 117.0839,-854.2743 16,-780.2743 502,-780.2743 400.9161,-854.2743\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-828.2743 213,-843.2743 306,-843.2743 306,-828.2743 213,-828.2743\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-833.2743\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-813.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-791.2743 210,-806.2743 310,-806.2743 310,-791.2743 210,-791.2743\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-796.2743\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s0_3:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-905.1639C260,-891.4829 260,-871.9567 260,-853.4243\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-853.2743 260,-843.2743 256.5001,-853.2743 263.5001,-853.2743\"/>\n",
+       "<text text-anchor=\"middle\" x=\"293.5\" y=\"-876.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_4 -->\n",
+       "<g id=\"node7\" class=\"node\">\n",
+       "<title>s0_4</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"331.889,-689.3005 331.889,-712.611 289.7774,-729.094 230.2226,-729.094 188.111,-712.611 188.111,-689.3005 230.2226,-672.8174 289.7774,-672.8174 331.889,-689.3005\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-711.9557 241,-726.9557 279,-726.9557 279,-711.9557 241,-711.9557\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-716.9557\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">inp</text>\n",
+       "<text text-anchor=\"start\" x=\"232\" y=\"-696.7557\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-674.9557 241,-689.9557 279,-689.9557 279,-674.9557 241,-674.9557\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-679.9557\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s0_3&#45;&gt;s0_4 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s0_3:OUT_1&#45;&gt;s0_4:inp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-791.2743C260,-766.3732 260,-757.7405 260,-737.0336\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-736.9557 260,-726.9557 256.5001,-736.9558 263.5001,-736.9557\"/>\n",
+       "<text text-anchor=\"middle\" x=\"326\" y=\"-751.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i &#45; 1:i + 2, j &#45; 1:j + 2]</text>\n",
+       "</g>\n",
+       "<!-- s0_5 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>s0_5</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"117.0839,-547.6371 400.9161,-547.6371 502,-621.6371 16,-621.6371 117.0839,-547.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-595.6371 213,-610.6371 306,-610.6371 306,-595.6371 213,-595.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-600.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-580.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-558.6371 210,-573.6371 310,-573.6371 310,-558.6371 210,-558.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-563.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"260\" cy=\"-478.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"260\" y=\"-474.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s0_5&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s0_5:OUT_1&#45;&gt;s0_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-558.6371C260,-541.5375 260,-522.382 260,-507.103\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-506.7475 260,-496.7475 256.5001,-506.7476 263.5001,-506.7475\"/>\n",
+       "<text text-anchor=\"middle\" x=\"318.5\" y=\"-518.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[1:N &#45; 1, 1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_6 -->\n",
+       "<g id=\"node8\" class=\"node\">\n",
+       "<title>s0_6</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"400.9161,-409.6371 117.0839,-409.6371 16,-335.6371 502,-335.6371 400.9161,-409.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-383.6371 213,-398.6371 306,-398.6371 306,-383.6371 213,-383.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-388.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-368.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-346.6371 210,-361.6371 310,-361.6371 310,-346.6371 210,-346.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-351.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_6 -->\n",
+       "<g id=\"edge6\" class=\"edge\">\n",
+       "<title>s0_1&#45;&gt;s0_6:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-460.5268C260,-446.8457 260,-427.3195 260,-408.7872\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-408.6371 260,-398.6371 256.5001,-408.6372 263.5001,-408.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"299.5\" y=\"-431.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node6\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"260\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"260\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_4&#45;&gt;s0_5 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>s0_4:out&#45;&gt;s0_5:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-674.9557C260,-650.0546 260,-641.4219 260,-620.715\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-620.6371 260,-610.6371 256.5001,-620.6372 263.5001,-620.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"283\" y=\"-643.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_6&#45;&gt;s0_7 -->\n",
+       "<g id=\"edge5\" class=\"edge\">\n",
+       "<title>s0_6:OUT_1&#45;&gt;s0_7:inp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-346.6371C260,-321.736 260,-313.1033 260,-292.3964\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-292.3185 260,-282.3186 256.5001,-292.3186 263.5001,-292.3185\"/>\n",
+       "<text text-anchor=\"middle\" x=\"332\" y=\"-306.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i &#45; 1:i + 2, j &#45; 1:j + 2]</text>\n",
+       "</g>\n",
+       "<!-- s0_8&#45;&gt;s0_2 -->\n",
+       "<g id=\"edge8\" class=\"edge\">\n",
+       "<title>s0_8:OUT_1&#45;&gt;s0_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-114C260,-96.9004 260,-77.7449 260,-62.4659\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-62.1103 260,-52.1104 256.5001,-62.1104 263.5001,-62.1103\"/>\n",
+       "<text text-anchor=\"middle\" x=\"312.5\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[1:N &#45; 1, 1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7f7dca12de48>"
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "sdfg.fill_scope_connectors()\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "This single-state SDFG is now valid, as no exceptions are raised below:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "sdfg.validate()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "However, it only runs for two time-steps, and will produce incorrect results, due to the boundaries of `tmp`. We thus need to define a starting state that sets `tmp` to zero, and a looping state machine. Initialization state is defined below:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"944pt\" height=\"1006pt\"\n",
+       " viewBox=\"0.00 0.00 944.00 1005.64\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 1001.6371)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-1001.6371 940,-1001.6371 940,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-989.6371 510,-989.6371 510,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"482.5\" y=\"-974.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">state_0</text>\n",
+       "</g>\n",
+       "<g id=\"clust2\" class=\"cluster\">\n",
+       "<title>cluster_state_1</title>\n",
+       "<polygon fill=\"#f7dede\" stroke=\"#f7dede\" points=\"518,-558.6371 518,-989.6371 928,-989.6371 928,-558.6371 518,-558.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"884.5\" y=\"-974.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">begin (END)</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"260\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"260\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"260\" cy=\"-940.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"260\" y=\"-936.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_3 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"400.9161,-871.6371 117.0839,-871.6371 16,-797.6371 502,-797.6371 400.9161,-871.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-845.6371 213,-860.6371 306,-860.6371 306,-845.6371 213,-845.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-850.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-830.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-808.6371 210,-823.6371 310,-823.6371 310,-808.6371 210,-808.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-813.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s0_3:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-922.5268C260,-908.8457 260,-889.3195 260,-870.7872\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-870.6371 260,-860.6371 256.5001,-870.6372 263.5001,-870.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"293.5\" y=\"-893.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_4 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>s0_4</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"331.889,-697.9819 331.889,-721.2924 289.7774,-737.7755 230.2226,-737.7755 188.111,-721.2924 188.111,-697.9819 230.2226,-681.4988 289.7774,-681.4988 331.889,-697.9819\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-720.6371 241,-735.6371 279,-735.6371 279,-720.6371 241,-720.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-725.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">inp</text>\n",
+       "<text text-anchor=\"start\" x=\"232\" y=\"-705.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-683.6371 241,-698.6371 279,-698.6371 279,-683.6371 241,-683.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-688.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s0_3&#45;&gt;s0_4 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>s0_3:OUT_1&#45;&gt;s0_4:inp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-808.6371C260,-779.8681 260,-770.2315 260,-745.7972\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-745.6371 260,-735.6371 256.5001,-745.6372 263.5001,-745.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"326\" y=\"-768.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i &#45; 1:i + 2, j &#45; 1:j + 2]</text>\n",
+       "</g>\n",
+       "<!-- s0_5 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s0_5</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"117.0839,-547.6371 400.9161,-547.6371 502,-621.6371 16,-621.6371 117.0839,-547.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-595.6371 213,-610.6371 306,-610.6371 306,-595.6371 213,-595.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-600.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-580.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-558.6371 210,-573.6371 310,-573.6371 310,-558.6371 210,-558.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-563.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_4&#45;&gt;s0_5 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s0_4:out&#45;&gt;s0_5:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-683.6371C260,-654.8681 260,-645.2315 260,-620.7972\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-620.6371 260,-610.6371 256.5001,-620.6372 263.5001,-620.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"283\" y=\"-643.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node9\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"260\" cy=\"-478.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"260\" y=\"-474.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s0_5&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge5\" class=\"edge\">\n",
+       "<title>s0_5:OUT_1&#45;&gt;s0_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-558.6371C260,-541.5375 260,-522.382 260,-507.103\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-506.7475 260,-496.7475 256.5001,-506.7476 263.5001,-506.7475\"/>\n",
+       "<text text-anchor=\"middle\" x=\"318.5\" y=\"-518.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[1:N &#45; 1, 1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_8 -->\n",
+       "<g id=\"node6\" class=\"node\">\n",
+       "<title>s0_8</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"117.0839,-103 400.9161,-103 502,-177 16,-177 117.0839,-103\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-151 213,-166 306,-166 306,-151 213,-151\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-156\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-135.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-114 210,-129 310,-129 310,-114 210,-114\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-119\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_8&#45;&gt;s0_2 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s0_8:OUT_1&#45;&gt;s0_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-114C260,-96.9004 260,-77.7449 260,-62.4659\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-62.1103 260,-52.1104 256.5001,-62.1104 263.5001,-62.1103\"/>\n",
+       "<text text-anchor=\"middle\" x=\"312.5\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[1:N &#45; 1, 1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_6 -->\n",
+       "<g id=\"node7\" class=\"node\">\n",
+       "<title>s0_6</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"400.9161,-409.6371 117.0839,-409.6371 16,-335.6371 502,-335.6371 400.9161,-409.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-383.6371 213,-398.6371 306,-398.6371 306,-383.6371 213,-383.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-388.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-368.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-346.6371 210,-361.6371 310,-361.6371 310,-346.6371 210,-346.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-351.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_7 -->\n",
+       "<g id=\"node8\" class=\"node\">\n",
+       "<title>s0_7</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"331.889,-244.6633 331.889,-267.9738 289.7774,-284.4569 230.2226,-284.4569 188.111,-267.9738 188.111,-244.6633 230.2226,-228.1803 289.7774,-228.1803 331.889,-244.6633\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-267.3186 241,-282.3186 279,-282.3186 279,-267.3186 241,-267.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-272.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">inp</text>\n",
+       "<text text-anchor=\"start\" x=\"232\" y=\"-252.1186\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-230.3186 241,-245.3186 279,-245.3186 279,-230.3186 241,-230.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-235.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s0_6&#45;&gt;s0_7 -->\n",
+       "<g id=\"edge6\" class=\"edge\">\n",
+       "<title>s0_6:OUT_1&#45;&gt;s0_7:inp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-346.6371C260,-321.736 260,-313.1033 260,-292.3964\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-292.3185 260,-282.3186 256.5001,-292.3186 263.5001,-292.3185\"/>\n",
+       "<text text-anchor=\"middle\" x=\"332\" y=\"-306.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i &#45; 1:i + 2, j &#45; 1:j + 2]</text>\n",
+       "</g>\n",
+       "<!-- s0_7&#45;&gt;s0_8 -->\n",
+       "<g id=\"edge8\" class=\"edge\">\n",
+       "<title>s0_7:out&#45;&gt;s0_8:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-230.3186C260,-205.4175 260,-196.7847 260,-176.0778\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-176 260,-166 256.5001,-176 263.5001,-176\"/>\n",
+       "<text text-anchor=\"middle\" x=\"277\" y=\"-198.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_6 -->\n",
+       "<g id=\"edge7\" class=\"edge\">\n",
+       "<title>s0_1&#45;&gt;s0_6:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-460.5268C260,-446.8457 260,-427.3195 260,-408.7872\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-408.6371 260,-398.6371 256.5001,-408.6372 263.5001,-408.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"299.5\" y=\"-431.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s1_3 -->\n",
+       "<g id=\"node10\" class=\"node\">\n",
+       "<title>s1_3</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"724\" cy=\"-584.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"724\" y=\"-580.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s1_0 -->\n",
+       "<g id=\"node11\" class=\"node\">\n",
+       "<title>s1_0</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"816.8545,-958.6371 629.1455,-958.6371 562.2948,-922.6371 883.7052,-922.6371 816.8545,-958.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"723\" y=\"-936.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">init_tmp_map[i=0:N, j=0:N]</text>\n",
+       "</g>\n",
+       "<!-- s1_1 -->\n",
+       "<g id=\"node13\" class=\"node\">\n",
+       "<title>s1_1</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"788.1124,-822.9819 788.1124,-846.2924 749.9704,-862.7755 696.0296,-862.7755 657.8876,-846.2924 657.8876,-822.9819 696.0296,-806.4988 749.9704,-806.4988 788.1124,-822.9819\"/>\n",
+       "<text text-anchor=\"start\" x=\"700\" y=\"-830.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">init_tmp</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"706,-808.6371 706,-823.6371 741,-823.6371 741,-808.6371 706,-808.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"716.5\" y=\"-813.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s1_0&#45;&gt;s1_1 -->\n",
+       "<g id=\"edge11\" class=\"edge\">\n",
+       "<title>s1_0&#45;&gt;s1_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M723,-922.1734C723,-908.616 723,-889.7185 723,-872.9504\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"726.5001,-872.8518 723,-862.8518 719.5001,-872.8518 726.5001,-872.8518\"/>\n",
+       "</g>\n",
+       "<!-- s1_2 -->\n",
+       "<g id=\"node12\" class=\"node\">\n",
+       "<title>s1_2</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"607.9487,-672.6371 838.0513,-672.6371 920,-746.6371 526,-746.6371 607.9487,-672.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"682,-720.6371 682,-735.6371 766,-735.6371 766,-720.6371 682,-720.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"708\" y=\"-725.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_tmp</text>\n",
+       "<text text-anchor=\"start\" x=\"644\" y=\"-705.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">init_tmp_map[i=0:N, j=0:N]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"678,-683.6371 678,-698.6371 770,-698.6371 770,-683.6371 678,-683.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"702.5\" y=\"-688.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_tmp</text>\n",
+       "</g>\n",
+       "<!-- s1_2&#45;&gt;s1_3 -->\n",
+       "<g id=\"edge9\" class=\"edge\">\n",
+       "<title>s1_2:OUT_tmp&#45;&gt;s1_3</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M724,-683.6371C724,-659.8539 724,-632.8569 724,-613.069\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"727.5001,-612.8836 724,-602.8837 720.5001,-612.8837 727.5001,-612.8836\"/>\n",
+       "<text text-anchor=\"middle\" x=\"763.5\" y=\"-643.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s1_1&#45;&gt;s1_2 -->\n",
+       "<g id=\"edge10\" class=\"edge\">\n",
+       "<title>s1_1:out&#45;&gt;s1_2:IN_tmp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M724,-808.6371C724,-779.8681 724,-770.2315 724,-745.7972\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"727.5001,-745.6371 724,-735.6371 720.5001,-745.6372 727.5001,-745.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"747\" y=\"-768.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i, j]</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7f7dca12de48>"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "bstate = sdfg.add_state('begin') # States can be named\n",
+    "# We use the convenience parameter external_edges to add the tmp array node and connectors\n",
+    "bstate.add_mapped_tasklet('init_tmp', dict(i='0:N', j='0:N'), {}, # no inputs\n",
+    "                          'out = 0', dict(out=dace.Memlet.simple('tmp', 'i,j')),\n",
+    "                          external_edges=True)\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "The loop will be defined in the same manner as C for-loops: A `guard` state, which jumps into the loop as long as it is in range; an `end` (empty) state; and the `loop` state (currently our main state), which jumps back to the guard and increments the iteration variable. Notice that adding edges on the SDFG (as opposed to adding them in states) requires a different edge object type: `InterstateEdge`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"783pt\" height=\"1662pt\"\n",
+       " viewBox=\"0.00 0.00 783.00 1661.91\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 1657.9114)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-1657.9114 779,-1657.9114 779,4 -4,4\"/>\n",
+       "<g id=\"clust2\" class=\"cluster\">\n",
+       "<title>cluster_state_1</title>\n",
+       "<polygon fill=\"#f7dede\" stroke=\"#f7dede\" points=\"357,-1265.2743 357,-1645.9114 767,-1645.9114 767,-1265.2743 357,-1265.2743\"/>\n",
+       "<text text-anchor=\"middle\" x=\"716.5\" y=\"-1630.7114\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">begin (BEGIN)</text>\n",
+       "</g>\n",
+       "<g id=\"clust3\" class=\"cluster\">\n",
+       "<title>cluster_state_2</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"528,-1081.2743 528,-1156.2743 598,-1156.2743 598,-1081.2743 528,-1081.2743\"/>\n",
+       "<text text-anchor=\"middle\" x=\"574.5\" y=\"-1141.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">guard</text>\n",
+       "</g>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-972.2743 510,-972.2743 510,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"482.5\" y=\"-957.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">state_0</text>\n",
+       "</g>\n",
+       "<g id=\"clust4\" class=\"cluster\">\n",
+       "<title>cluster_state_3</title>\n",
+       "<polygon fill=\"#f7dede\" stroke=\"#f7dede\" points=\"518,-897.2743 518,-972.2743 619,-972.2743 619,-897.2743 518,-897.2743\"/>\n",
+       "<text text-anchor=\"middle\" x=\"568.5\" y=\"-957.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">endstate (END)</text>\n",
+       "</g>\n",
+       "<!-- s1_3 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s1_3</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"563\" cy=\"-1291.2743\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"563\" y=\"-1287.5743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- dummy_2 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>dummy_2</title>\n",
+       "</g>\n",
+       "<!-- s1_3&#45;&gt;dummy_2 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s1_3&#45;&gt;dummy_2</title>\n",
+       "<path fill=\"none\" stroke=\"#0000ff\" d=\"M563,-1265.2776C563,-1239.7052 563,-1199.7643 563,-1166.4393\"/>\n",
+       "<polygon fill=\"#0000ff\" stroke=\"#0000ff\" points=\"566.5001,-1166.2739 563,-1156.274 559.5001,-1166.274 566.5001,-1166.2739\"/>\n",
+       "<text text-anchor=\"middle\" x=\"574\" y=\"-1206.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#0000ff\">k=0</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"368\" cy=\"-923.2743\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"368\" y=\"-919.5743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- dummy_2&#45;&gt;s0_0 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>dummy_2&#45;&gt;s0_0</title>\n",
+       "<path fill=\"none\" stroke=\"#0000ff\" d=\"M535.4469,-1081.2755C507.4698,-1054.8766 463.2669,-1013.1672 427.3012,-979.2303\"/>\n",
+       "<polygon fill=\"#0000ff\" stroke=\"#0000ff\" points=\"429.6046,-976.5916 419.9293,-972.2742 424.8005,-981.6828 429.6046,-976.5916\"/>\n",
+       "<text text-anchor=\"middle\" x=\"504\" y=\"-1022.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#0000ff\">(k &lt; T)</text>\n",
+       "</g>\n",
+       "<!-- dummy_3 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>dummy_3</title>\n",
+       "</g>\n",
+       "<!-- dummy_2&#45;&gt;dummy_3 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>dummy_2&#45;&gt;dummy_3</title>\n",
+       "<path fill=\"none\" stroke=\"#0000ff\" d=\"M563,-1081.2776C563,-1055.7052 563,-1015.7643 563,-982.4393\"/>\n",
+       "<polygon fill=\"#0000ff\" stroke=\"#0000ff\" points=\"566.5001,-982.2739 563,-972.274 559.5001,-982.274 566.5001,-982.2739\"/>\n",
+       "<text text-anchor=\"middle\" x=\"587.5\" y=\"-1022.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#0000ff\">(k &gt;= T)</text>\n",
+       "</g>\n",
+       "<!-- s0_3 -->\n",
+       "<g id=\"node9\" class=\"node\">\n",
+       "<title>s0_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"400.9161,-854.2743 117.0839,-854.2743 16,-780.2743 502,-780.2743 400.9161,-854.2743\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-828.2743 213,-843.2743 306,-843.2743 306,-828.2743 213,-828.2743\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-833.2743\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-813.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-791.2743 210,-806.2743 310,-806.2743 310,-791.2743 210,-791.2743\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-796.2743\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge6\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s0_3:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M342.6788,-916.6656C313.5005,-907.4892 268.5848,-887.984 261.0804,-853.4249\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"264.5388,-852.8477 260,-843.2743 257.5781,-853.5886 264.5388,-852.8477\"/>\n",
+       "<text text-anchor=\"middle\" x=\"316.5\" y=\"-876.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"421\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"421\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_2&#45;&gt;dummy_2 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s0_2&#45;&gt;dummy_2</title>\n",
+       "<path fill=\"none\" stroke=\"#0000ff\" d=\"M509.9967,-60.2323C569.5222,-80.523 637,-110.1152 637,-140 637,-1051.7743 637,-1051.7743 637,-1051.7743 637,-1069.7444 622.8612,-1082.6088 606.944,-1091.3975\"/>\n",
+       "<polygon fill=\"#0000ff\" stroke=\"#0000ff\" points=\"605.3782,-1088.2672 597.9966,-1095.8674 608.5066,-1094.5293 605.3782,-1088.2672\"/>\n",
+       "<text text-anchor=\"middle\" x=\"655.5\" y=\"-643.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#0000ff\">k=k+1</text>\n",
+       "</g>\n",
+       "<!-- s0_6 -->\n",
+       "<g id=\"node6\" class=\"node\">\n",
+       "<title>s0_6</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"400.9161,-409.6371 117.0839,-409.6371 16,-335.6371 502,-335.6371 400.9161,-409.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-383.6371 213,-398.6371 306,-398.6371 306,-383.6371 213,-383.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-388.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-368.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-346.6371 210,-361.6371 310,-361.6371 310,-346.6371 210,-346.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-351.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_7 -->\n",
+       "<g id=\"node8\" class=\"node\">\n",
+       "<title>s0_7</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"331.889,-244.6633 331.889,-267.9738 289.7774,-284.4569 230.2226,-284.4569 188.111,-267.9738 188.111,-244.6633 230.2226,-228.1803 289.7774,-228.1803 331.889,-244.6633\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-267.3186 241,-282.3186 279,-282.3186 279,-267.3186 241,-267.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-272.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">inp</text>\n",
+       "<text text-anchor=\"start\" x=\"232\" y=\"-252.1186\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-230.3186 241,-245.3186 279,-245.3186 279,-230.3186 241,-230.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-235.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s0_6&#45;&gt;s0_7 -->\n",
+       "<g id=\"edge9\" class=\"edge\">\n",
+       "<title>s0_6:OUT_1&#45;&gt;s0_7:inp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-346.6371C260,-321.736 260,-313.1033 260,-292.3964\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-292.3185 260,-282.3186 256.5001,-292.3186 263.5001,-292.3185\"/>\n",
+       "<text text-anchor=\"middle\" x=\"332\" y=\"-306.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i &#45; 1:i + 2, j &#45; 1:j + 2]</text>\n",
+       "</g>\n",
+       "<!-- s0_8 -->\n",
+       "<g id=\"node7\" class=\"node\">\n",
+       "<title>s0_8</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"117.0839,-103 400.9161,-103 502,-177 16,-177 117.0839,-103\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-151 213,-166 306,-166 306,-151 213,-151\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-156\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-135.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp_to_A_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-114 210,-129 310,-129 310,-114 210,-114\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-119\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_8&#45;&gt;s0_2 -->\n",
+       "<g id=\"edge12\" class=\"edge\">\n",
+       "<title>s0_8:OUT_1&#45;&gt;s0_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-114C260,-58.8967 336.5705,-41.748 384.1004,-36.4112\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"384.4498,-39.8938 394.0522,-35.4169 383.7538,-32.9285 384.4498,-39.8938\"/>\n",
+       "<text text-anchor=\"middle\" x=\"331.5\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[1:N &#45; 1, 1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_7&#45;&gt;s0_8 -->\n",
+       "<g id=\"edge11\" class=\"edge\">\n",
+       "<title>s0_7:out&#45;&gt;s0_8:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-230.3186C260,-205.4175 260,-196.7847 260,-176.0778\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-176 260,-166 256.5001,-176 263.5001,-176\"/>\n",
+       "<text text-anchor=\"middle\" x=\"277\" y=\"-198.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_4 -->\n",
+       "<g id=\"node12\" class=\"node\">\n",
+       "<title>s0_4</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"331.889,-689.3005 331.889,-712.611 289.7774,-729.094 230.2226,-729.094 188.111,-712.611 188.111,-689.3005 230.2226,-672.8174 289.7774,-672.8174 331.889,-689.3005\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-711.9557 241,-726.9557 279,-726.9557 279,-711.9557 241,-711.9557\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-716.9557\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">inp</text>\n",
+       "<text text-anchor=\"start\" x=\"232\" y=\"-696.7557\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"241,-674.9557 241,-689.9557 279,-689.9557 279,-674.9557 241,-674.9557\"/>\n",
+       "<text text-anchor=\"start\" x=\"253\" y=\"-679.9557\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s0_3&#45;&gt;s0_4 -->\n",
+       "<g id=\"edge5\" class=\"edge\">\n",
+       "<title>s0_3:OUT_1&#45;&gt;s0_4:inp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-791.2743C260,-766.3732 260,-757.7405 260,-737.0336\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-736.9557 260,-726.9557 256.5001,-736.9558 263.5001,-736.9557\"/>\n",
+       "<text text-anchor=\"middle\" x=\"326\" y=\"-751.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i &#45; 1:i + 2, j &#45; 1:j + 2]</text>\n",
+       "</g>\n",
+       "<!-- s0_5 -->\n",
+       "<g id=\"node10\" class=\"node\">\n",
+       "<title>s0_5</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"117.0839,-547.6371 400.9161,-547.6371 502,-621.6371 16,-621.6371 117.0839,-547.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"213,-595.6371 213,-610.6371 306,-610.6371 306,-595.6371 213,-595.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"248.5\" y=\"-600.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"157\" y=\"-580.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A_to_tmp_map[i=1:N &#45; 1, j=1:N &#45; 1]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"210,-558.6371 210,-573.6371 310,-573.6371 310,-558.6371 210,-558.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"244\" y=\"-563.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node11\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"260\" cy=\"-478.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"260\" y=\"-474.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s0_5&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge8\" class=\"edge\">\n",
+       "<title>s0_5:OUT_1&#45;&gt;s0_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-558.6371C260,-541.5375 260,-522.382 260,-507.103\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-506.7475 260,-496.7475 256.5001,-506.7476 263.5001,-506.7475\"/>\n",
+       "<text text-anchor=\"middle\" x=\"318.5\" y=\"-518.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[1:N &#45; 1, 1:N &#45; 1]</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_6 -->\n",
+       "<g id=\"edge10\" class=\"edge\">\n",
+       "<title>s0_1&#45;&gt;s0_6:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-460.5268C260,-446.8457 260,-427.3195 260,-408.7872\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-408.6371 260,-398.6371 256.5001,-408.6372 263.5001,-408.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"299.5\" y=\"-431.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_4&#45;&gt;s0_5 -->\n",
+       "<g id=\"edge7\" class=\"edge\">\n",
+       "<title>s0_4:out&#45;&gt;s0_5:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M260,-674.9557C260,-650.0546 260,-641.4219 260,-620.715\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"263.5001,-620.6371 260,-610.6371 256.5001,-620.6372 263.5001,-620.6371\"/>\n",
+       "<text text-anchor=\"middle\" x=\"283\" y=\"-643.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i, j]</text>\n",
+       "</g>\n",
+       "<!-- s1_0 -->\n",
+       "<g id=\"node13\" class=\"node\">\n",
+       "<title>s1_0</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"655.8545,-1614.9114 468.1455,-1614.9114 401.2948,-1578.9114 722.7052,-1578.9114 655.8545,-1614.9114\"/>\n",
+       "<text text-anchor=\"middle\" x=\"562\" y=\"-1593.2114\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">init_tmp_map[i=0:N, j=0:N]</text>\n",
+       "</g>\n",
+       "<!-- s1_1 -->\n",
+       "<g id=\"node15\" class=\"node\">\n",
+       "<title>s1_1</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"627.1124,-1501.9376 627.1124,-1525.2481 588.9704,-1541.7312 535.0296,-1541.7312 496.8876,-1525.2481 496.8876,-1501.9376 535.0296,-1485.4546 588.9704,-1485.4546 627.1124,-1501.9376\"/>\n",
+       "<text text-anchor=\"start\" x=\"539\" y=\"-1509.3929\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">init_tmp</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"545,-1487.5929 545,-1502.5929 580,-1502.5929 580,-1487.5929 545,-1487.5929\"/>\n",
+       "<text text-anchor=\"start\" x=\"555.5\" y=\"-1492.5929\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">out</text>\n",
+       "</g>\n",
+       "<!-- s1_0&#45;&gt;s1_1 -->\n",
+       "<g id=\"edge15\" class=\"edge\">\n",
+       "<title>s1_0&#45;&gt;s1_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M562,-1578.8326C562,-1570.8892 562,-1561.2403 562,-1551.8264\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"565.5001,-1551.7953 562,-1541.7953 558.5001,-1551.7953 565.5001,-1551.7953\"/>\n",
+       "</g>\n",
+       "<!-- s1_2 -->\n",
+       "<g id=\"node14\" class=\"node\">\n",
+       "<title>s1_2</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"446.9487,-1360.2743 677.0513,-1360.2743 759,-1434.2743 365,-1434.2743 446.9487,-1360.2743\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"521,-1408.2743 521,-1423.2743 605,-1423.2743 605,-1408.2743 521,-1408.2743\"/>\n",
+       "<text text-anchor=\"start\" x=\"547\" y=\"-1413.2743\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_tmp</text>\n",
+       "<text text-anchor=\"start\" x=\"483\" y=\"-1393.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">init_tmp_map[i=0:N, j=0:N]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"517,-1371.2743 517,-1386.2743 609,-1386.2743 609,-1371.2743 517,-1371.2743\"/>\n",
+       "<text text-anchor=\"start\" x=\"541.5\" y=\"-1376.2743\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_tmp</text>\n",
+       "</g>\n",
+       "<!-- s1_2&#45;&gt;s1_3 -->\n",
+       "<g id=\"edge13\" class=\"edge\">\n",
+       "<title>s1_2:OUT_tmp&#45;&gt;s1_3</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M563,-1371.2743C563,-1354.1747 563,-1335.0192 563,-1319.7402\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"566.5001,-1319.3846 563,-1309.3847 559.5001,-1319.3847 566.5001,-1319.3846\"/>\n",
+       "<text text-anchor=\"middle\" x=\"602.5\" y=\"-1331.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:N, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s1_1&#45;&gt;s1_2 -->\n",
+       "<g id=\"edge14\" class=\"edge\">\n",
+       "<title>s1_1:out&#45;&gt;s1_2:IN_tmp</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M563,-1487.5929C563,-1462.6918 563,-1454.059 563,-1433.3521\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"566.5001,-1433.2743 563,-1423.2743 559.5001,-1433.2743 566.5001,-1433.2743\"/>\n",
+       "<text text-anchor=\"middle\" x=\"586\" y=\"-1456.0743\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i, j]</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7f7dca12de48>"
+      ]
+     },
+     "execution_count": 12,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "guard = sdfg.add_state('guard')\n",
+    "endstate = sdfg.add_state('endstate')\n",
+    "\n",
+    "# State connection (control flow)\n",
+    "\n",
+    "# Note: dataflow (arrays) CAN affect control flow assignments and conditions,\n",
+    "#       but not the other way around (you cannot change an interstate variable\n",
+    "#       inside a state). The following code works as well:\n",
+    "#sdfg.add_edge(state0, guard, dace.InterstateEdge(assigments=dict('k', 'A[0]')))\n",
+    "\n",
+    "# Loop initialization (k=0)\n",
+    "sdfg.add_edge(bstate, guard, dace.InterstateEdge(assignments=dict(k='0')))\n",
+    "\n",
+    "# Loop condition (k < T / k >= T)\n",
+    "sdfg.add_edge(guard, state, dace.InterstateEdge('k < T'))\n",
+    "sdfg.add_edge(guard, endstate, dace.InterstateEdge('k >= T'))\n",
+    "\n",
+    "# Loop incrementation (k++)\n",
+    "sdfg.add_edge(\n",
+    "    state, guard, dace.InterstateEdge(assignments=dict(k='k+1')))\n",
+    "\n",
+    "# Display resulting SDFG\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "And the SDFG is complete. Now all that is left is to execute it and validate the results:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from scipy import ndimage\n",
+    "\n",
+    "# Symbol values\n",
+    "N = 24\n",
+    "T = 5\n",
+    "\n",
+    "# Arrays\n",
+    "inp = np.zeros(shape=(N, N), dtype=np.float32)\n",
+    "inp[1:N-1, 1:N-1] = np.random.rand(N-2, N-2).astype(np.float32)\n",
+    "expected = np.copy(inp[1:N-1, 1:N-1])\n",
+    "kernel = np.array([[0, 0.2, 0], [0.2, 0.2, 0.2], [0, 0.2, 0]], dtype=np.float32)\n",
+    "\n",
+    "# Evaluate expected result\n",
+    "for k in range(T * 2):\n",
+    "    expected = ndimage.convolve(\n",
+    "            expected, kernel, mode='constant', cval=0.0)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "0. Pattern FPGATransformSDFG in jacobi2d\n",
+      "1. Pattern FPGATransformState in state_0\n",
+      "2. Pattern FPGATransformState in begin\n",
+      "3. Pattern FPGATransformState in guard\n",
+      "4. Pattern FPGATransformState in endstate\n",
+      "5. Pattern GPUTransformState in jacobi2d\n",
+      "6. Pattern NestSDFG in jacobi2d\n",
+      "7. Pattern FPGATransformMap in A_to_tmp_map[i=1:N - 1, j=1:N - 1]\n",
+      "8. Pattern FPGATransformMap in tmp_to_A_map[i=1:N - 1, j=1:N - 1]\n",
+      "9. Pattern GPUTransformLocalStorage in A_to_tmp_map[i=1:N - 1, j=1:N - 1]\n",
+      "10. Pattern GPUTransformLocalStorage in tmp_to_A_map[i=1:N - 1, j=1:N - 1]\n",
+      "11. Pattern GPUTransformMap in A_to_tmp_map[i=1:N - 1, j=1:N - 1]\n",
+      "12. Pattern GPUTransformMap in tmp_to_A_map[i=1:N - 1, j=1:N - 1]\n",
+      "13. Pattern MapExpansion in A_to_tmp_map: ['i', 'j']\n",
+      "14. Pattern MapExpansion in tmp_to_A_map: ['i', 'j']\n",
+      "15. Pattern MapFusion in A_to_tmp_map: ['i', 'j'] -> tmp_to_A_map: ['i', 'j']\n",
+      "16. Pattern OrthogonalTiling in A_to_tmp_map: ['i', 'j']\n",
+      "17. Pattern OrthogonalTiling in tmp_to_A_map: ['i', 'j']\n",
+      "18. Pattern StripMining in A_to_tmp_map: ['i', 'j']\n",
+      "19. Pattern StripMining in tmp_to_A_map: ['i', 'j']\n",
+      "20. Pattern Vectorization in 3 -> 4 -> 5\n",
+      "21. Pattern Vectorization in 6 -> 7 -> 8\n",
+      "22. Pattern FPGATransformMap in init_tmp_map[i=0:N, j=0:N]\n",
+      "23. Pattern GPUTransformLocalStorage in init_tmp_map[i=0:N, j=0:N]\n",
+      "24. Pattern GPUTransformMap in init_tmp_map[i=0:N, j=0:N]\n",
+      "25. Pattern MapExpansion in init_tmp_map: ['i', 'j']\n",
+      "26. Pattern OrthogonalTiling in init_tmp_map: ['i', 'j']\n",
+      "27. Pattern StripMining in init_tmp_map: ['i', 'j']\n",
+      "Select the pattern to apply (0 - 27 or name$id): \n",
+      "You did not select a valid option. Quitting optimization ...\n",
+      "-- Configuring done\n",
+      "-- Generating done\n",
+      "-- Build files have been written to: /path/to/dace/tutorials/.dacecache/jacobi2d/build\n",
+      "\n",
+      "[ 50%] Built target dacestub\n",
+      "Scanning dependencies of target jacobi2d\n",
+      "[ 75%] Building CXX object CMakeFiles/jacobi2d.dir/path/to/dace/tutorials/.dacecache/jacobi2d/src/cpu/jacobi2d.cpp.o\n",
+      "/path/to/dace/tutorials/.dacecache/jacobi2d/src/cpu/jacobi2d.cpp: In function ‘void __program_jacobi2d_internal(float*, int, int)’:\n",
+      "/path/to/dace/tutorials/.dacecache/jacobi2d/src/cpu/jacobi2d.cpp:7:48: warning: ‘new’ of type ‘float’ with extended alignment 64 [-Waligned-new=]\n",
+      "     float *tmp = new float DACE_ALIGN(64)[N * N];\n",
+      "                                                ^\n",
+      "/path/to/dace/tutorials/.dacecache/jacobi2d/src/cpu/jacobi2d.cpp:7:48: note: uses ‘void* operator new [](std::size_t)’, which does not have an alignment parameter\n",
+      "/path/to/dace/tutorials/.dacecache/jacobi2d/src/cpu/jacobi2d.cpp:7:48: note: use ‘-faligned-new’ to enable C++17 over-aligned new support\n",
+      "/path/to/dace/tutorials/.dacecache/jacobi2d/src/cpu/jacobi2d.cpp:43:31: warning: unused variable ‘inp’ [-Wunused-variable]\n",
+      "                         auto *inp = __inp.ptr<1>();\n",
+      "                               ^~~\n",
+      "/path/to/dace/tutorials/.dacecache/jacobi2d/src/cpu/jacobi2d.cpp:64:31: warning: unused variable ‘inp’ [-Wunused-variable]\n",
+      "                         auto *inp = __inp.ptr<1>();\n",
+      "                               ^~~\n",
+      "[100%] Linking CXX shared library libjacobi2d.so\n",
+      "[100%] Built target jacobi2d\n",
+      "\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "0"
+      ]
+     },
+     "execution_count": 14,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "sdfg(A=inp, N=N, T=T)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Difference: 1.7447289e-06\n"
+     ]
+    }
+   ],
+   "source": [
+    "print('Difference:', np.linalg.norm(expected - inp[1:N-1, 1:N-1]))"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.6.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
diff --git a/tutorials/transformations.ipynb b/tutorials/transformations.ipynb
new file mode 100644
index 0000000000..3d129556fc
--- /dev/null
+++ b/tutorials/transformations.ipynb
@@ -0,0 +1,954 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Transforming SDFGs\n",
+    "\n",
+    "In this tutorial we will use the graph transformation (rewriting) capabilities of DaCe to apply _strict transformations_, which will enable applying performance-optimizing transformations. We will use both the SDFG API and the transformation Console UI.\n",
+    "\n",
+    "Another option is to use **DIODE**, the interactive optimization IDE, which is not covered in this tutorial. For more information, see the [tutorial video](https://www.vimeo.com/301317247)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import dace as dc"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Strict Transformations\n",
+    "\n",
+    "Below we see a matrix multiplication in its declarative form in the DaCe explicit interface, and the resulting SDFG:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "M = dc.symbol('M')\n",
+    "N = dc.symbol('N')\n",
+    "K = dc.symbol('K')\n",
+    "\n",
+    "@dc.program\n",
+    "def mm(A: dc.float32[M, K], B: dc.float32[K, N], C: dc.float32[M, N]):\n",
+    "    tmp = dc.define_local([M, N, K], dc.float32)\n",
+    "    \n",
+    "    @dc.map\n",
+    "    def multiply(i: _[0:M], j: _[0:N], k: _[0:K]):\n",
+    "        a << A[i, k]\n",
+    "        b << B[k, j]\n",
+    "        t >> tmp[i, j, k]\n",
+    "        \n",
+    "        t = a * b\n",
+    "        \n",
+    "    dc.reduce(lambda a,b: a+b, tmp, C, axis=2, identity=0)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"464pt\" height=\"1129pt\"\n",
+       " viewBox=\"0.00 0.00 464.00 1129.12\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 1125.1174)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-1125.1174 460,-1125.1174 460,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#f7dede\" stroke=\"#f7dede\" points=\"65,-1038.1174 65,-1113.1174 217,-1113.1174 217,-1038.1174 65,-1038.1174\"/>\n",
+       "<text text-anchor=\"middle\" x=\"170\" y=\"-1097.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">start (BEGIN)</text>\n",
+       "</g>\n",
+       "<g id=\"clust2\" class=\"cluster\">\n",
+       "<title>cluster_state_1</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-423.4802 8,-943.1174 448,-943.1174 448,-423.4802 8,-423.4802\"/>\n",
+       "<text text-anchor=\"middle\" x=\"416.5\" y=\"-927.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply</text>\n",
+       "</g>\n",
+       "<g id=\"clust3\" class=\"cluster\">\n",
+       "<title>cluster_state_2</title>\n",
+       "<polygon fill=\"#f7dede\" stroke=\"#f7dede\" points=\"131,-8 131,-328.4802 347,-328.4802 347,-8 131,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"300.5\" y=\"-313.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">reduce (END)</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<!-- s1_0 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s1_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"141\" cy=\"-894.1174\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"141\" y=\"-890.4174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s1_0 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s1_0</title>\n",
+       "<path fill=\"none\" stroke=\"#0000ff\" d=\"M141,-1038.1205C141,-1015.6652 141,-982.3664 141,-953.444\"/>\n",
+       "<polygon fill=\"#0000ff\" stroke=\"#0000ff\" points=\"144.5001,-953.114 141,-943.114 137.5001,-953.114 144.5001,-953.114\"/>\n",
+       "</g>\n",
+       "<!-- s1_1 -->\n",
+       "<g id=\"node8\" class=\"node\">\n",
+       "<title>s1_1</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"351.8116,-825.1174 104.1884,-825.1174 16,-751.1174 440,-751.1174 351.8116,-825.1174\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"158,-799.1174 158,-814.1174 212,-814.1174 212,-799.1174 158,-799.1174\"/>\n",
+       "<text text-anchor=\"start\" x=\"174\" y=\"-804.1174\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"244,-799.1174 244,-814.1174 298,-814.1174 298,-799.1174 244,-799.1174\"/>\n",
+       "<text text-anchor=\"start\" x=\"260\" y=\"-804.1174\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_2</text>\n",
+       "<text text-anchor=\"start\" x=\"141\" y=\"-783.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply[i=0:M, j=0:N, k=0:K]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"154,-762.1174 154,-777.1174 214,-777.1174 214,-762.1174 154,-762.1174\"/>\n",
+       "<text text-anchor=\"start\" x=\"168\" y=\"-767.1174\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"242,-762.1174 242,-777.1174 302,-777.1174 302,-762.1174 242,-762.1174\"/>\n",
+       "<text text-anchor=\"start\" x=\"256\" y=\"-767.1174\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_2</text>\n",
+       "</g>\n",
+       "<!-- s1_0&#45;&gt;s1_1 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s1_0&#45;&gt;s1_1:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M136.9695,-876.101C135.626,-865.8456 135.626,-853.0956 141,-843.1174 150.0234,-826.3632 173.4306,-833.1515 181.8911,-823.7237\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"185.2509,-824.7092 185,-814.1174 178.5909,-822.5539 185.2509,-824.7092\"/>\n",
+       "<text text-anchor=\"middle\" x=\"205.5\" y=\"-846.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A(K*M*N) [0:M, 0:K]</text>\n",
+       "</g>\n",
+       "<!-- s1_5 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s1_5</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"229\" cy=\"-449.4802\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"229\" y=\"-445.7802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s2_1 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>s2_1</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"229\" cy=\"-279.4802\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"229\" y=\"-275.7802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s1_5&#45;&gt;s2_1 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>s1_5&#45;&gt;s2_1</title>\n",
+       "<path fill=\"none\" stroke=\"#0000ff\" d=\"M229,-423.4834C229,-401.0281 229,-367.7292 229,-338.8069\"/>\n",
+       "<polygon fill=\"#0000ff\" stroke=\"#0000ff\" points=\"232.5001,-338.4768 229,-328.4769 225.5001,-338.4769 232.5001,-338.4768\"/>\n",
+       "</g>\n",
+       "<!-- s2_0 -->\n",
+       "<g id=\"node9\" class=\"node\">\n",
+       "<title>s2_0</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"239,-102.7588 339.3184,-183.7308 138.6816,-183.7308 239,-102.7588\"/>\n",
+       "<text text-anchor=\"middle\" x=\"239\" y=\"-160.5401\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">Op: Sum</text>\n",
+       "<text text-anchor=\"middle\" x=\"239\" y=\"-145.5401\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">Axes: (2,)</text>\n",
+       "</g>\n",
+       "<!-- s2_1&#45;&gt;s2_0 -->\n",
+       "<g id=\"edge10\" class=\"edge\">\n",
+       "<title>s2_1&#45;&gt;s2_0</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M230.4755,-261.3695C231.9107,-243.754 234.1342,-216.4634 235.9504,-194.1707\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"239.4543,-194.2647 236.778,-184.0135 232.4774,-193.6962 239.4543,-194.2647\"/>\n",
+       "<text text-anchor=\"middle\" x=\"288\" y=\"-232.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:M, 0:N, 0:K]</text>\n",
+       "</g>\n",
+       "<!-- s1_3 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s1_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"293.1124,-660.1435 293.1124,-683.4541 254.9704,-699.9371 201.0296,-699.9371 162.8876,-683.4541 162.8876,-660.1435 201.0296,-643.6605 254.9704,-643.6605 293.1124,-660.1435\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"204,-682.7988 204,-697.7988 222,-697.7988 222,-682.7988 204,-682.7988\"/>\n",
+       "<text text-anchor=\"start\" x=\"210.5\" y=\"-687.7988\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">a</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"235,-682.7988 235,-697.7988 254,-697.7988 254,-682.7988 235,-682.7988\"/>\n",
+       "<text text-anchor=\"start\" x=\"241.5\" y=\"-687.7988\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">b</text>\n",
+       "<text text-anchor=\"start\" x=\"205\" y=\"-667.5988\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"215,-645.7988 215,-660.7988 243,-660.7988 243,-645.7988 215,-645.7988\"/>\n",
+       "<text text-anchor=\"start\" x=\"227\" y=\"-650.7988\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">t</text>\n",
+       "</g>\n",
+       "<!-- s1_4 -->\n",
+       "<g id=\"node7\" class=\"node\">\n",
+       "<title>s1_4</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"104.1884,-518.4802 351.8116,-518.4802 440,-592.4802 16,-592.4802 104.1884,-518.4802\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"187,-566.4802 187,-581.4802 270,-581.4802 270,-566.4802 187,-566.4802\"/>\n",
+       "<text text-anchor=\"start\" x=\"217.5\" y=\"-571.4802\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"141\" y=\"-551.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply[i=0:M, j=0:N, k=0:K]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"184,-529.4802 184,-544.4802 273,-544.4802 273,-529.4802 184,-529.4802\"/>\n",
+       "<text text-anchor=\"start\" x=\"212.5\" y=\"-534.4802\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s1_3&#45;&gt;s1_4 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>s1_3:t&#45;&gt;s1_4:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M229,-645.7988C229,-620.8977 229,-612.265 229,-591.5581\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-591.4802 229,-581.4802 225.5001,-591.4803 232.5001,-591.4802\"/>\n",
+       "<text text-anchor=\"middle\" x=\"259.5\" y=\"-614.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i, j, k]</text>\n",
+       "</g>\n",
+       "<!-- s1_2 -->\n",
+       "<g id=\"node6\" class=\"node\">\n",
+       "<title>s1_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"293\" cy=\"-894.1174\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"293\" y=\"-890.4174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B</text>\n",
+       "</g>\n",
+       "<!-- s1_2&#45;&gt;s1_1 -->\n",
+       "<g id=\"edge6\" class=\"edge\">\n",
+       "<title>s1_2&#45;&gt;s1_1:IN_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M285.3055,-876.702C279.903,-863.1133 273.3279,-843.3702 271.4949,-824.11\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"274.9905,-823.932 271,-814.1174 267.9991,-824.2783 274.9905,-823.932\"/>\n",
+       "<text text-anchor=\"middle\" x=\"340.5\" y=\"-846.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B(K*M*N) [0:K, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s1_4&#45;&gt;s1_5 -->\n",
+       "<g id=\"edge5\" class=\"edge\">\n",
+       "<title>s1_4:OUT_1&#45;&gt;s1_5</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M229,-529.4802C229,-512.3806 229,-493.2251 229,-477.9461\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-477.5906 229,-467.5906 225.5001,-477.5906 232.5001,-477.5906\"/>\n",
+       "<text text-anchor=\"middle\" x=\"284\" y=\"-489.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:M, 0:N, 0:K]</text>\n",
+       "</g>\n",
+       "<!-- s1_1&#45;&gt;s1_3 -->\n",
+       "<g id=\"edge7\" class=\"edge\">\n",
+       "<title>s1_1:OUT_1&#45;&gt;s1_3:a</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M184,-762.1174C184,-741.4567 189.718,-736.5757 199,-718.1174 201.7714,-712.6063 205.8546,-710.3361 208.897,-707.4528\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"212.3097,-708.3711 213,-697.7988 205.8674,-705.6331 212.3097,-708.3711\"/>\n",
+       "<text text-anchor=\"middle\" x=\"217.5\" y=\"-721.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, k]</text>\n",
+       "</g>\n",
+       "<!-- s1_1&#45;&gt;s1_3 -->\n",
+       "<g id=\"edge8\" class=\"edge\">\n",
+       "<title>s1_1:OUT_2&#45;&gt;s1_3:b</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M272,-762.1174C272,-734.869 251.1431,-729.7824 246.0919,-707.9013\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"249.5544,-707.3648 245,-697.7988 242.5949,-708.117 249.5544,-707.3648\"/>\n",
+       "<text text-anchor=\"middle\" x=\"278.5\" y=\"-721.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B[k, j]</text>\n",
+       "</g>\n",
+       "<!-- s2_2 -->\n",
+       "<g id=\"node10\" class=\"node\">\n",
+       "<title>s2_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"239\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"239\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C</text>\n",
+       "</g>\n",
+       "<!-- s2_0&#45;&gt;s2_2 -->\n",
+       "<g id=\"edge9\" class=\"edge\">\n",
+       "<title>s2_0&#45;&gt;s2_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M239,-102.9045C239,-88.9976 239,-74.5667 239,-62.5267\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"242.5001,-62.2253 239,-52.2253 235.5001,-62.2254 242.5001,-62.2253\"/>\n",
+       "<text text-anchor=\"middle\" x=\"274\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C[0:M, 0:N]</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7fa81c48da20>"
+      ]
+     },
+     "execution_count": 3,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "sdfg = mm.to_sdfg(strict=False)\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Notice the `strict=False`. By default, DaCe applies strict (correctness-preserving) transformations on SDFGs generated from implicit/explicit dataflow in Python. These transformations include fusing states when there are no hazards (`StateFusion`), and removing unnecessary transient Data nodes. In the above example, we disabled it to show that DaCe actually generates three states. We will now apply strict transformations separately on the SDFG and show the (more compact) result."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Automatically applied 2 strict state fusions and removed 0 redundant arrays.\n"
+     ]
+    },
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"464pt\" height=\"789pt\"\n",
+       " viewBox=\"0.00 0.00 464.00 789.12\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 785.1174)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-785.1174 460,-785.1174 460,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-773.1174 448,-773.1174 448,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"416.5\" y=\"-757.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply</text>\n",
+       "</g>\n",
+       "<!-- s0_6 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s0_6</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" cx=\"229\" cy=\"-279.4802\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"229\" y=\"-275.7802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp</text>\n",
+       "</g>\n",
+       "<!-- s0_5 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s0_5</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"229,-102.7588 329.3184,-183.7308 128.6816,-183.7308 229,-102.7588\"/>\n",
+       "<text text-anchor=\"middle\" x=\"229\" y=\"-160.5401\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">Op: Sum</text>\n",
+       "<text text-anchor=\"middle\" x=\"229\" y=\"-145.5401\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">Axes: (2,)</text>\n",
+       "</g>\n",
+       "<!-- s0_6&#45;&gt;s0_5 -->\n",
+       "<g id=\"edge8\" class=\"edge\">\n",
+       "<title>s0_6&#45;&gt;s0_5</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M229,-261.3695C229,-243.754 229,-216.4634 229,-194.1707\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-194.0134 229,-184.0135 225.5001,-194.0135 232.5001,-194.0134\"/>\n",
+       "<text text-anchor=\"middle\" x=\"284\" y=\"-232.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:M, 0:N, 0:K]</text>\n",
+       "</g>\n",
+       "<!-- s0_7 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s0_7</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"229\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"229\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"351.8116,-655.1174 104.1884,-655.1174 16,-581.1174 440,-581.1174 351.8116,-655.1174\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"158,-629.1174 158,-644.1174 212,-644.1174 212,-629.1174 158,-629.1174\"/>\n",
+       "<text text-anchor=\"start\" x=\"174\" y=\"-634.1174\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"244,-629.1174 244,-644.1174 298,-644.1174 298,-629.1174 244,-629.1174\"/>\n",
+       "<text text-anchor=\"start\" x=\"260\" y=\"-634.1174\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_2</text>\n",
+       "<text text-anchor=\"start\" x=\"141\" y=\"-613.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply[i=0:M, j=0:N, k=0:K]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"154,-592.1174 154,-607.1174 214,-607.1174 214,-592.1174 154,-592.1174\"/>\n",
+       "<text text-anchor=\"start\" x=\"168\" y=\"-597.1174\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"242,-592.1174 242,-607.1174 302,-607.1174 302,-592.1174 242,-592.1174\"/>\n",
+       "<text text-anchor=\"start\" x=\"256\" y=\"-597.1174\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_2</text>\n",
+       "</g>\n",
+       "<!-- s0_3 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>s0_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"293.1124,-490.1435 293.1124,-513.4541 254.9704,-529.9371 201.0296,-529.9371 162.8876,-513.4541 162.8876,-490.1435 201.0296,-473.6605 254.9704,-473.6605 293.1124,-490.1435\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"204,-512.7988 204,-527.7988 222,-527.7988 222,-512.7988 204,-512.7988\"/>\n",
+       "<text text-anchor=\"start\" x=\"210.5\" y=\"-517.7988\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">a</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"235,-512.7988 235,-527.7988 254,-527.7988 254,-512.7988 235,-512.7988\"/>\n",
+       "<text text-anchor=\"start\" x=\"241.5\" y=\"-517.7988\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">b</text>\n",
+       "<text text-anchor=\"start\" x=\"205\" y=\"-497.5988\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"215,-475.7988 215,-490.7988 243,-490.7988 243,-475.7988 215,-475.7988\"/>\n",
+       "<text text-anchor=\"start\" x=\"227\" y=\"-480.7988\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">t</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>s0_1:OUT_1&#45;&gt;s0_3:a</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M184,-592.1174C184,-571.4567 189.718,-566.5757 199,-548.1174 201.7714,-542.6063 205.8546,-540.3361 208.897,-537.4528\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"212.3097,-538.3711 213,-527.7988 205.8674,-535.6331 212.3097,-538.3711\"/>\n",
+       "<text text-anchor=\"middle\" x=\"217.5\" y=\"-551.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, k]</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s0_1:OUT_2&#45;&gt;s0_3:b</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M272,-592.1174C272,-564.869 251.1431,-559.7824 246.0919,-537.9013\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"249.5544,-537.3648 245,-527.7988 242.5949,-538.117 249.5544,-537.3648\"/>\n",
+       "<text text-anchor=\"middle\" x=\"278.5\" y=\"-551.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B[k, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_4 -->\n",
+       "<g id=\"node8\" class=\"node\">\n",
+       "<title>s0_4</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"104.1884,-348.4802 351.8116,-348.4802 440,-422.4802 16,-422.4802 104.1884,-348.4802\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"187,-396.4802 187,-411.4802 270,-411.4802 270,-396.4802 187,-396.4802\"/>\n",
+       "<text text-anchor=\"start\" x=\"217.5\" y=\"-401.4802\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"141\" y=\"-381.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply[i=0:M, j=0:N, k=0:K]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"184,-359.4802 184,-374.4802 273,-374.4802 273,-359.4802 184,-359.4802\"/>\n",
+       "<text text-anchor=\"start\" x=\"212.5\" y=\"-364.4802\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_3&#45;&gt;s0_4 -->\n",
+       "<g id=\"edge6\" class=\"edge\">\n",
+       "<title>s0_3:t&#45;&gt;s0_4:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M229,-475.7988C229,-450.8977 229,-442.265 229,-421.5581\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-421.4802 229,-411.4802 225.5001,-421.4803 232.5001,-421.4802\"/>\n",
+       "<text text-anchor=\"middle\" x=\"259.5\" y=\"-444.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[i, j, k]</text>\n",
+       "</g>\n",
+       "<!-- s0_5&#45;&gt;s0_7 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>s0_5&#45;&gt;s0_7</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M229,-102.9045C229,-88.9976 229,-74.5667 229,-62.5267\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-62.2253 229,-52.2253 225.5001,-62.2254 232.5001,-62.2253\"/>\n",
+       "<text text-anchor=\"middle\" x=\"264\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C[0:M, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node6\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"141\" cy=\"-724.1174\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"141\" y=\"-720.4174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge5\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s0_1:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M136.9695,-706.101C135.626,-695.8456 135.626,-683.0956 141,-673.1174 150.0234,-656.3632 173.4306,-663.1515 181.8911,-653.7237\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"185.2509,-654.7092 185,-644.1174 178.5909,-652.5539 185.2509,-654.7092\"/>\n",
+       "<text text-anchor=\"middle\" x=\"205.5\" y=\"-676.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A(K*M*N) [0:M, 0:K]</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node7\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"293\" cy=\"-724.1174\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"293\" y=\"-720.4174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B</text>\n",
+       "</g>\n",
+       "<!-- s0_2&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s0_2&#45;&gt;s0_1:IN_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M285.3055,-706.702C279.903,-693.1133 273.3279,-673.3702 271.4949,-654.11\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"274.9905,-653.932 271,-644.1174 267.9991,-654.2783 274.9905,-653.932\"/>\n",
+       "<text text-anchor=\"middle\" x=\"340.5\" y=\"-676.9174\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B(K*M*N) [0:K, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_4&#45;&gt;s0_6 -->\n",
+       "<g id=\"edge7\" class=\"edge\">\n",
+       "<title>s0_4:OUT_1&#45;&gt;s0_6</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M229,-359.4802C229,-342.3806 229,-323.2251 229,-307.9461\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-307.5906 229,-297.5906 225.5001,-307.5906 232.5001,-307.5906\"/>\n",
+       "<text text-anchor=\"middle\" x=\"284\" y=\"-319.2802\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">tmp[0:M, 0:N, 0:K]</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7fa81c48da20>"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "sdfg.apply_strict_transformations()\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Transforming SDFGs using the Console UI\n",
+    "\n",
+    "When invoking SDFGs, the usual behavior (defined in the configuration `optimizer.interface`) is to run the SDFG console optimizer (`dace.transformation.optimizer.SDFGOptimizer`). This is the component responsible for printing the matched transformations and receiving console input.\n",
+    "\n",
+    "If the \"Enter\" key is pressed, the SDFG is no longer transformed and is sent to code generation. We will now enable profiling and test the performance of the above matrix multiplication SDFG as-is, and after a transformation fusing the map scope with the reduce node. _Note that this transformation would not have been matched if not for the strict state fusion performed above._"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "dc.Config.set('profiling', value=True)\n",
+    "\n",
+    "import numpy as np\n",
+    "A = np.random.rand(256, 255).astype(np.float32)\n",
+    "B = np.random.rand(255, 254).astype(np.float32)\n",
+    "C = np.random.rand(256, 254).astype(np.float32)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Without transformations:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "0. Pattern FPGATransformSDFG in mm\n",
+      "1. Pattern FPGATransformState in multiply\n",
+      "2. Pattern GPUTransformState in mm\n",
+      "3. Pattern NestSDFG in mm\n",
+      "4. Pattern FPGATransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "5. Pattern GPUTransformLocalStorage in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "6. Pattern GPUTransformLocalStorage in Op: Sum, Axes: (2,)\n",
+      "7. Pattern GPUTransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "8. Pattern GPUTransformMap in Op: Sum, Axes: (2,)\n",
+      "9. Pattern MapExpansion in multiply: ['i', 'j', 'k']\n",
+      "10. Pattern MapReduceFusion in 3 -> 4 -> 5\n",
+      "11. Pattern OrthogonalTiling in multiply: ['i', 'j', 'k']\n",
+      "12. Pattern ReduceExpansion in Op: Sum, Axes: (2,): (lambda a, b: (a + b)) on (2,)\n",
+      "13. Pattern StripMining in multiply: ['i', 'j', 'k']\n",
+      "Select the pattern to apply (0 - 13 or name$id): \n",
+      "You did not select a valid option. Quitting optimization ...\n",
+      "-- Configuring done\n",
+      "-- Generating done\n",
+      "-- Build files have been written to: /path/to/dace/tutorials/.dacecache/mm/build\n",
+      "\n",
+      "[ 50%] Built target dacestub\n",
+      "Scanning dependencies of target mm\n",
+      "[ 75%] Building CXX object CMakeFiles/mm.dir/path/to/dace/tutorials/.dacecache/mm/src/cpu/mm.cpp.o\n",
+      "/path/to/dace/tutorials/.dacecache/mm/src/cpu/mm.cpp: In function ‘void __program_mm_internal(float*, float*, float*, int, int, int)’:\n",
+      "/path/to/dace/tutorials/.dacecache/mm/src/cpu/mm.cpp:10:56: warning: ‘new’ of type ‘float’ with extended alignment 64 [-Waligned-new=]\n",
+      "         float *tmp = new float DACE_ALIGN(64)[M * N * K];\n",
+      "                                                        ^\n",
+      "/path/to/dace/tutorials/.dacecache/mm/src/cpu/mm.cpp:10:56: note: uses ‘void* operator new [](std::size_t)’, which does not have an alignment parameter\n",
+      "/path/to/dace/tutorials/.dacecache/mm/src/cpu/mm.cpp:10:56: note: use ‘-faligned-new’ to enable C++17 over-aligned new support\n",
+      "[100%] Linking CXX shared library libmm.so\n",
+      "[100%] Built target mm\n",
+      "\n",
+      "DaCe 73.38650003657676 ms\n"
+     ]
+    }
+   ],
+   "source": [
+    "sdfg(A=A, B=B, C=C, M=256, K=255, N=254)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "After the `MapReduceFusion` transformation:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "0. Pattern FPGATransformSDFG in mm\n",
+      "1. Pattern FPGATransformState in multiply\n",
+      "2. Pattern GPUTransformState in mm\n",
+      "3. Pattern NestSDFG in mm\n",
+      "4. Pattern FPGATransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "5. Pattern GPUTransformLocalStorage in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "6. Pattern GPUTransformLocalStorage in Op: Sum, Axes: (2,)\n",
+      "7. Pattern GPUTransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "8. Pattern GPUTransformMap in Op: Sum, Axes: (2,)\n",
+      "9. Pattern MapExpansion in multiply: ['i', 'j', 'k']\n",
+      "10. Pattern MapReduceFusion in 3 -> 4 -> 5\n",
+      "11. Pattern OrthogonalTiling in multiply: ['i', 'j', 'k']\n",
+      "12. Pattern ReduceExpansion in Op: Sum, Axes: (2,): (lambda a, b: (a + b)) on (2,)\n",
+      "13. Pattern StripMining in multiply: ['i', 'j', 'k']\n",
+      "Select the pattern to apply (0 - 13 or name$id): MapReduceFusion$0\n",
+      "You selected (MapReduceFusion$0) pattern MapReduceFusion in 3 -> 4 -> 5 with parameters {}\n",
+      "0. Pattern FPGATransformSDFG in mm\n",
+      "1. Pattern FPGATransformState in multiply\n",
+      "2. Pattern GPUTransformState in mm\n",
+      "3. Pattern NestSDFG in mm\n",
+      "4. Pattern FPGATransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "5. Pattern GPUTransformLocalStorage in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "6. Pattern GPUTransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "7. Pattern MapExpansion in multiply: ['i', 'j', 'k']\n",
+      "8. Pattern OrthogonalTiling in multiply: ['i', 'j', 'k']\n",
+      "9. Pattern StripMining in multiply: ['i', 'j', 'k']\n",
+      "Select the pattern to apply (0 - 9 or name$id): \n",
+      "You did not select a valid option. Quitting optimization ...\n",
+      "-- Configuring done\n",
+      "-- Generating done\n",
+      "-- Build files have been written to: /path/to/dace/tutorials/.dacecache/mm/build\n",
+      "\n",
+      "[ 50%] Built target dacestub\n",
+      "Scanning dependencies of target mm\n",
+      "[ 75%] Building CXX object CMakeFiles/mm.dir/path/to/dace/tutorials/.dacecache/mm/src/cpu/mm.cpp.o\n",
+      "[100%] Linking CXX shared library libmm.so\n",
+      "[100%] Built target mm\n",
+      "\n",
+      "DaCe 28.844499989645556 ms\n"
+     ]
+    }
+   ],
+   "source": [
+    "sdfg(A=A, B=B, C=C, M=256, K=255, N=254)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Notice how the time has decreased from 73.39 ms to 28.84 ms. \n",
+    "\n",
+    "We can also inspect the transformed SDFG by compiling it separately from its invocation and obtaining the resulting SDFG:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "0. Pattern FPGATransformSDFG in mm\n",
+      "1. Pattern FPGATransformState in multiply\n",
+      "2. Pattern GPUTransformState in mm\n",
+      "3. Pattern NestSDFG in mm\n",
+      "4. Pattern FPGATransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "5. Pattern GPUTransformLocalStorage in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "6. Pattern GPUTransformLocalStorage in Op: Sum, Axes: (2,)\n",
+      "7. Pattern GPUTransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "8. Pattern GPUTransformMap in Op: Sum, Axes: (2,)\n",
+      "9. Pattern MapExpansion in multiply: ['i', 'j', 'k']\n",
+      "10. Pattern MapReduceFusion in 3 -> 4 -> 5\n",
+      "11. Pattern OrthogonalTiling in multiply: ['i', 'j', 'k']\n",
+      "12. Pattern ReduceExpansion in Op: Sum, Axes: (2,): (lambda a, b: (a + b)) on (2,)\n",
+      "13. Pattern StripMining in multiply: ['i', 'j', 'k']\n",
+      "Select the pattern to apply (0 - 13 or name$id): MapReduceFusion$0\n",
+      "You selected (MapReduceFusion$0) pattern MapReduceFusion in 3 -> 4 -> 5 with parameters {}\n",
+      "0. Pattern FPGATransformSDFG in mm\n",
+      "1. Pattern FPGATransformState in multiply\n",
+      "2. Pattern GPUTransformState in mm\n",
+      "3. Pattern NestSDFG in mm\n",
+      "4. Pattern FPGATransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "5. Pattern GPUTransformLocalStorage in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "6. Pattern GPUTransformMap in multiply[i=0:M, j=0:N, k=0:K]\n",
+      "7. Pattern MapExpansion in multiply: ['i', 'j', 'k']\n",
+      "8. Pattern OrthogonalTiling in multiply: ['i', 'j', 'k']\n",
+      "9. Pattern StripMining in multiply: ['i', 'j', 'k']\n",
+      "Select the pattern to apply (0 - 9 or name$id): \n",
+      "You did not select a valid option. Quitting optimization ...\n",
+      "-- Configuring done\n",
+      "-- Generating done\n",
+      "-- Build files have been written to: /path/to/dace/tutorials/.dacecache/mm/build\n",
+      "\n",
+      "[ 50%] Built target dacestub\n",
+      "Scanning dependencies of target mm\n",
+      "[ 75%] Building CXX object CMakeFiles/mm.dir/path/to/dace/tutorials/.dacecache/mm/src/cpu/mm.cpp.o\n",
+      "[100%] Linking CXX shared library libmm.so\n",
+      "[100%] Built target mm\n",
+      "\n"
+     ]
+    },
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"464pt\" height=\"544pt\"\n",
+       " viewBox=\"0.00 0.00 464.00 543.64\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 539.6371)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-539.6371 460,-539.6371 460,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-527.6371 448,-527.6371 448,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"416.5\" y=\"-512.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"293\" cy=\"-478.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"293\" y=\"-474.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"351.8116,-409.6371 104.1884,-409.6371 16,-335.6371 440,-335.6371 351.8116,-409.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"158,-383.6371 158,-398.6371 212,-398.6371 212,-383.6371 158,-383.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"174\" y=\"-388.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"244,-383.6371 244,-398.6371 298,-398.6371 298,-383.6371 244,-383.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"260\" y=\"-388.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_2</text>\n",
+       "<text text-anchor=\"start\" x=\"141\" y=\"-368.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply[i=0:M, j=0:N, k=0:K]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"154,-346.6371 154,-361.6371 214,-361.6371 214,-346.6371 154,-346.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"168\" y=\"-351.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"242,-346.6371 242,-361.6371 302,-361.6371 302,-346.6371 242,-346.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"256\" y=\"-351.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_2</text>\n",
+       "</g>\n",
+       "<!-- s0_2&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s0_2&#45;&gt;s0_1:IN_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M285.3055,-461.2218C279.903,-447.633 273.3279,-427.89 271.4949,-408.6298\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"274.9905,-408.4517 271,-398.6371 267.9991,-408.7981 274.9905,-408.4517\"/>\n",
+       "<text text-anchor=\"middle\" x=\"340.5\" y=\"-431.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B(K*M*N) [0:K, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_3 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s0_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"293.1124,-244.6633 293.1124,-267.9738 254.9704,-284.4569 201.0296,-284.4569 162.8876,-267.9738 162.8876,-244.6633 201.0296,-228.1803 254.9704,-228.1803 293.1124,-244.6633\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"204,-267.3186 204,-282.3186 222,-282.3186 222,-267.3186 204,-267.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"210.5\" y=\"-272.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">a</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"235,-267.3186 235,-282.3186 254,-282.3186 254,-267.3186 235,-267.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"241.5\" y=\"-272.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">b</text>\n",
+       "<text text-anchor=\"start\" x=\"205\" y=\"-252.1186\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"215,-230.3186 215,-245.3186 243,-245.3186 243,-230.3186 215,-230.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"227\" y=\"-235.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">t</text>\n",
+       "</g>\n",
+       "<!-- s0_4 -->\n",
+       "<g id=\"node6\" class=\"node\">\n",
+       "<title>s0_4</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"104.1884,-103 351.8116,-103 440,-177 16,-177 104.1884,-103\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"187,-151 187,-166 270,-166 270,-151 187,-151\"/>\n",
+       "<text text-anchor=\"start\" x=\"217.5\" y=\"-156\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"141\" y=\"-135.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply[i=0:M, j=0:N, k=0:K]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"184,-114 184,-129 273,-129 273,-114 184,-114\"/>\n",
+       "<text text-anchor=\"start\" x=\"212.5\" y=\"-119\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_3&#45;&gt;s0_4 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s0_3:t&#45;&gt;s0_4:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" stroke-dasharray=\"5,2\" d=\"M229,-230.3186C229,-205.4175 229,-196.7847 229,-176.0778\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-176 229,-166 225.5001,-176 232.5001,-176\"/>\n",
+       "<text text-anchor=\"middle\" x=\"294\" y=\"-198.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C[i, j] (CR: Sum, id: 0)</text>\n",
+       "</g>\n",
+       "<!-- s0_5 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_5</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"216\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"216\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"141\" cy=\"-478.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"141\" y=\"-474.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s0_1:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M136.9695,-460.6208C135.626,-450.3654 135.626,-437.6154 141,-427.6371 150.0234,-410.883 173.4306,-417.6713 181.8911,-408.2435\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"185.2509,-409.229 185,-398.6371 178.5909,-407.0736 185.2509,-409.229\"/>\n",
+       "<text text-anchor=\"middle\" x=\"205.5\" y=\"-431.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A(K*M*N) [0:M, 0:K]</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge5\" class=\"edge\">\n",
+       "<title>s0_1:OUT_1&#45;&gt;s0_3:a</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M184,-346.6371C184,-325.9765 189.718,-321.0954 199,-302.6371 201.7714,-297.126 205.8546,-294.8559 208.897,-291.9726\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"212.3097,-292.8909 213,-282.3186 205.8674,-290.1529 212.3097,-292.8909\"/>\n",
+       "<text text-anchor=\"middle\" x=\"217.5\" y=\"-306.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, k]</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge6\" class=\"edge\">\n",
+       "<title>s0_1:OUT_2&#45;&gt;s0_3:b</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M272,-346.6371C272,-319.3888 251.1431,-314.3021 246.0919,-292.4211\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"249.5544,-291.8846 245,-282.3186 242.5949,-292.6368 249.5544,-291.8846\"/>\n",
+       "<text text-anchor=\"middle\" x=\"278.5\" y=\"-306.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B[k, j]</text>\n",
+       "</g>\n",
+       "<!-- s0_4&#45;&gt;s0_5 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>s0_4:OUT_1&#45;&gt;s0_5</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" stroke-dasharray=\"5,2\" d=\"M229,-114C229,-96.5079 225.9346,-77.1397 222.7808,-61.8422\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"226.1782,-60.9947 220.6199,-51.975 219.3403,-62.4923 226.1782,-60.9947\"/>\n",
+       "<text text-anchor=\"middle\" x=\"338\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C(K*M*N) [0:M, 0:N] (CR: Sum, id: 0)</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7fa840e74240>"
+      ]
+     },
+     "execution_count": 8,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "compiled_sdfg = sdfg.compile()\n",
+    "compiled_sdfg.sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Manually Transforming an SDFG\n",
+    "\n",
+    "Another option would be to use the SDFG API directly to transform an SDFG. Below we write a simple snippet that matches the first `MapReduceFusion` and applies it to the above SDFG:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "data": {
+      "image/svg+xml": [
+       "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n",
+       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
+       " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
+       "<!-- Generated by graphviz version 2.40.1 (20161225.0304)\n",
+       " -->\n",
+       "<!-- Title: SDFG Pages: 1 -->\n",
+       "<svg width=\"464pt\" height=\"544pt\"\n",
+       " viewBox=\"0.00 0.00 464.00 543.64\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
+       "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 539.6371)\">\n",
+       "<title>SDFG</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"transparent\" points=\"-4,4 -4,-539.6371 460,-539.6371 460,4 -4,4\"/>\n",
+       "<g id=\"clust1\" class=\"cluster\">\n",
+       "<title>cluster_state_0</title>\n",
+       "<polygon fill=\"#deebf7\" stroke=\"#deebf7\" points=\"8,-8 8,-527.6371 448,-527.6371 448,-8 8,-8\"/>\n",
+       "<text text-anchor=\"middle\" x=\"416.5\" y=\"-512.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply</text>\n",
+       "</g>\n",
+       "<!-- s0_3 -->\n",
+       "<g id=\"node1\" class=\"node\">\n",
+       "<title>s0_3</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"293.1124,-244.6633 293.1124,-267.9738 254.9704,-284.4569 201.0296,-284.4569 162.8876,-267.9738 162.8876,-244.6633 201.0296,-228.1803 254.9704,-228.1803 293.1124,-244.6633\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"204,-267.3186 204,-282.3186 222,-282.3186 222,-267.3186 204,-267.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"210.5\" y=\"-272.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">a</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"235,-267.3186 235,-282.3186 254,-282.3186 254,-267.3186 235,-267.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"241.5\" y=\"-272.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">b</text>\n",
+       "<text text-anchor=\"start\" x=\"205\" y=\"-252.1186\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"215,-230.3186 215,-245.3186 243,-245.3186 243,-230.3186 215,-230.3186\"/>\n",
+       "<text text-anchor=\"start\" x=\"227\" y=\"-235.3186\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">t</text>\n",
+       "</g>\n",
+       "<!-- s0_4 -->\n",
+       "<g id=\"node4\" class=\"node\">\n",
+       "<title>s0_4</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"104.1884,-103 351.8116,-103 440,-177 16,-177 104.1884,-103\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"187,-151 187,-166 270,-166 270,-151 187,-151\"/>\n",
+       "<text text-anchor=\"start\" x=\"217.5\" y=\"-156\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<text text-anchor=\"start\" x=\"141\" y=\"-135.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply[i=0:M, j=0:N, k=0:K]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"184,-114 184,-129 273,-129 273,-114 184,-114\"/>\n",
+       "<text text-anchor=\"start\" x=\"212.5\" y=\"-119\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "</g>\n",
+       "<!-- s0_3&#45;&gt;s0_4 -->\n",
+       "<g id=\"edge5\" class=\"edge\">\n",
+       "<title>s0_3:t&#45;&gt;s0_4:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" stroke-dasharray=\"5,2\" d=\"M229,-230.3186C229,-205.4175 229,-196.7847 229,-176.0778\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-176 229,-166 225.5001,-176 232.5001,-176\"/>\n",
+       "<text text-anchor=\"middle\" x=\"294\" y=\"-198.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C[i, j] (CR: Sum, id: 0)</text>\n",
+       "</g>\n",
+       "<!-- s0_0 -->\n",
+       "<g id=\"node2\" class=\"node\">\n",
+       "<title>s0_0</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"141\" cy=\"-478.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"141\" y=\"-474.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A</text>\n",
+       "</g>\n",
+       "<!-- s0_1 -->\n",
+       "<g id=\"node6\" class=\"node\">\n",
+       "<title>s0_1</title>\n",
+       "<polygon fill=\"#ffffff\" stroke=\"#000000\" points=\"351.8116,-409.6371 104.1884,-409.6371 16,-335.6371 440,-335.6371 351.8116,-409.6371\"/>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"158,-383.6371 158,-398.6371 212,-398.6371 212,-383.6371 158,-383.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"174\" y=\"-388.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_1</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"244,-383.6371 244,-398.6371 298,-398.6371 298,-383.6371 244,-383.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"260\" y=\"-388.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">IN_2</text>\n",
+       "<text text-anchor=\"start\" x=\"141\" y=\"-368.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">multiply[i=0:M, j=0:N, k=0:K]</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"154,-346.6371 154,-361.6371 214,-361.6371 214,-346.6371 154,-346.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"168\" y=\"-351.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_1</text>\n",
+       "<polygon fill=\"none\" stroke=\"#000000\" points=\"242,-346.6371 242,-361.6371 302,-361.6371 302,-346.6371 242,-346.6371\"/>\n",
+       "<text text-anchor=\"start\" x=\"256\" y=\"-351.6371\" font-family=\"Times,serif\" font-size=\"10.00\" fill=\"#000000\">OUT_2</text>\n",
+       "</g>\n",
+       "<!-- s0_0&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge4\" class=\"edge\">\n",
+       "<title>s0_0&#45;&gt;s0_1:IN_1</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M136.9695,-460.6208C135.626,-450.3654 135.626,-437.6154 141,-427.6371 150.0234,-410.883 173.4306,-417.6713 181.8911,-408.2435\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"185.2509,-409.229 185,-398.6371 178.5909,-407.0736 185.2509,-409.229\"/>\n",
+       "<text text-anchor=\"middle\" x=\"205.5\" y=\"-431.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A(K*M*N) [0:M, 0:K]</text>\n",
+       "</g>\n",
+       "<!-- s0_2 -->\n",
+       "<g id=\"node3\" class=\"node\">\n",
+       "<title>s0_2</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"293\" cy=\"-478.6371\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"293\" y=\"-474.9371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B</text>\n",
+       "</g>\n",
+       "<!-- s0_2&#45;&gt;s0_1 -->\n",
+       "<g id=\"edge6\" class=\"edge\">\n",
+       "<title>s0_2&#45;&gt;s0_1:IN_2</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M285.3055,-461.2218C279.903,-447.633 273.3279,-427.89 271.4949,-408.6298\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"274.9905,-408.4517 271,-398.6371 267.9991,-408.7981 274.9905,-408.4517\"/>\n",
+       "<text text-anchor=\"middle\" x=\"340.5\" y=\"-431.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B(K*M*N) [0:K, 0:N]</text>\n",
+       "</g>\n",
+       "<!-- s0_5 -->\n",
+       "<g id=\"node5\" class=\"node\">\n",
+       "<title>s0_5</title>\n",
+       "<ellipse fill=\"#ffffff\" stroke=\"#000000\" stroke-width=\"2\" cx=\"229\" cy=\"-34\" rx=\"27\" ry=\"18\"/>\n",
+       "<text text-anchor=\"middle\" x=\"229\" y=\"-30.3\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C</text>\n",
+       "</g>\n",
+       "<!-- s0_4&#45;&gt;s0_5 -->\n",
+       "<g id=\"edge2\" class=\"edge\">\n",
+       "<title>s0_4:OUT_1&#45;&gt;s0_5</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" stroke-dasharray=\"5,2\" d=\"M229,-114C229,-96.9004 229,-77.7449 229,-62.4659\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"232.5001,-62.1103 229,-52.1104 225.5001,-62.1104 232.5001,-62.1103\"/>\n",
+       "<text text-anchor=\"middle\" x=\"312\" y=\"-73.8\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">C[0:M, 0:N] (CR: Sum, id: 0)</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge1\" class=\"edge\">\n",
+       "<title>s0_1:OUT_1&#45;&gt;s0_3:a</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M184,-346.6371C184,-325.9765 189.718,-321.0954 199,-302.6371 201.7714,-297.126 205.8546,-294.8559 208.897,-291.9726\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"212.3097,-292.8909 213,-282.3186 205.8674,-290.1529 212.3097,-292.8909\"/>\n",
+       "<text text-anchor=\"middle\" x=\"217.5\" y=\"-306.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">A[i, k]</text>\n",
+       "</g>\n",
+       "<!-- s0_1&#45;&gt;s0_3 -->\n",
+       "<g id=\"edge3\" class=\"edge\">\n",
+       "<title>s0_1:OUT_2&#45;&gt;s0_3:b</title>\n",
+       "<path fill=\"none\" stroke=\"#000000\" d=\"M272,-346.6371C272,-319.3888 251.1431,-314.3021 246.0919,-292.4211\"/>\n",
+       "<polygon fill=\"#000000\" stroke=\"#000000\" points=\"249.5544,-291.8846 245,-282.3186 242.5949,-292.6368 249.5544,-291.8846\"/>\n",
+       "<text text-anchor=\"middle\" x=\"278.5\" y=\"-306.4371\" font-family=\"Times,serif\" font-size=\"14.00\" fill=\"#000000\">B[k, j]</text>\n",
+       "</g>\n",
+       "</g>\n",
+       "</svg>\n"
+      ],
+      "text/plain": [
+       "<dace.sdfg.SDFG at 0x7fa81c48da20>"
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from dace.transformation import optimizer\n",
+    "from dace.transformation.dataflow import MapReduceFusion\n",
+    "\n",
+    "# Create an SDFG optimizer\n",
+    "opt = optimizer.SDFGOptimizer(sdfg, inplace=True)\n",
+    "\n",
+    "# Find all matches for MapReduceFusion and apply them\n",
+    "for match in opt.get_pattern_matches(patterns=[MapReduceFusion]):\n",
+    "    match.apply(sdfg)\n",
+    "    \n",
+    "# Display resulting SDFG\n",
+    "sdfg"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "As we can see above, the resulting SDFG is equivalent to the one created in the previous section."
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.6.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
diff --git a/windows/runtime.vcxproj b/windows/runtime.vcxproj
new file mode 100644
index 0000000000..c460b9dd02
--- /dev/null
+++ b/windows/runtime.vcxproj
@@ -0,0 +1,178 @@
+﻿<?xml version="1.0" encoding="utf-8"?>
+<Project DefaultTargets="Build" ToolsVersion="15.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
+  <ItemGroup Label="ProjectConfigurations">
+    <ProjectConfiguration Include="Debug|Win32">
+      <Configuration>Debug</Configuration>
+      <Platform>Win32</Platform>
+    </ProjectConfiguration>
+    <ProjectConfiguration Include="Release|Win32">
+      <Configuration>Release</Configuration>
+      <Platform>Win32</Platform>
+    </ProjectConfiguration>
+    <ProjectConfiguration Include="Debug|x64">
+      <Configuration>Debug</Configuration>
+      <Platform>x64</Platform>
+    </ProjectConfiguration>
+    <ProjectConfiguration Include="Release|x64">
+      <Configuration>Release</Configuration>
+      <Platform>x64</Platform>
+    </ProjectConfiguration>
+  </ItemGroup>
+  <PropertyGroup Label="Globals">
+    <VCProjectVersion>15.0</VCProjectVersion>
+    <ProjectGuid>{0D3CB049-8CE8-4FA0-8160-A36240A58846}</ProjectGuid>
+    <RootNamespace>runtime</RootNamespace>
+    <WindowsTargetPlatformVersion>10.0.16299.0</WindowsTargetPlatformVersion>
+  </PropertyGroup>
+  <Import Project="$(VCTargetsPath)\Microsoft.Cpp.Default.props" />
+  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'" Label="Configuration">
+    <ConfigurationType>Application</ConfigurationType>
+    <UseDebugLibraries>true</UseDebugLibraries>
+    <PlatformToolset>v141</PlatformToolset>
+    <CharacterSet>MultiByte</CharacterSet>
+  </PropertyGroup>
+  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|Win32'" Label="Configuration">
+    <ConfigurationType>Application</ConfigurationType>
+    <UseDebugLibraries>false</UseDebugLibraries>
+    <PlatformToolset>v141</PlatformToolset>
+    <WholeProgramOptimization>true</WholeProgramOptimization>
+    <CharacterSet>MultiByte</CharacterSet>
+  </PropertyGroup>
+  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|x64'" Label="Configuration">
+    <ConfigurationType>Application</ConfigurationType>
+    <UseDebugLibraries>true</UseDebugLibraries>
+    <PlatformToolset>v141</PlatformToolset>
+    <CharacterSet>MultiByte</CharacterSet>
+  </PropertyGroup>
+  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|x64'" Label="Configuration">
+    <ConfigurationType>Application</ConfigurationType>
+    <UseDebugLibraries>false</UseDebugLibraries>
+    <PlatformToolset>v141</PlatformToolset>
+    <WholeProgramOptimization>true</WholeProgramOptimization>
+    <CharacterSet>MultiByte</CharacterSet>
+  </PropertyGroup>
+  <Import Project="$(VCTargetsPath)\Microsoft.Cpp.props" />
+  <ImportGroup Label="ExtensionSettings">
+  </ImportGroup>
+  <ImportGroup Label="Shared">
+  </ImportGroup>
+  <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">
+    <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
+  </ImportGroup>
+  <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">
+    <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
+  </ImportGroup>
+  <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">
+    <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
+  </ImportGroup>
+  <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Release|x64'">
+    <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
+  </ImportGroup>
+  <PropertyGroup Label="UserMacros" />
+  <PropertyGroup />
+  <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">
+    <ClCompile>
+      <WarningLevel>Level3</WarningLevel>
+      <Optimization>Disabled</Optimization>
+      <SDLCheck>true</SDLCheck>
+      <AdditionalIncludeDirectories>$(SolutionDir)\..\dace\runtime\include;%(AdditionalIncludeDirectories)</AdditionalIncludeDirectories>
+    </ClCompile>
+    <Link>
+      <SubSystem>Console</SubSystem>
+    </Link>
+  </ItemDefinitionGroup>
+  <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">
+    <ClCompile>
+      <WarningLevel>Level3</WarningLevel>
+      <Optimization>Disabled</Optimization>
+      <SDLCheck>true</SDLCheck>
+      <AdditionalIncludeDirectories>$(SolutionDir)\..\dace\runtime\include;%(AdditionalIncludeDirectories)</AdditionalIncludeDirectories>
+    </ClCompile>
+    <Link>
+      <SubSystem>Console</SubSystem>
+    </Link>
+  </ItemDefinitionGroup>
+  <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">
+    <ClCompile>
+      <WarningLevel>Level3</WarningLevel>
+      <Optimization>MaxSpeed</Optimization>
+      <FunctionLevelLinking>true</FunctionLevelLinking>
+      <IntrinsicFunctions>true</IntrinsicFunctions>
+      <SDLCheck>true</SDLCheck>
+      <AdditionalIncludeDirectories>$(SolutionDir)\..\dace\runtime\include;%(AdditionalIncludeDirectories)</AdditionalIncludeDirectories>
+      <OpenMPSupport>true</OpenMPSupport>
+    </ClCompile>
+    <Link>
+      <EnableCOMDATFolding>true</EnableCOMDATFolding>
+      <OptimizeReferences>true</OptimizeReferences>
+      <SubSystem>Console</SubSystem>
+    </Link>
+  </ItemDefinitionGroup>
+  <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Release|x64'">
+    <ClCompile>
+      <WarningLevel>Level3</WarningLevel>
+      <Optimization>MaxSpeed</Optimization>
+      <FunctionLevelLinking>true</FunctionLevelLinking>
+      <IntrinsicFunctions>true</IntrinsicFunctions>
+      <SDLCheck>true</SDLCheck>
+      <AdditionalIncludeDirectories>$(SolutionDir)\..\dace\runtime\include;%(AdditionalIncludeDirectories)</AdditionalIncludeDirectories>
+      <OpenMPSupport>true</OpenMPSupport>
+    </ClCompile>
+    <Link>
+      <EnableCOMDATFolding>true</EnableCOMDATFolding>
+      <OptimizeReferences>true</OptimizeReferences>
+      <SubSystem>Console</SubSystem>
+    </Link>
+  </ItemDefinitionGroup>
+  <ItemGroup>
+    <ClCompile Include="..\tests\loop_flat_test.cpp">
+      <ExcludedFromBuild Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">true</ExcludedFromBuild>
+      <ExcludedFromBuild Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">true</ExcludedFromBuild>
+      <ExcludedFromBuild Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">true</ExcludedFromBuild>
+      <ExcludedFromBuild Condition="'$(Configuration)|$(Platform)'=='Release|x64'">true</ExcludedFromBuild>
+    </ClCompile>
+    <ClCompile Include="..\tests\runtime_test.cpp" />
+    <ClCompile Include="..\dace\codegen\tools\get_cuda_arch.cpp">
+      <ExcludedFromBuild Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">true</ExcludedFromBuild>
+      <ExcludedFromBuild Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">true</ExcludedFromBuild>
+      <ExcludedFromBuild Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">true</ExcludedFromBuild>
+      <ExcludedFromBuild Condition="'$(Configuration)|$(Platform)'=='Release|x64'">true</ExcludedFromBuild>
+    </ClCompile>
+    <ClCompile Include="..\dace\codegen\tools\dacestub.cpp" />
+  </ItemGroup>
+  <ItemGroup>
+    <ClInclude Include="..\dace\runtime\include\dace\complex.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\copy.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\cudainterop.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\dace.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\intset.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\math.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\os.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\perf\instrumentation.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\pi.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\pyinterop.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\reduction.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\stream.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\types.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\vector.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\view.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\access.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\array_interface.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\device.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\host.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\reduce.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\stream.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\vec.h" />
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\view.h" />
+  </ItemGroup>
+  <ItemGroup>
+    <None Include="..\dace\runtime\include\dace\cuda\copy.cuh" />
+    <None Include="..\dace\runtime\include\dace\cuda\cudacommon.cuh" />
+    <None Include="..\dace\runtime\include\dace\cuda\dynmap.cuh" />
+    <None Include="..\dace\runtime\include\dace\cuda\stream.cuh" />
+    <None Include="..\dace\runtime\include\dace\cuda\vectype.cuh" />
+  </ItemGroup>
+  <Import Project="$(VCTargetsPath)\Microsoft.Cpp.targets" />
+  <ImportGroup Label="ExtensionTargets">
+  </ImportGroup>
+</Project>
\ No newline at end of file
diff --git a/windows/runtime.vcxproj.filters b/windows/runtime.vcxproj.filters
new file mode 100644
index 0000000000..560646a80e
--- /dev/null
+++ b/windows/runtime.vcxproj.filters
@@ -0,0 +1,129 @@
+﻿<?xml version="1.0" encoding="utf-8"?>
+<Project ToolsVersion="4.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
+  <ItemGroup>
+    <Filter Include="Source Files">
+      <UniqueIdentifier>{4FC737F1-C7A5-4376-A066-2A32D752A2FF}</UniqueIdentifier>
+      <Extensions>cpp;c;cc;cxx;def;odl;idl;hpj;bat;asm;asmx</Extensions>
+    </Filter>
+    <Filter Include="Header Files">
+      <UniqueIdentifier>{93995380-89BD-4b04-88EB-625FBE52EBFB}</UniqueIdentifier>
+      <Extensions>h;hh;hpp;hxx;hm;inl;inc;xsd</Extensions>
+    </Filter>
+    <Filter Include="Resource Files">
+      <UniqueIdentifier>{67DA6AB6-F800-4c08-8B7A-83BB121AAD01}</UniqueIdentifier>
+      <Extensions>rc;ico;cur;bmp;dlg;rc2;rct;bin;rgs;gif;jpg;jpeg;jpe;resx;tiff;tif;png;wav;mfcribbon-ms</Extensions>
+    </Filter>
+    <Filter Include="Header Files\cuda">
+      <UniqueIdentifier>{15eb8c59-750c-49a1-ab35-040b81ec1674}</UniqueIdentifier>
+    </Filter>
+    <Filter Include="Header Files\xilinx">
+      <UniqueIdentifier>{c05f28a4-3c1f-4056-8c5b-753febe2851c}</UniqueIdentifier>
+    </Filter>
+    <Filter Include="Header Files\perf">
+      <UniqueIdentifier>{6f4d3a0b-ea9b-48cf-94fa-3107822150ce}</UniqueIdentifier>
+    </Filter>
+  </ItemGroup>
+  <ItemGroup>
+    <ClInclude Include="..\dace\runtime\include\dace\dace_complex.h">
+    <ClInclude Include="..\dace\runtime\include\dace\complex.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\copy.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\cudainterop.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\dace.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\intset.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\math.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\os.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\pi.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\pyinterop.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\reduction.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\stream.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\types.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\vector.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\view.h">
+      <Filter>Header Files</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\access.h">
+      <Filter>Header Files\xilinx</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\array_interface.h">
+      <Filter>Header Files\xilinx</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\device.h">
+      <Filter>Header Files\xilinx</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\host.h">
+      <Filter>Header Files\xilinx</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\reduce.h">
+      <Filter>Header Files\xilinx</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\stream.h">
+      <Filter>Header Files\xilinx</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\vec.h">
+      <Filter>Header Files\xilinx</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\xilinx\view.h">
+      <Filter>Header Files\xilinx</Filter>
+    </ClInclude>
+    <ClInclude Include="..\dace\runtime\include\dace\perf\instrumentation.h">
+      <Filter>Header Files\perf</Filter>
+    </ClInclude>
+  </ItemGroup>
+  <ItemGroup>
+    <None Include="..\dace\runtime\include\dace\cuda\copy.cuh">
+      <Filter>Header Files\cuda</Filter>
+    </None>
+    <None Include="..\dace\runtime\include\dace\cuda\cudacommon.cuh">
+      <Filter>Header Files\cuda</Filter>
+    </None>
+    <None Include="..\dace\runtime\include\dace\cuda\dynmap.cuh">
+      <Filter>Header Files\cuda</Filter>
+    </None>
+    <None Include="..\dace\runtime\include\dace\cuda\stream.cuh">
+      <Filter>Header Files\cuda</Filter>
+    </None>
+    <None Include="..\dace\runtime\include\dace\cuda\vectype.cuh">
+      <Filter>Header Files\cuda</Filter>
+    </None>
+  </ItemGroup>
+  <ItemGroup>
+    <ClCompile Include="..\tests\runtime_test.cpp">
+      <Filter>Source Files</Filter>
+    </ClCompile>
+    <ClCompile Include="..\tests\loop_flat_test.cpp">
+      <Filter>Source Files</Filter>
+    </ClCompile>
+    <ClCompile Include="..\dace\codegen\tools\dacestub.cpp">
+      <Filter>Source Files</Filter>
+    </ClCompile>
+    <ClCompile Include="..\dace\codegen\tools\get_cuda_arch.cpp">
+      <Filter>Source Files</Filter>
+    </ClCompile>
+  </ItemGroup>
+</Project>
\ No newline at end of file
diff --git a/windows/src.pyproj b/windows/src.pyproj
new file mode 100644
index 0000000000..e66691ac35
--- /dev/null
+++ b/windows/src.pyproj
@@ -0,0 +1,477 @@
+<?xml version="1.0" encoding="utf-8"?>
+<Project ToolsVersion="4.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003" DefaultTargets="Build">
+  <PropertyGroup>
+    <Configuration Condition=" '$(Configuration)' == '' ">Debug</Configuration>
+    <SchemaVersion>2.0</SchemaVersion>
+    <ProjectGuid>{0389af43-31fb-4488-9481-69ebb65f4057}</ProjectGuid>
+    <ProjectHome />
+    <StartupFile>..\samples\simple\gemm.py</StartupFile>
+    <SearchPath />
+    <WorkingDirectory>.</WorkingDirectory>
+    <OutputPath>.</OutputPath>
+    <ProjectTypeGuids>{888888a0-9f3d-457c-b088-3a5042f75d52}</ProjectTypeGuids>
+    <LaunchProvider>Standard Python launcher</LaunchProvider>
+    <EnableNativeCodeDebugging>False</EnableNativeCodeDebugging>
+    <Environment></Environment>
+  </PropertyGroup>
+  <PropertyGroup Condition="'$(Configuration)' == 'Debug'" />
+  <PropertyGroup Condition="'$(Configuration)' == 'Release'" />
+  <PropertyGroup>
+    <VisualStudioVersion Condition=" '$(VisualStudioVersion)' == '' ">10.0</VisualStudioVersion>
+  </PropertyGroup>
+  <ItemGroup>
+    <Compile Include="..\dace\codegen\codegen.py" />
+    <Compile Include="..\dace\codegen\codeobject.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\codegen\compiler.py" />
+    <Compile Include="..\dace\codegen\prettycode.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\codegen\targets\cpu.py" />
+    <Compile Include="..\dace\codegen\targets\cuda.py" />
+    <Compile Include="..\dace\codegen\targets\framecode.py" />
+    <Compile Include="..\dace\codegen\targets\immaterial.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\codegen\targets\mpi.py" />
+    <Compile Include="..\dace\codegen\targets\target.py" />
+    <Compile Include="..\dace\codegen\targets\xilinx.py" />
+    <Compile Include="..\dace\codegen\targets\__init__.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\codegen\__init__.py" />
+    <Compile Include="..\dace\config.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\data.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\codegen\cppunparse.py" />
+    <Compile Include="..\dace\frontend\common\op_impl.py" />
+    <Compile Include="..\dace\frontend\common\__init__.py" />
+    <Compile Include="..\dace\frontend\octave\ast_arrayaccess.py" />
+    <Compile Include="..\dace\frontend\octave\ast_assign.py" />
+    <Compile Include="..\dace\frontend\octave\ast_expression.py" />
+    <Compile Include="..\dace\frontend\octave\ast_function.py" />
+    <Compile Include="..\dace\frontend\octave\ast_loop.py" />
+    <Compile Include="..\dace\frontend\octave\ast_matrix.py" />
+    <Compile Include="..\dace\frontend\octave\ast_node.py" />
+    <Compile Include="..\dace\frontend\octave\ast_nullstmt.py" />
+    <Compile Include="..\dace\frontend\octave\ast_range.py" />
+    <Compile Include="..\dace\frontend\octave\ast_values.py" />
+    <Compile Include="..\dace\frontend\octave\lexer.py" />
+    <Compile Include="..\dace\frontend\octave\parse.py" />
+    <Compile Include="..\dace\frontend\octave\__init__.py" />
+    <Compile Include="..\dace\frontend\python\ndarray.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\perfsettings.py" />
+    <Compile Include="..\dace\transformation\beam_search.py" />
+    <Compile Include="..\dace\transformation\dataflow\gpu_transform_local_storage.py" />
+    <Compile Include="..\dace\transformation\dataflow\mpi.py" />
+    <Compile Include="..\dace\transformation\dataflow\redundant_array_copying.py" />
+    <Compile Include="..\dace\transformation\dataflow\redundant_array_copying2.py" />
+    <Compile Include="..\dace\transformation\dataflow\redundant_array_copying3.py" />
+    <Compile Include="..\diode\optgraph\__init__.py" />
+    <Compile Include="..\samples\fpga\filter_fpga.py" />
+    <Compile Include="..\samples\fpga\filter_fpga_vectorized.py" />
+    <Compile Include="..\samples\fpga\gemm_fpga_double_buffered.py" />
+    <Compile Include="..\samples\fpga\gemm_fpga_naive.py" />
+    <Compile Include="..\samples\fpga\gemm_fpga_pipelined.py" />
+    <Compile Include="..\samples\fpga\gemm_fpga_stream.py" />
+    <Compile Include="..\samples\fpga\gemm_fpga_systolic.py" />
+    <Compile Include="..\samples\fpga\gemm_fpga_systolic_vectorized.py" />
+    <Compile Include="..\samples\fpga\gemv_transposed_fpga.py" />
+    <Compile Include="..\samples\fpga\histogram_fpga.py" />
+    <Compile Include="..\samples\fpga\histogram_fpga_parallel.py" />
+    <Compile Include="..\samples\fpga\jacobi_fpga.py" />
+    <Compile Include="..\samples\fpga\jacobi_fpga_sliding_window.py" />
+    <Compile Include="..\samples\fpga\jacobi_fpga_stream.py" />
+    <Compile Include="..\samples\fpga\jacobi_fpga_systolic.py" />
+    <Compile Include="..\samples\fpga\jacobi_fpga_vectorized.py" />
+    <Compile Include="..\samples\fpga\spmv_fpga.py" />
+    <Compile Include="..\samples\fpga\spmv_fpga_pipelined.py" />
+    <Compile Include="..\samples\fpga\spmv_fpga_stream.py" />
+    <Compile Include="..\samples\optimized\polybench\2mm_cpu.py" />
+    <Compile Include="..\samples\optimized\polybench\3mm_cpu.py" />
+    <Compile Include="..\samples\optimized\polybench\floyd-warshall.py" />
+    <Compile Include="..\samples\optimized\polybench\gemm_cpu.py" />
+    <Compile Include="..\samples\optimized\polybench\jacobi-2d.py" />
+    <Compile Include="..\samples\optimized\support\readgr.py" />
+    <Compile Include="..\samples\optimized\support\__init__.py" />
+    <Compile Include="..\samples\polybench\__init__.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\samples\realworld\gf_kernel_el.py" />
+    <Compile Include="..\samples\realworld\gf_kernel_gpu5.py" />
+    <Compile Include="..\samples\sdfg_api\control_flow.py" />
+    <Compile Include="..\samples\sdfg_api\cublas_tasklet.py" />
+    <Compile Include="..\samples\sdfg_api\jagged_arrays.py" />
+    <Compile Include="..\samples\sdfg_api\nested_states.py" />
+    <Compile Include="..\samples\sdfg_api\stencil_boundaries.py" />
+    <Compile Include="..\tests\multistream_custom_cudatest.py" />
+    <Compile Include="..\tests\multistream_kernel_cudatest.py" />
+    <Compile Include="..\tests\multistream_copy_cudatest.py" />
+    <Compile Include="..\samples\simple\dgemm10_fpga.py" />
+    <Compile Include="..\samples\simple\filter2.py" />
+    <Compile Include="..\tests\numpy\map_syntax_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\sdfg_validate_scopes_test.py" />
+    <Compile Include="..\tests\state_transition_array_test.py" />
+    <Compile Include="..\tests\cppunparse_test.py" />
+    <Compile Include="..\dace\frontend\tensorflow\tensorflow.py" />
+    <Compile Include="..\dace\frontend\tensorflow\__init__.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\frontend\__init__.py" />
+    <Compile Include="..\dace\graph\graph.py" />
+    <Compile Include="..\dace\graph\nxutil.py" />
+    <Compile Include="..\dace\sdfg.py" />
+    <Compile Include="..\dace\subsets.py" />
+    <Compile Include="..\dace\memlet.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\transformation\dataflow\copy_to_device.py" />
+    <Compile Include="..\dace\transformation\dataflow\fpga_transform.py" />
+    <Compile Include="..\dace\transformation\dataflow\gpu_transform.py" />
+    <Compile Include="..\dace\transformation\dataflow\map_collapse.py" />
+    <Compile Include="..\dace\transformation\dataflow\map_dim_interchange.py" />
+    <Compile Include="..\dace\transformation\dataflow\map_expansion.py" />
+    <Compile Include="..\dace\transformation\dataflow\map_for_loop.py" />
+    <Compile Include="..\dace\transformation\dataflow\map_fusion.py" />
+    <Compile Include="..\dace\transformation\dataflow\map_interchange.py" />
+    <Compile Include="..\dace\transformation\dataflow\reduce_expansion.py" />
+    <Compile Include="..\dace\transformation\dataflow\redundant_array.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\transformation\dataflow\stream_transient.py" />
+    <Compile Include="..\dace\transformation\dataflow\strip_mining.py" />
+    <Compile Include="..\dace\transformation\dataflow\tiling.py" />
+    <Compile Include="..\dace\transformation\dataflow\vectorization.py" />
+    <Compile Include="..\dace\transformation\grid_search.py" />
+    <Compile Include="..\dace\transformation\interstate\double_buffering.py" />
+    <Compile Include="..\dace\transformation\interstate\fpga_transform_sdfg.py" />
+    <Compile Include="..\dace\transformation\interstate\fpga_transform_state.py" />
+    <Compile Include="..\dace\transformation\interstate\gpu_transform_state.py" />
+    <Compile Include="..\dace\transformation\interstate\sdfg_nesting.py" />
+    <Compile Include="..\dace\transformation\interstate\state_fusion.py" />
+    <Compile Include="..\dace\transformation\interstate\__init__.py" />
+    <Compile Include="..\dace\transformation\__init__.py" />
+    <Compile Include="..\dace\frontend\python\astnodes.py" />
+    <Compile Include="..\dace\frontend\python\astparser.py" />
+    <Compile Include="..\dace\frontend\python\astutils.py" />
+    <Compile Include="..\dace\frontend\python\depanalysis.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\frontend\python\parser.py" />
+    <Compile Include="..\dace\frontend\python\__init__.py" />
+    <Compile Include="..\dace\properties.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\frontend\python\simulator.py" />
+    <Compile Include="..\dace\transformation\optimizer.py" />
+    <Compile Include="..\dace\transformation\pattern_matching.py" />
+    <Compile Include="..\dace\graph\labeling.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\transformation\dataflow\mapreduce.py" />
+    <Compile Include="..\dace\transformation\dataflow\__init__.py" />
+    <Compile Include="..\dace\types.py" />
+    <Compile Include="..\dace\frontend\python\decorators.py" />
+    <Compile Include="..\dace\graph\edges.py" />
+    <Compile Include="..\dace\graph\dot.py" />
+    <Compile Include="..\dace\graph\nodes.py" />
+    <Compile Include="..\dace\graph\__init__.py" />
+    <Compile Include="..\dace\frontend\python\ndloop.py" />
+    <Compile Include="..\dace\frontend\operations.py" />
+    <Compile Include="..\dace\symbolic.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\dace\__init__.py" />
+    <Compile Include="..\diode.py" />
+    <Compile Include="..\diode\abstract_sdfg.py" />
+    <Compile Include="..\diode\config_ui.py" />
+    <Compile Include="..\diode\diode.py" />
+    <Compile Include="..\diode\images.py" />
+    <Compile Include="..\diode\optgraph\DaceState.py" />
+    <Compile Include="..\diode\optgraph\optgraph.py" />
+    <Compile Include="..\diode\pattern_editor.py" />
+    <Compile Include="..\diode\performance_plot.py" />
+    <Compile Include="..\diode\property_renderer.py" />
+    <Compile Include="..\diode\remote_execution.py" />
+    <Compile Include="..\diode\rendered_graph.py" />
+    <Compile Include="..\diode\sdfg_editor.py" />
+    <Compile Include="..\diode\xdot\dot\lexer.py" />
+    <Compile Include="..\diode\xdot\dot\parser.py" />
+    <Compile Include="..\diode\xdot\dot\scanner.py" />
+    <Compile Include="..\diode\xdot\dot\__init__.py" />
+    <Compile Include="..\diode\xdot\ui\actions.py" />
+    <Compile Include="..\diode\xdot\ui\animation.py" />
+    <Compile Include="..\diode\xdot\ui\colors.py" />
+    <Compile Include="..\diode\xdot\ui\elements.py" />
+    <Compile Include="..\diode\xdot\ui\pen.py" />
+    <Compile Include="..\diode\xdot\ui\window.py" />
+    <Compile Include="..\diode\xdot\ui\__init__.py" />
+    <Compile Include="..\diode\xdot\__init__.py" />
+    <Compile Include="..\diode\xdot\__main__.py" />
+    <Compile Include="..\diode\__init__.py" />
+    <Compile Include="..\samples\graph\bfs.py" />
+    <Compile Include="..\samples\graph\bfs_bsp.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\samples\graph\cc.py" />
+    <Compile Include="..\samples\graph\support\readgr.py" />
+    <Compile Include="..\samples\graph\support\__init__.py" />
+    <Compile Include="..\samples\optimized\bfs_bsp_cuda.py" />
+    <Compile Include="..\samples\optimized\bfs_bsp_cpu.py" />
+    <Compile Include="..\samples\optimized\filter_cpu.py" />
+    <Compile Include="..\samples\optimized\filter_cuda.py" />
+    <Compile Include="..\samples\optimized\gemm_cpu.py" />
+    <Compile Include="..\samples\optimized\gemm_cuda.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\samples\optimized\histogram_cpu.py" />
+    <Compile Include="..\samples\optimized\histogram_cuda.py" />
+    <Compile Include="..\samples\optimized\jacobi_cuda.py" />
+    <Compile Include="..\samples\optimized\spmv_cuda.py" />
+    <Compile Include="..\samples\optimized\spmv_cpu.py" />
+    <Compile Include="..\samples\polybench\cholesky.py" />
+    <Compile Include="..\samples\polybench\deriche.py" />
+    <Compile Include="..\samples\polybench\doitgen.py" />
+    <Compile Include="..\samples\polybench\atax.py" />
+    <Compile Include="..\samples\polybench\bicg.py" />
+    <Compile Include="..\samples\polybench\durbin.py" />
+    <Compile Include="..\samples\polybench\floyd-warshall.py" />
+    <Compile Include="..\samples\polybench\gramschmidt.py" />
+    <Compile Include="..\samples\polybench\lu.py" />
+    <Compile Include="..\samples\polybench\ludcmp.py" />
+    <Compile Include="..\samples\polybench\mvt.py" />
+    <Compile Include="..\samples\polybench\3mm.py" />
+    <Compile Include="..\samples\polybench\adi.py" />
+    <Compile Include="..\samples\polybench\gemm.py" />
+    <Compile Include="..\samples\polybench\nussinov.py" />
+    <Compile Include="..\samples\polybench\symm.py" />
+    <Compile Include="..\samples\polybench\trisolv.py" />
+    <Compile Include="..\samples\polybench\trmm.py" />
+    <Compile Include="..\samples\polybench\syr2k.py" />
+    <Compile Include="..\samples\polybench\syrk.py" />
+    <Compile Include="..\samples\polybench\gesummv.py" />
+    <Compile Include="..\samples\polybench\gemver.py" />
+    <Compile Include="..\samples\polybench\2mm.py" />
+    <Compile Include="..\samples\polybench\correlation.py" />
+    <Compile Include="..\samples\polybench\covariance.py" />
+    <Compile Include="..\samples\polybench\fdtd-2d.py" />
+    <Compile Include="..\samples\polybench\heat-3d.py" />
+    <Compile Include="..\samples\polybench\jacobi-1d.py" />
+    <Compile Include="..\samples\polybench\jacobi-2d.py" />
+    <Compile Include="..\samples\polybench\polybench.py" />
+    <Compile Include="..\samples\polybench\seidel-2d.py" />
+    <Compile Include="..\samples\realworld\fastwaves.py" />
+    <Compile Include="..\samples\realworld\omen.py" />
+    <Compile Include="..\samples\realworld\omen_scattering_sigma.py" />
+    <Compile Include="..\samples\realworld\omen_scattering_sigma_blas.py" />
+    <Compile Include="..\samples\simple\arange.py" />
+    <Compile Include="..\samples\simple\axpy.py" />
+    <Compile Include="..\samples\simple\ddot.py" />
+    <Compile Include="..\samples\simple\dgemm10.py" />
+    <Compile Include="..\samples\simple\dzd_gemm.py" />
+    <Compile Include="..\samples\simple\fibonacci.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\samples\simple\filter.py" />
+    <Compile Include="..\samples\simple\gemm.py" />
+    <Compile Include="..\samples\simple\histogram.py" />
+    <Compile Include="..\samples\simple\histogram_declarative.py" />
+    <Compile Include="..\samples\simple\mandelbrot.py" />
+    <Compile Include="..\samples\simple\mat_add.py" />
+    <Compile Include="..\samples\simple\mirror.py" />
+    <Compile Include="..\samples\simple\overlay.py" />
+    <Compile Include="..\samples\simple\simple_stencil.py" />
+    <Compile Include="..\samples\simple\spmv.py" />
+    <Compile Include="..\samples\simple\sum.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\samples\simple\transpose.py" />
+    <Compile Include="..\samples\simple\xxt.py" />
+    <Compile Include="..\samples\simple\zgemm.py" />
+    <Compile Include="..\samples\simple\z_axpy.py" />
+    <Compile Include="..\samples\simple\z_gemm.py" />
+    <Compile Include="..\samples\solvers\cg.py" />
+    <Compile Include="..\samples\solvers\cholesky.py" />
+    <Compile Include="..\samples\solvers\lu.py" />
+    <Compile Include="..\samples\tensorflow\lenet.py" />
+    <Compile Include="..\samples\tensorflow\simple_training_example.py" />
+    <Compile Include="..\setup.py" />
+    <Compile Include="..\scripts\diode" />
+    <Compile Include="..\scripts\dacelab" />
+    <Compile Include="..\tests\chained_nested_tasklet_test.py" />
+    <Compile Include="..\tests\consume_chunk_cond_test.py" />
+    <Compile Include="..\tests\consume_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\cuda_blockreduce.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\cuda_highdim_kernel_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\custom_reduce_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\duplicate_arg_test.py" />
+    <Compile Include="..\tests\ifchain_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\memlet_lifetime_validation_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\add_edge_pair_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\nested_reduce_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\numpy\matrix_multiplication.py" />
+    <Compile Include="..\tests\offset_stride_test.py" />
+    <Compile Include="..\tests\chained_tasklet_test.py" />
+    <Compile Include="..\tests\parallel_sections_test.py" />
+    <Compile Include="..\tests\reduction_detection_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\sdfg_validate_names_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\struct_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\tensorflow_compile_test.py" />
+    <Compile Include="..\tests\threadlocal_stream_test.py" />
+    <Compile Include="..\tests\state_transition_test.py" />
+    <Compile Include="..\tests\strided_range_copy_test.py" />
+    <Compile Include="..\tests\strided_range_test.py" />
+    <Compile Include="..\tests\tensorflow_test.py" />
+    <Compile Include="..\tests\compile_sdfg_test.py" />
+    <Compile Include="..\tests\confres_test.py" />
+    <Compile Include="..\tests\copynd_test.py" />
+    <Compile Include="..\tests\constant_array_test.py" />
+    <Compile Include="..\tests\nested_sdfg_python_test.py" />
+    <Compile Include="..\tests\nested_sdfg_test.py" />
+    <Compile Include="..\tests\range_from_string_test.py" />
+    <Compile Include="..\tests\simple_control_flow_test.py" />
+    <Compile Include="..\tests\control_flow_test.py" />
+    <Compile Include="..\tests\cuda_block_test.py" />
+    <Compile Include="..\tests\cuda_grid2d_test.py" />
+    <Compile Include="..\tests\cuda_grid_test.py" />
+    <Compile Include="..\tests\cuda_smem2d_test.py" />
+    <Compile Include="..\tests\cuda_smem_test.py" />
+    <Compile Include="..\tests\dynamic_sdfg_functions_test.py" />
+    <Compile Include="..\tests\emptymap_opt.py" />
+    <Compile Include="..\tests\cr_reinit_test.py" />
+    <Compile Include="..\tests\erroneous_lib_use_test.py" />
+    <Compile Include="..\tests\multiple_cr_test.py" />
+    <Compile Include="..\tests\multiple_tasklet_test.py" />
+    <Compile Include="..\tests\multistate_init_test.py" />
+    <Compile Include="..\tests\external_tasklet.py" />
+    <Compile Include="..\tests\external_tasklet_test_opt.py" />
+    <Compile Include="..\tests\filter_test_opt.py" />
+    <Compile Include="..\tests\indirection_test.py" />
+    <Compile Include="..\tests\reloadable_lib_test.py" />
+    <Compile Include="..\tests\specialize_test.py" />
+    <Compile Include="..\tests\vector_min_test.py" />
+    <Compile Include="..\tools\yapf_everything.py" />
+    <Compile Include="..\tests\inlining_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\duplicate_naming_test.py" />
+    <Compile Include="..\tests\local_inline_test.py" />
+    <Compile Include="..\tests\mapreduce2_test.py" />
+    <Compile Include="..\tests\mapreduce3_test.py" />
+    <Compile Include="..\tests\mapreduce4_test.py" />
+    <Compile Include="..\tests\mapreducefusion_test_opt.py" />
+    <Compile Include="..\tests\mapreduce_test.py" />
+    <Compile Include="..\tests\mpihello.py" />
+    <Compile Include="..\tests\mpihello2.py" />
+    <Compile Include="..\tests\mpihello2_test_opt.py" />
+    <Compile Include="..\tests\mpihello_test_opt.py" />
+    <Compile Include="..\tests\multidim_indirection_test.py" />
+    <Compile Include="..\tests\multi_inline_test.py" />
+    <Compile Include="..\tests\multi_output_scope_test.py" />
+    <Compile Include="..\tests\properties_test.py" />
+    <Compile Include="..\tests\read_after_write_test.py" />
+    <Compile Include="..\tests\simple_stencil_test_opt.py" />
+    <Compile Include="..\tests\spmv_test_opt.py" />
+    <Compile Include="..\tests\stream_test.py">
+      <SubType>Code</SubType>
+    </Compile>
+    <Compile Include="..\tests\intarg_test.py" />
+    <Compile Include="..\tests\subarray_test.py" />
+    <Compile Include="..\tests\tasklet_test.py" />
+    <Compile Include="..\tests\tile_test.py" />
+    <Compile Include="..\tests\unit_test.py" />
+    <Compile Include="..\tests\vectorization_test.py" />
+  </ItemGroup>
+  <ItemGroup>
+    <Folder Include="..\dace" />
+    <Folder Include="..\dace\codegen\" />
+    <Folder Include="..\dace\codegen\targets\" />
+    <Folder Include="..\dace\codegen\tools\" />
+    <Folder Include="..\dace\frontend\" />
+    <Folder Include="..\dace\frontend\common\" />
+    <Folder Include="..\dace\frontend\octave\" />
+    <Folder Include="..\dace\frontend\tensorflow\" />
+    <Folder Include="..\dace\frontend\python\" />
+    <Folder Include="..\dace\transformation\interstate\" />
+    <Folder Include="..\dace\graph\" />
+    <Folder Include="..\dace\transformation\" />
+    <Folder Include="..\dace\transformation\dataflow\" />
+    <Folder Include="..\diode\xdot\" />
+    <Folder Include="..\diode\xdot\dot\" />
+    <Folder Include="..\diode\xdot\ui\" />
+    <Folder Include="..\diode\optgraph\" />
+    <Folder Include="..\diode\" />
+    <Folder Include="..\samples" />
+    <Folder Include="..\samples\fpga\" />
+    <Folder Include="..\samples\graph\" />
+    <Folder Include="..\samples\graph\support\" />
+    <Folder Include="..\samples\optimized\" />
+    <Folder Include="..\samples\optimized\polybench\" />
+    <Folder Include="..\samples\optimized\support\" />
+    <Folder Include="..\samples\polybench\" />
+    <Folder Include="..\samples\realworld\" />
+    <Folder Include="..\samples\sdfg_api\" />
+    <Folder Include="..\samples\simple\" />
+    <Folder Include="..\samples\solvers\" />
+    <Folder Include="..\samples\tensorflow\" />
+    <Folder Include="..\scripts\" />
+    <Folder Include="..\tests\" />
+    <Folder Include="..\tests\installtests\" />
+    <Folder Include="..\tests\installtests\docker_debian\" />
+    <Folder Include="..\tests\numpy\" />
+    <Folder Include="..\tests\octave\" />
+    <Folder Include="..\tools\" />
+  </ItemGroup>
+  <ItemGroup>
+    <Content Include="..\dace\codegen\CMakeLists.txt" />
+    <Content Include="..\dace\codegen\tools\dacestub.cpp" />
+    <Content Include="..\dace\codegen\tools\get_cuda_arch.cpp" />
+    <Content Include="..\dace\config_schema.yml" />
+    <Content Include="..\samples\graph\support\Makefile" />
+    <Content Include="..\samples\graph\support\readgr.cpp" />
+    <Content Include="..\samples\graph\support\readgr.h" />
+    <Content Include="..\samples\optimized\support\Makefile" />
+    <Content Include="..\samples\optimized\support\readgr.cpp" />
+    <Content Include="..\samples\optimized\support\readgr.h" />
+    <Content Include="..\tests\installtests\docker_debian\Dockerfile" />
+    <Content Include="..\tests\installtests\docker_debian\run.sh" />
+    <Content Include="..\tests\octave\add.m" />
+    <Content Include="..\tests\octave\cholesky.m" />
+    <Content Include="..\tests\octave\forloop.m" />
+    <Content Include="..\tests\octave\kalmanfilter.m" />
+    <Content Include="..\tests\octave\matrix_scalar_add.m" />
+    <Content Include="..\tests\octave\mult.m" />
+    <Content Include="..\tests\octave\scalar_add.m" />
+  </ItemGroup>
+  <Import Project="$(MSBuildExtensionsPath32)\Microsoft\VisualStudio\v$(VisualStudioVersion)\Python Tools\Microsoft.PythonTools.targets" />
+</Project>
\ No newline at end of file
diff --git a/windows/src.sln b/windows/src.sln
new file mode 100644
index 0000000000..742abcb124
--- /dev/null
+++ b/windows/src.sln
@@ -0,0 +1,51 @@
+﻿
+Microsoft Visual Studio Solution File, Format Version 12.00
+# Visual Studio 15
+VisualStudioVersion = 15.0.27130.2027
+MinimumVisualStudioVersion = 10.0.40219.1
+Project("{888888A0-9F3D-457C-B088-3A5042F75D52}") = "src", "src.pyproj", "{0389AF43-31FB-4488-9481-69EBB65F4057}"
+EndProject
+Project("{8BC9CEB8-8B4A-11D0-8D11-00A0C91BC942}") = "runtime", "runtime.vcxproj", "{0D3CB049-8CE8-4FA0-8160-A36240A58846}"
+EndProject
+Global
+	GlobalSection(SolutionConfigurationPlatforms) = preSolution
+		Debug|CPU = Debug|CPU
+		Debug|Python Sim = Debug|Python Sim
+		Debug|x64 = Debug|x64
+		Debug|x86 = Debug|x86
+		Release|CPU = Release|CPU
+		Release|Python Sim = Release|Python Sim
+		Release|x64 = Release|x64
+		Release|x86 = Release|x86
+	EndGlobalSection
+	GlobalSection(ProjectConfigurationPlatforms) = postSolution
+		{0389AF43-31FB-4488-9481-69EBB65F4057}.Debug|CPU.ActiveCfg = Release|Any CPU
+		{0389AF43-31FB-4488-9481-69EBB65F4057}.Debug|Python Sim.ActiveCfg = Debug|Any CPU
+		{0389AF43-31FB-4488-9481-69EBB65F4057}.Debug|x64.ActiveCfg = Debug|Any CPU
+		{0389AF43-31FB-4488-9481-69EBB65F4057}.Debug|x86.ActiveCfg = Debug|Any CPU
+		{0389AF43-31FB-4488-9481-69EBB65F4057}.Release|CPU.ActiveCfg = Release|Any CPU
+		{0389AF43-31FB-4488-9481-69EBB65F4057}.Release|Python Sim.ActiveCfg = Release|Any CPU
+		{0389AF43-31FB-4488-9481-69EBB65F4057}.Release|x64.ActiveCfg = Release|Any CPU
+		{0389AF43-31FB-4488-9481-69EBB65F4057}.Release|x86.ActiveCfg = Release|Any CPU
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Debug|CPU.ActiveCfg = Debug|x64
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Debug|CPU.Build.0 = Debug|x64
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Debug|Python Sim.ActiveCfg = Debug|Win32
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Debug|x64.ActiveCfg = Debug|x64
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Debug|x64.Build.0 = Debug|x64
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Debug|x86.ActiveCfg = Debug|Win32
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Debug|x86.Build.0 = Debug|Win32
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Release|CPU.ActiveCfg = Release|x64
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Release|CPU.Build.0 = Release|x64
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Release|Python Sim.ActiveCfg = Release|Win32
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Release|x64.ActiveCfg = Release|x64
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Release|x64.Build.0 = Release|x64
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Release|x86.ActiveCfg = Release|Win32
+		{0D3CB049-8CE8-4FA0-8160-A36240A58846}.Release|x86.Build.0 = Release|Win32
+	EndGlobalSection
+	GlobalSection(SolutionProperties) = preSolution
+		HideSolutionNode = FALSE
+	EndGlobalSection
+	GlobalSection(ExtensibilityGlobals) = postSolution
+		SolutionGuid = {5FABFF65-9DA9-466E-A971-1F755BDFFD8D}
+	EndGlobalSection
+EndGlobal