CME 211: Lecture 8

SciPy

SciPy (pronounced "Sigh Pie") is a Python-based ecosystem of open-source software for mathematics, science, and engineering. In particular, these are some of the core packages:

Actively developed:

SciPy Subpackages

See: http://docs.scipy.org/doc/scipy/reference/

Clustering package (scipy.cluster)
Constants (scipy.constants)
Discrete Fourier transforms (scipy.fftpack)
Integration and ODEs (scipy.integrate)
Interpolation (scipy.interpolate)
Input and output (scipy.io)
Linear algebra (scipy.linalg)
Miscellaneous routines (scipy.misc)
Multi-dimensional image processing (scipy.ndimage)
Orthogonal distance regression (scipy.odr)
Optimization and root finding (scipy.optimize)
Signal processing (scipy.signal)
Sparse matrices (scipy.sparse)
Sparse linear algebra (scipy.sparse.linalg)
Compressed Sparse Graph Routines (scipy.sparse.csgraph)
Spatial algorithms and data structures (scipy.spatial)
Special functions (scipy.special)
Statistical functions (scipy.stats)
Statistical functions for masked arrays (scipy.stats.mstats)
C/C++ integration (scipy.weave)

Loading and saving MATLAB files

Let's create a MATLAB data file in Octave. Octave is an open source MATLAB clone:

octave:1> a = [42, 17, -19; 9, 3, 0];
octave:2> a
a =

   42   17  -19
    9    3    0

octave:3> save -6 matlabfile.mat a
octave:4>

Let's use SciPy to open the matlab data file:

import scipy.io
data = scipy.io.loadmat('matlabfile.mat')
print(data)
print(data['a'])

Note: SciPy submodules must be specifically imported. For example the io module is not imported with import scipy. Need to import scipy.io.

>>> import scipy
>>> data = scipy.io.loadmat('matlabfile.mat')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
AttributeError: 'module' object has no attribute 'io'

More I/O

Image support
Little bit of audio support
Support for [NetCDF][https://en.wikipedia.org/wiki/NetCDF] (Network Common Data Form), a file format used in scientific computing for handling large arrays of data
Support for [HDF][https://en.wikipedia.org/wiki/Hierarchical_Data_Format]. (NetCDF 4 is based on HDF5)

Image Processing

import numpy
import scipy.misc, scipy.signal
im = scipy.misc.imread('fig/memchu.jpg',flatten=True).astype(numpy.float32)
laplacian = numpy.array([[0,1,0],[1,-4,1],[0,1,0]], dtype=numpy.float32)
ck = scipy.signal.cspline2d(im, 8.0)
edge = scipy.signal.convolve2d(ck, laplacian,mode='same',boundary='symm')
scipy.misc.imsave('fig/edge.jpg', edge)

Note: scipy.misc.imread() requires the Python Imaging Library (PIL). This will be installed with Anaconda Python. It is also possible to install via $ pip3 isntall pillow.

Input image:

Output image:

Quadrature

Quadrature = numerical integration. We want an approximation to:

$$ \int_a^b f(x) dx $$

import scipy.integrate

# define function to integrate
def line(x):
    return x

print(scipy.integrate.quad(line, 0., 1.))
print(scipy.integrate.quad(line, -1., 1.))

Optimization

Numerical optimization methods attempt to solve the problem:

$$ \begin{array}{ll} \mbox{minimize} & f_0(x)\\ \mbox{subject to} & f_i(x) \leq 0, \quad i=1,\ldots,m \end{array} $$

Let's use scipy.optimize to solve an unconstrained optimization problem:

import numpy
import scipy.optimize
import math
def rosenbrock(xy):
    x = xy[0]
    y = xy[1]
    f = math.pow(1.-x,2.) + 100*math.pow(y-x*x,2.)
    return f

xy0 = numpy.array([0., 0.])
min = scipy.optimize.fmin(rosenbrock, xy0)
print(min)
print(rosenbrock(min))

Linear algebra

Matrix inversion
Linear solver (direct)
Least squares
Eigenvalues / eigenvectors
Singular Value Decomposition (SVD)
LU, QR, Cholesky, Schur matrix factorizations

Solver example

Here is a $3 \times 3$ system of linear equations:

\begin{align} y + 2x - z &= 8 \ 2z - y - 3x &= -11 \ -2x + 2z + y &= -3 \ \end{align}

Convert to matrix-vector form:

$$ \begin{pmatrix} 1 & 2 & -1 \ 2 & -1 & -3 \ -2 & 2 & 1 \end{pmatrix} \begin{pmatrix} x \ y \ z \end{pmatrix}

\begin{pmatrix} 8 \ -11 \ -3 \end{pmatrix} $$

Solve with Python:

import numpy
import scipy.linalg
A = numpy.array(([2., 1., -1.],[-3., -1., 2.],[-2., 1., 2.]))
b = numpy.array([8., -11., -3.])
x = scipy.linalg.solve(A, b)
# print the solution
print(x)
# check the solution
print(scipy.linalg.norm(numpy.dot(A,x)-b))

Sparse matrices

Many problems lead to matrices with large numbers of zeros (like > 99% zeros)
Discretization of Partial Differential Equations (PDEs), social graphs, etc.
Can store these efficiently using a sparse matrix storage format such as Compressed Row Storage (CRS), sometimes also called Compressed Sparse Row (CSR)
One typical format for file exchange is Matrix Market (.mtx)
See: http://math.nist.gov/MatrixMarket/
Another source for sparse matrices: https://www.cise.ufl.edu/research/sparse/matrices/

Compressed Row Storage

Dense matrix :

$$ \begin{bmatrix} 10 & 0 & 0 & 0 \\ 3 & 9 & 0 & 0 \\ 0 & 7 & 8 & 7 \\ 0 & 0 & 3 & 7 \end{bmatrix} $$

Sparse matrix storage with 0-based indexing:

val: 10 3 9 7 8 7 3 7
col_ind: 0 0 1 1 2 3 2 3
row_ptr: 0 1 3 6 8

Question: what is the storage complexity for these different storage methods?

Matrix Market files

!head -n 20 data/LFAT5.mtx

Reading / writing sparse matrices

import scipy.io
A = scipy.io.mmio.mmread('data/LFAT5.mtx')
print(A)

Sparse matrix applications

See https://www.cise.ufl.edu/research/sparse/matrices/.

Sparse linear solvers

In the scipy.sparse.linalg subpackage
Direct solver
Iterative solvers
- Conjugate Gradient (CG)
- BiConjugate Gradient (BiCG)
- BiConjugate Gradient STABilized (BiCGSTAB)
- MINimum RESidual (MINRES)
- Generalized Minimum RESidual (GMRES)
Factorizations

Sparse solver examples

import numpy
import scipy.io
import scipy.linalg
import scipy.sparse.linalg
import time

K = scipy.io.mmio.mmread('data/nasa1824.mtx').tocsr()
f = numpy.squeeze(scipy.io.mmio.mmread('data/nasa1824_b.mtx'))

# Direct solve
t0 = time.time()
u = scipy.sparse.linalg.spsolve(K, f)
t1 = time.time()
err = scipy.linalg.norm(K*u - f)
print("spsolve() time = %f seconds" % (t1-t0))
print("err = %e" % err)

# Iterative solve (Conjugate Gradient)
t0 = time.time()
u, info = scipy.sparse.linalg.cg(K, f, tol=1.e-7)
t1 = time.time()
err = scipy.linalg.norm(K*u - f)
print("cg() time = %f seconds" % (t1-t0))
print("err = %e" % err)
print("info = %d" % info)

Sparse matrix eigenvalues

import scipy.io
import scipy.sparse.linalg
A = scipy.io.mmio.mmread('data/LFAT5.mtx')
scipy.sparse.linalg.eigsh(A,return_eigenvectors=False)

SciPy summary

Take a look at what is available so when the time comes you will know not to write those things from scratch
Some of these even allow you to utilize parallel computing because the implementations are multithreaded and will use multiple CPU cores

matplotlib

Examples from: http://matplotlib.org/gallery.html

Getting started

Plotting functionality can be access through either of these two module names:
- matplotlib.pyplot
- pylab
pylab module combines the namespaces of the matplotlib.pyplot module plus numpy and is intended for quick, interactive use
matplotlib.pyplot is intended for use when writing code to be executed out of a .py file
Conventions:
- import matplotlib.pyplot as plt
- import numpy as np

`pylab` includes `numpy`

import pylab
a = pylab.arange(9, dtype=pylab.float64)
print(a)
a = pylab.arange(9, dtype=pylab.float64).reshape(3,3)
print(a)

`matplotlib.pyplot`

Doesn't include numpy functionality:

import matplotlib.pyplot
a = matplotlib.pyplot.arange(9)

Your first plot

Some good Jupyter Settings:

import matplotlib.pyplot as plt
import numpy as np
%matplotlib inline
%config InlineBackend.figure_format = 'svg'

A simple plot:

plt.plot(2*np.arange(5))

Saving to a file

The savefig function saves the most recent plot to disk. The file type is determined by the extension:

plt.plot(2*np.arange(5))
plt.savefig('figure1.jpg')
plt.savefig('figure1.png')
plt.savefig('figure1.pdf')

Let's look at the files!

%ls

Start a new figure

plt.figure(2)
plt.plot(2*np.arange(5), 3*np.arange(5))
plt.show(block=False)

Controlling the line type

plt.figure(3)
time = np.arange(0,1.01,0.01)
signal = np.sin(2*np.pi*time)
plt.plot(time, signal, 'bx')
plt.show(block=False)

Text

plt.figure(4)
plt.plot(time, signal, 'r-')
plt.xlabel('Time')
plt.ylabel('Signal')
plt.title('Signal Strength')
plt.show(block=False)

Axis

plt.figure(5)
plt.plot(time,signal,'g-')
plt.axis([0, 1, -1.5, 1.5])
plt.show(block=False)

Plotting multiple data

plt.figure(6)
signal2 = np.cos(2*np.pi*time)
plt.plot(time,signal,'r-',time,signal2,'b--')
plt.show(block=False)

Legend

plt.figure(7)
p1, = plt.plot(time,signal,'r-')
p2, = plt.plot(time,signal2,'b--')
plt.legend([p1, p2], ["sin", "cos"])
plt.show(block=False)

Subplots

plt.figure(8)
plt.subplot(211)
plt.plot(time,signal,'r-')
plt.subplot(212)
plt.plot(time,signal2,'b--')
plt.show(block=False)

Pseudocolor plot

delta = 0.05
X = np.arange(-2., 2.+delta, delta)
Y = np.arange(-1., 3.+delta, delta)
X, Y = np.meshgrid(X, Y)
Z = (1.-X)**2 + 100.*(Y-X*X)**2
plt.figure(9)
plt.pcolor(X, Y, Z)
plt.colorbar()
plt.axis([-2, 2, -1, 3])
plt.show(block=False)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0-scipy.md

0-scipy.md

CME 211: Lecture 8

SciPy

SciPy Subpackages

Loading and saving MATLAB files

More I/O

Image Processing

Quadrature

Optimization

Linear algebra

Solver example

$$ \begin{pmatrix} 1 & 2 & -1 \ 2 & -1 & -3 \ -2 & 2 & 1 \end{pmatrix} \begin{pmatrix} x \ y \ z \end{pmatrix}

Sparse matrices

Compressed Row Storage

Matrix Market files

Reading / writing sparse matrices

Sparse matrix applications

Sparse linear solvers

Sparse solver examples

Sparse matrix eigenvalues

SciPy summary

matplotlib

Getting started

`pylab` includes `numpy`

`matplotlib.pyplot`

Your first plot

Saving to a file

Start a new figure

Controlling the line type

Text

Axis

Plotting multiple data

Legend

Subplots

Pseudocolor plot

Files

0-scipy.md

Latest commit

History

0-scipy.md

File metadata and controls

CME 211: Lecture 8

SciPy

SciPy Subpackages

Loading and saving MATLAB files

More I/O

Image Processing

Quadrature

Optimization

Linear algebra

Solver example

$$ \begin{pmatrix} 1 & 2 & -1 \ 2 & -1 & -3 \ -2 & 2 & 1 \end{pmatrix} \begin{pmatrix} x \ y \ z \end{pmatrix}

Sparse matrices

Compressed Row Storage

Matrix Market files

Reading / writing sparse matrices

Sparse matrix applications

Sparse linear solvers

Sparse solver examples

Sparse matrix eigenvalues

SciPy summary

matplotlib

Getting started

pylab includes numpy

matplotlib.pyplot

Your first plot

Saving to a file

Start a new figure

Controlling the line type

Text

Axis

Plotting multiple data

Legend

Subplots

Pseudocolor plot

`pylab` includes `numpy`

`matplotlib.pyplot`