From 734b24d6461b0c54f5fa0e020b70a663e4138252 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 26 Feb 2019 09:44:26 +0800 Subject: [PATCH 01/46] Update README.md --- keras/README.md | 64 +++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 64 insertions(+) diff --git a/keras/README.md b/keras/README.md index 34c4df3..67cc57a 100644 --- a/keras/README.md +++ b/keras/README.md @@ -1 +1,65 @@ +# Tutorials for running distribued Keras (v1.2.2) on Analytics Zoo Tutorials for running _**distribued Keras (v1.2.2) on Analytics Zoo**_. These tutorials are ported from François Chollet's [Jupyter notebooks](https://github.com/fchollet/deep-learning-with-python-notebooks) for the book [Deep Learning with Python (Manning Publications)](https://www.manning.com/books/deep-learning-with-python?a_aid=keras&a_bid=76564dff) + +This repository is built to describe how to write Keras-style code and directly run it in analytics-zoo so that we could run the original Keras code in distributed mode via limited modification. We make this repository based on notebook https://github.com/fchollet/deep-learning-with-python-notebooks#companion-jupyter-notebooks-for-the-book-deep-learning-with-python, which is the sample code repository of book "Deep Learning with Python" based on Keras 2.0.8 and we pick some basic chapters from it. + +Currently analytics-zoo builds its Keras-style code based on Keras 1.2.2. Thus, this repository contains the code of implementation of original Keras 2.0.8 code based on Keras 1.2.2, and how to modify the code in order to run it in analytics-zoo. + +To make it simple, we omit the description of plenty concepts here and you could find them in original notebook above [(link)](https://github.com/fchollet/deep-learning-with-python-notebooks#companion-jupyter-notebooks-for-the-book-deep-learning-with-python). Besides, we directly post Keras 1.2.2 code here and the replacements needed from Keras 2.0.8 to Keras 1.2.2 are noted in `Keras_2-to-1.md`. + +This repository use Python 3.5, Keras 1.2.2 (Keras code), Analytics-zoo 0.4.0 (zoo code). We post the summary of Keras-to-zoo code convertion as well as the table of contents. + +## Summary of Keras-to-zoo code convertion +We summarize the modification we need to make here, attached with the link to the example code chapter. + +#### First of all +Make sure you have analytics-zoo installed, see install guide [here](https://analytics-zoo.github.io/master/#PythonUserGuide/install/). Then set the environment variables like following + + PYSPARK_PYTHON=/path_to_your_python + PYSPARK_DRIVER_PYTHON=/path_to_your_python + # To avoid version conflict, in my ubuntu, this path is /usr/bin/python3.5 + + SPARK_DRIVER_MEMORY=4g + # If you encounter heap space exception, you could simply increase this variable to 8g, 16g, etc. + # If no more memory is available on your machine, you could consider reduce the data size. + +Then, make sure you have following code at the beginning of your zoo code. + + from zoo.common.nncontext import * + sc = init_nncontext(init_spark_conf().setMaster("local[4]")) + +we set core number to 4 above, you can also set it with another number. But we still recommend 4 because analytics-zoo need the core number could divide the batch size of learning, which is normal set as powers of 4, e.g. 16, 32, 128. Error would be raised if this requirement is not satisfied. + +#### Accuracy checkout +Currently in analytics-zoo, `fit` method does not have any return. Results can only be checked via tensorboard, see [Chapter 3.5]() + +#### Evaluate return +The return of `evaluate` method is an `EvaluationResult` object, which is different from Keras, see [Chapter 2.1]() + +#### Predict result +The return of `predict` method is RDD, so you need to call `collect` method to collect them, see [Chapter 3.5]() + +#### Parameters not supported +Analytics-zoo does not support following parameters currently + +* validation_split: +* verbose: + +## Table of contents + +The main purpose of this repository is to decribe the convertion from Keras to analytics-zoo so that we rename the notebooks to make it more experienced-user oriented. We keep the chapter index so you could still make a reference to the original notebook [here](https://github.com/fchollet/deep-learning-with-python-notebooks#companion-jupyter-notebooks-for-the-book-deep-learning-with-python). We only keep some key description of the original notebook. + +* Chapter 2: + * [2.1: MNIST]() +* Chapter 3: + * [3.5: Binary classification]() + * [3.6: Multi-class classification]() + * [3.7: Regression]() +* Chapter 4: + * [4.4: Regularization and Dropout]() +* Chapter 5: + * [5.1: CNN]() +* Chapter 6: + * [6.2: RNN]() + + From 9006edb606d9171c23622bda286d214b2a080cef Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 26 Feb 2019 09:51:16 +0800 Subject: [PATCH 02/46] Add files via upload --- keras/2.1-mnist.ipynb | 144 ++++++++++++++++ keras/3.5-binary-classification.ipynb | 190 +++++++++++++++++++++ keras/3.6-multi-class-classification.ipynb | 130 ++++++++++++++ keras/3.7-regression.ipynb | 174 +++++++++++++++++++ keras/4.4-regularization-and-dropout.ipynb | 53 ++++++ keras/5.1-cnn.ipynb | 126 ++++++++++++++ keras/6.2-rnn.ipynb | 171 +++++++++++++++++++ 7 files changed, 988 insertions(+) create mode 100644 keras/2.1-mnist.ipynb create mode 100644 keras/3.5-binary-classification.ipynb create mode 100644 keras/3.6-multi-class-classification.ipynb create mode 100644 keras/3.7-regression.ipynb create mode 100644 keras/4.4-regularization-and-dropout.ipynb create mode 100644 keras/5.1-cnn.ipynb create mode 100644 keras/6.2-rnn.ipynb diff --git a/keras/2.1-mnist.ipynb b/keras/2.1-mnist.ipynb new file mode 100644 index 0000000..d971f5d --- /dev/null +++ b/keras/2.1-mnist.ipynb @@ -0,0 +1,144 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# MNIST\n", + "\n", + "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset.\n", + "\n", + "The MNIST dataset comes pre-loaded in Keras, in the form of a set of four Numpy arrays." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from keras.datasets import mnist\n", + "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Import the modules we need to build the network. In Keras it is:\n", + "\n", + " from keras import models\n", + " from keras import layers\n", + "Just replace it with following in order to use analytics-zoo:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Build the network, compile and fit:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network = models.Sequential()\n", + "network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))\n", + "network.add(layers.Dense(10, activation='softmax'))\n", + "\n", + "network.compile(optimizer='rmsprop',\n", + " loss='categorical_crossentropy',\n", + " metrics=['accuracy'])\n", + "\n", + "train_images = train_images.reshape((60000, 28 * 28))\n", + "train_images = train_images.astype('float32') / 255\n", + "\n", + "test_images = test_images.reshape((10000, 28 * 28))\n", + "test_images = test_images.astype('float32') / 255\n", + "\n", + "\n", + "from keras.utils.np_utils import to_categorical\n", + "train_labels = to_categorical(train_labels)\n", + "#test_labels = to_categorical(test_labels)\n", + "\n", + "network.fit(train_images, train_labels, nb_epoch=5, batch_size=128)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Evaluate return\n", + "Check our result on test set. In Keras it is:\n", + "\n", + " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", + "In analytics-zoo, the return of `evaluate` method is an `EvaluationResult` object, which is different from Keras. We use following code to check:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", + "print('test_acc:', test_result[0].result)" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} diff --git a/keras/3.5-binary-classification.ipynb b/keras/3.5-binary-classification.ipynb new file mode 100644 index 0000000..929a731 --- /dev/null +++ b/keras/3.5-binary-classification.ipynb @@ -0,0 +1,190 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Binary classification\n", + "We'll be working with \"IMDB dataset\", a set of 50,000 highly-polarized reviews from the Internet Movie Database. They are split into 25,000 reviews for training and 25,000 reviews for testing, each set consisting in 50% negative and 50% positive reviews.\n", + "\n", + "Just like the MNIST dataset, the IMDB dataset comes packaged with Keras. It has already been preprocessed: the reviews (sequences of words) have been turned into sequences of integers, where each integer stands for a specific word in a dictionary.\n", + "\n", + "The following code will load the dataset (when you run it for the first time, about 80MB of data will be downloaded to your machine).\n", + "\n", + "Then we vectorize the sequences to prepare the data we are going to feed to the model. We also separate part of the data for validation." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from keras.datasets import imdb\n", + "(train_data, train_labels), (test_data, test_labels) = imdb.load_data(nb_words=10000)\n", + "\n", + "import numpy as np\n", + "def vectorize_sequences(sequences, dimension=10000):\n", + " # Create an all-zero matrix of shape (len(sequences), dimension)\n", + " results = np.zeros((len(sequences), dimension))\n", + " for i, sequence in enumerate(sequences):\n", + " results[i, sequence] = 1. # set specific indices of results[i] to 1s\n", + " return results\n", + "\n", + "y_train = np.asarray(train_labels).astype('float32')\n", + "y_test = np.asarray(test_labels).astype('float32')\n", + "\n", + "x_val = x_train[:10000]\n", + "partial_x_train = x_train[10000:]\n", + "y_val = y_train[:10000]\n", + "partial_y_train = y_train[10000:]" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Build the network, then compile:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers\n", + "\n", + "model = models.Sequential()\n", + "model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", + "model.add(layers.Dense(16, activation='relu'))\n", + "model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['accuracy'])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Accuracy checkout\n", + "To checkout the behavior of this model in Keras, the original code use `matplotlib` library to draw the following `history` object\n", + " \n", + " history = model.fit(partial_x_train,\n", + " partial_y_train,\n", + " nb_epoch=5,\n", + " batch_size=512,\n", + " validation_data=(x_val, y_val)\n", + " )\n", + "After `fit` method finishes, the results are stored in `history` and thus could be visualized.\n", + "\n", + "Currently in analytics-zoo, `fit` method does not have any return. Results can only be checked via tensorboard. Code above need to be replaced with following:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "model.set_tensorboard('./', '3-5_summary')\n", + "model.fit(partial_x_train,\n", + " partial_y_train,\n", + " nb_epoch=5,\n", + " batch_size=512,\n", + " validation_data=(x_val, y_val)\n", + " )" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Then you could see result in tensorboard by command in terminal `tensorboard --logdir ./`" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Check the result on test data:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "results = model.evaluate(x_test, y_test)\n", + "print('test_acc:', results[0].result)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Predict result\n", + "Predict the result on test data, in Keras, it is easy to just call following code to get the result\n", + "\n", + " model.predict(x_test)\n", + "In analytics-zoo, the return of `predict` is RDD, so you need to call `collect` method to get the result:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "prediction = model.predict(x_test)\n", + "result = prediction.collect()" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} diff --git a/keras/3.6-multi-class-classification.ipynb b/keras/3.6-multi-class-classification.ipynb new file mode 100644 index 0000000..7e8c9ff --- /dev/null +++ b/keras/3.6-multi-class-classification.ipynb @@ -0,0 +1,130 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Multi-class classification\n", + "In this section, we will build a network to classify Reuters newswires into 46 different mutually-exclusive topics. Since we have many classes, this problem is an instance of \"multi-class classification\", and since each data point should be classified into only one category, the problem is more specifically an instance of \"single-label, multi-class classification\".\n", + "\n", + "We will be working with the Reuters dataset, a set of short newswires and their topics, published by Reuters in 1986. It's a very simple, widely used toy dataset for text classification. There are 46 different topics; some topics are more represented than others, but each topic has at least 10 examples in the training set.\n", + "\n", + "Like IMDB and MNIST, the Reuters dataset comes packaged as part of Keras. Let's take a look right away:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from keras.datasets import reuters\n", + "(train_data, train_labels), (test_data, test_labels) = reuters.load_data(nb_words=10000)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Then we do some preprocessing like Chapter 3.5" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "word_index = reuters.get_word_index()\n", + "reverse_word_index = dict([(value, key) for (key, value) in word_index.items()])\n", + "\n", + "import numpy as np\n", + "def vectorize_sequences(sequences, dimension=10000):\n", + " results = np.zeros((len(sequences), dimension))\n", + " for i, sequence in enumerate(sequences):\n", + " results[i, sequence] = 1.\n", + " return results\n", + "\n", + "x_train = vectorize_sequences(train_data)\n", + "x_test = vectorize_sequences(test_data)\n", + "# this part pending to modify, one-hot or integer issue\n", + "\n", + "x_val = x_train[:1000]\n", + "partial_x_train = x_train[1000:]" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Build the model, compile, train and evaluate" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "model = models.Sequential()\n", + "model.add(layers.Dense(64, activation='relu', input_shape=(10000,)))\n", + "model.add(layers.Dense(64, activation='relu'))\n", + "model.add(layers.Dense(46, activation='softmax'))\n", + "\n", + "model.compile(optimizer='rmsprop',\n", + " loss='categorical_crossentropy',\n", + " metrics=['accuracy'])\n", + "model.set_tensorboard('./', '3-6_summary')\n", + "model.fit(partial_x_train,\n", + " partial_y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_val, y_val))\n", + "\n", + "results = model.evaluate(x_test, one_hot_test_labels)" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} diff --git a/keras/3.7-regression.ipynb b/keras/3.7-regression.ipynb new file mode 100644 index 0000000..b2547a2 --- /dev/null +++ b/keras/3.7-regression.ipynb @@ -0,0 +1,174 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "This dataset is packaged in Keras 2.0.8 but not in Keras 1.2.2, so that we need to use following code to get the data, then we also apply normalization on these data:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from keras.utils.data_utils import get_file\n", + "def load_data(path='boston_housing.npz', test_split=0.2, seed=113):\n", + " \"\"\"Loads the Boston Housing dataset.\n", + " # Arguments\n", + " path: path where to cache the dataset locally\n", + " (relative to ~/.zoo.pipeline.api.keras/datasets).\n", + " test_split: fraction of the data to reserve as test set.\n", + " seed: Random seed for shuffling the data\n", + " before computing the test split.\n", + " # Returns\n", + " Tuple of Numpy arrays: `(x_train, y_train), (x_test, y_test)`.\n", + " \"\"\"\n", + " assert 0 <= test_split < 1\n", + " path = get_file(\n", + " path,\n", + " origin='https://s3.amazonaws.com/zoo.pipeline.api.keras-datasets/boston_housing.npz'\n", + " )\n", + " with np.load(path) as f:\n", + " x = f['x']\n", + " y = f['y']\n", + "\n", + " np.random.seed(seed)\n", + " indices = np.arange(len(x))\n", + " np.random.shuffle(indices)\n", + " x = x[indices]\n", + " y = y[indices]\n", + "\n", + " x_train = np.array(x[:int(len(x) * (1 - test_split))])\n", + " y_train = np.array(y[:int(len(x) * (1 - test_split))])\n", + " x_test = np.array(x[int(len(x) * (1 - test_split)):])\n", + " y_test = np.array(y[int(len(x) * (1 - test_split)):])\n", + " return (x_train, y_train), (x_test, y_test)\n", + "\n", + "(train_data, train_targets), (test_data, test_targets) = load_data()\n", + "\n", + "mean = train_data.mean(axis=0)\n", + "train_data -= mean\n", + "std = train_data.std(axis=0)\n", + "train_data /= std\n", + "\n", + "test_data -= mean\n", + "test_data /= std" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "import numpy as np\n", + "\n", + "k = 4\n", + "num_val_samples = len(train_data) // k\n", + "num_nb_epoch = 50\n", + "all_scores = []\n", + "for i in range(k):\n", + " print('processing fold #', i)\n", + " # Prepare the validation data: data from partition # k\n", + " val_data = train_data[i * num_val_samples: (i + 1) * num_val_samples]\n", + " val_targets = train_targets[i * num_val_samples: (i + 1) * num_val_samples]\n", + "\n", + " # Prepare the training data: data from all other partitions\n", + " partial_train_data = np.concatenate(\n", + " [train_data[:i * num_val_samples],\n", + " train_data[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + " partial_train_targets = np.concatenate(\n", + " [train_targets[:i * num_val_samples],\n", + " train_targets[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + "\n", + " # Build the Keras model (already compiled)\n", + " model = build_model()\n", + " # Train the model (in silent mode, verbose=0)\n", + " #model.fit(partial_train_data, partial_train_targets,\n", + " # nb_epoch=num_nb_epoch, batch_size=1, verbose=0)\n", + " model.fit(partial_train_data, partial_train_targets,\n", + " nb_epoch=num_nb_epoch, batch_size=16)\n", + "\n", + " # Evaluate the model on the validation data\n", + " #val_mse, val_mae = model.evaluate(val_data, val_targets, verbose=0)\n", + " val_mae = model.evaluate(val_data, val_targets)\n", + " all_scores.append(val_mae[0].result)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "model = build_model()\n", + "# Train it on the entirety of the data.\n", + "model.fit(train_data, train_targets,\n", + " nb_epoch=80, batch_size=16)\n", + "test_result = model.evaluate(test_data, test_targets)\n", + "\n", + "print('test result:', test_result[0].result)" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} diff --git a/keras/4.4-regularization-and-dropout.ipynb b/keras/4.4-regularization-and-dropout.ipynb new file mode 100644 index 0000000..3c926d5 --- /dev/null +++ b/keras/4.4-regularization-and-dropout.ipynb @@ -0,0 +1,53 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} diff --git a/keras/5.1-cnn.ipynb b/keras/5.1-cnn.ipynb new file mode 100644 index 0000000..d36c4aa --- /dev/null +++ b/keras/5.1-cnn.ipynb @@ -0,0 +1,126 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First, let's take a practical look at a very simple convnet example. We will use our convnet to classify MNIST digits, a task that you've already been through in Chapter 2.\n", + "\n", + "First load the MNIST dataset:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from keras.datasets import mnist\n", + "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### CNN input shape\n", + "Once we get the dataset, we need to reshape the images. In keras the shape of the dataset is `(sample_size, height, width, channel)`, like the Keras code below:\n", + " \n", + " train_images = train_images.reshape((60000, 28, 28, 1))\n", + "In analytics-zoo, you need to use a little different order, `(sample_size, channel, height, width)`, like following:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "train_images = train_images.reshape((60000, 1, 28, 28))\n", + "train_images = train_images.astype('float32') / 255\n", + "\n", + "test_images = test_images.reshape((10000, 1, 28, 28))\n", + "test_images = test_images.astype('float32') / 255\n", + "\n", + "from keras.utils.np_utils import to_categorical\n", + "train_labels = to_categorical(train_labels)\n", + "test_labels = to_categorical(test_labels)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Then build the model, compile, train and evaluate:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import layers\n", + "from zoo.pipeline.api.keras import models\n", + "\n", + "model = models.Sequential()\n", + "model.add(layers.Conv2D(32, nb_col=3, nb_row=3, activation='relu', input_shape=(1,28,28)))\n", + "model.add(layers.MaxPooling2D((2, 2)))\n", + "model.add(layers.Conv2D(64, nb_col=3, nb_row=3, activation='relu'))\n", + "model.add(layers.MaxPooling2D((2, 2)))\n", + "model.add(layers.Conv2D(64, nb_col=3, nb_row=3, activation='relu'))\n", + "\n", + "model.summary()\n", + "\n", + "model.add(layers.Flatten())\n", + "model.add(layers.Dense(64, activation='relu'))\n", + "model.add(layers.Dense(10, activation='softmax'))\n", + "\n", + "test_result = model.evaluate(test_images, test_labels)\n", + "print('test_acc:', test_result[0].result)" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} diff --git a/keras/6.2-rnn.ipynb b/keras/6.2-rnn.ipynb new file mode 100644 index 0000000..16817b9 --- /dev/null +++ b/keras/6.2-rnn.ipynb @@ -0,0 +1,171 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# RNN\n" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.models import Sequential\n", + "from zoo.pipeline.api.keras.layers import Embedding, SimpleRNN" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Specify input shape" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "model = Sequential()\n", + "model.add(Embedding(10000, 32, input_shape=(500,)))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32)) # This last layer only returns the last outputs.\n", + "model.summary()" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from keras.datasets import imdb\n", + "from keras.preprocessing import sequence\n", + "\n", + "max_features = 10000 # number of words to consider as features\n", + "maxlen = 500 # cut texts after this number of words (among top max_features most common words)\n", + "batch_size = 32\n", + "\n", + "(input_train, y_train), (input_test, y_test) = imdb.load_data(nb_words=max_features)\n", + "input_train = sequence.pad_sequences(input_train, maxlen=maxlen)\n", + "input_test = sequence.pad_sequences(input_test, maxlen=maxlen)\n", + "print('input_train shape:', input_train.shape)\n", + "print('input_test shape:', input_test.shape)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.layers import Dense\n", + "\n", + "model = Sequential()\n", + "model.add(Embedding(max_features, 32, input_shape=(500,)))\n", + "model.add(SimpleRNN(32))\n", + "model.add(Dense(1, activation='sigmoid'))\n", + "\n", + "model.compile(optimizer='rmsprop', loss='binary_crossentropy', metrics=['acc'])\n", + "history = model.fit(input_train, y_train,\n", + " nb_epoch=10,\n", + " batch_size=128,\n", + " #validation_split=0.2\n", + " )" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.layers import LSTM\n", + "\n", + "model = Sequential()\n", + "model.add(Embedding(max_features, 32, input_shape=(500,)))\n", + "model.add(LSTM(32))\n", + "model.add(Dense(1, activation='sigmoid'))\n", + "\n", + "model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])\n", + "model.set_tensorboard('./', '6-2_summary')\n", + "history = model.fit(input_train, y_train,\n", + " nb_epoch=10,\n", + " batch_size=128,\n", + " #validation_split=0.2\n", + " )" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From e4549ba85b51ee31ebfb934061425ff536da7abf Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 26 Feb 2019 09:55:16 +0800 Subject: [PATCH 03/46] Update README.md --- keras/README.md | 6 ++---- 1 file changed, 2 insertions(+), 4 deletions(-) diff --git a/keras/README.md b/keras/README.md index 67cc57a..b6a6da1 100644 --- a/keras/README.md +++ b/keras/README.md @@ -1,11 +1,9 @@ # Tutorials for running distribued Keras (v1.2.2) on Analytics Zoo Tutorials for running _**distribued Keras (v1.2.2) on Analytics Zoo**_. These tutorials are ported from François Chollet's [Jupyter notebooks](https://github.com/fchollet/deep-learning-with-python-notebooks) for the book [Deep Learning with Python (Manning Publications)](https://www.manning.com/books/deep-learning-with-python?a_aid=keras&a_bid=76564dff) -This repository is built to describe how to write Keras-style code and directly run it in analytics-zoo so that we could run the original Keras code in distributed mode via limited modification. We make this repository based on notebook https://github.com/fchollet/deep-learning-with-python-notebooks#companion-jupyter-notebooks-for-the-book-deep-learning-with-python, which is the sample code repository of book "Deep Learning with Python" based on Keras 2.0.8 and we pick some basic chapters from it. +This repository is built to describe how to write Keras-style code and directly run it in analytics-zoo so that we could run the original Keras code in distributed mode via limited modification. The original notebook is based on Keras 2.0.8. Currently analytics-zoo builds its Keras-style code based on Keras 1.2.2. Thus, this repository contains the code of implementation of original Keras 2.0.8 code based on Keras 1.2.2, and how to modify the code in order to run it in analytics-zoo. -Currently analytics-zoo builds its Keras-style code based on Keras 1.2.2. Thus, this repository contains the code of implementation of original Keras 2.0.8 code based on Keras 1.2.2, and how to modify the code in order to run it in analytics-zoo. - -To make it simple, we omit the description of plenty concepts here and you could find them in original notebook above [(link)](https://github.com/fchollet/deep-learning-with-python-notebooks#companion-jupyter-notebooks-for-the-book-deep-learning-with-python). Besides, we directly post Keras 1.2.2 code here and the replacements needed from Keras 2.0.8 to Keras 1.2.2 are noted in `Keras_2-to-1.md`. +To make it simple, we omit the description of plenty concepts here and you could find them in original notebook above [(link)](https://github.com/fchollet/deep-learning-with-python-notebooks). Besides, we directly post Keras 1.2.2 code here and the replacements needed from Keras 2.0.8 to Keras 1.2.2 are noted in `Keras_2-to-1.md`. This repository use Python 3.5, Keras 1.2.2 (Keras code), Analytics-zoo 0.4.0 (zoo code). We post the summary of Keras-to-zoo code convertion as well as the table of contents. From f229f22f1f4b5f0c8075001e65ef279f796031f8 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 26 Feb 2019 15:56:10 +0800 Subject: [PATCH 04/46] Update README.md --- keras/README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/keras/README.md b/keras/README.md index b6a6da1..0ca5e8a 100644 --- a/keras/README.md +++ b/keras/README.md @@ -38,10 +38,10 @@ The return of `evaluate` method is an `EvaluationResult` object, which is differ The return of `predict` method is RDD, so you need to call `collect` method to collect them, see [Chapter 3.5]() #### Parameters not supported -Analytics-zoo does not support following parameters currently +Analytics-zoo does not support following parameters currently, so we just comment these code in this notebook. -* validation_split: -* verbose: +* validation_split: To split the training data into two parts, another part is used for validation +* verbose: To control the information showed during training ## Table of contents From 29897ec0374bd3073bf40df630f168faaa78c695 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 26 Feb 2019 15:58:00 +0800 Subject: [PATCH 05/46] Update README.md --- keras/README.md | 14 +++++++------- 1 file changed, 7 insertions(+), 7 deletions(-) diff --git a/keras/README.md b/keras/README.md index 0ca5e8a..a70481b 100644 --- a/keras/README.md +++ b/keras/README.md @@ -48,16 +48,16 @@ Analytics-zoo does not support following parameters currently, so we just commen The main purpose of this repository is to decribe the convertion from Keras to analytics-zoo so that we rename the notebooks to make it more experienced-user oriented. We keep the chapter index so you could still make a reference to the original notebook [here](https://github.com/fchollet/deep-learning-with-python-notebooks#companion-jupyter-notebooks-for-the-book-deep-learning-with-python). We only keep some key description of the original notebook. * Chapter 2: - * [2.1: MNIST]() + * [2.1: MNIST](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/2.1-mnist.ipynb) * Chapter 3: - * [3.5: Binary classification]() - * [3.6: Multi-class classification]() - * [3.7: Regression]() + * [3.5: Binary classification](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/3.5-binary-classification.ipynb) + * [3.6: Multi-class classification](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/3.6-multi-class-classification.ipynb) + * [3.7: Regression](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/3.7-regression.ipynb) * Chapter 4: - * [4.4: Regularization and Dropout]() + * [4.4: Regularization and Dropout](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/4.4-regularization-and-dropout.ipynb) * Chapter 5: - * [5.1: CNN]() + * [5.1: CNN](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/5.1-cnn.ipynb) * Chapter 6: - * [6.2: RNN]() + * [6.2: RNN](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/6.2-rnn.ipynb) From 6fad60123452328cc61b02cc147690adb8735c77 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 26 Feb 2019 15:59:19 +0800 Subject: [PATCH 06/46] Add files via upload --- keras/3.7-regression.ipynb | 36 +++++---- keras/4.4-regularization-and-dropout.ipynb | 92 +++++++++++++++++++++- keras/5.1-cnn.ipynb | 5 +- keras/6.2-rnn.ipynb | 31 +++++--- 4 files changed, 134 insertions(+), 30 deletions(-) diff --git a/keras/3.7-regression.ipynb b/keras/3.7-regression.ipynb index b2547a2..98c69d9 100644 --- a/keras/3.7-regression.ipynb +++ b/keras/3.7-regression.ipynb @@ -21,6 +21,16 @@ "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Regression\n", + "We will be attempting to predict the median price of homes in a given Boston suburb in the mid-1970s, given a few data points about the suburb at the time, such as the crime rate, the local property tax rate, etc.\n", + "\n", + "The dataset we will be using has another interesting difference from our two previous examples: it has very few data points, only 506 in total, split between 404 training samples and 102 test samples, and each \"feature\" in the input data (e.g. the crime rate is a feature) has a different scale. For instance some values are proportions, which take a values between 0 and 1, others take values between 1 and 12, others between 0 and 100..." + ] + }, { "cell_type": "markdown", "metadata": {}, @@ -79,11 +89,13 @@ ] }, { - "cell_type": "code", - "execution_count": null, + "cell_type": "markdown", "metadata": {}, - "outputs": [], - "source": [] + "source": [ + "In this example we have so few data points, the validation set would end up being very small (e.g. about 100 examples). A consequence is that our validation scores may change a lot depending on which data points we choose to use for validation and which we choose for training, i.e. the validation scores may have a high variance with regard to the validation split. This would prevent us from reliably evaluating our model.\n", + "\n", + "The best practice in such situations is to use K-fold cross-validation. It consists of splitting the available data into K partitions (typically K=4 or 5), then instantiating K identical models, and training each one on K-1 partitions while evaluating on the remaining partition. The validation score for the model used would then be the average of the K validation scores obtained." + ] }, { "cell_type": "code", @@ -128,11 +140,11 @@ ] }, { - "cell_type": "code", - "execution_count": null, + "cell_type": "markdown", "metadata": {}, - "outputs": [], - "source": [] + "source": [ + "Then we could check our K-fold training result:" + ] }, { "cell_type": "code", @@ -140,13 +152,7 @@ "metadata": {}, "outputs": [], "source": [ - "model = build_model()\n", - "# Train it on the entirety of the data.\n", - "model.fit(train_data, train_targets,\n", - " nb_epoch=80, batch_size=16)\n", - "test_result = model.evaluate(test_data, test_targets)\n", - "\n", - "print('test result:', test_result[0].result)" + "all_scores" ] } ], diff --git a/keras/4.4-regularization-and-dropout.ipynb b/keras/4.4-regularization-and-dropout.ipynb index 3c926d5..c79d8e1 100644 --- a/keras/4.4-regularization-and-dropout.ipynb +++ b/keras/4.4-regularization-and-dropout.ipynb @@ -21,12 +21,102 @@ "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Regularization and dropout\n", + "Let's review some of the most common techniques to prevent overfitting, and let's apply them in practice to improve our movie classification model from the previous chapter. First let's prepare the data using the code from Chapter 3.5 and preprocess the data" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from keras.datasets import imdb\n", + "import numpy as np\n", + "(train_data, train_labels), (test_data, test_labels) = imdb.load_data(nb_words=10000)\n", + "\n", + "def vectorize_sequences(sequences, dimension=10000):\n", + " # Create an all-zero matrix of shape (len(sequences), dimension)\n", + " results = np.zeros((len(sequences), dimension))\n", + " for i, sequence in enumerate(sequences):\n", + " results[i, sequence] = 1. # set specific indices of results[i] to 1s\n", + " return results\n", + "\n", + "x_train = vectorize_sequences(train_data)\n", + "x_test = vectorize_sequences(test_data)\n", + "\n", + "y_train = np.asarray(train_labels).astype('float32')\n", + "y_test = np.asarray(test_labels).astype('float32')" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's add L2 weight regularization to our movie review classification network:" + ] + }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], - "source": [] + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers\n", + "\n", + "from zoo.pipeline.api.keras import regularizers\n", + "\n", + "l2_model = models.Sequential()\n", + "l2_model.add(layers.Dense(16, W_regularizer=regularizers.l2(0.001),\n", + " activation='relu', input_shape=(10000,)))\n", + "l2_model.add(layers.Dense(16, W_regularizer=regularizers.l2(0.001),\n", + " activation='relu'))\n", + "l2_model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "l2_model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "l2_model_hist = l2_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's add two Dropout layers in our IMDB network to see how well they do at reducing overfitting:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "dpt_model = models.Sequential()\n", + "dpt_model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", + "dpt_model.add(layers.Dropout(0.5))\n", + "dpt_model.add(layers.Dense(16, activation='relu'))\n", + "dpt_model.add(layers.Dropout(0.5))\n", + "dpt_model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "dpt_model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "dpt_model_hist = dpt_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] } ], "metadata": { diff --git a/keras/5.1-cnn.ipynb b/keras/5.1-cnn.ipynb index d36c4aa..161499e 100644 --- a/keras/5.1-cnn.ipynb +++ b/keras/5.1-cnn.ipynb @@ -25,6 +25,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ + "# CNN\n", "First, let's take a practical look at a very simple convnet example. We will use our convnet to classify MNIST digits, a task that you've already been through in Chapter 2.\n", "\n", "First load the MNIST dataset:" @@ -48,7 +49,9 @@ "Once we get the dataset, we need to reshape the images. In keras the shape of the dataset is `(sample_size, height, width, channel)`, like the Keras code below:\n", " \n", " train_images = train_images.reshape((60000, 28, 28, 1))\n", - "In analytics-zoo, you need to use a little different order, `(sample_size, channel, height, width)`, like following:" + "In analytics-zoo, the default order is theano-style NCHW `(sample_size, channel, height, width)`, so you can process data like following:\n", + "\n", + "Alternatively, you can also use tensorflow-style NHWC as Keras default just by setting `Convolution2D(dim_ordering=\"tf\")`" ] }, { diff --git a/keras/6.2-rnn.ipynb b/keras/6.2-rnn.ipynb index 16817b9..5c68e6d 100644 --- a/keras/6.2-rnn.ipynb +++ b/keras/6.2-rnn.ipynb @@ -42,7 +42,12 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "#### Specify input shape" + "#### Specify input shape\n", + "We could add an embedding layer as our first layer in Keras as following:\n", + " \n", + " model = Sequential()\n", + " model.add(Embedding(10000, 32))\n", + "In analytics-zoo, you need to specify the input shape of first layer, in this example, the sequence length is 500, so we could build our model as following:" ] }, { @@ -61,11 +66,11 @@ ] }, { - "cell_type": "code", - "execution_count": null, + "cell_type": "markdown", "metadata": {}, - "outputs": [], - "source": [] + "source": [ + "Now let's try to use such a model on the IMDB movie review classification problem. First, let's preprocess the data:" + ] }, { "cell_type": "code", @@ -88,11 +93,11 @@ ] }, { - "cell_type": "code", - "execution_count": null, + "cell_type": "markdown", "metadata": {}, - "outputs": [], - "source": [] + "source": [ + "Let's train a simple recurrent network using an `Embedding` layer and a `SimpleRNN` layer:" + ] }, { "cell_type": "code", @@ -116,11 +121,11 @@ ] }, { - "cell_type": "code", - "execution_count": null, + "cell_type": "markdown", "metadata": {}, - "outputs": [], - "source": [] + "source": [ + "Now let's switch to more practical concerns: we will set up a model using a LSTM layer and train it on the IMDB data." + ] }, { "cell_type": "code", From cdcf30b5a97eb0ec76e57ad738a8d0a117047d28 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 28 Feb 2019 15:01:23 +0800 Subject: [PATCH 07/46] Update README.md --- keras/README.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/keras/README.md b/keras/README.md index a70481b..99282e6 100644 --- a/keras/README.md +++ b/keras/README.md @@ -5,6 +5,8 @@ This repository is built to describe how to write Keras-style code and directly To make it simple, we omit the description of plenty concepts here and you could find them in original notebook above [(link)](https://github.com/fchollet/deep-learning-with-python-notebooks). Besides, we directly post Keras 1.2.2 code here and the replacements needed from Keras 2.0.8 to Keras 1.2.2 are noted in `Keras_2-to-1.md`. +The training log of each epoch is stored in INFO, you could see these information though configuring python output level, or store these information in a file and check afterwards, these may includes plenty lines of information. For better expression here, we run the code in Pycharm IDE and **we only paste the last info of last epoch of each training function in this notebook**. + This repository use Python 3.5, Keras 1.2.2 (Keras code), Analytics-zoo 0.4.0 (zoo code). We post the summary of Keras-to-zoo code convertion as well as the table of contents. ## Summary of Keras-to-zoo code convertion @@ -37,6 +39,12 @@ The return of `evaluate` method is an `EvaluationResult` object, which is differ #### Predict result The return of `predict` method is RDD, so you need to call `collect` method to collect them, see [Chapter 3.5]() +#### CNN input shape +In analytics-zoo, the default order is theano-style NCHW, you can also use tensorflow-style NHWC as Keras default just by setting `Convolution2D(dim_ordering="tf")`, see [Chapter 5.1]() + +#### Specify input shape +In analytics-zoo, you need to specify the input shape of first layer, see [Chapter 6.2]() + #### Parameters not supported Analytics-zoo does not support following parameters currently, so we just comment these code in this notebook. From a0b235b52207b3f118e9a5b4d2c69bb733ef5d9e Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 28 Feb 2019 15:03:11 +0800 Subject: [PATCH 08/46] Update README.md --- keras/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/keras/README.md b/keras/README.md index 99282e6..73d00d8 100644 --- a/keras/README.md +++ b/keras/README.md @@ -46,7 +46,7 @@ In analytics-zoo, the default order is theano-style NCHW, you can also use tenso In analytics-zoo, you need to specify the input shape of first layer, see [Chapter 6.2]() #### Parameters not supported -Analytics-zoo does not support following parameters currently, so we just comment these code in this notebook. +Analytics-zoo does not support following parameters currently, so we do not use these code in this notebook if they exist in original notebook. * validation_split: To split the training data into two parts, another part is used for validation * verbose: To control the information showed during training From 6bfa0ab7359b854621308e0e48115c16d230497a Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 28 Feb 2019 15:04:12 +0800 Subject: [PATCH 09/46] Add files via upload --- keras/2.1-mnist.ipynb | 21 +++-- keras/3.5-binary-classification.ipynb | 14 +++- keras/3.6-multi-class-classification.ipynb | 39 +++++++-- keras/3.7-regression.ipynb | 56 ++++++++++++- keras/4.4-regularization-and-dropout.ipynb | 70 +++++++++++++--- keras/5.1-cnn.ipynb | 33 ++++++-- keras/6.2-rnn.ipynb | 92 +++++++++++++--------- 7 files changed, 258 insertions(+), 67 deletions(-) diff --git a/keras/2.1-mnist.ipynb b/keras/2.1-mnist.ipynb index d971f5d..cbf64b7 100644 --- a/keras/2.1-mnist.ipynb +++ b/keras/2.1-mnist.ipynb @@ -81,7 +81,7 @@ "network.add(layers.Dense(10, activation='softmax'))\n", "\n", "network.compile(optimizer='rmsprop',\n", - " loss='categorical_crossentropy',\n", + " loss='sparse_categorical_crossentropy',\n", " metrics=['accuracy'])\n", "\n", "train_images = train_images.reshape((60000, 28 * 28))\n", @@ -90,14 +90,16 @@ "test_images = test_images.reshape((10000, 28 * 28))\n", "test_images = test_images.astype('float32') / 255\n", "\n", - "\n", - "from keras.utils.np_utils import to_categorical\n", - "train_labels = to_categorical(train_labels)\n", - "#test_labels = to_categorical(test_labels)\n", - "\n", "network.fit(train_images, train_labels, nb_epoch=5, batch_size=128)" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." + ] + }, { "cell_type": "markdown", "metadata": {}, @@ -118,6 +120,13 @@ "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", "print('test_acc:', test_result[0].result)" ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "test_acc: 0.9783999919891357" + ] } ], "metadata": { diff --git a/keras/3.5-binary-classification.ipynb b/keras/3.5-binary-classification.ipynb index 929a731..79a17cc 100644 --- a/keras/3.5-binary-classification.ipynb +++ b/keras/3.5-binary-classification.ipynb @@ -102,7 +102,9 @@ " )\n", "After `fit` method finishes, the results are stored in `history` and thus could be visualized.\n", "\n", - "Currently in analytics-zoo, `fit` method does not have any return. Results can only be checked via tensorboard. Code above need to be replaced with following:" + "Currently in analytics-zoo, `fit` method does not have any return. Results can only be checked via tensorboard. Code above need to be replaced with following:\n", + "\n", + "Then you could see result in tensorboard by command in terminal `tensorboard --logdir ./` starting tensorboard service and go to `localhost:port_number` as shown in your terminal." ] }, { @@ -124,7 +126,8 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Then you could see result in tensorboard by command in terminal `tensorboard --logdir ./`" + "Trained 512 records in 0.022409147 seconds. Throughput is 22847.812 records/second. Loss is 0.21351601.\n", + "Top1Accuracy is Accuracy(correct: 8606, count: 10000, accuracy: 0.8606)" ] }, { @@ -144,6 +147,13 @@ "print('test_acc:', results[0].result)" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "test_acc: 0.8624799847602844" + ] + }, { "cell_type": "markdown", "metadata": {}, diff --git a/keras/3.6-multi-class-classification.ipynb b/keras/3.6-multi-class-classification.ipynb index 7e8c9ff..0655565 100644 --- a/keras/3.6-multi-class-classification.ipynb +++ b/keras/3.6-multi-class-classification.ipynb @@ -71,7 +71,11 @@ "# this part pending to modify, one-hot or integer issue\n", "\n", "x_val = x_train[:1000]\n", - "partial_x_train = x_train[1000:]" + "partial_x_train = x_train[1000:]\n", + "\n", + "y_val = train_labels[:1000]\n", + "partial_y_train = train_labels[1000:] # this line would return list\n", + "partial_y_train = np.array(partial_y_train) # convert list to ndarray" ] }, { @@ -93,16 +97,39 @@ "model.add(layers.Dense(46, activation='softmax'))\n", "\n", "model.compile(optimizer='rmsprop',\n", - " loss='categorical_crossentropy',\n", + " loss='sparse_categorical_crossentropy',\n", " metrics=['accuracy'])\n", - "model.set_tensorboard('./', '3-6_summary')\n", + "\n", "model.fit(partial_x_train,\n", " partial_y_train,\n", " nb_epoch=20,\n", " batch_size=512,\n", - " validation_data=(x_val, y_val))\n", - "\n", - "results = model.evaluate(x_test, one_hot_test_labels)" + " validation_data=(x_val, y_val))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 512 records in 0.03322949 seconds. Throughput is 15408.001 records/second. Loss is 0.36856997.\n", + "Top1Accuracy is Accuracy(correct: 808, count: 1000, accuracy: 0.808)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "test_results = model.evaluate(x_test, test_labels)\n", + "print (test_results[0].result)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "test_acc: 0.7885128855705261" ] } ], diff --git a/keras/3.7-regression.ipynb b/keras/3.7-regression.ipynb index 98c69d9..769e343 100644 --- a/keras/3.7-regression.ipynb +++ b/keras/3.7-regression.ipynb @@ -94,7 +94,38 @@ "source": [ "In this example we have so few data points, the validation set would end up being very small (e.g. about 100 examples). A consequence is that our validation scores may change a lot depending on which data points we choose to use for validation and which we choose for training, i.e. the validation scores may have a high variance with regard to the validation split. This would prevent us from reliably evaluating our model.\n", "\n", - "The best practice in such situations is to use K-fold cross-validation. It consists of splitting the available data into K partitions (typically K=4 or 5), then instantiating K identical models, and training each one on K-1 partitions while evaluating on the remaining partition. The validation score for the model used would then be the average of the K validation scores obtained." + "The best practice in such situations is to use K-fold cross-validation. It consists of splitting the available data into K partitions (typically K=4 or 5), then instantiating K identical models, and training each one on K-1 partitions while evaluating on the remaining partition. The validation score for the model used would then be the average of the K validation scores obtained.\n", + "\n", + "Since we are using K-fold so that we have to build the model multiple times, we use following function to build our model:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers\n", + "\n", + "def build_model():\n", + " # Because we will need to instantiate\n", + " # the same model multiple times,\n", + " # we use a function to construct it.\n", + " model = models.Sequential()\n", + " model.add(layers.Dense(64, activation='relu',\n", + " input_shape=(train_data.shape[1],)))\n", + " model.add(layers.Dense(64, activation='relu'))\n", + " model.add(layers.Dense(1))\n", + " model.compile(optimizer='rmsprop', loss='mse', metrics=['mae'])\n", + " return model" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Then let's start our training:" ] }, { @@ -139,6 +170,20 @@ " all_scores.append(val_mae[0].result)" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "processing fold # 0\n", + "Trained 16 records in 0.011235845 seconds. Throughput is 1424.0139 records/second. Loss is 8.708786.\n", + "processing fold # 1\n", + "Trained 16 records in 0.009535034 seconds. Throughput is 1678.0223 records/second. Loss is 5.3613434.\n", + "processing fold # 2\n", + "Trained 16 records in 0.008636178 seconds. Throughput is 1852.6713 records/second. Loss is 18.106756.\n", + "processing fold # 3\n", + "Trained 16 records in 0.009207628 seconds. Throughput is 1737.6897 records/second. Loss is 7.0931993." + ] + }, { "cell_type": "markdown", "metadata": {}, @@ -152,7 +197,14 @@ "metadata": {}, "outputs": [], "source": [ - "all_scores" + "print (all_scores)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "[20.572654724121094, 19.606250762939453, 21.224998474121094, 22.60078239440918]" ] } ], diff --git a/keras/4.4-regularization-and-dropout.ipynb b/keras/4.4-regularization-and-dropout.ipynb index c79d8e1..5ed40ab 100644 --- a/keras/4.4-regularization-and-dropout.ipynb +++ b/keras/4.4-regularization-and-dropout.ipynb @@ -57,7 +57,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Let's add L2 weight regularization to our movie review classification network:" + "Let's first try an easy model and try to optimize it afterwards:" ] }, { @@ -69,6 +69,42 @@ "from zoo.pipeline.api.keras import models\n", "from zoo.pipeline.api.keras import layers\n", "\n", + "original_model = models.Sequential()\n", + "original_model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", + "original_model.add(layers.Dense(16, activation='relu'))\n", + "original_model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "original_model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "original_model.fit(x_train, y_train,\n", + " epochs=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 512 records in 0.024455326 seconds. Throughput is 20936.135 records/second. Loss is 0.01585226.\n", + "Top1Accuracy is Accuracy(correct: 21341, count: 25000, accuracy: 0.85364)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's add L2 weight regularization to our movie review classification network:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ "from zoo.pipeline.api.keras import regularizers\n", "\n", "l2_model = models.Sequential()\n", @@ -82,10 +118,18 @@ " loss='binary_crossentropy',\n", " metrics=['acc'])\n", "\n", - "l2_model_hist = l2_model.fit(x_train, y_train,\n", - " nb_epoch=20,\n", - " batch_size=512,\n", - " validation_data=(x_test, y_test))" + "l2_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 512 records in 0.024366594 seconds. Throughput is 21012.373 records/second. Loss is 0.13651785.\n", + "Top1Accuracy is Accuracy(correct: 21684, count: 25000, accuracy: 0.86736)" ] }, { @@ -112,10 +156,18 @@ " loss='binary_crossentropy',\n", " metrics=['acc'])\n", "\n", - "dpt_model_hist = dpt_model.fit(x_train, y_train,\n", - " nb_epoch=20,\n", - " batch_size=512,\n", - " validation_data=(x_test, y_test))" + "dpt_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 512 records in 0.017992654 seconds. Throughput is 28456.057 records/second. Loss is 0.112769656. \n", + "Top1Accuracy is Accuracy(correct: 21871, count: 25000, accuracy: 0.87484)" ] } ], diff --git a/keras/5.1-cnn.ipynb b/keras/5.1-cnn.ipynb index 161499e..72786d2 100644 --- a/keras/5.1-cnn.ipynb +++ b/keras/5.1-cnn.ipynb @@ -64,11 +64,7 @@ "train_images = train_images.astype('float32') / 255\n", "\n", "test_images = test_images.reshape((10000, 1, 28, 28))\n", - "test_images = test_images.astype('float32') / 255\n", - "\n", - "from keras.utils.np_utils import to_categorical\n", - "train_labels = to_categorical(train_labels)\n", - "test_labels = to_categorical(test_labels)" + "test_images = test_images.astype('float32') / 255" ] }, { @@ -100,9 +96,36 @@ "model.add(layers.Dense(64, activation='relu'))\n", "model.add(layers.Dense(10, activation='softmax'))\n", "\n", + "model.compile(optimizer='rmsprop',\n", + " loss='sparse_categorical_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "model.fit(train_images, train_labels, nb_epoch=5, batch_size=64)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 64 records in 0.03212866 seconds. Throughput is 1991.9911 records/second. Loss is 0.0023578003." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ "test_result = model.evaluate(test_images, test_labels)\n", "print('test_acc:', test_result[0].result)" ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "test_acc: 0.9915000200271606" + ] } ], "metadata": { diff --git a/keras/6.2-rnn.ipynb b/keras/6.2-rnn.ipynb index 5c68e6d..ef1146f 100644 --- a/keras/6.2-rnn.ipynb +++ b/keras/6.2-rnn.ipynb @@ -42,12 +42,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "#### Specify input shape\n", - "We could add an embedding layer as our first layer in Keras as following:\n", - " \n", - " model = Sequential()\n", - " model.add(Embedding(10000, 32))\n", - "In analytics-zoo, you need to specify the input shape of first layer, in this example, the sequence length is 500, so we could build our model as following:" + "Now let's try to use such a model on the IMDB movie review classification problem. First, let's preprocess the data:" ] }, { @@ -56,20 +51,38 @@ "metadata": {}, "outputs": [], "source": [ - "model = Sequential()\n", - "model.add(Embedding(10000, 32, input_shape=(500,)))\n", - "model.add(SimpleRNN(32, return_sequences=True))\n", - "model.add(SimpleRNN(32, return_sequences=True))\n", - "model.add(SimpleRNN(32, return_sequences=True))\n", - "model.add(SimpleRNN(32)) # This last layer only returns the last outputs.\n", - "model.summary()" + "from keras.datasets import imdb\n", + "from keras.preprocessing import sequence\n", + "\n", + "max_features = 10000 # number of words to consider as features\n", + "maxlen = 500 # cut texts after this number of words (among top max_features most common words)\n", + "batch_size = 32\n", + "\n", + "(input_train, y_train), (input_test, y_test) = imdb.load_data(nb_words=max_features)\n", + "input_train = sequence.pad_sequences(input_train, maxlen=maxlen)\n", + "input_test = sequence.pad_sequences(input_test, maxlen=maxlen)\n", + "print('input_train shape:', input_train.shape)\n", + "print('input_test shape:', input_test.shape)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ - "Now let's try to use such a model on the IMDB movie review classification problem. First, let's preprocess the data:" + "input_train shape: (25000, 500)\n", + "input_test shape: (25000, 500)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Specify input shape\n", + "We could add an embedding layer as our first layer in Keras as following:\n", + " \n", + " model = Sequential()\n", + " model.add(Embedding(10000, 32))\n", + "In analytics-zoo, you need to specify the input shape of first layer, in this example, the sequence length is 500, so we could build our model as following:" ] }, { @@ -78,18 +91,13 @@ "metadata": {}, "outputs": [], "source": [ - "from keras.datasets import imdb\n", - "from keras.preprocessing import sequence\n", - "\n", - "max_features = 10000 # number of words to consider as features\n", - "maxlen = 500 # cut texts after this number of words (among top max_features most common words)\n", - "batch_size = 32\n", - "\n", - "(input_train, y_train), (input_test, y_test) = imdb.load_data(nb_words=max_features)\n", - "input_train = sequence.pad_sequences(input_train, maxlen=maxlen)\n", - "input_test = sequence.pad_sequences(input_test, maxlen=maxlen)\n", - "print('input_train shape:', input_train.shape)\n", - "print('input_test shape:', input_test.shape)" + "model = Sequential()\n", + "model.add(Embedding(10000, 32, input_shape=(500,)))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32)) # This last layer only returns the last outputs.\n", + "model.summary()" ] }, { @@ -113,11 +121,16 @@ "model.add(Dense(1, activation='sigmoid'))\n", "\n", "model.compile(optimizer='rmsprop', loss='binary_crossentropy', metrics=['acc'])\n", - "history = model.fit(input_train, y_train,\n", - " nb_epoch=10,\n", - " batch_size=128,\n", - " #validation_split=0.2\n", - " )" + "model.fit(input_train, y_train,\n", + " nb_epoch=10,\n", + " batch_size=128)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 128 records in 0.046239497 seconds. Throughput is 2768.1963 records/second. Loss is 0.16970885." ] }, { @@ -143,12 +156,17 @@ "model.compile(optimizer='rmsprop',\n", " loss='binary_crossentropy',\n", " metrics=['acc'])\n", - "model.set_tensorboard('./', '6-2_summary')\n", - "history = model.fit(input_train, y_train,\n", - " nb_epoch=10,\n", - " batch_size=128,\n", - " #validation_split=0.2\n", - " )" + "\n", + "model.fit(input_train, y_train,\n", + " nb_epoch=10,\n", + " batch_size=128)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 128 records in 0.335889472 seconds. Throughput is 381.07776 records/second. Loss is 0.14791179." ] } ], From a6abad4cff2d7110a3fc53450f3b4aa864a978e4 Mon Sep 17 00:00:00 2001 From: Jiaming Date: Thu, 28 Feb 2019 17:06:10 +0800 Subject: [PATCH 10/46] delete --- keras/2.1-mnist.ipynb | 153 -------------- keras/3.5-binary-classification.ipynb | 200 ------------------ keras/3.6-multi-class-classification.ipynb | 157 -------------- keras/3.7-regression.ipynb | 232 --------------------- keras/4.4-regularization-and-dropout.ipynb | 195 ----------------- keras/5.1-cnn.ipynb | 152 -------------- keras/6.2-rnn.ipynb | 194 ----------------- 7 files changed, 1283 deletions(-) delete mode 100644 keras/2.1-mnist.ipynb delete mode 100644 keras/3.5-binary-classification.ipynb delete mode 100644 keras/3.6-multi-class-classification.ipynb delete mode 100644 keras/3.7-regression.ipynb delete mode 100644 keras/4.4-regularization-and-dropout.ipynb delete mode 100644 keras/5.1-cnn.ipynb delete mode 100644 keras/6.2-rnn.ipynb diff --git a/keras/2.1-mnist.ipynb b/keras/2.1-mnist.ipynb deleted file mode 100644 index cbf64b7..0000000 --- a/keras/2.1-mnist.ipynb +++ /dev/null @@ -1,153 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "First of all, set environment variables and initialize spark context:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# MNIST\n", - "\n", - "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset.\n", - "\n", - "The MNIST dataset comes pre-loaded in Keras, in the form of a set of four Numpy arrays." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from keras.datasets import mnist\n", - "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Import the modules we need to build the network. In Keras it is:\n", - "\n", - " from keras import models\n", - " from keras import layers\n", - "Just replace it with following in order to use analytics-zoo:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import models\n", - "from zoo.pipeline.api.keras import layers" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Build the network, compile and fit:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "network = models.Sequential()\n", - "network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))\n", - "network.add(layers.Dense(10, activation='softmax'))\n", - "\n", - "network.compile(optimizer='rmsprop',\n", - " loss='sparse_categorical_crossentropy',\n", - " metrics=['accuracy'])\n", - "\n", - "train_images = train_images.reshape((60000, 28 * 28))\n", - "train_images = train_images.astype('float32') / 255\n", - "\n", - "test_images = test_images.reshape((10000, 28 * 28))\n", - "test_images = test_images.astype('float32') / 255\n", - "\n", - "network.fit(train_images, train_labels, nb_epoch=5, batch_size=128)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### Evaluate return\n", - "Check our result on test set. In Keras it is:\n", - "\n", - " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", - "In analytics-zoo, the return of `evaluate` method is an `EvaluationResult` object, which is different from Keras. We use following code to check:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", - "print('test_acc:', test_result[0].result)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "test_acc: 0.9783999919891357" - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} diff --git a/keras/3.5-binary-classification.ipynb b/keras/3.5-binary-classification.ipynb deleted file mode 100644 index 79a17cc..0000000 --- a/keras/3.5-binary-classification.ipynb +++ /dev/null @@ -1,200 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "First of all, set environment variables and initialize spark context:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# Binary classification\n", - "We'll be working with \"IMDB dataset\", a set of 50,000 highly-polarized reviews from the Internet Movie Database. They are split into 25,000 reviews for training and 25,000 reviews for testing, each set consisting in 50% negative and 50% positive reviews.\n", - "\n", - "Just like the MNIST dataset, the IMDB dataset comes packaged with Keras. It has already been preprocessed: the reviews (sequences of words) have been turned into sequences of integers, where each integer stands for a specific word in a dictionary.\n", - "\n", - "The following code will load the dataset (when you run it for the first time, about 80MB of data will be downloaded to your machine).\n", - "\n", - "Then we vectorize the sequences to prepare the data we are going to feed to the model. We also separate part of the data for validation." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from keras.datasets import imdb\n", - "(train_data, train_labels), (test_data, test_labels) = imdb.load_data(nb_words=10000)\n", - "\n", - "import numpy as np\n", - "def vectorize_sequences(sequences, dimension=10000):\n", - " # Create an all-zero matrix of shape (len(sequences), dimension)\n", - " results = np.zeros((len(sequences), dimension))\n", - " for i, sequence in enumerate(sequences):\n", - " results[i, sequence] = 1. # set specific indices of results[i] to 1s\n", - " return results\n", - "\n", - "y_train = np.asarray(train_labels).astype('float32')\n", - "y_test = np.asarray(test_labels).astype('float32')\n", - "\n", - "x_val = x_train[:10000]\n", - "partial_x_train = x_train[10000:]\n", - "y_val = y_train[:10000]\n", - "partial_y_train = y_train[10000:]" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Build the network, then compile:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import models\n", - "from zoo.pipeline.api.keras import layers\n", - "\n", - "model = models.Sequential()\n", - "model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", - "model.add(layers.Dense(16, activation='relu'))\n", - "model.add(layers.Dense(1, activation='sigmoid'))\n", - "\n", - "model.compile(optimizer='rmsprop',\n", - " loss='binary_crossentropy',\n", - " metrics=['accuracy'])" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### Accuracy checkout\n", - "To checkout the behavior of this model in Keras, the original code use `matplotlib` library to draw the following `history` object\n", - " \n", - " history = model.fit(partial_x_train,\n", - " partial_y_train,\n", - " nb_epoch=5,\n", - " batch_size=512,\n", - " validation_data=(x_val, y_val)\n", - " )\n", - "After `fit` method finishes, the results are stored in `history` and thus could be visualized.\n", - "\n", - "Currently in analytics-zoo, `fit` method does not have any return. Results can only be checked via tensorboard. Code above need to be replaced with following:\n", - "\n", - "Then you could see result in tensorboard by command in terminal `tensorboard --logdir ./` starting tensorboard service and go to `localhost:port_number` as shown in your terminal." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "model.set_tensorboard('./', '3-5_summary')\n", - "model.fit(partial_x_train,\n", - " partial_y_train,\n", - " nb_epoch=5,\n", - " batch_size=512,\n", - " validation_data=(x_val, y_val)\n", - " )" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 512 records in 0.022409147 seconds. Throughput is 22847.812 records/second. Loss is 0.21351601.\n", - "Top1Accuracy is Accuracy(correct: 8606, count: 10000, accuracy: 0.8606)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Check the result on test data:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "results = model.evaluate(x_test, y_test)\n", - "print('test_acc:', results[0].result)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "test_acc: 0.8624799847602844" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### Predict result\n", - "Predict the result on test data, in Keras, it is easy to just call following code to get the result\n", - "\n", - " model.predict(x_test)\n", - "In analytics-zoo, the return of `predict` is RDD, so you need to call `collect` method to get the result:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "prediction = model.predict(x_test)\n", - "result = prediction.collect()" - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} diff --git a/keras/3.6-multi-class-classification.ipynb b/keras/3.6-multi-class-classification.ipynb deleted file mode 100644 index 0655565..0000000 --- a/keras/3.6-multi-class-classification.ipynb +++ /dev/null @@ -1,157 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "First of all, set environment variables and initialize spark context:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# Multi-class classification\n", - "In this section, we will build a network to classify Reuters newswires into 46 different mutually-exclusive topics. Since we have many classes, this problem is an instance of \"multi-class classification\", and since each data point should be classified into only one category, the problem is more specifically an instance of \"single-label, multi-class classification\".\n", - "\n", - "We will be working with the Reuters dataset, a set of short newswires and their topics, published by Reuters in 1986. It's a very simple, widely used toy dataset for text classification. There are 46 different topics; some topics are more represented than others, but each topic has at least 10 examples in the training set.\n", - "\n", - "Like IMDB and MNIST, the Reuters dataset comes packaged as part of Keras. Let's take a look right away:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from keras.datasets import reuters\n", - "(train_data, train_labels), (test_data, test_labels) = reuters.load_data(nb_words=10000)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Then we do some preprocessing like Chapter 3.5" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "word_index = reuters.get_word_index()\n", - "reverse_word_index = dict([(value, key) for (key, value) in word_index.items()])\n", - "\n", - "import numpy as np\n", - "def vectorize_sequences(sequences, dimension=10000):\n", - " results = np.zeros((len(sequences), dimension))\n", - " for i, sequence in enumerate(sequences):\n", - " results[i, sequence] = 1.\n", - " return results\n", - "\n", - "x_train = vectorize_sequences(train_data)\n", - "x_test = vectorize_sequences(test_data)\n", - "# this part pending to modify, one-hot or integer issue\n", - "\n", - "x_val = x_train[:1000]\n", - "partial_x_train = x_train[1000:]\n", - "\n", - "y_val = train_labels[:1000]\n", - "partial_y_train = train_labels[1000:] # this line would return list\n", - "partial_y_train = np.array(partial_y_train) # convert list to ndarray" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Build the model, compile, train and evaluate" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "model = models.Sequential()\n", - "model.add(layers.Dense(64, activation='relu', input_shape=(10000,)))\n", - "model.add(layers.Dense(64, activation='relu'))\n", - "model.add(layers.Dense(46, activation='softmax'))\n", - "\n", - "model.compile(optimizer='rmsprop',\n", - " loss='sparse_categorical_crossentropy',\n", - " metrics=['accuracy'])\n", - "\n", - "model.fit(partial_x_train,\n", - " partial_y_train,\n", - " nb_epoch=20,\n", - " batch_size=512,\n", - " validation_data=(x_val, y_val))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 512 records in 0.03322949 seconds. Throughput is 15408.001 records/second. Loss is 0.36856997.\n", - "Top1Accuracy is Accuracy(correct: 808, count: 1000, accuracy: 0.808)" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "test_results = model.evaluate(x_test, test_labels)\n", - "print (test_results[0].result)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "test_acc: 0.7885128855705261" - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} diff --git a/keras/3.7-regression.ipynb b/keras/3.7-regression.ipynb deleted file mode 100644 index 769e343..0000000 --- a/keras/3.7-regression.ipynb +++ /dev/null @@ -1,232 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "First of all, set environment variables and initialize spark context:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# Regression\n", - "We will be attempting to predict the median price of homes in a given Boston suburb in the mid-1970s, given a few data points about the suburb at the time, such as the crime rate, the local property tax rate, etc.\n", - "\n", - "The dataset we will be using has another interesting difference from our two previous examples: it has very few data points, only 506 in total, split between 404 training samples and 102 test samples, and each \"feature\" in the input data (e.g. the crime rate is a feature) has a different scale. For instance some values are proportions, which take a values between 0 and 1, others take values between 1 and 12, others between 0 and 100..." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "This dataset is packaged in Keras 2.0.8 but not in Keras 1.2.2, so that we need to use following code to get the data, then we also apply normalization on these data:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from keras.utils.data_utils import get_file\n", - "def load_data(path='boston_housing.npz', test_split=0.2, seed=113):\n", - " \"\"\"Loads the Boston Housing dataset.\n", - " # Arguments\n", - " path: path where to cache the dataset locally\n", - " (relative to ~/.zoo.pipeline.api.keras/datasets).\n", - " test_split: fraction of the data to reserve as test set.\n", - " seed: Random seed for shuffling the data\n", - " before computing the test split.\n", - " # Returns\n", - " Tuple of Numpy arrays: `(x_train, y_train), (x_test, y_test)`.\n", - " \"\"\"\n", - " assert 0 <= test_split < 1\n", - " path = get_file(\n", - " path,\n", - " origin='https://s3.amazonaws.com/zoo.pipeline.api.keras-datasets/boston_housing.npz'\n", - " )\n", - " with np.load(path) as f:\n", - " x = f['x']\n", - " y = f['y']\n", - "\n", - " np.random.seed(seed)\n", - " indices = np.arange(len(x))\n", - " np.random.shuffle(indices)\n", - " x = x[indices]\n", - " y = y[indices]\n", - "\n", - " x_train = np.array(x[:int(len(x) * (1 - test_split))])\n", - " y_train = np.array(y[:int(len(x) * (1 - test_split))])\n", - " x_test = np.array(x[int(len(x) * (1 - test_split)):])\n", - " y_test = np.array(y[int(len(x) * (1 - test_split)):])\n", - " return (x_train, y_train), (x_test, y_test)\n", - "\n", - "(train_data, train_targets), (test_data, test_targets) = load_data()\n", - "\n", - "mean = train_data.mean(axis=0)\n", - "train_data -= mean\n", - "std = train_data.std(axis=0)\n", - "train_data /= std\n", - "\n", - "test_data -= mean\n", - "test_data /= std" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "In this example we have so few data points, the validation set would end up being very small (e.g. about 100 examples). A consequence is that our validation scores may change a lot depending on which data points we choose to use for validation and which we choose for training, i.e. the validation scores may have a high variance with regard to the validation split. This would prevent us from reliably evaluating our model.\n", - "\n", - "The best practice in such situations is to use K-fold cross-validation. It consists of splitting the available data into K partitions (typically K=4 or 5), then instantiating K identical models, and training each one on K-1 partitions while evaluating on the remaining partition. The validation score for the model used would then be the average of the K validation scores obtained.\n", - "\n", - "Since we are using K-fold so that we have to build the model multiple times, we use following function to build our model:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import models\n", - "from zoo.pipeline.api.keras import layers\n", - "\n", - "def build_model():\n", - " # Because we will need to instantiate\n", - " # the same model multiple times,\n", - " # we use a function to construct it.\n", - " model = models.Sequential()\n", - " model.add(layers.Dense(64, activation='relu',\n", - " input_shape=(train_data.shape[1],)))\n", - " model.add(layers.Dense(64, activation='relu'))\n", - " model.add(layers.Dense(1))\n", - " model.compile(optimizer='rmsprop', loss='mse', metrics=['mae'])\n", - " return model" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Then let's start our training:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "import numpy as np\n", - "\n", - "k = 4\n", - "num_val_samples = len(train_data) // k\n", - "num_nb_epoch = 50\n", - "all_scores = []\n", - "for i in range(k):\n", - " print('processing fold #', i)\n", - " # Prepare the validation data: data from partition # k\n", - " val_data = train_data[i * num_val_samples: (i + 1) * num_val_samples]\n", - " val_targets = train_targets[i * num_val_samples: (i + 1) * num_val_samples]\n", - "\n", - " # Prepare the training data: data from all other partitions\n", - " partial_train_data = np.concatenate(\n", - " [train_data[:i * num_val_samples],\n", - " train_data[(i + 1) * num_val_samples:]],\n", - " axis=0)\n", - " partial_train_targets = np.concatenate(\n", - " [train_targets[:i * num_val_samples],\n", - " train_targets[(i + 1) * num_val_samples:]],\n", - " axis=0)\n", - "\n", - " # Build the Keras model (already compiled)\n", - " model = build_model()\n", - " # Train the model (in silent mode, verbose=0)\n", - " #model.fit(partial_train_data, partial_train_targets,\n", - " # nb_epoch=num_nb_epoch, batch_size=1, verbose=0)\n", - " model.fit(partial_train_data, partial_train_targets,\n", - " nb_epoch=num_nb_epoch, batch_size=16)\n", - "\n", - " # Evaluate the model on the validation data\n", - " #val_mse, val_mae = model.evaluate(val_data, val_targets, verbose=0)\n", - " val_mae = model.evaluate(val_data, val_targets)\n", - " all_scores.append(val_mae[0].result)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "processing fold # 0\n", - "Trained 16 records in 0.011235845 seconds. Throughput is 1424.0139 records/second. Loss is 8.708786.\n", - "processing fold # 1\n", - "Trained 16 records in 0.009535034 seconds. Throughput is 1678.0223 records/second. Loss is 5.3613434.\n", - "processing fold # 2\n", - "Trained 16 records in 0.008636178 seconds. Throughput is 1852.6713 records/second. Loss is 18.106756.\n", - "processing fold # 3\n", - "Trained 16 records in 0.009207628 seconds. Throughput is 1737.6897 records/second. Loss is 7.0931993." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Then we could check our K-fold training result:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "print (all_scores)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "[20.572654724121094, 19.606250762939453, 21.224998474121094, 22.60078239440918]" - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} diff --git a/keras/4.4-regularization-and-dropout.ipynb b/keras/4.4-regularization-and-dropout.ipynb deleted file mode 100644 index 5ed40ab..0000000 --- a/keras/4.4-regularization-and-dropout.ipynb +++ /dev/null @@ -1,195 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "First of all, set environment variables and initialize spark context:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# Regularization and dropout\n", - "Let's review some of the most common techniques to prevent overfitting, and let's apply them in practice to improve our movie classification model from the previous chapter. First let's prepare the data using the code from Chapter 3.5 and preprocess the data" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from keras.datasets import imdb\n", - "import numpy as np\n", - "(train_data, train_labels), (test_data, test_labels) = imdb.load_data(nb_words=10000)\n", - "\n", - "def vectorize_sequences(sequences, dimension=10000):\n", - " # Create an all-zero matrix of shape (len(sequences), dimension)\n", - " results = np.zeros((len(sequences), dimension))\n", - " for i, sequence in enumerate(sequences):\n", - " results[i, sequence] = 1. # set specific indices of results[i] to 1s\n", - " return results\n", - "\n", - "x_train = vectorize_sequences(train_data)\n", - "x_test = vectorize_sequences(test_data)\n", - "\n", - "y_train = np.asarray(train_labels).astype('float32')\n", - "y_test = np.asarray(test_labels).astype('float32')" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Let's first try an easy model and try to optimize it afterwards:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import models\n", - "from zoo.pipeline.api.keras import layers\n", - "\n", - "original_model = models.Sequential()\n", - "original_model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", - "original_model.add(layers.Dense(16, activation='relu'))\n", - "original_model.add(layers.Dense(1, activation='sigmoid'))\n", - "\n", - "original_model.compile(optimizer='rmsprop',\n", - " loss='binary_crossentropy',\n", - " metrics=['acc'])\n", - "\n", - "original_model.fit(x_train, y_train,\n", - " epochs=20,\n", - " batch_size=512,\n", - " validation_data=(x_test, y_test))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 512 records in 0.024455326 seconds. Throughput is 20936.135 records/second. Loss is 0.01585226.\n", - "Top1Accuracy is Accuracy(correct: 21341, count: 25000, accuracy: 0.85364)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Let's add L2 weight regularization to our movie review classification network:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import regularizers\n", - "\n", - "l2_model = models.Sequential()\n", - "l2_model.add(layers.Dense(16, W_regularizer=regularizers.l2(0.001),\n", - " activation='relu', input_shape=(10000,)))\n", - "l2_model.add(layers.Dense(16, W_regularizer=regularizers.l2(0.001),\n", - " activation='relu'))\n", - "l2_model.add(layers.Dense(1, activation='sigmoid'))\n", - "\n", - "l2_model.compile(optimizer='rmsprop',\n", - " loss='binary_crossentropy',\n", - " metrics=['acc'])\n", - "\n", - "l2_model.fit(x_train, y_train,\n", - " nb_epoch=20,\n", - " batch_size=512,\n", - " validation_data=(x_test, y_test))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 512 records in 0.024366594 seconds. Throughput is 21012.373 records/second. Loss is 0.13651785.\n", - "Top1Accuracy is Accuracy(correct: 21684, count: 25000, accuracy: 0.86736)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Let's add two Dropout layers in our IMDB network to see how well they do at reducing overfitting:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "dpt_model = models.Sequential()\n", - "dpt_model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", - "dpt_model.add(layers.Dropout(0.5))\n", - "dpt_model.add(layers.Dense(16, activation='relu'))\n", - "dpt_model.add(layers.Dropout(0.5))\n", - "dpt_model.add(layers.Dense(1, activation='sigmoid'))\n", - "\n", - "dpt_model.compile(optimizer='rmsprop',\n", - " loss='binary_crossentropy',\n", - " metrics=['acc'])\n", - "\n", - "dpt_model.fit(x_train, y_train,\n", - " nb_epoch=20,\n", - " batch_size=512,\n", - " validation_data=(x_test, y_test))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 512 records in 0.017992654 seconds. Throughput is 28456.057 records/second. Loss is 0.112769656. \n", - "Top1Accuracy is Accuracy(correct: 21871, count: 25000, accuracy: 0.87484)" - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} diff --git a/keras/5.1-cnn.ipynb b/keras/5.1-cnn.ipynb deleted file mode 100644 index 72786d2..0000000 --- a/keras/5.1-cnn.ipynb +++ /dev/null @@ -1,152 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "First of all, set environment variables and initialize spark context:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# CNN\n", - "First, let's take a practical look at a very simple convnet example. We will use our convnet to classify MNIST digits, a task that you've already been through in Chapter 2.\n", - "\n", - "First load the MNIST dataset:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from keras.datasets import mnist\n", - "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### CNN input shape\n", - "Once we get the dataset, we need to reshape the images. In keras the shape of the dataset is `(sample_size, height, width, channel)`, like the Keras code below:\n", - " \n", - " train_images = train_images.reshape((60000, 28, 28, 1))\n", - "In analytics-zoo, the default order is theano-style NCHW `(sample_size, channel, height, width)`, so you can process data like following:\n", - "\n", - "Alternatively, you can also use tensorflow-style NHWC as Keras default just by setting `Convolution2D(dim_ordering=\"tf\")`" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "train_images = train_images.reshape((60000, 1, 28, 28))\n", - "train_images = train_images.astype('float32') / 255\n", - "\n", - "test_images = test_images.reshape((10000, 1, 28, 28))\n", - "test_images = test_images.astype('float32') / 255" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Then build the model, compile, train and evaluate:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import layers\n", - "from zoo.pipeline.api.keras import models\n", - "\n", - "model = models.Sequential()\n", - "model.add(layers.Conv2D(32, nb_col=3, nb_row=3, activation='relu', input_shape=(1,28,28)))\n", - "model.add(layers.MaxPooling2D((2, 2)))\n", - "model.add(layers.Conv2D(64, nb_col=3, nb_row=3, activation='relu'))\n", - "model.add(layers.MaxPooling2D((2, 2)))\n", - "model.add(layers.Conv2D(64, nb_col=3, nb_row=3, activation='relu'))\n", - "\n", - "model.summary()\n", - "\n", - "model.add(layers.Flatten())\n", - "model.add(layers.Dense(64, activation='relu'))\n", - "model.add(layers.Dense(10, activation='softmax'))\n", - "\n", - "model.compile(optimizer='rmsprop',\n", - " loss='sparse_categorical_crossentropy',\n", - " metrics=['acc'])\n", - "\n", - "model.fit(train_images, train_labels, nb_epoch=5, batch_size=64)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 64 records in 0.03212866 seconds. Throughput is 1991.9911 records/second. Loss is 0.0023578003." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "test_result = model.evaluate(test_images, test_labels)\n", - "print('test_acc:', test_result[0].result)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "test_acc: 0.9915000200271606" - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} diff --git a/keras/6.2-rnn.ipynb b/keras/6.2-rnn.ipynb deleted file mode 100644 index ef1146f..0000000 --- a/keras/6.2-rnn.ipynb +++ /dev/null @@ -1,194 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "First of all, set environment variables and initialize spark context:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# RNN\n" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras.models import Sequential\n", - "from zoo.pipeline.api.keras.layers import Embedding, SimpleRNN" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Now let's try to use such a model on the IMDB movie review classification problem. First, let's preprocess the data:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from keras.datasets import imdb\n", - "from keras.preprocessing import sequence\n", - "\n", - "max_features = 10000 # number of words to consider as features\n", - "maxlen = 500 # cut texts after this number of words (among top max_features most common words)\n", - "batch_size = 32\n", - "\n", - "(input_train, y_train), (input_test, y_test) = imdb.load_data(nb_words=max_features)\n", - "input_train = sequence.pad_sequences(input_train, maxlen=maxlen)\n", - "input_test = sequence.pad_sequences(input_test, maxlen=maxlen)\n", - "print('input_train shape:', input_train.shape)\n", - "print('input_test shape:', input_test.shape)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "input_train shape: (25000, 500)\n", - "input_test shape: (25000, 500)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### Specify input shape\n", - "We could add an embedding layer as our first layer in Keras as following:\n", - " \n", - " model = Sequential()\n", - " model.add(Embedding(10000, 32))\n", - "In analytics-zoo, you need to specify the input shape of first layer, in this example, the sequence length is 500, so we could build our model as following:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "model = Sequential()\n", - "model.add(Embedding(10000, 32, input_shape=(500,)))\n", - "model.add(SimpleRNN(32, return_sequences=True))\n", - "model.add(SimpleRNN(32, return_sequences=True))\n", - "model.add(SimpleRNN(32, return_sequences=True))\n", - "model.add(SimpleRNN(32)) # This last layer only returns the last outputs.\n", - "model.summary()" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Let's train a simple recurrent network using an `Embedding` layer and a `SimpleRNN` layer:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras.layers import Dense\n", - "\n", - "model = Sequential()\n", - "model.add(Embedding(max_features, 32, input_shape=(500,)))\n", - "model.add(SimpleRNN(32))\n", - "model.add(Dense(1, activation='sigmoid'))\n", - "\n", - "model.compile(optimizer='rmsprop', loss='binary_crossentropy', metrics=['acc'])\n", - "model.fit(input_train, y_train,\n", - " nb_epoch=10,\n", - " batch_size=128)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 128 records in 0.046239497 seconds. Throughput is 2768.1963 records/second. Loss is 0.16970885." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Now let's switch to more practical concerns: we will set up a model using a LSTM layer and train it on the IMDB data." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras.layers import LSTM\n", - "\n", - "model = Sequential()\n", - "model.add(Embedding(max_features, 32, input_shape=(500,)))\n", - "model.add(LSTM(32))\n", - "model.add(Dense(1, activation='sigmoid'))\n", - "\n", - "model.compile(optimizer='rmsprop',\n", - " loss='binary_crossentropy',\n", - " metrics=['acc'])\n", - "\n", - "model.fit(input_train, y_train,\n", - " nb_epoch=10,\n", - " batch_size=128)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 128 records in 0.335889472 seconds. Throughput is 381.07776 records/second. Loss is 0.14791179." - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} From bf37983161a5a028552c6477003885099b426d95 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 28 Feb 2019 17:11:13 +0800 Subject: [PATCH 11/46] Update README.md --- keras/README.md | 70 ------------------------------------------------- 1 file changed, 70 deletions(-) diff --git a/keras/README.md b/keras/README.md index 73d00d8..34c4df3 100644 --- a/keras/README.md +++ b/keras/README.md @@ -1,71 +1 @@ -# Tutorials for running distribued Keras (v1.2.2) on Analytics Zoo Tutorials for running _**distribued Keras (v1.2.2) on Analytics Zoo**_. These tutorials are ported from François Chollet's [Jupyter notebooks](https://github.com/fchollet/deep-learning-with-python-notebooks) for the book [Deep Learning with Python (Manning Publications)](https://www.manning.com/books/deep-learning-with-python?a_aid=keras&a_bid=76564dff) - -This repository is built to describe how to write Keras-style code and directly run it in analytics-zoo so that we could run the original Keras code in distributed mode via limited modification. The original notebook is based on Keras 2.0.8. Currently analytics-zoo builds its Keras-style code based on Keras 1.2.2. Thus, this repository contains the code of implementation of original Keras 2.0.8 code based on Keras 1.2.2, and how to modify the code in order to run it in analytics-zoo. - -To make it simple, we omit the description of plenty concepts here and you could find them in original notebook above [(link)](https://github.com/fchollet/deep-learning-with-python-notebooks). Besides, we directly post Keras 1.2.2 code here and the replacements needed from Keras 2.0.8 to Keras 1.2.2 are noted in `Keras_2-to-1.md`. - -The training log of each epoch is stored in INFO, you could see these information though configuring python output level, or store these information in a file and check afterwards, these may includes plenty lines of information. For better expression here, we run the code in Pycharm IDE and **we only paste the last info of last epoch of each training function in this notebook**. - -This repository use Python 3.5, Keras 1.2.2 (Keras code), Analytics-zoo 0.4.0 (zoo code). We post the summary of Keras-to-zoo code convertion as well as the table of contents. - -## Summary of Keras-to-zoo code convertion -We summarize the modification we need to make here, attached with the link to the example code chapter. - -#### First of all -Make sure you have analytics-zoo installed, see install guide [here](https://analytics-zoo.github.io/master/#PythonUserGuide/install/). Then set the environment variables like following - - PYSPARK_PYTHON=/path_to_your_python - PYSPARK_DRIVER_PYTHON=/path_to_your_python - # To avoid version conflict, in my ubuntu, this path is /usr/bin/python3.5 - - SPARK_DRIVER_MEMORY=4g - # If you encounter heap space exception, you could simply increase this variable to 8g, 16g, etc. - # If no more memory is available on your machine, you could consider reduce the data size. - -Then, make sure you have following code at the beginning of your zoo code. - - from zoo.common.nncontext import * - sc = init_nncontext(init_spark_conf().setMaster("local[4]")) - -we set core number to 4 above, you can also set it with another number. But we still recommend 4 because analytics-zoo need the core number could divide the batch size of learning, which is normal set as powers of 4, e.g. 16, 32, 128. Error would be raised if this requirement is not satisfied. - -#### Accuracy checkout -Currently in analytics-zoo, `fit` method does not have any return. Results can only be checked via tensorboard, see [Chapter 3.5]() - -#### Evaluate return -The return of `evaluate` method is an `EvaluationResult` object, which is different from Keras, see [Chapter 2.1]() - -#### Predict result -The return of `predict` method is RDD, so you need to call `collect` method to collect them, see [Chapter 3.5]() - -#### CNN input shape -In analytics-zoo, the default order is theano-style NCHW, you can also use tensorflow-style NHWC as Keras default just by setting `Convolution2D(dim_ordering="tf")`, see [Chapter 5.1]() - -#### Specify input shape -In analytics-zoo, you need to specify the input shape of first layer, see [Chapter 6.2]() - -#### Parameters not supported -Analytics-zoo does not support following parameters currently, so we do not use these code in this notebook if they exist in original notebook. - -* validation_split: To split the training data into two parts, another part is used for validation -* verbose: To control the information showed during training - -## Table of contents - -The main purpose of this repository is to decribe the convertion from Keras to analytics-zoo so that we rename the notebooks to make it more experienced-user oriented. We keep the chapter index so you could still make a reference to the original notebook [here](https://github.com/fchollet/deep-learning-with-python-notebooks#companion-jupyter-notebooks-for-the-book-deep-learning-with-python). We only keep some key description of the original notebook. - -* Chapter 2: - * [2.1: MNIST](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/2.1-mnist.ipynb) -* Chapter 3: - * [3.5: Binary classification](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/3.5-binary-classification.ipynb) - * [3.6: Multi-class classification](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/3.6-multi-class-classification.ipynb) - * [3.7: Regression](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/3.7-regression.ipynb) -* Chapter 4: - * [4.4: Regularization and Dropout](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/4.4-regularization-and-dropout.ipynb) -* Chapter 5: - * [5.1: CNN](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/5.1-cnn.ipynb) -* Chapter 6: - * [6.2: RNN](https://github.com/Litchilitchy/zoo-tutorials/blob/master/keras/6.2-rnn.ipynb) - - From d201267f4758803ccb70cbc0fce7ea24ffd8adf4 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 28 Feb 2019 18:17:12 +0800 Subject: [PATCH 12/46] Add files via upload --- 2.1-mnist.ipynb | 239 ++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 239 insertions(+) create mode 100644 2.1-mnist.ipynb diff --git a/2.1-mnist.ipynb b/2.1-mnist.ipynb new file mode 100644 index 0000000..80dd3df --- /dev/null +++ b/2.1-mnist.ipynb @@ -0,0 +1,239 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "**First of all, set environment variables and initialize spark context:**" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# A first look at a neural network\n", + "This notebook is imported from Chapter 2, Section 1 of [Deep Learning with Python Notebook]().\n", + "\n", + "We will now take a look at a first concrete example of a neural network, which makes use of analytics-zoo Keras module to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed analytics-zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", + "\n", + "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset, a classic dataset in the machine learning community, which has been around for almost as long as the field itself and has been very intensively studied. It's a set of 60,000 training images, plus 10,000 test images, assembled by the National Institute of Standards and Technology (the NIST in MNIST) in the 1980s. You can think of \"solving\" MNIST as the \"Hello World\" of deep learning -- it's what you do to verify that your algorithms are working as expected. As you become a machine learning practitioner, you will see MNIST come up over and over again, in scientific papers, blog posts, and so on.\n", + "\n", + "The MNIST dataset comes pre-loaded in Keras, in the form of a set of four Numpy arrays:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from keras.datasets import mnist\n", + "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "train_images and train_labels form the \"training set\", the data that the model will learn from. The model will then be tested on the \"test set\", test_images and test_labels. Our images are encoded as Numpy arrays, and the labels are simply an array of digits, ranging from 0 to 9. There is a one-to-one correspondence between the images and the labels." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Our workflow will be as follow: first we will present our neural network with the training data, train_images and train_labels. The network will then learn to associate images and labels. Finally, we will ask the network to produce predictions for test_images, and we will verify if these predictions match the labels from test_labels.\n", + "\n", + "Let's build our network -- again, remember that you aren't supposed to understand everything about this example just yet." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Module import (for previous Keras user only)\n", + "Import the modules we need to build the network. In Keras it is:\n", + "\n", + " from keras import models\n", + " from keras import layers\n", + "Just replace it with following in order to use analytics-zoo:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network = models.Sequential()\n", + "network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))\n", + "network.add(layers.Dense(10, activation='softmax'))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The core building block of neural networks is the \"layer\", a data-processing module which you can conceive as a \"filter\" for data. Some \n", + "data comes in, and comes out in a more useful form. Precisely, layers extract _representations_ out of the data fed into them -- hopefully \n", + "representations that are more meaningful for the problem at hand. Most of deep learning really consists of chaining together simple layers \n", + "which will implement a form of progressive \"data distillation\". A deep learning model is like a sieve for data processing, made of a \n", + "succession of increasingly refined data filters -- the \"layers\".\n", + "\n", + "Here our network consists of a sequence of two `Dense` layers, which are densely-connected (also called \"fully-connected\") neural layers. \n", + "The second (and last) layer is a 10-way \"softmax\" layer, which means it will return an array of 10 probability scores (summing to 1). Each \n", + "score will be the probability that the current digit image belongs to one of our 10 digit classes.\n", + "\n", + "To make our network ready for training, we need to pick three more things, as part of \"compilation\" step:\n", + "\n", + "* A loss function: the is how the network will be able to measure how good a job it is doing on its training data, and thus how it will be \n", + "able to steer itself in the right direction.\n", + "* An optimizer: this is the mechanism through which the network will update itself based on the data it sees and its loss function.\n", + "* Metrics to monitor during training and testing. Here we will only care about accuracy (the fraction of the images that were correctly \n", + "classified).\n", + "\n", + "The exact purpose of the loss function and the optimizer will be made clear throughout the next two chapters." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network.compile(optimizer='rmsprop',\n", + " loss='sparse_categorical_crossentropy',\n", + " metrics=['accuracy'])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Before training, we will preprocess our data by reshaping it into the shape that the network expects, and scaling it so that all values are in \n", + "the `[0, 1]` interval. Previously, our training images for instance were stored in an array of shape `(60000, 28, 28)` of type `uint8` with \n", + "values in the `[0, 255]` interval. We transform it into a `float32` array of shape `(60000, 28 * 28)` with values between 0 and 1." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "train_images = train_images.reshape((60000, 28 * 28))\n", + "train_images = train_images.astype('float32') / 255\n", + "\n", + "test_images = test_images.reshape((10000, 28 * 28))\n", + "test_images = test_images.astype('float32') / 255" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We are now ready to train our network, which in Keras is done via a call to the `fit` method of the network: \n", + "we \"fit\" the model to its training data." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network.fit(train_images, train_labels, nb_epoch=5, batch_size=128)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Evaluate return (for previous Keras user only)\n", + "Check our result on test set. In Keras it is:\n", + "\n", + " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", + "In analytics-zoo, the return of `evaluate` method is an `EvaluationResult` object, which is different from Keras. We use following code to check:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", + "print('test_acc:', test_result[0].result)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "test_acc: 0.9783999919891357" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "This concludes our very first example -- you just saw how we could build and a train a neural network to classify handwritten digits, in \n", + "less than 20 lines of Python code. In the next chapter, we will go in detail over every moving piece we just previewed, and clarify what is really \n", + "going on behind the scenes. You will learn about \"tensors\", the data-storing objects going into the network, about tensor operations, which \n", + "layers are made of, and about gradient descent, which allows our network to learn from its training examples." + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From 8c1f67e080002037d231456213a4e3cd16bfe01c Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 28 Feb 2019 18:17:55 +0800 Subject: [PATCH 13/46] Rename 2.1-mnist.ipynb to 2.1-a-first-look-at-a-neural-network.ipynb --- 2.1-mnist.ipynb => 2.1-a-first-look-at-a-neural-network.ipynb | 0 1 file changed, 0 insertions(+), 0 deletions(-) rename 2.1-mnist.ipynb => 2.1-a-first-look-at-a-neural-network.ipynb (100%) diff --git a/2.1-mnist.ipynb b/2.1-a-first-look-at-a-neural-network.ipynb similarity index 100% rename from 2.1-mnist.ipynb rename to 2.1-a-first-look-at-a-neural-network.ipynb From bf9ca63189d0dbcaa9a7f8491bfdad1066c5a243 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Fri, 1 Mar 2019 15:12:00 +0800 Subject: [PATCH 14/46] Add files via upload --- keras/2.1-mnist.ipynb | 241 ++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 241 insertions(+) create mode 100644 keras/2.1-mnist.ipynb diff --git a/keras/2.1-mnist.ipynb b/keras/2.1-mnist.ipynb new file mode 100644 index 0000000..073b04a --- /dev/null +++ b/keras/2.1-mnist.ipynb @@ -0,0 +1,241 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "**First of all, set environment variables and initialize spark context:**" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# A first look at a neural network\n", + "This notebook is imported from Chapter 2, Section 1 of [Deep Learning with Python Notebook]().\n", + "\n", + "----\n", + "\n", + "We will now take a look at a first concrete example of a neural network, which makes use of analytics-zoo Keras module to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed analytics-zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", + "\n", + "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset, a classic dataset in the machine learning community, which has been around for almost as long as the field itself and has been very intensively studied. It's a set of 60,000 training images, plus 10,000 test images, assembled by the National Institute of Standards and Technology (the NIST in MNIST) in the 1980s. You can think of \"solving\" MNIST as the \"Hello World\" of deep learning -- it's what you do to verify that your algorithms are working as expected. As you become a machine learning practitioner, you will see MNIST come up over and over again, in scientific papers, blog posts, and so on.\n", + "\n", + "The MNIST dataset comes pre-loaded in analytics-zoo Keras module, in the form of a set of four Numpy arrays:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.datasets import mnist\n", + "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "train_images and train_labels form the \"training set\", the data that the model will learn from. The model will then be tested on the \"test set\", test_images and test_labels. Our images are encoded as Numpy arrays, and the labels are simply an array of digits, ranging from 0 to 9. There is a one-to-one correspondence between the images and the labels." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Our workflow will be as follow: first we will present our neural network with the training data, train_images and train_labels. The network will then learn to associate images and labels. Finally, we will ask the network to produce predictions for test_images, and we will verify if these predictions match the labels from test_labels.\n", + "\n", + "Let's build our network -- again, remember that you aren't supposed to understand everything about this example just yet." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Module import (for previous Keras user)\n", + "Import the modules we need to build the network. In Keras it is:\n", + "\n", + " from keras import models\n", + " from keras import layers\n", + "Just replace it with following in order to use analytics-zoo:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network = models.Sequential()\n", + "network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))\n", + "network.add(layers.Dense(10, activation='softmax'))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The core building block of neural networks is the \"layer\", a data-processing module which you can conceive as a \"filter\" for data. Some \n", + "data comes in, and comes out in a more useful form. Precisely, layers extract _representations_ out of the data fed into them -- hopefully \n", + "representations that are more meaningful for the problem at hand. Most of deep learning really consists of chaining together simple layers \n", + "which will implement a form of progressive \"data distillation\". A deep learning model is like a sieve for data processing, made of a \n", + "succession of increasingly refined data filters -- the \"layers\".\n", + "\n", + "Here our network consists of a sequence of two `Dense` layers, which are densely-connected (also called \"fully-connected\") neural layers. \n", + "The second (and last) layer is a 10-way \"softmax\" layer, which means it will return an array of 10 probability scores (summing to 1). Each \n", + "score will be the probability that the current digit image belongs to one of our 10 digit classes.\n", + "\n", + "To make our network ready for training, we need to pick three more things, as part of \"compilation\" step:\n", + "\n", + "* A loss function: the is how the network will be able to measure how good a job it is doing on its training data, and thus how it will be \n", + "able to steer itself in the right direction.\n", + "* An optimizer: this is the mechanism through which the network will update itself based on the data it sees and its loss function.\n", + "* Metrics to monitor during training and testing. Here we will only care about accuracy (the fraction of the images that were correctly \n", + "classified).\n", + "\n", + "The exact purpose of the loss function and the optimizer will be made clear throughout the next two chapters." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network.compile(optimizer='rmsprop',\n", + " loss='sparse_categorical_crossentropy',\n", + " metrics=['accuracy'])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Before training, we will preprocess our data by reshaping it into the shape that the network expects, and scaling it so that all values are in \n", + "the `[0, 1]` interval. Previously, our training images for instance were stored in an array of shape `(60000, 28, 28)` of type `uint8` with \n", + "values in the `[0, 255]` interval. We transform it into a `float32` array of shape `(60000, 28 * 28)` with values between 0 and 1." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "train_images = train_images.reshape((60000, 28 * 28))\n", + "train_images = train_images.astype('float32') / 255\n", + "\n", + "test_images = test_images.reshape((10000, 28 * 28))\n", + "test_images = test_images.astype('float32') / 255" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We are now ready to train our network, which in analytics-zoo Keras module is done via a call to the `fit` method of the network: \n", + "we \"fit\" the model to its training data." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network.fit(train_images, train_labels, nb_epoch=5, batch_size=128)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Evaluate return (for previous Keras user)\n", + "Check our result on test set. In Keras it is:\n", + "\n", + " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", + "In analytics-zoo, the return of `evaluate` method is an `EvaluationResult` object, which is different from Keras. We use following code to check:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", + "print('test_acc:', test_result[0].result)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "test_acc: 0.9783999919891357" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "This concludes our very first example -- you just saw how we could build and a train a neural network to classify handwritten digits, in \n", + "less than 20 lines of Python code. In the next chapter, we will go in detail over every moving piece we just previewed, and clarify what is really \n", + "going on behind the scenes. You will learn about \"tensors\", the data-storing objects going into the network, about tensor operations, which \n", + "layers are made of, and about gradient descent, which allows our network to learn from its training examples." + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From fa23ec80ca133abafd61c32dd39ba2e28838c8f5 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Fri, 1 Mar 2019 15:12:13 +0800 Subject: [PATCH 15/46] Delete 2.1-a-first-look-at-a-neural-network.ipynb --- 2.1-a-first-look-at-a-neural-network.ipynb | 239 --------------------- 1 file changed, 239 deletions(-) delete mode 100644 2.1-a-first-look-at-a-neural-network.ipynb diff --git a/2.1-a-first-look-at-a-neural-network.ipynb b/2.1-a-first-look-at-a-neural-network.ipynb deleted file mode 100644 index 80dd3df..0000000 --- a/2.1-a-first-look-at-a-neural-network.ipynb +++ /dev/null @@ -1,239 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "**First of all, set environment variables and initialize spark context:**" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# A first look at a neural network\n", - "This notebook is imported from Chapter 2, Section 1 of [Deep Learning with Python Notebook]().\n", - "\n", - "We will now take a look at a first concrete example of a neural network, which makes use of analytics-zoo Keras module to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed analytics-zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", - "\n", - "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset, a classic dataset in the machine learning community, which has been around for almost as long as the field itself and has been very intensively studied. It's a set of 60,000 training images, plus 10,000 test images, assembled by the National Institute of Standards and Technology (the NIST in MNIST) in the 1980s. You can think of \"solving\" MNIST as the \"Hello World\" of deep learning -- it's what you do to verify that your algorithms are working as expected. As you become a machine learning practitioner, you will see MNIST come up over and over again, in scientific papers, blog posts, and so on.\n", - "\n", - "The MNIST dataset comes pre-loaded in Keras, in the form of a set of four Numpy arrays:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from keras.datasets import mnist\n", - "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "train_images and train_labels form the \"training set\", the data that the model will learn from. The model will then be tested on the \"test set\", test_images and test_labels. Our images are encoded as Numpy arrays, and the labels are simply an array of digits, ranging from 0 to 9. There is a one-to-one correspondence between the images and the labels." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Our workflow will be as follow: first we will present our neural network with the training data, train_images and train_labels. The network will then learn to associate images and labels. Finally, we will ask the network to produce predictions for test_images, and we will verify if these predictions match the labels from test_labels.\n", - "\n", - "Let's build our network -- again, remember that you aren't supposed to understand everything about this example just yet." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### Module import (for previous Keras user only)\n", - "Import the modules we need to build the network. In Keras it is:\n", - "\n", - " from keras import models\n", - " from keras import layers\n", - "Just replace it with following in order to use analytics-zoo:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import models\n", - "from zoo.pipeline.api.keras import layers" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "network = models.Sequential()\n", - "network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))\n", - "network.add(layers.Dense(10, activation='softmax'))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "The core building block of neural networks is the \"layer\", a data-processing module which you can conceive as a \"filter\" for data. Some \n", - "data comes in, and comes out in a more useful form. Precisely, layers extract _representations_ out of the data fed into them -- hopefully \n", - "representations that are more meaningful for the problem at hand. Most of deep learning really consists of chaining together simple layers \n", - "which will implement a form of progressive \"data distillation\". A deep learning model is like a sieve for data processing, made of a \n", - "succession of increasingly refined data filters -- the \"layers\".\n", - "\n", - "Here our network consists of a sequence of two `Dense` layers, which are densely-connected (also called \"fully-connected\") neural layers. \n", - "The second (and last) layer is a 10-way \"softmax\" layer, which means it will return an array of 10 probability scores (summing to 1). Each \n", - "score will be the probability that the current digit image belongs to one of our 10 digit classes.\n", - "\n", - "To make our network ready for training, we need to pick three more things, as part of \"compilation\" step:\n", - "\n", - "* A loss function: the is how the network will be able to measure how good a job it is doing on its training data, and thus how it will be \n", - "able to steer itself in the right direction.\n", - "* An optimizer: this is the mechanism through which the network will update itself based on the data it sees and its loss function.\n", - "* Metrics to monitor during training and testing. Here we will only care about accuracy (the fraction of the images that were correctly \n", - "classified).\n", - "\n", - "The exact purpose of the loss function and the optimizer will be made clear throughout the next two chapters." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "network.compile(optimizer='rmsprop',\n", - " loss='sparse_categorical_crossentropy',\n", - " metrics=['accuracy'])" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Before training, we will preprocess our data by reshaping it into the shape that the network expects, and scaling it so that all values are in \n", - "the `[0, 1]` interval. Previously, our training images for instance were stored in an array of shape `(60000, 28, 28)` of type `uint8` with \n", - "values in the `[0, 255]` interval. We transform it into a `float32` array of shape `(60000, 28 * 28)` with values between 0 and 1." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "train_images = train_images.reshape((60000, 28 * 28))\n", - "train_images = train_images.astype('float32') / 255\n", - "\n", - "test_images = test_images.reshape((10000, 28 * 28))\n", - "test_images = test_images.astype('float32') / 255" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "We are now ready to train our network, which in Keras is done via a call to the `fit` method of the network: \n", - "we \"fit\" the model to its training data." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "network.fit(train_images, train_labels, nb_epoch=5, batch_size=128)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### Evaluate return (for previous Keras user only)\n", - "Check our result on test set. In Keras it is:\n", - "\n", - " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", - "In analytics-zoo, the return of `evaluate` method is an `EvaluationResult` object, which is different from Keras. We use following code to check:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", - "print('test_acc:', test_result[0].result)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "test_acc: 0.9783999919891357" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "This concludes our very first example -- you just saw how we could build and a train a neural network to classify handwritten digits, in \n", - "less than 20 lines of Python code. In the next chapter, we will go in detail over every moving piece we just previewed, and clarify what is really \n", - "going on behind the scenes. You will learn about \"tensors\", the data-storing objects going into the network, about tensor operations, which \n", - "layers are made of, and about gradient descent, which allows our network to learn from its training examples." - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} From b19c4427fe5d743f5493856b51ec01d8b6a2f51b Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Fri, 1 Mar 2019 15:12:54 +0800 Subject: [PATCH 16/46] Add files via upload --- ...2.1-a-first-look-at-a-neural-network.ipynb | 241 ++++++++++++++++++ 1 file changed, 241 insertions(+) create mode 100644 keras/2.1-a-first-look-at-a-neural-network.ipynb diff --git a/keras/2.1-a-first-look-at-a-neural-network.ipynb b/keras/2.1-a-first-look-at-a-neural-network.ipynb new file mode 100644 index 0000000..073b04a --- /dev/null +++ b/keras/2.1-a-first-look-at-a-neural-network.ipynb @@ -0,0 +1,241 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "**First of all, set environment variables and initialize spark context:**" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# A first look at a neural network\n", + "This notebook is imported from Chapter 2, Section 1 of [Deep Learning with Python Notebook]().\n", + "\n", + "----\n", + "\n", + "We will now take a look at a first concrete example of a neural network, which makes use of analytics-zoo Keras module to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed analytics-zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", + "\n", + "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset, a classic dataset in the machine learning community, which has been around for almost as long as the field itself and has been very intensively studied. It's a set of 60,000 training images, plus 10,000 test images, assembled by the National Institute of Standards and Technology (the NIST in MNIST) in the 1980s. You can think of \"solving\" MNIST as the \"Hello World\" of deep learning -- it's what you do to verify that your algorithms are working as expected. As you become a machine learning practitioner, you will see MNIST come up over and over again, in scientific papers, blog posts, and so on.\n", + "\n", + "The MNIST dataset comes pre-loaded in analytics-zoo Keras module, in the form of a set of four Numpy arrays:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.datasets import mnist\n", + "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "train_images and train_labels form the \"training set\", the data that the model will learn from. The model will then be tested on the \"test set\", test_images and test_labels. Our images are encoded as Numpy arrays, and the labels are simply an array of digits, ranging from 0 to 9. There is a one-to-one correspondence between the images and the labels." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Our workflow will be as follow: first we will present our neural network with the training data, train_images and train_labels. The network will then learn to associate images and labels. Finally, we will ask the network to produce predictions for test_images, and we will verify if these predictions match the labels from test_labels.\n", + "\n", + "Let's build our network -- again, remember that you aren't supposed to understand everything about this example just yet." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Module import (for previous Keras user)\n", + "Import the modules we need to build the network. In Keras it is:\n", + "\n", + " from keras import models\n", + " from keras import layers\n", + "Just replace it with following in order to use analytics-zoo:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network = models.Sequential()\n", + "network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))\n", + "network.add(layers.Dense(10, activation='softmax'))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The core building block of neural networks is the \"layer\", a data-processing module which you can conceive as a \"filter\" for data. Some \n", + "data comes in, and comes out in a more useful form. Precisely, layers extract _representations_ out of the data fed into them -- hopefully \n", + "representations that are more meaningful for the problem at hand. Most of deep learning really consists of chaining together simple layers \n", + "which will implement a form of progressive \"data distillation\". A deep learning model is like a sieve for data processing, made of a \n", + "succession of increasingly refined data filters -- the \"layers\".\n", + "\n", + "Here our network consists of a sequence of two `Dense` layers, which are densely-connected (also called \"fully-connected\") neural layers. \n", + "The second (and last) layer is a 10-way \"softmax\" layer, which means it will return an array of 10 probability scores (summing to 1). Each \n", + "score will be the probability that the current digit image belongs to one of our 10 digit classes.\n", + "\n", + "To make our network ready for training, we need to pick three more things, as part of \"compilation\" step:\n", + "\n", + "* A loss function: the is how the network will be able to measure how good a job it is doing on its training data, and thus how it will be \n", + "able to steer itself in the right direction.\n", + "* An optimizer: this is the mechanism through which the network will update itself based on the data it sees and its loss function.\n", + "* Metrics to monitor during training and testing. Here we will only care about accuracy (the fraction of the images that were correctly \n", + "classified).\n", + "\n", + "The exact purpose of the loss function and the optimizer will be made clear throughout the next two chapters." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network.compile(optimizer='rmsprop',\n", + " loss='sparse_categorical_crossentropy',\n", + " metrics=['accuracy'])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Before training, we will preprocess our data by reshaping it into the shape that the network expects, and scaling it so that all values are in \n", + "the `[0, 1]` interval. Previously, our training images for instance were stored in an array of shape `(60000, 28, 28)` of type `uint8` with \n", + "values in the `[0, 255]` interval. We transform it into a `float32` array of shape `(60000, 28 * 28)` with values between 0 and 1." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "train_images = train_images.reshape((60000, 28 * 28))\n", + "train_images = train_images.astype('float32') / 255\n", + "\n", + "test_images = test_images.reshape((10000, 28 * 28))\n", + "test_images = test_images.astype('float32') / 255" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We are now ready to train our network, which in analytics-zoo Keras module is done via a call to the `fit` method of the network: \n", + "we \"fit\" the model to its training data." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "network.fit(train_images, train_labels, nb_epoch=5, batch_size=128)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Evaluate return (for previous Keras user)\n", + "Check our result on test set. In Keras it is:\n", + "\n", + " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", + "In analytics-zoo, the return of `evaluate` method is an `EvaluationResult` object, which is different from Keras. We use following code to check:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", + "print('test_acc:', test_result[0].result)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "test_acc: 0.9783999919891357" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "This concludes our very first example -- you just saw how we could build and a train a neural network to classify handwritten digits, in \n", + "less than 20 lines of Python code. In the next chapter, we will go in detail over every moving piece we just previewed, and clarify what is really \n", + "going on behind the scenes. You will learn about \"tensors\", the data-storing objects going into the network, about tensor operations, which \n", + "layers are made of, and about gradient descent, which allows our network to learn from its training examples." + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From 5022ebf2c2cc2e1b7079fb09533d4b15d2622dc3 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Fri, 1 Mar 2019 15:13:09 +0800 Subject: [PATCH 17/46] Delete 2.1-mnist.ipynb --- keras/2.1-mnist.ipynb | 241 ------------------------------------------ 1 file changed, 241 deletions(-) delete mode 100644 keras/2.1-mnist.ipynb diff --git a/keras/2.1-mnist.ipynb b/keras/2.1-mnist.ipynb deleted file mode 100644 index 073b04a..0000000 --- a/keras/2.1-mnist.ipynb +++ /dev/null @@ -1,241 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "**First of all, set environment variables and initialize spark context:**" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# A first look at a neural network\n", - "This notebook is imported from Chapter 2, Section 1 of [Deep Learning with Python Notebook]().\n", - "\n", - "----\n", - "\n", - "We will now take a look at a first concrete example of a neural network, which makes use of analytics-zoo Keras module to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed analytics-zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", - "\n", - "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset, a classic dataset in the machine learning community, which has been around for almost as long as the field itself and has been very intensively studied. It's a set of 60,000 training images, plus 10,000 test images, assembled by the National Institute of Standards and Technology (the NIST in MNIST) in the 1980s. You can think of \"solving\" MNIST as the \"Hello World\" of deep learning -- it's what you do to verify that your algorithms are working as expected. As you become a machine learning practitioner, you will see MNIST come up over and over again, in scientific papers, blog posts, and so on.\n", - "\n", - "The MNIST dataset comes pre-loaded in analytics-zoo Keras module, in the form of a set of four Numpy arrays:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras.datasets import mnist\n", - "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "train_images and train_labels form the \"training set\", the data that the model will learn from. The model will then be tested on the \"test set\", test_images and test_labels. Our images are encoded as Numpy arrays, and the labels are simply an array of digits, ranging from 0 to 9. There is a one-to-one correspondence between the images and the labels." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Our workflow will be as follow: first we will present our neural network with the training data, train_images and train_labels. The network will then learn to associate images and labels. Finally, we will ask the network to produce predictions for test_images, and we will verify if these predictions match the labels from test_labels.\n", - "\n", - "Let's build our network -- again, remember that you aren't supposed to understand everything about this example just yet." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### Module import (for previous Keras user)\n", - "Import the modules we need to build the network. In Keras it is:\n", - "\n", - " from keras import models\n", - " from keras import layers\n", - "Just replace it with following in order to use analytics-zoo:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import models\n", - "from zoo.pipeline.api.keras import layers" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "network = models.Sequential()\n", - "network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))\n", - "network.add(layers.Dense(10, activation='softmax'))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "The core building block of neural networks is the \"layer\", a data-processing module which you can conceive as a \"filter\" for data. Some \n", - "data comes in, and comes out in a more useful form. Precisely, layers extract _representations_ out of the data fed into them -- hopefully \n", - "representations that are more meaningful for the problem at hand. Most of deep learning really consists of chaining together simple layers \n", - "which will implement a form of progressive \"data distillation\". A deep learning model is like a sieve for data processing, made of a \n", - "succession of increasingly refined data filters -- the \"layers\".\n", - "\n", - "Here our network consists of a sequence of two `Dense` layers, which are densely-connected (also called \"fully-connected\") neural layers. \n", - "The second (and last) layer is a 10-way \"softmax\" layer, which means it will return an array of 10 probability scores (summing to 1). Each \n", - "score will be the probability that the current digit image belongs to one of our 10 digit classes.\n", - "\n", - "To make our network ready for training, we need to pick three more things, as part of \"compilation\" step:\n", - "\n", - "* A loss function: the is how the network will be able to measure how good a job it is doing on its training data, and thus how it will be \n", - "able to steer itself in the right direction.\n", - "* An optimizer: this is the mechanism through which the network will update itself based on the data it sees and its loss function.\n", - "* Metrics to monitor during training and testing. Here we will only care about accuracy (the fraction of the images that were correctly \n", - "classified).\n", - "\n", - "The exact purpose of the loss function and the optimizer will be made clear throughout the next two chapters." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "network.compile(optimizer='rmsprop',\n", - " loss='sparse_categorical_crossentropy',\n", - " metrics=['accuracy'])" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Before training, we will preprocess our data by reshaping it into the shape that the network expects, and scaling it so that all values are in \n", - "the `[0, 1]` interval. Previously, our training images for instance were stored in an array of shape `(60000, 28, 28)` of type `uint8` with \n", - "values in the `[0, 255]` interval. We transform it into a `float32` array of shape `(60000, 28 * 28)` with values between 0 and 1." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "train_images = train_images.reshape((60000, 28 * 28))\n", - "train_images = train_images.astype('float32') / 255\n", - "\n", - "test_images = test_images.reshape((10000, 28 * 28))\n", - "test_images = test_images.astype('float32') / 255" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "We are now ready to train our network, which in analytics-zoo Keras module is done via a call to the `fit` method of the network: \n", - "we \"fit\" the model to its training data." - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "network.fit(train_images, train_labels, nb_epoch=5, batch_size=128)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "#### Evaluate return (for previous Keras user)\n", - "Check our result on test set. In Keras it is:\n", - "\n", - " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", - "In analytics-zoo, the return of `evaluate` method is an `EvaluationResult` object, which is different from Keras. We use following code to check:" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [ - "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", - "print('test_acc:', test_result[0].result)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "test_acc: 0.9783999919891357" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "This concludes our very first example -- you just saw how we could build and a train a neural network to classify handwritten digits, in \n", - "less than 20 lines of Python code. In the next chapter, we will go in detail over every moving piece we just previewed, and clarify what is really \n", - "going on behind the scenes. You will learn about \"tensors\", the data-storing objects going into the network, about tensor operations, which \n", - "layers are made of, and about gradient descent, which allows our network to learn from its training examples." - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} From 81073f4a0806ec51020d4fd51f138b1fe4116ca0 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Fri, 8 Mar 2019 15:13:29 +0800 Subject: [PATCH 18/46] Add files via upload --- ...2.1-a-first-look-at-a-neural-network.ipynb | 250 +++++++++++++++--- 1 file changed, 219 insertions(+), 31 deletions(-) diff --git a/keras/2.1-a-first-look-at-a-neural-network.ipynb b/keras/2.1-a-first-look-at-a-neural-network.ipynb index 073b04a..c281ede 100644 --- a/keras/2.1-a-first-look-at-a-neural-network.ipynb +++ b/keras/2.1-a-first-look-at-a-neural-network.ipynb @@ -9,11 +9,19 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 1, "metadata": {}, - "outputs": [], + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", + "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n" + ] + } + ], "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", "\n", @@ -26,22 +34,43 @@ "metadata": {}, "source": [ "# A first look at a neural network\n", - "This notebook is imported from Chapter 2, Section 1 of [Deep Learning with Python Notebook]().\n", "\n", "----\n", "\n", - "We will now take a look at a first concrete example of a neural network, which makes use of analytics-zoo Keras module to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed analytics-zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", + "We will now take a look at a first concrete example of a neural network, which makes use of Keras (v1.2.2) API in [Analytics Zoo](https://github.com/intel-analytics/analytics-zoo) to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed analytics-zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", "\n", "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset, a classic dataset in the machine learning community, which has been around for almost as long as the field itself and has been very intensively studied. It's a set of 60,000 training images, plus 10,000 test images, assembled by the National Institute of Standards and Technology (the NIST in MNIST) in the 1980s. You can think of \"solving\" MNIST as the \"Hello World\" of deep learning -- it's what you do to verify that your algorithms are working as expected. As you become a machine learning practitioner, you will see MNIST come up over and over again, in scientific papers, blog posts, and so on.\n", "\n", - "The MNIST dataset comes pre-loaded in analytics-zoo Keras module, in the form of a set of four Numpy arrays:" + "The MNIST dataset comes pre-loaded in the Keras API of Analytics Zoo, in the form of a set of four Numpy arrays:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Datasets import\n", + "In Keras you could use following code to import the datasets:\n", + "\n", + " from keras.datasets import mnist\n", + "Just replace it with following in analytics-zoo:" ] }, { "cell_type": "code", - "execution_count": null, + "execution_count": 2, "metadata": {}, - "outputs": [], + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Extracting /tmp/.zoo/dataset/mnist/train-images-idx3-ubyte.gz\n", + "Extracting /tmp/.zoo/dataset/mnist/train-labels-idx1-ubyte.gz\n", + "Extracting /tmp/.zoo/dataset/mnist/t10k-images-idx3-ubyte.gz\n", + "Extracting /tmp/.zoo/dataset/mnist/t10k-labels-idx1-ubyte.gz\n" + ] + } + ], "source": [ "from zoo.pipeline.api.keras.datasets import mnist\n", "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()" @@ -51,14 +80,140 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "train_images and train_labels form the \"training set\", the data that the model will learn from. The model will then be tested on the \"test set\", test_images and test_labels. Our images are encoded as Numpy arrays, and the labels are simply an array of digits, ranging from 0 to 9. There is a one-to-one correspondence between the images and the labels." + "`train_images` and `train_labels` form the \"training set\", the data that the model will learn from. The model will then be tested on the \n", + "\"test set\", `test_images` and `test_labels`. Our images are encoded as Numpy arrays, and the labels are simply an array of digits, ranging \n", + "from 0 to 9. There is a one-to-one correspondence between the images and the labels.\n", + "\n", + "Let's have a look at the training data:" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "(60000, 28, 28, 1)" + ] + }, + "execution_count": 3, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "train_images.shape" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "60000" + ] + }, + "execution_count": 4, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "len(train_labels)" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([5, 0, 4, ..., 5, 6, 8], dtype=uint8)" + ] + }, + "execution_count": 5, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "train_labels" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "(10000, 28, 28, 1)" + ] + }, + "execution_count": 6, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "test_images.shape" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "10000" + ] + }, + "execution_count": 7, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "len(test_labels)" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([7, 2, 1, ..., 4, 5, 6], dtype=uint8)" + ] + }, + "execution_count": 8, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "test_labels" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ - "Our workflow will be as follow: first we will present our neural network with the training data, train_images and train_labels. The network will then learn to associate images and labels. Finally, we will ask the network to produce predictions for test_images, and we will verify if these predictions match the labels from test_labels.\n", + "Our workflow will be as follow: first we will present our neural network with the training data, `train_images` and `train_labels`. The \n", + "network will then learn to associate images and labels. Finally, we will ask the network to produce predictions for `test_images`, and we \n", + "will verify if these predictions match the labels from `test_labels`.\n", "\n", "Let's build our network -- again, remember that you aren't supposed to understand everything about this example just yet." ] @@ -67,17 +222,17 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "#### Module import (for previous Keras user)\n", - "Import the modules we need to build the network. In Keras it is:\n", + "#### Module import\n", + "In Keras you could use following code to import the modules we need to build the network:\n", "\n", " from keras import models\n", " from keras import layers\n", - "Just replace it with following in order to use analytics-zoo:" + "Just replace it with following in analytics-zoo:" ] }, { "cell_type": "code", - "execution_count": null, + "execution_count": 9, "metadata": {}, "outputs": [], "source": [ @@ -87,9 +242,29 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 10, "metadata": {}, - "outputs": [], + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n" + ] + }, + { + "data": { + "text/plain": [ + "" + ] + }, + "execution_count": 10, + "metadata": {}, + "output_type": "execute_result" + } + ], "source": [ "network = models.Sequential()\n", "network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))\n", @@ -123,9 +298,19 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 11, "metadata": {}, - "outputs": [], + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createRMSprop\n", + "creating: createZooKerasSparseCategoricalCrossEntropy\n", + "creating: createZooKerasAccuracy\n" + ] + } + ], "source": [ "network.compile(optimizer='rmsprop',\n", " loss='sparse_categorical_crossentropy',\n", @@ -143,7 +328,7 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 12, "metadata": {}, "outputs": [], "source": [ @@ -164,7 +349,7 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 13, "metadata": {}, "outputs": [], "source": [ @@ -175,14 +360,16 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." + "Blue messages below is the last INFO of training, you can find full training process in INFO, which outputs in your terminal or IDE\n", + "\n", + "INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ - "#### Evaluate return (for previous Keras user)\n", + "#### Evaluate return (for previous Keras user) pending for code\n", "Check our result on test set. In Keras it is:\n", "\n", " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", @@ -191,21 +378,22 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 14, "metadata": {}, - "outputs": [], + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "test_acc: 0.9771000146865845\n" + ] + } + ], "source": [ "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", "print('test_acc:', test_result[0].result)" ] }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "test_acc: 0.9783999919891357" - ] - }, { "cell_type": "markdown", "metadata": {}, From 390d1125c4db4257f0f239e2a6ba7d46fdf0b848 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 12 Mar 2019 16:07:17 +0800 Subject: [PATCH 19/46] Add files via upload --- keras/2.1-a-first-look-at-a-neural-network.ipynb | 14 +++++--------- 1 file changed, 5 insertions(+), 9 deletions(-) diff --git a/keras/2.1-a-first-look-at-a-neural-network.ipynb b/keras/2.1-a-first-look-at-a-neural-network.ipynb index c281ede..41bcf51 100644 --- a/keras/2.1-a-first-look-at-a-neural-network.ipynb +++ b/keras/2.1-a-first-look-at-a-neural-network.ipynb @@ -257,7 +257,7 @@ { "data": { "text/plain": [ - "" + "" ] }, "execution_count": 10, @@ -369,11 +369,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "#### Evaluate return (for previous Keras user) pending for code\n", - "Check our result on test set. In Keras it is:\n", - "\n", - " test_loss, test_acc = network.evaluate(test_images, test_labels)\n", - "In analytics-zoo, the return of `evaluate` method is an `EvaluationResult` object, which is different from Keras. We use following code to check:" + "We quickly reach an accuracy of 0.989 (i.e. 98.9%) on the training data. Now let's check that our model performs well on the test set too:" ] }, { @@ -385,13 +381,13 @@ "name": "stdout", "output_type": "stream", "text": [ - "test_acc: 0.9771000146865845\n" + "test_acc: 0.9797000288963318\n" ] } ], "source": [ - "test_result = network.evaluate(test_images, test_labels, batch_size=32)\n", - "print('test_acc:', test_result[0].result)" + "test_loss, test_acc = network.evaluate(test_images, test_labels, batch_size=32)\n", + "print('test_acc:', test_acc)" ] }, { From e3b1b5cd30dbfd15ae37f4be8394770d9eb91ad2 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 12 Mar 2019 16:11:29 +0800 Subject: [PATCH 20/46] Add files via upload --- keras/2.1-a-first-look-at-a-neural-network.ipynb | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/keras/2.1-a-first-look-at-a-neural-network.ipynb b/keras/2.1-a-first-look-at-a-neural-network.ipynb index 41bcf51..81f35ef 100644 --- a/keras/2.1-a-first-look-at-a-neural-network.ipynb +++ b/keras/2.1-a-first-look-at-a-neural-network.ipynb @@ -49,10 +49,10 @@ "metadata": {}, "source": [ "#### Datasets import\n", - "In Keras you could use following code to import the datasets:\n", + "_In Keras one could use following code to import the datasets:_\n", "\n", " from keras.datasets import mnist\n", - "Just replace it with following in analytics-zoo:" + "_Just replace it with following in analytics-zoo:_" ] }, { @@ -223,11 +223,11 @@ "metadata": {}, "source": [ "#### Module import\n", - "In Keras you could use following code to import the modules we need to build the network:\n", + "_In Keras one could use following code to import the modules we need to build the network:_\n", "\n", " from keras import models\n", " from keras import layers\n", - "Just replace it with following in analytics-zoo:" + "_Just replace it with following in analytics-zoo:_" ] }, { From 201c4abadf45846673adead985252848355f0513 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 12 Mar 2019 16:15:26 +0800 Subject: [PATCH 21/46] Add files via upload --- keras/2.1-a-first-look-at-a-neural-network.ipynb | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/keras/2.1-a-first-look-at-a-neural-network.ipynb b/keras/2.1-a-first-look-at-a-neural-network.ipynb index 81f35ef..d022670 100644 --- a/keras/2.1-a-first-look-at-a-neural-network.ipynb +++ b/keras/2.1-a-first-look-at-a-neural-network.ipynb @@ -360,9 +360,9 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Blue messages below is the last INFO of training, you can find full training process in INFO, which outputs in your terminal or IDE\n", + "Messages below is the last INFO of training, you can find full training process in INFO, which outputs in your terminal or IDE (not the output of the program)\n", "\n", - "INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556." + "_INFO - Trained 128 records in 0.018066358 seconds. Throughput is 7084.992 records/second. Loss is 0.012087556._" ] }, { From 09f6bc7053c5de375d30371443cbb2db48022b64 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 18 Mar 2019 10:42:27 +0800 Subject: [PATCH 22/46] Add files via upload --- keras/3.5-binary-classification.ipynb | 1686 +++++++++++++++++++++++++ 1 file changed, 1686 insertions(+) create mode 100644 keras/3.5-binary-classification.ipynb diff --git a/keras/3.5-binary-classification.ipynb b/keras/3.5-binary-classification.ipynb new file mode 100644 index 0000000..93ac869 --- /dev/null +++ b/keras/3.5-binary-classification.ipynb @@ -0,0 +1,1686 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "**First of all, set environment variables and initialize spark context:**" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "env: SPARK_DRIVER_MEMORY=32g\n", + "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", + "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n" + ] + } + ], + "source": [ + "%env SPARK_DRIVER_MEMORY=32g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Classifying movie reviews: a binary classification example\n", + "This notebook is imported from Chapter 3, Section 5 of [Deep Learning with Python Notebook]().\n", + "\n", + "----\n", + "\n", + "Two-class classification, or binary classification, may be the most widely applied kind of machine learning problem. In this example, we \n", + "will learn to classify movie reviews into \"positive\" reviews and \"negative\" reviews, just based on the text content of the reviews." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## The IMDB dataset\n", + "We'll be working with \"IMDB dataset\", a set of 50,000 highly-polarized reviews from the Internet Movie Database. They are split into 25,000 \n", + "reviews for training and 25,000 reviews for testing, each set consisting in 50% negative and 50% positive reviews.\n", + "\n", + "Why do we have these two separate training and test sets? You should never test a machine learning model on the same data that you used to \n", + "train it! Just because a model performs well on its training data doesn't mean that it will perform well on data it has never seen, and \n", + "what you actually care about is your model's performance on new data (since you already know the labels of your training data -- obviously \n", + "you don't need your model to predict those). For instance, it is possible that your model could end up merely _memorizing_ a mapping between \n", + "your training samples and their targets -- which would be completely useless for the task of predicting targets for data never seen before. \n", + "We will go over this point in much more detail in the next chapter.\n", + "\n", + "Just like the MNIST dataset, the IMDB dataset comes packaged with Keras. It has already been preprocessed: the reviews (sequences of words) \n", + "have been turned into sequences of integers, where each integer stands for a specific word in a dictionary.\n", + "\n", + "The following code will load the dataset (when you run it for the first time, about 80MB of data will be downloaded to your machine):" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + "Using TensorFlow backend.\n" + ] + } + ], + "source": [ + "from keras.datasets import imdb\n", + "(train_data, train_labels), (test_data, test_labels) = imdb.load_data(nb_words=10000)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The argument `nb_words=10000` means that we will only keep the top 10,000 most frequently occurring words in the training data. Rare words \n", + "will be discarded. This allows us to work with vector data of manageable size.\n", + "\n", + "The variables `train_data` and `test_data` are lists of reviews, each review being a list of word indices (encoding a sequence of words). \n", + "`train_labels` and `test_labels` are lists of 0s and 1s, where 0 stands for \"negative\" and 1 stands for \"positive\":" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Since we restricted ourselves to the top 10,000 most frequent words, no word index will exceed 10,000:" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "9999" + ] + }, + "execution_count": 3, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "max([max(sequence) for sequence in train_data])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "For kicks, here's how you can quickly decode one of these reviews back to English words:" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "\"? this film was just brilliant casting location scenery story direction everyone's really suited the part they played and you could just imagine being there robert ? is an amazing actor and now the same being director ? father came from the same scottish island as myself so i loved the fact there was a real connection with this film the witty remarks throughout the film were great it was just brilliant so much that i bought the film as soon as it was released for ? and would recommend it to everyone to watch and the fly fishing was amazing really cried at the end it was so sad and you know what they say if you cry at a film it must have been good and this definitely was also ? to the two little boy's that played the ? of norman and paul they were just brilliant children are often left out of the ? list i think because the stars that play them all grown up are such a big profile for the whole film but these children are amazing and should be praised for what they have done don't you think the whole story was so lovely because it was true and was someone's life after all that was shared with us all\"" + ] + }, + "execution_count": 4, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "# word_index is a dictionary mapping words to an integer index\n", + "word_index = imdb.get_word_index()\n", + "# We reverse it, mapping integer indices to words\n", + "reverse_word_index = dict([(value, key) for (key, value) in word_index.items()])\n", + "# We decode the review; note that our indices were offset by 3\n", + "# because 0, 1 and 2 are reserved indices for \"padding\", \"start of sequence\", and \"unknown\".\n", + "decoded_review = ' '.join([reverse_word_index.get(i - 3, '?') for i in train_data[0]])\n", + "decoded_review" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We cannot feed lists of integers into a neural network. We have to turn our lists into tensors. There are two ways we could do that:\n", + "\n", + "* We could pad our lists so that they all have the same length, and turn them into an integer tensor of shape `(samples, word_indices)`, \n", + "then use as first layer in our network a layer capable of handling such integer tensors (the `Embedding` layer, which we will cover in \n", + "detail later in the book).\n", + "* We could one-hot-encode our lists to turn them into vectors of 0s and 1s. Concretely, this would mean for instance turning the sequence \n", + "`[3, 5]` into a 10,000-dimensional vector that would be all-zeros except for indices 3 and 5, which would be ones. Then we could use as \n", + "first layer in our network a `Dense` layer, capable of handling floating point vector data.\n", + "\n", + "We will go with the latter solution. Let's vectorize our data, which we will do manually for maximum clarity:" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [], + "source": [ + "import numpy as np\n", + "def vectorize_sequences(sequences, dimension=10000):\n", + " # Create an all-zero matrix of shape (len(sequences), dimension)\n", + " results = np.zeros((len(sequences), dimension))\n", + " for i, sequence in enumerate(sequences):\n", + " results[i, sequence] = 1. # set specific indices of results[i] to 1s\n", + " return results\n", + "\n", + "# Our vectorized training data\n", + "x_train = vectorize_sequences(train_data)\n", + "# Our vectorized test data\n", + "x_test = vectorize_sequences(test_data)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Here's what our samples look like now:" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([0., 1., 1., ..., 0., 0., 0.])" + ] + }, + "execution_count": 6, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "x_train[0]" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We should also vectorize our labels, which is straightforward:" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [], + "source": [ + "y_train = np.asarray(train_labels).astype('float32')\n", + "y_test = np.asarray(test_labels).astype('float32')" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Now our data is ready to be fed into a neural network." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Building our network\n", + "\n", + "\n", + "Our input data is simply vectors, and our labels are scalars (1s and 0s): this is the easiest setup you will ever encounter. A type of \n", + "network that performs well on such a problem would be a simple stack of fully-connected (`Dense`) layers with `relu` activations: `Dense(16, activation='relu')`\n", + "\n", + "The argument being passed to each `Dense` layer (16) is the number of \"hidden units\" of the layer. What's a hidden unit? It's a dimension \n", + "in the representation space of the layer. You may remember from the previous chapter that each such `Dense` layer with a `relu` activation implements \n", + "the following chain of tensor operations:\n", + "\n", + "`output = relu(dot(W, input) + b)`\n", + "\n", + "Having 16 hidden units means that the weight matrix `W` will have shape `(input_dimension, 16)`, i.e. the dot product with `W` will project the \n", + "input data onto a 16-dimensional representation space (and then we would add the bias vector `b` and apply the `relu` operation). You can \n", + "intuitively understand the dimensionality of your representation space as \"how much freedom you are allowing the network to have when \n", + "learning internal representations\". Having more hidden units (a higher-dimensional representation space) allows your network to learn more \n", + "complex representations, but it makes your network more computationally expensive and may lead to learning unwanted patterns (patterns that \n", + "will improve performance on the training data but not on the test data).\n", + "\n", + "There are two key architecture decisions to be made about such stack of dense layers:\n", + "\n", + "* How many layers to use.\n", + "* How many \"hidden units\" to chose for each layer.\n", + "\n", + "In the next chapter, you will learn formal principles to guide you in making these choices. \n", + "For the time being, you will have to trust us with the following architecture choice: \n", + "two intermediate layers with 16 hidden units each, \n", + "and a third layer which will output the scalar prediction regarding the sentiment of the current review. \n", + "The intermediate layers will use `relu` as their \"activation function\", \n", + "and the final layer will use a sigmoid activation so as to output a probability \n", + "(a score between 0 and 1, indicating how likely the sample is to have the target \"1\", i.e. how likely the review is to be positive). \n", + "A `relu` (rectified linear unit) is a function meant to zero-out negative values, \n", + "while a sigmoid \"squashes\" arbitrary values into the `[0, 1]` interval, thus outputting something that can be interpreted as a probability." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Here's what our network looks like:\n", + "\n", + "![3-layer network](https://s3.amazonaws.com/book.keras.io/img/ch3/3_layer_network.png)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "And here's the analytics-zoo implementation, very similar to the MNIST example you saw previously:" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n" + ] + }, + { + "data": { + "text/plain": [ + "" + ] + }, + "execution_count": 8, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers\n", + "\n", + "model = models.Sequential()\n", + "model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", + "model.add(layers.Dense(16, activation='relu'))\n", + "model.add(layers.Dense(1, activation='sigmoid'))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Lastly, we need to pick a loss function and an optimizer. Since we are facing a binary classification problem and the output of our network \n", + "is a probability (we end our network with a single-unit layer with a sigmoid activation), is it best to use the `binary_crossentropy` loss. \n", + "It isn't the only viable choice: you could use, for instance, `mean_squared_error`. But crossentropy is usually the best choice when you \n", + "are dealing with models that output probabilities. Crossentropy is a quantity from the field of Information Theory, that measures the \"distance\" \n", + "between probability distributions, or in our case, between the ground-truth distribution and our predictions.\n", + "\n", + "Here's the step where we configure our model with the `rmsprop` optimizer and the `binary_crossentropy` loss function. Note that we will \n", + "also monitor accuracy during training." + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['accuracy'])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Validating our approach\n", + "\n", + "In order to monitor during training the accuracy of the model on data that it has never seen before, we will create a \"validation set\" by \n", + "setting apart 10,000 samples from the original training data:" + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "metadata": {}, + "outputs": [], + "source": [ + "x_val = x_train[:10000]\n", + "partial_x_train = x_train[10000:]\n", + "y_val = y_train[:10000]\n", + "partial_y_train = y_train[10000:]" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We will now train our model for 20 epochs (20 iterations over all samples in the `x_train` and `y_train` tensors), in mini-batches of 512 \n", + "samples. At this same time we will monitor loss and accuracy on the 10,000 samples that we set apart. This is done by passing the \n", + "validation data as the `validation_data` argument:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Accuracy checkout\n", + "_To checkout the behavior of this model in Keras, one could use following code accompanied with `matplotlib` library to draw the following `history` object_\n", + " \n", + " history = model.fit(partial_x_train,\n", + " partial_y_train,\n", + " nb_epoch=5,\n", + " batch_size=512,\n", + " validation_data=(x_val, y_val)\n", + " )\n", + "_After `fit` method finishes, the results are stored in `history` and thus could be visualized. Currently in analytics-zoo, `fit` method does not have any return. Results can only be checked via setting tensorboard._" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "To do training visualization, you can configure tensorboard in the model. The code of setting tensorboard and train is following, note that `set_tesnsorboard` need to be called before `fit` method:" + ] + }, + { + "cell_type": "code", + "execution_count": 11, + "metadata": {}, + "outputs": [], + "source": [ + "model.set_tensorboard('./', '3-5_summary')\n", + "model.fit(partial_x_train,\n", + " partial_y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_val, y_val))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Then result could be visualized in either of following ways: \n", + "\n", + "* Start tensorboard web interface in terminal by `tensorboard --logdir ./` and go to web browser url `localhost:port_number` as shown in your terminal.\n", + "* Use analytics-zoo built-in method `get_scalar_from_summary` with parameter `Loss` or `Validation` to get the array of scalar, then visualize via `matplotlib`.\n", + "\n", + "We use the second approach here in order to directly show the result in this notebook." + ] + }, + { + "cell_type": "code", + "execution_count": 13, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "iVBORw0KGgoAAAANSUhEUgAAAYUAAAEWCAYAAACJ0YulAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDMuMC4yLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvOIA7rQAAIABJREFUeJzs3Xd4VNXWwOHfSiP0jnQCivQeitKbUhRFuqKCCoIitqtiASuC5VOuiihcsaCAgFIkVOkgAqH3HiChhQAhlJC2vz9mMkySSTIJM5mU9T4Pj3PO2efMCpFZs7sYY1BKKaUAvDwdgFJKqexDk4JSSikbTQpKKaVsNCkopZSy0aSglFLKRpOCUkopG00KyqVExFtEropIZVeW9SQRuUtEXD52W0Q6iUiI3fFBEWntTNlMvNf/ROStzN6fxnM/EpGfXP1c5Tk+ng5AeZaIXLU7LADcBOKtx88aY37LyPOMMfFAIVeXzQuMMTVc8RwReQYYaIxpZ/fsZ1zxbJX7aVLI44wxtg9l6zfRZ4wxf6dWXkR8jDFxWRGbUirrafORSpO1eeB3EZkhIlHAQBG5R0T+FZHLInJGRL4SEV9reR8RMSISYD3+1Xp9sYhEichGEama0bLW611F5JCIRIrI1yKyQUQGpRK3MzE+KyJHROSSiHxld6+3iHwpIhEicgzoksbfz9siMjPZuYki8oX19TMist/68xy1fotP7VmhItLO+rqAiEyzxrYXaJKs7Dsicsz63L0i0sN6vh7wDdDa2jR3we7v9j27+4dZf/YIEZknIuWc+btJj4j0tMZzWURWikgNu2tvichpEbkiIgfsftYWIrLNev6ciHzm7PspNzDG6B/9gzEGIATolOzcR0AM8CCWLxH5gaZAcyw1zWrAIWCEtbwPYIAA6/GvwAUgEPAFfgd+zUTZMkAU8JD12itALDAolZ/FmRjnA0WBAOBi4s8OjAD2AhWBksBayz8Vh+9TDbgKFLR79nkg0Hr8oLWMAB2AG0B967VOQIjds0KBdtbXnwOrgeJAFWBfsrJ9gXLW38mj1hjusF57BlidLM5fgfesr++zxtgQ8Ae+BVY683fj4Of/CPjJ+rqWNY4O1t/RW8BB6+s6wAmgrLVsVaCa9fUWYID1dWGguaf/LeTlP1pTUM5Yb4z5yxiTYIy5YYzZYozZZIyJM8YcAyYDbdO4f44xJtgYEwv8huXDKKNlHwB2GGPmW699iSWBOORkjOOMMZHGmBAsH8CJ79UX+NIYE2qMiQDGp/E+x4A9WJIVQGfgkjEm2Hr9L2PMMWOxElgBOOxMTqYv8JEx5pIx5gSWb//27zvLGHPG+juZjiWhBzrxXIDHgP8ZY3YYY6KBUUBbEaloVya1v5u09AcWGGNWWn9H47EkluZAHJYEVMfaBHnc+ncHluReXURKGmOijDGbnPw5lBtoUlDOOGV/ICI1RSRIRM6KyBXgA6BUGveftXt9nbQ7l1MrW94+DmOMwfLN2iEnY3TqvbB8w03LdGCA9fWj1uPEOB4QkU0iclFELmP5lp7W31WicmnFICKDRGSntZnmMlDTyeeC5eezPc8YcwW4BFSwK5OR31lqz03A8juqYIw5CLyK5fdw3tocWdZadDBQGzgoIptFpJuTP4dyA00KyhnJh2N+j+Xb8V3GmCLAGCzNI+50BktzDgAiIiT9EEvudmI8A1SyO05vyOwsoJOIVMBSY5hujTE/MAcYh6VppxiwzMk4zqYWg4hUAyYBw4GS1ucesHtuesNnT2Npkkp8XmEszVRhTsSVked6YfmdhQEYY341xrTE0nTkjeXvBWPMQWNMfyxNhP8H/CEi/rcZi8okTQoqMwoDkcA1EakFPJsF77kQaCwiD4qID/AiUNpNMc4CXhKRCiJSEngjrcLGmLPAeuAn4KAx5rD1Uj7ADwgH4kXkAaBjBmJ4S0SKiWUexwi7a4WwfPCHY8mPQ7DUFBKdAyomdqw7MAN4WkTqi0g+LB/O64wxqda8MhBzDxFpZ33v17D0A20SkVoi0t76fjesfxKw/ACPi0gpa80i0vqzJdxmLCqTNCmozHgVeBLLP/jvsXQIu5Ux5hzQD/gCiADuBLZjmVfh6hgnYWn7342lE3SOE/dMx9JxbGs6MsZcBl4G5mLprO2NJbk5410sNZYQYDHwi91zdwFfA5utZWoA9u3wy4HDwDkRsW8GSrx/CZZmnLnW+ytj6We4LcaYvVj+zidhSVhdgB7W/oV8wKdY+oHOYqmZvG29tRuwXyyj2z4H+hljYm43HpU5YmmaVSpnERFvLM0VvY0x6zwdj1K5hdYUVI4hIl2szSn5gNFYRq1s9nBYSuUqmhRUTtIKOIalaeJ+oKcxJrXmI6VUJmjzkVJKKRutKSillLLJcQvilSpVygQEBHg6DKWUylG2bt16wRiT1jBuIAcmhYCAAIKDgz0dhlJK5Sgikt7MfECbj5RSStnRpKCUUspGk4JSSimbHNenoHKH2NhYQkNDiY6O9nQoKg3+/v5UrFgRX9/UllFSuY0mBeURoaGhFC5cmICAACwLnqrsxhhDREQEoaGhVK1aNf0bVK6gzUfKI6KjoylZsqQmhGxMRChZsqTW5vIYtyYF61o1B617vY5ycL2yiKwSke0isks318hbNCFkf/o7ynvclhSsq1hOBLpi2VVpgIjUTlbsHWCWMaYRlq38vnVXPEop5UkXb1yk0y+dyO5LC7mzptAMOGLdnzYGmMmtfWwTGaCI9XVRLEshK+V2ly9f5ttvM/cdpFu3bly+fNnp8u+99x6ff/55pt5L5R4jF49kxfEVBB0OytT9a0+s5Wac+9d/dGdSqEDSPWZDSbl94nvAQBEJBRYBLzh6kIgMFZFgEQkODw93R6wqj0krKcTFxaV576JFiyhWrJg7wlK5WHRcdJL/ZsSpyFO0/aktj/75qKvDSsHTHc0DgJ+MMRWx7L40zbqvaxLGmMnGmEBjTGDp0uku3aFUukaNGsXRo0dp2LAhr732GqtXr6Z169b06NGD2rUtrZwPP/wwTZo0oU6dOkyePNl2b0BAABcuXCAkJIRatWoxZMgQ6tSpw3333ceNGzfSfN8dO3bQokUL6tevT8+ePbl06RIAX331FbVr16Z+/fr0798fgDVr1tCwYUMaNmxIo0aNiIqKctPfhsruIm5EAHD04lG3v5c7h6SGkXTjcdsG3naexrJlH8aYjdbNuksB590Yl8pm3v9rL/tOX3HpM2uXL8K7D9ZJ9fr48ePZs2cPO3bsAGD16tVs27aNPXv22IZfTp06lRIlSnDjxg2aNm1Kr169KFmyZJLnHD58mBkzZjBlyhT69u3LH3/8wcCBA1N93yeeeIKvv/6atm3bMmbMGN5//30mTJjA+PHjOX78OPny5bM1TX3++edMnDiRli1bcvXqVfz9dS975X7urClsAaqLSFUR8cPSkbwgWZmTWDcyt26u7o9lAxWlslyzZs2SjMf/6quvaNCgAS1atODUqVMcPnw4xT1Vq1alYcOGADRp0oSQkJBUnx8ZGcnly5dp27YtAE8++SRr164FoH79+jz22GP8+uuv+PhYvqu1bNmSV155ha+++orLly/bzivlTm77v8wYEyciI4ClgDcw1RizV0Q+AIKNMQuwbK4+RURextLpPMhk96555XJpfaPPSgULFrS9Xr16NX///TcbN26kQIECtGvXzuF4/Xz58tlee3t7p9t8lJqgoCDWrl3LX3/9xdixY9m9ezejRo2ie/fuLFq0iJYtW7J06VJq1qyZqecr5Sy3fvUwxizC0oFsf26M3et9QEt3xqCUI4ULF06zjT4yMpLixYtToEABDhw4wL///nvb71m0aFGKFy/OunXraN26NdOmTaNt27YkJCRw6tQp2rdvT6tWrZg5cyZXr14lIiKCevXqUa9ePbZs2cKBAwc0KSi30/qoypNKlixJy5YtqVu3Ll27dqV79+5Jrnfp0oXvvvuOWrVqUaNGDVq0aOGS9/35558ZNmwY169fp1q1avz444/Ex8czcOBAIiMjMcYwcuRIihUrxujRo1m1ahVeXl7UqVOHrl27uiQGpdKS4/ZoDgwMNLrJTs63f/9+atWq5ekwlBP0d+UavWf15o/9fzC7z2x61+6doXt3nN1Bo+8b0eCOBuwYtiNT7y8iW40xgemV8/SQVKWUUtmIJgWllFI2mhSUUkrZaFJQSillo0lBKaWUjSYFpZRSNpoUlHJSoUKFADh9+jS9ezseUtiuXTvSGzI9YcIErl+/bjvO6FLcqdElupUraFJQKoPKly/PnDlzMn1/8qSgS3Gr7ESTgsqTRo0axcSJE23Hid+yr169SseOHWncuDH16tVj/vz5Ke4NCQmhbt26ANy4cYP+/ftTq1YtevbsmWTto+HDhxMYGEidOnV49913Acsie6dPn6Z9+/a0b98euLUUN8AXX3xB3bp1qVu3LhMmTLC9ny7RrbKKLnOhPO6lJS+x42zmZmmmpmHZhkzoMiHV6/369eOll17i+eefB2DWrFksXboUf39/5s6dS5EiRbhw4QItWrSgR48eqe5VPGnSJAoUKMD+/fvZtWsXjRs3tl0bO3YsJUqUID4+no4dO7Jr1y5GjhzJF198wapVqyhVqlSSZ23dupUff/yRTZs2YYyhefPmtG3bluLFi+sS3SrLaE1B5UmNGjXi/PnznD59mp07d1K8eHEqVaqEMYa33nqL+vXr06lTJ8LCwjh37lyqz1m7dq3tw7l+/frUr1/fdm3WrFk0btyYRo0asXfvXvbt25dmTOvXr6dnz54ULFiQQoUK8cgjj7Bu3TpAl+hWWUd/+8rj0vpG7059+vRhzpw5nD17ln79+gHw22+/ER4eztatW/H19SUgIMDhktnpOX78OJ9//jlbtmyhePHiDBo0KFPPSaRLdKusojUFlWf169ePmTNnMmfOHPr06QNYvmWXKVMGX19fVq1axYkTJ9J8Rps2bZg+fToAe/bsYdeuXQBcuXKFggULUrRoUc6dO8fixYtt96S2bHfr1q2ZN28e169f59q1a8ydO5fWrVtn+OeyX6IbcLhE9yeffEJkZCRXr17l6NGj1KtXjzfeeIOmTZty4MCBDL+ncq/DEZYNnny83P89XmsKKs+qU6cOUVFRVKhQgXLlygHw2GOP8eCDD1KvXj0CAwPT/cY8fPhwBg8eTK1atahVqxZNmjQBoEGDBjRq1IiaNWtSqVIlWra8tW3I0KFD6dKlC+XLl2fVqlW2840bN2bQoEE0a9YMgGeeeYZGjRql2VSUGl2iO/c4E3WG/n9YBgU8XPNht7+fW5fOFpEuwH+x7Lz2P2PM+GTXvwTaWw8LAGWMMWmOzdOls3MHXY4559DflWtkZuns6Lho2v3Ujk1hmwCY1H0SwwKHZer9Pb50toh4AxOBrkBtYICI1LYvY4x52RjT0BjTEPga+NNd8SilVE5ijGHYwmFsCtvExG4T07/BRdzZp9AMOGKMOWaMiQFmAg+lUX4AMMON8SilVI7x5b9f8vPOn3mv7Xs8UuuRLHtfdyaFCsApu+NQ67kURKQKUBVYmcr1oSISLCLB4eHhmQrmw4X7CBgVRGx8QqbuV66X03b9y4v0d+QZS48s5bXlr9GrVi9Gtx2dpe+dXUYf9QfmGGPiHV00xkw2xgQaYwJLly6dqTf4Yf1xAI5fuJbpIJXr+Pv7ExERoR862ZgxhoiICJ3MlsUORRyi35x+1CtTj58f/hkvydqPaXeOPgoDKtkdV7Sec6Q/8LwbY7H5fOlBJj+Rbl+LcrOKFSsSGhpKZmt+Kmv4+/tTsWJFT4eRZ1yOvkyPGT3w9fZlfv/5FPQrmOUxuDMpbAGqi0hVLMmgP/Bo8kIiUhMoDmx0Yywsf7kNnb9cy7J9qc9OVVnH19eXqlWrejoMpbKN+IR4BvwxgKOXjrLiiRVUKVbFI3G4rV5ijIkDRgBLgf3ALGPMXhH5QER62BXtD8w0bm5HuKtMIXc+XimlbsubK95kyZElTOw2kTZV2ngsDrdOXjPGLAIWJTs3Jtnxe+6MIZH9gmYXr8VQoqBfVrytUkoBcPbqWQASTMrBLtN2TuOzfz7jucDnGNpkaFaHlkR26WjOUjdiHfZnK6WUW4RdCWPDqQ2ApSPZ3uawzQz5awjtAtp5bB0we3kyKbQcv5LAj5Z7OgylVB5wKvIUbX9qazu2rymcjjrNwzMfpnzh8szuMxtfb1+HzyjgW4Dnmz5PndJ13B5vnlr7qHThfIRH3QTgwtUYD0ejlMrtTlw+Qfuf2xNxIyLFtei4aHr+3pOomCiWPb6MUgVKOXiCRZF8Rfim2zfuDNUmT9UUEhOCUkq52/FLx2n7U1su3rjI8seTtkwYYxj611A2h23m156/UrdMXQ9FmVKeqikopVRWOHrxKB1+6UDUzShWPLGCJuWbJLn+fxv/j2m7pvFh+w95qGZaq/9kvTyVFIrm9yXyRqynw1BK5WKHIw7T4ZcOXI+9zoonVtCoXKMk1xcfWcym0E30qd2Ht1u/7aEoU5enmo8CShbwdAhKqVzs4IWDtPu5HdFx0ax8YmWKhADwb+i/NCjbgB8f+jHVvb89KU8lhV+eau7pEJRSudT+8P20+7kdsfGxrHpyFQ3KNki1rKeWsHBGnkoKRQs4Hu6llFK3Y+/5vbT7uR3GGFYPWu2w47hCYcsi0S82f5HKRStndYhOy1NJQSmlbocxhviEpJNfd53bRbuf2+Et3qwetJrapWs7vLdFxRYAtKrcyu1x3g5NCkop5aSSn5bE58Nb43O2n9lO+5/bk887H2sGraFmqbT39M4JNCkopfKUcevGMX339Ezdeyn6ku311tNb6fhLRwr6FmTNoDVUL1ndVSF6VJ4akqqUUm+tfAuAR+ulWMk/TVdjrtpebw7bzH3T7qOYfzFWPbmKqsVzzzLwebqmcD0mztMhKKVygH3h+2g2pZntuPO0zpTIX4LVg1Y7nRA6Vu0IQL0y9dwSo6vk6aRw4GyUp0NQSmVzv+36jaZTmiZZv6h0gdKsHrSagGIBTj9neNPhmHcNNUrVcEOUruPWpCAiXUTkoIgcEZFRqZTpKyL7RGSviGSuoS8DmlUtYXt9+bouiqeUciw6LpphC4cxcO5AmpRrwvZnt9uurR60OlsPK70dbutTEBFvYCLQGQgFtojIAmPMPrsy1YE3gZbGmEsiUsZd8SS6GXdr2drQSzfc/XZKqRzo2KVj9Jndh21ntvFGyzf4qMNH+Hj5ULVYVY5fPk7FIrl332p3djQ3A44YY44BiMhM4CFgn12ZIcBEY8wlAGPMeTfGA0CLqiXYeeoyAFdvap+CUiqp+Qfm8+S8JxER5vefT48at3YPPvbiMQ9GljXc2XxUAThldxxqPWfvbuBuEdkgIv+KSBc3xgNAnQpFba+vaVJQSlnFxsfy+vLXefj3h7mrxF1sG7otSULIKzzd0ewDVAfaAQOAKSJSLHkhERkqIsEiEhweHn5bb1jQz9v2euKqo7f1LKVU7nA66jQdfunAZ/98xvDA4ax/an2uGmaaEe5MCmFAJbvjitZz9kKBBcaYWGPMceAQliSRhDFmsjEm0BgTWLp06dsKqoBf0hazaN2vWak8Y/3J9SnOrTi2gkbfN2LbmW389shvfNv9W/x9/D0QXfbgzqSwBaguIlVFxA/oDyxIVmYelloCIlIKS3OSWxvtCubzTnJcc/QSd76dUiqbmLJ1Ch1+7gBY5gokmAQ+WvsRnad1pmT+kmwZsiXDE9pyI7d1NBtj4kRkBLAU8AamGmP2isgHQLAxZoH12n0isg+IB14zxqTczNSFktcUlFK5W2x8LC8vfZmJWybS5a4u7Dm/h5IFStJ9eneWHFnCY/Ue47sHvqOQXyFPh5otuPUT0hizCFiU7NwYu9cGeMX6J0skrykopXKv8Gvh9JndhzUn1vDava8xruM4Gn7fkNUhq/Hz9mNS90k82+TZbLnZjafkua/NWlNQKm/YeXYnD818iLNXzzKt5zQG1h8IQHH/4lQtVpXZfWan2DtZ5cmkoDUFpXK7P/b9wRPznqCYfzHWDV5H0wpNbdcWProQP2+/PN2ZnBZPD0nNcr7eee5HVirPSDAJjFk1ht6ze1P/jvoEDwlOkhAAiuQrogkhDfoJqZTKUYwx7A/fT2x8bJLzUTej6DWrFx+u/ZDBDQez+snVlCtczkNR5lx5MimUL6rfEpTKaUKvhDJ27ViqfVWN2t/WpsfMW7ONj148yj0/3MNfB/9iwv0T+KHHD+TzyefBaHOuPNenALD8lbbUeXep7fh6TJx2QCuVDd2Mu8mCgwuYumMqS48sxWC4o+AdACw5YpljtOLYCvrO6YsxhiUDl9CpWidPhpzj5cmaQsF8SRPAzlORHopEKeXIjrM7GLl4JOW/KE/fOX3Ze34v77R5h6MjjxL0aJCt3FebvuL+X++nXKFybBmyRROCC+jXY6VUtnDxxkWm757O1O1T2X52O37efvSs2ZOnGj1Fx6od8fayjBzcemOr7Z4Xl7zIQzUeYlrPaRTOV9hToecqmhSUUh4TnxDPiuMrmLp9KnMPzCUmPobG5RrzTddvGFBvACXyl0jz/tFtRvNeu/fwkjzZ6OEWmhSUUlnuRuwNxq8fz487fuTUlVOUyF+CYU2GMbjRYBqWbZjmvUX9by1//0H7D9wdap6jSQEYMOVfjo/rplPdlcoiK46v4IO1H9A+oD3/d9//0aNGD6dHCyXWCqoWy5tLW7tbnq1zVSiWP8nxhiMRXLqmezYr5U6HIg7x8bqPeXDGgwC8du9r9KnTJ0PDR8sXLg/AM42fcUuMeV2erSlsGNWBgFG3RjF8t+Yo649cIGhkK+qUL5rGnUqpjNgfvp85++Ywe99sdp/fneTauWvnMvw8fx9/zLvGVeGpZPJsTSG59UcuALDt5GUPR6JU1rkee50rN6+49JnGGPac38N7q9+j7rd1qf1tbd5d/S5F/Ysy4f4JnHzpJE82eNKl76lcJ8/WFFIzet4eHm9RxdNhKJUles3qxYpjK+hUrRN9avfhoZoPpTvixxFjDLvP72b23tnM2T+HAxcOIAhtqrTh665f80itR2zNPip706SgVB4Wfi2ckgVKsv/Cfp5a8BQ+C33oWLUjvWv35uGaD1OqQKlU7zXGsOPsDmbvm82cfXM4fPEwXuJFu4B2jGw2kp61elK2UNks/GmUK7g1KYhIF+C/WHZe+58xZnyy64OAz7i1d/M3xpj/uTMmpVRSTco14a8Bf7H1zFZb2/+Qv4YwbOEw2ldtT5/afehZsyelCybdH33CvxN4ZdkreIs3Hap24D/3/oeHaz5MmYJlPPSTKFdwW1IQEW9gItAZCAW2iMgCY8y+ZEV/N8aMcFccSqn0iQiB5QMJLB/IuI7jbDWA2ftm8+zCZxkeNJx2Ae3oXas3j9R6hDsK3cGpK6fI75Ofky+fTLNGoXIWd9YUmgFHjDHHAERkJvAQkDwpKKWyERGhUblGNCrXiLEdxrLr3C5bDeK5Rc/x/KLnaVOlDRtDNxITH6MJIZdx5+ijCsApu+NQ67nkeonILhGZIyKVHD1IRIaKSLCIBIeHh7sswFnP3uOyZymVG4kIDco24MMOH7L/+f3sHr6b0W1GE349nJh4ndeTG3l6SOpfQIAxpj6wHPjZUSFjzGRjTKAxJrB06dKOimTKnaULuuxZSuV2IkLdMnV5v/377H1ur6fDUW7izqQQBth/86/IrQ5lAIwxEcaYm9bD/wFZuot28iW0lVJZo2WllgDUv6O+hyNRybkzKWwBqotIVRHxA/oDC+wLiIj9Xnk9gP1ujCeFfD6erigplTcNaTKE2NGxNC7X2NOhqGTc9lXZGBMnIiOApViGpE41xuwVkQ+AYGPMAmCkiPQA4oCLwCB3xeOILoCnlOf4eGlNPTty62/FGLMIWJTs3Bi7128Cb7ozhswIGBVEyPjung5DKaWynLafKKWUstH6m1J5VMT1CLae2Zp+QZWnaE1BqTzoz/1/UufbOrf1jNfufc1F0ajsJM8nhbvvKOTpEJTKMuevnafv7L70mtXrtlct/bTzp7qvQS6U55uP/HRYqsoDjDHM3DOTFxa/QFRMFB93+Jj/3Psfvvz3S12mQiWR55NCPh9vh+f/3neOTrXvyOJolHK9M1FnGB40nPkH59O8QnOmPjSV2qVrA/B6y9c9HJ3KbvJ8UvDzdlxTeOaXYMoX9Wflf9rh7+s4cSiVnRlj+GXnL7y09CWi46L5rPNnvNziZby99P9nlbo833YysmP1VK+djozm539Csi4YpTIoJj4GeV94fXnSb/ynIk/RfXp3Bs0fRN0yddk5bCf/ufc/mhBUuvJ8UrjnzpJpTlQ7eDYqC6NRKmMuR1v2FP9px0+ApXYwZesU6nxbhzUn1vDfLv9lzaA13F3ybg9GqXKSPJ8U0vPn9rD0CynlIZvDNtteh1wOofO0zgxdOJTA8oHsHr6bkc1H4iX6z1w5z6k+BRG5Ewg1xtwUkXZAfeAXY8xldwanlHJs17ldvLXiLYIOBwFw5eYV6n5bFxFhUvdJDG0yVJOByhRn/6/5A4gXkbuAyViWxJ7utqiymYBRQVy+rhuKKM87dukYA/8cSMPvGrLh1AZGtxkNwM34m7Ss3JK9z+1lWOAwTQgq05wdfZRgXfW0J/C1MeZrEdnuzsCym+/WHGNU15qeDkNlU/MOzGPtibV8cf8Xbnn+2atn+WjtR0zeOhkfLx/eaPkGr7d8nWL+xbgac5V6ZeoxqOEgXflX3TZnk0KsiAwAngQetJ7zdU9I2c95v7F8vy+GUV2DPR2KyoaCDgXRZ3YfvMTL5UkhMjqSz/75jC///ZKbcTcZ0ngIo9uOTjIb2V2JSOVNziaFwcAwYKwx5riIVAWmuS+s7OWG90ZCrno6CpUdrTuxjt6zexOXEIeft5/Lnnsj9gYTt0xk3PpxXLxxkf51+/NBuw+oXjL1IdRKuYJTDY/GmH3GmJHGmBkiUhwobIz5xM2xZTuJw/9embWDluNXejga5WnbzmzjgRkPEFAsgKcbPX1bz4pPiAcgLiGOH7b9wN3f3M1ry1+jWYWRmwhQAAAgAElEQVRmbBu6jRm9ZmhCUFnCqaQgIqtFpIiIlAC2AVNEJN06q4h0EZGDInJEREalUa6XiBgRCXQ+9Kw3bOEwjDH8uS2MsMs3PB2O8qCDFw7S5dcuFPMvxrKByyhdoHSGn3Et5ho/bv+RllNb4vOhD19u/JK639blmb+eoWKRiqx6chWLH1tMo3KN3PATKOWYs81HRY0xV0TkGSxDUd8VkV1p3SAi3sBEoDMQCmwRkQXGmH3JyhUGXgQ2ZTz8rPX73t+57877AF0TKS87GXmSztM6IyIsf3w5lYpWytD9W09v5X/b/sf0PdO5cvOK7fwry16hdunazOs3jx41eminsfIIZ8et+YhIOaAvsNDJe5oBR4wxx4wxMcBM4CEH5T4EPgGinXyuW9WtUCTVax2qduCFxS8QK6FZGJHKTs5fO0/naZ25cvMKSwcutc0U3nx6MzHxqQ9bjoyOZNKWSTT+vjGBUwL5aedPPFzzYdYOWkvv2r0BaFi2IbuG7eKhmg9pQlAe42xS+ABYChw1xmwRkWrA4XTuqQCcsjsOtZ6zEZHGQCVjTFBaDxKRoSISLCLB4eHhToacOSUL5kv12tstJuLr5U+436cYYt0ah8p+IqMj6fJrF05FnmLhowtpWLah7drK4yn7mIwxbDi5gUHzBlHu/8rx3KLnSDAJfNP1G868eoafH/6Z1lVaU65QOQAGNxysaxMpj3Oq+cgYMxuYbXd8DOh1O28sIl7AF8AgJ95/MpZJcwQGBrp1V48hraux5pDjxPPU1KP4eT1PZL4PueT7E/CwO0NR2ciN2Bv0mNmD3ed3s6D/AlpVbpVq2QvXL/DLzl/437b/sf/Cfgr5FeLx+o8zpMkQmpRrorUAla05u8xFReBroKX11DrgRWNMWu0oYVhmPieqaD2XqDBQF1ht/UdSFlggIj2MMR6bEBCXkJDm9QIJzSkc9yBRPvNZfHgxXat3zaLIlKfExsfSZ3Yf1p1Yx4xeM9L8nfef05+5B+YSEx9Di4ot+KHHD/St05dCfrrDn8oZnG0++hFYAJS3/vnLei4tW4DqIlJVRPyA/tZnAGCMiTTGlDLGBBhjAoB/AY8lhI41ywAQG59+RaR47GB8EwJ4ct6TnL161t2hKQ9KMAkMmj+IoMNBTOo+iX51+6VZfvmx5QwPHM7u4bvZ+PRGnmr0lCYElaM4mxRKG2N+NMbEWf/8BKQ5Bs8YEweMwNIXsR+YZYzZKyIfiEiP24raDXytm+3ExqddUwAQ/CgV8zpXY67yxNwnSDDp36Pc759T/1Djmxr8se8PlzzPGMMLi15g+u7pjOs4jmcDn033nrBXwpjQZQJ1y9R1SQxKZTVnk0KEiAwUEW/rn4FARHo3GWMWGWPuNsbcaYwZaz03xhizwEHZdp5sNnr8nioANKxULMU1//hG+CXUSHLOz1RmQpcJLD+2nC826jIDnjZ562Ta/dSOQxGH2H9hv0ueOWbVGL4N/pbX732dUa1SnWaThL+Pf4bfJ/GezNyrlKs5mxSewjIc9SxwBuiNEx3EOUnLu0oRMr475YvlT3GteOxgSsQMS3F+SOMh9KrVizdXvMmWsC1ZEWauFxsfy7Kjy5yufd2Mu8nQv4by7MJnaRvQ1mVxfLHxCz5a9xFDGg9hfKfxLnuuI++1e482Vdrc9qxopVzB2WUuThhjehhjShtjyhhjHuY2Rx/lJH6mGvlMyiUGRIQpD06hXKFyDPhjAFE3dZe222GMYdjCYdz/6/0En06/0ng66jTtf27PlG1TeLPVmywc4OwUmrRN3T6VV5e9Sp/afZjUfZLbRwsV8C3AmkFrdDiqyhZuZ9H1V1wWRQ5WPH9xfnvkN45fPs6IxSM8HU6O9vG6j5m6YypgqQGk5Z9T/9BkchN2ndvF7D6z+bjjx7f9oTp562T6zu7LkL+GcP+d9/PrI7/qB7XKc5xd5sIRHWxt1bpKa0a3Gc37a97nvmr38Vj9xzwdUo4zY/cM3ln1DjVL1eTAhQNplp28dTIjFo2gctHKLH98eaY6deMT4jlw4QCbwzZb/pzezLYz2wC4t9K9/NH3jwytejqv3zw+3/h5huNQKru5naTg1klkOc07bd5hxfEVDA8aTouKLbizxJ2eDinHWHdiHYPmD6JNlTa80fINuk/v7rDczbibvLD4BaZsm8L9d97PjF4zKJ6/eLrPN8YQeiU0SQIIPh3M1RjLeuhF8hWhafmmtvILByykoF/BDP0MD9V8iIdqOlrFRamcJc2kICJROP7wFyBlj2weNmjqVo4ffQrvYq/w6J+Psn7weny988w+RJl2KOIQD//+MFWLVWVuv7lsP+N4Q7/TUafpPas3G0M38marN/mw/YcpmnYSl58evWo0LSq2YHPYZjaFbWJz2GbbfBJfL18alm3Ikw2epFmFZjSr0Iy7S96Nl3gh71sqv84kGqVyqzSTgjGmcFYFkhPFxSfgY53fsP7IBXwow5QHp9Bndh/GrBrDuE7jPBxh9nbh+gW6T++Ol3gR9GgQJfKXcFjun1P/0GtWL6JuRjG7z2zbAnLJxSbcWo+q87TOANQoWYPO1TrbEkCDOxqQzyf19a2Uyutup/koz+s16R/mPtcSL69b3Su9a/dmaOOhfLLhEzpV60THah09GGH2FR0XzUMzH+JU5ClWPbkq1ea2jPQf+Hjd+t95+ePLCSwfSDH/lPNOlFKpu53RR7nek9YJbanZGRrJ+3/tTXH+yy5fUrNUTR6f+zjh19y7qmtOlGASGDRvEP+c+odpPadxT6V7UpS5GX9r/kHHah3ZMmRLuh3KXmL537lUgVJ0qtZJE4JSmaBJwYF6FYrSoWYZ/P3SH47488YTBIxKuvJ3Ad8CzOg1g4s3LjJ4/mCM0T55e2+veJvf9/7OJ50+oU+dPg7LdJ7WOcn8A2fa+f28/Xir1VtseGqDq0NWKs/Q5iMH/nrBsizyH1szv5lOg7IN+KzzZ4xcMpJinxQjclSkq8LL0aZsncL4DeN5tsmzvHbvaymuX7h+wfY6rf6D1IztOPa2Y1QqL9OaQhoeaVwh/UJpGNHMMpntys0rtuGPedmyo8sYHjScLnd14Ztu3zicKbzz3E4AXr3n1QwnhNsV9GgQfz/+d5a+p1LZjSaFNGRmeYOAUUGMnLE9xf0Xb1x0WVw50e5zu+k9qzd1ytTh996/J+kUdiS1kUju1K16Nx0YoPI8TQpusGDnafafuZJ+wTzidNRpuk/vTuF8hQl6NIgi+VLfB1sp5Vnap+AmT0zdzLNtqtmOC/pmbIZsbnE15ioPzniQizcusm7wOioWqejpkJRSaXBrTUFEuojIQRE5IiIpFqQXkWEisltEdojIehGp7c54slJMXALjFt9aw2f0yvfy3PDU+IR4BvwxgB1ndzCrzywalWvk6ZCUUulwW1IQEW9gItAVqA0McPChP90YU88Y0xD4FMg1u9VE3oglPuHWUNRJW7+hyoQqvLDoBUIuh2RJDOmtNOouxy4dY+fZnby05CUWHlrIN12/oVv1bh6JRSmVMe5sPmoGHDHGHAMQkZnAQ8C+xALGGPuG94LkwkX2xPhjJJq1T27kp51T+H7r90wKnsSAegN4/d7XqXdHPZe8T0x8DLvO7WJT6CY2n97MptBNHIw4yOQHJjOkyRCXvEdawq6EMWvvLGbuncnmsM2286/e8yrDmw53+/srpVzDnUmhAnDK7jgUaJ68kIg8j2VvBj+gg6MHichQYChA5cqVXR6osx5qWJ75O05n6J4Ssc8R4fcFjco2oXVAC6rmG8Sn679g7v65/LrrV7pX786oVqNoVbmVw/tPXD5BlWJJZ1YbYzh26ZhtwbdNYZvYfmY7N+MtNYMyBcvQvEJzDkYc5NSVU44e6xLh18L5Y/8fzNgzg3Un1mEwNC7X2Hb9kVqP8GnnTzP0zKFNhjJu/TgGNRzk4miVUs7weEezMWYiMFFEHgXeAZ50UGYyMBkgMDDQY7WJdjVKZzgpFIrvQKEbHWzDU9fuT6BE3DP8/ugElp34hf9u+i+tf2xNy0otGdVqFN2qd7Mt17Dw0EIenPEg33T9hjtL3GmrBWwO22yb5JXfJz9NyjdhRLMRNK/QnGYVmlG5aGVEBK/3M946aIwh5HIIjb5vRONyjVn55Mok1yOjI5l7YC4z98zk72N/E2/iqVmqJu+1e49+dfpRo1QN22qj03pOs/0szgooFoB5N9dVGJXKMdyZFMKASnbHFa3nUjMTmOTGeLKVYv4lGN12NK/c8wpTt0/l842f8+CMB6lbpi5vtHyDfnX68fcxy0SqxB3dBKF26dr0uLsHzSo0o3nF5tQtUzfdMf9pSTAJ7Avfx7oT61h7ci3rTqwjLMrya1oVsgqAazHXWHhoITP3zmTR4UXExMdQtVhVXm/5Ov3r9qdemXoO53QU8C2Q6biUUp7hzqSwBaguIlWxJIP+wKP2BUSkujHmsPWwO3CYPKagX0FeaP4CwwKH8fve3xm/fjyPz32cd1a+w4nIE7ZyK59YSZPyTW57jH9sfCxbz2xl3Yl1rDu5jg2nNtgm1pUrVI7WVVrTpnIb3lzxJlExUTz6x6MsOLiAa7HXKFeoHM8FPkf/uv1pVqGZ2/cuVkplPbclBWNMnIiMAJYC3sBUY8xeEfkACDbGLABGiEgnIBa4hIOmo+yka91yvPz7TsY8UJsPFu5L/wY7xy9c45eNIale9/X2ZWD9gTxa71EWHV7EuPXjbEnh8fqP075q+wzHazBcuH6BFcdWsO6kJQn8G/ov12OvA1C9RHUervEwrau0pnXl1lQrXs32QT/v4Dz+PvY3y44uY2D9gQyoO4BWlVs5tWfx1B5TGbpwaIbjVUp5nuS0FTwDAwNNcHBwlr3fuSvRNP94BQAh4y3bREZcvUmTjzK3Rk6Jgn5cvBbD0pfaUKNs2nsYrT+5no/XfcycvnMy1RST2LYPlqanBmUb0Lpya9pUaUOryq0oW6hsqvcevHCQk5EnaRfQTneQUyoXEJGtxpjA9Mp5vKM5u7ujiH+KcyUL5ePXp5sz8IdNGX5ebHyC02VbVW7FoscWZfg97BXyK8Ss3rO4t9K9FPUv6vR9NUrVoEapGrf13kqpnEeTQi6mo3iUUhmlC+J5yLPTgvlt04n0CyqlVBbSpOAhIRHXeXvuHk+HoZRSSWhSyGJR0XGeDkEppVKlSSGTapd3zZ4A3685yrK9Z13yLKWUul3a0eyEzW93TLLiKViGlrqC/fLaAK/dX4Pn2t1pmy9w4OwVukxYx8yhLWhRraRL3lMppVKjScEJZQqnHJbqLp8tPcidpQvh7+vFoB+32M4v2n1Gk4JSyu00KWRDw37dmuJcRuY3KKVUZmmfQg4RE6dzDpRS7qdJIYfQmoJSKitoUsghNCkopbKCJoXb8N3AJgxvdydVSrp/34DFe85Se8wSt7+PUipv06RwG7rULcsbXWqy5rWML2udGddj4p0q992aowSMCnJzNEqp3EiTQhYpVcg18xqW7T3LjxuOp1lmfLK5D0op5SwdkppFvn+8CeFRNxn267bbes7QaZbhqoNbVnVFWEoplYRbawoi0kVEDorIEREZ5eD6KyKyT0R2icgKEanizniyUsj47vw1opXtuLC/LwGlCrrs+eeuRBMwKoivVuS5HUyVUm7ktqQgIt7ARKArUBsYICK1kxXbDgQaY+oDc4BP3RWPu33Sq16Kc/Uq3trUplA+H/x90t/K0lnztocBsP7wBZc9Uyml3Nl81Aw4Yow5BiAiM4GHANvmxsaYVXbl/wUGujEet+obWIkzkdE80qgilR2MRirk78MNJzuKnZG4ZtLmkIucvxJNGQc7xCmlVEa5s/moAnDK7jjUei41TwOLHV0QkaEiEiwiweHh4S4M0XVEhJc63e0wIQAU8ffF39d1NQV7zT5eQZMPl2fq3kvXYvh0yQHidB6EUopsMvpIRAYCgcBnjq4bYyYbYwKNMYGlS5fO2uBcKL+bkgJAxLUYZmw+SXRsxmojHwXt59vVR1m275ybIlNK5STuTAphQCW744rWc0mISCfgbaCHMeamG+PxOF9vcevz3/xzNxuPRtiOA0YFsWL/OU5fvpHqPcZY1lRKPgciYFSQznVQKg9yZ5/CFqC6iFTFkgz6A4/aFxCRRsD3QBdjzHk3xpItJO6R4E6Df9qS5Pjpn4OpWqog80e0ZOLKI3y/9hgAL3S4i1fvq0E+X8v3gptxruvvUErlXG6rKRhj4oARwFJgPzDLGLNXRD4QkR7WYp8BhYDZIrJDRBa4K5687PiFawyautmWEAC+XnkEgHzWEVExcdqnoJRy8+Q1Y8wiYFGyc2PsXndy5/tnR8+0qsr/1juekZzPx4ubbvpwPnnRcRNSPp/EmkICe8Ii8ff14q4yhd0Sg1Iq+9MZzW5WoVh+3uha03b8ZrdaPNO6GldvxtLpi7VJyvq5MSk4armKTzC3kkJsAg98vR6Aec+3dEsMSqnsL1uMPsrNNozqQI8G5W3H3l5C2aL+FMnvm6KsfTlXC49K2Yc/+Kct+Pmk7FPYeepyms+6GRfP1Ztxrg1QKZUtaFLwkKJ2SaF7vXIA3HNnSX5+qlmWxbD2UDiXr8cCpFlDiYqOJWBUEMcvXAOgxjtLqPvuUrfEtP/MFbc8VynlHE0KHpIvlSUvAqsUT/febvXKuiyOkxevAzB/x63Rwu8u2JukzOLdZwH4dtWRVJ/z57ZQHrQ2PwWHXOS531LuM52eGZtP0vW/61h5QOdMKOUpmhSymYL5fOgbWDHNMkUdND1lVj7rhLoLV2PSLTt7a2iKc/EJhldm7eCVWTvZHRYJQL/J/7LImkgy4sj5qwAcC7+W4XuVUq6hHc0e9PvQFpQp4m9b3K5SccsSGekth1HQz3W/tsSO5rQcCb9qe33M7nXAqCCebVuNP7clnZMYn2BcFp9SKmtpUvCg5tVKAvBSp+p0qVuWWuWKAFDNbontWuWKcOVGLGF2s5IL5HPdry296XT9vt/IpuMXbccd/m9NkutL9mS8RqCUyr60+SgbEBFbQgB44p4A2+v7at/BghFJh4gWyue6NZQcNQnZs08IzhgxPfObCP2QyvwNpVTW0aSQDXl5CffXucN2XLJQviTXC7qwpuBqC3edSXIcdvmGS5cMV0q5lyaFbKpm2SJJju3nMBTKxkkhuZbjV/L4D5scXvtxw3ECRgVx1K6fAuCXjScy/X77Tl/RJKTUbdCkkEO0vduyZPgjjSo47GiePqR5VoeUrp//CQEg+MQlwLKF6HdrjhIXn8Dm4xf595hlRdfD56KS3Jc4TDaj4uIT6PbVOnp+uyHzQSuVx+Wcr5x5TK1yhZP8t0Lx/ABULlmAtjVKU6lEfk7ZrWd0752lCChZgJCIzH2guoP9fIeQC9do9/lqABbuOs2esCtUL1PIdv2zpQdu+/3irKOejl3QIa1KZZbWFLKpLnXL8c+oDnSpa5nt3KJaSaY93YwXO1bH19uLFa+045HGSTey+/O57LtmUWJCANgTZpm1fN5u6Y2Jq45mdUhKKQc0KWRj5YvlT3Lcunpp254Mfj5efNG3YZLr7tzZLTUnbqNmEnnDssTGsF8dj1iKvG5ZXuOUk81J7pwfkaBzL1QeoUkhF8nv581fI1qlWy4glX2ks5OEBMM869IbU9YdS3H96s04Gn+4nPWHL9jOvfT7jnSfezMunk3HItItZ+/g2SiqvbWIHzfokFmV+2lSyGUqlUhau/j2sca0q5Hz9rX+ddOtEUgHzkYRMCqIYdNurae0NyySi9dimPD3Idu55U7sMz1u0QH6Tf43QwvvHT5v6QgPDrnE5uMXmRV8yul7lcpp3JoURKSLiBwUkSMiMsrB9TYisk1E4kSktztjya2ebVONKnbf/H28k/5Ku9Urx1cDGjH9mew3OiktXyy/9WG/2TqBbsnes7T9bBUAUdGWpbsdLUGelsRO6LNXogHLUh21xyxx+v6+32/k9Tm7MvSeSuUkbksKIuINTAS6ArWBASJSO1mxk8AgYLq74sjt3uxWizWvtbcdO1q2ooi/L/feVcp2nLi20hd9G6T5bD9vz1UkL1+PTbFaK1j6MBISDBHXLJ3Uhf0tA+iux6S/v8PNuHi2WBPMjZh42wJ813VeQ5aKTzBsCcnYTHmVddz5r74ZcMQYc8wYEwPMBB6yL2CMCTHG7AJ0g2AXcWaBuwJ+lqRQJZ2+hRc63OWSmFyt5pglvPHHbgDm7zjN1hMXCb10a3hu8v2mY+ISOHg2ildm7eRGrCUBvD13N52+SLqO0+0IuXCN6FhNLs74dtUR+ny30TZPRWUv7kwKFQD7xtdQ67kME5GhIhIsIsHh4eEuCS638vH24vi4bk6XDxnfPcW5ne/eB8CglgHMHNoCgGYBJVwToAsk/9DvNWkj93251mHZvacjufudxdw/YW2SHeUuWTcXcoaXdcRXXILj7y6XrsXQ7vPVfLhwn9PPzMsSV90Nu+R433DlWTmio9kYM9kYE2iMCSxdOud1mmY1cbQhcwYUze9LyPjuFPb3pUW1kix7uQ1P3FvFRdFljcTtR4fbDXcNTeNDaNvJS6lee3uupVaS2hak+89aOq0Pno1yeD07237yEgfOpt/pHh0bjzGuGZY7f8dpAGLjtYEgO3JnUggDKtkdV7SeUx7yZb8GLH6xtW2BvSL+KTtpa5YtnOLc3XcUpqt1El1O0XTs39yMi3d67sbR81dTnDt/JZqwyzdstYrEzu3U7r3LboZ2WgJGBREwKsipsu7W89t/6DJhXZplTkRco+boJemuqJtRsTr3I1tyZ1LYAlQXkaoi4gf0Bxa48f2UncQOWHs9G1WkVrkiTOjXkPd71KH6HSkTwIhU+hG8vYQfBzd1eZzuVOOdJeT3cy4pbDhyge/WHOVm3K1+gWYfr6Dl+JW2412hkQ7vPX7BMrlu5pZbraWLd5+h5fiVxGXw2/C1m3G2GklcfEKKpjJP2HfaUpNYsd+126TGZoOfTaXktqRgjIkDRgBLgf3ALGPMXhH5QER6AIhIUxEJBfoA34tIyuEmKlOCXmjNO91rObxWMJ8PT94bcKvsyFb8/UpbBjSrRPd6qdcIyhbxB2BU15qplmlRLfv0PQDssOtHSMu8HacZv/gAE1emvg+1vcRv+hNXHWGqg0lt7y7YS9jlG2luc/rstOAU5+q8u5R67y0F4K63F3P3O4udisedoq2JMr0dATMqtT4a5VluXRDPGLMIWJTs3Bi711uwNCspF6tcsgDPtK7mVNk65YsCMO6R+mmWq1WuCKv/044qJQswfnHKBewK+nnToWYZ/j1mGW5Ytoi/bT5ATvHVyiMcvXCNf45cSL8w8NnSgw7PF8rnw/mom1y9GQv4OyyzdG/Sb95rD1kGUTjbdD9ryylmbDnJ3EyueeVMLeZo+FVe/n0nAP4+rk0K127qaK3sKEd0NKvsI6BUQUSEla+25dHmlZNc61jrjiS7xs169h5e7Fg9iyO8fUG7zjg1OslRv8CMzScByzLhAFetH3x/7Tyd7izqJ6ZuzlCcr/+xi+0nU68JnY+KTnOY7JR16S/bYb+sub+vaz8u/rvisEufB5blUXRo8O3RpKAypVrpQnzcs57tePnLbfisT33bhLceDcpTuWQBXu58N+Meqceika3Z98H9HPiwi+2eu+9wrmM2J3nzT8tIpWvWCXET/j5EXHwCL8zYzoNfr3f4gfXegr18szLtD8iEBMPEVUe4dC2GhATD6Hl70o2l2dgV3DNuBQARV28yZe0xTl++QaQ14Z3LYC3O1c1H7vDJkgPUHL1EE8Nt0P0UlEvYd1oHv9OJonbLTwxoVtnRLZQo6Jfq897qVpOPF93+HguetvpgOOusi/bFJRiG/7o1RZmfrJsRpSZgVBA/Dm7KZ0sPcuT8VTYejUjSLHclOtbhSDK4NR/jnXl7WLznLGMX7adM4XxsfruTUx/yy+zWk8qXA5LCzxtDAPeumJvbaU1B3ZbRD9Smld0SGgClCuXD14klMh6022I0NenNus6OkjcrDf5pi+31kWRbj+4JczyiKXkH8+AfLc+Yuz0sRT/Ns7+kTDTJJdh1VCTuY+FMc9Cf226NInd185E7RMda+km8bnOuTl6W/X/LKlt7ulVVfs3gYnvPtq3G5Meb8FjzKux8974kM7CXvdyGFa+2pURBy1yKAc0qp+iXKOpgEbxKJfITMr67U0uHe1LyZPnA1+sdlsvIUNSNxyIIGBVEs7F/O7weG5/A2kMpO87t53A4MzHNmY7mK9GxSTqwZwWfImBUkG2Y7T9HnevAz40uXovhkW83uGwSoLtoUlBZ7s2utbivTlnA8gFvPwP77jsKc2fpQvRqXIHX7q/B0NbVeLnz3fzyVDNbmYaVilG8gC9zn7uX2cPuSfLsehWLZs0PkUnHwt23VWhiDeB6TBy/2S09HvjR37Y1n+zZNx/Fxqf/QbX1xCVuxsVzJTr1Tvj67y1Lsq/Fp0ssTYBR1nt+3BCS7vu4woS/D9Hqk5X8z8FeHJmRkGAyPOckuff/2su2k5dZsf+8S2JyF00KKlsSEZ5vfxdeXpaE0ebu0qx7vb3t9fYx99GocnHuKJxyuGfr6qVSnEvrfG4yfvEBao9Zyttzb3VEJ+5wZ2/+jrAkq9Da70Vx0Lp/xaqDST+8gnafocY7S6j/3jIafbAs1aVBFu46Y3udOE/DxytrP2q+X3uM0Es3+Chov0ueV+2tRdz1tuM5IzM3n0y1GdBeYu0vJpsv76EdzSpbmPZ0MwrmS/t/x0olCqRYwK9iccumQm91vTVRb9rTt5qzEtv35z/fkpj4BE5EXGfRi62p++5SV4WerXy3xrm9rl+cmXSXuqDdp+levxyR12PZfNyyeulXaQwZvXQ9lke+/YfvBja27SNu78j5q04v+2Hv9OUbxCcYKpXIPgzGkLcAAA7JSURBVH1JaW3FmpBgGGUdcXbs4262LzE5mdYUVLbQunppGlcunuH7vLyEkPHd6ZrGTGwvgQaVitE0oARrX29PIbvkc2Rs10zF2y+wUvqFcpBFu88yee1RGnywjNHzLTWItOZAJLLfX3vc4lvfyjt9sYaFu06ne3/AqCBb0xLAveNX0vrTVbZjYww/rD/OlehYzkdFs/e042/kw6Ztddt6UsEnUl8s0d71XDIMVpOCytV+GtyUf0Z1TPW6j7dXkiSRXOKSHs+0qpqklvJUq6pJypUt4s/QNs7NIM+ufspke3/iN+nv1yRtvx8xfbvtddOxf5OQYBxumTpvu+N1MietPkp765Lk9d9bRrOxK+j+leOO+aX7zmYqdmck7xg+GXHdYf/C1VQWTMxpNCmoXK1djTKULZqy3+H9HnWY9rSl83rb6M6288m3LW1W1bKWU90KaXdglyjox5tdayaZnGevXoWitmdlV6cjM7ckSbW3Fjn1Lb3aW4tSvTZz88kUz/hkyQFCIq6nKOvMLnuOuGLUz+XrMbT5bBXPT9+W4pplSZNb4uITOHI+ikmrj/L3vnO2pdWX7j3L5eupr4mVXGJtKeLqzdsL3knap6DyJPsFAf18vOherxxBu88Qce3WP9bENuLNb3WkdGHLENmFL7Ti+IVbI4gCShagV+OKPNq8MiKCv683ZQrn43zUTQ6P7croeXuYueUUvw1pzlM/3pqvkNynvepzR1F/nszgUhe5RWK7fKKHJ25ItWztMUtZMKIlc7eH8UD98tQuVyTd9aKqvRlE8QJ+DG1TjYNno/hzexhBI1vR/av1LH+5DdXvKMzVm3HEJ5gUQ57j7PoULlsnAyZftwpSLq3+2pxdzHVQC5q/4zRrD4Wzfcx9aQdttSXkEh8u3Mf2k5f45tHGTt1zOzQpKAU0qFSUoN1nKODnzcyhLUhIMLZOwzJFbtU06lYoSt0KRTl18brt2gvJ5lGsfb09Jy9ex9fbi/G96jO+l2WhwadaVU21fbpvU0sfxYj2d3EtJo53H6yT5rfvaqULunV4a1ZK7MOwl97qtj2+sSSNHzeEOLWXeIKBiGsxjLNbyPH1ObsAmLMtlDe71qLNp6u4eC2Gv19pQ4ViBcjv552iyStxbatEP6y/tX5U8hVxD51LfdOl9NbWen3OToJPXGLlq+04fdmyOdTCXWcY1jYy3Vrr7dKkoBTwTKtq1CxbhNbVSzm1c12lEgV4s2tN+jVN2eHs7+vN3Q72quhWrxwh47sTHHKR3t9tpF2N0oztWS/Jvtr/ub+G7fXIjtWJvB7DzxtvzTloXrUEre4qxQsdqzNu0X6+X2tpx7/3zpJMH9KCu99Z7HDiW05csdZZ6Q3xPHLe8Ydz4of292uOcV/tsly01hI7fWHZ2nXBiJYMmPyvbR0rwPb3nWjsolud60N+CXa4vW1qIm/E2molN2Li8fEWfL29OH35BrOCb21olPgFBOCNP3YRNLK10++RGZLdZ9clFxgYaIKDU65Dr1RuFRxykT+2hVG7fBEeb5F0W9TE2sShj7ri5+PFg1+vZ3dYJEfGdrWNq0/8oHJU8yhewDfFt9YxD9TmA+t+04X/v717j5GqPOM4/v2xC8udBaF0cVGwIEKiyy3CCrFKlaKoiRarpKkYsdSojSaaFqpFa2y1tfHWVNNqrdaopVgvW2K13OxFlJvcQQTrctkqCwiIKLAsT/847x6GYWAX6szs2X0+yWTOec87m/eBs/vM+55z3rd1IZeW9eD5+RuP+Gxzd3JxG6p2Hr7E660X9KV/SUe+/2z9U4+8fOM5DOjRkWkLNzE19JZm/GDkYU+5r713DP3ufD3e71/Skb/dcmJJQdJiMxtaX72s9hQkjQEeAQqAJ83s/rTjRcAfgSHAduAqM6vMZpucS5qhvbowtFfmi9TPXHc2ry6polXobUy/oZwtn+6lsKAF911xJmWlxUd8pvL+sazbspu2RYV0a1/Ej19ewQPjzmLovdE0GdeN7E1BC3FXxSpuH92PCef0YunGnawOU3/3L+l42DTgc277Ok/868N4aGVASUd+dWUZFz96aJnPc0/vFq8X0VSkJwSAh2c1fDrwyx+bd0RZ+rQnqQkhV7LWU5BUALwPXAhsJlqec7yZrU6pcyNwlpndIOlq4HIzu+pYP9d7Cs6dmLnvVdOzSxv6fOXIoa1MFm/YwaCexfG1lSUbd7Cu+jMuK+vBri9qGPbz2fTu2o65t58Xf6am9iAFEi1aiC/219J/avRH7a3Jo/jzwk3U1B7ksTcPPWD37MSzOfPkThS3bcUz8yq5q2IVhS3EtwaXMm3RJo7l0rIe/HXZoWch2hcVxnMsNVUj+pzEc9cPP6HPNrSnkM2kUA7cbWbfDPtTAMzsvpQ6b4Q6b0sqBD4GutkxGuVJwbnk2LPvALPfq+aytBlxZ63eQuX2PcdcHbDuz8Ce/bXsramla/si9tbUcsZPokRTef9Yag8aNbUH2bp7X3yH2H+27uG0bu3ieq1btohnT63TuW1LDtQay+8ezfLNu/h8fy3jn3inwXFdN6J3xmVYs+3uSwdw7Yje9VfMoDEkhXHAGDO7Pux/FxhmZjen1FkZ6mwO+x+EOtvSftYkYBLAKaecMmTDhg0451wuvbKkisICcUH/7odNJvjZvgO0aVlAC8GG7Z+z5qNP4yfsl23aSU3tQT7Zs58LB3SnaucXdGjdkp2f7+f5BRu5YlAp/b7agZVVu3hw5vuMPbOE26ZHy5/efH4f1lXvjm9/nTd5FCWdWjfoRohMmlRSSOU9BeecO34NTQrZfKK5Cki9X680lGWsE4aPOhFdcHbOOZcH2UwKC4G+knpLagVcDVSk1akAJoTtccCcY11PcM45l11ZuyXVzA5Iuhl4g+iW1KfMbJWke4BFZlYB/B54VtJ64BOixOGccy5Psvqcgpm9BryWVjY1ZXsvcGU22+Ccc67hfJZU55xzMU8KzjnnYp4UnHPOxTwpOOeciyVullRJW4ETfaS5K3DUB+MSpqnE0lTiAI+lsfJYIqeaWbf6KiUuKfw/JC1qyBN9SdBUYmkqcYDH0lh5LMfHh4+cc87FPCk455yLNbek8Lt8N+BL1FRiaSpxgMfSWHksx6FZXVNwzjl3bM2tp+Ccc+4YPCk455yLNYukIGmMpLWS1kuanO/21EfSU5KqwyJEdWVdJM2UtC68dw7lkvRoiG25pMH5a/mRJPWUNFfSakmrJN0SyhMXj6TWkhZIWhZi+Wko7y1pfmjztDBVPJKKwv76cLxXPtufTlKBpCWSZoT9pMZRKWmFpKWSFoWyxJ1fAJKKJb0o6T1JaySV5zqWJp8UJBUAvwEuAgYA4yUNyG+r6vU0MCatbDIw28z6ArPDPkRx9Q2vScDjOWpjQx0AbjOzAcBw4Kbw75/EePYBo8ysDBgIjJE0HPgF8JCZ9QF2ABND/YnAjlD+UKjXmNwCrEnZT2ocAOeb2cCUe/iTeH4BPAK8bmZnAGVE/z+5jcXMmvQLKAfeSNmfAkzJd7sa0O5ewMqU/bVASdguAdaG7d8C4zPVa4wv4FXgwqTHA7QF3gWGET1hWph+vhGtJVIetgtDPeW77aE9pUR/YEYBMwAlMY7Qpkqga1pZ4s4vopUnP0z/t811LE2+pwCcDGxK2d8cypKmu5l9FLY/BrqH7cTEF4YdBgHzSWg8YchlKVANzAQ+AHaa2YFQJbW9cSzh+C7gpNy2+KgeBn4IHAz7J5HMOAAM+LukxZImhbIknl+9ga3AH8Kw3pOS2pHjWJpDUmhyLPpakKh7iSW1B/4C3Gpmn6YeS1I8ZlZrZgOJvmmfDZyR5yYdN0mXANVmtjjfbfmSjDSzwUTDKTdJOjf1YILOr0JgMPC4mQ0C9nBoqAjITSzNISlUAT1T9ktDWdJskVQCEN6rQ3mjj09SS6KE8JyZvRSKExsPgJntBOYSDbMUS6pbxTC1vXEs4XgnYHuOm5rJCOAySZXAn4iGkB4heXEAYGZV4b0aeJkoWSfx/NoMbDaz+WH/RaIkkdNYmkNSWAj0DXdWtCJaB7oiz206ERXAhLA9gWhsvq78mnAnwnBgV0pXM+8kiWgt7jVm9mDKocTFI6mbpOKw3Ybo2sgaouQwLlRLj6UuxnHAnPBNL6/MbIqZlZpZL6Lfhzlm9h0SFgeApHaSOtRtA6OBlSTw/DKzj4FNkvqFom8Aq8l1LPm+uJKjCzgXA+8Tjf/eke/2NKC9LwAfATVE3x4mEo3hzgbWAbOALqGuiO6u+gBYAQzNd/vTYhlJ1N1dDiwNr4uTGA9wFrAkxLISmBrKTwMWAOuB6UBRKG8d9teH46flO4YMMZ0HzEhqHKHNy8JrVd3vdxLPr9C+gcCicI69AnTOdSw+zYVzzrlYcxg+cs4510CeFJxzzsU8KTjnnIt5UnDOORfzpOCccy7mScG5o5B0R5gNdXmYgXOYpFsltc1325zLFr8l1bkMJJUDDwLnmdk+SV2BVsA8ovvBt+W1gc5lifcUnMusBNhmZvsAQhIYB/QA5kqaCyBptKS3Jb0raXqY46lujv9fhnn+F0jqE8qvlLRS0ZoM/8xPaM4dnfcUnMsg/HH/N9EU2bOAaWb2jzBf0FAz2xZ6Dy8BF5nZHkk/InoK+J5Q7wkz+5mka4Bvm9klklYAY8ysSlKxRXMoOddoeE/BuQzM7DNgCNHiJVuBaZKuTas2nGjhprfCdNoTgFNTjr+Q8l4ett8Cnpb0PaAgO6137sQV1l/FuebJzGqBN4E3wzf8CWlVBMw0s/FH+xHp22Z2g6RhwFhgsaQhZtZoZhx1znsKzmUgqZ+kvilFA4ENwG6gQyh7BxiRcr2gnaTTUz5zVcr726HO18xsvplNJeqBpE597FzeeU/BuczaA78OU2UfIJohdBIwHnhd0n/N7PwwpPSCpKLwuTuJZuQF6CxpOdHaznW9iQdCshHRzJfLchKNcw3kF5qdy4LUC9L5botzx8OHj5xzzsW8p+Cccy7mPQXnnHMxTwrOOedinhScc87FPCk455yLeVJwzjkX+x+DURa1CBCVtgAAAABJRU5ErkJggg==\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "train_loss = np.array(model.get_train_summary('Loss'))\n", + "val_loss = np.array(model.get_validation_summary('Loss'))\n", + "\n", + "import matplotlib.pyplot as plt\n", + "plt.plot(train_loss[:,0],train_loss[:,1],label='train loss')\n", + "plt.plot(val_loss[:,0],val_loss[:,1],label='validation loss',color='green')\n", + "plt.title('Training and validation loss')\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Loss')\n", + "plt.legend()\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 512 records in 0.020173091 seconds. Throughput is 25380.344 records/second. Loss is 0.0092472015.\n", + "Top1Accuracy is Accuracy(correct: 8707, count: 10000, accuracy: 0.8707)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The blue line is the training loss, while the green line is the validation loss. Note that your own results may vary \n", + "slightly due to a different random initialization of your network." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, the training loss decreases with every epoch. That's what you would \n", + "expect when running gradient descent optimization -- the quantity you are trying to minimize should get lower with every iteration. But that \n", + "isn't the case for the validation loss: it seems to be optimized at about 1/5 of the total training epochs, which is 20/5 = 4. This is an example of what we were warning \n", + "against earlier: a model that performs better on the training data isn't necessarily a model that will do better on data it has never seen \n", + "before. In precise terms, what you are seeing is \"overfitting\": after the second epoch, we are over-optimizing on the training data, and we \n", + "ended up learning representations that are specific to the training data and do not generalize to data outside of the training set.\n", + "\n", + "In this case, to prevent overfitting, we could simply stop training after three epochs. In general, there is a range of techniques you can \n", + "leverage to mitigate overfitting, which we will cover in the next chapter.\n", + "\n", + "Let's train a new network from scratch for 4 epochs, then evaluate it on our test data:" + ] + }, + { + "cell_type": "code", + "execution_count": 14, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "model = models.Sequential()\n", + "model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", + "model.add(layers.Dense(16, activation='relu'))\n", + "model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['accuracy'])\n", + "\n", + "model.fit(x_train, y_train, nb_epoch=4, batch_size=512)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 512 records in 0.023978868 seconds. Throughput is 21352.133 records/second. Loss is 0.108611815._" + ] + }, + { + "cell_type": "code", + "execution_count": 15, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "[0.3026159405708313, 0.8824800252914429]" + ] + }, + "execution_count": 15, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "results = model.evaluate(x_test, y_test)\n", + "results" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Our fairly naive approach achieves an accuracy of 88%. With state-of-the-art approaches, one should be able to get close to 95%." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Using a trained network to generate predictions on new data\n", + "\n", + "After having trained a network, you will want to use it in a practical setting. You can generate the likelihood of reviews being positive \n", + "by using the `predict` method:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Predict result\n", + "_In Keras, one could just call following code to predict the test data_\n", + "\n", + " model.predict(x_test)\n", + "_In analytics-zoo, the return of `predict` is RDD, so you need to call `collect` method to get the result:_" + ] + }, + { + "cell_type": "code", + "execution_count": 16, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "[array([0.9291455], dtype=float32),\n", + " array([0.96646625], dtype=float32),\n", + " array([0.9992888], dtype=float32),\n", + " array([0.6787984], dtype=float32),\n", + " array([0.8021081], dtype=float32),\n", + " array([0.21841279], dtype=float32),\n", + " array([0.00925558], dtype=float32),\n", + " array([0.03703438], dtype=float32),\n", + " array([0.9931043], dtype=float32),\n", + " array([0.77527326], dtype=float32),\n", + " array([0.57820714], dtype=float32),\n", + " array([0.9590181], dtype=float32),\n", + " array([0.23954743], dtype=float32),\n", + " array([0.99956185], dtype=float32),\n", + " array([0.99986804], dtype=float32),\n", + " array([0.89252], dtype=float32),\n", + " array([0.6362871], dtype=float32),\n", + " array([0.00453862], dtype=float32),\n", + " array([0.00022487], dtype=float32),\n", + " array([0.5878058], dtype=float32),\n", + " array([0.79077387], dtype=float32),\n", + " array([0.951337], dtype=float32),\n", + " array([0.02377308], dtype=float32),\n", + " array([0.01233012], dtype=float32),\n", + " array([2.1225938e-05], dtype=float32),\n", + " array([0.92153585], dtype=float32),\n", + " array([0.27290833], dtype=float32),\n", + " array([0.9678427], dtype=float32),\n", + " array([0.98325783], dtype=float32),\n", + " array([0.01623936], dtype=float32),\n", + " array([0.9855356], dtype=float32),\n", + " array([0.00476582], dtype=float32),\n", + " array([0.99998844], dtype=float32),\n", + " array([0.37596926], dtype=float32),\n", + " array([0.99066436], dtype=float32),\n", + " array([0.995574], dtype=float32),\n", + " array([0.92657614], dtype=float32),\n", + " array([0.00201875], dtype=float32),\n", + " array([0.04801914], dtype=float32),\n", + " array([0.9741373], dtype=float32),\n", + " array([0.11522112], dtype=float32),\n", + " array([0.26702783], dtype=float32),\n", + " array([0.9866962], dtype=float32),\n", + " array([0.9840716], dtype=float32),\n", + " array([0.7866682], dtype=float32),\n", + " array([0.8413062], dtype=float32),\n", + " array([0.5110062], dtype=float32),\n", + " array([0.9998983], dtype=float32),\n", + " array([0.80300266], dtype=float32),\n", + " array([0.914983], dtype=float32),\n", + " array([0.9802167], dtype=float32),\n", + " array([0.99499327], dtype=float32),\n", + " array([0.6802777], dtype=float32),\n", + " array([0.94399655], dtype=float32),\n", + " array([0.00192062], dtype=float32),\n", + " array([0.01228456], dtype=float32),\n", + " array([0.0108898], dtype=float32),\n", + " array([0.08844584], dtype=float32),\n", + " array([0.00843766], dtype=float32),\n", + " array([0.975319], dtype=float32),\n", + " array([0.9284045], dtype=float32),\n", + " array([0.28974563], dtype=float32),\n", + " array([0.00184639], dtype=float32),\n", + " array([0.7453014], dtype=float32),\n", + " array([0.08012109], dtype=float32),\n", + " array([0.00030172], dtype=float32),\n", + " array([0.9998647], dtype=float32),\n", + " array([0.9998778], dtype=float32),\n", + " array([0.99604577], dtype=float32),\n", + " array([0.9999883], dtype=float32),\n", + " array([0.03597537], dtype=float32),\n", + " array([0.9160098], dtype=float32),\n", + " array([0.1100359], dtype=float32),\n", + " array([0.21295448], dtype=float32),\n", + " array([0.45663244], dtype=float32),\n", + " array([0.9961888], dtype=float32),\n", + " array([0.3244737], dtype=float32),\n", + " array([0.99997187], dtype=float32),\n", + " array([0.12168489], dtype=float32),\n", + " array([0.04461992], dtype=float32),\n", + " array([0.05755902], dtype=float32),\n", + " array([0.16763222], dtype=float32),\n", + " array([0.87495023], dtype=float32),\n", + " array([0.21862818], dtype=float32),\n", + " array([0.01208456], dtype=float32),\n", + " array([0.10023356], dtype=float32),\n", + " array([0.29649988], dtype=float32),\n", + " array([0.99824023], dtype=float32),\n", + " array([0.39383507], dtype=float32),\n", + " array([0.3296326], dtype=float32),\n", + " array([0.68079424], dtype=float32),\n", + " array([0.7821922], dtype=float32),\n", + " array([0.99843687], dtype=float32),\n", + " array([0.998406], dtype=float32),\n", + " array([0.09952371], dtype=float32),\n", + " array([0.20677786], dtype=float32),\n", + " array([0.07357709], dtype=float32),\n", + " array([0.99855345], dtype=float32),\n", + " array([0.00219191], dtype=float32),\n", + " array([0.9438781], dtype=float32),\n", + " array([0.11632669], dtype=float32),\n", + " array([0.31134847], dtype=float32),\n", + " array([0.12542696], dtype=float32),\n", + " array([0.9763289], dtype=float32),\n", + " array([0.9640528], dtype=float32),\n", + " array([0.6855567], dtype=float32),\n", + " array([0.9791134], dtype=float32),\n", + " array([0.47091678], dtype=float32),\n", + " array([0.01279368], dtype=float32),\n", + " array([0.33637416], dtype=float32),\n", + " array([0.92876065], dtype=float32),\n", + " array([0.02191273], dtype=float32),\n", + " array([0.02534392], dtype=float32),\n", + " array([0.27622753], dtype=float32),\n", + " array([0.03425935], dtype=float32),\n", + " array([0.11640935], dtype=float32),\n", + " array([0.99640983], dtype=float32),\n", + " array([0.99434376], dtype=float32),\n", + " array([0.02413097], dtype=float32),\n", + " array([0.36645678], dtype=float32),\n", + " array([0.01748311], dtype=float32),\n", + " array([0.18354651], dtype=float32),\n", + " array([0.06130786], dtype=float32),\n", + " array([0.21773124], dtype=float32),\n", + " array([0.95380867], dtype=float32),\n", + " array([0.5504796], dtype=float32),\n", + " array([0.0219801], dtype=float32),\n", + " array([0.01981366], dtype=float32),\n", + " array([0.00031499], dtype=float32),\n", + " array([0.13779135], dtype=float32),\n", + " array([0.9984407], dtype=float32),\n", + " array([0.40540016], dtype=float32),\n", + " array([0.77313596], dtype=float32),\n", + " array([0.01747493], dtype=float32),\n", + " array([0.0557996], dtype=float32),\n", + " array([0.06081589], dtype=float32),\n", + " array([0.04389222], dtype=float32),\n", + " array([0.9974957], dtype=float32),\n", + " array([0.5977306], dtype=float32),\n", + " array([0.02096312], dtype=float32),\n", + " array([0.8821718], dtype=float32),\n", + " array([0.00831421], dtype=float32),\n", + " array([0.85648173], dtype=float32),\n", + " array([0.61687416], dtype=float32),\n", + " array([0.00555464], dtype=float32),\n", + " array([0.9578813], dtype=float32),\n", + " array([0.16184929], dtype=float32),\n", + " array([0.8980497], dtype=float32),\n", + " array([0.99223125], dtype=float32),\n", + " array([0.0132063], dtype=float32),\n", + " array([0.92253935], dtype=float32),\n", + " array([0.06267989], dtype=float32),\n", + " array([0.9891216], dtype=float32),\n", + " array([0.9419726], dtype=float32),\n", + " array([0.00210088], dtype=float32),\n", + " array([0.99873346], dtype=float32),\n", + " array([0.01406829], dtype=float32),\n", + " array([0.08238389], dtype=float32),\n", + " array([0.7304331], dtype=float32),\n", + " array([0.07515999], dtype=float32),\n", + " array([0.00137386], dtype=float32),\n", + " array([0.999446], dtype=float32),\n", + " array([0.06388262], dtype=float32),\n", + " array([0.1658486], dtype=float32),\n", + " array([0.99999976], dtype=float32),\n", + " array([0.9154651], dtype=float32),\n", + " array([0.9564062], dtype=float32),\n", + " array([0.9038421], dtype=float32),\n", + " array([0.9414884], dtype=float32),\n", + " array([0.023891], dtype=float32),\n", + " array([0.27174172], dtype=float32),\n", + " array([0.9541309], dtype=float32),\n", + " array([0.06518997], dtype=float32),\n", + " array([0.01072453], dtype=float32),\n", + " array([0.99960166], dtype=float32),\n", + " array([0.0525161], dtype=float32),\n", + " array([0.00074362], dtype=float32),\n", + " array([0.84470475], dtype=float32),\n", + " array([0.68433523], dtype=float32),\n", + " array([0.73134536], dtype=float32),\n", + " array([0.02615881], dtype=float32),\n", + " array([0.9435008], dtype=float32),\n", + " array([0.9924217], dtype=float32),\n", + " array([0.81417906], dtype=float32),\n", + " array([0.99532396], dtype=float32),\n", + " array([0.11175746], dtype=float32),\n", + " array([0.01164351], dtype=float32),\n", + " array([0.99615234], dtype=float32),\n", + " array([0.99891615], dtype=float32),\n", + " array([0.85309887], dtype=float32),\n", + " array([0.3838549], dtype=float32),\n", + " array([0.08728907], dtype=float32),\n", + " array([0.99386746], dtype=float32),\n", + " array([0.99560165], dtype=float32),\n", + " array([0.01668872], dtype=float32),\n", + " array([0.865859], dtype=float32),\n", + " array([0.00344433], dtype=float32),\n", + " array([0.10099232], dtype=float32),\n", + " array([0.18755046], dtype=float32),\n", + " array([0.17657793], dtype=float32),\n", + " array([0.99963737], dtype=float32),\n", + " array([0.1608229], dtype=float32),\n", + " array([0.99993265], dtype=float32),\n", + " array([0.9839023], dtype=float32),\n", + " array([0.8809537], dtype=float32),\n", + " array([0.16208851], dtype=float32),\n", + " array([0.9696871], dtype=float32),\n", + " array([0.9999682], dtype=float32),\n", + " array([0.06924216], dtype=float32),\n", + " array([0.03222934], dtype=float32),\n", + " array([0.01602055], dtype=float32),\n", + " array([0.16013445], dtype=float32),\n", + " array([0.13429435], dtype=float32),\n", + " array([0.9997141], dtype=float32),\n", + " array([0.5197483], dtype=float32),\n", + " array([0.02680757], dtype=float32),\n", + " array([0.9984849], dtype=float32),\n", + " array([0.00670389], dtype=float32),\n", + " array([0.04960306], dtype=float32),\n", + " array([0.05909725], dtype=float32),\n", + " array([0.07385926], dtype=float32),\n", + " array([0.01410465], dtype=float32),\n", + " array([0.4758584], dtype=float32),\n", + " array([0.9994578], dtype=float32),\n", + " array([0.00207514], dtype=float32),\n", + " array([0.98792577], dtype=float32),\n", + " array([0.99999547], dtype=float32),\n", + " array([3.411692e-06], dtype=float32),\n", + " array([0.7747846], dtype=float32),\n", + " array([0.91780055], dtype=float32),\n", + " array([0.9927326], dtype=float32),\n", + " array([0.2352484], dtype=float32),\n", + " array([0.00142602], dtype=float32),\n", + " array([0.32317147], dtype=float32),\n", + " array([0.00565691], dtype=float32),\n", + " array([0.53445995], dtype=float32),\n", + " array([0.8927338], dtype=float32),\n", + " array([0.13075478], dtype=float32),\n", + " array([0.92551], dtype=float32),\n", + " array([0.06454863], dtype=float32),\n", + " array([0.945902], dtype=float32),\n", + " array([0.98765355], dtype=float32),\n", + " array([0.00029585], dtype=float32),\n", + " array([0.02011549], dtype=float32),\n", + " array([0.03295863], dtype=float32),\n", + " array([0.00324995], dtype=float32),\n", + " array([0.01008756], dtype=float32),\n", + " array([0.9823131], dtype=float32),\n", + " array([0.27388793], dtype=float32),\n", + " array([0.5470663], dtype=float32),\n", + " array([0.00781587], dtype=float32),\n", + " array([0.005428], dtype=float32),\n", + " array([0.9992046], dtype=float32),\n", + " array([0.11337327], dtype=float32),\n", + " array([0.0242104], dtype=float32),\n", + " array([0.06808829], dtype=float32),\n", + " array([0.9719501], dtype=float32),\n", + " array([0.960842], dtype=float32),\n", + " array([0.05452807], dtype=float32),\n", + " array([0.9993693], dtype=float32),\n", + " array([0.7771042], dtype=float32),\n", + " array([0.99957746], dtype=float32),\n", + " array([0.05997226], dtype=float32),\n", + " array([0.903902], dtype=float32),\n", + " array([0.86632144], dtype=float32),\n", + " array([0.99996936], dtype=float32),\n", + " array([0.69629955], dtype=float32),\n", + " array([0.8225713], dtype=float32),\n", + " array([0.00246663], dtype=float32),\n", + " array([0.0023386], dtype=float32),\n", + " array([0.9990294], dtype=float32),\n", + " array([0.9977755], dtype=float32),\n", + " array([0.9861735], dtype=float32),\n", + " array([0.03454849], dtype=float32),\n", + " array([0.08657297], dtype=float32),\n", + " array([0.9999199], dtype=float32),\n", + " array([0.9969818], dtype=float32),\n", + " array([0.05295636], dtype=float32),\n", + " array([0.99947375], dtype=float32),\n", + " array([0.01684208], dtype=float32),\n", + " array([0.00564773], dtype=float32),\n", + " array([0.00795649], dtype=float32),\n", + " array([0.9999298], dtype=float32),\n", + " array([0.06059966], dtype=float32),\n", + " array([0.9730349], dtype=float32),\n", + " array([0.3703475], dtype=float32),\n", + " array([0.00036947], dtype=float32),\n", + " array([0.3769037], dtype=float32),\n", + " array([0.00783887], dtype=float32),\n", + " array([0.0283057], dtype=float32),\n", + " array([0.04452141], dtype=float32),\n", + " array([0.47198933], dtype=float32),\n", + " array([0.99648005], dtype=float32),\n", + " array([0.9991768], dtype=float32),\n", + " array([0.99995506], dtype=float32),\n", + " array([0.1129663], dtype=float32),\n", + " array([0.01932632], dtype=float32),\n", + " array([0.01605185], dtype=float32),\n", + " array([0.9423293], dtype=float32),\n", + " array([0.06175272], dtype=float32),\n", + " array([0.99719644], dtype=float32),\n", + " array([0.10867236], dtype=float32),\n", + " array([0.02944934], dtype=float32),\n", + " array([0.9235288], dtype=float32),\n", + " array([0.47749949], dtype=float32),\n", + " array([0.88871026], dtype=float32),\n", + " array([0.11335868], dtype=float32),\n", + " array([0.9990363], dtype=float32),\n", + " array([0.03595558], dtype=float32),\n", + " array([0.19236687], dtype=float32),\n", + " array([0.99891937], dtype=float32),\n", + " array([0.28199562], dtype=float32),\n", + " array([0.01422782], dtype=float32),\n", + " array([0.0095924], dtype=float32),\n", + " array([0.05586888], dtype=float32),\n", + " array([0.90418166], dtype=float32),\n", + " array([0.06067238], dtype=float32),\n", + " array([0.12407589], dtype=float32),\n", + " array([0.00962292], dtype=float32),\n", + " array([0.01531602], dtype=float32),\n", + " array([0.00537257], dtype=float32),\n", + " array([0.3383139], dtype=float32),\n", + " array([0.9790156], dtype=float32),\n", + " array([0.99629223], dtype=float32),\n", + " array([0.999041], dtype=float32),\n", + " array([0.4543444], dtype=float32),\n", + " array([0.9975923], dtype=float32),\n", + " array([0.994676], dtype=float32),\n", + " array([0.9955705], dtype=float32),\n", + " array([0.00852212], dtype=float32),\n", + " array([0.00965995], dtype=float32),\n", + " array([0.04810032], dtype=float32),\n", + " array([0.03414589], dtype=float32),\n", + " array([0.19549885], dtype=float32),\n", + " array([0.04221633], dtype=float32),\n", + " array([0.999081], dtype=float32),\n", + " array([0.95819485], dtype=float32),\n", + " array([0.02422899], dtype=float32),\n", + " array([0.00078607], dtype=float32),\n", + " array([0.01256537], dtype=float32),\n", + " array([0.573112], dtype=float32),\n", + " array([0.97446126], dtype=float32),\n", + " array([0.01481443], dtype=float32),\n", + " array([0.91800165], dtype=float32),\n", + " array([0.04669483], dtype=float32),\n", + " array([0.12667258], dtype=float32),\n", + " array([0.9989146], dtype=float32),\n", + " array([0.11938414], dtype=float32),\n", + " array([0.8915277], dtype=float32),\n", + " array([0.01737923], dtype=float32),\n", + " array([0.9999982], dtype=float32),\n", + " array([0.96499765], dtype=float32),\n", + " array([0.02043628], dtype=float32),\n", + " array([0.6315207], dtype=float32),\n", + " array([0.9999362], dtype=float32),\n", + " array([0.34459296], dtype=float32),\n", + " array([0.98566204], dtype=float32),\n", + " array([0.97014564], dtype=float32),\n", + " array([0.99786866], dtype=float32),\n", + " array([0.01015446], dtype=float32),\n", + " array([0.8746796], dtype=float32),\n", + " array([0.9308818], dtype=float32),\n", + " array([0.00047523], dtype=float32),\n", + " array([0.99945456], dtype=float32),\n", + " array([0.00871587], dtype=float32),\n", + " array([0.87762976], dtype=float32),\n", + " array([0.00176486], dtype=float32),\n", + " array([0.9776403], dtype=float32),\n", + " array([0.00964555], dtype=float32),\n", + " array([0.38256386], dtype=float32),\n", + " array([0.9978903], dtype=float32),\n", + " array([0.48501348], dtype=float32),\n", + " array([0.9758839], dtype=float32),\n", + " array([0.76296157], dtype=float32),\n", + " array([0.00493866], dtype=float32),\n", + " array([0.05346973], dtype=float32),\n", + " array([0.9999949], dtype=float32),\n", + " array([0.00160436], dtype=float32),\n", + " array([0.00270788], dtype=float32),\n", + " array([0.2500689], dtype=float32),\n", + " array([0.01582536], dtype=float32),\n", + " array([0.8722655], dtype=float32),\n", + " array([0.95772576], dtype=float32),\n", + " array([0.9999635], dtype=float32),\n", + " array([0.86149096], dtype=float32),\n", + " array([0.12169719], dtype=float32),\n", + " array([0.28068733], dtype=float32),\n", + " array([0.00027394], dtype=float32),\n", + " array([0.00176786], dtype=float32),\n", + " array([0.00266076], dtype=float32),\n", + " array([0.00955428], dtype=float32),\n", + " array([0.06166862], dtype=float32),\n", + " array([0.96516556], dtype=float32),\n", + " array([0.99725515], dtype=float32),\n", + " array([0.86680585], dtype=float32),\n", + " array([0.7473102], dtype=float32),\n", + " array([0.09695191], dtype=float32),\n", + " array([0.00296136], dtype=float32),\n", + " array([0.00260568], dtype=float32),\n", + " array([0.9957995], dtype=float32),\n", + " array([0.99882144], dtype=float32),\n", + " array([0.00024917], dtype=float32),\n", + " array([0.9272133], dtype=float32),\n", + " array([0.00036355], dtype=float32),\n", + " array([0.9843275], dtype=float32),\n", + " array([0.02331734], dtype=float32),\n", + " array([0.10120519], dtype=float32),\n", + " array([0.47101545], dtype=float32),\n", + " array([0.02735368], dtype=float32),\n", + " array([0.9993734], dtype=float32),\n", + " array([0.9975063], dtype=float32),\n", + " array([0.9815989], dtype=float32),\n", + " array([0.9998317], dtype=float32),\n", + " array([0.08344828], dtype=float32),\n", + " array([0.9794001], dtype=float32),\n", + " array([0.9987134], dtype=float32),\n", + " array([0.00018663], dtype=float32),\n", + " array([0.8894404], dtype=float32),\n", + " array([0.04193362], dtype=float32),\n", + " array([0.99497104], dtype=float32),\n", + " array([0.04068065], dtype=float32),\n", + " array([0.00536961], dtype=float32),\n", + " array([0.06312026], dtype=float32),\n", + " array([0.05857188], dtype=float32),\n", + " array([0.68636453], dtype=float32),\n", + " array([0.9688917], dtype=float32),\n", + " array([0.19696496], dtype=float32),\n", + " array([0.06663571], dtype=float32),\n", + " array([0.03183109], dtype=float32),\n", + " array([0.96671313], dtype=float32),\n", + " array([0.05208089], dtype=float32),\n", + " array([0.09314187], dtype=float32),\n", + " array([0.9926622], dtype=float32),\n", + " array([0.92366344], dtype=float32),\n", + " array([0.9675213], dtype=float32),\n", + " array([0.01860829], dtype=float32),\n", + " array([0.9165048], dtype=float32),\n", + " array([0.94261616], dtype=float32),\n", + " array([0.0835857], dtype=float32),\n", + " array([0.00041597], dtype=float32),\n", + " array([0.00907566], dtype=float32),\n", + " array([0.94010156], dtype=float32),\n", + " array([0.9840152], dtype=float32),\n", + " array([0.00041544], dtype=float32),\n", + " array([0.8775654], dtype=float32),\n", + " array([0.32026634], dtype=float32),\n", + " array([0.01421119], dtype=float32),\n", + " array([0.0130173], dtype=float32),\n", + " array([0.9884545], dtype=float32),\n", + " array([0.04571154], dtype=float32),\n", + " array([1.6011174e-05], dtype=float32),\n", + " array([0.00115296], dtype=float32),\n", + " array([0.9790429], dtype=float32),\n", + " array([0.64808387], dtype=float32),\n", + " array([4.6414338e-05], dtype=float32),\n", + " array([0.5914479], dtype=float32),\n", + " array([0.00711486], dtype=float32),\n", + " array([0.82269937], dtype=float32),\n", + " array([0.66748506], dtype=float32),\n", + " array([0.9879972], dtype=float32),\n", + " array([6.153964e-05], dtype=float32),\n", + " array([0.99550027], dtype=float32),\n", + " array([0.00570012], dtype=float32),\n", + " array([0.46884078], dtype=float32),\n", + " array([0.00010328], dtype=float32),\n", + " array([0.03197822], dtype=float32),\n", + " array([0.9972145], dtype=float32),\n", + " array([0.05284659], dtype=float32),\n", + " array([0.9037368], dtype=float32),\n", + " array([0.04048614], dtype=float32),\n", + " array([0.998744], dtype=float32),\n", + " array([0.99049884], dtype=float32),\n", + " array([0.01359443], dtype=float32),\n", + " array([0.9997179], dtype=float32),\n", + " array([0.99999964], dtype=float32),\n", + " array([0.98721075], dtype=float32),\n", + " array([0.00063402], dtype=float32),\n", + " array([0.820883], dtype=float32),\n", + " array([0.4547376], dtype=float32),\n", + " array([0.891108], dtype=float32),\n", + " array([0.16223074], dtype=float32),\n", + " array([0.9726654], dtype=float32),\n", + " array([0.9003827], dtype=float32),\n", + " array([0.99944574], dtype=float32),\n", + " array([0.7704998], dtype=float32),\n", + " array([0.95534146], dtype=float32),\n", + " array([0.0062368], dtype=float32),\n", + " array([0.9787254], dtype=float32),\n", + " array([0.00126028], dtype=float32),\n", + " array([0.7004171], dtype=float32),\n", + " array([0.09580212], dtype=float32),\n", + " array([0.97376186], dtype=float32),\n", + " array([0.9920665], dtype=float32),\n", + " array([0.12573221], dtype=float32),\n", + " array([0.96389884], dtype=float32),\n", + " array([0.9980578], dtype=float32),\n", + " array([0.00578512], dtype=float32),\n", + " array([0.01206519], dtype=float32),\n", + " array([0.9992673], dtype=float32),\n", + " array([0.07898365], dtype=float32),\n", + " array([0.00214792], dtype=float32),\n", + " array([0.05026222], dtype=float32),\n", + " array([0.99995553], dtype=float32),\n", + " array([0.99840695], dtype=float32),\n", + " array([0.99762005], dtype=float32),\n", + " array([1.], dtype=float32),\n", + " array([0.7520437], dtype=float32),\n", + " array([0.03680907], dtype=float32),\n", + " array([0.5764651], dtype=float32),\n", + " array([0.99977905], dtype=float32),\n", + " array([0.20490777], dtype=float32),\n", + " array([0.00173326], dtype=float32),\n", + " array([0.9939528], dtype=float32),\n", + " array([1.9042971e-05], dtype=float32),\n", + " array([0.95325047], dtype=float32),\n", + " array([0.9958734], dtype=float32),\n", + " array([0.03266709], dtype=float32),\n", + " array([0.02785043], dtype=float32),\n", + " array([0.05891791], dtype=float32),\n", + " array([0.985227], dtype=float32),\n", + " array([0.00210726], dtype=float32),\n", + " array([0.9943926], dtype=float32),\n", + " array([0.02131884], dtype=float32),\n", + " array([0.99827754], dtype=float32),\n", + " array([0.8846837], dtype=float32),\n", + " array([0.99997437], dtype=float32),\n", + " array([0.9946067], dtype=float32),\n", + " array([0.99978703], dtype=float32),\n", + " array([0.00300169], dtype=float32),\n", + " array([0.00031111], dtype=float32),\n", + " array([0.9819504], dtype=float32),\n", + " array([0.00375891], dtype=float32),\n", + " array([0.63086605], dtype=float32),\n", + " array([0.83123654], dtype=float32),\n", + " array([0.63774806], dtype=float32),\n", + " array([0.6908987], dtype=float32),\n", + " array([0.15456767], dtype=float32),\n", + " array([0.4055819], dtype=float32),\n", + " array([0.0910763], dtype=float32),\n", + " array([0.24731727], dtype=float32),\n", + " array([0.994842], dtype=float32),\n", + " array([0.38033506], dtype=float32),\n", + " array([0.98958546], dtype=float32),\n", + " array([0.9998734], dtype=float32),\n", + " array([0.99897206], dtype=float32),\n", + " array([0.8665204], dtype=float32),\n", + " array([0.00639246], dtype=float32),\n", + " array([0.9556338], dtype=float32),\n", + " array([0.9666423], dtype=float32),\n", + " array([0.9984849], dtype=float32),\n", + " array([0.02892273], dtype=float32),\n", + " array([0.995031], dtype=float32),\n", + " array([0.72732645], dtype=float32),\n", + " array([0.998869], dtype=float32),\n", + " array([0.01101623], dtype=float32),\n", + " array([0.9236663], dtype=float32),\n", + " array([0.0053416], dtype=float32),\n", + " array([0.9376339], dtype=float32),\n", + " array([0.9097744], dtype=float32),\n", + " array([0.32959577], dtype=float32),\n", + " array([0.69777864], dtype=float32),\n", + " array([0.02821315], dtype=float32),\n", + " array([0.8764768], dtype=float32),\n", + " array([0.01698065], dtype=float32),\n", + " array([0.05337453], dtype=float32),\n", + " array([0.00699902], dtype=float32),\n", + " array([0.01098163], dtype=float32),\n", + " array([0.02664173], dtype=float32),\n", + " array([0.19115667], dtype=float32),\n", + " array([0.01039254], dtype=float32),\n", + " array([0.7853336], dtype=float32),\n", + " array([0.13310696], dtype=float32),\n", + " array([0.12221986], dtype=float32),\n", + " array([0.99144626], dtype=float32),\n", + " array([0.12488245], dtype=float32),\n", + " array([0.10422938], dtype=float32),\n", + " array([0.03960704], dtype=float32),\n", + " array([0.96108264], dtype=float32),\n", + " array([0.00321886], dtype=float32),\n", + " array([0.9612626], dtype=float32),\n", + " array([0.77753425], dtype=float32),\n", + " array([0.992634], dtype=float32),\n", + " array([0.08653396], dtype=float32),\n", + " array([0.573064], dtype=float32),\n", + " array([0.994193], dtype=float32),\n", + " array([0.9994568], dtype=float32),\n", + " array([0.00164113], dtype=float32),\n", + " array([0.02974954], dtype=float32),\n", + " array([0.00094306], dtype=float32),\n", + " array([0.01469964], dtype=float32),\n", + " array([0.01007712], dtype=float32),\n", + " array([0.5073839], dtype=float32),\n", + " array([0.00891581], dtype=float32),\n", + " array([0.01340619], dtype=float32),\n", + " array([0.19153547], dtype=float32),\n", + " array([0.00785635], dtype=float32),\n", + " array([0.00529725], dtype=float32),\n", + " array([0.9981645], dtype=float32),\n", + " array([0.89561176], dtype=float32),\n", + " array([0.44029003], dtype=float32),\n", + " array([0.9998586], dtype=float32),\n", + " array([0.09905386], dtype=float32),\n", + " array([0.9964311], dtype=float32),\n", + " array([0.9990119], dtype=float32),\n", + " array([0.85216373], dtype=float32),\n", + " array([0.9999999], dtype=float32),\n", + " array([0.9584791], dtype=float32),\n", + " array([0.00405316], dtype=float32),\n", + " array([0.9539604], dtype=float32),\n", + " array([0.9477786], dtype=float32),\n", + " array([0.99440527], dtype=float32),\n", + " array([0.9769205], dtype=float32),\n", + " array([0.89980936], dtype=float32),\n", + " array([0.00249994], dtype=float32),\n", + " array([0.8010222], dtype=float32),\n", + " array([0.20035101], dtype=float32),\n", + " array([0.00624598], dtype=float32),\n", + " array([0.80348605], dtype=float32),\n", + " array([0.19786465], dtype=float32),\n", + " array([0.0849141], dtype=float32),\n", + " array([0.9986082], dtype=float32),\n", + " array([0.13313013], dtype=float32),\n", + " array([0.16162278], dtype=float32),\n", + " array([0.993683], dtype=float32),\n", + " array([0.55489], dtype=float32),\n", + " array([0.07838932], dtype=float32),\n", + " array([0.008188], dtype=float32),\n", + " array([0.00667501], dtype=float32),\n", + " array([0.16437183], dtype=float32),\n", + " array([0.96623826], dtype=float32),\n", + " array([0.8830661], dtype=float32),\n", + " array([0.9948244], dtype=float32),\n", + " array([0.98182225], dtype=float32),\n", + " array([0.98158526], dtype=float32),\n", + " array([0.03434094], dtype=float32),\n", + " array([0.3056716], dtype=float32),\n", + " array([0.98550564], dtype=float32),\n", + " array([0.03481789], dtype=float32),\n", + " array([0.99663454], dtype=float32),\n", + " array([0.9985311], dtype=float32),\n", + " array([0.9939167], dtype=float32),\n", + " array([0.10510859], dtype=float32),\n", + " array([0.00508491], dtype=float32),\n", + " array([0.00165993], dtype=float32),\n", + " array([0.8245149], dtype=float32),\n", + " array([0.9556251], dtype=float32),\n", + " array([0.9887638], dtype=float32),\n", + " array([0.17581154], dtype=float32),\n", + " array([0.00011383], dtype=float32),\n", + " array([0.00106234], dtype=float32),\n", + " array([0.01681961], dtype=float32),\n", + " array([0.00525573], dtype=float32),\n", + " array([0.9957818], dtype=float32),\n", + " array([0.8603696], dtype=float32),\n", + " array([0.00199909], dtype=float32),\n", + " array([0.04100141], dtype=float32),\n", + " array([0.00017335], dtype=float32),\n", + " array([0.9992545], dtype=float32),\n", + " array([0.9659639], dtype=float32),\n", + " array([0.8995427], dtype=float32),\n", + " array([0.12103864], dtype=float32),\n", + " array([0.710389], dtype=float32),\n", + " array([0.69011146], dtype=float32),\n", + " array([0.05472863], dtype=float32),\n", + " array([0.01922707], dtype=float32),\n", + " array([0.8451995], dtype=float32),\n", + " array([0.99947244], dtype=float32),\n", + " array([0.09135215], dtype=float32),\n", + " array([0.00159073], dtype=float32),\n", + " array([0.05302845], dtype=float32),\n", + " array([0.98199064], dtype=float32),\n", + " array([0.50679266], dtype=float32),\n", + " array([0.17369047], dtype=float32),\n", + " array([0.9998796], dtype=float32),\n", + " array([0.6899049], dtype=float32),\n", + " array([0.00706529], dtype=float32),\n", + " array([0.00957006], dtype=float32),\n", + " array([0.78653455], dtype=float32),\n", + " array([0.00385365], dtype=float32),\n", + " array([0.3887317], dtype=float32),\n", + " array([0.03332283], dtype=float32),\n", + " array([0.04271935], dtype=float32),\n", + " array([0.00691515], dtype=float32),\n", + " array([0.51850504], dtype=float32),\n", + " array([0.00992748], dtype=float32),\n", + " array([0.29605916], dtype=float32),\n", + " array([0.00028048], dtype=float32),\n", + " array([0.9928925], dtype=float32),\n", + " array([0.00124131], dtype=float32),\n", + " array([0.9764488], dtype=float32),\n", + " array([0.7932969], dtype=float32),\n", + " array([0.97071785], dtype=float32),\n", + " array([0.98584795], dtype=float32),\n", + " array([0.07081781], dtype=float32),\n", + " array([0.04931527], dtype=float32),\n", + " array([0.15695621], dtype=float32),\n", + " array([0.9994659], dtype=float32),\n", + " array([0.5096887], dtype=float32),\n", + " array([0.9917842], dtype=float32),\n", + " array([0.9981895], dtype=float32),\n", + " array([0.12094765], dtype=float32),\n", + " array([0.11894753], dtype=float32),\n", + " array([0.90818393], dtype=float32),\n", + " array([0.98540974], dtype=float32),\n", + " array([0.94457304], dtype=float32),\n", + " array([0.9998313], dtype=float32),\n", + " array([0.9907136], dtype=float32),\n", + " array([0.72061336], dtype=float32),\n", + " array([0.99645764], dtype=float32),\n", + " array([0.8892745], dtype=float32),\n", + " array([0.999595], dtype=float32),\n", + " array([0.4836757], dtype=float32),\n", + " array([0.97150517], dtype=float32),\n", + " array([0.95611787], dtype=float32),\n", + " array([0.01308193], dtype=float32),\n", + " array([0.00030093], dtype=float32),\n", + " array([0.99999774], dtype=float32),\n", + " array([0.00015023], dtype=float32),\n", + " array([0.00129508], dtype=float32),\n", + " array([0.02929328], dtype=float32),\n", + " array([0.15967111], dtype=float32),\n", + " array([0.04437197], dtype=float32),\n", + " array([0.00077572], dtype=float32),\n", + " array([0.00117108], dtype=float32),\n", + " array([0.02031642], dtype=float32),\n", + " array([0.78561157], dtype=float32),\n", + " array([0.6640868], dtype=float32),\n", + " array([0.96112865], dtype=float32),\n", + " array([0.9977581], dtype=float32),\n", + " array([0.24247183], dtype=float32),\n", + " array([0.3876404], dtype=float32),\n", + " array([0.00950658], dtype=float32),\n", + " array([0.12216215], dtype=float32),\n", + " array([0.00086611], dtype=float32),\n", + " array([0.99396956], dtype=float32),\n", + " array([0.77168643], dtype=float32),\n", + " array([0.00386494], dtype=float32),\n", + " array([0.9824516], dtype=float32),\n", + " array([0.9933523], dtype=float32),\n", + " array([0.95625204], dtype=float32),\n", + " array([0.00056513], dtype=float32),\n", + " array([0.949406], dtype=float32),\n", + " array([0.91392314], dtype=float32),\n", + " array([0.998437], dtype=float32),\n", + " array([0.00321832], dtype=float32),\n", + " array([0.3081614], dtype=float32),\n", + " array([0.18779686], dtype=float32),\n", + " array([0.86671174], dtype=float32),\n", + " array([0.99736756], dtype=float32),\n", + " array([0.37101072], dtype=float32),\n", + " array([0.97278196], dtype=float32),\n", + " array([0.03783982], dtype=float32),\n", + " array([0.98899263], dtype=float32),\n", + " array([0.99997747], dtype=float32),\n", + " array([0.33761773], dtype=float32),\n", + " array([0.9922051], dtype=float32),\n", + " array([0.9986929], dtype=float32),\n", + " array([0.9734451], dtype=float32),\n", + " array([0.00104312], dtype=float32),\n", + " array([0.00905259], dtype=float32),\n", + " array([0.9999858], dtype=float32),\n", + " array([0.2685942], dtype=float32),\n", + " array([8.498155e-07], dtype=float32),\n", + " array([0.00194448], dtype=float32),\n", + " array([0.9610404], dtype=float32),\n", + " array([0.06272461], dtype=float32),\n", + " array([0.9734326], dtype=float32),\n", + " array([0.9998591], dtype=float32),\n", + " array([0.02384088], dtype=float32),\n", + " array([0.00458063], dtype=float32),\n", + " array([0.8619814], dtype=float32),\n", + " array([0.3280481], dtype=float32),\n", + " array([0.58994853], dtype=float32),\n", + " array([0.00738818], dtype=float32),\n", + " array([0.9968213], dtype=float32),\n", + " array([0.94588715], dtype=float32),\n", + " array([0.89741385], dtype=float32),\n", + " array([0.0001792], dtype=float32),\n", + " array([7.942238e-05], dtype=float32),\n", + " array([0.2487981], dtype=float32),\n", + " array([0.99818295], dtype=float32),\n", + " array([0.06495794], dtype=float32),\n", + " array([0.61300725], dtype=float32),\n", + " array([0.00142293], dtype=float32),\n", + " array([0.7782267], dtype=float32),\n", + " array([0.70798534], dtype=float32),\n", + " array([0.15175731], dtype=float32),\n", + " array([0.99284136], dtype=float32),\n", + " array([0.9841339], dtype=float32),\n", + " array([0.00554728], dtype=float32),\n", + " array([0.0500682], dtype=float32),\n", + " array([1.7751601e-06], dtype=float32),\n", + " array([0.12731266], dtype=float32),\n", + " array([0.01886535], dtype=float32),\n", + " array([0.9990376], dtype=float32),\n", + " array([0.27495182], dtype=float32),\n", + " array([0.90534323], dtype=float32),\n", + " array([0.8381721], dtype=float32),\n", + " array([0.12258686], dtype=float32),\n", + " array([0.23695664], dtype=float32),\n", + " array([0.04559099], dtype=float32),\n", + " array([0.798738], dtype=float32),\n", + " array([0.9249577], dtype=float32),\n", + " array([0.5790399], dtype=float32),\n", + " array([0.7356898], dtype=float32),\n", + " array([0.9420959], dtype=float32),\n", + " array([0.80315626], dtype=float32),\n", + " array([0.907965], dtype=float32),\n", + " array([0.18890426], dtype=float32),\n", + " array([0.04044292], dtype=float32),\n", + " array([0.00435959], dtype=float32),\n", + " array([0.01255109], dtype=float32),\n", + " array([0.973041], dtype=float32),\n", + " array([0.89595586], dtype=float32),\n", + " array([0.15041849], dtype=float32),\n", + " array([0.7386434], dtype=float32),\n", + " array([0.01395628], dtype=float32),\n", + " array([0.00037464], dtype=float32),\n", + " array([0.30354175], dtype=float32),\n", + " array([0.92193896], dtype=float32),\n", + " array([0.95892274], dtype=float32),\n", + " array([1.066259e-06], dtype=float32),\n", + " array([0.96353555], dtype=float32),\n", + " array([0.14788437], dtype=float32),\n", + " array([0.9997639], dtype=float32),\n", + " array([0.01777537], dtype=float32),\n", + " array([0.9861092], dtype=float32),\n", + " array([0.13082978], dtype=float32),\n", + " array([0.0002504], dtype=float32),\n", + " array([0.8804745], dtype=float32),\n", + " array([0.9967051], dtype=float32),\n", + " array([0.56104803], dtype=float32),\n", + " array([0.36787862], dtype=float32),\n", + " array([0.8360154], dtype=float32),\n", + " array([0.9998766], dtype=float32),\n", + " array([0.00568995], dtype=float32),\n", + " array([0.00194393], dtype=float32),\n", + " array([0.00631262], dtype=float32),\n", + " array([0.03533027], dtype=float32),\n", + " array([0.9103368], dtype=float32),\n", + " array([0.9982439], dtype=float32),\n", + " array([0.97841996], dtype=float32),\n", + " array([0.00286406], dtype=float32),\n", + " array([0.0708506], dtype=float32),\n", + " array([0.9432028], dtype=float32),\n", + " array([0.9654381], dtype=float32),\n", + " array([0.05079986], dtype=float32),\n", + " array([0.8743878], dtype=float32),\n", + " array([0.00240675], dtype=float32),\n", + " array([0.98993146], dtype=float32),\n", + " array([0.07532773], dtype=float32),\n", + " array([0.22899462], dtype=float32),\n", + " array([0.00091621], dtype=float32),\n", + " array([0.9989504], dtype=float32),\n", + " array([0.39317238], dtype=float32),\n", + " array([0.3326581], dtype=float32),\n", + " array([0.01213577], dtype=float32),\n", + " array([0.99774724], dtype=float32),\n", + " array([0.9886003], dtype=float32),\n", + " array([0.79621345], dtype=float32),\n", + " array([0.79079646], dtype=float32),\n", + " array([0.93861336], dtype=float32),\n", + " array([0.07021908], dtype=float32),\n", + " array([0.7411332], dtype=float32),\n", + " array([0.969042], dtype=float32),\n", + " array([0.9099184], dtype=float32),\n", + " array([0.02733893], dtype=float32),\n", + " array([0.9999924], dtype=float32),\n", + " array([0.9897418], dtype=float32),\n", + " array([0.03869773], dtype=float32),\n", + " array([0.97638786], dtype=float32),\n", + " array([0.08542448], dtype=float32),\n", + " array([0.05407662], dtype=float32),\n", + " array([0.9999993], dtype=float32),\n", + " array([0.14986295], dtype=float32),\n", + " array([0.999286], dtype=float32),\n", + " array([0.24805169], dtype=float32),\n", + " array([0.01673634], dtype=float32),\n", + " array([0.01463007], dtype=float32),\n", + " array([0.3670616], dtype=float32),\n", + " array([0.9926224], dtype=float32),\n", + " array([0.6253009], dtype=float32),\n", + " array([0.03401245], dtype=float32),\n", + " array([0.00030278], dtype=float32),\n", + " array([0.96080303], dtype=float32),\n", + " array([0.04573576], dtype=float32),\n", + " array([0.04926103], dtype=float32),\n", + " array([0.5770354], dtype=float32),\n", + " array([0.02184597], dtype=float32),\n", + " array([0.9933947], dtype=float32),\n", + " array([0.00422782], dtype=float32),\n", + " array([0.7942074], dtype=float32),\n", + " array([0.14047284], dtype=float32),\n", + " array([0.90892816], dtype=float32),\n", + " array([0.79335517], dtype=float32),\n", + " array([0.02081179], dtype=float32),\n", + " array([0.03224256], dtype=float32),\n", + " array([0.00269347], dtype=float32),\n", + " array([0.7325373], dtype=float32),\n", + " array([0.86657375], dtype=float32),\n", + " array([0.9994041], dtype=float32),\n", + " array([0.99819297], dtype=float32),\n", + " array([0.306308], dtype=float32),\n", + " array([0.9358532], dtype=float32),\n", + " array([0.00968082], dtype=float32),\n", + " array([0.22723815], dtype=float32),\n", + " array([0.88686043], dtype=float32),\n", + " array([0.00376564], dtype=float32),\n", + " array([0.9558993], dtype=float32),\n", + " array([0.03709094], dtype=float32),\n", + " array([0.9284992], dtype=float32),\n", + " array([0.01156035], dtype=float32),\n", + " array([0.6904194], dtype=float32),\n", + " array([0.7789368], dtype=float32),\n", + " array([0.06749155], dtype=float32),\n", + " array([0.83822256], dtype=float32),\n", + " array([0.00499537], dtype=float32),\n", + " array([0.96375054], dtype=float32),\n", + " array([0.99763095], dtype=float32),\n", + " array([0.00083689], dtype=float32),\n", + " array([0.1384925], dtype=float32),\n", + " array([0.99911016], dtype=float32),\n", + " array([0.18213369], dtype=float32),\n", + " array([0.01104294], dtype=float32),\n", + " array([0.9997731], dtype=float32),\n", + " array([0.00157826], dtype=float32),\n", + " array([0.45021382], dtype=float32),\n", + " array([0.70889956], dtype=float32),\n", + " array([0.99980146], dtype=float32),\n", + " array([0.14717786], dtype=float32),\n", + " array([0.9981312], dtype=float32),\n", + " array([0.99910754], dtype=float32),\n", + " array([0.00473733], dtype=float32),\n", + " array([0.00330126], dtype=float32),\n", + " array([0.17611578], dtype=float32),\n", + " array([0.69635725], dtype=float32),\n", + " array([0.39411786], dtype=float32),\n", + " array([0.26741236], dtype=float32),\n", + " array([0.56975543], dtype=float32),\n", + " array([0.06516983], dtype=float32),\n", + " array([0.70290774], dtype=float32),\n", + " array([0.1079508], dtype=float32),\n", + " array([0.9905323], dtype=float32),\n", + " array([0.07408904], dtype=float32),\n", + " array([0.99945086], dtype=float32),\n", + " array([0.08830733], dtype=float32),\n", + " array([0.47597456], dtype=float32),\n", + " array([0.08325432], dtype=float32),\n", + " array([0.9963791], dtype=float32),\n", + " array([0.99327046], dtype=float32),\n", + " array([0.987528], dtype=float32),\n", + " array([0.2695155], dtype=float32),\n", + " array([0.01687575], dtype=float32),\n", + " array([0.0887219], dtype=float32),\n", + " array([0.00404755], dtype=float32),\n", + " array([0.8474386], dtype=float32),\n", + " array([0.02510497], dtype=float32),\n", + " array([0.00147101], dtype=float32),\n", + " array([0.00696711], dtype=float32),\n", + " array([0.01805459], dtype=float32),\n", + " array([0.37892923], dtype=float32),\n", + " array([0.32513785], dtype=float32),\n", + " array([0.00713208], dtype=float32),\n", + " array([0.05214171], dtype=float32),\n", + " array([0.9894679], dtype=float32),\n", + " array([0.74764496], dtype=float32),\n", + " array([0.0094498], dtype=float32),\n", + " array([0.05753988], dtype=float32),\n", + " array([0.9815139], dtype=float32),\n", + " array([0.994449], dtype=float32),\n", + " array([0.0733721], dtype=float32),\n", + " array([0.03602724], dtype=float32),\n", + " array([0.99997675], dtype=float32),\n", + " array([0.6763087], dtype=float32),\n", + " array([0.9927671], dtype=float32),\n", + " array([0.02451441], dtype=float32),\n", + " array([0.86146873], dtype=float32),\n", + " array([0.04389035], dtype=float32),\n", + " array([0.9999443], dtype=float32),\n", + " array([0.809564], dtype=float32),\n", + " array([0.99578035], dtype=float32),\n", + " array([0.4989446], dtype=float32),\n", + " array([0.02612785], dtype=float32),\n", + " array([0.87981015], dtype=float32),\n", + " array([0.6465501], dtype=float32),\n", + " array([0.576932], dtype=float32),\n", + " array([0.03007537], dtype=float32),\n", + " array([0.00870073], dtype=float32),\n", + " array([0.9998024], dtype=float32),\n", + " array([0.08114275], dtype=float32),\n", + " array([0.68397623], dtype=float32),\n", + " array([0.9999337], dtype=float32),\n", + " array([0.0099621], dtype=float32),\n", + " array([0.99060285], dtype=float32),\n", + " array([0.00027312], dtype=float32),\n", + " array([0.9289166], dtype=float32),\n", + " array([0.9932289], dtype=float32),\n", + " array([0.02628781], dtype=float32),\n", + " array([0.99826354], dtype=float32),\n", + " array([0.6789669], dtype=float32),\n", + " ...]" + ] + }, + "execution_count": 16, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "prediction = model.predict(x_test)\n", + "result = prediction.collect()\n", + "result" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Further experiments\n", + "\n", + "\n", + "* We were using 2 hidden layers. Try to use 1 or 3 hidden layers and see how it affects validation and test accuracy.\n", + "* Try to use layers with more hidden units or less hidden units: 32 units, 64 units...\n", + "* Try to use the `mse` loss function instead of `binary_crossentropy`.\n", + "* Try to use the `tanh` activation (an activation that was popular in the early days of neural networks) instead of `relu`.\n", + "\n", + "These experiments will help convince you that the architecture choices we have made are all fairly reasonable, although they can still be \n", + "improved!" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Conclusions\n", + "\n", + "\n", + "Here's what you should take away from this example:\n", + "\n", + "* There's usually quite a bit of preprocessing you need to do on your raw data in order to be able to feed it -- as tensors -- into a neural \n", + "network. In the case of sequences of words, they can be encoded as binary vectors -- but there are other encoding options too.\n", + "* Stacks of `Dense` layers with `relu` activations can solve a wide range of problems (including sentiment classification), and you will \n", + "likely use them frequently.\n", + "* In a binary classification problem (two output classes), your network should end with a `Dense` layer with 1 unit and a `sigmoid` activation, \n", + "i.e. the output of your network should be a scalar between 0 and 1, encoding a probability.\n", + "* With such a scalar sigmoid output, on a binary classification problem, the loss function you should use is `binary_crossentropy`.\n", + "* The `rmsprop` optimizer is generally a good enough choice of optimizer, whatever your problem. That's one less thing for you to worry \n", + "about.\n", + "* As they get better on their training data, neural networks eventually start _overfitting_ and end up obtaining increasingly worse results on data \n", + "never-seen-before. Make sure to always monitor performance on data that is outside of the training set.\n" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From cd838f21e4e47c07ee40b79ff8ec76533599f86f Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 18 Mar 2019 10:47:47 +0800 Subject: [PATCH 23/46] Rename 3.5-binary-classification.ipynb to 3.5-classifying-movie-reviews.ipynb --- ...y-classification.ipynb => 3.5-classifying-movie-reviews.ipynb} | 0 1 file changed, 0 insertions(+), 0 deletions(-) rename keras/{3.5-binary-classification.ipynb => 3.5-classifying-movie-reviews.ipynb} (100%) diff --git a/keras/3.5-binary-classification.ipynb b/keras/3.5-classifying-movie-reviews.ipynb similarity index 100% rename from keras/3.5-binary-classification.ipynb rename to keras/3.5-classifying-movie-reviews.ipynb From 696832558850d60012e9084f2b1904bc76dcbaba Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 18 Mar 2019 10:50:49 +0800 Subject: [PATCH 24/46] Add files via upload --- keras/3.5-classifying-movie-reviews.ipynb | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 93ac869..43a3971 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -59,7 +59,7 @@ "your training samples and their targets -- which would be completely useless for the task of predicting targets for data never seen before. \n", "We will go over this point in much more detail in the next chapter.\n", "\n", - "Just like the MNIST dataset, the IMDB dataset comes packaged with Keras. It has already been preprocessed: the reviews (sequences of words) \n", + "Just like the MNIST dataset, the IMDB dataset comes packaged with the Keras API of Analytics Zoo. It has already been preprocessed: the reviews (sequences of words) \n", "have been turned into sequences of integers, where each integer stands for a specific word in a dictionary.\n", "\n", "The following code will load the dataset (when you run it for the first time, about 80MB of data will be downloaded to your machine):" @@ -79,7 +79,7 @@ } ], "source": [ - "from keras.datasets import imdb\n", + "from zoo.pipeline.api.keras.datasets import imdb\n", "(train_data, train_labels), (test_data, test_labels) = imdb.load_data(nb_words=10000)" ] }, From d0ff83eea7ecc41117309ae0d76c00a5d4ad8b29 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 18 Mar 2019 11:09:00 +0800 Subject: [PATCH 25/46] Add files via upload --- keras/3.5-classifying-movie-reviews.ipynb | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 43a3971..0ca5992 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -436,6 +436,14 @@ " validation_data=(x_val, y_val))" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 512 records in 0.020173091 seconds. Throughput is 25380.344 records/second. Loss is 0.0092472015.\n", + "Top1Accuracy is Accuracy(correct: 8707, count: 10000, accuracy: 0.8707)_" + ] + }, { "cell_type": "markdown", "metadata": {}, @@ -480,14 +488,6 @@ "plt.show()" ] }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Trained 512 records in 0.020173091 seconds. Throughput is 25380.344 records/second. Loss is 0.0092472015.\n", - "Top1Accuracy is Accuracy(correct: 8707, count: 10000, accuracy: 0.8707)" - ] - }, { "cell_type": "markdown", "metadata": {}, From 1c25881314d16ba049f7c15604c32454bed869c5 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 18 Mar 2019 13:47:32 +0800 Subject: [PATCH 26/46] Add files via upload --- keras/2.1-a-first-look-at-a-neural-network.ipynb | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/keras/2.1-a-first-look-at-a-neural-network.ipynb b/keras/2.1-a-first-look-at-a-neural-network.ipynb index d022670..c211e3f 100644 --- a/keras/2.1-a-first-look-at-a-neural-network.ipynb +++ b/keras/2.1-a-first-look-at-a-neural-network.ipynb @@ -37,7 +37,7 @@ "\n", "----\n", "\n", - "We will now take a look at a first concrete example of a neural network, which makes use of Keras (v1.2.2) API in [Analytics Zoo](https://github.com/intel-analytics/analytics-zoo) to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed analytics-zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", + "We will now take a look at a first concrete example of a neural network, which makes use of Keras (v1.2.2) API in [Analytics Zoo](https://github.com/intel-analytics/analytics-zoo) to learn to classify hand-written digits. Unless you already have experience with Keras or similar libraries, you will not understand everything about this first example right away. You probably haven't even installed Analytics zoo yet. Don't worry, that is perfectly fine. In the next chapter, we will review each element in our example and explain them in detail. So don't worry if some steps seem arbitrary or look like magic to you! We've got to start somewhere.\n", "\n", "The problem we are trying to solve here is to classify grayscale images of handwritten digits (28 pixels by 28 pixels), into their 10 categories (0 to 9). The dataset we will use is the MNIST dataset, a classic dataset in the machine learning community, which has been around for almost as long as the field itself and has been very intensively studied. It's a set of 60,000 training images, plus 10,000 test images, assembled by the National Institute of Standards and Technology (the NIST in MNIST) in the 1980s. You can think of \"solving\" MNIST as the \"Hello World\" of deep learning -- it's what you do to verify that your algorithms are working as expected. As you become a machine learning practitioner, you will see MNIST come up over and over again, in scientific papers, blog posts, and so on.\n", "\n", @@ -52,7 +52,7 @@ "_In Keras one could use following code to import the datasets:_\n", "\n", " from keras.datasets import mnist\n", - "_Just replace it with following in analytics-zoo:_" + "_Just replace it with following in Analytics zoo:_" ] }, { @@ -227,7 +227,7 @@ "\n", " from keras import models\n", " from keras import layers\n", - "_Just replace it with following in analytics-zoo:_" + "_Just replace it with following in Analytics zoo:_" ] }, { @@ -343,7 +343,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "We are now ready to train our network, which in analytics-zoo Keras module is done via a call to the `fit` method of the network: \n", + "We are now ready to train our network, which in Keras API of Analytics Zoo is done via a call to the `fit` method of the network: \n", "we \"fit\" the model to its training data." ] }, From 81d6031ddcfeaf7408d2eff2e7b75a8a8f3182b4 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 18 Mar 2019 13:47:50 +0800 Subject: [PATCH 27/46] Add files via upload --- keras/3.5-classifying-movie-reviews.ipynb | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 0ca5992..40822fb 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -294,7 +294,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "And here's the analytics-zoo implementation, very similar to the MNIST example you saw previously:" + "And here's the Analytics zoo implementation, very similar to the MNIST example you saw previously:" ] }, { @@ -412,7 +412,7 @@ " batch_size=512,\n", " validation_data=(x_val, y_val)\n", " )\n", - "_After `fit` method finishes, the results are stored in `history` and thus could be visualized. Currently in analytics-zoo, `fit` method does not have any return. Results can only be checked via setting tensorboard._" + "_After `fit` method finishes, the results are stored in `history` and thus could be visualized. Currently in Analytics zoo, `fit` method does not have any return. Results can only be checked via setting tensorboard._" ] }, { @@ -451,7 +451,7 @@ "Then result could be visualized in either of following ways: \n", "\n", "* Start tensorboard web interface in terminal by `tensorboard --logdir ./` and go to web browser url `localhost:port_number` as shown in your terminal.\n", - "* Use analytics-zoo built-in method `get_scalar_from_summary` with parameter `Loss` or `Validation` to get the array of scalar, then visualize via `matplotlib`.\n", + "* Use Analytics zoo built-in method `get_scalar_from_summary` with parameter `Loss` or `Validation` to get the array of scalar, then visualize via `matplotlib`.\n", "\n", "We use the second approach here in order to directly show the result in this notebook." ] @@ -598,7 +598,7 @@ "_In Keras, one could just call following code to predict the test data_\n", "\n", " model.predict(x_test)\n", - "_In analytics-zoo, the return of `predict` is RDD, so you need to call `collect` method to get the result:_" + "_In Analytics zoo, the return of `predict` is RDD, so you need to call `collect` method to get the result:_" ] }, { From d55c0b7e562bd9cb1d7f65cab4804abe5ace3c2f Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 18 Mar 2019 16:41:13 +0800 Subject: [PATCH 28/46] Add files via upload --- keras/3.5-classifying-movie-reviews.ipynb | 2017 ++++++++++----------- 1 file changed, 1005 insertions(+), 1012 deletions(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 40822fb..9c38c15 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -36,7 +36,6 @@ "metadata": {}, "source": [ "# Classifying movie reviews: a binary classification example\n", - "This notebook is imported from Chapter 3, Section 5 of [Deep Learning with Python Notebook]().\n", "\n", "----\n", "\n", @@ -69,15 +68,7 @@ "cell_type": "code", "execution_count": 2, "metadata": {}, - "outputs": [ - { - "name": "stderr", - "output_type": "stream", - "text": [ - "Using TensorFlow backend.\n" - ] - } - ], + "outputs": [], "source": [ "from zoo.pipeline.api.keras.datasets import imdb\n", "(train_data, train_labels), (test_data, test_labels) = imdb.load_data(nb_words=10000)" @@ -136,7 +127,7 @@ { "data": { "text/plain": [ - "\"? this film was just brilliant casting location scenery story direction everyone's really suited the part they played and you could just imagine being there robert ? is an amazing actor and now the same being director ? father came from the same scottish island as myself so i loved the fact there was a real connection with this film the witty remarks throughout the film were great it was just brilliant so much that i bought the film as soon as it was released for ? and would recommend it to everyone to watch and the fly fishing was amazing really cried at the end it was so sad and you know what they say if you cry at a film it must have been good and this definitely was also ? to the two little boy's that played the ? of norman and paul they were just brilliant children are often left out of the ? list i think because the stars that play them all grown up are such a big profile for the whole film but these children are amazing and should be praised for what they have done don't you think the whole story was so lovely because it was true and was someone's life after all that was shared with us all\"" + "\"distracting ? a evil entertainment ? ? might ? might an films ? tries who because truly tries talent too br she man ? steven how determination will ? looks world's which can ? it screen that in have way gonna of of least ? want take toxic even paint ? similar ? japanese that ? would ? ? charles cover movie even ? moment ear ? ? not wanted involved ? ? ? ? quality ? ? ? point and sequences will ? bruckheimer how actually he way kinds ? genre fact fine l a either her ? ? movie ? cover and minute ? ending ? ? favorite ? of of private ? spoiler down remember and while ? ? having 200 while movie ? prove charisma pretty ? chuck and perspective seriously if a bed movie in ? cover was most five springer for to free film been but woods was showing director movie this ? ? display sinister much details to and ? ? many no there if i which explore is will ? paramount the we without on most ? eddie urban just the ? harold came even like cares charged ? be most comparison buck hollywood or mixed well 1 or have doesn't comes and point more ? meant scenes you'd for work doesn't school is pants ? was ? mask of of example is ? to friend flying making br any or ? seems as sending and you it this ? mcintire ? is style it role think parts guy was most feeling ? and awful if is close and down meets and shoot more lies ? anything question making for or ? try finally the way plot way car of of if assistant ? as a first is victim ? ? most corrupt be ? does few in a ? former teeth completely and top ? pets ? 50 score seems as will scenes ? plot done independent to ? waste the ? all john is talent just am ? for reading ? we ? this writing guy maintain jokes has on think team ? nudity been its film guy is 3 ? ? as and ? let's body from ground film was it terrified voice throughout distracting ? ? there and ? negative of of ? bland began been his most yourself was most enjoy the across cried ? luis for br been his it restaurant or better but shows ? very an off and comments in\"" ] }, "execution_count": 4, @@ -315,7 +306,7 @@ { "data": { "text/plain": [ - "" + "" ] }, "execution_count": 8, @@ -428,7 +419,9 @@ "metadata": {}, "outputs": [], "source": [ - "model.set_tensorboard('./', '3-5_summary')\n", + "import time\n", + "dir_name = '3-5 ' + str(time.ctime())\n", + "model.set_tensorboard('./', dir_name)\n", "model.fit(partial_x_train,\n", " partial_y_train,\n", " nb_epoch=20,\n", @@ -463,7 +456,7 @@ "outputs": [ { "data": { - "image/png": "\n", + "image/png": "\n", "text/plain": [ "
" ] @@ -560,7 +553,7 @@ { "data": { "text/plain": [ - "[0.3026159405708313, 0.8824800252914429]" + "[0.3063896596431732, 0.8806399703025818]" ] }, "execution_count": 15, @@ -609,1006 +602,1006 @@ { "data": { "text/plain": [ - "[array([0.9291455], dtype=float32),\n", - " array([0.96646625], dtype=float32),\n", - " array([0.9992888], dtype=float32),\n", - " array([0.6787984], dtype=float32),\n", - " array([0.8021081], dtype=float32),\n", - " array([0.21841279], dtype=float32),\n", - " array([0.00925558], dtype=float32),\n", - " array([0.03703438], dtype=float32),\n", - " array([0.9931043], dtype=float32),\n", - " array([0.77527326], dtype=float32),\n", - " array([0.57820714], dtype=float32),\n", - " array([0.9590181], dtype=float32),\n", - " array([0.23954743], dtype=float32),\n", - " array([0.99956185], dtype=float32),\n", - " array([0.99986804], dtype=float32),\n", - " array([0.89252], dtype=float32),\n", - " array([0.6362871], dtype=float32),\n", - " array([0.00453862], dtype=float32),\n", - " array([0.00022487], dtype=float32),\n", - " array([0.5878058], dtype=float32),\n", - " array([0.79077387], dtype=float32),\n", - " array([0.951337], dtype=float32),\n", - " array([0.02377308], dtype=float32),\n", - " array([0.01233012], dtype=float32),\n", - " array([2.1225938e-05], dtype=float32),\n", - " array([0.92153585], dtype=float32),\n", - " array([0.27290833], dtype=float32),\n", - " array([0.9678427], dtype=float32),\n", - " array([0.98325783], dtype=float32),\n", - " array([0.01623936], dtype=float32),\n", - " array([0.9855356], dtype=float32),\n", - " array([0.00476582], dtype=float32),\n", - " array([0.99998844], dtype=float32),\n", - " array([0.37596926], dtype=float32),\n", - " array([0.99066436], dtype=float32),\n", - " array([0.995574], dtype=float32),\n", - " array([0.92657614], dtype=float32),\n", - " array([0.00201875], dtype=float32),\n", - " array([0.04801914], dtype=float32),\n", - " array([0.9741373], dtype=float32),\n", - " array([0.11522112], dtype=float32),\n", - " array([0.26702783], dtype=float32),\n", - " array([0.9866962], dtype=float32),\n", - " array([0.9840716], dtype=float32),\n", - " array([0.7866682], dtype=float32),\n", - " array([0.8413062], dtype=float32),\n", - " array([0.5110062], dtype=float32),\n", - " array([0.9998983], dtype=float32),\n", - " array([0.80300266], dtype=float32),\n", - " array([0.914983], dtype=float32),\n", - " array([0.9802167], dtype=float32),\n", - " array([0.99499327], dtype=float32),\n", - " array([0.6802777], dtype=float32),\n", - " array([0.94399655], dtype=float32),\n", - " array([0.00192062], dtype=float32),\n", - " array([0.01228456], dtype=float32),\n", - " array([0.0108898], dtype=float32),\n", - " array([0.08844584], dtype=float32),\n", - " array([0.00843766], dtype=float32),\n", - " array([0.975319], dtype=float32),\n", - " array([0.9284045], dtype=float32),\n", - " array([0.28974563], dtype=float32),\n", - " array([0.00184639], dtype=float32),\n", - " array([0.7453014], dtype=float32),\n", - " array([0.08012109], dtype=float32),\n", - " array([0.00030172], dtype=float32),\n", - " array([0.9998647], dtype=float32),\n", - " array([0.9998778], dtype=float32),\n", - " array([0.99604577], dtype=float32),\n", - " array([0.9999883], dtype=float32),\n", - " array([0.03597537], dtype=float32),\n", - " array([0.9160098], dtype=float32),\n", - " array([0.1100359], dtype=float32),\n", - " array([0.21295448], dtype=float32),\n", - " array([0.45663244], dtype=float32),\n", - " array([0.9961888], dtype=float32),\n", - " array([0.3244737], dtype=float32),\n", - " array([0.99997187], dtype=float32),\n", - " array([0.12168489], dtype=float32),\n", - " array([0.04461992], dtype=float32),\n", - " array([0.05755902], dtype=float32),\n", - " array([0.16763222], dtype=float32),\n", - " array([0.87495023], dtype=float32),\n", - " array([0.21862818], dtype=float32),\n", - " array([0.01208456], dtype=float32),\n", - " array([0.10023356], dtype=float32),\n", - " array([0.29649988], dtype=float32),\n", - " array([0.99824023], dtype=float32),\n", - " array([0.39383507], dtype=float32),\n", - " array([0.3296326], dtype=float32),\n", - " array([0.68079424], dtype=float32),\n", - " array([0.7821922], dtype=float32),\n", - " array([0.99843687], dtype=float32),\n", - " array([0.998406], dtype=float32),\n", - " array([0.09952371], dtype=float32),\n", - " array([0.20677786], dtype=float32),\n", - " array([0.07357709], dtype=float32),\n", - " array([0.99855345], dtype=float32),\n", - " array([0.00219191], dtype=float32),\n", - " array([0.9438781], dtype=float32),\n", - " array([0.11632669], dtype=float32),\n", - " array([0.31134847], dtype=float32),\n", - " array([0.12542696], dtype=float32),\n", - " array([0.9763289], dtype=float32),\n", - " array([0.9640528], dtype=float32),\n", - " array([0.6855567], dtype=float32),\n", - " array([0.9791134], dtype=float32),\n", - " array([0.47091678], dtype=float32),\n", - " array([0.01279368], dtype=float32),\n", - " array([0.33637416], dtype=float32),\n", - " array([0.92876065], dtype=float32),\n", - " array([0.02191273], dtype=float32),\n", - " array([0.02534392], dtype=float32),\n", - " array([0.27622753], dtype=float32),\n", - " array([0.03425935], dtype=float32),\n", - " array([0.11640935], dtype=float32),\n", - " array([0.99640983], dtype=float32),\n", - " array([0.99434376], dtype=float32),\n", - " array([0.02413097], dtype=float32),\n", - " array([0.36645678], dtype=float32),\n", - " array([0.01748311], dtype=float32),\n", - " array([0.18354651], dtype=float32),\n", - " array([0.06130786], dtype=float32),\n", - " array([0.21773124], dtype=float32),\n", - " array([0.95380867], dtype=float32),\n", - " array([0.5504796], dtype=float32),\n", - " array([0.0219801], dtype=float32),\n", - " array([0.01981366], dtype=float32),\n", - " array([0.00031499], dtype=float32),\n", - " array([0.13779135], dtype=float32),\n", - " array([0.9984407], dtype=float32),\n", - " array([0.40540016], dtype=float32),\n", - " array([0.77313596], dtype=float32),\n", - " array([0.01747493], dtype=float32),\n", - " array([0.0557996], dtype=float32),\n", - " array([0.06081589], dtype=float32),\n", - " array([0.04389222], dtype=float32),\n", - " array([0.9974957], dtype=float32),\n", - " array([0.5977306], dtype=float32),\n", - " array([0.02096312], dtype=float32),\n", - " array([0.8821718], dtype=float32),\n", - " array([0.00831421], dtype=float32),\n", - " array([0.85648173], dtype=float32),\n", - " array([0.61687416], dtype=float32),\n", - " array([0.00555464], dtype=float32),\n", - " array([0.9578813], dtype=float32),\n", - " array([0.16184929], dtype=float32),\n", - " array([0.8980497], dtype=float32),\n", - " array([0.99223125], dtype=float32),\n", - " array([0.0132063], dtype=float32),\n", - " array([0.92253935], dtype=float32),\n", - " array([0.06267989], dtype=float32),\n", - " array([0.9891216], dtype=float32),\n", - " array([0.9419726], dtype=float32),\n", - " array([0.00210088], dtype=float32),\n", - " array([0.99873346], dtype=float32),\n", - " array([0.01406829], dtype=float32),\n", - " array([0.08238389], dtype=float32),\n", - " array([0.7304331], dtype=float32),\n", - " array([0.07515999], dtype=float32),\n", - " array([0.00137386], dtype=float32),\n", - " array([0.999446], dtype=float32),\n", - " array([0.06388262], dtype=float32),\n", - " array([0.1658486], dtype=float32),\n", - " array([0.99999976], dtype=float32),\n", - " array([0.9154651], dtype=float32),\n", - " array([0.9564062], dtype=float32),\n", - " array([0.9038421], dtype=float32),\n", - " array([0.9414884], dtype=float32),\n", - " array([0.023891], dtype=float32),\n", - " array([0.27174172], dtype=float32),\n", - " array([0.9541309], dtype=float32),\n", - " array([0.06518997], dtype=float32),\n", - " array([0.01072453], dtype=float32),\n", - " array([0.99960166], dtype=float32),\n", - " array([0.0525161], dtype=float32),\n", - " array([0.00074362], dtype=float32),\n", - " array([0.84470475], dtype=float32),\n", - " array([0.68433523], dtype=float32),\n", - " array([0.73134536], dtype=float32),\n", - " array([0.02615881], dtype=float32),\n", - " array([0.9435008], dtype=float32),\n", - " array([0.9924217], dtype=float32),\n", - " array([0.81417906], dtype=float32),\n", - " array([0.99532396], dtype=float32),\n", - " array([0.11175746], dtype=float32),\n", - " array([0.01164351], dtype=float32),\n", - " array([0.99615234], dtype=float32),\n", - " array([0.99891615], dtype=float32),\n", - " array([0.85309887], dtype=float32),\n", - " array([0.3838549], dtype=float32),\n", - " array([0.08728907], dtype=float32),\n", - " array([0.99386746], dtype=float32),\n", - " array([0.99560165], dtype=float32),\n", - " array([0.01668872], dtype=float32),\n", - " array([0.865859], dtype=float32),\n", - " array([0.00344433], dtype=float32),\n", - " array([0.10099232], dtype=float32),\n", - " array([0.18755046], dtype=float32),\n", - " array([0.17657793], dtype=float32),\n", - " array([0.99963737], dtype=float32),\n", - " array([0.1608229], dtype=float32),\n", - " array([0.99993265], dtype=float32),\n", - " array([0.9839023], dtype=float32),\n", - " array([0.8809537], dtype=float32),\n", - " array([0.16208851], dtype=float32),\n", - " array([0.9696871], dtype=float32),\n", - " array([0.9999682], dtype=float32),\n", - " array([0.06924216], dtype=float32),\n", - " array([0.03222934], dtype=float32),\n", - " array([0.01602055], dtype=float32),\n", - " array([0.16013445], dtype=float32),\n", - " array([0.13429435], dtype=float32),\n", - " array([0.9997141], dtype=float32),\n", - " array([0.5197483], dtype=float32),\n", - " array([0.02680757], dtype=float32),\n", - " array([0.9984849], dtype=float32),\n", - " array([0.00670389], dtype=float32),\n", - " array([0.04960306], dtype=float32),\n", - " array([0.05909725], dtype=float32),\n", - " array([0.07385926], dtype=float32),\n", - " array([0.01410465], dtype=float32),\n", - " array([0.4758584], dtype=float32),\n", - " array([0.9994578], dtype=float32),\n", - " array([0.00207514], dtype=float32),\n", - " array([0.98792577], dtype=float32),\n", + "[array([0.6597656], dtype=float32),\n", + " array([0.97529125], dtype=float32),\n", + " array([1.39369495e-05], dtype=float32),\n", + " array([0.9499197], dtype=float32),\n", + " array([0.69558215], dtype=float32),\n", + " array([0.98174447], dtype=float32),\n", + " array([0.01318819], dtype=float32),\n", + " array([0.9626703], dtype=float32),\n", + " array([0.98742026], dtype=float32),\n", + " array([0.00059057], dtype=float32),\n", + " array([0.6133139], dtype=float32),\n", + " array([0.978926], dtype=float32),\n", + " array([0.99840707], dtype=float32),\n", + " array([0.07168697], dtype=float32),\n", + " array([0.89191675], dtype=float32),\n", + " array([0.48994958], dtype=float32),\n", + " array([0.02672931], dtype=float32),\n", + " array([0.78033304], dtype=float32),\n", + " array([0.07513892], dtype=float32),\n", + " array([0.00686305], dtype=float32),\n", + " array([0.04119945], dtype=float32),\n", + " array([1.49263915e-05], dtype=float32),\n", + " array([0.99980336], dtype=float32),\n", + " array([0.8471216], dtype=float32),\n", + " array([0.00010777], dtype=float32),\n", + " array([0.9340031], dtype=float32),\n", + " array([0.8214722], dtype=float32),\n", + " array([0.9786547], dtype=float32),\n", + " array([0.00837058], dtype=float32),\n", + " array([0.9238503], dtype=float32),\n", + " array([0.00408007], dtype=float32),\n", + " array([0.18840362], dtype=float32),\n", + " array([0.999974], dtype=float32),\n", + " array([0.9948447], dtype=float32),\n", + " array([0.8062789], dtype=float32),\n", + " array([0.11027395], dtype=float32),\n", + " array([0.04690371], dtype=float32),\n", + " array([0.07576486], dtype=float32),\n", + " array([0.9307181], dtype=float32),\n", + " array([0.9578869], dtype=float32),\n", + " array([4.3841228e-05], dtype=float32),\n", + " array([0.97011423], dtype=float32),\n", + " array([0.372595], dtype=float32),\n", + " array([0.08670929], dtype=float32),\n", + " array([0.9922921], dtype=float32),\n", + " array([0.00444584], dtype=float32),\n", + " array([0.9995722], dtype=float32),\n", + " array([0.90575284], dtype=float32),\n", + " array([0.03082987], dtype=float32),\n", + " array([8.931183e-06], dtype=float32),\n", + " array([0.01119773], dtype=float32),\n", + " array([0.9681336], dtype=float32),\n", + " array([0.839909], dtype=float32),\n", + " array([0.00667274], dtype=float32),\n", + " array([0.99168044], dtype=float32),\n", + " array([0.99999154], dtype=float32),\n", + " array([1.9037469e-05], dtype=float32),\n", + " array([0.9974356], dtype=float32),\n", + " array([0.00046782], dtype=float32),\n", + " array([0.00524331], dtype=float32),\n", + " array([0.8870116], dtype=float32),\n", + " array([0.9076144], dtype=float32),\n", + " array([0.02826679], dtype=float32),\n", + " array([0.95415473], dtype=float32),\n", + " array([0.3839109], dtype=float32),\n", + " array([0.99069595], dtype=float32),\n", + " array([0.06462941], dtype=float32),\n", + " array([0.99408925], dtype=float32),\n", + " array([0.00728586], dtype=float32),\n", + " array([0.9963102], dtype=float32),\n", + " array([0.88912857], dtype=float32),\n", + " array([0.99318165], dtype=float32),\n", + " array([0.98711836], dtype=float32),\n", + " array([0.9997482], dtype=float32),\n", + " array([0.12893666], dtype=float32),\n", + " array([2.553328e-05], dtype=float32),\n", + " array([0.81136394], dtype=float32),\n", + " array([0.6672609], dtype=float32),\n", + " array([0.6661795], dtype=float32),\n", + " array([0.03229121], dtype=float32),\n", + " array([0.56833935], dtype=float32),\n", + " array([0.23906621], dtype=float32),\n", + " array([0.9886596], dtype=float32),\n", + " array([0.9827251], dtype=float32),\n", + " array([0.08567941], dtype=float32),\n", + " array([0.37140584], dtype=float32),\n", + " array([0.00025531], dtype=float32),\n", + " array([0.99791545], dtype=float32),\n", + " array([0.02411093], dtype=float32),\n", + " array([0.9877809], dtype=float32),\n", + " array([0.908092], dtype=float32),\n", + " array([0.8383248], dtype=float32),\n", + " array([0.00739653], dtype=float32),\n", + " array([0.00090695], dtype=float32),\n", + " array([0.9652902], dtype=float32),\n", + " array([0.01431155], dtype=float32),\n", + " array([0.93294597], dtype=float32),\n", + " array([0.99896336], dtype=float32),\n", + " array([0.9984067], dtype=float32),\n", + " array([0.93452567], dtype=float32),\n", + " array([0.99430794], dtype=float32),\n", + " array([0.36339617], dtype=float32),\n", + " array([0.8769031], dtype=float32),\n", + " array([0.9518878], dtype=float32),\n", + " array([0.83151025], dtype=float32),\n", + " array([0.9985399], dtype=float32),\n", + " array([0.0002125], dtype=float32),\n", + " array([0.714252], dtype=float32),\n", + " array([0.27901366], dtype=float32),\n", + " array([0.8523226], dtype=float32),\n", + " array([0.99559104], dtype=float32),\n", + " array([0.18001182], dtype=float32),\n", + " array([0.9432954], dtype=float32),\n", + " array([0.8350808], dtype=float32),\n", + " array([0.00853516], dtype=float32),\n", + " array([0.15583186], dtype=float32),\n", + " array([0.92990994], dtype=float32),\n", + " array([0.7541111], dtype=float32),\n", + " array([0.69654137], dtype=float32),\n", + " array([0.01848821], dtype=float32),\n", + " array([0.59170055], dtype=float32),\n", + " array([0.9971204], dtype=float32),\n", + " array([0.9903796], dtype=float32),\n", + " array([0.9991167], dtype=float32),\n", + " array([0.9316476], dtype=float32),\n", + " array([0.06031401], dtype=float32),\n", + " array([0.02550006], dtype=float32),\n", + " array([0.9999504], dtype=float32),\n", + " array([0.00857145], dtype=float32),\n", + " array([0.47920564], dtype=float32),\n", + " array([0.9485018], dtype=float32),\n", + " array([0.00464081], dtype=float32),\n", + " array([0.08251999], dtype=float32),\n", + " array([0.98797554], dtype=float32),\n", + " array([0.97623616], dtype=float32),\n", + " array([0.00270883], dtype=float32),\n", + " array([0.41065904], dtype=float32),\n", + " array([0.00041126], dtype=float32),\n", + " array([0.9735677], dtype=float32),\n", + " array([0.01444051], dtype=float32),\n", + " array([0.1193343], dtype=float32),\n", + " array([0.94883794], dtype=float32),\n", + " array([0.81132954], dtype=float32),\n", + " array([0.9701367], dtype=float32),\n", + " array([0.99988973], dtype=float32),\n", + " array([0.95782846], dtype=float32),\n", + " array([0.9999559], dtype=float32),\n", + " array([0.02463553], dtype=float32),\n", + " array([0.80905896], dtype=float32),\n", + " array([0.00272602], dtype=float32),\n", + " array([0.9443275], dtype=float32),\n", + " array([0.6925543], dtype=float32),\n", + " array([0.96254104], dtype=float32),\n", + " array([0.9993697], dtype=float32),\n", + " array([0.90027475], dtype=float32),\n", + " array([0.05616611], dtype=float32),\n", + " array([1.1050109e-05], dtype=float32),\n", + " array([0.8539005], dtype=float32),\n", + " array([0.7169908], dtype=float32),\n", + " array([0.06052893], dtype=float32),\n", + " array([0.03273512], dtype=float32),\n", + " array([0.98712534], dtype=float32),\n", + " array([0.00043659], dtype=float32),\n", + " array([0.9919195], dtype=float32),\n", + " array([0.5189989], dtype=float32),\n", + " array([0.01810263], dtype=float32),\n", + " array([0.00150598], dtype=float32),\n", + " array([0.06606124], dtype=float32),\n", + " array([0.00081787], dtype=float32),\n", + " array([0.01792734], dtype=float32),\n", + " array([0.9788325], dtype=float32),\n", + " array([0.95970446], dtype=float32),\n", + " array([0.09366837], dtype=float32),\n", + " array([0.01276378], dtype=float32),\n", + " array([0.9993555], dtype=float32),\n", + " array([0.027029], dtype=float32),\n", + " array([0.56499213], dtype=float32),\n", + " array([0.99708503], dtype=float32),\n", + " array([0.00154167], dtype=float32),\n", + " array([0.2801673], dtype=float32),\n", + " array([0.52925706], dtype=float32),\n", + " array([0.0010483], dtype=float32),\n", + " array([0.9990589], dtype=float32),\n", + " array([0.00761955], dtype=float32),\n", + " array([0.936439], dtype=float32),\n", + " array([0.9875731], dtype=float32),\n", + " array([0.05203724], dtype=float32),\n", + " array([0.9949458], dtype=float32),\n", + " array([0.12733188], dtype=float32),\n", + " array([0.01648956], dtype=float32),\n", + " array([0.7714576], dtype=float32),\n", + " array([0.7118609], dtype=float32),\n", + " array([0.09135327], dtype=float32),\n", + " array([0.94923663], dtype=float32),\n", + " array([0.00418737], dtype=float32),\n", + " array([0.39404547], dtype=float32),\n", + " array([0.98599905], dtype=float32),\n", + " array([0.7954801], dtype=float32),\n", + " array([0.42050537], dtype=float32),\n", + " array([0.02979656], dtype=float32),\n", + " array([0.9153005], dtype=float32),\n", + " array([0.7568136], dtype=float32),\n", + " array([0.5575319], dtype=float32),\n", + " array([0.9995894], dtype=float32),\n", + " array([0.9746347], dtype=float32),\n", + " array([6.51397e-05], dtype=float32),\n", + " array([0.14501932], dtype=float32),\n", + " array([0.97661], dtype=float32),\n", + " array([0.01651403], dtype=float32),\n", + " array([0.73719937], dtype=float32),\n", + " array([0.9063153], dtype=float32),\n", + " array([0.997982], dtype=float32),\n", + " array([0.91056806], dtype=float32),\n", + " array([0.00447078], dtype=float32),\n", + " array([0.09257668], dtype=float32),\n", + " array([0.9366054], dtype=float32),\n", + " array([0.9811677], dtype=float32),\n", + " array([0.0012391], dtype=float32),\n", + " array([0.00391587], dtype=float32),\n", + " array([0.00012618], dtype=float32),\n", + " array([0.0366583], dtype=float32),\n", + " array([0.00550616], dtype=float32),\n", + " array([0.890634], dtype=float32),\n", + " array([0.00715845], dtype=float32),\n", + " array([0.72381204], dtype=float32),\n", + " array([0.19576788], dtype=float32),\n", + " array([0.99990416], dtype=float32),\n", + " array([0.0158124], dtype=float32),\n", + " array([0.61522424], dtype=float32),\n", + " array([0.9689464], dtype=float32),\n", + " array([0.04064468], dtype=float32),\n", + " array([0.00022891], dtype=float32),\n", + " array([0.02944768], dtype=float32),\n", + " array([0.999653], dtype=float32),\n", + " array([0.40116826], dtype=float32),\n", + " array([0.9913776], dtype=float32),\n", + " array([0.0029448], dtype=float32),\n", + " array([0.32557806], dtype=float32),\n", + " array([0.6863088], dtype=float32),\n", + " array([0.00081112], dtype=float32),\n", + " array([0.97927356], dtype=float32),\n", + " array([0.19653757], dtype=float32),\n", + " array([0.9705768], dtype=float32),\n", + " array([0.04453946], dtype=float32),\n", + " array([0.00284266], dtype=float32),\n", + " array([0.03559921], dtype=float32),\n", + " array([0.9526187], dtype=float32),\n", + " array([0.7230885], dtype=float32),\n", + " array([0.8201464], dtype=float32),\n", + " array([0.00017875], dtype=float32),\n", + " array([0.97747767], dtype=float32),\n", + " array([0.5449069], dtype=float32),\n", + " array([0.09639208], dtype=float32),\n", + " array([0.90544367], dtype=float32),\n", + " array([0.167667], dtype=float32),\n", + " array([0.9997439], dtype=float32),\n", + " array([0.9310318], dtype=float32),\n", + " array([0.37656942], dtype=float32),\n", + " array([0.0002848], dtype=float32),\n", + " array([0.0001366], dtype=float32),\n", + " array([0.7440771], dtype=float32),\n", + " array([0.88802665], dtype=float32),\n", + " array([0.9152749], dtype=float32),\n", + " array([0.5734805], dtype=float32),\n", + " array([0.9993099], dtype=float32),\n", + " array([0.49408263], dtype=float32),\n", + " array([0.8506351], dtype=float32),\n", + " array([0.00250183], dtype=float32),\n", + " array([0.9945287], dtype=float32),\n", + " array([0.9684286], dtype=float32),\n", + " array([0.90822536], dtype=float32),\n", + " array([0.9937883], dtype=float32),\n", + " array([0.99190396], dtype=float32),\n", + " array([0.01760691], dtype=float32),\n", + " array([0.5422416], dtype=float32),\n", + " array([0.29439396], dtype=float32),\n", + " array([0.99019873], dtype=float32),\n", + " array([0.06950508], dtype=float32),\n", + " array([0.00818285], dtype=float32),\n", + " array([0.9632261], dtype=float32),\n", + " array([0.99473333], dtype=float32),\n", + " array([0.25060079], dtype=float32),\n", + " array([0.00048786], dtype=float32),\n", + " array([0.01472425], dtype=float32),\n", + " array([0.00318411], dtype=float32),\n", + " array([0.00093868], dtype=float32),\n", + " array([0.83109117], dtype=float32),\n", + " array([0.00123343], dtype=float32),\n", + " array([0.9713263], dtype=float32),\n", + " array([0.04610278], dtype=float32),\n", + " array([0.05665827], dtype=float32),\n", + " array([0.5868943], dtype=float32),\n", + " array([0.98522806], dtype=float32),\n", + " array([0.03351312], dtype=float32),\n", + " array([0.02006613], dtype=float32),\n", + " array([0.00033519], dtype=float32),\n", + " array([0.67317265], dtype=float32),\n", + " array([0.30107507], dtype=float32),\n", + " array([3.784242e-05], dtype=float32),\n", + " array([0.6087148], dtype=float32),\n", + " array([0.997804], dtype=float32),\n", + " array([0.32963577], dtype=float32),\n", + " array([0.03810342], dtype=float32),\n", + " array([0.99538136], dtype=float32),\n", + " array([0.5548133], dtype=float32),\n", + " array([0.9353912], dtype=float32),\n", + " array([0.9966528], dtype=float32),\n", + " array([0.00378726], dtype=float32),\n", + " array([0.43726218], dtype=float32),\n", + " array([0.95121735], dtype=float32),\n", + " array([0.9728295], dtype=float32),\n", + " array([3.875886e-06], dtype=float32),\n", + " array([0.98975426], dtype=float32),\n", + " array([0.9864806], dtype=float32),\n", + " array([0.00165366], dtype=float32),\n", + " array([0.1064606], dtype=float32),\n", + " array([0.89174306], dtype=float32),\n", + " array([0.00587977], dtype=float32),\n", + " array([0.98498905], dtype=float32),\n", + " array([0.06515972], dtype=float32),\n", + " array([0.06025562], dtype=float32),\n", + " array([0.0166713], dtype=float32),\n", + " array([0.93327284], dtype=float32),\n", + " array([0.36270353], dtype=float32),\n", + " array([0.99993503], dtype=float32),\n", + " array([0.75670844], dtype=float32),\n", + " array([0.8717547], dtype=float32),\n", + " array([0.3455405], dtype=float32),\n", + " array([0.79031855], dtype=float32),\n", + " array([0.28538352], dtype=float32),\n", + " array([0.9997949], dtype=float32),\n", + " array([0.26040974], dtype=float32),\n", + " array([0.9983621], dtype=float32),\n", + " array([0.04919887], dtype=float32),\n", + " array([0.00535334], dtype=float32),\n", + " array([0.33617225], dtype=float32),\n", + " array([0.07422278], dtype=float32),\n", + " array([0.15734425], dtype=float32),\n", + " array([0.8681399], dtype=float32),\n", + " array([3.36514e-05], dtype=float32),\n", + " array([0.220001], dtype=float32),\n", + " array([0.03030171], dtype=float32),\n", + " array([0.00071725], dtype=float32),\n", + " array([0.20411605], dtype=float32),\n", + " array([0.38738677], dtype=float32),\n", + " array([0.99825364], dtype=float32),\n", + " array([0.97874314], dtype=float32),\n", + " array([0.9536651], dtype=float32),\n", + " array([0.99999595], dtype=float32),\n", + " array([0.9274589], dtype=float32),\n", + " array([0.67642564], dtype=float32),\n", + " array([0.86876076], dtype=float32),\n", + " array([0.99380374], dtype=float32),\n", + " array([0.00764247], dtype=float32),\n", + " array([0.00141049], dtype=float32),\n", + " array([0.44760624], dtype=float32),\n", + " array([0.7392404], dtype=float32),\n", + " array([0.94820905], dtype=float32),\n", + " array([0.01543296], dtype=float32),\n", + " array([0.0030313], dtype=float32),\n", + " array([0.9983657], dtype=float32),\n", + " array([0.9877472], dtype=float32),\n", + " array([0.14449687], dtype=float32),\n", + " array([0.0175909], dtype=float32),\n", + " array([0.9933814], dtype=float32),\n", + " array([0.1099957], dtype=float32),\n", + " array([0.502743], dtype=float32),\n", + " array([0.0021092], dtype=float32),\n", + " array([0.4014902], dtype=float32),\n", + " array([8.531843e-05], dtype=float32),\n", + " array([0.0042778], dtype=float32),\n", + " array([0.91485137], dtype=float32),\n", + " array([0.02211919], dtype=float32),\n", + " array([0.00567074], dtype=float32),\n", + " array([0.06237838], dtype=float32),\n", + " array([0.9416742], dtype=float32),\n", + " array([0.0665731], dtype=float32),\n", + " array([0.8300122], dtype=float32),\n", + " array([0.93574494], dtype=float32),\n", + " array([0.99325573], dtype=float32),\n", + " array([0.24700274], dtype=float32),\n", + " array([0.99896765], dtype=float32),\n", + " array([0.93945384], dtype=float32),\n", + " array([0.18341716], dtype=float32),\n", + " array([0.00710799], dtype=float32),\n", + " array([0.00717159], dtype=float32),\n", + " array([0.9978796], dtype=float32),\n", + " array([0.39169902], dtype=float32),\n", + " array([0.9921503], dtype=float32),\n", + " array([0.33547845], dtype=float32),\n", + " array([0.97284275], dtype=float32),\n", " array([0.99999547], dtype=float32),\n", - " array([3.411692e-06], dtype=float32),\n", - " array([0.7747846], dtype=float32),\n", - " array([0.91780055], dtype=float32),\n", - " array([0.9927326], dtype=float32),\n", - " array([0.2352484], dtype=float32),\n", - " array([0.00142602], dtype=float32),\n", - " array([0.32317147], dtype=float32),\n", - " array([0.00565691], dtype=float32),\n", - " array([0.53445995], dtype=float32),\n", - " array([0.8927338], dtype=float32),\n", - " array([0.13075478], dtype=float32),\n", - " array([0.92551], dtype=float32),\n", - " array([0.06454863], dtype=float32),\n", - " array([0.945902], dtype=float32),\n", - " array([0.98765355], dtype=float32),\n", - " array([0.00029585], dtype=float32),\n", - " array([0.02011549], dtype=float32),\n", - " array([0.03295863], dtype=float32),\n", - " array([0.00324995], dtype=float32),\n", - " array([0.01008756], dtype=float32),\n", - " array([0.9823131], dtype=float32),\n", - " array([0.27388793], dtype=float32),\n", - " array([0.5470663], dtype=float32),\n", - " array([0.00781587], dtype=float32),\n", - " array([0.005428], dtype=float32),\n", - " array([0.9992046], dtype=float32),\n", - " array([0.11337327], dtype=float32),\n", - " array([0.0242104], dtype=float32),\n", - " array([0.06808829], dtype=float32),\n", - " array([0.9719501], dtype=float32),\n", - " array([0.960842], dtype=float32),\n", - " array([0.05452807], dtype=float32),\n", - " array([0.9993693], dtype=float32),\n", - " array([0.7771042], dtype=float32),\n", - " array([0.99957746], dtype=float32),\n", - " array([0.05997226], dtype=float32),\n", - " array([0.903902], dtype=float32),\n", - " array([0.86632144], dtype=float32),\n", - " array([0.99996936], dtype=float32),\n", - " array([0.69629955], dtype=float32),\n", - " array([0.8225713], dtype=float32),\n", - " array([0.00246663], dtype=float32),\n", - " array([0.0023386], dtype=float32),\n", - " array([0.9990294], dtype=float32),\n", - " array([0.9977755], dtype=float32),\n", - " array([0.9861735], dtype=float32),\n", - " array([0.03454849], dtype=float32),\n", - " array([0.08657297], dtype=float32),\n", - " array([0.9999199], dtype=float32),\n", - " array([0.9969818], dtype=float32),\n", - " array([0.05295636], dtype=float32),\n", - " array([0.99947375], dtype=float32),\n", - " array([0.01684208], dtype=float32),\n", - " array([0.00564773], dtype=float32),\n", - " array([0.00795649], dtype=float32),\n", - " array([0.9999298], dtype=float32),\n", - " array([0.06059966], dtype=float32),\n", - " array([0.9730349], dtype=float32),\n", - " array([0.3703475], dtype=float32),\n", - " array([0.00036947], dtype=float32),\n", - " array([0.3769037], dtype=float32),\n", - " array([0.00783887], dtype=float32),\n", - " array([0.0283057], dtype=float32),\n", - " array([0.04452141], dtype=float32),\n", - " array([0.47198933], dtype=float32),\n", - " array([0.99648005], dtype=float32),\n", - " array([0.9991768], dtype=float32),\n", - " array([0.99995506], dtype=float32),\n", - " array([0.1129663], dtype=float32),\n", - " array([0.01932632], dtype=float32),\n", - " array([0.01605185], dtype=float32),\n", - " array([0.9423293], dtype=float32),\n", - " array([0.06175272], dtype=float32),\n", - " array([0.99719644], dtype=float32),\n", - " array([0.10867236], dtype=float32),\n", - " array([0.02944934], dtype=float32),\n", - " array([0.9235288], dtype=float32),\n", - " array([0.47749949], dtype=float32),\n", - " array([0.88871026], dtype=float32),\n", - " array([0.11335868], dtype=float32),\n", - " array([0.9990363], dtype=float32),\n", - " array([0.03595558], dtype=float32),\n", - " array([0.19236687], dtype=float32),\n", - " array([0.99891937], dtype=float32),\n", - " array([0.28199562], dtype=float32),\n", - " array([0.01422782], dtype=float32),\n", - " array([0.0095924], dtype=float32),\n", - " array([0.05586888], dtype=float32),\n", - " array([0.90418166], dtype=float32),\n", - " array([0.06067238], dtype=float32),\n", - " array([0.12407589], dtype=float32),\n", - " array([0.00962292], dtype=float32),\n", - " array([0.01531602], dtype=float32),\n", - " array([0.00537257], dtype=float32),\n", - " array([0.3383139], dtype=float32),\n", - " array([0.9790156], dtype=float32),\n", - " array([0.99629223], dtype=float32),\n", - " array([0.999041], dtype=float32),\n", - " array([0.4543444], dtype=float32),\n", - " array([0.9975923], dtype=float32),\n", - " array([0.994676], dtype=float32),\n", - " array([0.9955705], dtype=float32),\n", - " array([0.00852212], dtype=float32),\n", - " array([0.00965995], dtype=float32),\n", - " array([0.04810032], dtype=float32),\n", - " array([0.03414589], dtype=float32),\n", - " array([0.19549885], dtype=float32),\n", - " array([0.04221633], dtype=float32),\n", - " array([0.999081], dtype=float32),\n", - " array([0.95819485], dtype=float32),\n", - " array([0.02422899], dtype=float32),\n", - " array([0.00078607], dtype=float32),\n", - " array([0.01256537], dtype=float32),\n", - " array([0.573112], dtype=float32),\n", - " array([0.97446126], dtype=float32),\n", - " array([0.01481443], dtype=float32),\n", - " array([0.91800165], dtype=float32),\n", - " array([0.04669483], dtype=float32),\n", - " array([0.12667258], dtype=float32),\n", - " array([0.9989146], dtype=float32),\n", - " array([0.11938414], dtype=float32),\n", - " array([0.8915277], dtype=float32),\n", - " array([0.01737923], dtype=float32),\n", - " array([0.9999982], dtype=float32),\n", - " array([0.96499765], dtype=float32),\n", - " array([0.02043628], dtype=float32),\n", - " array([0.6315207], dtype=float32),\n", - " array([0.9999362], dtype=float32),\n", - " array([0.34459296], dtype=float32),\n", - " array([0.98566204], dtype=float32),\n", - " array([0.97014564], dtype=float32),\n", - " array([0.99786866], dtype=float32),\n", - " array([0.01015446], dtype=float32),\n", - " array([0.8746796], dtype=float32),\n", - " array([0.9308818], dtype=float32),\n", - " array([0.00047523], dtype=float32),\n", - " array([0.99945456], dtype=float32),\n", - " array([0.00871587], dtype=float32),\n", - " array([0.87762976], dtype=float32),\n", - " array([0.00176486], dtype=float32),\n", - " array([0.9776403], dtype=float32),\n", - " array([0.00964555], dtype=float32),\n", - " array([0.38256386], dtype=float32),\n", - " array([0.9978903], dtype=float32),\n", - " array([0.48501348], dtype=float32),\n", - " array([0.9758839], dtype=float32),\n", - " array([0.76296157], dtype=float32),\n", - " array([0.00493866], dtype=float32),\n", - " array([0.05346973], dtype=float32),\n", - " array([0.9999949], dtype=float32),\n", - " array([0.00160436], dtype=float32),\n", - " array([0.00270788], dtype=float32),\n", - " array([0.2500689], dtype=float32),\n", - " array([0.01582536], dtype=float32),\n", - " array([0.8722655], dtype=float32),\n", - " array([0.95772576], dtype=float32),\n", - " array([0.9999635], dtype=float32),\n", - " array([0.86149096], dtype=float32),\n", - " array([0.12169719], dtype=float32),\n", - " array([0.28068733], dtype=float32),\n", - " array([0.00027394], dtype=float32),\n", - " array([0.00176786], dtype=float32),\n", - " array([0.00266076], dtype=float32),\n", - " array([0.00955428], dtype=float32),\n", - " array([0.06166862], dtype=float32),\n", - " array([0.96516556], dtype=float32),\n", - " array([0.99725515], dtype=float32),\n", - " array([0.86680585], dtype=float32),\n", - " array([0.7473102], dtype=float32),\n", - " array([0.09695191], dtype=float32),\n", - " array([0.00296136], dtype=float32),\n", - " array([0.00260568], dtype=float32),\n", - " array([0.9957995], dtype=float32),\n", - " array([0.99882144], dtype=float32),\n", - " array([0.00024917], dtype=float32),\n", - " array([0.9272133], dtype=float32),\n", - " array([0.00036355], dtype=float32),\n", - " array([0.9843275], dtype=float32),\n", - " array([0.02331734], dtype=float32),\n", - " array([0.10120519], dtype=float32),\n", - " array([0.47101545], dtype=float32),\n", - " array([0.02735368], dtype=float32),\n", - " array([0.9993734], dtype=float32),\n", - " array([0.9975063], dtype=float32),\n", - " array([0.9815989], dtype=float32),\n", - " array([0.9998317], dtype=float32),\n", - " array([0.08344828], dtype=float32),\n", - " array([0.9794001], dtype=float32),\n", - " array([0.9987134], dtype=float32),\n", - " array([0.00018663], dtype=float32),\n", - " array([0.8894404], dtype=float32),\n", - " array([0.04193362], dtype=float32),\n", - " array([0.99497104], dtype=float32),\n", - " array([0.04068065], dtype=float32),\n", - " array([0.00536961], dtype=float32),\n", - " array([0.06312026], dtype=float32),\n", - " array([0.05857188], dtype=float32),\n", - " array([0.68636453], dtype=float32),\n", - " array([0.9688917], dtype=float32),\n", - " array([0.19696496], dtype=float32),\n", - " array([0.06663571], dtype=float32),\n", - " array([0.03183109], dtype=float32),\n", - " array([0.96671313], dtype=float32),\n", - " array([0.05208089], dtype=float32),\n", - " array([0.09314187], dtype=float32),\n", - " array([0.9926622], dtype=float32),\n", - " array([0.92366344], dtype=float32),\n", - " array([0.9675213], dtype=float32),\n", - " array([0.01860829], dtype=float32),\n", - " array([0.9165048], dtype=float32),\n", - " array([0.94261616], dtype=float32),\n", - " array([0.0835857], dtype=float32),\n", - " array([0.00041597], dtype=float32),\n", - " array([0.00907566], dtype=float32),\n", - " array([0.94010156], dtype=float32),\n", - " array([0.9840152], dtype=float32),\n", - " array([0.00041544], dtype=float32),\n", - " array([0.8775654], dtype=float32),\n", - " array([0.32026634], dtype=float32),\n", - " array([0.01421119], dtype=float32),\n", - " array([0.0130173], dtype=float32),\n", - " array([0.9884545], dtype=float32),\n", - " array([0.04571154], dtype=float32),\n", - " array([1.6011174e-05], dtype=float32),\n", - " array([0.00115296], dtype=float32),\n", - " array([0.9790429], dtype=float32),\n", - " array([0.64808387], dtype=float32),\n", - " array([4.6414338e-05], dtype=float32),\n", - " array([0.5914479], dtype=float32),\n", - " array([0.00711486], dtype=float32),\n", - " array([0.82269937], dtype=float32),\n", - " array([0.66748506], dtype=float32),\n", - " array([0.9879972], dtype=float32),\n", - " array([6.153964e-05], dtype=float32),\n", - " array([0.99550027], dtype=float32),\n", - " array([0.00570012], dtype=float32),\n", - " array([0.46884078], dtype=float32),\n", - " array([0.00010328], dtype=float32),\n", - " array([0.03197822], dtype=float32),\n", - " array([0.9972145], dtype=float32),\n", - " array([0.05284659], dtype=float32),\n", - " array([0.9037368], dtype=float32),\n", - " array([0.04048614], dtype=float32),\n", + " array([0.04805868], dtype=float32),\n", + " array([0.6807831], dtype=float32),\n", + " array([0.38082442], dtype=float32),\n", + " array([0.7750744], dtype=float32),\n", + " array([0.99722785], dtype=float32),\n", + " array([0.77780694], dtype=float32),\n", + " array([0.9519044], dtype=float32),\n", + " array([0.00215464], dtype=float32),\n", + " array([0.29531085], dtype=float32),\n", + " array([0.9999316], dtype=float32),\n", + " array([0.7214245], dtype=float32),\n", + " array([0.8033163], dtype=float32),\n", + " array([0.6166736], dtype=float32),\n", + " array([0.26327613], dtype=float32),\n", + " array([0.21962917], dtype=float32),\n", + " array([0.10679483], dtype=float32),\n", + " array([0.04216451], dtype=float32),\n", + " array([0.00307667], dtype=float32),\n", + " array([0.99923015], dtype=float32),\n", + " array([0.00597921], dtype=float32),\n", + " array([0.99360764], dtype=float32),\n", + " array([0.973897], dtype=float32),\n", + " array([0.13671698], dtype=float32),\n", + " array([0.44968152], dtype=float32),\n", + " array([0.07701934], dtype=float32),\n", + " array([0.05103498], dtype=float32),\n", + " array([0.9994609], dtype=float32),\n", + " array([0.07936312], dtype=float32),\n", + " array([0.8839954], dtype=float32),\n", + " array([1.365624e-06], dtype=float32),\n", + " array([0.00480004], dtype=float32),\n", + " array([0.12765045], dtype=float32),\n", + " array([0.9904794], dtype=float32),\n", + " array([0.6438497], dtype=float32),\n", + " array([0.8862176], dtype=float32),\n", + " array([7.784928e-05], dtype=float32),\n", + " array([0.19045115], dtype=float32),\n", + " array([0.00067149], dtype=float32),\n", + " array([0.9358372], dtype=float32),\n", + " array([0.02452566], dtype=float32),\n", + " array([0.9958995], dtype=float32),\n", + " array([0.550974], dtype=float32),\n", + " array([0.30900526], dtype=float32),\n", + " array([0.99798125], dtype=float32),\n", + " array([0.01287526], dtype=float32),\n", + " array([0.01379994], dtype=float32),\n", + " array([0.12119947], dtype=float32),\n", + " array([0.665414], dtype=float32),\n", + " array([0.00102568], dtype=float32),\n", + " array([0.2067204], dtype=float32),\n", + " array([0.0050051], dtype=float32),\n", + " array([0.00433443], dtype=float32),\n", + " array([0.39867714], dtype=float32),\n", + " array([0.00024582], dtype=float32),\n", + " array([0.00571835], dtype=float32),\n", + " array([0.00590702], dtype=float32),\n", + " array([0.5449246], dtype=float32),\n", + " array([0.97699547], dtype=float32),\n", + " array([0.00366751], dtype=float32),\n", + " array([0.13479914], dtype=float32),\n", + " array([0.98704463], dtype=float32),\n", + " array([0.0312269], dtype=float32),\n", + " array([0.00039572], dtype=float32),\n", + " array([0.7193606], dtype=float32),\n", + " array([0.07044102], dtype=float32),\n", + " array([0.03585317], dtype=float32),\n", + " array([0.17524014], dtype=float32),\n", + " array([0.14926364], dtype=float32),\n", + " array([0.21622558], dtype=float32),\n", + " array([0.47393447], dtype=float32),\n", + " array([0.8796138], dtype=float32),\n", + " array([0.57277304], dtype=float32),\n", + " array([0.9692422], dtype=float32),\n", + " array([0.9952886], dtype=float32),\n", + " array([0.95525163], dtype=float32),\n", + " array([0.3414528], dtype=float32),\n", + " array([0.6035593], dtype=float32),\n", + " array([0.03257844], dtype=float32),\n", + " array([0.01301803], dtype=float32),\n", + " array([0.47819394], dtype=float32),\n", + " array([1.6677832e-08], dtype=float32),\n", + " array([0.22340754], dtype=float32),\n", + " array([0.9999951], dtype=float32),\n", + " array([0.96137166], dtype=float32),\n", + " array([0.9981943], dtype=float32),\n", + " array([0.05160893], dtype=float32),\n", + " array([0.99629396], dtype=float32),\n", + " array([0.9625849], dtype=float32),\n", + " array([0.0002911], dtype=float32),\n", + " array([0.980667], dtype=float32),\n", + " array([0.9892765], dtype=float32),\n", + " array([0.9987301], dtype=float32),\n", + " array([0.9874142], dtype=float32),\n", + " array([0.9936329], dtype=float32),\n", + " array([0.997771], dtype=float32),\n", + " array([0.5043148], dtype=float32),\n", + " array([0.8399789], dtype=float32),\n", + " array([0.9929483], dtype=float32),\n", + " array([0.31873196], dtype=float32),\n", + " array([0.0675632], dtype=float32),\n", + " array([0.00233161], dtype=float32),\n", + " array([0.98852634], dtype=float32),\n", + " array([0.9999845], dtype=float32),\n", + " array([0.08548676], dtype=float32),\n", + " array([0.00016344], dtype=float32),\n", + " array([0.06375157], dtype=float32),\n", + " array([0.98533106], dtype=float32),\n", + " array([0.9875267], dtype=float32),\n", + " array([0.02328171], dtype=float32),\n", + " array([0.7528208], dtype=float32),\n", + " array([0.6718994], dtype=float32),\n", + " array([0.7016442], dtype=float32),\n", + " array([0.29562166], dtype=float32),\n", + " array([0.21487534], dtype=float32),\n", + " array([0.05325569], dtype=float32),\n", + " array([0.98829865], dtype=float32),\n", + " array([0.0206712], dtype=float32),\n", + " array([0.39194584], dtype=float32),\n", + " array([0.05182257], dtype=float32),\n", + " array([0.12892328], dtype=float32),\n", + " array([0.98039585], dtype=float32),\n", + " array([0.07023581], dtype=float32),\n", + " array([0.998417], dtype=float32),\n", + " array([0.7812852], dtype=float32),\n", + " array([0.09137525], dtype=float32),\n", + " array([0.8678507], dtype=float32),\n", + " array([0.9933328], dtype=float32),\n", + " array([0.3079019], dtype=float32),\n", + " array([0.8708483], dtype=float32),\n", + " array([0.9929174], dtype=float32),\n", + " array([0.85494846], dtype=float32),\n", + " array([0.9882675], dtype=float32),\n", + " array([0.9930362], dtype=float32),\n", + " array([0.44101492], dtype=float32),\n", + " array([0.00028029], dtype=float32),\n", + " array([0.98733073], dtype=float32),\n", + " array([0.94348913], dtype=float32),\n", + " array([3.119138e-05], dtype=float32),\n", + " array([0.980949], dtype=float32),\n", + " array([0.9913406], dtype=float32),\n", + " array([0.99495846], dtype=float32),\n", + " array([0.9629638], dtype=float32),\n", + " array([0.0100573], dtype=float32),\n", + " array([0.02189975], dtype=float32),\n", + " array([0.99831617], dtype=float32),\n", + " array([0.98490876], dtype=float32),\n", + " array([0.54414076], dtype=float32),\n", + " array([0.06107181], dtype=float32),\n", + " array([0.9978096], dtype=float32),\n", + " array([0.9745584], dtype=float32),\n", + " array([0.00242021], dtype=float32),\n", + " array([0.03076136], dtype=float32),\n", + " array([0.35039175], dtype=float32),\n", + " array([0.83999205], dtype=float32),\n", + " array([0.99990547], dtype=float32),\n", + " array([0.05263938], dtype=float32),\n", + " array([0.8979464], dtype=float32),\n", + " array([0.03534276], dtype=float32),\n", + " array([0.00471485], dtype=float32),\n", + " array([0.99737906], dtype=float32),\n", + " array([0.929945], dtype=float32),\n", + " array([0.01993787], dtype=float32),\n", + " array([0.9856134], dtype=float32),\n", + " array([0.7457446], dtype=float32),\n", + " array([0.99158585], dtype=float32),\n", + " array([0.9860604], dtype=float32),\n", + " array([0.03886136], dtype=float32),\n", + " array([0.96496195], dtype=float32),\n", + " array([0.31795], dtype=float32),\n", + " array([0.99946743], dtype=float32),\n", + " array([0.996521], dtype=float32),\n", + " array([0.03773015], dtype=float32),\n", + " array([0.00583928], dtype=float32),\n", + " array([0.99041665], dtype=float32),\n", + " array([0.9955739], dtype=float32),\n", + " array([0.01058325], dtype=float32),\n", + " array([0.00011865], dtype=float32),\n", + " array([0.8401856], dtype=float32),\n", + " array([0.63474256], dtype=float32),\n", + " array([0.9829626], dtype=float32),\n", + " array([0.01037378], dtype=float32),\n", + " array([0.26479724], dtype=float32),\n", + " array([0.21121329], dtype=float32),\n", + " array([0.9914016], dtype=float32),\n", + " array([0.9588108], dtype=float32),\n", + " array([0.99756277], dtype=float32),\n", + " array([0.30543897], dtype=float32),\n", + " array([0.99640626], dtype=float32),\n", + " array([0.30586973], dtype=float32),\n", + " array([0.9993086], dtype=float32),\n", + " array([0.9949649], dtype=float32),\n", + " array([0.6421015], dtype=float32),\n", + " array([0.14092435], dtype=float32),\n", + " array([0.01815344], dtype=float32),\n", + " array([0.00090887], dtype=float32),\n", + " array([0.9869277], dtype=float32),\n", + " array([0.22545609], dtype=float32),\n", + " array([0.9994192], dtype=float32),\n", + " array([0.10223134], dtype=float32),\n", + " array([0.9989011], dtype=float32),\n", + " array([0.02059738], dtype=float32),\n", + " array([0.88542646], dtype=float32),\n", + " array([0.9960936], dtype=float32),\n", + " array([0.9262567], dtype=float32),\n", + " array([0.9434017], dtype=float32),\n", + " array([0.98046255], dtype=float32),\n", + " array([0.9889431], dtype=float32),\n", + " array([0.7408156], dtype=float32),\n", + " array([0.00285646], dtype=float32),\n", + " array([0.9890942], dtype=float32),\n", + " array([0.7398897], dtype=float32),\n", + " array([0.9671184], dtype=float32),\n", + " array([0.99998057], dtype=float32),\n", + " array([0.9491266], dtype=float32),\n", + " array([0.54299086], dtype=float32),\n", + " array([0.00412416], dtype=float32),\n", + " array([0.6694579], dtype=float32),\n", + " array([0.95415497], dtype=float32),\n", + " array([0.01549284], dtype=float32),\n", + " array([0.0003646], dtype=float32),\n", + " array([0.99999607], dtype=float32),\n", + " array([0.9999577], dtype=float32),\n", + " array([0.00113213], dtype=float32),\n", + " array([0.9941749], dtype=float32),\n", + " array([0.9958812], dtype=float32),\n", + " array([0.99189734], dtype=float32),\n", + " array([0.0017188], dtype=float32),\n", + " array([0.985795], dtype=float32),\n", + " array([0.9998721], dtype=float32),\n", + " array([0.99999976], dtype=float32),\n", + " array([0.98263794], dtype=float32),\n", + " array([0.58947575], dtype=float32),\n", + " array([0.00927054], dtype=float32),\n", + " array([0.9716789], dtype=float32),\n", + " array([0.84313625], dtype=float32),\n", + " array([0.96165526], dtype=float32),\n", + " array([0.9851811], dtype=float32),\n", + " array([0.9854842], dtype=float32),\n", + " array([0.00322469], dtype=float32),\n", + " array([0.9309462], dtype=float32),\n", + " array([0.20306274], dtype=float32),\n", + " array([0.04456307], dtype=float32),\n", + " array([0.9654337], dtype=float32),\n", + " array([0.01055153], dtype=float32),\n", + " array([0.99989104], dtype=float32),\n", + " array([0.03129936], dtype=float32),\n", + " array([0.2108155], dtype=float32),\n", + " array([0.98949], dtype=float32),\n", + " array([0.99999154], dtype=float32),\n", + " array([0.94526803], dtype=float32),\n", + " array([0.99107426], dtype=float32),\n", + " array([0.99824476], dtype=float32),\n", + " array([0.99930096], dtype=float32),\n", + " array([0.9494158], dtype=float32),\n", + " array([0.59529406], dtype=float32),\n", + " array([0.00836287], dtype=float32),\n", + " array([0.99950933], dtype=float32),\n", + " array([0.8118227], dtype=float32),\n", + " array([0.6227854], dtype=float32),\n", + " array([0.9727045], dtype=float32),\n", + " array([0.99001336], dtype=float32),\n", + " array([0.61210626], dtype=float32),\n", + " array([0.00018276], dtype=float32),\n", + " array([0.09038408], dtype=float32),\n", + " array([0.08299794], dtype=float32),\n", + " array([0.0105845], dtype=float32),\n", + " array([0.16678979], dtype=float32),\n", + " array([0.9531919], dtype=float32),\n", + " array([0.9998332], dtype=float32),\n", + " array([5.249855e-05], dtype=float32),\n", + " array([0.00057517], dtype=float32),\n", + " array([0.997013], dtype=float32),\n", + " array([0.12925929], dtype=float32),\n", + " array([0.07413327], dtype=float32),\n", + " array([0.98919934], dtype=float32),\n", + " array([0.5382614], dtype=float32),\n", + " array([0.9996692], dtype=float32),\n", + " array([0.8613375], dtype=float32),\n", + " array([0.71423596], dtype=float32),\n", + " array([0.09667405], dtype=float32),\n", + " array([0.9979893], dtype=float32),\n", + " array([0.00794561], dtype=float32),\n", + " array([0.00175152], dtype=float32),\n", + " array([0.21769904], dtype=float32),\n", + " array([0.94123036], dtype=float32),\n", + " array([0.96663105], dtype=float32),\n", + " array([0.01070287], dtype=float32),\n", + " array([0.07400733], dtype=float32),\n", + " array([0.012168], dtype=float32),\n", + " array([0.01236583], dtype=float32),\n", " array([0.998744], dtype=float32),\n", - " array([0.99049884], dtype=float32),\n", - " array([0.01359443], dtype=float32),\n", - " array([0.9997179], dtype=float32),\n", - " array([0.99999964], dtype=float32),\n", - " array([0.98721075], dtype=float32),\n", - " array([0.00063402], dtype=float32),\n", - " array([0.820883], dtype=float32),\n", - " array([0.4547376], dtype=float32),\n", - " array([0.891108], dtype=float32),\n", - " array([0.16223074], dtype=float32),\n", - " array([0.9726654], dtype=float32),\n", - " array([0.9003827], dtype=float32),\n", - " array([0.99944574], dtype=float32),\n", - " array([0.7704998], dtype=float32),\n", - " array([0.95534146], dtype=float32),\n", - " array([0.0062368], dtype=float32),\n", - " array([0.9787254], dtype=float32),\n", - " array([0.00126028], dtype=float32),\n", - " array([0.7004171], dtype=float32),\n", - " array([0.09580212], dtype=float32),\n", - " array([0.97376186], dtype=float32),\n", - " array([0.9920665], dtype=float32),\n", - " array([0.12573221], dtype=float32),\n", - " array([0.96389884], dtype=float32),\n", - " array([0.9980578], dtype=float32),\n", - " array([0.00578512], dtype=float32),\n", - " array([0.01206519], dtype=float32),\n", - " array([0.9992673], dtype=float32),\n", - " array([0.07898365], dtype=float32),\n", - " array([0.00214792], dtype=float32),\n", - " array([0.05026222], dtype=float32),\n", - " array([0.99995553], dtype=float32),\n", - " array([0.99840695], dtype=float32),\n", - " array([0.99762005], dtype=float32),\n", - " array([1.], dtype=float32),\n", - " array([0.7520437], dtype=float32),\n", - " array([0.03680907], dtype=float32),\n", - " array([0.5764651], dtype=float32),\n", - " array([0.99977905], dtype=float32),\n", - " array([0.20490777], dtype=float32),\n", - " array([0.00173326], dtype=float32),\n", - " array([0.9939528], dtype=float32),\n", - " array([1.9042971e-05], dtype=float32),\n", - " array([0.95325047], dtype=float32),\n", - " array([0.9958734], dtype=float32),\n", - " array([0.03266709], dtype=float32),\n", - " array([0.02785043], dtype=float32),\n", - " array([0.05891791], dtype=float32),\n", - " array([0.985227], dtype=float32),\n", - " array([0.00210726], dtype=float32),\n", - " array([0.9943926], dtype=float32),\n", - " array([0.02131884], dtype=float32),\n", - " array([0.99827754], dtype=float32),\n", - " array([0.8846837], dtype=float32),\n", - " array([0.99997437], dtype=float32),\n", - " array([0.9946067], dtype=float32),\n", - " array([0.99978703], dtype=float32),\n", - " array([0.00300169], dtype=float32),\n", - " array([0.00031111], dtype=float32),\n", - " array([0.9819504], dtype=float32),\n", - " array([0.00375891], dtype=float32),\n", - " array([0.63086605], dtype=float32),\n", - " array([0.83123654], dtype=float32),\n", - " array([0.63774806], dtype=float32),\n", - " array([0.6908987], dtype=float32),\n", - " array([0.15456767], dtype=float32),\n", - " array([0.4055819], dtype=float32),\n", - " array([0.0910763], dtype=float32),\n", - " array([0.24731727], dtype=float32),\n", - " array([0.994842], dtype=float32),\n", - " array([0.38033506], dtype=float32),\n", - " array([0.98958546], dtype=float32),\n", - " array([0.9998734], dtype=float32),\n", - " array([0.99897206], dtype=float32),\n", - " array([0.8665204], dtype=float32),\n", - " array([0.00639246], dtype=float32),\n", - " array([0.9556338], dtype=float32),\n", - " array([0.9666423], dtype=float32),\n", - " array([0.9984849], dtype=float32),\n", - " array([0.02892273], dtype=float32),\n", - " array([0.995031], dtype=float32),\n", - " array([0.72732645], dtype=float32),\n", - " array([0.998869], dtype=float32),\n", - " array([0.01101623], dtype=float32),\n", - " array([0.9236663], dtype=float32),\n", - " array([0.0053416], dtype=float32),\n", - " array([0.9376339], dtype=float32),\n", - " array([0.9097744], dtype=float32),\n", - " array([0.32959577], dtype=float32),\n", - " array([0.69777864], dtype=float32),\n", - " array([0.02821315], dtype=float32),\n", - " array([0.8764768], dtype=float32),\n", - " array([0.01698065], dtype=float32),\n", - " array([0.05337453], dtype=float32),\n", - " array([0.00699902], dtype=float32),\n", - " array([0.01098163], dtype=float32),\n", - " array([0.02664173], dtype=float32),\n", - " array([0.19115667], dtype=float32),\n", - " array([0.01039254], dtype=float32),\n", - " array([0.7853336], dtype=float32),\n", - " array([0.13310696], dtype=float32),\n", - " array([0.12221986], dtype=float32),\n", - " array([0.99144626], dtype=float32),\n", - " array([0.12488245], dtype=float32),\n", - " array([0.10422938], dtype=float32),\n", - " array([0.03960704], dtype=float32),\n", - " array([0.96108264], dtype=float32),\n", - " array([0.00321886], dtype=float32),\n", - " array([0.9612626], dtype=float32),\n", - " array([0.77753425], dtype=float32),\n", - " array([0.992634], dtype=float32),\n", - " array([0.08653396], dtype=float32),\n", - " array([0.573064], dtype=float32),\n", - " array([0.994193], dtype=float32),\n", - " array([0.9994568], dtype=float32),\n", - " array([0.00164113], dtype=float32),\n", - " array([0.02974954], dtype=float32),\n", - " array([0.00094306], dtype=float32),\n", - " array([0.01469964], dtype=float32),\n", - " array([0.01007712], dtype=float32),\n", - " array([0.5073839], dtype=float32),\n", - " array([0.00891581], dtype=float32),\n", - " array([0.01340619], dtype=float32),\n", - " array([0.19153547], dtype=float32),\n", - " array([0.00785635], dtype=float32),\n", - " array([0.00529725], dtype=float32),\n", - " array([0.9981645], dtype=float32),\n", - " array([0.89561176], dtype=float32),\n", - " array([0.44029003], dtype=float32),\n", - " array([0.9998586], dtype=float32),\n", - " array([0.09905386], dtype=float32),\n", - " array([0.9964311], dtype=float32),\n", - " array([0.9990119], dtype=float32),\n", - " array([0.85216373], dtype=float32),\n", - " array([0.9999999], dtype=float32),\n", - " array([0.9584791], dtype=float32),\n", - " array([0.00405316], dtype=float32),\n", - " array([0.9539604], dtype=float32),\n", - " array([0.9477786], dtype=float32),\n", - " array([0.99440527], dtype=float32),\n", - " array([0.9769205], dtype=float32),\n", - " array([0.89980936], dtype=float32),\n", - " array([0.00249994], dtype=float32),\n", - " array([0.8010222], dtype=float32),\n", - " array([0.20035101], dtype=float32),\n", - " array([0.00624598], dtype=float32),\n", - " array([0.80348605], dtype=float32),\n", - " array([0.19786465], dtype=float32),\n", - " array([0.0849141], dtype=float32),\n", - " array([0.9986082], dtype=float32),\n", - " array([0.13313013], dtype=float32),\n", - " array([0.16162278], dtype=float32),\n", - " array([0.993683], dtype=float32),\n", - " array([0.55489], dtype=float32),\n", - " array([0.07838932], dtype=float32),\n", - " array([0.008188], dtype=float32),\n", - " array([0.00667501], dtype=float32),\n", - " array([0.16437183], dtype=float32),\n", - " array([0.96623826], dtype=float32),\n", - " array([0.8830661], dtype=float32),\n", - " array([0.9948244], dtype=float32),\n", - " array([0.98182225], dtype=float32),\n", - " array([0.98158526], dtype=float32),\n", - " array([0.03434094], dtype=float32),\n", - " array([0.3056716], dtype=float32),\n", - " array([0.98550564], dtype=float32),\n", - " array([0.03481789], dtype=float32),\n", - " array([0.99663454], dtype=float32),\n", - " array([0.9985311], dtype=float32),\n", - " array([0.9939167], dtype=float32),\n", - " array([0.10510859], dtype=float32),\n", - " array([0.00508491], dtype=float32),\n", - " array([0.00165993], dtype=float32),\n", - " array([0.8245149], dtype=float32),\n", - " array([0.9556251], dtype=float32),\n", - " array([0.9887638], dtype=float32),\n", - " array([0.17581154], dtype=float32),\n", - " array([0.00011383], dtype=float32),\n", - " array([0.00106234], dtype=float32),\n", - " array([0.01681961], dtype=float32),\n", - " array([0.00525573], dtype=float32),\n", - " array([0.9957818], dtype=float32),\n", - " array([0.8603696], dtype=float32),\n", - " array([0.00199909], dtype=float32),\n", - " array([0.04100141], dtype=float32),\n", - " array([0.00017335], dtype=float32),\n", - " array([0.9992545], dtype=float32),\n", - " array([0.9659639], dtype=float32),\n", - " array([0.8995427], dtype=float32),\n", - " array([0.12103864], dtype=float32),\n", - " array([0.710389], dtype=float32),\n", - " array([0.69011146], dtype=float32),\n", - " array([0.05472863], dtype=float32),\n", - " array([0.01922707], dtype=float32),\n", - " array([0.8451995], dtype=float32),\n", - " array([0.99947244], dtype=float32),\n", - " array([0.09135215], dtype=float32),\n", - " array([0.00159073], dtype=float32),\n", - " array([0.05302845], dtype=float32),\n", - " array([0.98199064], dtype=float32),\n", - " array([0.50679266], dtype=float32),\n", - " array([0.17369047], dtype=float32),\n", - " array([0.9998796], dtype=float32),\n", - " array([0.6899049], dtype=float32),\n", - " array([0.00706529], dtype=float32),\n", - " array([0.00957006], dtype=float32),\n", - " array([0.78653455], dtype=float32),\n", - " array([0.00385365], dtype=float32),\n", - " array([0.3887317], dtype=float32),\n", - " array([0.03332283], dtype=float32),\n", - " array([0.04271935], dtype=float32),\n", - " array([0.00691515], dtype=float32),\n", - " array([0.51850504], dtype=float32),\n", - " array([0.00992748], dtype=float32),\n", - " array([0.29605916], dtype=float32),\n", - " array([0.00028048], dtype=float32),\n", - " array([0.9928925], dtype=float32),\n", - " array([0.00124131], dtype=float32),\n", - " array([0.9764488], dtype=float32),\n", - " array([0.7932969], dtype=float32),\n", - " array([0.97071785], dtype=float32),\n", - " array([0.98584795], dtype=float32),\n", - " array([0.07081781], dtype=float32),\n", - " array([0.04931527], dtype=float32),\n", - " array([0.15695621], dtype=float32),\n", - " array([0.9994659], dtype=float32),\n", - " array([0.5096887], dtype=float32),\n", - " array([0.9917842], dtype=float32),\n", - " array([0.9981895], dtype=float32),\n", - " array([0.12094765], dtype=float32),\n", - " array([0.11894753], dtype=float32),\n", - " array([0.90818393], dtype=float32),\n", - " array([0.98540974], dtype=float32),\n", - " array([0.94457304], dtype=float32),\n", - " array([0.9998313], dtype=float32),\n", - " array([0.9907136], dtype=float32),\n", - " array([0.72061336], dtype=float32),\n", - " array([0.99645764], dtype=float32),\n", - " array([0.8892745], dtype=float32),\n", - " array([0.999595], dtype=float32),\n", - " array([0.4836757], dtype=float32),\n", - " array([0.97150517], dtype=float32),\n", - " array([0.95611787], dtype=float32),\n", - " array([0.01308193], dtype=float32),\n", - " array([0.00030093], dtype=float32),\n", - " array([0.99999774], dtype=float32),\n", - " array([0.00015023], dtype=float32),\n", - " array([0.00129508], dtype=float32),\n", - " array([0.02929328], dtype=float32),\n", - " array([0.15967111], dtype=float32),\n", - " array([0.04437197], dtype=float32),\n", - " array([0.00077572], dtype=float32),\n", - " array([0.00117108], dtype=float32),\n", - " array([0.02031642], dtype=float32),\n", - " array([0.78561157], dtype=float32),\n", - " array([0.6640868], dtype=float32),\n", - " array([0.96112865], dtype=float32),\n", - " array([0.9977581], dtype=float32),\n", - " array([0.24247183], dtype=float32),\n", - " array([0.3876404], dtype=float32),\n", - " array([0.00950658], dtype=float32),\n", - " array([0.12216215], dtype=float32),\n", - " array([0.00086611], dtype=float32),\n", - " array([0.99396956], dtype=float32),\n", - " array([0.77168643], dtype=float32),\n", - " array([0.00386494], dtype=float32),\n", - " array([0.9824516], dtype=float32),\n", - " array([0.9933523], dtype=float32),\n", - " array([0.95625204], dtype=float32),\n", - " array([0.00056513], dtype=float32),\n", - " array([0.949406], dtype=float32),\n", - " array([0.91392314], dtype=float32),\n", - " array([0.998437], dtype=float32),\n", - " array([0.00321832], dtype=float32),\n", - " array([0.3081614], dtype=float32),\n", - " array([0.18779686], dtype=float32),\n", - " array([0.86671174], dtype=float32),\n", - " array([0.99736756], dtype=float32),\n", - " array([0.37101072], dtype=float32),\n", - " array([0.97278196], dtype=float32),\n", - " array([0.03783982], dtype=float32),\n", - " array([0.98899263], dtype=float32),\n", - " array([0.99997747], dtype=float32),\n", - " array([0.33761773], dtype=float32),\n", - " array([0.9922051], dtype=float32),\n", - " array([0.9986929], dtype=float32),\n", - " array([0.9734451], dtype=float32),\n", - " array([0.00104312], dtype=float32),\n", - " array([0.00905259], dtype=float32),\n", - " array([0.9999858], dtype=float32),\n", - " array([0.2685942], dtype=float32),\n", - " array([8.498155e-07], dtype=float32),\n", - " array([0.00194448], dtype=float32),\n", - " array([0.9610404], dtype=float32),\n", - " array([0.06272461], dtype=float32),\n", - " array([0.9734326], dtype=float32),\n", - " array([0.9998591], dtype=float32),\n", - " array([0.02384088], dtype=float32),\n", - " array([0.00458063], dtype=float32),\n", - " array([0.8619814], dtype=float32),\n", - " array([0.3280481], dtype=float32),\n", - " array([0.58994853], dtype=float32),\n", - " array([0.00738818], dtype=float32),\n", - " array([0.9968213], dtype=float32),\n", - " array([0.94588715], dtype=float32),\n", - " array([0.89741385], dtype=float32),\n", - " array([0.0001792], dtype=float32),\n", - " array([7.942238e-05], dtype=float32),\n", - " array([0.2487981], dtype=float32),\n", - " array([0.99818295], dtype=float32),\n", - " array([0.06495794], dtype=float32),\n", - " array([0.61300725], dtype=float32),\n", - " array([0.00142293], dtype=float32),\n", - " array([0.7782267], dtype=float32),\n", - " array([0.70798534], dtype=float32),\n", - " array([0.15175731], dtype=float32),\n", - " array([0.99284136], dtype=float32),\n", - " array([0.9841339], dtype=float32),\n", - " array([0.00554728], dtype=float32),\n", - " array([0.0500682], dtype=float32),\n", - " array([1.7751601e-06], dtype=float32),\n", - " array([0.12731266], dtype=float32),\n", - " array([0.01886535], dtype=float32),\n", - " array([0.9990376], dtype=float32),\n", - " array([0.27495182], dtype=float32),\n", - " array([0.90534323], dtype=float32),\n", - " array([0.8381721], dtype=float32),\n", - " array([0.12258686], dtype=float32),\n", - " array([0.23695664], dtype=float32),\n", - " array([0.04559099], dtype=float32),\n", - " array([0.798738], dtype=float32),\n", - " array([0.9249577], dtype=float32),\n", - " array([0.5790399], dtype=float32),\n", - " array([0.7356898], dtype=float32),\n", - " array([0.9420959], dtype=float32),\n", - " array([0.80315626], dtype=float32),\n", - " array([0.907965], dtype=float32),\n", - " array([0.18890426], dtype=float32),\n", - " array([0.04044292], dtype=float32),\n", - " array([0.00435959], dtype=float32),\n", - " array([0.01255109], dtype=float32),\n", - " array([0.973041], dtype=float32),\n", - " array([0.89595586], dtype=float32),\n", - " array([0.15041849], dtype=float32),\n", - " array([0.7386434], dtype=float32),\n", - " array([0.01395628], dtype=float32),\n", - " array([0.00037464], dtype=float32),\n", - " array([0.30354175], dtype=float32),\n", - " array([0.92193896], dtype=float32),\n", - " array([0.95892274], dtype=float32),\n", - " array([1.066259e-06], dtype=float32),\n", - " array([0.96353555], dtype=float32),\n", - " array([0.14788437], dtype=float32),\n", - " array([0.9997639], dtype=float32),\n", - " array([0.01777537], dtype=float32),\n", - " array([0.9861092], dtype=float32),\n", - " array([0.13082978], dtype=float32),\n", - " array([0.0002504], dtype=float32),\n", - " array([0.8804745], dtype=float32),\n", - " array([0.9967051], dtype=float32),\n", - " array([0.56104803], dtype=float32),\n", - " array([0.36787862], dtype=float32),\n", - " array([0.8360154], dtype=float32),\n", - " array([0.9998766], dtype=float32),\n", - " array([0.00568995], dtype=float32),\n", - " array([0.00194393], dtype=float32),\n", - " array([0.00631262], dtype=float32),\n", - " array([0.03533027], dtype=float32),\n", - " array([0.9103368], dtype=float32),\n", - " array([0.9982439], dtype=float32),\n", - " array([0.97841996], dtype=float32),\n", - " array([0.00286406], dtype=float32),\n", - " array([0.0708506], dtype=float32),\n", - " array([0.9432028], dtype=float32),\n", - " array([0.9654381], dtype=float32),\n", - " array([0.05079986], dtype=float32),\n", - " array([0.8743878], dtype=float32),\n", - " array([0.00240675], dtype=float32),\n", - " array([0.98993146], dtype=float32),\n", - " array([0.07532773], dtype=float32),\n", - " array([0.22899462], dtype=float32),\n", - " array([0.00091621], dtype=float32),\n", - " array([0.9989504], dtype=float32),\n", - " array([0.39317238], dtype=float32),\n", - " array([0.3326581], dtype=float32),\n", - " array([0.01213577], dtype=float32),\n", - " array([0.99774724], dtype=float32),\n", - " array([0.9886003], dtype=float32),\n", - " array([0.79621345], dtype=float32),\n", - " array([0.79079646], dtype=float32),\n", - " array([0.93861336], dtype=float32),\n", - " array([0.07021908], dtype=float32),\n", - " array([0.7411332], dtype=float32),\n", - " array([0.969042], dtype=float32),\n", - " array([0.9099184], dtype=float32),\n", - " array([0.02733893], dtype=float32),\n", - " array([0.9999924], dtype=float32),\n", - " array([0.9897418], dtype=float32),\n", - " array([0.03869773], dtype=float32),\n", - " array([0.97638786], dtype=float32),\n", - " array([0.08542448], dtype=float32),\n", - " array([0.05407662], dtype=float32),\n", - " array([0.9999993], dtype=float32),\n", - " array([0.14986295], dtype=float32),\n", - " array([0.999286], dtype=float32),\n", - " array([0.24805169], dtype=float32),\n", - " array([0.01673634], dtype=float32),\n", - " array([0.01463007], dtype=float32),\n", - " array([0.3670616], dtype=float32),\n", - " array([0.9926224], dtype=float32),\n", - " array([0.6253009], dtype=float32),\n", - " array([0.03401245], dtype=float32),\n", - " array([0.00030278], dtype=float32),\n", - " array([0.96080303], dtype=float32),\n", - " array([0.04573576], dtype=float32),\n", - " array([0.04926103], dtype=float32),\n", - " array([0.5770354], dtype=float32),\n", - " array([0.02184597], dtype=float32),\n", - " array([0.9933947], dtype=float32),\n", - " array([0.00422782], dtype=float32),\n", - " array([0.7942074], dtype=float32),\n", - " array([0.14047284], dtype=float32),\n", - " array([0.90892816], dtype=float32),\n", - " array([0.79335517], dtype=float32),\n", - " array([0.02081179], dtype=float32),\n", - " array([0.03224256], dtype=float32),\n", - " array([0.00269347], dtype=float32),\n", - " array([0.7325373], dtype=float32),\n", - " array([0.86657375], dtype=float32),\n", - " array([0.9994041], dtype=float32),\n", - " array([0.99819297], dtype=float32),\n", - " array([0.306308], dtype=float32),\n", - " array([0.9358532], dtype=float32),\n", - " array([0.00968082], dtype=float32),\n", - " array([0.22723815], dtype=float32),\n", - " array([0.88686043], dtype=float32),\n", - " array([0.00376564], dtype=float32),\n", - " array([0.9558993], dtype=float32),\n", - " array([0.03709094], dtype=float32),\n", - " array([0.9284992], dtype=float32),\n", - " array([0.01156035], dtype=float32),\n", - " array([0.6904194], dtype=float32),\n", - " array([0.7789368], dtype=float32),\n", - " array([0.06749155], dtype=float32),\n", - " array([0.83822256], dtype=float32),\n", - " array([0.00499537], dtype=float32),\n", - " array([0.96375054], dtype=float32),\n", - " array([0.99763095], dtype=float32),\n", - " array([0.00083689], dtype=float32),\n", - " array([0.1384925], dtype=float32),\n", - " array([0.99911016], dtype=float32),\n", - " array([0.18213369], dtype=float32),\n", - " array([0.01104294], dtype=float32),\n", - " array([0.9997731], dtype=float32),\n", - " array([0.00157826], dtype=float32),\n", - " array([0.45021382], dtype=float32),\n", - " array([0.70889956], dtype=float32),\n", - " array([0.99980146], dtype=float32),\n", - " array([0.14717786], dtype=float32),\n", - " array([0.9981312], dtype=float32),\n", - " array([0.99910754], dtype=float32),\n", - " array([0.00473733], dtype=float32),\n", - " array([0.00330126], dtype=float32),\n", - " array([0.17611578], dtype=float32),\n", - " array([0.69635725], dtype=float32),\n", - " array([0.39411786], dtype=float32),\n", - " array([0.26741236], dtype=float32),\n", - " array([0.56975543], dtype=float32),\n", - " array([0.06516983], dtype=float32),\n", - " array([0.70290774], dtype=float32),\n", - " array([0.1079508], dtype=float32),\n", - " array([0.9905323], dtype=float32),\n", - " array([0.07408904], dtype=float32),\n", - " array([0.99945086], dtype=float32),\n", - " array([0.08830733], dtype=float32),\n", - " array([0.47597456], dtype=float32),\n", - " array([0.08325432], dtype=float32),\n", - " array([0.9963791], dtype=float32),\n", - " array([0.99327046], dtype=float32),\n", - " array([0.987528], dtype=float32),\n", - " array([0.2695155], dtype=float32),\n", - " array([0.01687575], dtype=float32),\n", - " array([0.0887219], dtype=float32),\n", - " array([0.00404755], dtype=float32),\n", - " array([0.8474386], dtype=float32),\n", - " array([0.02510497], dtype=float32),\n", - " array([0.00147101], dtype=float32),\n", - " array([0.00696711], dtype=float32),\n", - " array([0.01805459], dtype=float32),\n", - " array([0.37892923], dtype=float32),\n", - " array([0.32513785], dtype=float32),\n", - " array([0.00713208], dtype=float32),\n", - " array([0.05214171], dtype=float32),\n", - " array([0.9894679], dtype=float32),\n", - " array([0.74764496], dtype=float32),\n", - " array([0.0094498], dtype=float32),\n", - " array([0.05753988], dtype=float32),\n", - " array([0.9815139], dtype=float32),\n", - " array([0.994449], dtype=float32),\n", - " array([0.0733721], dtype=float32),\n", - " array([0.03602724], dtype=float32),\n", + " array([0.00689602], dtype=float32),\n", + " array([0.9943845], dtype=float32),\n", + " array([0.81676173], dtype=float32),\n", + " array([0.9999511], dtype=float32),\n", + " array([0.00488768], dtype=float32),\n", + " array([0.33030394], dtype=float32),\n", + " array([1.3143154e-05], dtype=float32),\n", + " array([0.97804296], dtype=float32),\n", + " array([0.97198254], dtype=float32),\n", + " array([0.98943126], dtype=float32),\n", + " array([0.9713336], dtype=float32),\n", + " array([0.44176972], dtype=float32),\n", + " array([0.9996177], dtype=float32),\n", + " array([0.97341985], dtype=float32),\n", + " array([0.9889993], dtype=float32),\n", + " array([0.9999981], dtype=float32),\n", + " array([0.9906634], dtype=float32),\n", + " array([0.7313937], dtype=float32),\n", + " array([4.6735212e-07], dtype=float32),\n", + " array([0.9986381], dtype=float32),\n", + " array([0.5398831], dtype=float32),\n", + " array([0.5327877], dtype=float32),\n", + " array([0.99454075], dtype=float32),\n", + " array([0.7781688], dtype=float32),\n", + " array([0.00171901], dtype=float32),\n", + " array([0.9790917], dtype=float32),\n", + " array([1.7694707e-05], dtype=float32),\n", + " array([0.22174618], dtype=float32),\n", + " array([0.00032948], dtype=float32),\n", + " array([0.98750776], dtype=float32),\n", + " array([0.9930167], dtype=float32),\n", + " array([0.7805735], dtype=float32),\n", + " array([0.874757], dtype=float32),\n", + " array([0.10298155], dtype=float32),\n", + " array([0.00014828], dtype=float32),\n", + " array([0.01591556], dtype=float32),\n", + " array([0.96804875], dtype=float32),\n", + " array([0.91091835], dtype=float32),\n", + " array([0.00087433], dtype=float32),\n", + " array([0.02600787], dtype=float32),\n", + " array([0.00168016], dtype=float32),\n", + " array([0.93263006], dtype=float32),\n", + " array([0.19706792], dtype=float32),\n", + " array([0.9951959], dtype=float32),\n", + " array([0.024617], dtype=float32),\n", + " array([0.9766921], dtype=float32),\n", + " array([0.04694933], dtype=float32),\n", + " array([0.9548745], dtype=float32),\n", + " array([0.01036863], dtype=float32),\n", + " array([0.9931427], dtype=float32),\n", + " array([0.01541146], dtype=float32),\n", + " array([0.02152353], dtype=float32),\n", + " array([0.78170955], dtype=float32),\n", + " array([0.5403529], dtype=float32),\n", + " array([0.22647694], dtype=float32),\n", + " array([0.00169189], dtype=float32),\n", + " array([0.9999125], dtype=float32),\n", + " array([0.00411672], dtype=float32),\n", + " array([0.92826104], dtype=float32),\n", + " array([0.78801906], dtype=float32),\n", + " array([0.9407639], dtype=float32),\n", + " array([0.98959863], dtype=float32),\n", + " array([0.9553592], dtype=float32),\n", + " array([0.01293176], dtype=float32),\n", + " array([0.01023797], dtype=float32),\n", + " array([0.03475741], dtype=float32),\n", + " array([0.9997676], dtype=float32),\n", + " array([0.97791255], dtype=float32),\n", + " array([0.00023193], dtype=float32),\n", + " array([0.00889389], dtype=float32),\n", + " array([0.957821], dtype=float32),\n", + " array([0.8767215], dtype=float32),\n", + " array([0.12694164], dtype=float32),\n", + " array([0.00611601], dtype=float32),\n", + " array([0.3953812], dtype=float32),\n", + " array([0.0004641], dtype=float32),\n", + " array([0.9987463], dtype=float32),\n", + " array([0.01019047], dtype=float32),\n", + " array([0.5764202], dtype=float32),\n", + " array([0.01138657], dtype=float32),\n", + " array([0.5458222], dtype=float32),\n", + " array([0.9942966], dtype=float32),\n", + " array([0.00240233], dtype=float32),\n", + " array([0.7553514], dtype=float32),\n", + " array([0.0881868], dtype=float32),\n", + " array([0.34226933], dtype=float32),\n", + " array([0.4873583], dtype=float32),\n", + " array([0.33895075], dtype=float32),\n", + " array([0.03251609], dtype=float32),\n", + " array([0.00574167], dtype=float32),\n", + " array([0.988293], dtype=float32),\n", + " array([0.00064724], dtype=float32),\n", + " array([0.17580858], dtype=float32),\n", + " array([0.94925475], dtype=float32),\n", + " array([0.18242276], dtype=float32),\n", + " array([0.9117029], dtype=float32),\n", + " array([0.717524], dtype=float32),\n", + " array([0.9948232], dtype=float32),\n", + " array([0.41392937], dtype=float32),\n", + " array([0.39889827], dtype=float32),\n", + " array([0.21468543], dtype=float32),\n", + " array([0.00194653], dtype=float32),\n", + " array([0.8318713], dtype=float32),\n", + " array([0.9048981], dtype=float32),\n", + " array([0.00159927], dtype=float32),\n", + " array([0.01717789], dtype=float32),\n", + " array([0.99441886], dtype=float32),\n", + " array([0.9672044], dtype=float32),\n", + " array([0.9958038], dtype=float32),\n", + " array([0.8791019], dtype=float32),\n", + " array([0.9852657], dtype=float32),\n", + " array([0.09051052], dtype=float32),\n", + " array([0.00520805], dtype=float32),\n", + " array([0.4179241], dtype=float32),\n", + " array([0.02102872], dtype=float32),\n", + " array([0.999458], dtype=float32),\n", + " array([0.07276529], dtype=float32),\n", + " array([0.89086306], dtype=float32),\n", + " array([0.58678746], dtype=float32),\n", + " array([0.9981602], dtype=float32),\n", + " array([0.98019546], dtype=float32),\n", + " array([0.81768113], dtype=float32),\n", + " array([0.3091106], dtype=float32),\n", + " array([0.7304271], dtype=float32),\n", + " array([0.00713154], dtype=float32),\n", + " array([0.10799696], dtype=float32),\n", + " array([0.00034327], dtype=float32),\n", + " array([0.97954047], dtype=float32),\n", + " array([0.9953832], dtype=float32),\n", + " array([0.06257767], dtype=float32),\n", + " array([0.8372882], dtype=float32),\n", + " array([1.8113557e-05], dtype=float32),\n", + " array([0.04951284], dtype=float32),\n", + " array([0.04139359], dtype=float32),\n", + " array([0.5803639], dtype=float32),\n", + " array([0.01002938], dtype=float32),\n", + " array([0.44129696], dtype=float32),\n", + " array([0.88426584], dtype=float32),\n", + " array([0.01807107], dtype=float32),\n", + " array([0.87367356], dtype=float32),\n", + " array([0.09437197], dtype=float32),\n", + " array([0.98715776], dtype=float32),\n", + " array([0.06557368], dtype=float32),\n", + " array([0.9997048], dtype=float32),\n", + " array([0.5877887], dtype=float32),\n", + " array([0.10160982], dtype=float32),\n", + " array([0.2194032], dtype=float32),\n", + " array([0.996086], dtype=float32),\n", + " array([0.70603895], dtype=float32),\n", + " array([0.0575645], dtype=float32),\n", + " array([0.58087355], dtype=float32),\n", + " array([0.9330629], dtype=float32),\n", + " array([0.004917], dtype=float32),\n", + " array([0.19366205], dtype=float32),\n", + " array([0.99521846], dtype=float32),\n", + " array([0.9976768], dtype=float32),\n", + " array([0.01894422], dtype=float32),\n", + " array([0.5626045], dtype=float32),\n", + " array([0.99873656], dtype=float32),\n", + " array([0.98620087], dtype=float32),\n", + " array([0.20380375], dtype=float32),\n", + " array([0.00324226], dtype=float32),\n", + " array([0.03813465], dtype=float32),\n", + " array([0.07607552], dtype=float32),\n", + " array([0.02199142], dtype=float32),\n", + " array([0.7561464], dtype=float32),\n", + " array([0.9669124], dtype=float32),\n", + " array([0.86246103], dtype=float32),\n", + " array([0.189888], dtype=float32),\n", + " array([6.4221174e-05], dtype=float32),\n", + " array([0.61084515], dtype=float32),\n", + " array([0.9931891], dtype=float32),\n", + " array([0.95753783], dtype=float32),\n", + " array([0.96757764], dtype=float32),\n", + " array([0.99537355], dtype=float32),\n", + " array([0.05853846], dtype=float32),\n", + " array([0.9369336], dtype=float32),\n", + " array([0.99967706], dtype=float32),\n", + " array([0.48768336], dtype=float32),\n", + " array([0.38854727], dtype=float32),\n", + " array([0.16301523], dtype=float32),\n", + " array([0.44746688], dtype=float32),\n", + " array([0.9951616], dtype=float32),\n", + " array([0.9310025], dtype=float32),\n", + " array([0.9793833], dtype=float32),\n", + " array([0.9996581], dtype=float32),\n", + " array([0.06153212], dtype=float32),\n", + " array([0.99993515], dtype=float32),\n", + " array([8.6169755e-05], dtype=float32),\n", + " array([0.14121674], dtype=float32),\n", + " array([0.001046], dtype=float32),\n", + " array([0.96887445], dtype=float32),\n", + " array([0.9940006], dtype=float32),\n", + " array([0.20827933], dtype=float32),\n", + " array([1.3143304e-05], dtype=float32),\n", + " array([0.70770514], dtype=float32),\n", + " array([0.00062637], dtype=float32),\n", + " array([0.09923268], dtype=float32),\n", + " array([0.00062528], dtype=float32),\n", + " array([0.9974062], dtype=float32),\n", + " array([0.6399337], dtype=float32),\n", + " array([0.9582232], dtype=float32),\n", + " array([5.5980826e-07], dtype=float32),\n", + " array([0.98064935], dtype=float32),\n", + " array([0.9810916], dtype=float32),\n", + " array([0.02825824], dtype=float32),\n", + " array([0.00210933], dtype=float32),\n", + " array([0.03763315], dtype=float32),\n", + " array([0.9897635], dtype=float32),\n", + " array([0.38776097], dtype=float32),\n", + " array([0.01495247], dtype=float32),\n", + " array([0.00611806], dtype=float32),\n", + " array([0.998847], dtype=float32),\n", + " array([0.01276735], dtype=float32),\n", + " array([0.00442079], dtype=float32),\n", + " array([0.2124616], dtype=float32),\n", + " array([0.01237443], dtype=float32),\n", + " array([0.01144132], dtype=float32),\n", + " array([0.92837715], dtype=float32),\n", + " array([0.02206292], dtype=float32),\n", + " array([0.98381627], dtype=float32),\n", + " array([0.00593874], dtype=float32),\n", + " array([0.26435003], dtype=float32),\n", + " array([0.02000471], dtype=float32),\n", + " array([0.84790653], dtype=float32),\n", + " array([0.9852173], dtype=float32),\n", + " array([0.9987846], dtype=float32),\n", + " array([0.99995625], dtype=float32),\n", + " array([0.17164613], dtype=float32),\n", + " array([0.18840362], dtype=float32),\n", + " array([0.9717937], dtype=float32),\n", + " array([0.74185556], dtype=float32),\n", + " array([0.00340732], dtype=float32),\n", + " array([0.01526649], dtype=float32),\n", + " array([0.61485744], dtype=float32),\n", + " array([0.9119215], dtype=float32),\n", + " array([0.02722141], dtype=float32),\n", + " array([0.39047685], dtype=float32),\n", + " array([0.19983715], dtype=float32),\n", + " array([0.00018045], dtype=float32),\n", + " array([0.76507735], dtype=float32),\n", + " array([0.00108664], dtype=float32),\n", + " array([0.8838372], dtype=float32),\n", + " array([0.9674925], dtype=float32),\n", + " array([0.00014587], dtype=float32),\n", + " array([0.01428808], dtype=float32),\n", + " array([0.6684856], dtype=float32),\n", + " array([0.03062288], dtype=float32),\n", + " array([0.46116126], dtype=float32),\n", + " array([0.16899237], dtype=float32),\n", + " array([0.9975586], dtype=float32),\n", + " array([0.91609216], dtype=float32),\n", + " array([0.9852622], dtype=float32),\n", + " array([0.5730661], dtype=float32),\n", + " array([0.19011642], dtype=float32),\n", + " array([0.9962901], dtype=float32),\n", + " array([0.00494908], dtype=float32),\n", + " array([0.9681047], dtype=float32),\n", + " array([0.03208594], dtype=float32),\n", + " array([0.00147857], dtype=float32),\n", + " array([0.12340485], dtype=float32),\n", + " array([0.996431], dtype=float32),\n", + " array([0.9512111], dtype=float32),\n", + " array([0.9922307], dtype=float32),\n", + " array([0.02449521], dtype=float32),\n", + " array([0.9568155], dtype=float32),\n", + " array([0.99991953], dtype=float32),\n", + " array([0.9982376], dtype=float32),\n", + " array([0.1572257], dtype=float32),\n", + " array([0.34052122], dtype=float32),\n", + " array([0.6778389], dtype=float32),\n", + " array([0.9513396], dtype=float32),\n", + " array([0.99644357], dtype=float32),\n", + " array([0.3379453], dtype=float32),\n", + " array([0.9816772], dtype=float32),\n", + " array([0.01320378], dtype=float32),\n", + " array([0.00027732], dtype=float32),\n", " array([0.99997675], dtype=float32),\n", - " array([0.6763087], dtype=float32),\n", - " array([0.9927671], dtype=float32),\n", - " array([0.02451441], dtype=float32),\n", - " array([0.86146873], dtype=float32),\n", - " array([0.04389035], dtype=float32),\n", - " array([0.9999443], dtype=float32),\n", - " array([0.809564], dtype=float32),\n", - " array([0.99578035], dtype=float32),\n", - " array([0.4989446], dtype=float32),\n", - " array([0.02612785], dtype=float32),\n", - " array([0.87981015], dtype=float32),\n", - " array([0.6465501], dtype=float32),\n", - " array([0.576932], dtype=float32),\n", - " array([0.03007537], dtype=float32),\n", - " array([0.00870073], dtype=float32),\n", - " array([0.9998024], dtype=float32),\n", - " array([0.08114275], dtype=float32),\n", - " array([0.68397623], dtype=float32),\n", - " array([0.9999337], dtype=float32),\n", - " array([0.0099621], dtype=float32),\n", - " array([0.99060285], dtype=float32),\n", - " array([0.00027312], dtype=float32),\n", - " array([0.9289166], dtype=float32),\n", - " array([0.9932289], dtype=float32),\n", - " array([0.02628781], dtype=float32),\n", - " array([0.99826354], dtype=float32),\n", - " array([0.6789669], dtype=float32),\n", + " array([0.49815693], dtype=float32),\n", + " array([0.00038428], dtype=float32),\n", + " array([0.03885539], dtype=float32),\n", + " array([0.5476643], dtype=float32),\n", + " array([0.9998455], dtype=float32),\n", + " array([0.9970118], dtype=float32),\n", + " array([0.5124474], dtype=float32),\n", + " array([0.38307184], dtype=float32),\n", + " array([0.99099356], dtype=float32),\n", + " array([0.25695708], dtype=float32),\n", + " array([0.9953335], dtype=float32),\n", + " array([0.97055674], dtype=float32),\n", + " array([0.4068285], dtype=float32),\n", + " array([1.4898453e-06], dtype=float32),\n", + " array([0.66622144], dtype=float32),\n", + " array([0.99686724], dtype=float32),\n", + " array([0.00997034], dtype=float32),\n", + " array([0.2946419], dtype=float32),\n", + " array([0.70338255], dtype=float32),\n", + " array([0.02406825], dtype=float32),\n", + " array([0.99934345], dtype=float32),\n", + " array([0.03414964], dtype=float32),\n", + " array([0.00095879], dtype=float32),\n", + " array([0.99705076], dtype=float32),\n", + " array([0.21492238], dtype=float32),\n", + " array([0.87716794], dtype=float32),\n", + " array([0.47392538], dtype=float32),\n", + " array([0.24244678], dtype=float32),\n", + " array([0.03492213], dtype=float32),\n", + " array([0.9038005], dtype=float32),\n", + " array([0.51358217], dtype=float32),\n", + " array([0.3492779], dtype=float32),\n", + " array([0.37952748], dtype=float32),\n", + " array([0.9956209], dtype=float32),\n", + " array([0.05870749], dtype=float32),\n", + " array([0.93354183], dtype=float32),\n", + " array([0.45190257], dtype=float32),\n", + " array([0.99952877], dtype=float32),\n", + " array([0.35226253], dtype=float32),\n", " ...]" ] }, From 1d722fa75d9d1069bb7ec03c7b2a18b4ec71b138 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 18 Mar 2019 16:58:14 +0800 Subject: [PATCH 29/46] Add files via upload --- keras/3.6-classifying-newswires.ipynb | 554 ++++++++++++++++++++++++++ 1 file changed, 554 insertions(+) create mode 100644 keras/3.6-classifying-newswires.ipynb diff --git a/keras/3.6-classifying-newswires.ipynb b/keras/3.6-classifying-newswires.ipynb new file mode 100644 index 0000000..09ac085 --- /dev/null +++ b/keras/3.6-classifying-newswires.ipynb @@ -0,0 +1,554 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "env: SPARK_DRIVER_MEMORY=8g\n", + "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", + "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n" + ] + } + ], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# # Classifying newswires: a multi-class classification example\n", + "\n", + "----\n", + "\n", + "In the previous section we saw how to classify vector inputs into two mutually exclusive classes using a densely-connected neural network. \n", + "But what happens when you have more than two classes? \n", + "\n", + "In this section, we will build a network to classify Reuters newswires into 46 different mutually-exclusive topics. Since we have many \n", + "classes, this problem is an instance of \"multi-class classification\", and since each data point should be classified into only one \n", + "category, the problem is more specifically an instance of \"single-label, multi-class classification\". If each data point could have \n", + "belonged to multiple categories (in our case, topics) then we would be facing a \"multi-label, multi-class classification\" problem." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## The Reuters dataset\n", + "\n", + "\n", + "We will be working with the _Reuters dataset_, a set of short newswires and their topics, published by Reuters in 1986. It's a very simple, \n", + "widely used toy dataset for text classification. There are 46 different topics; some topics are more represented than others, but each \n", + "topic has at least 10 examples in the training set.\n", + "\n", + "Like IMDB and MNIST, the Reuters dataset comes packaged as part of Keras API of Analytics Zoo. Let's take a look right away:" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.datasets import reuters\n", + "(train_data, train_labels), (test_data, test_labels) = reuters.load_data(nb_words=10000)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Like with the IMDB dataset, the argument `nb_words=10000` restricts the data to the 10,000 most frequently occurring words found in the \n", + "data.\n", + "\n", + "We have 8,982 training examples and 2,246 test examples:" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [], + "source": [ + "word_index = reuters.get_word_index()\n", + "reverse_word_index = dict([(value, key) for (key, value) in word_index.items()])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Preparing the data\n", + "\n", + "We can vectorize the data with the exact same code as in our previous example:" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [], + "source": [ + "import numpy as np\n", + "def vectorize_sequences(sequences, dimension=10000):\n", + " results = np.zeros((len(sequences), dimension))\n", + " for i, sequence in enumerate(sequences):\n", + " results[i, sequence] = 1.\n", + " return results\n", + "\n", + "x_train = vectorize_sequences(train_data)\n", + "x_test = vectorize_sequences(test_data)\n", + "# this part pending to modify, one-hot or integer issue" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Building our network\n", + "\n", + "\n", + "This topic classification problem looks very similar to our previous movie review classification problem: in both cases, we are trying to \n", + "classify short snippets of text. There is however a new constraint here: the number of output classes has gone from 2 to 46, i.e. the \n", + "dimensionality of the output space is much larger. \n", + "\n", + "In a stack of `Dense` layers like what we were using, each layer can only access information present in the output of the previous layer. \n", + "If one layer drops some information relevant to the classification problem, this information can never be recovered by later layers: each \n", + "layer can potentially become an \"information bottleneck\". In our previous example, we were using 16-dimensional intermediate layers, but a \n", + "16-dimensional space may be too limited to learn to separate 46 different classes: such small layers may act as information bottlenecks, \n", + "permanently dropping relevant information.\n", + "\n", + "For this reason we will use larger layers. Let's go with 64 units:" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n" + ] + }, + { + "data": { + "text/plain": [ + "" + ] + }, + "execution_count": 5, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers\n", + "\n", + "model = models.Sequential()\n", + "model.add(layers.Dense(64, activation='relu', input_shape=(10000,)))\n", + "model.add(layers.Dense(64, activation='relu'))\n", + "model.add(layers.Dense(46, activation='softmax'))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "There are two other things you should note about this architecture:\n", + "\n", + "* We are ending the network with a `Dense` layer of size 46. This means that for each input sample, our network will output a \n", + "46-dimensional vector. Each entry in this vector (each dimension) will encode a different output class.\n", + "* The last layer uses a `softmax` activation. You have already seen this pattern in the MNIST example. It means that the network will \n", + "output a _probability distribution_ over the 46 different output classes, i.e. for every input sample, the network will produce a \n", + "46-dimensional output vector where `output[i]` is the probability that the sample belongs to class `i`. The 46 scores will sum to 1.\n", + "\n", + "The best loss function to use in this case is `categorical_crossentropy`. It measures the distance between two probability distributions: \n", + "in our case, between the probability distribution output by our network, and the true distribution of the labels. By minimizing the \n", + "distance between these two distributions, we train our network to output something as close as possible to the true labels." + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createRMSprop\n", + "creating: createZooKerasSparseCategoricalCrossEntropy\n", + "creating: createZooKerasSparseCategoricalAccuracy\n" + ] + } + ], + "source": [ + "model.compile(optimizer='rmsprop',\n", + " loss='sparse_categorical_crossentropy',\n", + " metrics=['accuracy'])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Validating our approach\n", + "\n", + "Let's set apart 1,000 samples in our training data to use as a validation set:" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [], + "source": [ + "x_val = x_train[:1000]\n", + "partial_x_train = x_train[1000:]\n", + "\n", + "y_val = train_labels[:1000]\n", + "partial_y_train = train_labels[1000:] # this line would return list\n", + "partial_y_train = np.array(partial_y_train) # convert list to ndarray" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Now let's train our network for 20 epochs:" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [], + "source": [ + "import time\n", + "dir_name = '3-5 ' + str(time.ctime())\n", + "model.set_tensorboard('./', dir_name)\n", + "model.fit(partial_x_train,\n", + " partial_y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_val, y_val))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 512 records in 0.03322949 seconds. Throughput is 15408.001 records/second. Loss is 0.36856997.\n", + "Top1Accuracy is Accuracy(correct: 808, count: 1000, accuracy: 0.808)_" + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "train_loss = np.array(model.get_train_summary('Loss'))\n", + "val_loss = np.array(model.get_validation_summary('Loss'))\n", + "\n", + "import matplotlib.pyplot as plt\n", + "plt.plot(train_loss[:,0],train_loss[:,1],label='train loss')\n", + "plt.plot(val_loss[:,0],val_loss[:,1],label='validation loss',color='green')\n", + "plt.title('Training and validation loss')\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Loss')\n", + "plt.legend()\n", + "\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "It seems that the network starts overfitting after 8 epochs. Let's train a new network from scratch for 8 epochs, then let's evaluate it on \n", + "the test set:" + ] + }, + { + "cell_type": "code", + "execution_count": 14, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasSparseCategoricalCrossEntropy\n", + "creating: createZooKerasSparseCategoricalAccuracy\n" + ] + } + ], + "source": [ + "model = models.Sequential()\n", + "model.add(layers.Dense(64, activation='relu', input_shape=(10000,)))\n", + "model.add(layers.Dense(64, activation='relu'))\n", + "model.add(layers.Dense(46, activation='softmax'))\n", + "\n", + "model.compile(optimizer='rmsprop',\n", + " loss='sparse_categorical_crossentropy',\n", + " metrics=['accuracy'])\n", + "model.fit(partial_x_train,\n", + " partial_y_train,\n", + " nb_epoch=8,\n", + " batch_size=512,\n", + " validation_data=(x_val, y_val))\n", + "y_test = np.array(test_labels).astype('float32')\n", + "results = model.evaluate(x_test, y_test)" + ] + }, + { + "cell_type": "code", + "execution_count": 15, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "[0.9659086465835571, 0.8032057285308838]" + ] + }, + "execution_count": 15, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "results" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Our approach reaches an accuracy of ~80%. With a balanced binary classification problem, the accuracy reached by a purely random classifier \n", + "would be 50%, but in our case it is closer to 19%, so our results seem pretty good, at least when compared to a random baseline:" + ] + }, + { + "cell_type": "code", + "execution_count": 16, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "0.19011576135351738" + ] + }, + "execution_count": 16, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "import copy\n", + "\n", + "test_labels_copy = copy.copy(test_labels)\n", + "np.random.shuffle(test_labels_copy)\n", + "float(np.sum(np.array(test_labels) == np.array(test_labels_copy))) / len(test_labels)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Generating predictions on new data\n", + "\n", + "We can verify that the `predict` method of our model instance returns a probability distribution over all 46 topics. Let's generate topic \n", + "predictions for all of the test data:" + ] + }, + { + "cell_type": "code", + "execution_count": 17, + "metadata": {}, + "outputs": [], + "source": [ + "predictions = model.predict(x_test).collect()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Each entry in `predictions` is a vector of length 46:" + ] + }, + { + "cell_type": "code", + "execution_count": 18, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "(46,)" + ] + }, + "execution_count": 18, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "predictions[0].shape" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The coefficients in this vector sum to 1:" + ] + }, + { + "cell_type": "code", + "execution_count": 19, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "0.99999994" + ] + }, + "execution_count": 19, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "np.sum(predictions[0])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The largest entry is the predicted class, i.e. the class with the highest probability:" + ] + }, + { + "cell_type": "code", + "execution_count": 20, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "4" + ] + }, + "execution_count": 20, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "np.argmax(predictions[0])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Further experiments\n", + "\n", + "* Try using larger or smaller layers: 32 units, 128 units...\n", + "* We were using two hidden layers. Now try to use a single hidden layer, or three hidden layers." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Wrapping up\n", + "\n", + "\n", + "Here's what you should take away from this example:\n", + "\n", + "* If you are trying to classify data points between N classes, your network should end with a `Dense` layer of size N.\n", + "* In a single-label, multi-class classification problem, your network should end with a `softmax` activation, so that it will output a \n", + "probability distribution over the N output classes.\n", + "* _Categorical crossentropy_ is almost always the loss function you should use for such problems. It minimizes the distance between the \n", + "probability distributions output by the network, and the true distribution of the targets.\n", + "* There are two ways to handle labels in multi-class classification:\n", + " ** Encoding the labels via \"categorical encoding\" (also known as \"one-hot encoding\") and using `categorical_crossentropy` as your loss \n", + "function.\n", + " ** Encoding the labels as integers and using the `sparse_categorical_crossentropy` loss function.\n", + "* If you need to classify data into a large number of categories, then you should avoid creating information bottlenecks in your network by having \n", + "intermediate layers that are too small." + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From 64cba48eba12f6199927beb38c3039cfa8b74de2 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 21 Mar 2019 08:56:43 +0800 Subject: [PATCH 30/46] Add files via upload --- keras/3.7-regression.ipynb | 797 +++++++++++++++++++++++++++++++++++++ 1 file changed, 797 insertions(+) create mode 100644 keras/3.7-regression.ipynb diff --git a/keras/3.7-regression.ipynb b/keras/3.7-regression.ipynb new file mode 100644 index 0000000..0f37622 --- /dev/null +++ b/keras/3.7-regression.ipynb @@ -0,0 +1,797 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "env: SPARK_DRIVER_MEMORY=8g\n", + "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", + "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n" + ] + } + ], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Predicting house prices: a regression example\n", + "\n", + "\n", + "----\n", + "\n", + "\n", + "In our two previous examples, we were considering classification problems, where the goal was to predict a single discrete label of an \n", + "input data point. Another common type of machine learning problem is \"regression\", which consists of predicting a continuous value instead \n", + "of a discrete label. For instance, predicting the temperature tomorrow, given meteorological data, or predicting the time that a \n", + "software project will take to complete, given its specifications.\n", + "\n", + "Do not mix up \"regression\" with the algorithm \"logistic regression\": confusingly, \"logistic regression\" is not a regression algorithm, \n", + "it is a classification algorithm." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## The Boston Housing Price dataset\n", + "\n", + "\n", + "We will be attempting to predict the median price of homes in a given Boston suburb in the mid-1970s, given a few data points about the \n", + "suburb at the time, such as the crime rate, the local property tax rate, etc.\n", + "\n", + "The dataset we will be using has another interesting difference from our two previous examples: it has very few data points, only 506 in \n", + "total, split between 404 training samples and 102 test samples, and each \"feature\" in the input data (e.g. the crime rate is a feature) has \n", + "a different scale. For instance some values are proportions, which take a values between 0 and 1, others take values between 1 and 12, \n", + "others between 0 and 100...\n", + "\n", + "Let's take a look at the data:" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.datasets import boston_housing\n", + "(train_data, train_targets), (test_data, test_targets) = boston_housing.load_data()" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "(404, 13)" + ] + }, + "execution_count": 3, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "train_data.shape" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "(102, 13)" + ] + }, + "execution_count": 4, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "test_data.shape" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, we have 404 training samples and 102 test samples. The data comprises 13 features. The 13 features in the input data are as \n", + "follow:\n", + "\n", + "1. Per capita crime rate.\n", + "2. Proportion of residential land zoned for lots over 25,000 square feet.\n", + "3. Proportion of non-retail business acres per town.\n", + "4. Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).\n", + "5. Nitric oxides concentration (parts per 10 million).\n", + "6. Average number of rooms per dwelling.\n", + "7. Proportion of owner-occupied units built prior to 1940.\n", + "8. Weighted distances to five Boston employment centres.\n", + "9. Index of accessibility to radial highways.\n", + "10. Full-value property-tax rate per $10,000.\n", + "11. Pupil-teacher ratio by town.\n", + "12. 1000 * (Bk - 0.63) ** 2 where Bk is the proportion of Black people by town.\n", + "13. % lower status of the population.\n", + "\n", + "The targets are the median values of owner-occupied homes, in thousands of dollars:" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([22.6, 50. , 23. , 8.3, 21.2, 19.9, 20.6, 18.7, 16.1, 18.6, 8.8,\n", + " 17.2, 14.9, 10.5, 50. , 29. , 23. , 33.3, 29.4, 21. , 23.8, 19.1,\n", + " 20.4, 29.1, 19.3, 23.1, 19.6, 19.4, 38.7, 18.7, 14.6, 20. , 20.5,\n", + " 20.1, 23.6, 16.8, 5.6, 50. , 14.5, 13.3, 23.9, 20. , 19.8, 13.8,\n", + " 16.5, 21.6, 20.3, 17. , 11.8, 27.5, 15.6, 23.1, 24.3, 42.8, 15.6,\n", + " 21.7, 17.1, 17.2, 15. , 21.7, 18.6, 21. , 33.1, 31.5, 20.1, 29.8,\n", + " 15.2, 15. , 27.5, 22.6, 20. , 21.4, 23.5, 31.2, 23.7, 7.4, 48.3,\n", + " 24.4, 22.6, 18.3, 23.3, 17.1, 27.9, 44.8, 50. , 23. , 21.4, 10.2,\n", + " 23.3, 23.2, 18.9, 13.4, 21.9, 24.8, 11.9, 24.3, 13.8, 24.7, 14.1,\n", + " 18.7, 28.1, 19.8, 26.7, 21.7, 22. , 22.9, 10.4, 21.9, 20.6, 26.4,\n", + " 41.3, 17.2, 27.1, 20.4, 16.5, 24.4, 8.4, 23. , 9.7, 50. , 30.5,\n", + " 12.3, 19.4, 21.2, 20.3, 18.8, 33.4, 18.5, 19.6, 33.2, 13.1, 7.5,\n", + " 13.6, 17.4, 8.4, 35.4, 24. , 13.4, 26.2, 7.2, 13.1, 24.5, 37.2,\n", + " 25. , 24.1, 16.6, 32.9, 36.2, 11. , 7.2, 22.8, 28.7, 14.4, 24.4,\n", + " 18.1, 22.5, 20.5, 15.2, 17.4, 13.6, 8.7, 18.2, 35.4, 31.7, 33. ,\n", + " 22.2, 20.4, 23.9, 25. , 12.7, 29.1, 12. , 17.7, 27. , 20.6, 10.2,\n", + " 17.5, 19.7, 29.8, 20.5, 14.9, 10.9, 19.5, 22.7, 19.5, 24.6, 25. ,\n", + " 24.5, 50. , 14.3, 11.8, 31. , 28.7, 16.2, 43.5, 25. , 22. , 19.9,\n", + " 22.1, 46. , 22.9, 20.2, 43.1, 34.6, 13.8, 24.3, 21.5, 24.4, 21.2,\n", + " 23.8, 26.6, 25.1, 9.6, 19.4, 19.4, 9.5, 14. , 26.5, 13.8, 34.7,\n", + " 16.3, 21.7, 17.5, 15.6, 20.9, 21.7, 12.7, 18.5, 23.7, 19.3, 12.7,\n", + " 21.6, 23.2, 29.6, 21.2, 23.8, 17.1, 22. , 36.5, 18.8, 21.9, 23.1,\n", + " 20.2, 17.4, 37. , 24.1, 36.2, 15.7, 32.2, 13.5, 17.9, 13.3, 11.7,\n", + " 41.7, 18.4, 13.1, 25. , 21.2, 16. , 34.9, 25.2, 24.8, 21.5, 23.4,\n", + " 18.9, 10.8, 21. , 27.5, 17.5, 13.5, 28.7, 14.8, 19.1, 28.6, 13.1,\n", + " 19. , 11.3, 13.3, 22.4, 20.1, 18.2, 22.9, 20.6, 25. , 12.8, 34.9,\n", + " 23.7, 50. , 29. , 30.1, 22. , 15.6, 23.3, 30.1, 14.3, 22.8, 50. ,\n", + " 20.8, 6.3, 34.9, 32.4, 19.9, 20.3, 17.8, 23.1, 20.4, 23.2, 7. ,\n", + " 16.8, 46.7, 50. , 22.9, 23.9, 21.4, 21.7, 15.4, 15.3, 23.1, 23.9,\n", + " 19.4, 11.9, 17.8, 31.5, 33.8, 20.8, 19.8, 22.4, 5. , 24.5, 19.4,\n", + " 15.1, 18.2, 19.3, 27.1, 20.7, 37.6, 11.7, 33.4, 30.1, 21.4, 45.4,\n", + " 20.1, 20.8, 26.4, 10.4, 21.8, 32. , 21.7, 18.4, 37.9, 17.8, 28. ,\n", + " 28.2, 36. , 18.9, 15. , 22.5, 30.7, 20. , 19.1, 23.3, 26.6, 21.1,\n", + " 19.7, 20. , 12.1, 7.2, 14.2, 17.3, 27.5, 22.2, 10.9, 19.2, 32. ,\n", + " 14.5, 24.7, 12.6, 24. , 24.1, 50. , 16.1, 43.8, 26.6, 36.1, 21.8,\n", + " 29.9, 50. , 44. , 20.6, 19.6, 28.4, 19.1, 22.3, 20.9, 28.4, 14.4,\n", + " 32.7, 13.8, 8.5, 22.5, 35.1, 31.6, 17.8, 15.6])" + ] + }, + "execution_count": 5, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "train_targets" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The prices are typically between \\$10,000 and \\$50,000. If that sounds cheap, remember this was the mid-1970s, and these prices are not \n", + "inflation-adjusted." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Preparing the data\n", + "\n", + "\n", + "It would be problematic to feed into a neural network values that all take wildly different ranges. The network might be able to \n", + "automatically adapt to such heterogeneous data, but it would definitely make learning more difficult. A widespread best practice to deal \n", + "with such data is to do feature-wise normalization: for each feature in the input data (a column in the input data matrix), we \n", + "will subtract the mean of the feature and divide by the standard deviation, so that the feature will be centered around 0 and will have a \n", + "unit standard deviation. This is easily done in Numpy:" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [], + "source": [ + "mean = train_data.mean(axis=0)\n", + "train_data -= mean\n", + "std = train_data.std(axis=0)\n", + "train_data /= std\n", + "\n", + "test_data -= mean\n", + "test_data /= std" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Note that the quantities that we use for normalizing the test data have been computed using the training data. We should never use in our \n", + "workflow any quantity computed on the test data, even for something as simple as data normalization." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Building our network\n", + "\n", + "\n", + "Because so few samples are available, we will be using a very small network with two \n", + "hidden layers, each with 64 units. In general, the less training data you have, the worse overfitting will be, and using \n", + "a small network is one way to mitigate overfitting." + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers\n", + "\n", + "def build_model():\n", + " # Because we will need to instantiate\n", + " # the same model multiple times,\n", + " # we use a function to construct it.\n", + " model = models.Sequential()\n", + " model.add(layers.Dense(64, activation='relu',\n", + " input_shape=(train_data.shape[1],)))\n", + " model.add(layers.Dense(64, activation='relu'))\n", + " model.add(layers.Dense(1))\n", + " model.compile(optimizer='rmsprop', loss='mse', metrics=['mae'])\n", + " return model" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Our network ends with a single unit, and no activation (i.e. it will be linear layer). \n", + "This is a typical setup for scalar regression (i.e. regression where we are trying to predict a single continuous value). \n", + "Applying an activation function would constrain the range that the output can take; for instance if \n", + "we applied a `sigmoid` activation function to our last layer, the network could only learn to predict values between 0 and 1. Here, because \n", + "the last layer is purely linear, the network is free to learn to predict values in any range.\n", + "\n", + "Note that we are compiling the network with the `mse` loss function -- Mean Squared Error, the square of the difference between the \n", + "predictions and the targets, a widely used loss function for regression problems.\n", + "\n", + "We are also monitoring a new metric during training: `mae`. This stands for Mean Absolute Error. It is simply the absolute value of the \n", + "difference between the predictions and the targets. For instance, a MAE of 0.5 on this problem would mean that our predictions are off by \n", + "\\$500 on average." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Validating our approach using K-fold validation\n", + "\n", + "\n", + "To evaluate our network while we keep adjusting its parameters (such as the number of epochs used for training), we could simply split the \n", + "data into a training set and a validation set, as we were doing in our previous examples. However, because we have so few data points, the \n", + "validation set would end up being very small (e.g. about 100 examples). A consequence is that our validation scores may change a lot \n", + "depending on _which_ data points we choose to use for validation and which we choose for training, i.e. the validation scores may have a \n", + "high _variance_ with regard to the validation split. This would prevent us from reliably evaluating our model.\n", + "\n", + "The best practice in such situations is to use K-fold cross-validation. It consists of splitting the available data into K partitions \n", + "(typically K=4 or 5), then instantiating K identical models, and training each one on K-1 partitions while evaluating on the remaining \n", + "partition. The validation score for the model used would then be the average of the K validation scores obtained." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Then let's start our training:" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "processing fold # 0\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 1\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 2\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 3\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n" + ] + } + ], + "source": [ + "import numpy as np\n", + "\n", + "k = 4\n", + "num_val_samples = len(train_data) // k\n", + "num_nb_epoch = 50\n", + "all_scores = []\n", + "for i in range(k):\n", + " print('processing fold #', i)\n", + " # Prepare the validation data: data from partition # k\n", + " val_data = train_data[i * num_val_samples: (i + 1) * num_val_samples]\n", + " val_targets = train_targets[i * num_val_samples: (i + 1) * num_val_samples]\n", + "\n", + " # Prepare the training data: data from all other partitions\n", + " partial_train_data = np.concatenate(\n", + " [train_data[:i * num_val_samples],\n", + " train_data[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + " partial_train_targets = np.concatenate(\n", + " [train_targets[:i * num_val_samples],\n", + " train_targets[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + "\n", + " # Build the model (already compiled)\n", + " model = build_model()\n", + " # Train the model (in silent mode, verbose=0)\n", + " #model.fit(partial_train_data, partial_train_targets,\n", + " # nb_epoch=num_nb_epoch, batch_size=1, verbose=0)\n", + " model.fit(partial_train_data, partial_train_targets,\n", + " nb_epoch=num_nb_epoch, batch_size=16)\n", + "\n", + " # Evaluate the model on the validation data\n", + " #val_mse, val_mae = model.evaluate(val_data, val_targets, verbose=0)\n", + " val_mse, val_mae = model.evaluate(val_data, val_targets)\n", + " all_scores.append(val_mae)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 16 records in 0.011235845 seconds. Throughput is 1424.0139 records/second. Loss is 8.708786._\n", + "\n", + "_INFO - Trained 16 records in 0.009535034 seconds. Throughput is 1678.0223 records/second. Loss is 5.3613434._\n", + "\n", + "_INFO - Trained 16 records in 0.008636178 seconds. Throughput is 1852.6713 records/second. Loss is 18.106756._\n", + "\n", + "_INFO - Trained 16 records in 0.009207628 seconds. Throughput is 1737.6897 records/second. Loss is 7.0931993._" + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "[3.291872501373291, 2.496018171310425, 2.221175193786621, 2.6994853019714355]" + ] + }, + "execution_count": 9, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "all_scores" + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "2.677137792110443" + ] + }, + "execution_count": 10, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "np.mean(all_scores)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can notice, the different runs do indeed show rather different validation scores, from 2.1 to 2.9. Their average (2.4) is a much more \n", + "reliable metric than any single of these scores -- that's the entire point of K-fold cross-validation. In this case, we are off by \\\\$2,400 on \n", + "average, which is still significant considering that the prices range from \\\\$10,000 to \\\\$50,000. \n", + "\n", + "Let's try training the network for a bit longer: 500 epochs. To keep a record of how well the model did at each epoch, we will modify our training loop \n", + "to save the per-epoch validation score log:" + ] + }, + { + "cell_type": "code", + "execution_count": 13, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "processing fold # 0\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 1\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 2\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 3\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n" + ] + } + ], + "source": [ + "num_epochs = 500\n", + "all_mae_histories = []\n", + "for i in range(k):\n", + " print('processing fold #', i)\n", + " # Prepare the validation data: data from partition # k\n", + " val_data = train_data[i * num_val_samples: (i + 1) * num_val_samples]\n", + " val_targets = train_targets[i * num_val_samples: (i + 1) * num_val_samples]\n", + "\n", + " # Prepare the training data: data from all other partitions\n", + " partial_train_data = np.concatenate(\n", + " [train_data[:i * num_val_samples],\n", + " train_data[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + " partial_train_targets = np.concatenate(\n", + " [train_targets[:i * num_val_samples],\n", + " train_targets[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + "\n", + " # Build the model (already compiled)\n", + " model = build_model()\n", + " # Train the model (in silent mode, verbose=0)\n", + " import time\n", + " dir_name = '3-7 ' + str(time.ctime())\n", + " model.set_tensorboard('./', dir_name)\n", + " history = model.fit(partial_train_data, partial_train_targets,\n", + " validation_data=(val_data, val_targets),\n", + " nb_epoch=num_epochs, batch_size=16)\n", + " \n", + " #mae_history = history.history['val_mean_absolute_error']\n", + " mae_history = model.get_validation_summary(\"Loss\")\n", + " all_mae_histories.append(mae_history)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We can then compute the average of the per-epoch MAE scores for all folds:" + ] + }, + { + "cell_type": "code", + "execution_count": 47, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([[[1.90000000e+01, 4.05375427e+02, 1.55307042e+09],\n", + " [3.80000000e+01, 2.64351837e+02, 1.55307042e+09],\n", + " [5.70000000e+01, 1.50977859e+02, 1.55307042e+09],\n", + " ...,\n", + " [9.46200000e+03, 2.07635689e+01, 1.55307053e+09],\n", + " [9.48100000e+03, 2.02473850e+01, 1.55307053e+09],\n", + " [9.50000000e+03, 2.02105141e+01, 1.55307053e+09]],\n", + "\n", + " [[1.90000000e+01, 4.76980957e+02, 1.55307053e+09],\n", + " [3.80000000e+01, 3.29584198e+02, 1.55307053e+09],\n", + " [5.70000000e+01, 1.80655548e+02, 1.55307053e+09],\n", + " ...,\n", + " [9.46200000e+03, 1.73588219e+01, 1.55307064e+09],\n", + " [9.48100000e+03, 1.78555279e+01, 1.55307064e+09],\n", + " [9.50000000e+03, 1.73744106e+01, 1.55307064e+09]],\n", + "\n", + " [[1.90000000e+01, 4.62182434e+02, 1.55307064e+09],\n", + " [3.80000000e+01, 3.34037567e+02, 1.55307064e+09],\n", + " [5.70000000e+01, 2.06141006e+02, 1.55307064e+09],\n", + " ...,\n", + " [9.46200000e+03, 1.72124062e+01, 1.55307075e+09],\n", + " [9.48100000e+03, 1.75751667e+01, 1.55307075e+09],\n", + " [9.50000000e+03, 1.74055386e+01, 1.55307075e+09]],\n", + "\n", + " [[1.90000000e+01, 5.21177673e+02, 1.55307075e+09],\n", + " [3.80000000e+01, 3.99685974e+02, 1.55307075e+09],\n", + " [5.70000000e+01, 2.67611786e+02, 1.55307075e+09],\n", + " ...,\n", + " [9.46200000e+03, 1.75390892e+01, 1.55307085e+09],\n", + " [9.48100000e+03, 1.76337471e+01, 1.55307085e+09],\n", + " [9.50000000e+03, 1.91227703e+01, 1.55307085e+09]]])" + ] + }, + "execution_count": 47, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "all_mae_histories = np.array(all_mae_histories)\n", + "all_mae_histories" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, the `all_mae_histories` is a 3-d array, the last dimension are 3-element tuples. This 3-d array is built up with four 2-d arrays and all the first element of every 2-d array are equal. The first element of tuple stands for the training step and the third element stands for time stamp. You do need to worry about them, let's just calculate the average value through the first axis of this 3-d array. Actually we just want the second elements of this array, which stand for the MAE results. " + ] + }, + { + "cell_type": "code", + "execution_count": 48, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([[1.90000000e+01, 4.66429123e+02, 1.55307058e+09],\n", + " [3.80000000e+01, 3.31914894e+02, 1.55307058e+09],\n", + " [5.70000000e+01, 2.01346550e+02, 1.55307058e+09],\n", + " ...,\n", + " [9.46200000e+03, 1.82184715e+01, 1.55307069e+09],\n", + " [9.48100000e+03, 1.83279567e+01, 1.55307069e+09],\n", + " [9.50000000e+03, 1.85283084e+01, 1.55307069e+09]])" + ] + }, + "execution_count": 48, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "average_mae_history = np.mean(all_mae_histories, axis=0)\n", + "average_mae_history" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, this operation does not mess up the first elements since they are all equal through the first axis. And we do not need to care about the third element because it is useless at this time.\n", + "\n", + "Let's plot this:" + ] + }, + { + "cell_type": "code", + "execution_count": 49, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "import matplotlib.pyplot as plt\n", + "plt.plot(average_mae_history[:,0],average_mae_history[:,1])\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Validation MAE')\n", + "plt.ylim((14, 20))\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's plot this:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "According to this plot, it seems that validation MAE stops improving significantly after 150 epochs. Past that point, we start overfitting.\n", + "\n", + "Once we are done tuning other parameters of our model (besides the number of epochs, we could also adjust the size of the hidden layers), we \n", + "can train a final \"production\" model on all of the training data, with the best parameters, then look at its performance on the test data:" + ] + }, + { + "cell_type": "code", + "execution_count": 50, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n" + ] + } + ], + "source": [ + "# Get a fresh, compiled model.\n", + "model = build_model()\n", + "# Train it on the entirety of the data.\n", + "model.fit(train_data, train_targets,\n", + " nb_epoch=150, batch_size=16)\n", + "test_mse_score, test_mae_score = model.evaluate(test_data, test_targets)" + ] + }, + { + "cell_type": "code", + "execution_count": 51, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "1.7991065979003906" + ] + }, + "execution_count": 51, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "test_mae_score" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We are still off by about \\$1,800." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Wrapping up\n", + "\n", + "\n", + "Here's what you should take away from this example:\n", + "\n", + "* Regression is done using different loss functions from classification; Mean Squared Error (MSE) is a commonly used loss function for \n", + "regression.\n", + "* Similarly, evaluation metrics to be used for regression differ from those used for classification; naturally the concept of \"accuracy\" \n", + "does not apply for regression. A common regression metric is Mean Absolute Error (MAE).\n", + "* When features in the input data have values in different ranges, each feature should be scaled independently as a preprocessing step.\n", + "* When there is little data available, using K-Fold validation is a great way to reliably evaluate a model.\n", + "* When little training data is available, it is preferable to use a small network with very few hidden layers (typically only one or two), \n", + "in order to avoid severe overfitting.\n", + "\n", + "This example concludes our series of three introductory practical examples. You are now able to handle common types of problems with vector data input:\n", + "\n", + "* Binary (2-class) classification.\n", + "* Multi-class, single-label classification.\n", + "* Scalar regression.\n", + "\n", + "In the next chapter, you will acquire a more formal understanding of some of the concepts you have encountered in these first examples, \n", + "such as data preprocessing, model evaluation, and overfitting." + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From 024733581dd9aed443b9131b9c5993d99e7250bd Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 21 Mar 2019 08:57:35 +0800 Subject: [PATCH 31/46] Add files via upload --- keras/3.7-predicting-house-prices.ipynb | 797 ++++++++++++++++++++++++ 1 file changed, 797 insertions(+) create mode 100644 keras/3.7-predicting-house-prices.ipynb diff --git a/keras/3.7-predicting-house-prices.ipynb b/keras/3.7-predicting-house-prices.ipynb new file mode 100644 index 0000000..0f37622 --- /dev/null +++ b/keras/3.7-predicting-house-prices.ipynb @@ -0,0 +1,797 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "env: SPARK_DRIVER_MEMORY=8g\n", + "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", + "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n" + ] + } + ], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Predicting house prices: a regression example\n", + "\n", + "\n", + "----\n", + "\n", + "\n", + "In our two previous examples, we were considering classification problems, where the goal was to predict a single discrete label of an \n", + "input data point. Another common type of machine learning problem is \"regression\", which consists of predicting a continuous value instead \n", + "of a discrete label. For instance, predicting the temperature tomorrow, given meteorological data, or predicting the time that a \n", + "software project will take to complete, given its specifications.\n", + "\n", + "Do not mix up \"regression\" with the algorithm \"logistic regression\": confusingly, \"logistic regression\" is not a regression algorithm, \n", + "it is a classification algorithm." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## The Boston Housing Price dataset\n", + "\n", + "\n", + "We will be attempting to predict the median price of homes in a given Boston suburb in the mid-1970s, given a few data points about the \n", + "suburb at the time, such as the crime rate, the local property tax rate, etc.\n", + "\n", + "The dataset we will be using has another interesting difference from our two previous examples: it has very few data points, only 506 in \n", + "total, split between 404 training samples and 102 test samples, and each \"feature\" in the input data (e.g. the crime rate is a feature) has \n", + "a different scale. For instance some values are proportions, which take a values between 0 and 1, others take values between 1 and 12, \n", + "others between 0 and 100...\n", + "\n", + "Let's take a look at the data:" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.datasets import boston_housing\n", + "(train_data, train_targets), (test_data, test_targets) = boston_housing.load_data()" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "(404, 13)" + ] + }, + "execution_count": 3, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "train_data.shape" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "(102, 13)" + ] + }, + "execution_count": 4, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "test_data.shape" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, we have 404 training samples and 102 test samples. The data comprises 13 features. The 13 features in the input data are as \n", + "follow:\n", + "\n", + "1. Per capita crime rate.\n", + "2. Proportion of residential land zoned for lots over 25,000 square feet.\n", + "3. Proportion of non-retail business acres per town.\n", + "4. Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).\n", + "5. Nitric oxides concentration (parts per 10 million).\n", + "6. Average number of rooms per dwelling.\n", + "7. Proportion of owner-occupied units built prior to 1940.\n", + "8. Weighted distances to five Boston employment centres.\n", + "9. Index of accessibility to radial highways.\n", + "10. Full-value property-tax rate per $10,000.\n", + "11. Pupil-teacher ratio by town.\n", + "12. 1000 * (Bk - 0.63) ** 2 where Bk is the proportion of Black people by town.\n", + "13. % lower status of the population.\n", + "\n", + "The targets are the median values of owner-occupied homes, in thousands of dollars:" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([22.6, 50. , 23. , 8.3, 21.2, 19.9, 20.6, 18.7, 16.1, 18.6, 8.8,\n", + " 17.2, 14.9, 10.5, 50. , 29. , 23. , 33.3, 29.4, 21. , 23.8, 19.1,\n", + " 20.4, 29.1, 19.3, 23.1, 19.6, 19.4, 38.7, 18.7, 14.6, 20. , 20.5,\n", + " 20.1, 23.6, 16.8, 5.6, 50. , 14.5, 13.3, 23.9, 20. , 19.8, 13.8,\n", + " 16.5, 21.6, 20.3, 17. , 11.8, 27.5, 15.6, 23.1, 24.3, 42.8, 15.6,\n", + " 21.7, 17.1, 17.2, 15. , 21.7, 18.6, 21. , 33.1, 31.5, 20.1, 29.8,\n", + " 15.2, 15. , 27.5, 22.6, 20. , 21.4, 23.5, 31.2, 23.7, 7.4, 48.3,\n", + " 24.4, 22.6, 18.3, 23.3, 17.1, 27.9, 44.8, 50. , 23. , 21.4, 10.2,\n", + " 23.3, 23.2, 18.9, 13.4, 21.9, 24.8, 11.9, 24.3, 13.8, 24.7, 14.1,\n", + " 18.7, 28.1, 19.8, 26.7, 21.7, 22. , 22.9, 10.4, 21.9, 20.6, 26.4,\n", + " 41.3, 17.2, 27.1, 20.4, 16.5, 24.4, 8.4, 23. , 9.7, 50. , 30.5,\n", + " 12.3, 19.4, 21.2, 20.3, 18.8, 33.4, 18.5, 19.6, 33.2, 13.1, 7.5,\n", + " 13.6, 17.4, 8.4, 35.4, 24. , 13.4, 26.2, 7.2, 13.1, 24.5, 37.2,\n", + " 25. , 24.1, 16.6, 32.9, 36.2, 11. , 7.2, 22.8, 28.7, 14.4, 24.4,\n", + " 18.1, 22.5, 20.5, 15.2, 17.4, 13.6, 8.7, 18.2, 35.4, 31.7, 33. ,\n", + " 22.2, 20.4, 23.9, 25. , 12.7, 29.1, 12. , 17.7, 27. , 20.6, 10.2,\n", + " 17.5, 19.7, 29.8, 20.5, 14.9, 10.9, 19.5, 22.7, 19.5, 24.6, 25. ,\n", + " 24.5, 50. , 14.3, 11.8, 31. , 28.7, 16.2, 43.5, 25. , 22. , 19.9,\n", + " 22.1, 46. , 22.9, 20.2, 43.1, 34.6, 13.8, 24.3, 21.5, 24.4, 21.2,\n", + " 23.8, 26.6, 25.1, 9.6, 19.4, 19.4, 9.5, 14. , 26.5, 13.8, 34.7,\n", + " 16.3, 21.7, 17.5, 15.6, 20.9, 21.7, 12.7, 18.5, 23.7, 19.3, 12.7,\n", + " 21.6, 23.2, 29.6, 21.2, 23.8, 17.1, 22. , 36.5, 18.8, 21.9, 23.1,\n", + " 20.2, 17.4, 37. , 24.1, 36.2, 15.7, 32.2, 13.5, 17.9, 13.3, 11.7,\n", + " 41.7, 18.4, 13.1, 25. , 21.2, 16. , 34.9, 25.2, 24.8, 21.5, 23.4,\n", + " 18.9, 10.8, 21. , 27.5, 17.5, 13.5, 28.7, 14.8, 19.1, 28.6, 13.1,\n", + " 19. , 11.3, 13.3, 22.4, 20.1, 18.2, 22.9, 20.6, 25. , 12.8, 34.9,\n", + " 23.7, 50. , 29. , 30.1, 22. , 15.6, 23.3, 30.1, 14.3, 22.8, 50. ,\n", + " 20.8, 6.3, 34.9, 32.4, 19.9, 20.3, 17.8, 23.1, 20.4, 23.2, 7. ,\n", + " 16.8, 46.7, 50. , 22.9, 23.9, 21.4, 21.7, 15.4, 15.3, 23.1, 23.9,\n", + " 19.4, 11.9, 17.8, 31.5, 33.8, 20.8, 19.8, 22.4, 5. , 24.5, 19.4,\n", + " 15.1, 18.2, 19.3, 27.1, 20.7, 37.6, 11.7, 33.4, 30.1, 21.4, 45.4,\n", + " 20.1, 20.8, 26.4, 10.4, 21.8, 32. , 21.7, 18.4, 37.9, 17.8, 28. ,\n", + " 28.2, 36. , 18.9, 15. , 22.5, 30.7, 20. , 19.1, 23.3, 26.6, 21.1,\n", + " 19.7, 20. , 12.1, 7.2, 14.2, 17.3, 27.5, 22.2, 10.9, 19.2, 32. ,\n", + " 14.5, 24.7, 12.6, 24. , 24.1, 50. , 16.1, 43.8, 26.6, 36.1, 21.8,\n", + " 29.9, 50. , 44. , 20.6, 19.6, 28.4, 19.1, 22.3, 20.9, 28.4, 14.4,\n", + " 32.7, 13.8, 8.5, 22.5, 35.1, 31.6, 17.8, 15.6])" + ] + }, + "execution_count": 5, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "train_targets" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The prices are typically between \\$10,000 and \\$50,000. If that sounds cheap, remember this was the mid-1970s, and these prices are not \n", + "inflation-adjusted." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Preparing the data\n", + "\n", + "\n", + "It would be problematic to feed into a neural network values that all take wildly different ranges. The network might be able to \n", + "automatically adapt to such heterogeneous data, but it would definitely make learning more difficult. A widespread best practice to deal \n", + "with such data is to do feature-wise normalization: for each feature in the input data (a column in the input data matrix), we \n", + "will subtract the mean of the feature and divide by the standard deviation, so that the feature will be centered around 0 and will have a \n", + "unit standard deviation. This is easily done in Numpy:" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [], + "source": [ + "mean = train_data.mean(axis=0)\n", + "train_data -= mean\n", + "std = train_data.std(axis=0)\n", + "train_data /= std\n", + "\n", + "test_data -= mean\n", + "test_data /= std" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Note that the quantities that we use for normalizing the test data have been computed using the training data. We should never use in our \n", + "workflow any quantity computed on the test data, even for something as simple as data normalization." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Building our network\n", + "\n", + "\n", + "Because so few samples are available, we will be using a very small network with two \n", + "hidden layers, each with 64 units. In general, the less training data you have, the worse overfitting will be, and using \n", + "a small network is one way to mitigate overfitting." + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers\n", + "\n", + "def build_model():\n", + " # Because we will need to instantiate\n", + " # the same model multiple times,\n", + " # we use a function to construct it.\n", + " model = models.Sequential()\n", + " model.add(layers.Dense(64, activation='relu',\n", + " input_shape=(train_data.shape[1],)))\n", + " model.add(layers.Dense(64, activation='relu'))\n", + " model.add(layers.Dense(1))\n", + " model.compile(optimizer='rmsprop', loss='mse', metrics=['mae'])\n", + " return model" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Our network ends with a single unit, and no activation (i.e. it will be linear layer). \n", + "This is a typical setup for scalar regression (i.e. regression where we are trying to predict a single continuous value). \n", + "Applying an activation function would constrain the range that the output can take; for instance if \n", + "we applied a `sigmoid` activation function to our last layer, the network could only learn to predict values between 0 and 1. Here, because \n", + "the last layer is purely linear, the network is free to learn to predict values in any range.\n", + "\n", + "Note that we are compiling the network with the `mse` loss function -- Mean Squared Error, the square of the difference between the \n", + "predictions and the targets, a widely used loss function for regression problems.\n", + "\n", + "We are also monitoring a new metric during training: `mae`. This stands for Mean Absolute Error. It is simply the absolute value of the \n", + "difference between the predictions and the targets. For instance, a MAE of 0.5 on this problem would mean that our predictions are off by \n", + "\\$500 on average." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Validating our approach using K-fold validation\n", + "\n", + "\n", + "To evaluate our network while we keep adjusting its parameters (such as the number of epochs used for training), we could simply split the \n", + "data into a training set and a validation set, as we were doing in our previous examples. However, because we have so few data points, the \n", + "validation set would end up being very small (e.g. about 100 examples). A consequence is that our validation scores may change a lot \n", + "depending on _which_ data points we choose to use for validation and which we choose for training, i.e. the validation scores may have a \n", + "high _variance_ with regard to the validation split. This would prevent us from reliably evaluating our model.\n", + "\n", + "The best practice in such situations is to use K-fold cross-validation. It consists of splitting the available data into K partitions \n", + "(typically K=4 or 5), then instantiating K identical models, and training each one on K-1 partitions while evaluating on the remaining \n", + "partition. The validation score for the model used would then be the average of the K validation scores obtained." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Then let's start our training:" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "processing fold # 0\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 1\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 2\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 3\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n" + ] + } + ], + "source": [ + "import numpy as np\n", + "\n", + "k = 4\n", + "num_val_samples = len(train_data) // k\n", + "num_nb_epoch = 50\n", + "all_scores = []\n", + "for i in range(k):\n", + " print('processing fold #', i)\n", + " # Prepare the validation data: data from partition # k\n", + " val_data = train_data[i * num_val_samples: (i + 1) * num_val_samples]\n", + " val_targets = train_targets[i * num_val_samples: (i + 1) * num_val_samples]\n", + "\n", + " # Prepare the training data: data from all other partitions\n", + " partial_train_data = np.concatenate(\n", + " [train_data[:i * num_val_samples],\n", + " train_data[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + " partial_train_targets = np.concatenate(\n", + " [train_targets[:i * num_val_samples],\n", + " train_targets[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + "\n", + " # Build the model (already compiled)\n", + " model = build_model()\n", + " # Train the model (in silent mode, verbose=0)\n", + " #model.fit(partial_train_data, partial_train_targets,\n", + " # nb_epoch=num_nb_epoch, batch_size=1, verbose=0)\n", + " model.fit(partial_train_data, partial_train_targets,\n", + " nb_epoch=num_nb_epoch, batch_size=16)\n", + "\n", + " # Evaluate the model on the validation data\n", + " #val_mse, val_mae = model.evaluate(val_data, val_targets, verbose=0)\n", + " val_mse, val_mae = model.evaluate(val_data, val_targets)\n", + " all_scores.append(val_mae)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 16 records in 0.011235845 seconds. Throughput is 1424.0139 records/second. Loss is 8.708786._\n", + "\n", + "_INFO - Trained 16 records in 0.009535034 seconds. Throughput is 1678.0223 records/second. Loss is 5.3613434._\n", + "\n", + "_INFO - Trained 16 records in 0.008636178 seconds. Throughput is 1852.6713 records/second. Loss is 18.106756._\n", + "\n", + "_INFO - Trained 16 records in 0.009207628 seconds. Throughput is 1737.6897 records/second. Loss is 7.0931993._" + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "[3.291872501373291, 2.496018171310425, 2.221175193786621, 2.6994853019714355]" + ] + }, + "execution_count": 9, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "all_scores" + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "2.677137792110443" + ] + }, + "execution_count": 10, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "np.mean(all_scores)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can notice, the different runs do indeed show rather different validation scores, from 2.1 to 2.9. Their average (2.4) is a much more \n", + "reliable metric than any single of these scores -- that's the entire point of K-fold cross-validation. In this case, we are off by \\\\$2,400 on \n", + "average, which is still significant considering that the prices range from \\\\$10,000 to \\\\$50,000. \n", + "\n", + "Let's try training the network for a bit longer: 500 epochs. To keep a record of how well the model did at each epoch, we will modify our training loop \n", + "to save the per-epoch validation score log:" + ] + }, + { + "cell_type": "code", + "execution_count": 13, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "processing fold # 0\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 1\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 2\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n", + "processing fold # 3\n", + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n" + ] + } + ], + "source": [ + "num_epochs = 500\n", + "all_mae_histories = []\n", + "for i in range(k):\n", + " print('processing fold #', i)\n", + " # Prepare the validation data: data from partition # k\n", + " val_data = train_data[i * num_val_samples: (i + 1) * num_val_samples]\n", + " val_targets = train_targets[i * num_val_samples: (i + 1) * num_val_samples]\n", + "\n", + " # Prepare the training data: data from all other partitions\n", + " partial_train_data = np.concatenate(\n", + " [train_data[:i * num_val_samples],\n", + " train_data[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + " partial_train_targets = np.concatenate(\n", + " [train_targets[:i * num_val_samples],\n", + " train_targets[(i + 1) * num_val_samples:]],\n", + " axis=0)\n", + "\n", + " # Build the model (already compiled)\n", + " model = build_model()\n", + " # Train the model (in silent mode, verbose=0)\n", + " import time\n", + " dir_name = '3-7 ' + str(time.ctime())\n", + " model.set_tensorboard('./', dir_name)\n", + " history = model.fit(partial_train_data, partial_train_targets,\n", + " validation_data=(val_data, val_targets),\n", + " nb_epoch=num_epochs, batch_size=16)\n", + " \n", + " #mae_history = history.history['val_mean_absolute_error']\n", + " mae_history = model.get_validation_summary(\"Loss\")\n", + " all_mae_histories.append(mae_history)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We can then compute the average of the per-epoch MAE scores for all folds:" + ] + }, + { + "cell_type": "code", + "execution_count": 47, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([[[1.90000000e+01, 4.05375427e+02, 1.55307042e+09],\n", + " [3.80000000e+01, 2.64351837e+02, 1.55307042e+09],\n", + " [5.70000000e+01, 1.50977859e+02, 1.55307042e+09],\n", + " ...,\n", + " [9.46200000e+03, 2.07635689e+01, 1.55307053e+09],\n", + " [9.48100000e+03, 2.02473850e+01, 1.55307053e+09],\n", + " [9.50000000e+03, 2.02105141e+01, 1.55307053e+09]],\n", + "\n", + " [[1.90000000e+01, 4.76980957e+02, 1.55307053e+09],\n", + " [3.80000000e+01, 3.29584198e+02, 1.55307053e+09],\n", + " [5.70000000e+01, 1.80655548e+02, 1.55307053e+09],\n", + " ...,\n", + " [9.46200000e+03, 1.73588219e+01, 1.55307064e+09],\n", + " [9.48100000e+03, 1.78555279e+01, 1.55307064e+09],\n", + " [9.50000000e+03, 1.73744106e+01, 1.55307064e+09]],\n", + "\n", + " [[1.90000000e+01, 4.62182434e+02, 1.55307064e+09],\n", + " [3.80000000e+01, 3.34037567e+02, 1.55307064e+09],\n", + " [5.70000000e+01, 2.06141006e+02, 1.55307064e+09],\n", + " ...,\n", + " [9.46200000e+03, 1.72124062e+01, 1.55307075e+09],\n", + " [9.48100000e+03, 1.75751667e+01, 1.55307075e+09],\n", + " [9.50000000e+03, 1.74055386e+01, 1.55307075e+09]],\n", + "\n", + " [[1.90000000e+01, 5.21177673e+02, 1.55307075e+09],\n", + " [3.80000000e+01, 3.99685974e+02, 1.55307075e+09],\n", + " [5.70000000e+01, 2.67611786e+02, 1.55307075e+09],\n", + " ...,\n", + " [9.46200000e+03, 1.75390892e+01, 1.55307085e+09],\n", + " [9.48100000e+03, 1.76337471e+01, 1.55307085e+09],\n", + " [9.50000000e+03, 1.91227703e+01, 1.55307085e+09]]])" + ] + }, + "execution_count": 47, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "all_mae_histories = np.array(all_mae_histories)\n", + "all_mae_histories" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, the `all_mae_histories` is a 3-d array, the last dimension are 3-element tuples. This 3-d array is built up with four 2-d arrays and all the first element of every 2-d array are equal. The first element of tuple stands for the training step and the third element stands for time stamp. You do need to worry about them, let's just calculate the average value through the first axis of this 3-d array. Actually we just want the second elements of this array, which stand for the MAE results. " + ] + }, + { + "cell_type": "code", + "execution_count": 48, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "array([[1.90000000e+01, 4.66429123e+02, 1.55307058e+09],\n", + " [3.80000000e+01, 3.31914894e+02, 1.55307058e+09],\n", + " [5.70000000e+01, 2.01346550e+02, 1.55307058e+09],\n", + " ...,\n", + " [9.46200000e+03, 1.82184715e+01, 1.55307069e+09],\n", + " [9.48100000e+03, 1.83279567e+01, 1.55307069e+09],\n", + " [9.50000000e+03, 1.85283084e+01, 1.55307069e+09]])" + ] + }, + "execution_count": 48, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "average_mae_history = np.mean(all_mae_histories, axis=0)\n", + "average_mae_history" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, this operation does not mess up the first elements since they are all equal through the first axis. And we do not need to care about the third element because it is useless at this time.\n", + "\n", + "Let's plot this:" + ] + }, + { + "cell_type": "code", + "execution_count": 49, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "import matplotlib.pyplot as plt\n", + "plt.plot(average_mae_history[:,0],average_mae_history[:,1])\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Validation MAE')\n", + "plt.ylim((14, 20))\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's plot this:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "According to this plot, it seems that validation MAE stops improving significantly after 150 epochs. Past that point, we start overfitting.\n", + "\n", + "Once we are done tuning other parameters of our model (besides the number of epochs, we could also adjust the size of the hidden layers), we \n", + "can train a final \"production\" model on all of the training data, with the best parameters, then look at its performance on the test data:" + ] + }, + { + "cell_type": "code", + "execution_count": 50, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasMeanSquaredError\n", + "creating: createZooKerasMAE\n" + ] + } + ], + "source": [ + "# Get a fresh, compiled model.\n", + "model = build_model()\n", + "# Train it on the entirety of the data.\n", + "model.fit(train_data, train_targets,\n", + " nb_epoch=150, batch_size=16)\n", + "test_mse_score, test_mae_score = model.evaluate(test_data, test_targets)" + ] + }, + { + "cell_type": "code", + "execution_count": 51, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "1.7991065979003906" + ] + }, + "execution_count": 51, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "test_mae_score" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We are still off by about \\$1,800." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Wrapping up\n", + "\n", + "\n", + "Here's what you should take away from this example:\n", + "\n", + "* Regression is done using different loss functions from classification; Mean Squared Error (MSE) is a commonly used loss function for \n", + "regression.\n", + "* Similarly, evaluation metrics to be used for regression differ from those used for classification; naturally the concept of \"accuracy\" \n", + "does not apply for regression. A common regression metric is Mean Absolute Error (MAE).\n", + "* When features in the input data have values in different ranges, each feature should be scaled independently as a preprocessing step.\n", + "* When there is little data available, using K-Fold validation is a great way to reliably evaluate a model.\n", + "* When little training data is available, it is preferable to use a small network with very few hidden layers (typically only one or two), \n", + "in order to avoid severe overfitting.\n", + "\n", + "This example concludes our series of three introductory practical examples. You are now able to handle common types of problems with vector data input:\n", + "\n", + "* Binary (2-class) classification.\n", + "* Multi-class, single-label classification.\n", + "* Scalar regression.\n", + "\n", + "In the next chapter, you will acquire a more formal understanding of some of the concepts you have encountered in these first examples, \n", + "such as data preprocessing, model evaluation, and overfitting." + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From 6ec5eb4acc5fddc2d2336f0cf1dbda65c5edb0b0 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 21 Mar 2019 08:57:44 +0800 Subject: [PATCH 32/46] Delete 3.7-regression.ipynb --- keras/3.7-regression.ipynb | 797 ------------------------------------- 1 file changed, 797 deletions(-) delete mode 100644 keras/3.7-regression.ipynb diff --git a/keras/3.7-regression.ipynb b/keras/3.7-regression.ipynb deleted file mode 100644 index 0f37622..0000000 --- a/keras/3.7-regression.ipynb +++ /dev/null @@ -1,797 +0,0 @@ -{ - "cells": [ - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "First of all, set environment variables and initialize spark context:" - ] - }, - { - "cell_type": "code", - "execution_count": 1, - "metadata": {}, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - "env: SPARK_DRIVER_MEMORY=8g\n", - "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", - "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n" - ] - } - ], - "source": [ - "%env SPARK_DRIVER_MEMORY=8g\n", - "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", - "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", - "\n", - "from zoo.common.nncontext import *\n", - "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# Predicting house prices: a regression example\n", - "\n", - "\n", - "----\n", - "\n", - "\n", - "In our two previous examples, we were considering classification problems, where the goal was to predict a single discrete label of an \n", - "input data point. Another common type of machine learning problem is \"regression\", which consists of predicting a continuous value instead \n", - "of a discrete label. For instance, predicting the temperature tomorrow, given meteorological data, or predicting the time that a \n", - "software project will take to complete, given its specifications.\n", - "\n", - "Do not mix up \"regression\" with the algorithm \"logistic regression\": confusingly, \"logistic regression\" is not a regression algorithm, \n", - "it is a classification algorithm." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "## The Boston Housing Price dataset\n", - "\n", - "\n", - "We will be attempting to predict the median price of homes in a given Boston suburb in the mid-1970s, given a few data points about the \n", - "suburb at the time, such as the crime rate, the local property tax rate, etc.\n", - "\n", - "The dataset we will be using has another interesting difference from our two previous examples: it has very few data points, only 506 in \n", - "total, split between 404 training samples and 102 test samples, and each \"feature\" in the input data (e.g. the crime rate is a feature) has \n", - "a different scale. For instance some values are proportions, which take a values between 0 and 1, others take values between 1 and 12, \n", - "others between 0 and 100...\n", - "\n", - "Let's take a look at the data:" - ] - }, - { - "cell_type": "code", - "execution_count": 2, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras.datasets import boston_housing\n", - "(train_data, train_targets), (test_data, test_targets) = boston_housing.load_data()" - ] - }, - { - "cell_type": "code", - "execution_count": 3, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "(404, 13)" - ] - }, - "execution_count": 3, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "train_data.shape" - ] - }, - { - "cell_type": "code", - "execution_count": 4, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "(102, 13)" - ] - }, - "execution_count": 4, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "test_data.shape" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "As you can see, we have 404 training samples and 102 test samples. The data comprises 13 features. The 13 features in the input data are as \n", - "follow:\n", - "\n", - "1. Per capita crime rate.\n", - "2. Proportion of residential land zoned for lots over 25,000 square feet.\n", - "3. Proportion of non-retail business acres per town.\n", - "4. Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).\n", - "5. Nitric oxides concentration (parts per 10 million).\n", - "6. Average number of rooms per dwelling.\n", - "7. Proportion of owner-occupied units built prior to 1940.\n", - "8. Weighted distances to five Boston employment centres.\n", - "9. Index of accessibility to radial highways.\n", - "10. Full-value property-tax rate per $10,000.\n", - "11. Pupil-teacher ratio by town.\n", - "12. 1000 * (Bk - 0.63) ** 2 where Bk is the proportion of Black people by town.\n", - "13. % lower status of the population.\n", - "\n", - "The targets are the median values of owner-occupied homes, in thousands of dollars:" - ] - }, - { - "cell_type": "code", - "execution_count": 5, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "array([22.6, 50. , 23. , 8.3, 21.2, 19.9, 20.6, 18.7, 16.1, 18.6, 8.8,\n", - " 17.2, 14.9, 10.5, 50. , 29. , 23. , 33.3, 29.4, 21. , 23.8, 19.1,\n", - " 20.4, 29.1, 19.3, 23.1, 19.6, 19.4, 38.7, 18.7, 14.6, 20. , 20.5,\n", - " 20.1, 23.6, 16.8, 5.6, 50. , 14.5, 13.3, 23.9, 20. , 19.8, 13.8,\n", - " 16.5, 21.6, 20.3, 17. , 11.8, 27.5, 15.6, 23.1, 24.3, 42.8, 15.6,\n", - " 21.7, 17.1, 17.2, 15. , 21.7, 18.6, 21. , 33.1, 31.5, 20.1, 29.8,\n", - " 15.2, 15. , 27.5, 22.6, 20. , 21.4, 23.5, 31.2, 23.7, 7.4, 48.3,\n", - " 24.4, 22.6, 18.3, 23.3, 17.1, 27.9, 44.8, 50. , 23. , 21.4, 10.2,\n", - " 23.3, 23.2, 18.9, 13.4, 21.9, 24.8, 11.9, 24.3, 13.8, 24.7, 14.1,\n", - " 18.7, 28.1, 19.8, 26.7, 21.7, 22. , 22.9, 10.4, 21.9, 20.6, 26.4,\n", - " 41.3, 17.2, 27.1, 20.4, 16.5, 24.4, 8.4, 23. , 9.7, 50. , 30.5,\n", - " 12.3, 19.4, 21.2, 20.3, 18.8, 33.4, 18.5, 19.6, 33.2, 13.1, 7.5,\n", - " 13.6, 17.4, 8.4, 35.4, 24. , 13.4, 26.2, 7.2, 13.1, 24.5, 37.2,\n", - " 25. , 24.1, 16.6, 32.9, 36.2, 11. , 7.2, 22.8, 28.7, 14.4, 24.4,\n", - " 18.1, 22.5, 20.5, 15.2, 17.4, 13.6, 8.7, 18.2, 35.4, 31.7, 33. ,\n", - " 22.2, 20.4, 23.9, 25. , 12.7, 29.1, 12. , 17.7, 27. , 20.6, 10.2,\n", - " 17.5, 19.7, 29.8, 20.5, 14.9, 10.9, 19.5, 22.7, 19.5, 24.6, 25. ,\n", - " 24.5, 50. , 14.3, 11.8, 31. , 28.7, 16.2, 43.5, 25. , 22. , 19.9,\n", - " 22.1, 46. , 22.9, 20.2, 43.1, 34.6, 13.8, 24.3, 21.5, 24.4, 21.2,\n", - " 23.8, 26.6, 25.1, 9.6, 19.4, 19.4, 9.5, 14. , 26.5, 13.8, 34.7,\n", - " 16.3, 21.7, 17.5, 15.6, 20.9, 21.7, 12.7, 18.5, 23.7, 19.3, 12.7,\n", - " 21.6, 23.2, 29.6, 21.2, 23.8, 17.1, 22. , 36.5, 18.8, 21.9, 23.1,\n", - " 20.2, 17.4, 37. , 24.1, 36.2, 15.7, 32.2, 13.5, 17.9, 13.3, 11.7,\n", - " 41.7, 18.4, 13.1, 25. , 21.2, 16. , 34.9, 25.2, 24.8, 21.5, 23.4,\n", - " 18.9, 10.8, 21. , 27.5, 17.5, 13.5, 28.7, 14.8, 19.1, 28.6, 13.1,\n", - " 19. , 11.3, 13.3, 22.4, 20.1, 18.2, 22.9, 20.6, 25. , 12.8, 34.9,\n", - " 23.7, 50. , 29. , 30.1, 22. , 15.6, 23.3, 30.1, 14.3, 22.8, 50. ,\n", - " 20.8, 6.3, 34.9, 32.4, 19.9, 20.3, 17.8, 23.1, 20.4, 23.2, 7. ,\n", - " 16.8, 46.7, 50. , 22.9, 23.9, 21.4, 21.7, 15.4, 15.3, 23.1, 23.9,\n", - " 19.4, 11.9, 17.8, 31.5, 33.8, 20.8, 19.8, 22.4, 5. , 24.5, 19.4,\n", - " 15.1, 18.2, 19.3, 27.1, 20.7, 37.6, 11.7, 33.4, 30.1, 21.4, 45.4,\n", - " 20.1, 20.8, 26.4, 10.4, 21.8, 32. , 21.7, 18.4, 37.9, 17.8, 28. ,\n", - " 28.2, 36. , 18.9, 15. , 22.5, 30.7, 20. , 19.1, 23.3, 26.6, 21.1,\n", - " 19.7, 20. , 12.1, 7.2, 14.2, 17.3, 27.5, 22.2, 10.9, 19.2, 32. ,\n", - " 14.5, 24.7, 12.6, 24. , 24.1, 50. , 16.1, 43.8, 26.6, 36.1, 21.8,\n", - " 29.9, 50. , 44. , 20.6, 19.6, 28.4, 19.1, 22.3, 20.9, 28.4, 14.4,\n", - " 32.7, 13.8, 8.5, 22.5, 35.1, 31.6, 17.8, 15.6])" - ] - }, - "execution_count": 5, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "train_targets" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "The prices are typically between \\$10,000 and \\$50,000. If that sounds cheap, remember this was the mid-1970s, and these prices are not \n", - "inflation-adjusted." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "## Preparing the data\n", - "\n", - "\n", - "It would be problematic to feed into a neural network values that all take wildly different ranges. The network might be able to \n", - "automatically adapt to such heterogeneous data, but it would definitely make learning more difficult. A widespread best practice to deal \n", - "with such data is to do feature-wise normalization: for each feature in the input data (a column in the input data matrix), we \n", - "will subtract the mean of the feature and divide by the standard deviation, so that the feature will be centered around 0 and will have a \n", - "unit standard deviation. This is easily done in Numpy:" - ] - }, - { - "cell_type": "code", - "execution_count": 6, - "metadata": {}, - "outputs": [], - "source": [ - "mean = train_data.mean(axis=0)\n", - "train_data -= mean\n", - "std = train_data.std(axis=0)\n", - "train_data /= std\n", - "\n", - "test_data -= mean\n", - "test_data /= std" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Note that the quantities that we use for normalizing the test data have been computed using the training data. We should never use in our \n", - "workflow any quantity computed on the test data, even for something as simple as data normalization." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "## Building our network\n", - "\n", - "\n", - "Because so few samples are available, we will be using a very small network with two \n", - "hidden layers, each with 64 units. In general, the less training data you have, the worse overfitting will be, and using \n", - "a small network is one way to mitigate overfitting." - ] - }, - { - "cell_type": "code", - "execution_count": 7, - "metadata": {}, - "outputs": [], - "source": [ - "from zoo.pipeline.api.keras import models\n", - "from zoo.pipeline.api.keras import layers\n", - "\n", - "def build_model():\n", - " # Because we will need to instantiate\n", - " # the same model multiple times,\n", - " # we use a function to construct it.\n", - " model = models.Sequential()\n", - " model.add(layers.Dense(64, activation='relu',\n", - " input_shape=(train_data.shape[1],)))\n", - " model.add(layers.Dense(64, activation='relu'))\n", - " model.add(layers.Dense(1))\n", - " model.compile(optimizer='rmsprop', loss='mse', metrics=['mae'])\n", - " return model" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Our network ends with a single unit, and no activation (i.e. it will be linear layer). \n", - "This is a typical setup for scalar regression (i.e. regression where we are trying to predict a single continuous value). \n", - "Applying an activation function would constrain the range that the output can take; for instance if \n", - "we applied a `sigmoid` activation function to our last layer, the network could only learn to predict values between 0 and 1. Here, because \n", - "the last layer is purely linear, the network is free to learn to predict values in any range.\n", - "\n", - "Note that we are compiling the network with the `mse` loss function -- Mean Squared Error, the square of the difference between the \n", - "predictions and the targets, a widely used loss function for regression problems.\n", - "\n", - "We are also monitoring a new metric during training: `mae`. This stands for Mean Absolute Error. It is simply the absolute value of the \n", - "difference between the predictions and the targets. For instance, a MAE of 0.5 on this problem would mean that our predictions are off by \n", - "\\$500 on average." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "## Validating our approach using K-fold validation\n", - "\n", - "\n", - "To evaluate our network while we keep adjusting its parameters (such as the number of epochs used for training), we could simply split the \n", - "data into a training set and a validation set, as we were doing in our previous examples. However, because we have so few data points, the \n", - "validation set would end up being very small (e.g. about 100 examples). A consequence is that our validation scores may change a lot \n", - "depending on _which_ data points we choose to use for validation and which we choose for training, i.e. the validation scores may have a \n", - "high _variance_ with regard to the validation split. This would prevent us from reliably evaluating our model.\n", - "\n", - "The best practice in such situations is to use K-fold cross-validation. It consists of splitting the available data into K partitions \n", - "(typically K=4 or 5), then instantiating K identical models, and training each one on K-1 partitions while evaluating on the remaining \n", - "partition. The validation score for the model used would then be the average of the K validation scores obtained." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Then let's start our training:" - ] - }, - { - "cell_type": "code", - "execution_count": 8, - "metadata": {}, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - "processing fold # 0\n", - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n", - "processing fold # 1\n", - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n", - "processing fold # 2\n", - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n", - "processing fold # 3\n", - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n" - ] - } - ], - "source": [ - "import numpy as np\n", - "\n", - "k = 4\n", - "num_val_samples = len(train_data) // k\n", - "num_nb_epoch = 50\n", - "all_scores = []\n", - "for i in range(k):\n", - " print('processing fold #', i)\n", - " # Prepare the validation data: data from partition # k\n", - " val_data = train_data[i * num_val_samples: (i + 1) * num_val_samples]\n", - " val_targets = train_targets[i * num_val_samples: (i + 1) * num_val_samples]\n", - "\n", - " # Prepare the training data: data from all other partitions\n", - " partial_train_data = np.concatenate(\n", - " [train_data[:i * num_val_samples],\n", - " train_data[(i + 1) * num_val_samples:]],\n", - " axis=0)\n", - " partial_train_targets = np.concatenate(\n", - " [train_targets[:i * num_val_samples],\n", - " train_targets[(i + 1) * num_val_samples:]],\n", - " axis=0)\n", - "\n", - " # Build the model (already compiled)\n", - " model = build_model()\n", - " # Train the model (in silent mode, verbose=0)\n", - " #model.fit(partial_train_data, partial_train_targets,\n", - " # nb_epoch=num_nb_epoch, batch_size=1, verbose=0)\n", - " model.fit(partial_train_data, partial_train_targets,\n", - " nb_epoch=num_nb_epoch, batch_size=16)\n", - "\n", - " # Evaluate the model on the validation data\n", - " #val_mse, val_mae = model.evaluate(val_data, val_targets, verbose=0)\n", - " val_mse, val_mae = model.evaluate(val_data, val_targets)\n", - " all_scores.append(val_mae)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "_INFO - Trained 16 records in 0.011235845 seconds. Throughput is 1424.0139 records/second. Loss is 8.708786._\n", - "\n", - "_INFO - Trained 16 records in 0.009535034 seconds. Throughput is 1678.0223 records/second. Loss is 5.3613434._\n", - "\n", - "_INFO - Trained 16 records in 0.008636178 seconds. Throughput is 1852.6713 records/second. Loss is 18.106756._\n", - "\n", - "_INFO - Trained 16 records in 0.009207628 seconds. Throughput is 1737.6897 records/second. Loss is 7.0931993._" - ] - }, - { - "cell_type": "code", - "execution_count": 9, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "[3.291872501373291, 2.496018171310425, 2.221175193786621, 2.6994853019714355]" - ] - }, - "execution_count": 9, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "all_scores" - ] - }, - { - "cell_type": "code", - "execution_count": 10, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "2.677137792110443" - ] - }, - "execution_count": 10, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "np.mean(all_scores)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "As you can notice, the different runs do indeed show rather different validation scores, from 2.1 to 2.9. Their average (2.4) is a much more \n", - "reliable metric than any single of these scores -- that's the entire point of K-fold cross-validation. In this case, we are off by \\\\$2,400 on \n", - "average, which is still significant considering that the prices range from \\\\$10,000 to \\\\$50,000. \n", - "\n", - "Let's try training the network for a bit longer: 500 epochs. To keep a record of how well the model did at each epoch, we will modify our training loop \n", - "to save the per-epoch validation score log:" - ] - }, - { - "cell_type": "code", - "execution_count": 13, - "metadata": {}, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - "processing fold # 0\n", - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n", - "processing fold # 1\n", - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n", - "processing fold # 2\n", - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n", - "processing fold # 3\n", - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n" - ] - } - ], - "source": [ - "num_epochs = 500\n", - "all_mae_histories = []\n", - "for i in range(k):\n", - " print('processing fold #', i)\n", - " # Prepare the validation data: data from partition # k\n", - " val_data = train_data[i * num_val_samples: (i + 1) * num_val_samples]\n", - " val_targets = train_targets[i * num_val_samples: (i + 1) * num_val_samples]\n", - "\n", - " # Prepare the training data: data from all other partitions\n", - " partial_train_data = np.concatenate(\n", - " [train_data[:i * num_val_samples],\n", - " train_data[(i + 1) * num_val_samples:]],\n", - " axis=0)\n", - " partial_train_targets = np.concatenate(\n", - " [train_targets[:i * num_val_samples],\n", - " train_targets[(i + 1) * num_val_samples:]],\n", - " axis=0)\n", - "\n", - " # Build the model (already compiled)\n", - " model = build_model()\n", - " # Train the model (in silent mode, verbose=0)\n", - " import time\n", - " dir_name = '3-7 ' + str(time.ctime())\n", - " model.set_tensorboard('./', dir_name)\n", - " history = model.fit(partial_train_data, partial_train_targets,\n", - " validation_data=(val_data, val_targets),\n", - " nb_epoch=num_epochs, batch_size=16)\n", - " \n", - " #mae_history = history.history['val_mean_absolute_error']\n", - " mae_history = model.get_validation_summary(\"Loss\")\n", - " all_mae_histories.append(mae_history)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "We can then compute the average of the per-epoch MAE scores for all folds:" - ] - }, - { - "cell_type": "code", - "execution_count": 47, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "array([[[1.90000000e+01, 4.05375427e+02, 1.55307042e+09],\n", - " [3.80000000e+01, 2.64351837e+02, 1.55307042e+09],\n", - " [5.70000000e+01, 1.50977859e+02, 1.55307042e+09],\n", - " ...,\n", - " [9.46200000e+03, 2.07635689e+01, 1.55307053e+09],\n", - " [9.48100000e+03, 2.02473850e+01, 1.55307053e+09],\n", - " [9.50000000e+03, 2.02105141e+01, 1.55307053e+09]],\n", - "\n", - " [[1.90000000e+01, 4.76980957e+02, 1.55307053e+09],\n", - " [3.80000000e+01, 3.29584198e+02, 1.55307053e+09],\n", - " [5.70000000e+01, 1.80655548e+02, 1.55307053e+09],\n", - " ...,\n", - " [9.46200000e+03, 1.73588219e+01, 1.55307064e+09],\n", - " [9.48100000e+03, 1.78555279e+01, 1.55307064e+09],\n", - " [9.50000000e+03, 1.73744106e+01, 1.55307064e+09]],\n", - "\n", - " [[1.90000000e+01, 4.62182434e+02, 1.55307064e+09],\n", - " [3.80000000e+01, 3.34037567e+02, 1.55307064e+09],\n", - " [5.70000000e+01, 2.06141006e+02, 1.55307064e+09],\n", - " ...,\n", - " [9.46200000e+03, 1.72124062e+01, 1.55307075e+09],\n", - " [9.48100000e+03, 1.75751667e+01, 1.55307075e+09],\n", - " [9.50000000e+03, 1.74055386e+01, 1.55307075e+09]],\n", - "\n", - " [[1.90000000e+01, 5.21177673e+02, 1.55307075e+09],\n", - " [3.80000000e+01, 3.99685974e+02, 1.55307075e+09],\n", - " [5.70000000e+01, 2.67611786e+02, 1.55307075e+09],\n", - " ...,\n", - " [9.46200000e+03, 1.75390892e+01, 1.55307085e+09],\n", - " [9.48100000e+03, 1.76337471e+01, 1.55307085e+09],\n", - " [9.50000000e+03, 1.91227703e+01, 1.55307085e+09]]])" - ] - }, - "execution_count": 47, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "all_mae_histories = np.array(all_mae_histories)\n", - "all_mae_histories" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "As you can see, the `all_mae_histories` is a 3-d array, the last dimension are 3-element tuples. This 3-d array is built up with four 2-d arrays and all the first element of every 2-d array are equal. The first element of tuple stands for the training step and the third element stands for time stamp. You do need to worry about them, let's just calculate the average value through the first axis of this 3-d array. Actually we just want the second elements of this array, which stand for the MAE results. " - ] - }, - { - "cell_type": "code", - "execution_count": 48, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "array([[1.90000000e+01, 4.66429123e+02, 1.55307058e+09],\n", - " [3.80000000e+01, 3.31914894e+02, 1.55307058e+09],\n", - " [5.70000000e+01, 2.01346550e+02, 1.55307058e+09],\n", - " ...,\n", - " [9.46200000e+03, 1.82184715e+01, 1.55307069e+09],\n", - " [9.48100000e+03, 1.83279567e+01, 1.55307069e+09],\n", - " [9.50000000e+03, 1.85283084e+01, 1.55307069e+09]])" - ] - }, - "execution_count": 48, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "average_mae_history = np.mean(all_mae_histories, axis=0)\n", - "average_mae_history" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "As you can see, this operation does not mess up the first elements since they are all equal through the first axis. And we do not need to care about the third element because it is useless at this time.\n", - "\n", - "Let's plot this:" - ] - }, - { - "cell_type": "code", - "execution_count": 49, - "metadata": {}, - "outputs": [ - { - "data": { - "image/png": "\n", - "text/plain": [ - "
" - ] - }, - "metadata": { - "needs_background": "light" - }, - "output_type": "display_data" - } - ], - "source": [ - "import matplotlib.pyplot as plt\n", - "plt.plot(average_mae_history[:,0],average_mae_history[:,1])\n", - "plt.xlabel('Steps')\n", - "plt.ylabel('Validation MAE')\n", - "plt.ylim((14, 20))\n", - "plt.show()" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Let's plot this:" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "According to this plot, it seems that validation MAE stops improving significantly after 150 epochs. Past that point, we start overfitting.\n", - "\n", - "Once we are done tuning other parameters of our model (besides the number of epochs, we could also adjust the size of the hidden layers), we \n", - "can train a final \"production\" model on all of the training data, with the best parameters, then look at its performance on the test data:" - ] - }, - { - "cell_type": "code", - "execution_count": 50, - "metadata": {}, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - "creating: createZooKerasSequential\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createZooKerasDense\n", - "creating: createRMSprop\n", - "creating: createZooKerasMeanSquaredError\n", - "creating: createZooKerasMAE\n" - ] - } - ], - "source": [ - "# Get a fresh, compiled model.\n", - "model = build_model()\n", - "# Train it on the entirety of the data.\n", - "model.fit(train_data, train_targets,\n", - " nb_epoch=150, batch_size=16)\n", - "test_mse_score, test_mae_score = model.evaluate(test_data, test_targets)" - ] - }, - { - "cell_type": "code", - "execution_count": 51, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "1.7991065979003906" - ] - }, - "execution_count": 51, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "test_mae_score" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "We are still off by about \\$1,800." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "## Wrapping up\n", - "\n", - "\n", - "Here's what you should take away from this example:\n", - "\n", - "* Regression is done using different loss functions from classification; Mean Squared Error (MSE) is a commonly used loss function for \n", - "regression.\n", - "* Similarly, evaluation metrics to be used for regression differ from those used for classification; naturally the concept of \"accuracy\" \n", - "does not apply for regression. A common regression metric is Mean Absolute Error (MAE).\n", - "* When features in the input data have values in different ranges, each feature should be scaled independently as a preprocessing step.\n", - "* When there is little data available, using K-Fold validation is a great way to reliably evaluate a model.\n", - "* When little training data is available, it is preferable to use a small network with very few hidden layers (typically only one or two), \n", - "in order to avoid severe overfitting.\n", - "\n", - "This example concludes our series of three introductory practical examples. You are now able to handle common types of problems with vector data input:\n", - "\n", - "* Binary (2-class) classification.\n", - "* Multi-class, single-label classification.\n", - "* Scalar regression.\n", - "\n", - "In the next chapter, you will acquire a more formal understanding of some of the concepts you have encountered in these first examples, \n", - "such as data preprocessing, model evaluation, and overfitting." - ] - } - ], - "metadata": { - "kernelspec": { - "display_name": "Python 3", - "language": "python", - "name": "python3" - }, - "language_info": { - "codemirror_mode": { - "name": "ipython", - "version": 3 - }, - "file_extension": ".py", - "mimetype": "text/x-python", - "name": "python", - "nbconvert_exporter": "python", - "pygments_lexer": "ipython3", - "version": "3.5.2" - } - }, - "nbformat": 4, - "nbformat_minor": 2 -} From fb2f9ecfc8fa7ce6f800b6d8ee3493707a3b1cd2 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 21 Mar 2019 16:09:14 +0800 Subject: [PATCH 33/46] Add files via upload --- keras/4.4-overfitting-and-underfitting.ipynb | 704 +++++++++++++++++++ 1 file changed, 704 insertions(+) create mode 100644 keras/4.4-overfitting-and-underfitting.ipynb diff --git a/keras/4.4-overfitting-and-underfitting.ipynb b/keras/4.4-overfitting-and-underfitting.ipynb new file mode 100644 index 0000000..d28bebd --- /dev/null +++ b/keras/4.4-overfitting-and-underfitting.ipynb @@ -0,0 +1,704 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "env: SPARK_DRIVER_MEMORY=32g\n", + "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", + "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n" + ] + } + ], + "source": [ + "%env SPARK_DRIVER_MEMORY=32g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Overfitting and underfitting\n", + "\n", + "\n", + "----\n", + "\n", + "\n", + "In all the examples we saw in the previous chapter -- movie review sentiment prediction, topic classification, and house price regression -- \n", + "we could notice that the performance of our model on the held-out validation data would always peak after a few epochs and would then start \n", + "degrading, i.e. our model would quickly start to _overfit_ to the training data. Overfitting happens in every single machine learning \n", + "problem. Learning how to deal with overfitting is essential to mastering machine learning.\n", + "\n", + "The fundamental issue in machine learning is the tension between optimization and generalization. \"Optimization\" refers to the process of \n", + "adjusting a model to get the best performance possible on the training data (the \"learning\" in \"machine learning\"), while \"generalization\" \n", + "refers to how well the trained model would perform on data it has never seen before. The goal of the game is to get good generalization, of \n", + "course, but you do not control generalization; you can only adjust the model based on its training data.\n", + "\n", + "At the beginning of training, optimization and generalization are correlated: the lower your loss on training data, the lower your loss on \n", + "test data. While this is happening, your model is said to be _under-fit_: there is still progress to be made; the network hasn't yet \n", + "modeled all relevant patterns in the training data. But after a certain number of iterations on the training data, generalization stops \n", + "improving, validation metrics stall then start degrading: the model is then starting to over-fit, i.e. is it starting to learn patterns \n", + "that are specific to the training data but that are misleading or irrelevant when it comes to new data.\n", + "\n", + "To prevent a model from learning misleading or irrelevant patterns found in the training data, _the best solution is of course to get \n", + "more training data_. A model trained on more data will naturally generalize better. When that is no longer possible, the next best solution \n", + "is to modulate the quantity of information that your model is allowed to store, or to add constraints on what information it is allowed to \n", + "store. If a network can only afford to memorize a small number of patterns, the optimization process will force it to focus on the most \n", + "prominent patterns, which have a better chance of generalizing well.\n", + "\n", + "The processing of fighting overfitting in this way is called _regularization_. Let's review some of the most common regularization \n", + "techniques, and let's apply them in practice to improve our movie classification model from the previous chapter." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Note: in this notebook we will be using the IMDB test set as our validation set. It doesn't matter in this context.\n", + "\n", + "Let's prepare the data using the code from Chapter 3, Section 5:" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.datasets import imdb\n", + "import numpy as np\n", + "(train_data, train_labels), (test_data, test_labels) = imdb.load_data(nb_words=10000)\n", + "\n", + "def vectorize_sequences(sequences, dimension=10000):\n", + " # Create an all-zero matrix of shape (len(sequences), dimension)\n", + " results = np.zeros((len(sequences), dimension))\n", + " for i, sequence in enumerate(sequences):\n", + " results[i, sequence] = 1. # set specific indices of results[i] to 1s\n", + " return results\n", + "\n", + "x_train = vectorize_sequences(train_data)\n", + "x_test = vectorize_sequences(test_data)\n", + "\n", + "y_train = np.asarray(train_labels).astype('float32')\n", + "y_test = np.asarray(test_labels).astype('float32')" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Fighting overfitting\n", + "\n", + "## Reducing the network's size\n", + "\n", + "\n", + "The simplest way to prevent overfitting is to reduce the size of the model, i.e. the number of learnable parameters in the model (which is \n", + "determined by the number of layers and the number of units per layer). In deep learning, the number of learnable parameters in a model is \n", + "often referred to as the model's \"capacity\". Intuitively, a model with more parameters will have more \"memorization capacity\" and therefore \n", + "will be able to easily learn a perfect dictionary-like mapping between training samples and their targets, a mapping without any \n", + "generalization power. For instance, a model with 500,000 binary parameters could easily be made to learn the class of every digits in the \n", + "MNIST training set: we would only need 10 binary parameters for each of the 50,000 digits. Such a model would be useless for classifying \n", + "new digit samples. Always keep this in mind: deep learning models tend to be good at fitting to the training data, but the real challenge \n", + "is generalization, not fitting.\n", + "\n", + "On the other hand, if the network has limited memorization resources, it will not be able to learn this mapping as easily, and thus, in \n", + "order to minimize its loss, it will have to resort to learning compressed representations that have predictive power regarding the targets \n", + "-- precisely the type of representations that we are interested in. At the same time, keep in mind that you should be using models that have \n", + "enough parameters that they won't be underfitting: your model shouldn't be starved for memorization resources. There is a compromise to be \n", + "found between \"too much capacity\" and \"not enough capacity\".\n", + "\n", + "Unfortunately, there is no magical formula to determine what the right number of layers is, or what the right size for each layer is. You \n", + "will have to evaluate an array of different architectures (on your validation set, not on your test set, of course) in order to find the \n", + "right model size for your data. The general workflow to find an appropriate model size is to start with relatively few layers and \n", + "parameters, and start increasing the size of the layers or adding new layers until you see diminishing returns with regard to the \n", + "validation loss.\n", + "\n", + "Let's try this on our movie review classification network. Our original network was as such:" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "from zoo.pipeline.api.keras import models\n", + "from zoo.pipeline.api.keras import layers\n", + "\n", + "original_model = models.Sequential()\n", + "original_model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", + "original_model.add(layers.Dense(16, activation='relu'))\n", + "original_model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "original_model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "import time\n", + "dir_name = '4-4 ' + str(time.ctime())\n", + "original_model.set_tensorboard('./', dir_name)\n", + "original_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 512 records in 0.024455326 seconds. Throughput is 20936.135 records/second. Loss is 0.01585226.\n", + "Top1Accuracy is Accuracy(correct: 21341, count: 25000, accuracy: 0.85364)_" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Now let's try to replace it with this smaller network:" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "smaller_model = models.Sequential()\n", + "smaller_model.add(layers.Dense(4, activation='relu', input_shape=(10000,)))\n", + "smaller_model.add(layers.Dense(4, activation='relu'))\n", + "smaller_model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "smaller_model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "dir_name = '4-4 ' + str(time.ctime())\n", + "smaller_model.set_tensorboard('./', dir_name)\n", + "smaller_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [], + "source": [ + "import matplotlib.pyplot as plt\n", + "original_val_loss = np.array(original_model.get_validation_summary(\"Loss\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, the smaller network starts overfitting later than the reference one (after 6 epochs rather than 4) and its performance \n", + "degrades much more slowly once it starts overfitting.\n", + "\n", + "Now, for kicks, let's add to this benchmark a network that has much more capacity, far more than the problem would warrant:" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "smaller_val_loss = np.array(smaller_model.get_validation_summary(\"Loss\"))\n", + "\n", + "plt.plot(original_val_loss[:,0], original_val_loss[:,1], label='original model')\n", + "plt.plot(smaller_val_loss[:,0], smaller_val_loss[:,1],label='smaller model',color='green')\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Loss')\n", + "plt.legend()\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, the smaller network starts overfitting later than the reference one (the original one starts overfitting at about 150 to 200 steps, which is about 3 or 4 epochs) and its performance \n", + "degrades much more slowly once it starts overfitting.\n", + "\n", + "Now, for kicks, let's add to this benchmark a network that has much more capacity, far more than the problem would warrant:" + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "bigger_model = models.Sequential()\n", + "bigger_model.add(layers.Dense(512, activation='relu', input_shape=(10000,)))\n", + "bigger_model.add(layers.Dense(512, activation='relu'))\n", + "bigger_model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "bigger_model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])" + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "metadata": {}, + "outputs": [], + "source": [ + "dir_name = '4-4 ' + str(time.ctime())\n", + "bigger_model.set_tensorboard('./', dir_name)\n", + "bigger_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Here's how the bigger network fares compared to the reference one. The dots are the validation loss values of the bigger network, and the \n", + "crosses are the initial network." + ] + }, + { + "cell_type": "code", + "execution_count": 11, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "bigger_val_loss = np.array(bigger_model.get_validation_summary(\"Loss\"))\n", + "\n", + "plt.plot(original_val_loss[:,0], original_val_loss[:,1], label='original model')\n", + "plt.plot(bigger_val_loss[:,0], bigger_val_loss[:,1],label='bigger model',color='green')\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Loss')\n", + "plt.legend()\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The bigger network starts overfitting almost right away, after just one epoch, and overfits much more severely. Its validation loss is also \n", + "more noisy.\n", + "\n", + "Meanwhile, here are the training losses for our two networks:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, the bigger network gets its training loss near zero very quickly. The more capacity the network has, the quicker it will be \n", + "able to model the training data (resulting in a low training loss), but the more susceptible it is to overfitting (resulting in a large \n", + "difference between the training and validation loss)." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Adding weight regularization\n", + "\n", + "\n", + "You may be familiar with _Occam's Razor_ principle: given two explanations for something, the explanation most likely to be correct is the \n", + "\"simplest\" one, the one that makes the least amount of assumptions. This also applies to the models learned by neural networks: given some \n", + "training data and a network architecture, there are multiple sets of weights values (multiple _models_) that could explain the data, and \n", + "simpler models are less likely to overfit than complex ones.\n", + "\n", + "A \"simple model\" in this context is a model where the distribution of parameter values has less entropy (or a model with fewer \n", + "parameters altogether, as we saw in the section above). Thus a common way to mitigate overfitting is to put constraints on the complexity \n", + "of a network by forcing its weights to only take small values, which makes the distribution of weight values more \"regular\". This is called \n", + "\"weight regularization\", and it is done by adding to the loss function of the network a _cost_ associated with having large weights. This \n", + "cost comes in two flavors:\n", + "\n", + "* L1 regularization, where the cost added is proportional to the _absolute value of the weights coefficients_ (i.e. to what is called the \n", + "\"L1 norm\" of the weights).\n", + "* L2 regularization, where the cost added is proportional to the _square of the value of the weights coefficients_ (i.e. to what is called \n", + "the \"L2 norm\" of the weights). L2 regularization is also called _weight decay_ in the context of neural networks. Don't let the different \n", + "name confuse you: weight decay is mathematically the exact same as L2 regularization.\n", + "\n", + "In Analytics-zoo Keras API, weight regularization is added by passing _weight regularizer instances_ to layers as keyword arguments. Let's add L2 weight \n", + "regularization to our movie review classification network:" + ] + }, + { + "cell_type": "code", + "execution_count": 12, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createL2Regularizer\n", + "creating: createZooKerasDense\n", + "creating: createL2Regularizer\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "from zoo.pipeline.api.keras import regularizers\n", + "\n", + "l2_model = models.Sequential()\n", + "l2_model.add(layers.Dense(16, W_regularizer=regularizers.l2(0.001),\n", + " activation='relu', input_shape=(10000,)))\n", + "l2_model.add(layers.Dense(16, W_regularizer=regularizers.l2(0.001),\n", + " activation='relu'))\n", + "l2_model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "l2_model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "`l2(0.001)` means that every coefficient in the weight matrix of the layer will add `0.001 * weight_coefficient_value` to the total loss of \n", + "the network. Note that because this penalty is _only added at training time_, the loss for this network will be much higher at training \n", + "than at test time.\n", + "\n", + "Here's the impact of our L2 regularization penalty:" + ] + }, + { + "cell_type": "code", + "execution_count": 13, + "metadata": {}, + "outputs": [], + "source": [ + "dir_name = '4-4 ' + str(time.ctime())\n", + "l2_model.set_tensorboard('./', dir_name)\n", + "l2_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 512 records in 0.024366594 seconds. Throughput is 21012.373 records/second. Loss is 0.13651785.\n", + "Top1Accuracy is Accuracy(correct: 21684, count: 25000, accuracy: 0.86736)_" + ] + }, + { + "cell_type": "code", + "execution_count": 14, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "l2_val_loss = np.array(l2_model.get_validation_summary(\"Loss\"))\n", + "\n", + "plt.plot(original_val_loss[:,0], original_val_loss[:,1], label='original model')\n", + "plt.plot(l2_val_loss[:,0], l2_val_loss[:,1],label='L2-regularized model',color='green')\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Loss')\n", + "plt.legend()\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, the model with L2 regularization (dots) has become much more resistant to overfitting than the reference model (crosses), \n", + "even though both models have the same number of parameters.\n", + "\n", + "As alternatives to L2 regularization, you could use one of the following Analytics-zoo Keras API weight regularizers: " + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras import regularizers\n", + "\n", + "# L1 regularization\n", + "regularizers.l1(0.001)\n", + "\n", + "# L1 and L2 regularization at the same time\n", + "regularizers.l1_l2(l1=0.001, l2=0.001)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Adding dropout\n", + "\n", + "\n", + "Dropout is one of the most effective and most commonly used regularization techniques for neural networks, developed by Hinton and his \n", + "students at the University of Toronto. Dropout, applied to a layer, consists of randomly \"dropping out\" (i.e. setting to zero) a number of \n", + "output features of the layer during training. Let's say a given layer would normally have returned a vector `[0.2, 0.5, 1.3, 0.8, 1.1]` for a \n", + "given input sample during training; after applying dropout, this vector will have a few zero entries distributed at random, e.g. `[0, 0.5, \n", + "1.3, 0, 1.1]`. The \"dropout rate\" is the fraction of the features that are being zeroed-out; it is usually set between 0.2 and 0.5. At test \n", + "time, no units are dropped out, and instead the layer's output values are scaled down by a factor equal to the dropout rate, so as to \n", + "balance for the fact that more units are active than at training time.\n", + "\n", + "Consider a Numpy matrix containing the output of a layer, `layer_output`, of shape `(batch_size, features)`. At training time, we would be \n", + "zero-ing out at random a fraction of the values in the matrix:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "This technique may seem strange and arbitrary. Why would this help reduce overfitting? Geoff Hinton has said that he was inspired, among \n", + "other things, by a fraud prevention mechanism used by banks -- in his own words: _\"I went to my bank. The tellers kept changing and I asked \n", + "one of them why. He said he didn’t know but they got moved around a lot. I figured it must be because it would require cooperation \n", + "between employees to successfully defraud the bank. This made me realize that randomly removing a different subset of neurons on each \n", + "example would prevent conspiracies and thus reduce overfitting\"_.\n", + "\n", + "The core idea is that introducing noise in the output values of a layer can break up happenstance patterns that are not significant (what \n", + "Hinton refers to as \"conspiracies\"), which the network would start memorizing if no noise was present. \n", + "\n", + "In Analytics-zoo Keras API you can introduce dropout in a network via the `Dropout` layer, which gets applied to the output of layer right before it, e.g.:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "model.add(layers.Dropout(0.5))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's add two `Dropout` layers in our IMDB network to see how well they do at reducing overfitting:" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDropout\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDropout\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "dpt_model = models.Sequential()\n", + "dpt_model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))\n", + "dpt_model.add(layers.Dropout(0.5))\n", + "dpt_model.add(layers.Dense(16, activation='relu'))\n", + "dpt_model.add(layers.Dropout(0.5))\n", + "dpt_model.add(layers.Dense(1, activation='sigmoid'))\n", + "\n", + "dpt_model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "dir_name = '4-4 ' + str(time.ctime())\n", + "dpt_model.set_tensorboard('./', dir_name)\n", + "dpt_model.fit(x_train, y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_test, y_test))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 512 records in 0.017992654 seconds. Throughput is 28456.057 records/second. Loss is 0.112769656. \n", + "Top1Accuracy is Accuracy(correct: 21871, count: 25000, accuracy: 0.87484)_" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "dpt_val_loss = np.array(dpt_model.get_validation_summary(\"Loss\"))\n", + "\n", + "plt.plot(original_val_loss[:,0], original_val_loss[:,1], label='original model')\n", + "plt.plot(dpt_val_loss[:,0], dpt_val_loss[:,1],label='Dropout-regularized model',color='green')\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Loss')\n", + "plt.legend()\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Again, a clear improvement over the reference network.\n", + "\n", + "To recap: here the most common ways to prevent overfitting in neural networks:\n", + "\n", + "* Getting more training data.\n", + "* Reducing the capacity of the network.\n", + "* Adding weight regularization.\n", + "* Adding dropout." + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From d92cd27cf4afd1239156aefe675ede80de2939f9 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 21 Mar 2019 16:36:09 +0800 Subject: [PATCH 34/46] Add files via upload some writing style fix --- keras/4.4-overfitting-and-underfitting.ipynb | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/keras/4.4-overfitting-and-underfitting.ipynb b/keras/4.4-overfitting-and-underfitting.ipynb index d28bebd..c2a801b 100644 --- a/keras/4.4-overfitting-and-underfitting.ipynb +++ b/keras/4.4-overfitting-and-underfitting.ipynb @@ -409,7 +409,7 @@ "the \"L2 norm\" of the weights). L2 regularization is also called _weight decay_ in the context of neural networks. Don't let the different \n", "name confuse you: weight decay is mathematically the exact same as L2 regularization.\n", "\n", - "In Analytics-zoo Keras API, weight regularization is added by passing _weight regularizer instances_ to layers as keyword arguments. Let's add L2 weight \n", + "In Keras API of Analytics Zoo, weight regularization is added by passing _weight regularizer instances_ to layers as keyword arguments. Let's add L2 weight \n", "regularization to our movie review classification network:" ] }, @@ -518,7 +518,7 @@ "As you can see, the model with L2 regularization (dots) has become much more resistant to overfitting than the reference model (crosses), \n", "even though both models have the same number of parameters.\n", "\n", - "As alternatives to L2 regularization, you could use one of the following Analytics-zoo Keras API weight regularizers: " + "As alternatives to L2 regularization, you could use one of the following Keras API of Analytics Zoo weight regularizers: " ] }, { @@ -568,7 +568,7 @@ "The core idea is that introducing noise in the output values of a layer can break up happenstance patterns that are not significant (what \n", "Hinton refers to as \"conspiracies\"), which the network would start memorizing if no noise was present. \n", "\n", - "In Analytics-zoo Keras API you can introduce dropout in a network via the `Dropout` layer, which gets applied to the output of layer right before it, e.g.:" + "In Keras API of Analytics Zoo you can introduce dropout in a network via the `Dropout` layer, which gets applied to the output of layer right before it, e.g.:" ] }, { From 2b88f7880fee9c6f496947389dcc45f090b07dd7 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 21 Mar 2019 16:37:59 +0800 Subject: [PATCH 35/46] Add files via upload --- keras/5.1-introduction-to-convnets.ipynb | 300 +++++++++++++++++++++++ 1 file changed, 300 insertions(+) create mode 100644 keras/5.1-introduction-to-convnets.ipynb diff --git a/keras/5.1-introduction-to-convnets.ipynb b/keras/5.1-introduction-to-convnets.ipynb new file mode 100644 index 0000000..0fbedc1 --- /dev/null +++ b/keras/5.1-introduction-to-convnets.ipynb @@ -0,0 +1,300 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "env: SPARK_DRIVER_MEMORY=8g\n", + "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", + "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n" + ] + } + ], + "source": [ + "%env SPARK_DRIVER_MEMORY=8g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# 5.1 - Introduction to convnets\n", + "\n", + "\n", + "----\n", + "\n", + "First, let's take a practical look at a very simple convnet example. We will use our convnet to classify MNIST digits, a task that you've already been \n", + "through in Chapter 2, using a densely-connected network (our test accuracy then was 97.8%). Even though our convnet will be very basic, its \n", + "accuracy will still blow out of the water that of the densely-connected model from Chapter 2.\n", + "\n", + "The 6 lines of code below show you what a basic convnet looks like. It's a stack of `Conv2D` and `MaxPooling2D` layers. We'll see in a \n", + "minute what they do concretely.\n", + "Importantly, a convnet takes as input tensors of shape `(image_height, image_width, image_channels)` (not including the batch dimension). \n", + "In our case, we will configure our convnet to process inputs of size `(28, 28, 1)`, which is the format of MNIST images. We do this via \n", + "passing the argument `input_shape=(28, 28, 1)` to our first layer." + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasConvolution2D\n", + "creating: createZooKerasMaxPooling2D\n", + "creating: createZooKerasConvolution2D\n", + "creating: createZooKerasMaxPooling2D\n", + "creating: createZooKerasConvolution2D\n" + ] + } + ], + "source": [ + "from zoo.pipeline.api.keras import layers\n", + "from zoo.pipeline.api.keras import models\n", + "\n", + "model = models.Sequential()\n", + "model.add(layers.Conv2D(32, nb_col=3, nb_row=3, activation='relu', input_shape=(1,28,28)))\n", + "model.add(layers.MaxPooling2D((2, 2)))\n", + "model.add(layers.Conv2D(64, nb_col=3, nb_row=3, activation='relu'))\n", + "model.add(layers.MaxPooling2D((2, 2)))\n", + "model.add(layers.Conv2D(64, nb_col=3, nb_row=3, activation='relu'))\n", + "\n", + "model.summary()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_In Keras one could see model summary directly in output, in Keras API of Analytics Zoo, summary is printed in console, the same as INFO._" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "From summary you can see that the output of every `Conv2D` and `MaxPooling2D` layer is a 3D tensor of shape `(height, width, channels)`. The width \n", + "and height dimensions tend to shrink as we go deeper in the network. The number of channels is controlled by the first argument passed to \n", + "the `Conv2D` layers (e.g. 32 or 64).\n", + "\n", + "The next step would be to feed our last output tensor (of shape `(3, 3, 64)`) into a densely-connected classifier network like those you are \n", + "already familiar with: a stack of `Dense` layers. These classifiers process vectors, which are 1D, whereas our current output is a 3D tensor. \n", + "So first, we will have to flatten our 3D outputs to 1D, and then add a few `Dense` layers on top:" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasFlatten\n", + "creating: createZooKerasDense\n", + "creating: createZooKerasDense\n" + ] + }, + { + "data": { + "text/plain": [ + "" + ] + }, + "execution_count": 3, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "model.add(layers.Flatten())\n", + "model.add(layers.Dense(64, activation='relu'))\n", + "model.add(layers.Dense(10, activation='softmax'))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "We are going to do 10-way classification, so we use a final layer with 10 outputs and a softmax activation. Now here's what our network \n", + "looks like:" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [], + "source": [ + "model.summary()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, our `(3, 3, 64)` outputs were flattened into vectors of shape `(576,)`, before going through two `Dense` layers.\n", + "\n", + "Now, let's train our convnet on the MNIST digits. We will reuse a lot of the code we have already covered in the MNIST example from Chapter \n", + "2." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### CNN input shape\n", + "_Once we get the dataset, we need to reshape the images. In Keras the shape of the dataset is `(sample_size, height, width, channel)`, like the Keras code below:\n", + " \n", + " train_images = train_images.reshape((60000, 28, 28, 1))\n", + "In Keras API of Analytics Zoo, the default order is theano-style NCHW `(sample_size, channel, height, width)`, so you can process data like following:\n", + "\n", + "Alternatively, you can also use tensorflow-style NHWC as Keras default just by setting `Convolution2D(dim_ordering=\"tf\")`" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As you can see, our `(3, 3, 64)` outputs were flattened into vectors of shape `(576,)`, before going through two `Dense` layers.\n", + "\n", + "Now, let's train our convnet on the MNIST digits. We will reuse a lot of the code we have already covered in the MNIST example from Chapter \n", + "2." + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + "Using TensorFlow backend.\n" + ] + } + ], + "source": [ + "from keras.datasets import mnist\n", + "(train_images, train_labels), (test_images, test_labels) = mnist.load_data()\n", + "\n", + "train_images = train_images.reshape((60000, 1, 28, 28))\n", + "train_images = train_images.astype('float32') / 255\n", + "\n", + "test_images = test_images.reshape((10000, 1, 28, 28))\n", + "test_images = test_images.astype('float32') / 255" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createRMSprop\n", + "creating: createZooKerasSparseCategoricalCrossEntropy\n", + "creating: createZooKerasSparseCategoricalAccuracy\n" + ] + } + ], + "source": [ + "model.compile(optimizer='rmsprop',\n", + " loss='sparse_categorical_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "model.fit(train_images, train_labels, nb_epoch=5, batch_size=64)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Trained 64 records in 0.03212866 seconds. Throughput is 1991.9911 records/second. Loss is 0.0023578003." + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [], + "source": [ + "test_loss, test_acc = model.evaluate(test_images, test_labels)" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "0.9912999868392944" + ] + }, + "execution_count": 8, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "test_acc" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "While our densely-connected network from Chapter 2 had a test accuracy of 97.8%, our basic convnet has a test accuracy of 99.1%: we \n", + "decreased our error rate by over 50% (relative). Not bad! " + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From 5dc2650d7b841a08bae2d6619331141f48f28110 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Mon, 25 Mar 2019 14:31:05 +0800 Subject: [PATCH 36/46] Add files via upload --- ...erstanding-recurrent-neural-networks.ipynb | 441 ++++++++++++++++++ 1 file changed, 441 insertions(+) create mode 100644 keras/6.2-understanding-recurrent-neural-networks.ipynb diff --git a/keras/6.2-understanding-recurrent-neural-networks.ipynb b/keras/6.2-understanding-recurrent-neural-networks.ipynb new file mode 100644 index 0000000..5b19bf2 --- /dev/null +++ b/keras/6.2-understanding-recurrent-neural-networks.ipynb @@ -0,0 +1,441 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "First of all, set environment variables and initialize spark context:" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "env: SPARK_DRIVER_MEMORY=16g\n", + "env: PYSPARK_PYTHON=/usr/bin/python3.5\n", + "env: PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "Prepending /home/litchy/.local/lib/python3.5/site-packages/bigdl/share/conf/spark-bigdl.conf to sys.path\n" + ] + } + ], + "source": [ + "%env SPARK_DRIVER_MEMORY=16g\n", + "%env PYSPARK_PYTHON=/usr/bin/python3.5\n", + "%env PYSPARK_DRIVER_PYTHON=/usr/bin/python3.5\n", + "\n", + "from zoo.common.nncontext import *\n", + "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Understanding recurrent neural networks\n", + "\n", + "----\n", + "\n", + "In this section we will build recurrent neural networks to finish the same task as we did in chapter 3." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## A first recurrent layer in Keras API of Analytics Zoo\n", + "\n", + "The process we just naively implemented in Numpy corresponds to an actual layer: the `SimpleRNN` layer:" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.layers import SimpleRNN" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "There is just one minor difference: `SimpleRNN` processes batches of sequences, like all other Keras API of Analytics Zoo layers, not just a single sequence like \n", + "in our Numpy example. This means that it takes inputs of shape `(batch_size, timesteps, input_features)`, rather than `(timesteps, \n", + "input_features)`.\n", + "\n", + "Like all recurrent layers in Keras API of Analytics Zoo, `SimpleRNN` can be run in two different modes: it can return either the full sequences of successive \n", + "outputs for each timestep (a 3D tensor of shape `(batch_size, timesteps, output_features)`), or it can return only the last output for each \n", + "input sequence (a 2D tensor of shape `(batch_size, output_features)`). These two modes are controlled by the `return_sequences` constructor \n", + "argument. Let's take a look at an example:" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [], + "source": [ + "from zoo.pipeline.api.keras.models import Sequential\n", + "from zoo.pipeline.api.keras.layers import Embedding, SimpleRNN" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Following is the preprocessing method. You do not need to care about the detail of its implementation. Basically this `pad_sequences` method fix all the sequences to a same length." + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [], + "source": [ + "def pad_sequences(sequences, maxlen=None, dtype='int32',\n", + " padding='pre', truncating='pre', value=0.): \n", + " lengths = [len(s) for s in sequences]\n", + "\n", + " nb_samples = len(sequences)\n", + " if maxlen is None:\n", + " maxlen = np.max(lengths)\n", + "\n", + " # take the sample shape from the first non empty sequence\n", + " # checking for consistency in the main loop below.\n", + " sample_shape = tuple()\n", + " for s in sequences:\n", + " if len(s) > 0:\n", + " sample_shape = np.asarray(s).shape[1:]\n", + " break\n", + "\n", + " x = (np.ones((nb_samples, maxlen) + sample_shape) * value).astype(dtype)\n", + " for idx, s in enumerate(sequences):\n", + " if not len(s):\n", + " continue # empty list/array was found\n", + " if truncating == 'pre':\n", + " trunc = s[-maxlen:]\n", + " elif truncating == 'post':\n", + " trunc = s[:maxlen]\n", + " else:\n", + " raise ValueError('Truncating type \"%s\" not understood' % truncating)\n", + "\n", + " # check `trunc` has expected shape\n", + " trunc = np.asarray(trunc, dtype=dtype)\n", + " if trunc.shape[1:] != sample_shape:\n", + " raise ValueError('Shape of sample %s of sequence at position %s is different from expected shape %s' %\n", + " (trunc.shape[1:], idx, sample_shape))\n", + "\n", + " if padding == 'post':\n", + " x[idx, :len(trunc)] = trunc\n", + " elif padding == 'pre':\n", + " x[idx, -len(trunc):] = trunc\n", + " else:\n", + " raise ValueError('Padding type \"%s\" not understood' % padding)\n", + " return x" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Now let's try to use such a model on the IMDB movie review classification problem. First, let's preprocess the data. " + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "input_train shape: (25000, 500)\n", + "input_test shape: (25000, 500)\n" + ] + } + ], + "source": [ + "from zoo.pipeline.api.keras.datasets import imdb\n", + "\n", + "max_features = 10000 # number of words to consider as features\n", + "maxlen = 500 # cut texts after this number of words (among top max_features most common words)\n", + "batch_size = 32\n", + "\n", + "(input_train, y_train), (input_test, y_test) = imdb.load_data(nb_words=max_features)\n", + "input_train = pad_sequences(input_train, maxlen=maxlen)\n", + "input_test = pad_sequences(input_test, maxlen=maxlen)\n", + "print('input_train shape:', input_train.shape)\n", + "print('input_test shape:', input_test.shape)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "It is sometimes useful to stack several recurrent layers one after the other in order to increase the representational power of a network. \n", + "In such a setup, you have to get all intermediate layers to return full sequences:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Specify input shape\n", + "_One could add an embedding layer as our first layer in Keras as following:_\n", + " \n", + " model = Sequential()\n", + " model.add(Embedding(10000, 32))\n", + "_In Keras API of Analytics Zoo, you need to specify the input shape of first layer, in this example, the sequence length is 500, as is shown above, so we could build our model as following:_" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasEmbedding\n", + "creating: createZooKerasSimpleRNN\n", + "creating: createZooKerasSimpleRNN\n", + "creating: createZooKerasSimpleRNN\n", + "creating: createZooKerasSimpleRNN\n" + ] + } + ], + "source": [ + "model = Sequential()\n", + "model.add(Embedding(10000, 32, input_shape=(500,)))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32, return_sequences=True))\n", + "model.add(SimpleRNN(32)) # This last layer only returns the last outputs.\n", + "model.summary()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's train a simple recurrent network using an `Embedding` layer and a `SimpleRNN` layer:" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasEmbedding\n", + "creating: createZooKerasSimpleRNN\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "from zoo.pipeline.api.keras.layers import Dense\n", + "\n", + "model = Sequential()\n", + "model.add(Embedding(max_features, 32, input_shape=(500,)))\n", + "model.add(SimpleRNN(32))\n", + "model.add(Dense(1, activation='sigmoid'))\n", + "\n", + "model.compile(optimizer='rmsprop', loss='binary_crossentropy', metrics=['acc'])\n", + "\n", + "import time\n", + "dir_name = '6-2 ' + str(time.ctime())\n", + "model.set_tensorboard('./', dir_name)\n", + "model.fit(input_train, y_train,\n", + " nb_epoch=10,\n", + " batch_size=128,\n", + " validation_split=0.2)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 128 records in 0.046239497 seconds. Throughput is 2768.1963 records/second. Loss is 0.16970885.\n", + "\n", + "Top1Accuracy is Accuracy(correct: 4167, count: 5000, accuracy: 0.8334)_" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's display the training and validation loss:" + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "train_loss = np.array(model.get_train_summary('Loss'))\n", + "val_loss = np.array(model.get_validation_summary('Loss'))\n", + "\n", + "import matplotlib.pyplot as plt\n", + "plt.plot(train_loss[:,0],train_loss[:,1],label='train loss')\n", + "plt.plot(val_loss[:,0],val_loss[:,1],label='validation loss',color='green')\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Loss')\n", + "plt.legend()\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "As a reminder, in chapter 3, our very first naive approach to this very dataset got us to 88% test accuracy. Unfortunately, our small \n", + "recurrent network doesn't perform very well at all compared to this baseline (only up to 85% validation accuracy). Part of the problem is \n", + "that our inputs only consider the first 500 words rather the full sequences -- \n", + "hence our RNN has access to less information than our earlier baseline model. The remainder of the problem is simply that `SimpleRNN` isn't very good at processing long sequences, like text. Other types of recurrent layers perform much better. Let's take a look at some \n", + "more advanced layers." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## A concrete LSTM example in Keras API of Analytics Zoo\n", + "\n", + "Now let's switch to more practical concerns: we will set up a model using a LSTM layer and train it on the IMDB data. Here's the network, \n", + "similar to the one with `SimpleRNN` that we just presented. We only specify the output dimensionality of the LSTM layer, and leave every \n", + "other argument (there are lots) to the Keras API of Analytics Zoo defaults, which has good defaults, and things will almost always \"just work\" without you \n", + "having to spend time tuning parameters by hand." + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "creating: createZooKerasSequential\n", + "creating: createZooKerasEmbedding\n", + "creating: createZooKerasLSTM\n", + "creating: createZooKerasDense\n", + "creating: createRMSprop\n", + "creating: createZooKerasBinaryCrossEntropy\n", + "creating: createZooKerasBinaryAccuracy\n" + ] + } + ], + "source": [ + "from zoo.pipeline.api.keras.layers import LSTM\n", + "\n", + "model = Sequential()\n", + "model.add(Embedding(max_features, 32, input_shape=(500,)))\n", + "model.add(LSTM(32))\n", + "model.add(Dense(1, activation='sigmoid'))\n", + "\n", + "model.compile(optimizer='rmsprop',\n", + " loss='binary_crossentropy',\n", + " metrics=['acc'])\n", + "\n", + "dir_name = '6-2 ' + str(time.ctime())\n", + "model.set_tensorboard('./', dir_name)\n", + "model.fit(input_train, y_train,\n", + " nb_epoch=10,\n", + " batch_size=128,\n", + " validation_split=0.2)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "_INFO - Trained 128 records in 0.335889472 seconds. Throughput is 381.07776 records/second. Loss is 0.14791179.\n", + "\n", + "Top1Accuracy is Accuracy(correct: 4358, count: 5000, accuracy: 0.8716)_" + ] + }, + { + "cell_type": "code", + "execution_count": 11, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": { + "needs_background": "light" + }, + "output_type": "display_data" + } + ], + "source": [ + "train_loss = np.array(model.get_train_summary('Loss'))\n", + "val_loss = np.array(model.get_validation_summary('Loss'))\n", + "\n", + "plt.plot(train_loss[:,0],train_loss[:,1],label='train loss')\n", + "plt.plot(val_loss[:,0],val_loss[:,1],label='validation loss',color='green')\n", + "plt.xlabel('Steps')\n", + "plt.ylabel('Loss')\n", + "plt.legend()\n", + "plt.show()" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.5.2" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} From c875f97f6bdd37d4e540fedbcf01ef61582fcb58 Mon Sep 17 00:00:00 2001 From: Kai Huang Date: Mon, 25 Mar 2019 19:53:24 +0800 Subject: [PATCH 37/46] Add bert run_classifier original code (#13) --- tensorflow/bert/run_classifier.py | 981 ++++++++++++++++++++++++++++++ 1 file changed, 981 insertions(+) create mode 100644 tensorflow/bert/run_classifier.py diff --git a/tensorflow/bert/run_classifier.py b/tensorflow/bert/run_classifier.py new file mode 100644 index 0000000..817b147 --- /dev/null +++ b/tensorflow/bert/run_classifier.py @@ -0,0 +1,981 @@ +# coding=utf-8 +# Copyright 2018 The Google AI Language Team Authors. +# +# Licensed under the Apache License, Version 2.0 (the "License"); +# you may not use this file except in compliance with the License. +# You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +"""BERT finetuning runner.""" + +from __future__ import absolute_import +from __future__ import division +from __future__ import print_function + +import collections +import csv +import os +import modeling +import optimization +import tokenization +import tensorflow as tf + +flags = tf.flags + +FLAGS = flags.FLAGS + +## Required parameters +flags.DEFINE_string( + "data_dir", None, + "The input data dir. Should contain the .tsv files (or other data files) " + "for the task.") + +flags.DEFINE_string( + "bert_config_file", None, + "The config json file corresponding to the pre-trained BERT model. " + "This specifies the model architecture.") + +flags.DEFINE_string("task_name", None, "The name of the task to train.") + +flags.DEFINE_string("vocab_file", None, + "The vocabulary file that the BERT model was trained on.") + +flags.DEFINE_string( + "output_dir", None, + "The output directory where the model checkpoints will be written.") + +## Other parameters + +flags.DEFINE_string( + "init_checkpoint", None, + "Initial checkpoint (usually from a pre-trained BERT model).") + +flags.DEFINE_bool( + "do_lower_case", True, + "Whether to lower case the input text. Should be True for uncased " + "models and False for cased models.") + +flags.DEFINE_integer( + "max_seq_length", 128, + "The maximum total input sequence length after WordPiece tokenization. " + "Sequences longer than this will be truncated, and sequences shorter " + "than this will be padded.") + +flags.DEFINE_bool("do_train", False, "Whether to run training.") + +flags.DEFINE_bool("do_eval", False, "Whether to run eval on the dev set.") + +flags.DEFINE_bool( + "do_predict", False, + "Whether to run the model in inference mode on the test set.") + +flags.DEFINE_integer("train_batch_size", 32, "Total batch size for training.") + +flags.DEFINE_integer("eval_batch_size", 8, "Total batch size for eval.") + +flags.DEFINE_integer("predict_batch_size", 8, "Total batch size for predict.") + +flags.DEFINE_float("learning_rate", 5e-5, "The initial learning rate for Adam.") + +flags.DEFINE_float("num_train_epochs", 3.0, + "Total number of training epochs to perform.") + +flags.DEFINE_float( + "warmup_proportion", 0.1, + "Proportion of training to perform linear learning rate warmup for. " + "E.g., 0.1 = 10% of training.") + +flags.DEFINE_integer("save_checkpoints_steps", 1000, + "How often to save the model checkpoint.") + +flags.DEFINE_integer("iterations_per_loop", 1000, + "How many steps to make in each estimator call.") + +flags.DEFINE_bool("use_tpu", False, "Whether to use TPU or GPU/CPU.") + +tf.flags.DEFINE_string( + "tpu_name", None, + "The Cloud TPU to use for training. This should be either the name " + "used when creating the Cloud TPU, or a grpc://ip.address.of.tpu:8470 " + "url.") + +tf.flags.DEFINE_string( + "tpu_zone", None, + "[Optional] GCE zone where the Cloud TPU is located in. If not " + "specified, we will attempt to automatically detect the GCE project from " + "metadata.") + +tf.flags.DEFINE_string( + "gcp_project", None, + "[Optional] Project name for the Cloud TPU-enabled project. If not " + "specified, we will attempt to automatically detect the GCE project from " + "metadata.") + +tf.flags.DEFINE_string("master", None, "[Optional] TensorFlow master URL.") + +flags.DEFINE_integer( + "num_tpu_cores", 8, + "Only used if `use_tpu` is True. Total number of TPU cores to use.") + + +class InputExample(object): + """A single training/test example for simple sequence classification.""" + + def __init__(self, guid, text_a, text_b=None, label=None): + """Constructs a InputExample. + + Args: + guid: Unique id for the example. + text_a: string. The untokenized text of the first sequence. For single + sequence tasks, only this sequence must be specified. + text_b: (Optional) string. The untokenized text of the second sequence. + Only must be specified for sequence pair tasks. + label: (Optional) string. The label of the example. This should be + specified for train and dev examples, but not for test examples. + """ + self.guid = guid + self.text_a = text_a + self.text_b = text_b + self.label = label + + +class PaddingInputExample(object): + """Fake example so the num input examples is a multiple of the batch size. + + When running eval/predict on the TPU, we need to pad the number of examples + to be a multiple of the batch size, because the TPU requires a fixed batch + size. The alternative is to drop the last batch, which is bad because it means + the entire output data won't be generated. + + We use this class instead of `None` because treating `None` as padding + battches could cause silent errors. + """ + + +class InputFeatures(object): + """A single set of features of data.""" + + def __init__(self, + input_ids, + input_mask, + segment_ids, + label_id, + is_real_example=True): + self.input_ids = input_ids + self.input_mask = input_mask + self.segment_ids = segment_ids + self.label_id = label_id + self.is_real_example = is_real_example + + +class DataProcessor(object): + """Base class for data converters for sequence classification data sets.""" + + def get_train_examples(self, data_dir): + """Gets a collection of `InputExample`s for the train set.""" + raise NotImplementedError() + + def get_dev_examples(self, data_dir): + """Gets a collection of `InputExample`s for the dev set.""" + raise NotImplementedError() + + def get_test_examples(self, data_dir): + """Gets a collection of `InputExample`s for prediction.""" + raise NotImplementedError() + + def get_labels(self): + """Gets the list of labels for this data set.""" + raise NotImplementedError() + + @classmethod + def _read_tsv(cls, input_file, quotechar=None): + """Reads a tab separated value file.""" + with tf.gfile.Open(input_file, "r") as f: + reader = csv.reader(f, delimiter="\t", quotechar=quotechar) + lines = [] + for line in reader: + lines.append(line) + return lines + + +class XnliProcessor(DataProcessor): + """Processor for the XNLI data set.""" + + def __init__(self): + self.language = "zh" + + def get_train_examples(self, data_dir): + """See base class.""" + lines = self._read_tsv( + os.path.join(data_dir, "multinli", + "multinli.train.%s.tsv" % self.language)) + examples = [] + for (i, line) in enumerate(lines): + if i == 0: + continue + guid = "train-%d" % (i) + text_a = tokenization.convert_to_unicode(line[0]) + text_b = tokenization.convert_to_unicode(line[1]) + label = tokenization.convert_to_unicode(line[2]) + if label == tokenization.convert_to_unicode("contradictory"): + label = tokenization.convert_to_unicode("contradiction") + examples.append( + InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label)) + return examples + + def get_dev_examples(self, data_dir): + """See base class.""" + lines = self._read_tsv(os.path.join(data_dir, "xnli.dev.tsv")) + examples = [] + for (i, line) in enumerate(lines): + if i == 0: + continue + guid = "dev-%d" % (i) + language = tokenization.convert_to_unicode(line[0]) + if language != tokenization.convert_to_unicode(self.language): + continue + text_a = tokenization.convert_to_unicode(line[6]) + text_b = tokenization.convert_to_unicode(line[7]) + label = tokenization.convert_to_unicode(line[1]) + examples.append( + InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label)) + return examples + + def get_labels(self): + """See base class.""" + return ["contradiction", "entailment", "neutral"] + + +class MnliProcessor(DataProcessor): + """Processor for the MultiNLI data set (GLUE version).""" + + def get_train_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "train.tsv")), "train") + + def get_dev_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "dev_matched.tsv")), + "dev_matched") + + def get_test_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "test_matched.tsv")), "test") + + def get_labels(self): + """See base class.""" + return ["contradiction", "entailment", "neutral"] + + def _create_examples(self, lines, set_type): + """Creates examples for the training and dev sets.""" + examples = [] + for (i, line) in enumerate(lines): + if i == 0: + continue + guid = "%s-%s" % (set_type, tokenization.convert_to_unicode(line[0])) + text_a = tokenization.convert_to_unicode(line[8]) + text_b = tokenization.convert_to_unicode(line[9]) + if set_type == "test": + label = "contradiction" + else: + label = tokenization.convert_to_unicode(line[-1]) + examples.append( + InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label)) + return examples + + +class MrpcProcessor(DataProcessor): + """Processor for the MRPC data set (GLUE version).""" + + def get_train_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "train.tsv")), "train") + + def get_dev_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev") + + def get_test_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "test.tsv")), "test") + + def get_labels(self): + """See base class.""" + return ["0", "1"] + + def _create_examples(self, lines, set_type): + """Creates examples for the training and dev sets.""" + examples = [] + for (i, line) in enumerate(lines): + if i == 0: + continue + guid = "%s-%s" % (set_type, i) + text_a = tokenization.convert_to_unicode(line[3]) + text_b = tokenization.convert_to_unicode(line[4]) + if set_type == "test": + label = "0" + else: + label = tokenization.convert_to_unicode(line[0]) + examples.append( + InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label)) + return examples + + +class ColaProcessor(DataProcessor): + """Processor for the CoLA data set (GLUE version).""" + + def get_train_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "train.tsv")), "train") + + def get_dev_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev") + + def get_test_examples(self, data_dir): + """See base class.""" + return self._create_examples( + self._read_tsv(os.path.join(data_dir, "test.tsv")), "test") + + def get_labels(self): + """See base class.""" + return ["0", "1"] + + def _create_examples(self, lines, set_type): + """Creates examples for the training and dev sets.""" + examples = [] + for (i, line) in enumerate(lines): + # Only the test set has a header + if set_type == "test" and i == 0: + continue + guid = "%s-%s" % (set_type, i) + if set_type == "test": + text_a = tokenization.convert_to_unicode(line[1]) + label = "0" + else: + text_a = tokenization.convert_to_unicode(line[3]) + label = tokenization.convert_to_unicode(line[1]) + examples.append( + InputExample(guid=guid, text_a=text_a, text_b=None, label=label)) + return examples + + +def convert_single_example(ex_index, example, label_list, max_seq_length, + tokenizer): + """Converts a single `InputExample` into a single `InputFeatures`.""" + + if isinstance(example, PaddingInputExample): + return InputFeatures( + input_ids=[0] * max_seq_length, + input_mask=[0] * max_seq_length, + segment_ids=[0] * max_seq_length, + label_id=0, + is_real_example=False) + + label_map = {} + for (i, label) in enumerate(label_list): + label_map[label] = i + + tokens_a = tokenizer.tokenize(example.text_a) + tokens_b = None + if example.text_b: + tokens_b = tokenizer.tokenize(example.text_b) + + if tokens_b: + # Modifies `tokens_a` and `tokens_b` in place so that the total + # length is less than the specified length. + # Account for [CLS], [SEP], [SEP] with "- 3" + _truncate_seq_pair(tokens_a, tokens_b, max_seq_length - 3) + else: + # Account for [CLS] and [SEP] with "- 2" + if len(tokens_a) > max_seq_length - 2: + tokens_a = tokens_a[0:(max_seq_length - 2)] + + # The convention in BERT is: + # (a) For sequence pairs: + # tokens: [CLS] is this jack ##son ##ville ? [SEP] no it is not . [SEP] + # type_ids: 0 0 0 0 0 0 0 0 1 1 1 1 1 1 + # (b) For single sequences: + # tokens: [CLS] the dog is hairy . [SEP] + # type_ids: 0 0 0 0 0 0 0 + # + # Where "type_ids" are used to indicate whether this is the first + # sequence or the second sequence. The embedding vectors for `type=0` and + # `type=1` were learned during pre-training and are added to the wordpiece + # embedding vector (and position vector). This is not *strictly* necessary + # since the [SEP] token unambiguously separates the sequences, but it makes + # it easier for the model to learn the concept of sequences. + # + # For classification tasks, the first vector (corresponding to [CLS]) is + # used as the "sentence vector". Note that this only makes sense because + # the entire model is fine-tuned. + tokens = [] + segment_ids = [] + tokens.append("[CLS]") + segment_ids.append(0) + for token in tokens_a: + tokens.append(token) + segment_ids.append(0) + tokens.append("[SEP]") + segment_ids.append(0) + + if tokens_b: + for token in tokens_b: + tokens.append(token) + segment_ids.append(1) + tokens.append("[SEP]") + segment_ids.append(1) + + input_ids = tokenizer.convert_tokens_to_ids(tokens) + + # The mask has 1 for real tokens and 0 for padding tokens. Only real + # tokens are attended to. + input_mask = [1] * len(input_ids) + + # Zero-pad up to the sequence length. + while len(input_ids) < max_seq_length: + input_ids.append(0) + input_mask.append(0) + segment_ids.append(0) + + assert len(input_ids) == max_seq_length + assert len(input_mask) == max_seq_length + assert len(segment_ids) == max_seq_length + + label_id = label_map[example.label] + if ex_index < 5: + tf.logging.info("*** Example ***") + tf.logging.info("guid: %s" % (example.guid)) + tf.logging.info("tokens: %s" % " ".join( + [tokenization.printable_text(x) for x in tokens])) + tf.logging.info("input_ids: %s" % " ".join([str(x) for x in input_ids])) + tf.logging.info("input_mask: %s" % " ".join([str(x) for x in input_mask])) + tf.logging.info("segment_ids: %s" % " ".join([str(x) for x in segment_ids])) + tf.logging.info("label: %s (id = %d)" % (example.label, label_id)) + + feature = InputFeatures( + input_ids=input_ids, + input_mask=input_mask, + segment_ids=segment_ids, + label_id=label_id, + is_real_example=True) + return feature + + +def file_based_convert_examples_to_features( + examples, label_list, max_seq_length, tokenizer, output_file): + """Convert a set of `InputExample`s to a TFRecord file.""" + + writer = tf.python_io.TFRecordWriter(output_file) + + for (ex_index, example) in enumerate(examples): + if ex_index % 10000 == 0: + tf.logging.info("Writing example %d of %d" % (ex_index, len(examples))) + + feature = convert_single_example(ex_index, example, label_list, + max_seq_length, tokenizer) + + def create_int_feature(values): + f = tf.train.Feature(int64_list=tf.train.Int64List(value=list(values))) + return f + + features = collections.OrderedDict() + features["input_ids"] = create_int_feature(feature.input_ids) + features["input_mask"] = create_int_feature(feature.input_mask) + features["segment_ids"] = create_int_feature(feature.segment_ids) + features["label_ids"] = create_int_feature([feature.label_id]) + features["is_real_example"] = create_int_feature( + [int(feature.is_real_example)]) + + tf_example = tf.train.Example(features=tf.train.Features(feature=features)) + writer.write(tf_example.SerializeToString()) + writer.close() + + +def file_based_input_fn_builder(input_file, seq_length, is_training, + drop_remainder): + """Creates an `input_fn` closure to be passed to TPUEstimator.""" + + name_to_features = { + "input_ids": tf.FixedLenFeature([seq_length], tf.int64), + "input_mask": tf.FixedLenFeature([seq_length], tf.int64), + "segment_ids": tf.FixedLenFeature([seq_length], tf.int64), + "label_ids": tf.FixedLenFeature([], tf.int64), + "is_real_example": tf.FixedLenFeature([], tf.int64), + } + + def _decode_record(record, name_to_features): + """Decodes a record to a TensorFlow example.""" + example = tf.parse_single_example(record, name_to_features) + + # tf.Example only supports tf.int64, but the TPU only supports tf.int32. + # So cast all int64 to int32. + for name in list(example.keys()): + t = example[name] + if t.dtype == tf.int64: + t = tf.to_int32(t) + example[name] = t + + return example + + def input_fn(params): + """The actual input function.""" + batch_size = params["batch_size"] + + # For training, we want a lot of parallel reading and shuffling. + # For eval, we want no shuffling and parallel reading doesn't matter. + d = tf.data.TFRecordDataset(input_file) + if is_training: + d = d.repeat() + d = d.shuffle(buffer_size=100) + + d = d.apply( + tf.contrib.data.map_and_batch( + lambda record: _decode_record(record, name_to_features), + batch_size=batch_size, + drop_remainder=drop_remainder)) + + return d + + return input_fn + + +def _truncate_seq_pair(tokens_a, tokens_b, max_length): + """Truncates a sequence pair in place to the maximum length.""" + + # This is a simple heuristic which will always truncate the longer sequence + # one token at a time. This makes more sense than truncating an equal percent + # of tokens from each, since if one sequence is very short then each token + # that's truncated likely contains more information than a longer sequence. + while True: + total_length = len(tokens_a) + len(tokens_b) + if total_length <= max_length: + break + if len(tokens_a) > len(tokens_b): + tokens_a.pop() + else: + tokens_b.pop() + + +def create_model(bert_config, is_training, input_ids, input_mask, segment_ids, + labels, num_labels, use_one_hot_embeddings): + """Creates a classification model.""" + model = modeling.BertModel( + config=bert_config, + is_training=is_training, + input_ids=input_ids, + input_mask=input_mask, + token_type_ids=segment_ids, + use_one_hot_embeddings=use_one_hot_embeddings) + + # In the demo, we are doing a simple classification task on the entire + # segment. + # + # If you want to use the token-level output, use model.get_sequence_output() + # instead. + output_layer = model.get_pooled_output() + + hidden_size = output_layer.shape[-1].value + + output_weights = tf.get_variable( + "output_weights", [num_labels, hidden_size], + initializer=tf.truncated_normal_initializer(stddev=0.02)) + + output_bias = tf.get_variable( + "output_bias", [num_labels], initializer=tf.zeros_initializer()) + + with tf.variable_scope("loss"): + if is_training: + # I.e., 0.1 dropout + output_layer = tf.nn.dropout(output_layer, keep_prob=0.9) + + logits = tf.matmul(output_layer, output_weights, transpose_b=True) + logits = tf.nn.bias_add(logits, output_bias) + probabilities = tf.nn.softmax(logits, axis=-1) + log_probs = tf.nn.log_softmax(logits, axis=-1) + + one_hot_labels = tf.one_hot(labels, depth=num_labels, dtype=tf.float32) + + per_example_loss = -tf.reduce_sum(one_hot_labels * log_probs, axis=-1) + loss = tf.reduce_mean(per_example_loss) + + return (loss, per_example_loss, logits, probabilities) + + +def model_fn_builder(bert_config, num_labels, init_checkpoint, learning_rate, + num_train_steps, num_warmup_steps, use_tpu, + use_one_hot_embeddings): + """Returns `model_fn` closure for TPUEstimator.""" + + def model_fn(features, labels, mode, params): # pylint: disable=unused-argument + """The `model_fn` for TPUEstimator.""" + + tf.logging.info("*** Features ***") + for name in sorted(features.keys()): + tf.logging.info(" name = %s, shape = %s" % (name, features[name].shape)) + + input_ids = features["input_ids"] + input_mask = features["input_mask"] + segment_ids = features["segment_ids"] + label_ids = features["label_ids"] + is_real_example = None + if "is_real_example" in features: + is_real_example = tf.cast(features["is_real_example"], dtype=tf.float32) + else: + is_real_example = tf.ones(tf.shape(label_ids), dtype=tf.float32) + + is_training = (mode == tf.estimator.ModeKeys.TRAIN) + + (total_loss, per_example_loss, logits, probabilities) = create_model( + bert_config, is_training, input_ids, input_mask, segment_ids, label_ids, + num_labels, use_one_hot_embeddings) + + tvars = tf.trainable_variables() + initialized_variable_names = {} + scaffold_fn = None + if init_checkpoint: + (assignment_map, initialized_variable_names + ) = modeling.get_assignment_map_from_checkpoint(tvars, init_checkpoint) + if use_tpu: + + def tpu_scaffold(): + tf.train.init_from_checkpoint(init_checkpoint, assignment_map) + return tf.train.Scaffold() + + scaffold_fn = tpu_scaffold + else: + tf.train.init_from_checkpoint(init_checkpoint, assignment_map) + + tf.logging.info("**** Trainable Variables ****") + for var in tvars: + init_string = "" + if var.name in initialized_variable_names: + init_string = ", *INIT_FROM_CKPT*" + tf.logging.info(" name = %s, shape = %s%s", var.name, var.shape, + init_string) + + output_spec = None + if mode == tf.estimator.ModeKeys.TRAIN: + + train_op = optimization.create_optimizer( + total_loss, learning_rate, num_train_steps, num_warmup_steps, use_tpu) + + output_spec = tf.contrib.tpu.TPUEstimatorSpec( + mode=mode, + loss=total_loss, + train_op=train_op, + scaffold_fn=scaffold_fn) + elif mode == tf.estimator.ModeKeys.EVAL: + + def metric_fn(per_example_loss, label_ids, logits, is_real_example): + predictions = tf.argmax(logits, axis=-1, output_type=tf.int32) + accuracy = tf.metrics.accuracy( + labels=label_ids, predictions=predictions, weights=is_real_example) + loss = tf.metrics.mean(values=per_example_loss, weights=is_real_example) + return { + "eval_accuracy": accuracy, + "eval_loss": loss, + } + + eval_metrics = (metric_fn, + [per_example_loss, label_ids, logits, is_real_example]) + output_spec = tf.contrib.tpu.TPUEstimatorSpec( + mode=mode, + loss=total_loss, + eval_metrics=eval_metrics, + scaffold_fn=scaffold_fn) + else: + output_spec = tf.contrib.tpu.TPUEstimatorSpec( + mode=mode, + predictions={"probabilities": probabilities}, + scaffold_fn=scaffold_fn) + return output_spec + + return model_fn + + +# This function is not used by this file but is still used by the Colab and +# people who depend on it. +def input_fn_builder(features, seq_length, is_training, drop_remainder): + """Creates an `input_fn` closure to be passed to TPUEstimator.""" + + all_input_ids = [] + all_input_mask = [] + all_segment_ids = [] + all_label_ids = [] + + for feature in features: + all_input_ids.append(feature.input_ids) + all_input_mask.append(feature.input_mask) + all_segment_ids.append(feature.segment_ids) + all_label_ids.append(feature.label_id) + + def input_fn(params): + """The actual input function.""" + batch_size = params["batch_size"] + + num_examples = len(features) + + # This is for demo purposes and does NOT scale to large data sets. We do + # not use Dataset.from_generator() because that uses tf.py_func which is + # not TPU compatible. The right way to load data is with TFRecordReader. + d = tf.data.Dataset.from_tensor_slices({ + "input_ids": + tf.constant( + all_input_ids, shape=[num_examples, seq_length], + dtype=tf.int32), + "input_mask": + tf.constant( + all_input_mask, + shape=[num_examples, seq_length], + dtype=tf.int32), + "segment_ids": + tf.constant( + all_segment_ids, + shape=[num_examples, seq_length], + dtype=tf.int32), + "label_ids": + tf.constant(all_label_ids, shape=[num_examples], dtype=tf.int32), + }) + + if is_training: + d = d.repeat() + d = d.shuffle(buffer_size=100) + + d = d.batch(batch_size=batch_size, drop_remainder=drop_remainder) + return d + + return input_fn + + +# This function is not used by this file but is still used by the Colab and +# people who depend on it. +def convert_examples_to_features(examples, label_list, max_seq_length, + tokenizer): + """Convert a set of `InputExample`s to a list of `InputFeatures`.""" + + features = [] + for (ex_index, example) in enumerate(examples): + if ex_index % 10000 == 0: + tf.logging.info("Writing example %d of %d" % (ex_index, len(examples))) + + feature = convert_single_example(ex_index, example, label_list, + max_seq_length, tokenizer) + + features.append(feature) + return features + + +def main(_): + tf.logging.set_verbosity(tf.logging.INFO) + + processors = { + "cola": ColaProcessor, + "mnli": MnliProcessor, + "mrpc": MrpcProcessor, + "xnli": XnliProcessor, + } + + tokenization.validate_case_matches_checkpoint(FLAGS.do_lower_case, + FLAGS.init_checkpoint) + + if not FLAGS.do_train and not FLAGS.do_eval and not FLAGS.do_predict: + raise ValueError( + "At least one of `do_train`, `do_eval` or `do_predict' must be True.") + + bert_config = modeling.BertConfig.from_json_file(FLAGS.bert_config_file) + + if FLAGS.max_seq_length > bert_config.max_position_embeddings: + raise ValueError( + "Cannot use sequence length %d because the BERT model " + "was only trained up to sequence length %d" % + (FLAGS.max_seq_length, bert_config.max_position_embeddings)) + + tf.gfile.MakeDirs(FLAGS.output_dir) + + task_name = FLAGS.task_name.lower() + + if task_name not in processors: + raise ValueError("Task not found: %s" % (task_name)) + + processor = processors[task_name]() + + label_list = processor.get_labels() + + tokenizer = tokenization.FullTokenizer( + vocab_file=FLAGS.vocab_file, do_lower_case=FLAGS.do_lower_case) + + tpu_cluster_resolver = None + if FLAGS.use_tpu and FLAGS.tpu_name: + tpu_cluster_resolver = tf.contrib.cluster_resolver.TPUClusterResolver( + FLAGS.tpu_name, zone=FLAGS.tpu_zone, project=FLAGS.gcp_project) + + is_per_host = tf.contrib.tpu.InputPipelineConfig.PER_HOST_V2 + run_config = tf.contrib.tpu.RunConfig( + cluster=tpu_cluster_resolver, + master=FLAGS.master, + model_dir=FLAGS.output_dir, + save_checkpoints_steps=FLAGS.save_checkpoints_steps, + tpu_config=tf.contrib.tpu.TPUConfig( + iterations_per_loop=FLAGS.iterations_per_loop, + num_shards=FLAGS.num_tpu_cores, + per_host_input_for_training=is_per_host)) + + train_examples = None + num_train_steps = None + num_warmup_steps = None + if FLAGS.do_train: + train_examples = processor.get_train_examples(FLAGS.data_dir) + num_train_steps = int( + len(train_examples) / FLAGS.train_batch_size * FLAGS.num_train_epochs) + num_warmup_steps = int(num_train_steps * FLAGS.warmup_proportion) + + model_fn = model_fn_builder( + bert_config=bert_config, + num_labels=len(label_list), + init_checkpoint=FLAGS.init_checkpoint, + learning_rate=FLAGS.learning_rate, + num_train_steps=num_train_steps, + num_warmup_steps=num_warmup_steps, + use_tpu=FLAGS.use_tpu, + use_one_hot_embeddings=FLAGS.use_tpu) + + # If TPU is not available, this will fall back to normal Estimator on CPU + # or GPU. + estimator = tf.contrib.tpu.TPUEstimator( + use_tpu=FLAGS.use_tpu, + model_fn=model_fn, + config=run_config, + train_batch_size=FLAGS.train_batch_size, + eval_batch_size=FLAGS.eval_batch_size, + predict_batch_size=FLAGS.predict_batch_size) + + if FLAGS.do_train: + train_file = os.path.join(FLAGS.output_dir, "train.tf_record") + file_based_convert_examples_to_features( + train_examples, label_list, FLAGS.max_seq_length, tokenizer, train_file) + tf.logging.info("***** Running training *****") + tf.logging.info(" Num examples = %d", len(train_examples)) + tf.logging.info(" Batch size = %d", FLAGS.train_batch_size) + tf.logging.info(" Num steps = %d", num_train_steps) + train_input_fn = file_based_input_fn_builder( + input_file=train_file, + seq_length=FLAGS.max_seq_length, + is_training=True, + drop_remainder=True) + estimator.train(input_fn=train_input_fn, max_steps=num_train_steps) + + if FLAGS.do_eval: + eval_examples = processor.get_dev_examples(FLAGS.data_dir) + num_actual_eval_examples = len(eval_examples) + if FLAGS.use_tpu: + # TPU requires a fixed batch size for all batches, therefore the number + # of examples must be a multiple of the batch size, or else examples + # will get dropped. So we pad with fake examples which are ignored + # later on. These do NOT count towards the metric (all tf.metrics + # support a per-instance weight, and these get a weight of 0.0). + while len(eval_examples) % FLAGS.eval_batch_size != 0: + eval_examples.append(PaddingInputExample()) + + eval_file = os.path.join(FLAGS.output_dir, "eval.tf_record") + file_based_convert_examples_to_features( + eval_examples, label_list, FLAGS.max_seq_length, tokenizer, eval_file) + + tf.logging.info("***** Running evaluation *****") + tf.logging.info(" Num examples = %d (%d actual, %d padding)", + len(eval_examples), num_actual_eval_examples, + len(eval_examples) - num_actual_eval_examples) + tf.logging.info(" Batch size = %d", FLAGS.eval_batch_size) + + # This tells the estimator to run through the entire set. + eval_steps = None + # However, if running eval on the TPU, you will need to specify the + # number of steps. + if FLAGS.use_tpu: + assert len(eval_examples) % FLAGS.eval_batch_size == 0 + eval_steps = int(len(eval_examples) // FLAGS.eval_batch_size) + + eval_drop_remainder = True if FLAGS.use_tpu else False + eval_input_fn = file_based_input_fn_builder( + input_file=eval_file, + seq_length=FLAGS.max_seq_length, + is_training=False, + drop_remainder=eval_drop_remainder) + + result = estimator.evaluate(input_fn=eval_input_fn, steps=eval_steps) + + output_eval_file = os.path.join(FLAGS.output_dir, "eval_results.txt") + with tf.gfile.GFile(output_eval_file, "w") as writer: + tf.logging.info("***** Eval results *****") + for key in sorted(result.keys()): + tf.logging.info(" %s = %s", key, str(result[key])) + writer.write("%s = %s\n" % (key, str(result[key]))) + + if FLAGS.do_predict: + predict_examples = processor.get_test_examples(FLAGS.data_dir) + num_actual_predict_examples = len(predict_examples) + if FLAGS.use_tpu: + # TPU requires a fixed batch size for all batches, therefore the number + # of examples must be a multiple of the batch size, or else examples + # will get dropped. So we pad with fake examples which are ignored + # later on. + while len(predict_examples) % FLAGS.predict_batch_size != 0: + predict_examples.append(PaddingInputExample()) + + predict_file = os.path.join(FLAGS.output_dir, "predict.tf_record") + file_based_convert_examples_to_features(predict_examples, label_list, + FLAGS.max_seq_length, tokenizer, + predict_file) + + tf.logging.info("***** Running prediction*****") + tf.logging.info(" Num examples = %d (%d actual, %d padding)", + len(predict_examples), num_actual_predict_examples, + len(predict_examples) - num_actual_predict_examples) + tf.logging.info(" Batch size = %d", FLAGS.predict_batch_size) + + predict_drop_remainder = True if FLAGS.use_tpu else False + predict_input_fn = file_based_input_fn_builder( + input_file=predict_file, + seq_length=FLAGS.max_seq_length, + is_training=False, + drop_remainder=predict_drop_remainder) + + result = estimator.predict(input_fn=predict_input_fn) + + output_predict_file = os.path.join(FLAGS.output_dir, "test_results.tsv") + with tf.gfile.GFile(output_predict_file, "w") as writer: + num_written_lines = 0 + tf.logging.info("***** Predict results *****") + for (i, prediction) in enumerate(result): + probabilities = prediction["probabilities"] + if i >= num_actual_predict_examples: + break + output_line = "\t".join( + str(class_probability) + for class_probability in probabilities) + "\n" + writer.write(output_line) + num_written_lines += 1 + assert num_written_lines == num_actual_predict_examples + + +if __name__ == "__main__": + flags.mark_flag_as_required("data_dir") + flags.mark_flag_as_required("task_name") + flags.mark_flag_as_required("vocab_file") + flags.mark_flag_as_required("bert_config_file") + flags.mark_flag_as_required("output_dir") + tf.app.run() From 4361683c7b79ddf0e7115be73c78fca909269a70 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Tue, 26 Mar 2019 14:47:51 +0800 Subject: [PATCH 38/46] Add files via upload --- keras/3.5-classifying-movie-reviews.ipynb | 61 +++++++++++++++++++++++ 1 file changed, 61 insertions(+) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 9c38c15..2e87dc0 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -31,6 +31,13 @@ "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [this link would be added after merge]()." + ] + }, { "cell_type": "markdown", "metadata": {}, @@ -1653,6 +1660,60 @@ "* As they get better on their training data, neural networks eventually start _overfitting_ and end up obtaining increasingly worse results on data \n", "never-seen-before. Make sure to always monitor performance on data that is outside of the training set.\n" ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## \\* Memory saving\n", + "To run this notebook based on codes above, you need 32g `SPARK_DRIVER_MEMORY`, which is a bit expensive. Following is a viable memory saving approach which could save your `SPARK_DRIVER_MEMORY` to 12g.\n", + "\n", + "Taking a review of the time you have compiled the model, and prepared the `ndarray` type of datasets. And in old code above, the next step you would do is fit:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "model.fit(partial_x_train,\n", + " partial_y_train,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=(x_val, y_val))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Just hold on here! Before you call this `fit` method, use following code to do the training to save the memory:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "from bigdl.util.common import to_sample_rdd\n", + "\n", + "train = to_sample_rdd(partial_x_train, partial_y_train)\n", + "val = to_sample_rdd(x_val, y_val)\n", + "\n", + "model.fit(train, None,\n", + " nb_epoch=20,\n", + " batch_size=512,\n", + " validation_data=val)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "This code zip the training data and label into RDD. The reason why it works is that every time when `fit` method takes `ndarray` as input, it transforms the `ndarray` to RDD and some memory is taken for cache in this process. And in this notebook, we use the same dataset as input repeatedly. If we call this operation only once and reuse the RDD afterwards, all the subsequential memory use would be saved." + ] } ], "metadata": { From c257c264a101cf3e4f4d2303974d839dc2ef2494 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Wed, 27 Mar 2019 13:26:49 +0800 Subject: [PATCH 39/46] Update 3.5-classifying-movie-reviews.ipynb --- keras/3.5-classifying-movie-reviews.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 2e87dc0..1bbf7b5 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -35,7 +35,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [this link would be added after merge]()." + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](https://render.githubusercontent.com/view/ipynb?commit=453119be8480aa5e7b6c11071cf72510ccc8d7fe&enc_url=68747470733a2f2f7261772e67697468756275736572636f6e74656e742e636f6d2f696e74656c2d616e616c79746963732f7a6f6f2d7475746f7269616c732f343533313139626538343830616135653762366331313037316366373235313063636338643766652f6b657261732f332e352d636c617373696679696e672d6d6f7669652d726576696577732e6970796e623f746f6b656e3d414e675f3838585a706332444f457544334778614b38394e3631494c7a766a716b7335636d77635a7741253344253344&nwo=intel-analytics%2Fzoo-tutorials&path=keras%2F3.5-classifying-movie-reviews.ipynb&repository_id=172436179&repository_type=Repository#*-Memory-saving)." ] }, { From ab198a5895b87cb4bf92648cddcefc2a22b56a76 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Wed, 27 Mar 2019 13:44:52 +0800 Subject: [PATCH 40/46] Update 3.5-classifying-movie-reviews.ipynb --- keras/3.5-classifying-movie-reviews.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 1bbf7b5..85dd72e 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -35,7 +35,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](https://render.githubusercontent.com/view/ipynb?commit=453119be8480aa5e7b6c11071cf72510ccc8d7fe&enc_url=68747470733a2f2f7261772e67697468756275736572636f6e74656e742e636f6d2f696e74656c2d616e616c79746963732f7a6f6f2d7475746f7269616c732f343533313139626538343830616135653762366331313037316366373235313063636338643766652f6b657261732f332e352d636c617373696679696e672d6d6f7669652d726576696577732e6970796e623f746f6b656e3d414e675f3838585a706332444f457544334778614b38394e3631494c7a766a716b7335636d77635a7741253344253344&nwo=intel-analytics%2Fzoo-tutorials&path=keras%2F3.5-classifying-movie-reviews.ipynb&repository_id=172436179&repository_type=Repository#*-Memory-saving)." + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](*-memory-saving)." ] }, { From a6f5a5818f444f89872b67c25bcc0a66a2ac5e34 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Wed, 27 Mar 2019 14:04:05 +0800 Subject: [PATCH 41/46] Update 3.5-classifying-movie-reviews.ipynb --- keras/3.5-classifying-movie-reviews.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 85dd72e..0206619 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -35,7 +35,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](*-memory-saving)." + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](#*-memory-saving)." ] }, { From e007c136075745e8efb48cd4c78037e5ae40c497 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Wed, 27 Mar 2019 14:05:35 +0800 Subject: [PATCH 42/46] Update 3.5-classifying-movie-reviews.ipynb --- keras/3.5-classifying-movie-reviews.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 0206619..c44415d 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -35,7 +35,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](#*-memory-saving)." + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](##\*-memory-saving)." ] }, { From cb6371fdd9908a17be2f47611eb651bdc13217aa Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Wed, 27 Mar 2019 14:09:30 +0800 Subject: [PATCH 43/46] Update 3.5-classifying-movie-reviews.ipynb --- keras/3.5-classifying-movie-reviews.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index c44415d..f5c2251 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -35,7 +35,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](##\*-memory-saving)." + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](#\*-memory-saving)." ] }, { From 4f6c88490f97aca85be7db14f516e58f5b6279a4 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Wed, 27 Mar 2019 15:28:00 +0800 Subject: [PATCH 44/46] Update 3.5-classifying-movie-reviews.ipynb --- keras/3.5-classifying-movie-reviews.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index f5c2251..0c73aba 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -35,7 +35,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here](#\*-memory-saving)." + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [here]()." ] }, { From 24683102be42959443137f1ca06f7807d4e86abb Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Wed, 27 Mar 2019 15:37:08 +0800 Subject: [PATCH 45/46] Add files via upload --- keras/3.5-classifying-movie-reviews.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/keras/3.5-classifying-movie-reviews.ipynb b/keras/3.5-classifying-movie-reviews.ipynb index 2e87dc0..2a2d22c 100644 --- a/keras/3.5-classifying-movie-reviews.ipynb +++ b/keras/3.5-classifying-movie-reviews.ipynb @@ -35,7 +35,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach [this link would be added after merge]()." + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach at the end of this notebook." ] }, { From 07f0401a1b82fa6e5be3fa9d4ee182545babab83 Mon Sep 17 00:00:00 2001 From: Jiaming Song Date: Thu, 4 Apr 2019 14:55:45 +0800 Subject: [PATCH 46/46] Add files via upload --- keras/4.4-overfitting-and-underfitting.ipynb | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/keras/4.4-overfitting-and-underfitting.ipynb b/keras/4.4-overfitting-and-underfitting.ipynb index c2a801b..3e9c188 100644 --- a/keras/4.4-overfitting-and-underfitting.ipynb +++ b/keras/4.4-overfitting-and-underfitting.ipynb @@ -31,6 +31,13 @@ "sc = init_nncontext(init_spark_conf().setMaster(\"local[4]\"))" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Note that you have to allocate 32g memory to `SPARK_DRIVER_MEMORY` if you are about to finish the contents in this notebook. Perhaps there is no such memory left on your machine, see memory saving approach at [Chapter 3.5](https://github.com/intel-analytics/zoo-tutorials/blob/master/keras/3.7-predicting-house-prices.ipynb)" + ] + }, { "cell_type": "markdown", "metadata": {},