Mix.install([
{:kino, "~> 0.8.0", override: true},
{:kino_vega_lite, "~> 0.1.7"},
{:benchee, "~> 1.1"},
{:hidden_cell, github: "BrooklinJazz/hidden_cell"}
])
-
Scalability
-
Speed
-
Compiled and run on the Erlang VM ("BEAM"). (Renowned for efficiency)
-
Much better "garbage collection" than virtually any other VM
-
Many tiny processes (as opposed to "threads" which are more difficult to manage)
-
Functional language with dynamic typing
-
Immutable data so "state" is always predictable!
-
High reliability, availability and fault tolerance (because of Erlang) means apps built with
Elixir
are run in production for years without any "downtime"! -
Real-time web apps are "easy" (or at least easier than many other languages!) as WebSockets & streaming are baked-in
Things will go wrong with code, and Elixir
provides supervisors which describe how to restart parts of your system when things don't go as planned.
If you have the time, these videos give a nice contextual introduction into what Elixir
is, what it's used for and how it works:
- Code School's Try Elixir, 3 videos (25mins 🎥 plus exercises, totalling 90mins). The 'Try' course is free (there is an extended paid for course).
- Pete Broderick's Intro to Elixir (41 mins 🎥)
- Jessica Kerr's Elixir Should Take Over the World (58 mins 🎥)
Not a video learner? Looking for a specific learning? https://elixirschool.com/ is an excellent, free, open-source resource that explains all things Elixir
📖 ❤️.
Before you learn Elixir
as a language you will need to have it installed on your machine.
To do so you can go to http://elixir-lang.org/install.html or follow our guide here:
Using the Homebrew package manager:
brew install elixir
- Add the Erlang Solutions repo:
wget https://packages.erlang-solutions.com/erlang-solutions_2.0_all.deb && sudo dpkg -i erlang-solutions_2.0_all.deb
- Run:
sudo apt-get update
- Install the Erlang/OTP platform and all of its applications:
sudo apt-get install esl-erlang
- Install Elixir:
sudo apt-get install elixir
-
Web installer
- Download the installer
- Click next, next, ..., finish
-
Chocolatey (Package Manager)
choco install elixir
Elixir
is an interpreted language. The script runs in the BEAM
, the Erlang
virtual machine.
You have several ways to run an Elixir script. You can firstly use the REPL (Read-Eval-Print Loop) command-line tool to write and run simple Elixir
. In other words, it is a program running in the console that gives you a shell to run Elixir
commands
-
After installing
Elixir
you can open the interactive shell by typingiex
. This allows you to type in anyElixir
expression and see the result in the terminal. -
Type in
h
followed by thefunction
name at any time to see documentation information about any given built-in function and how to use it. E.g If you typeh round
into the (iex) terminal you should see something like this:
- Typing
i
followed by the value name will give you information about a value in your code:
You can alternatively use Livebook
. This is an interactive program that runs on top of Elixir
and lets you run Elixir
code, write Markdown, draw graphics... You can download it from the website or via the github repo.
This page itself is a Livebook
.
This section brings together the key information from Elixir's Getting Started documentation and multiple other sources. It will take you through some examples to practice using and familiarise yourself with Elixir's 7 basic types.
Elixir's 7 basic types:
integers
floats
booleans
atoms
strings
lists
tuples
The cell below runs Elixir
commands, the same way you would do it when you open a console and run IEx
. It is an editable cell. You just need to click on "evaluate" and you get the result below.
Lets look at two basic numerical operators: +
and /
.
1 + 2
When using the /
with two integers this gives a float
(5.0).
10 / 3
This operator can be invoked by the Kernel./
routine:
Kernel./(10, 3)
If you want to do integer division or get the euclidean division remainder you can use the div
or rem
functions
a = 10
b = 3
div(a, b)
The rem
is the remainder of the euclidean division
rem(a, b)
a == div(a, b) * b + rem(a, b)
You can find more information in the docs
Elixir supports true
and false
as booleans. We use below the type-checking routine is_boolean
to check if the input is a boolean or not:
true
false
is_boolean(true)
is_boolean(1)
Besides the booleans true
and false
Elixir
also has the
concept of a "truthy" or "falsy" value.
- a value is truthy when it is neither
false
nornil
- a value is falsy when it is
false
ornil
Elixir has functions, like and/2
, that only work with
booleans, but also functions that work with these
truthy/falsy values, like &&/2
and !/1
.
The syntax <function_name>/<number>
is the convention
used in Elixir to identify a function named
<function_name>
that takes <number>
parameters.
The value <number>
is also referred to as the function
arity.
In Elixir each function is identified univocally both by
its name and its arity. More information can be found here.
We can check the truthiness of a value by using the !/1
function twice.
Truthy values:
!true
!!true
!5
!!5
!false
!!false
!nil
!!nil
Atoms are constants where their name is their own value (some other languages call these Symbols).
:hello
!:hello == :world
true
and false
are actually atoms in Elixir
Names of modules in Elixir
are also atoms. MyApp.MyModule
is a valid atom, even if no such module has been declared yet.
is_atom(MyApp.MyModule)
Atoms are also used to reference modules from Erlang
libraries,
including built-in ones. In fact, Elixir
can natively run Erlang
code
pi = :erlang.term_to_binary(3.14)
:erlang.binary_to_term(pi)
:crypto.strong_rand_bytes(3)
You notice in the example above that the cells are linked in
Livebook
. You declare a variablepi
and this value is available afterwards. You must evaluate the first cell to be able to use it in the next cell.
Strings are surrounded by double quotes.
"Hello World"
You can use comments in the code (and it becomes gray)
# a comment starts with "#"
You can print a string using the IO
module. You have 2 routines, IO.puts/1
and IO.inspect/2
. Both return the atom :ok
.
IO.puts("hello")
You have string interpolation with #{variable}
.
a = "ok"
IO.inspect(a, label: "the value is: ")
IO.puts("the value is still: #{a}")
Prefer inspect/2
to IO.puts
is you print anything else but a string. For example, you can't use IO.puts
to print a list. You need to use IO.inspect
. The line below with IO.puts
doesn't work - it returns nothing - but IO.inspect
does work because we are printing a list, which is not a string.
t = [1, 2]
IO.puts("#{t}")
This works:
IO.puts("#{inspect(t)}")
Elixir uses square brackets []
to make a list. Lists are enumerable and can use the Enum module to perform iterative functions such as mapping.
my_list = [1, 2]
length(my_list)
is_list(my_list)
You have operators on lists. You can concatenating lists together with the ++
operator and substract lists with --
my_list ++ [4, 5, 6]
[1, true, 2, false, 3, true] -- [true, false]
You can prepend to a list with |
:
list = [2, 3]
[1 | list]
Elixir uses curly brackets to make a tuple. Tuples are not enumerable and there are far fewer functions available in the Tuple module. You can reference tuple values by index but you cannot iterate over them.
If you must treat your tuple as a list, then convert it using Tuple.to_list(your_tuple)
Tuples are similar to lists but are not suited to data sets that need to be updated or added to regularly.
t = {:ok, "hello", "John"}
is_tuple(t)
elem(t, 2)
tuple_size(t)
l = Tuple.to_list(t)
is_list(l)
If you need to iterate over the values use a list.
When dealing with large lists or tuples:
-
Updating
alist
(adding or removing elements) is fast -
Updating
atuple
is slow -
Reading
alist
(getting its length or selecting an element) is slow -
Reading
atuple
is fast
source: http://stackoverflow.com/questions/31192923/lists-vs-tuples-what-to-use-and-when
Pattern matching is heavily used in Elixir
. This is like destructuring in Javascript. The examples below are self explanatory
{first, second, third} = {1, :ok, "fine"}
IO.inspect(first)
IO.inspect(second)
IO.inspect(third)
[head | tail] = [1, 2, 3, "four"]
IO.inspect(head, label: "head: ")
IO.inspect(tail, label: "tail: ")
# when we don't need a variable, use the underscore "_" and it becomes gray.
[_h | [_h1 | t]] = [1, :two, 3, "four"]
IO.puts("t is: #{inspect(t)}")
A very popular usage of pattern matching is showed further down in the Function paragraph.
As every functional language, Elixir
implements anonymous functions. These start with fn
and end with end
, and can be binded to a variable.
add = fn a, b -> a + b end
add.(1, 2)
# beware of the dot "."
Note a dot
.
between the variableadd
and parenthesis is required to invoke an anonymous function.
In Elixir, functions are first class citizens
meaning that they can be passed as arguments to other functions the same way integers and strings can.
is_function(add)
This uses the inbuilt function is_function
which checks to see if the parameter passed is a function and returns a bool.
Anonymous functions are closures (named functions are not) and as such they can access variables that are in scope when the function is defined.
Example: you can define a new anonymous function double
that uses the add
anonymous function we have previously defined and a variable b
. Both are in the scope of the new anonymous function double/1
:
b = 3
double = fn a -> add.(a, a) end
double.(b)
These functions can be useful but will no longer be available to you.
If you want to make something more permanent then you can create a module
.
With modules you're able to group several functions together. Most of the time it is convenient to write modules into files so they can be compiled and reused.
In order to create your own modules in Elixir, use the defmodule
macro, then use the def
macro to define functions in that module with the do ... end
. So in this case the module is "Math" and the function is "sum".
The first letter of the module name must be in uppercase.
defmodule Math do
def sum(a, b) do
a + b
end
def double_sum(a, b), do: sum(a, b) * 2
end
Note how we simplified the way we wrote the second function with the
, do:
isntead of thedo ... end
. This is a short cut when the body of the function is simple.
Math.sum(1, 2)
Math.double_sum(1, é)
We have seen the anonymous function can access to outer variables within their scope. For example, we can apply via the module Enum
an anonymous function to each element of an enumerable such as a list because the variable is in the scope.
Enum.map([1, 2], fn x -> x + 2 end)
This is not true for named functions, those declared within a module. Furthermore, only named function are exported from a module, not anonymous functions.
Look at the example below. The outer variable a
is not accessible within a module. In the cell below, you can see that a
is underlined in red. If you evaluate this cell, you get an error and a warning on var
. The variable b
is not accessible as well.
Bad example:
a = 4
defmodule ExMod do
def print_a do
a
end
end
ExMod.print_a()
You can get around this via module attributes. These are values prepended with @
that are accessible to any function in the module. Note there is no =
sign between the variable declaration @a
and the value.
b = 5
defmodule ExMod2 do
@b b
def print_b, do: @b
end
ExMod2.print_b()
Note that this turns the function to be impure; if b
changes, the n the result of ExMod2.print_b
changes. You can consider them as environmental variables: you may need to use them in a controlled way.
The control flow if
below uses the variable n
. It is in the scope, accessible. The macro if
is itself a function that returns something.
n = 5
result =
if rem(n, 2) == 0 do
n + 1
else
n + 2
end
IO.inspect(result, label: "the macro 'if' returns: ")
IO.inspect(n, label: "the value of 'n' is unchanged: ")
We can rebind the variable n
.
n =
if rem(n, 2) == 0 do
n + 1
else
n + 2
end
IO.inspect(n, label: "the value of 'n' is now: ")
Imagine you have a function that behaves differently depending on a variable. For example, the function adds if
statement like this:
defmodule Ex1 do
def add(x) do
if rem(x, 2) == 0 do
x + 1
else
x + 2
end
end
end
Ex1.add(5)
We can instead use a guard clause. You define our function twice like this:
defmodule Ex2 do
def add(x) when rem(x, 2) == 0, do: x + 1
def add(x), do: x + 2
end
Ex2.add(3)
This works again by pattern matching. This means the order in which you defined the headers is extremely important. If you reverse the order, the second clause will never match since the first will match all cases.
A use case is recursion, which is heavily used in functional code. Guards allows you yo write recursive functions very easily like this:
defmodule Ex3 do
def sum(n) when n == 1, do: 1
def sum(n), do: n + sum(n - 1)
end
Ex3.sum(4)
We need to add a "stop" condition in a recursion. This is provided by the first header of the function sum
. Every recursive function will be written in this form with 2 declarations.
What if we wanted to chain our functions? We can do this with the pipe |>
operator.
An example. Consider the two functions below. Suppose that times
receives as first argument the output of double
, then we can pipe them: double.(2) |> times.(3)
.
Note we write only the second argument of
times
because his first argument is implicit. If we write it, you get an error because the compiler will consider thattimes
has 3 arguments whereas we only definedtimes
with 2 arguments. The number of arguments is called the "arity".
We can even pipe inspect
in the middle of the piping sequence to check what double
is sending to times
. This can be very usefull.
double = fn x -> x * 2 end
times = fn x, y -> x * y end
double.(2)
|> IO.inspect(label: "'add' sends")
|> times.(3)
One popular use of atoms in Elixir
is to use them as messages for pattern matching.
Let's say you have a function which processes an http
request. The outcome of this process is either going to be a success or an error. You could therefore use atoms to indicate whether or not this process is successful. If the result of our process is successful, it will return {:ok, lines}
, however if it fails (e.g. returns nil) then it will return an error {:error, "failed to process response}
. This will allows us to pattern match on this result.
defmodule HTTP do
def process(http_request) do
lines = http_request |> String.split("\n")
case lines == [""] do
false ->
{:ok, lines}
true ->
{:error, "failed to process response"}
end
end
end
We can pattern match on the response:
{status1, response1} = HTTP.process("the request response is text.txt\n and is very long")
{status2, response2} = HTTP.process("")
IO.inspect(status1)
IO.inspect(response1)
IO.inspect(status2)
IO.inspect(response2)
The usage of the pattern below is very popular in Elixir
:
case HTTP.process("") do
{:ok, response} -> response
{:error, msg} -> msg
end
The &
symbol is called the
capture operator,
which can be used to quickly generate anonymous functions that expect at least one argument.
The arguments can be accessed inside the capture operator &()
with &X
, where
X
refers to the input number of the argument.
There is no difference between:
add_capture = &(&1 + &2)
add_fn = fn a, b -> a + b end
add_capture.(1, 2) == add_fn.(1, 2)
We can use this to pass a short anonymous function on an enumerable. The readability is a matter of taste.
list = [1, 2]
add_one = &(&1 + 1)
eval1 = Enum.map(list, fn x -> x + 1 end) == Enum.map(list, &(&1 + 1))
eval2 = Enum.map(list, &(&1 + 1)) == Enum.map(list, add_one)
IO.puts("First is: #{eval1}")
IO.puts("Second is: #{eval2}")
Note that
list
is immutable.
To get started with your first Elixir
project that doesn't use a Livebook
, you need to make use of the
Mix
build tool that comes with Elixir
.
Mix allows you to do a number of things including:
- Create projects
- Compile projects
- Run tasks
- Testing
- Generate documentation
- Manage dependencies
To generate a new project follow these steps:
Initialise a project by typing the following command in your terminal, replacing [project_name] with the name of your project:
mix new [project_name]
e.g:
mix new animals
We have chosen to call our project 'animals'
This will create a new folder with the given name of your project and should also print something that looks like this to the command line:
* creating README.md
* creating .formatter.exs
* creating .gitignore
* creating mix.exs
* creating lib
* creating lib/animals.ex
* creating test
* creating test/test_helper.exs
* creating test/animals_test.exs
Your Mix project was created successfully.
You can use "mix" to compile it, test it, and more:
cd animals
mix test
Run "mix help" for more commands.
Navigate to your newly created directory:
> cd animals
Open the directory in your text editor. You will be able to see that Elixir
has
generated a few files for us that are specific to our project:
lib/animals.ex
test/animals_test.ex
Open up the animals.ex
file in the lib directory. You should already see some hello
boilerplate.
Elixir
has created a module with the name of your project along with a function
that prints out a :world
atom when called.
It's also added boilerplate for module and function documentation - the first part of the file. (we will go into more detail about documentation later).
Let's add some functionalities in it:
defmodule Animals do
@moduledoc false
@doc """
Hello world.
## Examples
iex> Animals.hello()
:world
"""
def hello do
:world
end
@doc """
`create_zoo/0` returns a list of zoo animals
## Examples
iex> Animals.create_zoo
["lion", "tiger", "gorilla", "elephant", "monkey", "giraffe"]
"""
def create_zoo do
["lion", "tiger", "gorilla", "elephant", "monkey", "giraffe"]
end
@doc """
`randomise/1` takes a list of zoo animals and returns a new randomised list with
the same elements as the first.
## Examples
iex> zoo = Animals.create_zoo
iex> Animals.randomise(zoo)
"""
def randomise(zoo) do
Enum.shuffle(zoo)
end
@doc """
contains? takes a list of zoo animals and a single animal and returns a boolean
as to whether or not the list contains the given animal.
## Examples
iex> zoo = Animals.create_zoo
iex> Animals.contains?(zoo, "gorilla")
true
"""
def contains?(zoo, animal) do
Enum.member?(zoo, animal)
end
@doc """
`see_animals/2` takes a list of zoo animals and the number of animals that
you want to see and then returns a list
> Note: `Enum.split` returns a tuple so we have to pattern match on the result
to get the value we want out.
## Examples
iex> zoo = Animals.create_zoo
iex> Animals.see_animals(zoo, 2)
["monkey", "giraffe"]
"""
def see_animals(zoo, count) do
{_seen, to_see} = Enum.split(zoo, -count)
to_see
end
@doc """
`save/2` takes a list of zoo animals and a filename and saves the list to that file
## Examples
iex> zoo = Animals.create_zoo
iex> Animals.save(zoo, "my_animals")
:ok
"""
def save(zoo, filename) do
# erlang is converting the zoo list to something that can be written to the file system
binary = :erlang.term_to_binary(zoo)
File.write(filename, binary)
end
@doc """
`load/1` takes filename and returns a list of animals if the file exists
> Note: here we are running a case expression on the result of File.read(filename)
- if we receive an :ok then we want to return the list
- if we receive an error then we want to give the user an error-friendly message
## Examples
iex> Animals.load("my_animals")
["lion", "tiger", "gorilla", "elephant", "monkey", "giraffe"]
iex> Animals.load("aglkjhdfg")
"File does not exist"
"""
def load(filename) do
case File.read(filename) do
{:ok, binary} -> :erlang.binary_to_term(binary)
{:error, _reason} -> "File does not exist"
end
end
@doc """
`selection/1` takes a number, creates a zoo, randomises it and then returns a list
of animals of length selected
> Note: We are using the pipe operator here. It takes the value returned from the
expression and passes it down as the first argument in the expression below.
`see_animals` takes two arguments but only one needs to be specified
as the first is provided by the pipe operator
## Examples
iex> Animals.selection(2)
"""
def selection(number_of_animals) do
Animals.create_zoo()
|> Animals.randomise()
|> Animals.see_animals(number_of_animals)
end
end
Let's test out the boilerplate code. In your project directory type the following command:
> iex -S mix
What this means is: "Start the Elixir
REPL and compile with the context of my current project".
This allows you to access modules and functions created within the file tree.
Call the hello
function given to us by Elixir
. It should print out the :world
atom to the command line:
> Animals.hello
# :world
We then added some functions with their documentation: create_zoo/0
, randomise/1
, contains?/2
.
NOTE: we are making use of a pre-built module called Enum
which has a list of functions that you can use on enumerables such as lists. Documentation available at:
hexdocs.pm/elixir/Enum.html
NOTE: It's convention when writing a function that returns a boolean to add a question mark after the name of the method.
zoo = Animals.create_zoo()
shuffled_zoo = Animals.randomise(zoo)
If you run this Livebook locally, you will see the documentation of the function when you pass the cursor above it.
Animals.contains?(shuffled_zoo, "gorilla")
We have a pattern matching example in the module with the function Animals.see_animals
.
Animals.see_animals(shuffled_zoo, 3)
The function save/2
writes to the file system. Note the conversion :erlang.term_to_binary/1
before using the File
module.
Animals.save(zoo, "zoo.txt")
This will create a new file in your file tree with the name of the file that you specified in the function. It will contain some odd characters:
�l\����m����lionm����tigerm����gorillam����elephantm����monkeym����giraffej
You can load back the file with Animals.load/1
. Note the case <something> do
switch. The value of <something>
is returned by the function File.read
. It returns a tuple whose first element is the atom :ok
or :error
, so returns {:ok, value}
or {:error, reason}
. You pattern matching on this return, thus have 2 cases:
- the success case, where the first element of the response tuple is
:ok
, you use the pattern matching binding to the second element of it's tuple response to the variablevalue
so that we can use it. - the error case. In case the first element of the response tuple is the atom
:error
, you ignore the second element (you underscore it_reason
) and return an error message.
Note the opposite conversion :erlang.binary_to_term/1
Animals.load("zoo.txt")
Animals.load("blabla")
If you do not use
Livebook
but a code editor instead, each time you modify your module, you need to recompile it:
recompile()
If you use
Livebook
, each time you change the Animals module, you need to "evaluate" the cell.
What if we wanted to call some of our functions in succession to another? It takes the output of a function as the input of the first variable of the next function. When you "pipe" two functions, you musn't write the first argument of the second function because it is implicit. This way, we can write clean and shorter code.
This is done in the code of the function Animals.selection/1
.
Animals.selection(2)
When we created a new project with mix, it created a file for us called mix.exs
which is referred to as the 'MixFile'. This file holds information about our
project and its dependencies.
At the bottom of the file it gives us a function called deps
which manages all
of the dependencies in our project. To install a third party package we need to
manually write it in the deps function (accepts a tuple of the package name and
the version) and then install it in the command line. Let's install ex_doc
as
an example:
Add the following to the deps function in your mix.exs
file:
def deps do
[
{:ex_doc, "~> 0.21"}
]
end
Then in the command line quit your iex
shell and enter the following to install
the ex_docs
dependency:
> mix deps.get
You might receive an error saying:
Could not find Hex, which is needed to build dependency :ex_doc
Shall I install Hex? (if running non-interactively,
use: "mix local.hex --force") [Yn]
If you do then just enter y
and then press enter. This will install the dependencies that you need.
Once ex_docs
has been installed, run run the following command to generate documentation (make sure you're not in iex
):
> mix docs
This will generate documentation that can be viewed if you copy the file path of the index.html
file within the newly created doc
folder and then paste it in your browser.
If you have added documentation to your module and functions as per the examples above,
you should see something like the following:
It looks exactly like the format of the official Elixir
docs because they used the same tool to create theirs. Here is what the method documentation should look like if you click on Animals
:
This is an incredibly powerful tool that comes 'baked in' with elixir. It means that other developers who are joining the project can be brought up to speed incredibly quickly!
When you generate a project with Elixir
it automatically gives you a number of
files and directories. One of these directories is called test
and it holds two
files like should have names like:
[project_name]_test.exs
test_helper.exs
We are running this code in a Livebook
. It is slightly different.
https://www.elixirnewbie.com/blog/writing-tests-in-livebook
Since we are running the code in a Livebook
, run the following:
ExUnit.start(auto_run: false)
!!!! For the moment, the doctest
functionality does not work on Fly.io. We can nevertheless run tests.
NOTE: you need to run the command above and set
async: false
to run test in Livebook.
defmodule AnimalsTest do
use ExUnit.Case, async: false
doctest Animals
describe "first test" do
test "greets the world" do
assert Animals.hello() == :world
end
end
describe "test Animal module" do
test "contains?" do
zoo = Animals.create_zoo()
assert true == Animals.contains?(zoo, "gorilla")
end
end
end
ExUnit.run()
If you want to learn about code coverage then check out the following tutorial:
https://github.com/dwyl/learn-elixir/tree/master/codecov_example.
A blog post that explains how to run tests in Livebook.
The following is not a concern for Livebook.
In Elixir
version 1.6 the mix format
task was introduced.
See: elixir-lang/elixir#6643
mix format
is a built-in way to format your Elixir
code according to the community-agreed consistent style.
This means all code will look consistent across projects (personal, "work" & hex.pm packages)
which makes learning faster and maintainability easier!
At present, using the formatter is optional,
however most Elixir
projects have adopted it.
To use the mix task in your project, you can either check files individually, e.g:
mix format path/to/file.ex
Or you can define a pattern for types of files you want to check the format of:
mix format "lib/**/*.{ex,exs}"
will check all the .ex
and .exs
files in the lib/
directory.
Having to type this pattern each time you want to check the files is tedious. Thankfully, Elixir
has you covered.
In the root of your Elixir
project, you will find a .formatter.exs
config file with the following code:
# Used by "mix format"
[
inputs: ["{mix,.formatter}.exs", "{config,lib,test}/**/*.{ex,exs}"]
]
This means that if you run mix format
it will check the mix.exs
file
and all .ex
and .exs
files in the config
, lib/
and test
directories.
This is the most common pattern for running mix format. Unless you have a reason to "deviate" from it, it's a good practice to keep it as it is.
Simply run:
mix format
And your code will now follow Elixir's formatting guidelines.
You may also use credo
, a static code analyzer.
We recommend installing a plugin in your Text Editor to auto-format:
-
Atom Text Editor Auto-formatter: https://atom.io/packages/atom-elixir-formatter
-
Vim
Elixir
Formatter: https://github.com/mhinz/vim-mix-format -
VSCode: https://marketplace.visualstudio.com/items?itemName=sammkj.vscode-elixir-formatter
-
Read the
mix/tasks/format.ex
source to understand how it works: https://github.com/elixir-lang/elixir/blob/master/lib/mix/lib/mix/tasks/format.ex -
https://hashrocket.com/blog/posts/format-your-elixir-code-now
-
https://devonestes.herokuapp.com/everything-you-need-to-know-about-elixirs-new-formatter
To publish your Elixir
package to Hex.pm:
-
Check the version in
mix.exs
is up to date and that it follows the semantic versioning format:MAJOR.MINOR.PATCH where
MAJOR version when you make incompatible API changes MINOR version when you add functionality in a backwards-compatible manner PATCH version when you make backwards-compatible bug fixes
-
Check that the main properties of the project are defined in
mix.exs
- name: The name of the package
- description: A short description of the package
- licenses: The names of the licenses of the package
- NB. dwyl's
cid
repo contains an example of a more advancedmix.exs
file where you can see this in action
-
Create a Hex.pm account if you do not have one already.
-
Make sure that ex_doc is added as a dependency in you project
defp deps do
[
{:ex_doc, "~> 0.21", only: :dev}
]
end
When publishing a package, the documentation will be automatically generated.
So if the dependency ex_doc
is not declared, the package won't be able to be published
- Run
mix hex.publish
and if all the information are correct replyY
If you have not logged into your Hex.pm account in your command line before running the above command, you will be met with the following...
No authenticated user found. Do you want to authenticate now? [Yn]
You will need to reply Y
and follow the on-screen instructions
to enter your Hex.pm username and password.
After you have been authenticated, Hex will ask you for a local password that applies only to the machine you are using for security purposes.
Create a password for this and follow the onscreen instructions to enter it.
- Now that your package is published you can create a new git tag with the name of the version:
git tag -a 0.1.0 -m "0.1.0 release"
git push --tags
That's it, you've generated, formatted
and published your first Elixir
project.
If you want a more detailed example
of publishing a real-world package
and re-using it in a real-world project,
see:
code-reuse-hexpm.md
Maps are very similar to Object
literals in JavaScript
. They have almost the samesyntax except for a %
symbol. They look like this:
animal = %{
name: "Rex",
type: "dog",
legs: 4
}
Values can be accessed in a couple of ways, exactly like Javascript.The first is by dot notation just like JavaScript
and the square bracket[key]
animal.type
key = :type
animal[key]
The third way values can be accessed is by pattern matching, similar to "destructuring" in Javascript.
Let's say we wanted to assign values to the variables for each of the key-value pairs in the map. We would write something that looks like this:
%{
name: var_name,
type: var_type,
legs: var_legs
} = animal
You can pattern match on a part of the map. Note that you don't have the Javascript short cut of not writting the value explicitely; you need to write key: value
.
%{name: name} = animal
name
We now have access to the values by typing the variable names. We used the string interpolation #{variable}
as seen above to shorten the output. We check that the variables var_name
, var_type
and var_legs
have a value found by pattern matching:
IO.puts("#{var_name}, #{var_type}, #{var_legs}")
Due to the immutability of Elixir
, you cannot update a map using dot notation.
For example, if we try to reassign a value to the map animal
, we have an error:
animal.name = "Max"
In Elixir
we can only create new data structures as opposed to manipulating existing
ones. So when we update a map, we are creating a new map with our new values.
This can be done in a couple of ways:
- Function
- Syntax
- Using a function
We can update a map usingMap.put(map, key, value)
. This takes the map you want to update followed by the key we want to reassign and lastly the value that we want to reassign to the key:
updatedAnimal = Map.put(animal, :name, "Max")
- Using syntax
We can use a special syntax for updating a map in Elixir. It looks like this:
%{animal | legs: 5}
Remark that it didn't take into account the first change we made to
animal.name
, but onlyanimal.legs
is changed.
NOTE: Unlike the function method above, this syntax can only be used to UPDATE a current key-value pair inside the map, it cannot add a new key value pair.
If we want to effectively change animal
, then we have to "re-bind" it. We add IO.inspect/2
to show the intermediate results. Only the last one should be printed otherwise.
IO.inspect(animal)
animal = %{animal | legs: 2} |> IO.inspect()
animal = %{animal | name: "Max"}
When looking into Elixir
you may have heard about its
processes and its support for concurrency.
In fact we even mention processes as one of the key advantages. If you're anything like us,you're probably wondering what this actually means for you and your code.
This section aims to help you understand what they are and how they can help improve
your Elixir
projects.
Elixir-lang
describes processes as:
In
Elixir
, all code runs inside processes. Processes are isolated from each other, run concurrent to one another and communicate via message passing. Processes are not only the basis for concurrency inElixir
, but they also provide the means for building distributed and fault-tolerant programs.
-
a nice video to watch about processes: The Soul of Erlang and Elixir • Sasa Juric • GOTO 2019
-
a blog: understand processes
Let's define a function.
defmodule Math2 do
@doc """
iex> Math2.add(1,2)
3
"""
def add(a, b) do
(a + b) |> IO.inspect()
end
end
Now that we have a definition, let's start by spawning our first process. We can spawn
a process by:
- supplying an anonymous function
- or via a declarative way
<module>, <function>, <args>
spawn(Math2, :add, [1, 2]) |> IO.inspect()
# equivalently:
spawn(fn -> Math2.add(1, 2) end)
The log returns a process identifier
, PID for short, and the result of the Math2.add
function.
A PID is a unique id for a process. It could be unique among all processes in the world, but here it's just unique for your application.
So what just happened here. We called the
spawn/3
function and passed it 3 arguments. The module name, the function name (as an atom), and a list of the arguments that we want to give to our function.
This one line of code spawned a process for us 🎉 🥳
Normally we would not see the result of the function (3 in this case). The only reason we have is because of the
IO.inspect
in the "add" function. If we removed this the only log we would have is the PID itself.
This might make you wonder, what good is spawning a process if I can't get access to the data it returns ?! This is where messages come in.
But before, let's introduce a useful routine, self/0
.
The function self/0
; it returns the PID of the running process. In this case, it is the shell, the main process. We see that the returned PID from the spawn/1
function is different from the main process.
IO.inspect(self(), label: "main process")
pid = spawn(fn -> Math2.add(2, 2) end)
IO.puts("spawned process with PID: #{inspect(pid)}")
In a Livebook
, there is a nice way to visualize the processes with the help of the package Kino.Process
.
IO.inspect(self())
Kino.Process.render_seq_trace(fn -> spawn(Math2, :add, [2, 2]) end)
Now let's run the following module. We added a bunch of IO.inspect/2
to be easier to follow what happens between each step.
defmodule Math3 do
def add(a, b) do
IO.inspect(self(), label: "'add' PID is: ")
receive do
senders_pid ->
IO.inspect(senders_pid, label: "'add' receives this message: ")
IO.puts(
"'add' will 'send' to the process with PID #{inspect(senders_pid)} the message #{a + b}"
)
send(senders_pid, a + b)
end
end
def double(n) do
IO.inspect(self(), label: "process 'double' PID is: ")
spawn(Math3, :add, [n, n])
|> send(self())
|> IO.inspect(label: "double sends his PID: ")
receive do
doubled ->
IO.inspect(doubled, label: "double received the message: ")
doubled
end
end
end
IO.inspect(self(), label: "main process: ")
Math3.double(10)
sequenceDiagram
participant D as double
participant A as add
D->>A: SPAWN process 'add' with argument '[n,n]'
Note left of A: process 'add' is created with arg '[n,n]'
D->>A: SEND his PID: send( pid_D )
loop receive do
A->>A: RECEIVE do ( &handle_a/1 ) end
end
Note over A,A: this listener receives: "pid_D" <br/> and returns: "send(pid_D, n + n)"
Note over A,A: handle_a = send(pid_D, n + n)
A->>D: SEND send(pid_D, n + n)
loop receive do
D->>D: RECEIVE do( &handle_d/1 ) end
end
Note over D,D: the listener receives: "n + n" <br/> and returns: "return n + n"
Let's go through the code.
We have a function called double
This function spawns the Math.add/2
function. Remember the spawn function returnes a PID. We pipe |>
the "spawn" with a send/2
. This means the output of the spawn (the PID) is used as the first argument of send
.
The function send/2
takes two arguments, a destination and a message. Because we "pipped" "spawn" with "send", the first argument will be what "spawn" returns, so the destination is the PID
created by the spawn
function on the line above. The second argument, the message, is
self/0
, the PID of the calling process (the PID of double).
The last instruction of double
is to call
receive/1
. This is a listener which checks if there is a message matching the clauses in the current process. It works very similarly to a case
statement. In this case, the variable doubled
will match anything, so it will capture anything message sent to this process, and just returns whatever the message was.
The add/2
function also contains a listener receive
. This listener receives a message, supposed to be the PID of the sender. It returns a function that send
a message. The message is the result of the addition a+b
and the destination is the process whose PID is the one reiceved, so back to the sender.
This will trigger the receive block in our double function. As mentioned above, it simply returns the message it receives which is the answer from add.
Now that we can create processes that can send messages to each other, let's see if we can use them for something a little more intensive than doubling an integer.
Parallelism is about using multiple cores, whilst concurrency is about starting multiple tasks at the same time, independantly of the number of cores.
"Concurrency and parallelism are related terms but don't mean precisely the same thing. A concurrent program is one where multiple tasks can be "in progress," but at any single point in time, only one task is executing on the CPU (e.g., executing one task while another is waiting for IO such as reading or writing to the disk or a network). On the other hand, a parallel program is capable of executing multiple tasks at the same time on multiple CPU cores."
In Elixir, processes are not OS processes, and have separate contexts, independant execution contexts (isolation). You can have hundreds of thousands of processes on a single CPU. If your computer has multiple cores, the BEAM - the VM that runs the Elixir
code - will automaticaly run processes on each of them in parallel.
Something can run concurrently, but that doesn't mean it will be parallel. If you run 2 CPU-bound concurrent tasks with one CPU core, they won't run in parallel. Concurrency doesn't always mean that it will be faster. However, if something is running in parallel, that means that it is running concurrently."
The speedup with concurrency is largely dependent on whether the task is IO- or CPU-bound, and whether there is more than 1 CPU core available.
Let's give an example. In the code below, we use a comprehension to enumerate over a range. It is an iteration loop and uses the for command. In the first loop, we iterate over the range 1..4
and ask to print the index and the present time every 500ms. The index and time will appear sequentially. In the second loop, we iterate over the range 5..8
and run concurrently processes by spawning the function above. As a response, the PIDs will be printed immediately, and then all the processes will end all together 500ms after: this is concurrency.
sleep = fn i, t ->
Process.sleep(t)
IO.puts("#{i}: #{Time.utc_now()}")
end
# single process
Kino.Process.render_seq_trace(fn -> spawn(fn -> for i <- 1..4, do: sleep.(i, 500) end) end)
# concurrent processes
Kino.Process.render_seq_trace(fn ->
for i <- 5..8, do: spawn(fn -> sleep.(i, 500) end)
end)
In the example below we will aim to see exactly how concurrency can be used to speed up a function (and in turn, hopefully a project).
We are going to do this by solving factorials using two different approaches. One will solve them on a single process and the other will solve them using multiple processes.
Recall that the factorial of a number
$n$ is the product of all the integers below it: e.g $\rm{factorial}(4) = 123*4=24$
It is also note
Note: this livebook is running on fly.io, thus limited in terms of CPU and cores. You are limited to small values, not more than 10_000. You might not fully appreciate this unless you fork this repo and run it on your computer.
Run the cell below. You should see 1, whilst probably the result is at least 4 on your computer. Parallelism on fly.io is not possible with this free tier, thus only concurrency can be used. If your computer has more than 1 core, then the BEAM (the Erlang virtual machine that runs the code) will automatically make in run parallel.
:erlang.system_info(:logical_processors_available)
This represents the maximum number of VM processes that can be executing at the same time is given by:
:erlang.system_info(:schedulers_online)
Now, consider the following module that computes the factorial of a given number. It computes the factorial in 3 differents ways:
- via recursion,
Factorial.facto
, - via reduction,
Factorial.calc_product
, - and via concurrency, with
Factorial.spawn
andFactorial.stream
We added a helper function at the end for the rendering.
-
the recursion works simply by using the formula
$n! = n\times (n-1)!$ . In other words, a function that calls himself: $$ \rm{factorial}(n) = n \cdot \rm{factorial}(n-1) $$ -
the
calc_product
works by reduction: sending the functionfn x, acc -> x * acc
to the list, and the result is accumulated in theacc
variable. -
the concurrent version will calculate concurrently "chunked" subproducts. Given a number
$n$ , we generate a list$[1,\dots, n]$ and group them by say 4: we get a list of sublists of 4 consecutive numbers. Then we apply anEnum.map
function to this modified list. It sends a spawned version of a function to compute the subproduct$n \times n+1 \times n+2 \times n+3$ . This function is the reductioncalc_product
that sends back to the sender a subproduct. This is the return of the spawn. Since we ranEnum.map
, these responses will by collected in a list. It is then sufficient to reduce this new list by again usingcalc_product
.
We have 2 concurrent versions:
- one with
spawn
that uses the message passingreceive do
andsend
, - one that uses the Stream module. This generates concurrency and is designed ofr handling data.
chunk = 4
defmodule Factorial do
@chunk chunk
# concurrent "spawn" version
def spawn(n) do
1..n
|> Enum.chunk_every(@chunk)
|> Enum.map(fn list ->
spawn(Factorial, :_spawn_function, [list])
|> send(self())
receive do
chunked_product -> chunked_product
end
end)
|> calc_product()
end
def _spawn_function(list) do
receive do
sender ->
chunked_product = calc_product(list)
send(sender, chunked_product)
end
end
## Reduction ######################################
@doc """
Used on the single process, the last loop of "spawn"
iex> Factorial.calc_product(4)
24
"""
def calc_product(n) when is_integer(n) do
Enum.reduce(1..n, 1, fn x, acc -> x * acc end)
end
# used with multiple processes, in the spawned function
def calc_product(list), do: Enum.reduce(list, 1, &(&1 * &2))
### Recursion #########################################
@doc """
iex> Factorial.facto(4)
24
"""
def facto(0), do: 1
def facto(n), do: n * facto(n - 1)
# concurrent with Stream module ##########################
@doc """
Concurrenct with Stream
iex> Factorial.stream(4)
24
"""
def stream(n) do
1..n
|> Stream.chunk_every(10)
|> Stream.map(&calc_product/1)
|> Enum.to_list()
|> Enum.reduce(1, &(&1 * &2))
end
### Helper
def run(f_name, args) do
:timer.tc(Factorial, f_name, args)
# only displays the time as I didn't want to log numbers that could have thousands of digits
|> elem(0)
end
end
Before we go any further, let's take a quick look at the calc_product/1
function.
You will see that there are 2 definitions for this function. One which takes a list and another which takes an integer and turns it into a range. Other than this, the functions work in exactly the same way. They both call reduce
on an enumerable and multiply the current value with the accumulator.
As a side note, recall that we can used the equivalent shorthand notation with &
instead of the anonymous function being passed to the reduce
. In the example below, you create 2 equivalent anonymous functions:
prod1 = &(&1 * &2)
# equivalent to:
prod2 = fn x, acc -> x * acc end
# check:
prod1.(2, 3) == prod2.(2, 3)
The reason both
calc_product(n)
andcalc_product(list)
work the same way is so that we could see the effect multiple processes running concurrently have on how long it takes for us to get the results of our factorial. I didn't want differences in a functions approach to be the reason for changes in time. Also these factorial functions are not perfect and do not need to be. That is not what we are testing here.
Let's run the two functions below:
Factorial.facto(11)
Factorial.calc_product(11)
You just solved a factorial on a single process.
This works well on a smaller scale but what if we need/want to work out factorial(100_000)
.
If we use this approach it will take quite some time before it we get the answer returned (something we will log a little later). The reason for this is because this massive sum is being run on a single process.
This is where spawning multiple processes comes in. By spawning multiple processes, instead of giving all of the work to a single process, we can share the load between any number of processes. This way each process is only handling a portion of the work and we should be able to get our solution faster.
This sounds good in theory but let's see if we can put it into practice.
First, let's look through the spawn
function and try to work out what it is
doing exactly.
def spawn(n) do
1..n
|> Enum.chunk_every(@nbc)
|> Enum.map(fn list ->
spawn(Factorial, :_spawn_function, [list])
|> send(self())
receive do
chunked_product -> chunked_product
end
end)
|> calc_product()
end
The function starts by converting an integer into a range which it then 'chunks' into a list of lists with 4 elements. The number 4 itself is not important, it could have been 5, 10, or 1000. What is important about it, is that it influences the number of processes we will be spawning. The larger the size of the 'chunks' the fewer processes are spawned.
This illustrates this step: we have a list made of sublists of length 4
[1, 2, 3, 4, 5, 6, 7, 8, 9, 10] ->
[[1, 2, 3, 4], [5, 6, 7, 8], [9, 10]]
All sublist have length 4 except maybe the last chunk, which may contain fewer elements.
The Enum.map
will iterate over the list, and apply a function to each element (a sublist). The function that is applied is a spawn
that calls the function _spawn_function(sublist)
.
This _spawned_function
receives a sublist, and computes a subproduct with calc_product
. Since the main process send a message with his PID, and since _spawned_function
has a listener (the receive do
), he will receive this PID. Then we ask to _spawned_function
to send this PID the subproduct.
The _spawn_function
function is pretty simple.
def _spawn_function(sublist) do
receive do
sender ->
chunked_product = calc_product(sublist)
send(sender, chunked_product)
end
end
The Enum.map
function ends with a listener (a receive do
). Whenever this listener receives something, the iteration of Enum.map
is asked to return this something. Knowing that the returns of each iteration of Enum.map
is accumulated in a list, we end this step with a new list that contains all the subproducts.
We eventually then call the calc_product
once more via a reduction: tihs turn the list of subproduct into a single integer, the grand total product, or factorial.
Now that we have been through the code the only things left are to run the code and to time the code.
Let's see how many processes are run when we evaluate Factorial.spawn(11)
with a chunk size of 4. We saw that we will spawn 3 functions that will compute subproducts.
Kino.Process.render_seq_trace(fn ->
Factorial.spawn(11)
end)
This concurrent version Factorial.stream/1
uses the Stream module. This allows lazy evaluation. Lazily means that it doesn't returns an immediately usable response but rather a function that returns a response; the returned function will only be run once it is called. In our case, these functions are executed when we further apply a reduction on this list: we there once again run a reduction to accumulate the product but on the list of subproducts. This eventually calculates the grand product, the factorial.
def stream(n) do
1..n
|> Stream.chunk_every(10)
|> Stream.map(&calc_product/1)
|> Enum.to_list()
|> Enum.reduce(1, &(&1*&2))
end
Performance will largely depend upon your machine, in particular the number of cores for parallelisation. The concurrent stream
will be the most performant method for big numbers. It runs concurrently (and parallel when possible) and has no overhead when compared to spawn
. When the number to evaluate is small, then the simple recursion function will be the most performant (see the benchmark tests further).
When you are using a single core machine like the one used here on Fly.io, the results might be different as there is no chance to run computations in parallel.
Parallelism (mutli-core) is of outmost importance for computations which are heavily CPU-bound operations. Note that IO-bound operations, among which we find web apps with HTTP requests benefit of concurrency only; Javascript is a famous example of a natively concurrent language.
To time the execution of our code, we use the Factorial.run
function to evaluate the execution time (ms) of the function Factorial.facto/1
. It is a wrapper of the routine :timer.tc
. We further use the package benchee.
❗❗ When running on fly.io, there are limitations due to the memory limits so you can't run higher values than 9_000. The fly.io instance has 1 CPU, but probably not your machine so we check the numbers of cores to avoid overflow and process down.
nb_input = Kino.Input.text("You can enter a value for n")
max = 9_000
chunk = 4
n = Kino.Input.read(nb_input)
# avoids blank values and parsing a string into a integer
n = if n == "", do: 1, else: n |> Integer.parse() |> elem(0)
# prevents from running too high values if running on fly.io
nb =
case n > max && :erlang.system_info(:logical_processors_available) == 1 do
true -> max
false -> n
end
# result
n_conc = Factorial.run(:spawn, [nb]) / 1_000
n_stream = Factorial.run(:stream, [nb]) / 1_000
n_rec = Factorial.run(:facto, [nb]) / 1_000
# n_task = Factorial.run(:worker_task, [nb]) / 1000
# output
IO.puts("Time (ms) per process for n= #{nb} with a chunk size of #{chunk}")
IO.puts("- concurrent: #{n_conc}")
IO.puts("- stream: #{n_stream}")
IO.puts("- recursion: #{n_rec}")
# IO.puts("- task_stream: #{n_task}")
We can play a bit more. In the previous example, we chunked the list to spawn a lot of process to compute subproducts, and then simply computed the grand product by reducing the new list.
We can do better with recursion. Once we get a sublist of products, why not reusing the technic to produce a sub-sublist, and redo this until we get a list of length 1? To better understand, take
[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14]
2. 2. 2. 2. 1
[6, 120, 504, 1320, 182]
2 1
[362880, 241969]
1
87805710720
Details of the algorithm fo compute the sequence of processes
GivenWe can repeat: since
We can compute the number of spawned processes. As the algorithm above suggests, the code is naturally a recursion. Note how we need to reuse the header of a function with a guard clause. This first clause makes the code to stop, the second and third add
Firstly, recall how you can preprend an element as the first element of a list:
list = [3, 2, 1]
[4 | list]
The code of this algorithm:
defmodule Serie do
@doc """
iex> Serie.calc(14,3)
[5, 2, 1]
"""
# stop guard clause
def calc(n, c) when n <= c, do: [1]
def calc(n, c) when rem(n, c) == 0 do
n = div(n, c)
[n | calc(n, c)]
end
# recursion to build the list
def calc(n, c) do
n = div(n, c) + 1
[n | calc(n, c)]
end
@doc """
Calculates the sum of the elements of the list by reduction
iex> Serie.processes(14,3)
8
"""
def processes(n, c) do
Serie.calc(n, c)
|> Enum.reduce(0, &(&1 + &2))
end
@doc """
Calculates how much represents the first step
iex> Serie.summary(14,3)
50.0
"""
def summary(n, c) do
processes(n, c)
|> then(fn d ->
Float.round(div(n, c) / d * 100, 3)
end)
end
end
form =
Kino.Control.form(
[
c: Kino.Input.text("Chunk size"),
n: Kino.Input.text("Enter a number:")
],
submit: "Run"
)
Re-evaluate the "hidden" cell below to refresh (you can double-click to see the code)
frame = Kino.Frame.new() |> Kino.render()
form
|> Kino.Control.stream()
|> Kino.listen(fn stream ->
%{data: %{n: n, c: c}} = stream
n = Integer.parse(n) |> elem(0)
c = Integer.parse(c) |> elem(0)
Kino.Frame.append(
frame,
"For n: #{n} and a chunk of #{c}, the serie of async processes are: #{inspect(Serie.calc(n, c))}"
)
Kino.Frame.append(
frame,
"For {#{n}, #{c}}, the total number of processes is: #{Serie.processes(n, c)}"
)
Kino.Frame.append(frame, "The first step represents #{Serie.summary(n, c)}% of all processes")
end)
The cost is the complication in the code for the compiler and the memory used. If we take
Nevertheless, the algorithm above is coded below quite easily with recursion. This is where functional code shines.
chunk = 5
defmodule FactorialPlus do
@chunk chunk
@doc """
Takes an integer and returns the list version of this function
iex> FactorialPlus.spawn(4)
24
"""
def spawn(n) when is_integer(n) do
Enum.to_list(1..n)
|> FactorialPlus.spawn()
end
# recursion stop
def spawn(list) when is_list(list) and length(list) == 1 do
List.first(list)
end
# "list" version
def spawn(list) when is_list(list) do
list
|> Enum.chunk_every(@chunk)
|> Enum.map(fn list ->
spawn(FactorialPlus, :_spawn_function, [list])
|> send(self())
receive do
chunked_product -> chunked_product
end
end)
# <------ recursion call
|> FactorialPlus.spawn()
end
# subproduct of size "chunk" calculation
def _spawn_function(list) do
receive do
sender ->
chunked_product = FactorialPlus.calc_product(list)
send(sender, chunked_product)
end
end
####### using Stream module ############
@doc """
Takes an integer and returns the list version of this function
iex> FactorialPlus.stream(4)
24
"""
def stream(n, chunk_size \\ @chunk)
def stream(n, chunk_size) when is_integer(n) do
Enum.to_list(1..n) |> FactorialPlus.stream(chunk_size)
end
# recursion stop
def stream(l, _chunk_size) when is_list(l) and length(l) == 1, do: List.first(l)
# list version
def stream(list, chunk_size) when is_list(list) do
list
|> Stream.chunk_every(10)
|> Stream.map(&calc_product/1)
|> Enum.to_list()
# <---- recursion call
|> FactorialPlus.stream(chunk_size)
end
@doc """
iex> FactorialPlus.calc_product([1,2,3])
6
"""
def calc_product(list), do: Enum.reduce(list, 1, &(&1 * &2))
# Time measurement helper
def run(f_name, args) do
:timer.tc(FactorialPlus, f_name, args)
# only displays the time as I didn't want to log numbers that could have thousands of digits
|> elem(0)
end
end
❗❗ Limit yourself to say 8000 if you run this in the cloud!
form =
Kino.Control.form(
[
chunk: Kino.Input.text("Chunk size"),
n: Kino.Input.text("Compute the factorial of:")
],
submit: "Run"
)
frame = Kino.Frame.new() |> Kino.render()
Kino.Control.stream(form)
|> Kino.listen(nil, fn %{data: %{n: n, chunk: chunk}}, _res ->
n = Integer.parse(n) |> elem(0)
fpspw = FactorialPlus.run(:spawn, [n]) / 1_000
fspw = Factorial.run(:spawn, [n]) / 1_000
fstr = Factorial.run(:stream, [n]) / 1_000
fpstr = FactorialPlus.run(:stream, [n]) / 1_000
Kino.Frame.append(frame, number: n, chunk: chunk, spawn_recursive: fpspw)
Kino.Frame.append(frame, number: n, chunk: chunk, spawn: fspw)
Kino.Frame.append(frame, number: n, chunk: chunk, stream: fstr)
Kino.Frame.append(frame, number: n, chunk: chunk, stream_recursive: fpstr)
{:cont, nil}
end)
You can use the library benchee to benchmark the implementations. It compares how fast the implementation is and the memory usage.
Note: it might not be relevate on Fly.io due to the single core and memory limitation (no more than 5000 with Fly.io).
Note: recall that an anonymous function
fn input -> Factorial.spawn(input) end
can be written with the&
operator:&Factorial.spawn/1
We pass the functions to test as maps %{ "name" => function to evaluate, ...}
to the function Benchee.run/2
test? = true
max = 20_000
if test?,
do:
Benchee.run(
%{
"concurrent_spawn" => &Factorial.spawn/1,
"concurrent_stream" => &Factorial.stream/1,
# "single_process" => &Factorial.facto/1,
"concurrent_spawn_recursive" => &FactorialPlus.spawn/1,
"concurrent_stream_recursive" => &FactorialPlus.stream/1
},
memory_time: 2,
inputs: [small: 1_000, larger: max]
)
We want to visualize the computation time for a given number. With the Livebook, we can do this easily with the "smart cell" Chart
. The module VegaLite
needs data in the form of the map of lists, or a list of maps as below:
# map of lists
%{
x: [..n..],
y: [...time...]
}
# or list of maps
[
%{x: 1, y: 1}, %{x: 2, y: 2}, ...
]
To plots 2 curves and get a legend, add another
key: value
as shnown below.
We want to measure the time taken to compute each factorial from 1 to
Note that using concurrency to run all these functions - some Task.async_stream(range, fn i -> time(factorial(i)) end)
- doesn't help as the timer needs to be run as a single process.
We will plot concurrent (spawn) and single process (reduction).
If you run this on Fly.io, limit
$n$ to$8000$ maximum.
defmodule Plot do
@i 2_000
@j 1_000
@max 8_000
def guard(n) do
case n > @max && :erlang.system_info(:logical_processors_available) == 1 do
true -> @max
false -> n
end
end
def range(f, n) do
max = guard(n)
case f do
:calc_product ->
0..max//@j
:stream ->
0..max//@i
end
end
def plot(module, f, n) do
range(f, n)
|> Enum.map(fn i ->
%{number: i, time: module.run(f, [i]) / 1_000, group: f}
end)
end
end
n = 16_000
data_conc_rec = Plot.plot(Factorial, :stream, n)
data_single = Plot.plot(Factorial, :calc_product, n)
VegaLite.new(
width: 400,
height: 400,
title: "Computation time, recursive_stream vs single process (reduction)"
)
|> VegaLite.layers([
VegaLite.new()
|> VegaLite.data_from_values(data_single, only: ["number", "time", "group"])
|> VegaLite.mark(:point)
|> VegaLite.encode_field(:x, "number", type: :quantitative, title: "number")
|> VegaLite.encode_field(:y, "time", type: :quantitative, title: "time(ms)")
|> VegaLite.encode_field(:color, "group", type: :nominal),
VegaLite.new()
|> VegaLite.data_from_values(data_conc_rec, only: ["number", "time", "group"])
|> VegaLite.mark(:line)
|> VegaLite.encode_field(:x, "number", type: :quantitative, title: "number")
|> VegaLite.encode_field(:y, "time", type: :quantitative)
|> VegaLite.encode_field(:color, "group", type: :nominal)
])
Note: this is definitely not a "reason" to switch programming languages, but one of our (totally unscientific) reasons for deciding to investigate other options for programming languages was the fact that JavaScript (with the introduction of ES2015) now has Six Ways to Declare a Function: https://rainsoft.io/6-ways-to-declare-javascript-functions/ which means that there is ambiguity and "debate" as to which is "best practice", Go,
Elixir
and Rust don't suffer from this problem. Sure there are "anonymous" functions in Elixir (required for functional programming!) but there are still only Two Ways to define afunction
(and both have specific use-cases), which is way easier to explain to a beginner than the JS approach. see: http://stackoverflow.com/questions/18011784/why-are-there-two-kinds-of-functions-in-elixir
- Crash Course in Elixir
- Elixir School, which is available translated at least partially in over 20 languages and functions as a great succinct guide to core concepts.
- 30 Days of Elixir
is a walk through the
Elixir
language in 30 exercises. - Learn
Elixir
- List of Curated Resources - Explanation video of Pattern Matching in Elixir
- Sign up to: https://elixirweekly.net/ for regular (relevant) updates!
- List of more useful resources and sample apps
- If you want to know what's next it's worth check out What's Ahead for Elixir? (53 mins) by José Valim (the creator of Elixir)
- Interview with José Valim (the creator of Elixir) on why he made it! https://www.sitepoint.com/an-interview-with-elixir-creator-jose-valim/
- What was "wrong" with just writing directly in Erlang? read: http://www.unlimitednovelty.com/2011/07/trouble-with-erlang-or-erlang-is-ghetto.html
- While
Elixir
by itself is pretty amazing, where the language really shines is in the Phoenix Web Framework!! So once you know the basics of the language LearnPhoenix
Web Development.