Machine Learning @ Applied Statistics with Network Analysis, HSE, Moscow

This repository includes teaching materials related to the elective course Machine Learning taught at the HSE-Moscow masters programme Applied Statistics with Network Analysis. The materials are organized in sections corresponding to lecture days. Each section provides a brief outline of the topics addressed, access to the lecture slides, outline of the practical exercises and seminars, and references to the relevant literature.

For further information on the course, students can contact the lecturers via email at Nada Lavrač, [email protected] and Ljupco Todorovski, [email protected].

Tentative Course Schedule for the Academic Year 2020/21

You can follow the lectures using the following Zoom link, https://fmf-uni-lj-si.zoom.us/j/97756216461 or join the Zoom meeting using the ID 977 562 16461.

Date	Topic/Section
Thursday, 14th of January 2021	Introduction to Machine Learning
Tuesday, 19th of January 2021	Learning Rules
Thursday, 21st of January 2021	Relational Learning
Tuesday, 26th of January 2021	Learning from Heterogeneous Data
Thursday, 28th of January 2021	Learning Ensembles
Tuesday, 2nd of February 2021	Support Vectors Machines and Kernels
Thursday, 4th of February 2021	Artificial Neural Networks and Deep Learning
Tuesday, 9th of February 2021	Complex Data Types and Embeddings
Thursday, 11th of February 2021	Dimensionality Reduction with Autoencoders
Tuesday, 16th of February 2021	TBA

1: Introduction to Machine Learning

Basic definitions and taxonomy of learning tasks
Three generations of machine learning and data mining methods
Understanding the error of machine learning models
The curse of dimensionality
Rough overview of the course topics

Slides

First part, Nada Lavrač
Second part, Ljupčo Todorovski
Last update: 14th of January 2021

Literature

James G, Witten D, Hastie T and Tibshirani R (2013) An Introduction to Statistical Learning. Springer, New York. Available at https://statlearning.com/. Sections 1 and 2, check also the exercises at the end of Section 2.
Bramer M (2007) Principles of Data Mining. Springer, Berlin. DOI:10.1007/978-1-84628-766-4. An introductory textbook for refreshing your knowledge on basics of data mining. The first edition of the textbook is also available at ResearchGate, https://www.researchgate.net/publication/220688376_Principles_of_Data_Mining

2: Learning Rules

Covering algorithm and its variants
Association rules and subgroup discovery
Evaluating rules and rule sets

Exercises

3: Relational Learning

Learning relational rules
Inductive logic programming
Propositionalization
Wordification and Python-RDM

4: Learning from Heterogeneous Data

Propositionalization of heterogeneous information networks
Learning from taxonomies and ontologies
Semantic learning with HINMINE

5: Learning Ensembles

Why ensembles: bias and variance decomposition of the predictive error
Boosting, Bagging, Feature subspaces, Random forests
Out of bag error estimate, feature importance
Random forests in R

6: Support Vectors Machines and Kernels

Linear support vector machine
Non-linearity and kernel functions
Selecting kernels, setting hyper-parameters
Support vector machines and kernels in R

Slides and Exercise Materials

Slides
Exercises
Last update: 14th of January 2021

Literature

James G, Witten D, Hastie T and Tibshirani R (2013) An Introduction to Statistical Learning. Springer, New York. Available at https://statlearning.com/. Section 9, check also exercises 1-8 in the same section.

7: Artificial Neural Networks and Deep Learning

General intro to ANNs
Feed-forward networks and back propagation
Towards deep networks: Convolutional networks
ANNs in R

8: Complex Data Types and Embeddings

Embeddings for text data, word2vec and doc2vec
Embeddings for network data, node2vec
Embeddings in R

9: Dimensionality reduction with Autoencoders

Autoencoders as general embedding approach
Taxonomy of autoencoders: regularization and denoising
Autoencoders in R

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
01		01
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning @ Applied Statistics with Network Analysis, HSE, Moscow

Tentative Course Schedule for the Academic Year 2020/21

1: Introduction to Machine Learning

Slides

Literature

2: Learning Rules

3: Relational Learning

4: Learning from Heterogeneous Data

5: Learning Ensembles

6: Support Vectors Machines and Kernels

Slides and Exercise Materials

Literature

7: Artificial Neural Networks and Deep Learning

8: Complex Data Types and Embeddings

9: Dimensionality reduction with Autoencoders

About

Releases

Packages

AnastasiaBaranova/hse-moscow-ml

Folders and files

Latest commit

History

Repository files navigation

Machine Learning @ Applied Statistics with Network Analysis, HSE, Moscow

Tentative Course Schedule for the Academic Year 2020/21

1: Introduction to Machine Learning

Slides

Literature

2: Learning Rules

3: Relational Learning

4: Learning from Heterogeneous Data

5: Learning Ensembles

6: Support Vectors Machines and Kernels

Slides and Exercise Materials

Literature

7: Artificial Neural Networks and Deep Learning

8: Complex Data Types and Embeddings

9: Dimensionality reduction with Autoencoders

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages