Skip to content

Commit

Permalink
Deployed 7343def with MkDocs version: 1.5.2
Browse files Browse the repository at this point in the history
  • Loading branch information
d9w committed Sep 23, 2024
1 parent 7cc530f commit a9ef561
Show file tree
Hide file tree
Showing 123 changed files with 2,473 additions and 5,922 deletions.
46 changes: 43 additions & 3 deletions 0_1_databases.html
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@
<link rel="prev" href="ctf.html">


<link rel="next" href="0_2_postgres.html">
<link rel="next" href="0_2_ETL.html">


<link rel="icon" href="https://raw.githubusercontent.com/SupaeroDataScience/DE/master/src/favicon.ico">
Expand Down Expand Up @@ -455,7 +455,27 @@


<li class="md-nav__item">
<a href="0_2_postgres.html" class="md-nav__link">
<a href="0_2_ETL.html" class="md-nav__link">


<span class="md-ellipsis">
ETL
</span>


</a>
</li>









<li class="md-nav__item">
<a href="0_3_postgres.html" class="md-nav__link">


<span class="md-ellipsis">
Expand All @@ -469,6 +489,26 @@








<li class="md-nav__item">
<a href="0_4_project.html" class="md-nav__link">


<span class="md-ellipsis">
Project
</span>


</a>
</li>




</ul>
</nav>

Expand Down Expand Up @@ -1145,7 +1185,7 @@ <h1 id="data-storage">Data Storage<a class="headerlink" href="#data-storage" tit
<p>In this module on databases, database management systems will be covered. A basic understanding of SQL is considered as a prerequisite, and students can refer to the slides and additional resources if needed. For evaluation, students will install and explore the advantages of different DBMSs as a graded project.</p>
<p>In this first class, we introduce the basics of database management systems and cover high level DBMS functionality.</p>
<p><a href="https://raw.githubusercontent.com/SupaeroDataScience/DE/master/readings/bdd_sql.pdf">Slides</a></p>
<p>For the <a href="0_2_postgres.html">next class</a>, students should install PostgreSQL and MongoDB on their local machines.</p>
<p>For the <a href="0_3_postgres.html">next class</a>, students should install PostgreSQL and MongoDB on their local machines.</p>
<h2 id="additional-resources">Additional Resources<a class="headerlink" href="#additional-resources" title="Permanent link">🔗</a></h2>
<ul>
<li><a href="https://www.postgresql.org/docs/manuals/">PostgreSQL documentation</a></li>
Expand Down
104 changes: 68 additions & 36 deletions 0_3_project.html → 0_2_ETL.html
Original file line number Diff line number Diff line change
Expand Up @@ -10,17 +10,21 @@



<link rel="canonical" href="https://supaerodatascience.github.io/DE/0_3_project.html">
<link rel="canonical" href="https://supaerodatascience.github.io/DE/0_2_ETL.html">


<link rel="prev" href="0_1_databases.html">


<link rel="next" href="0_3_postgres.html">


<link rel="icon" href="https://raw.githubusercontent.com/SupaeroDataScience/DE/master/src/favicon.ico">
<meta name="generator" content="mkdocs-1.5.2, mkdocs-material-9.2.8">



<title>NoSQL Databases Project - Data Engineering</title>
<title>ETL - Data Engineering</title>



Expand Down Expand Up @@ -71,7 +75,7 @@
<div data-md-component="skip">


<a href="#nosql-databases-project" class="md-skip">
<a href="#extract-transform-load-etl" class="md-skip">
Skip to content
</a>

Expand Down Expand Up @@ -106,7 +110,7 @@
<div class="md-header__topic" data-md-component="header-topic">
<span class="md-ellipsis">

NoSQL Databases Project
ETL

</span>
</div>
Expand Down Expand Up @@ -344,19 +348,21 @@











<li class="md-nav__item md-nav__item--section md-nav__item--nested">
<li class="md-nav__item md-nav__item--active md-nav__item--section md-nav__item--nested">





<input class="md-nav__toggle md-toggle " type="checkbox" id="__nav_3" >
<input class="md-nav__toggle md-toggle " type="checkbox" id="__nav_3" checked>

<label class="md-nav__link" for="__nav_3" id="__nav_3_label" tabindex="0">

Expand All @@ -369,7 +375,7 @@
<span class="md-nav__icon md-icon"></span>
</label>

<nav class="md-nav" data-md-level="1" aria-labelledby="__nav_3_label" aria-expanded="false">
<nav class="md-nav" data-md-level="1" aria-labelledby="__nav_3_label" aria-expanded="true">
<label class="md-nav__title" for="__nav_3">
<span class="md-nav__icon md-icon"></span>
Data Storage
Expand Down Expand Up @@ -400,9 +406,39 @@






<li class="md-nav__item md-nav__item--active">

<input class="md-nav__toggle md-toggle" type="checkbox" id="__toc">





<a href="0_2_ETL.html" class="md-nav__link md-nav__link--active">


<span class="md-ellipsis">
ETL
</span>


</a>

</li>









<li class="md-nav__item">
<a href="0_2_postgres.html" class="md-nav__link">
<a href="0_3_postgres.html" class="md-nav__link">


<span class="md-ellipsis">
Expand All @@ -416,6 +452,26 @@








<li class="md-nav__item">
<a href="0_4_project.html" class="md-nav__link">


<span class="md-ellipsis">
Project
</span>


</a>
</li>




</ul>
</nav>

Expand Down Expand Up @@ -1073,34 +1129,10 @@



<h1 id="nosql-databases-project">NoSQL Databases Project<a class="headerlink" href="#nosql-databases-project" title="Permanent link">🔗</a></h1>
<p><img alt="NoSQL databases" src="https://raw.githubusercontent.com/SupaeroDataScience/DE/master/src/reveal/static/img/NoSQL-Database-Types.jpg" /></p>
<p>The evaluation of the databases class is a presentation of a specific DBMS. You can work in teams of 4 of your choosing.</p>
<p>The idea of this project is to study new DBMSs, with a focus on NoSQL DBMSs. There is a <a href="https://lms.isae.fr/mod/resource/view.php?id=36332">presentation</a> on the differences between relational databases and various NoSQL DBMSs and another <a href="https://lms.isae.fr/mod/resource/view.php?id=81798">here</a> on the history of NoSQL. In this project, students should compare their DBMS with a relational DBMS (PostgreSQL) to understand the advantages and disadvantages of various NoSQL DBMSs.</p>
<p>You are working in a company which is looking to replace a relational DBMS currently in use. Each team should present a feasability study of a specific DBMS, showing its advantages, disadvantages, and use cases. We will organize the subjects on 06/10/2021, which is a work class dedicated to the project. Up to two different teams can work on each DBMS.</p>
<p>The possible DBMSs are:</p>
<ol>
<li><a href="https://www.mongodb.org/">MongoDB</a>, a DBMS for documents used, for example, by CERN</li>
<li><a href="https://cassandra.apache.org/">Cassandra</a>, a distributed data storage system for handling very large amounts of structured data</li>
<li><a href="http://redis.io/">Redis</a>, a very efficient key/value DBMS</li>
<li><a href="https://hbase.apache.org/">HBase</a>, a distributed and non-relationed column-based DBMS</li>
<li><a href="http://neo4j.com/">Neo4j</a>, a native graph DBMS</li>
<li><a href="https://ravendb.net/">RavenDB</a>, a document DBMS with ACID integrity</li>
<li><a href="https://www.couchbase.com/">Couchbase</a>, a document DBMS for interactive web applications</li>
<li><a href="https://couchdb.apache.org/">CouchDB</a>, a JSON-based DBMS with native JavaScript support</li>
<li><a href="https://www.influxdata.com/get-influxdb/">InfluxDB</a>, a distributed DBMS optimized for timeseries data</li>
<li><a href="https://orientdb.org/">OrientDB</a>, a DMBS for graph data</li>
</ol>
<p>Each team should:</p>
<ul>
<li>install their DBMS</li>
<li>test the DBMS on a relevant database (datasets from <a href="https://datasetsearch.research.google.com/">Google</a>, and <a href="https://www.kaggle.com/datasets">kaggle</a>)</li>
<li>compare their DBMS with a relational database system</li>
<li>prepare a presentation of their DBMS and example database which presents convincing argument for using this DBMS</li>
<li>evaluate how ACID or BASE principles are met by their DBMS</li>
</ul>
<p>A good example from previous years is <a href="https://raw.githubusercontent.com/SupaeroDataScience/DE/master/readings/mongodb.pdf">here</a>. This was in the form of a report, but currently a report is not required, just the oral presentation.</p>
<p>Presentations will take place on 02/11/2021. You should upload your presentation materials to the LMS before midnight on 01/11/2021.</p>
<h1 id="extract-transform-load-etl">Extract, Transform, Load (ETL)<a class="headerlink" href="#extract-transform-load-etl" title="Permanent link">🔗</a></h1>
<p>In this module on ETL, we will cover the fundamental concepts and practices of data integration and processing. A basic understanding of databases and SQL is considered a prerequisite. Students can refer to the slides and additional resources if they need to refresh their knowledge. For evaluation, students will design and implement an ETL pipeline as a graded <a href="0_4_project.html">project</a>.</p>
<p>In this first class, we introduce the basics of ETL processes and cover high-level ETL functionality and tools.</p>
<p><a href="https://raw.githubusercontent.com/SupaeroDataScience/DE/master/readings/ETL.pdf">Slides</a></p>



Expand Down
42 changes: 41 additions & 1 deletion 0_3_dbms.html
Original file line number Diff line number Diff line change
Expand Up @@ -402,7 +402,27 @@


<li class="md-nav__item">
<a href="0_2_postgres.html" class="md-nav__link">
<a href="0_2_ETL.html" class="md-nav__link">


<span class="md-ellipsis">
ETL
</span>


</a>
</li>









<li class="md-nav__item">
<a href="0_3_postgres.html" class="md-nav__link">


<span class="md-ellipsis">
Expand All @@ -416,6 +436,26 @@








<li class="md-nav__item">
<a href="0_4_project.html" class="md-nav__link">


<span class="md-ellipsis">
Project
</span>


</a>
</li>




</ul>
</nav>

Expand Down
48 changes: 44 additions & 4 deletions 0_2_postgres.html → 0_3_postgres.html
Original file line number Diff line number Diff line change
Expand Up @@ -10,13 +10,13 @@



<link rel="canonical" href="https://supaerodatascience.github.io/DE/0_2_postgres.html">
<link rel="canonical" href="https://supaerodatascience.github.io/DE/0_3_postgres.html">


<link rel="prev" href="0_1_databases.html">
<link rel="prev" href="0_2_ETL.html">


<link rel="next" href="1_1_overview.html">
<link rel="next" href="0_4_project.html">


<link rel="icon" href="https://raw.githubusercontent.com/SupaeroDataScience/DE/master/src/favicon.ico">
Expand Down Expand Up @@ -406,6 +406,26 @@




<li class="md-nav__item">
<a href="0_2_ETL.html" class="md-nav__link">


<span class="md-ellipsis">
ETL
</span>


</a>
</li>











Expand All @@ -428,7 +448,7 @@
<span class="md-nav__icon md-icon"></span>
</label>

<a href="0_2_postgres.html" class="md-nav__link md-nav__link--active">
<a href="0_3_postgres.html" class="md-nav__link md-nav__link--active">


<span class="md-ellipsis">
Expand Down Expand Up @@ -531,6 +551,26 @@








<li class="md-nav__item">
<a href="0_4_project.html" class="md-nav__link">


<span class="md-ellipsis">
Project
</span>


</a>
</li>




</ul>
</nav>

Expand Down
Loading

0 comments on commit a9ef561

Please sign in to comment.