[DOC] Adds Iñigo's blogpost of Week 1 (#39)

* [DOC] Adds blogpost Iñigo's blogpost of Week 1 * FIX: RCorrects bad RST URL syntax & deletes mentions to week 0 work * DOC: Adds Iñigo's week 1 and 2 blogposts * FIX: Single underscore instead of two for RST URLs 6219f3c
dipy · Jun 10, 2024 · 8b47716 · 8b47716
1 parent 7d34b5f
commit 8b47716
Show file tree

Hide file tree

Showing 49 changed files with 3,992 additions and 476 deletions.
diff --git a/_images/fibercup_preliminary_results.png b/_images/fibercup_preliminary_results.png
diff --git a/_sources/posts/2024/2024_05_31_Inigo_week_1.rst.txt b/_sources/posts/2024/2024_05_31_Inigo_week_1.rst.txt
@@ -0,0 +1,24 @@
+First Week into GSoC 2024: Building the AutoEncoder, writing the training loop
+==============================================================================
+
+.. post:: May 31 2024
+    :author: Iñigo Tellaetxe
+    :tags: google
+    :category: gsoc
+
+
+What I did this week
+~~~~~~~~~~~~~~~~~~~~
+I finished becoming familiar with the TensorFlow + Keras basics and I wrote the training loop and a couple of scripts for instantiating and training the AutoEncoder.
+Data loading was also addressed and I am able to load the data from the FiberCup dataset in .trk format using `NiBabel <https://nipy.org/nibabel/>`_, transform it into NumPy arrays, and feed it into the network.
+
+What is coming up next week
+~~~~~~~~~~~~~~~~~~~~~~~~~~~
+Because the training loop is taking too long, I will refactor the code to make it more modular and more in the TensorFlow style. Also, other DIPY models are implemented in this fashion, what will contribute to consistency across the library.
+I think I have yet to get the hang of the TensorFlow way of doing things.
+I plan to use a class for the ``Encoder`` and another one for the ``Decoder``. Then I will bring them together under an ``AutoEncoder`` class that inherits from the Keras ``Model`` class.
+This will allow me to use the ``fit`` method from the Keras ``Model`` class and make the training loop more efficient, together with easing its usage. I will just have all the relevant training parameters to the ``compile`` method, to later call the ``fit`` method to train the model, taking care of the weight updates more efficiently than my handmade training loop.
+
+Did I get stuck anywhere
+~~~~~~~~~~~~~~~~~~~~~~~~
+Getting the handmade training loop up and running gave me a couple of headaches because the weight update was taking ages. I am still stuck here and that is why I will refactor the code next week.
diff --git a/_sources/posts/2024/2024_06_07_Inigo_week_2.rst.txt b/_sources/posts/2024/2024_06_07_Inigo_week_2.rst.txt
@@ -0,0 +1,29 @@
+Second Week into GSoC 2024: Refactoring the AutoEncoder, preliminary results
+============================================================================
+
+.. post:: June 7 2024
+    :author: Iñigo Tellaetxe
+    :tags: google
+    :category: gsoc
+
+
+What I did this week
+~~~~~~~~~~~~~~~~~~~~
+This week I refactored the AutoEncoder code to match the design patterns and the organization of other Deep Learning models in the DIPY repo; and to make the training loop more efficient and easy to use. I transferred my code to a `separate repo <https://github.com/itellaetxe/tractoencoder_gsoc>`_ to keep the DIPY repo clean and to experiment freely. Once the final product is working, I will merge it into DIPY. I also packaged the whole repo so I can use it as a library.
+Training experiments were run for a maximum of a 150 epochs, with variable results. They are not amazing, but at least we get some reconstruction of the input tracts from FiberCup, which seems to be on the right track. I also implemented training logs that report the parameters I used for training, so I can reproduce the results at any time. This still needs work though, because not all parameters are stored. Need to polish!
+The left image shows the input tracts, and the middle and right images show two reconstructions from two different training experiments.
+
+.. image:: /_static/images/fibercup_preliminary_results.png
+    :alt: Preliminary results of the AutoEncoder training for a subset of plausible fibers of the FiberCup dataset.
+    :width: 600
+
+What is coming up next week
+~~~~~~~~~~~~~~~~~~~~~~~~~~~
+With the help of my mentors, we identified possible improvements to the AutoEncoder training process. Yesterday I investigated how PyTorch weights are initialized in convolutional kernels and in Keras Dense layers using the `He Initialization <https://paperswithcode.com/paper/delving-deep-into-rectifiers-surpassing-human>`_. Using custom initializers, one can mimic the same behavior in TensorFlow, which I started to implement also yesterday.
+This week should focus on trying to reproduce the small implementation differences that might be causing the model to not converge as the PyTorch one. I will also try to finish implementing the He Initialization in TensorFlow.
+
+
+Did I get stuck anywhere
+~~~~~~~~~~~~~~~~~~~~~~~~
+I got a bit stuck refactoring the code to match the DIPY design patterns and also with the TensorFlow implementation itself, because the output shape of the ``Encoder`` and the input shape of the ``Decoder`` were not matching.
+After investigating what caused this issue, I discovered that ``tf.shape`` was not giving me the usual (and expected) shape of the tensor, conveniently stored in a ``tuple``. I found this behavior strange, but I solved the problem just calling a ``.shape`` method on the Tensor, which does give me the shape tuple I needed.
diff --git a/_static/images/fibercup_preliminary_results.png b/_static/images/fibercup_preliminary_results.png
diff --git a/blog.html b/blog.html
@@ -809,6 +809,81 @@ <h1>
   </h1>
 
 
+  <div class="section ablog-post">
+    <h2 class="ablog-post-title">
+      <a href="posts/2024/2024_06_07_Inigo_week_2.html">Second Week into GSoC 2024: Refactoring the AutoEncoder, preliminary results</a>
+    </h2>
+    <ul class="ablog-archive">
+      <li>
+
+
+        <i class="fa fa-calendar"></i>
+
+        07 June 2024
+
+      </li>
+      <div class="ablog-sidebar-item ablog__postcard2">
+
+
+<li id="ablog-sidebar-item author ablog__author">
+  <span>
+
+    <i class="fa-fw fa fa-user"></i>
+
+    </span>
+
+
+  <a href="blog/author/inigo-tellaetxe.html">Iñigo Tellaetxe</a>
+
+
+
+</li>
+
+
+
+
+<li id="ablog-sidebar-item category ablog__category">
+  <span>
+
+    <i class="fa-fw fa fa-folder-open"></i>
+
+    </span>
+
+
+  <a href="blog/category/gsoc.html">gsoc</a>
+
+
+
+</li>
+
+
+<li id="ablog-sidebar-item tags ablog__tags">
+  <span>
+
+
+    <i class="fa-fw fa fa-tag"></i>
+
+    </span>
+
+
+  <a href="blog/tag/google.html">google</a>
+
+
+
+</li>
+
+
+</div>
+    </ul>
+    <p class="ablog-post-excerpt"><p>This week I refactored the AutoEncoder code to match the design patterns and the organization of other Deep Learning models in the DIPY repo; and to make the training loop more efficient and easy to use. I transferred my code to a <a class="reference external" href="https://github.com/itellaetxe/tractoencoder_gsoc">separate repo</a> to keep the DIPY repo clean and to experiment freely. Once the final product is working, I will merge it into DIPY. I also packaged the whole repo so I can use it as a library.
+Training experiments were run for a maximum of a 150 epochs, with variable results. They are not amazing, but at least we get some reconstruction of the input tracts from FiberCup, which seems to be on the right track. I also implemented training logs that report the parameters I used for training, so I can reproduce the results at any time. This still needs work though, because not all parameters are stored. Need to polish!
+The left image shows the input tracts, and the middle and right images show two reconstructions from two different training experiments.</p>
+</p>
+
+    <p class="ablog-post-expand"><a href="posts/2024/2024_06_07_Inigo_week_2.html"><em>Read more ...</em></a></p>
+    <hr/>
+  </div>
+
   <div class="section ablog-post">
     <h2 class="ablog-post-title">
       <a href="posts/2024/2024_06_03_kaustav_week_1.html">My Journey Continues: Week 1 Progress with DIPY</a>
@@ -883,6 +958,80 @@ <h2 class="ablog-post-title">
     <hr/>
   </div>
 
+  <div class="section ablog-post">
+    <h2 class="ablog-post-title">
+      <a href="posts/2024/2024_05_31_Inigo_week_1.html">First Week into GSoC 2024: Building the AutoEncoder, writing the training loop</a>
+    </h2>
+    <ul class="ablog-archive">
+      <li>
+
+
+        <i class="fa fa-calendar"></i>
+
+        31 May 2024
+
+      </li>
+      <div class="ablog-sidebar-item ablog__postcard2">
+
+
+<li id="ablog-sidebar-item author ablog__author">
+  <span>
+
+    <i class="fa-fw fa fa-user"></i>
+
+    </span>
+
+
+  <a href="blog/author/inigo-tellaetxe.html">Iñigo Tellaetxe</a>
+
+
+
+</li>
+
+
+
+
+<li id="ablog-sidebar-item category ablog__category">
+  <span>
+
+    <i class="fa-fw fa fa-folder-open"></i>
+
+    </span>
+
+
+  <a href="blog/category/gsoc.html">gsoc</a>
+
+
+
+</li>
+
+
+<li id="ablog-sidebar-item tags ablog__tags">
+  <span>
+
+
+    <i class="fa-fw fa fa-tag"></i>
+
+    </span>
+
+
+  <a href="blog/tag/google.html">google</a>
+
+
+
+</li>
+
+
+</div>
+    </ul>
+    <p class="ablog-post-excerpt"><p>I finished becoming familiar with the TensorFlow + Keras basics and I wrote the training loop and a couple of scripts for instantiating and training the AutoEncoder.
+Data loading was also addressed and I am able to load the data from the FiberCup dataset in .trk format using <a class="reference external" href="https://nipy.org/nibabel/">NiBabel</a>, transform it into NumPy arrays, and feed it into the network.</p>
+</p>
+
+    <p class="ablog-post-expand"><a href="posts/2024/2024_05_31_Inigo_week_1.html"><em>Read more ...</em></a></p>
+    <hr/>
+  </div>
+
   <div class="section ablog-post">
     <h2 class="ablog-post-title">
       <a href="posts/2024/2024_05_27_kaustav_week0.html">My Journey Begins: Community Bonding Period with DIPY</a>

diff --git a/blog/2023.html b/blog/2023.html
@@ -759,32 +759,32 @@ <h3>
 
 
   <li>
-    <a href="../posts/2024/2024_06_03_kaustav_week_1.html">
-      03 June - My Journey Continues: Week 1 Progress with DIPY
+    <a href="../posts/2024/2024_06_07_Inigo_week_2.html">
+      07 June - Second Week into GSoC 2024: Refactoring the AutoEncoder, preliminary results
     </a>
   </li>
 
   <li>
-    <a href="../posts/2024/2024_05_27_kaustav_week0.html">
-      27 May - My Journey Begins: Community Bonding Period with DIPY
+    <a href="../posts/2024/2024_06_03_kaustav_week_1.html">
+      03 June - My Journey Continues: Week 1 Progress with DIPY
     </a>
   </li>
 
   <li>
-    <a href="../posts/2024/2024_05_27_Inigo_week_0.html">
-      27 May - Community Bonding Period Summary and first impressions
+    <a href="../posts/2024/2024_05_31_Inigo_week_1.html">
+      31 May - First Week into GSoC 2024: Building the AutoEncoder, writing the training loop
     </a>
   </li>
 
   <li>
-    <a href="../posts/2023/2023_08_28_Shilpi_Week14.html">
-      28 August - Doing Final Touch-Ups: Week 14
+    <a href="../posts/2024/2024_05_27_kaustav_week0.html">
+      27 May - My Journey Begins: Community Bonding Period with DIPY
     </a>
   </li>
 
   <li>
-    <a href="../posts/2023/2023_08_22_Shilpi_Week13.html">
-      22 August - Writing Tests & Making Documentation: Week 13
+    <a href="../posts/2024/2024_05_27_Inigo_week_0.html">
+      27 May - Community Bonding Period Summary and first impressions
     </a>
   </li>
 
@@ -814,7 +814,7 @@ <h3>
 
 
   <li>
-    <a href="category/gsoc.html">gsoc (26)</a>
+    <a href="category/gsoc.html">gsoc (28)</a>
   </li>
 
 
@@ -830,7 +830,7 @@ <h3>
 
 
   <li>
-    <a href="2024.html">2024 (3)</a>
+    <a href="2024.html">2024 (5)</a>
   </li>