Generating synthetic CT scans from MRIs to enhance radiotherapy treatment planning

About me

I am a gradute student at Polytechnique Montreal, finishing my Masters in Biomedical Engineering, Instrumentation and Medical Imaging. I come from a background of a Bachelor in Pure Physics and a Masters in Medical Physics-Safety and Security. I am interested in finiding new imaging techniques coupled to AI and hyperspectral retinal imaging and i believe that BHS will help me get a step closer to achieving a personal project adn get familiar with Machine Learning and Data analysis.

_{Khaled Ashkar}

Project summary

This project aims to develop a deep learning model that generates reliable synthetic CT scans from MRIs only, Contrast and Field magnitude-agnostic to eliminate the need for additional CT scans in hospitals and clinics.

Keywords

MRI, CT, CBCT, Radiotherapy, Treatement planning

Background

Over the past decades, medical imaging has significantly enhanced the diagnosis and treatment of oncological patients, particularly in radiotherapy (RT). Traditionally, 3D computed tomography (CT) is the primary imaging modality used in RT for accurate patient geometry, dose calculations, and plan optimization. However, CT exposes patients to ionizing radiation, which is a concern, especially for repeated simulations or vulnerable populations such as pediatric patients. Magnetic resonance imaging (MRI), with its superior soft-tissue contrast, has proven valuable for tumor and organ-at-risk delineation, patient positioning, and monitoring. Despite its advantages, MRI lacks the necessary tissue attenuation information required for accurate dose calculations, making CT still necessary in RT workflows. To overcome these challenges, recent advancements have focused on generating synthetic CT (sCT) images from MRIs. This approach aims to combine the superior imaging capabilities of MRI with the essential dose calculation information of CT, reducing the patient's exposure to radiation and simplifying the workflow. Artificial intelligence (AI) techniques, particularly machine learning and deep learning, have emerged as leading methods for creating sCT images from MRI data. These AI-driven methods have the potential to enhance the accuracy of RT planning and delivery. However, the field still faces the challenge of a lack of public datasets and standardized benchmarks to validate and compare different AI approaches. This project aims to develop and refine AI algorithms to generate high-quality CT scans from MRIs, thereby improving RT workflows and patient outcomes by leveraging the best features of both imaging modalities.

Main Objectives

On the personal level:get a basic knowledge in Opensource Data, DataSet manipulation, Machine Learning, DeepLearning models and Data visualization.
Short term: Provide a pipeline to generate synthetic CTs from MRIs (training/testing).
Long term: Generalizing the model on different Contrasts and Field magnitudes.

Tools

This project relied on numerous tools such as:

Git and GitHub;
Bash terminal
Machine Learning
Opensource websites: Zenodo and Kaggle;
High performance computing (HPC), such as Google Colab, for training the model;
Python packages, such as matplotlib, pandas and torch.

Data used

The dataset can be downloaded from https://doi.org/10.5281/zenodo.7260705 and a detailed description is offered at "synthRAD2023_dataset_description.pdf". The training datasets for Task1 is in Task1.zip, while for Task2 in Task2.zip. After unzipping, each Task is organized according to the following folder structure: Task1.zip/ ├── Task1 ├── brain ├── 1Bxxxx ├── mr.nii.gz ├── ct.nii.gz └── mask.nii.gz

Each patient folder has a unique name that contains information about the task, anatomy, center and a patient ID. The naming follows the convention below: [Task] [Anatomy] [Center] [PatientID] 1 B A 001 In each patient folder, three files can be found: ct.nii.gz: CT image mr.nii.gz or cbct.nii.gz (depending on the task): CBCT/MR image mask.nii.gz:image containing a binary mask of the dilated patient outline For each task and anatomy, an overview folder is provided which contains the following files: [task]_[anatomy]_train.xlsx: This file contains information about the image acquisition protocol for each patient. [task][anatomy][center][PatientID]_train.png: For each patient a png showing axial, coronal and sagittal slices of CBCT/MR, CT, mask and the difference between CBCT/MR and CT is provided. These images are meant to provide a quick visual overview of the data.

The data used in this project covered 30 Brain images taken with MRI and CT only. For the sake of time constraints and computational resources, the models were trained on 1, 3, 7, 10 and 20 subjects at a time to analyze the performance of each and the effect of the hyperparameters on the model convergence and results.

This challenge dataset contains imaging data of patients who underwent radiotherapy in the brain or pelvis region. Overall, the population is predominantly adult and no gender restrictions were considered during data collection. For Task 1, the inclusion criteria were the acquisition of a CT and MRI during treatment planning while for task 2, acquisitions of a CT and CBCT, used for patient positioning, were required. Datasets for task 1 and 2 do not necessarily contain the same patients, given the different image acquisitions for the different tasks. Data was collected at 3 Dutch university medical centers:

-Radboud University Medical Center -University Medical Center Utrecht -University Medical Center Groningen For anonymization purposes, from here on, institution names are substituted with A, B and C, without specifying which institute each letter refers to. All images were acquired with the clinically used scanners and imaging protocols of the respective centers and reflect typical images found in clinical routine. As a result, imaging protocols and scanner can vary between patients. A detailed description of the imaging protocol for each image, can be found in spreadsheets that are part of the dataset release (see dataset structure).

Data was acquired with the following scanners: Center A: MRI: Philips Ingenia 1.5T/3.0T CT: Philips Brilliance Big Bore or Siemens Biograph20 PET-CT CBCT: Elekta XVI

Center B: MRI: Siemens MAGNETOM Aera 1.5T or MAGNETOM Avanto_fit 1.5T CT: Siemens SOMATOM Definition AS CBCT: IBA Proteus+ or Elekta XVI

Center C: MRI: Siemens Avanto fit 1.5T or Siemens MAGNETOM Vida fit 3.0T CT: Philips Brilliance Big Bore CBCT: Elekta XVI

For task 1, MRIs were acquired with a T1-weighted gradient echo or an inversion prepared - turbo field echo (TFE) sequence and collected along with the corresponding planning CTs for all subjects. The exact acquisition parameters vary between patients and centers. For centers B and C, selected MRIs were acquired with Gadolinium contrast, while the selected MRIs of center A were acquired without contrast. For task 2, the CBCTs used for image-guided radiotherapy ensuring accurate patient position were selected for all subjects along with the corresponding planning CT. The following pre-processing steps were performed on the data: Conversion from dicom to compressed nifti (nii.gz) Rigid registration between CT and MR/CBCT Anonymization (face removal, only for brain patients) Patient outline segmentation (provided as a binary mask) Crop MR/CBCT, CT and mask to remove background and reduce file sizes The code used to preprocess the images can be found at: https://github.com/SynthRAD2023/. Detailed information about the dataset are provided in SynthRAD2023_dataset_description.pdf published here along with the data and will also be submitted to Medical Physics.

Project Delivrables

Github reposiory with all the data needed to train the model, describing the project and explaining the results;
Two Pre-trained models on 3 and 7 Brain images;
The Wrap-up presentation done at the end of the Brainhack school (14-06-2024);
Two versions of the model 'GAN_MRI_to_CT_Visualstudio' and 'GAN_MRI_to_CT_Colab' (ready to be ran on Google Colab for faster computing and locally using a CPU only). Those two files couldn't be uploaded due to their size, they will be sent on demand. Contact me at: [email protected]

Results

Preprocessing

The MRI data used to train the model had different sizes, meanwhile the model takes (128x128) arrays. The first step was to padd all the images with 50 to 100 pixels ( MRI,CT and the mask) to ensure a (256x256) sized array. Next, I resized the images to (128x128) to match the conditions. I could have trained the model on higher resolution but I decided to go with less due to the computational resources and to have a model capable of delivering good results in case of degraded images. Using "ITK-SNAP", I made sure that the images and the mask are aligned, which saved a lot of work. The mask covered the Brain area in all the images so no additional segmentation was needed.

Model Testing and Validation

After trying more than 22 combinations of subjects number, epochs number and batch size, I decided to compare 3 models: 1- 20 ep, batch size 4 and 1 subject 2- 100 ep, batch size 10 and 7 subjects 3- 200 ep, batch size 5 and 3 subjects During the validation on the 20% of the training data that the model didn't see, all of the 3 models showed moderate to good results. But it was during the validation on external data that the model (100ep_b10_7sub) showed better performance since it was trained on more subjects. It was not clear if the effect was due to the batch size or the subjects number or both. Also, the evaluation between the original CT scan and the generated one was based on the :

Mean absolute error (MAE)
Peak signal to noise ratio (PSNR)
Structrual similarity index (SSIM)

Figure1.: Evaluation metrics and Convergence plots for each model

Figure2.: Model validation on the 20% of the training dataset showing fair results considering the hyperparameters

Figure3.: Comparing the real CT with the synthetic CT, and the original MRI with the generated MRI from the synthetic CT using the model in the link: (https://surfer.nmr.mgh.harvard.edu/fswiki/SynthSR)

Figure4.: Real CT vs Synthetic CT's Histogram after normalization

Conclusion

Can we generate reliable synthetic CT scans from MRIs?

While in this instance, the model was unable to generate reliable Synthetic CT scans from the available MRI dataset due to various factors such as the selection of the hyperparameters, the size of the data, the choice of the model, the resolution of the input data and the architecture of the loss function. All of those should be considered when implementing such model to assure a good result from which we can start the next step of this project by extracting the Hounsfield number from those geenrated CT scans in order to start the treatment planning and the dose calculation. I believe that the major limit that all the researchers have found is getting the quantitative relation betwwen the MRI and the CT scans to fairly evaluate the success and the reliability of the results.

Acknowledgement

I would like to thank the Brainhack team of 2024 for their constructive insights throughout the course, with particular thanks to Dr. Eva Alonso Ortiz, the TAs Daniel Ridani and Jan Valosek, and my colleague Nilser Laines Medina for providing their support and expertise throughout the project.

References

Gupta, D., Kim, M., Vineberg, K. A., & Balter, J. M. (2019). Generation of synthetic CT images from MRI for treatment planning and patient positioning using a 3-Channel U-Net trained on sagittal images. Frontiers in Oncology, 9. https://doi.org/10.3389/fonc.2019.00964
Liu, Y., Lei, Y., Wang, Y., Shafai-Erfani, G., Wang, T., Tian, S., Patel, P., Jani, A. B., McDonald, M., Curran, W. J., Liu, T., Zhou, J., & Yang, X. (2019). Evaluation of a deep learning-based pelvic synthetic CT generation technique for MRI-based prostate proton treatment planning. Physics in Medicine & Biology/Physics in Medicine and Biology, 64(20), 205022. https://doi.org/10.1088/1361-6560/ab41af
Tahri, S., Texier, B., Nunes, J., Hemon, C., Lekieffre, P., Collot, E., Chourak, H., Guevelou, J. L., Greer, P., Dowling, J., Acosta, O., Bessieres, I., Marage, L., Boue-Rafle, A., De Crevoisier, R., Lafond, C., & Barateau, A. (2023). A deep learning model to generate synthetic CT for prostate MR-only radiotherapy dose planning: a multicenter study. Frontiers in Oncology, 13. https://doi.org/10.3389/fonc.2023.1279750
SynthRAD2023 - Grand Challenge. (n.d.). grand-challenge.org. https://synthrad2023.grand-challenge.org/
Deep learning based synthetic‐CT generation in radiotherapy and PET: A review
Deep learning to generate synthetic CT Images from MR for Radiotherapy treatment planning – MVision AI

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
pictures		pictures
Copy_of_GAN_MRI_to_CT_final.ipynb		Copy_of_GAN_MRI_to_CT_final.ipynb
GAN_MRI_to_CT_Colab.ipynb		GAN_MRI_to_CT_Colab.ipynb
GAN_MRI_to_CT_Visualstudio.ipynb		GAN_MRI_to_CT_Visualstudio.ipynb
LICENSE		LICENSE
Project_wrap_up_Ashkar_2226262.pptx		Project_wrap_up_Ashkar_2226262.pptx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generating synthetic CT scans from MRIs to enhance radiotherapy treatment planning

About me

Project summary

Keywords

MRI, CT, CBCT, Radiotherapy, Treatement planning

Background

Main Objectives

Tools

Data used

Project Delivrables

Results

Preprocessing

Model Testing and Validation

Conclusion

Can we generate reliable synthetic CT scans from MRIs?

Acknowledgement

References

About

Releases

Packages

Languages

License

brainhack-school2024/Ashkar_project

Folders and files

Latest commit

History

Repository files navigation

Generating synthetic CT scans from MRIs to enhance radiotherapy treatment planning

About me

Project summary

Keywords

MRI, CT, CBCT, Radiotherapy, Treatement planning

Background

Main Objectives

Tools

Data used

Project Delivrables

Results

Preprocessing

Model Testing and Validation

Conclusion

Can we generate reliable synthetic CT scans from MRIs?

Acknowledgement

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages