-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How is "train_data_processed_w_static.csv" obtained for out-of-domain task on MIMIC? #23
Comments
Hey! Sorry for the late response. I haven't touched this project in a long while, but it suspect that perhaps the preprocessing pipeline from the MIMIC benchmark produces train / test splits that are named differently than in this repo. The names of the splits are defined in this module, and you could try changing them according to your output files in your local clone. It occurs to me that based on this script that at least the test split output is simply named |
No worries and thank you for the response. After running the pipeline, the What format should the files in that module be? I also notice that |
Mhh, is that the output even after running the |
Yes. Though this is expected, as it is mentioned here: |
Heyo! I was looking at some code again. Since this project happened some time ago and I hadn't worked on that specific aspect of it, I am not sure if I can comprehensively help you solving this problem, unfortunately. Since I do not work at Pacmed anymore, I also don't have access to data, so I can't provide any more detail on the format of the dataset. It should correspond to the names in this pickle file, though. What I feel like I am understanding is the following:
I am very sorry reproducing this part is being such a hassle! Also mentioning @Giovannicina in case he might be able to provide some more info on this problem, and best of luck. In case you do figure out the procedure, let me know here so that we can improve the documentation of the repo in this regard. |
Hi there. @jjgarciac, did you find a solution to this issue? |
When executing
python3 src/experiments/out_of_domain.py --models PPCA
, I encounter:FileNotFoundError: (...)/in-hospital-mortality/train_data_processed_w_static.csv
I performed the 6 pre-processing steps listed here to setup the MIMIC mortality benchmark. The resulting directory does not include the file causing the error. It is as follows:
-in-hospital-mortality/
--train/
--test/
Note: I am using the MIMIC-III-demo dataset; I was able to run
python -um mimic3models.in_hospital_mortality.logistic.main --l2 --C 0.001 --output_dir mimic3models/in_hospital_mortality/logistic
from mimic3-benchmarks.The text was updated successfully, but these errors were encountered: