-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop model to label replacement modes when not filled out by user #783
Comments
Currently have tested Random Forest, Multinomial/Mixed Logit, and Gradient Boosted Decision Tree models for replacement labeling on all labeled data: Performance is about the same for the MNL/MXL and the Gradient Boosted Decision Tree ~75% accuracy across all classes, ~71% F1 score when weighted by class support. Given the simplicity of the implementation for the Sklearn models (GBDT/RF) it makes the most sense to implement those in the pipeline, while keeping things modular enough to plug in other models such as the MNL/MXL in the future. This notebook also contains many "replacement" visualizations that were used in the TRB paper. These may be useful for the dashboard when it is configured in "program" (not yet implemented #781). |
The testing notebook is now here: And the implementation is here: Both are work in progress, but mostly on the implementation. |
TODO:
Model Storage:
e-mission/e-mission-server#874
Model Structure:
e-mission/e-mission-server#852
The text was updated successfully, but these errors were encountered: