-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate performance drop for react-native repo with new tokens structure. #595
Comments
What I can see from the reports:
I also take the repo with the biggest quality improvement (storybook) to see if there some kind of similar patterns. it helps to understand what is going on. In next tables v 010 means test classification report comparison for react-native
test classification report comparacion for storybook
What we can see:
Here the comparison between confusion matrixes for
I also spend a lot of time digging deeper but I do not find anything suspicious there. |
What I did:
So all big deal was about unperfect And I see here one important outcome that is not related to the particular issue. Summary:
|
If we research feature selection again then it's worth studying feature agglomeration instead of selection. Also it's worth discussing running the feature selection on a selected set of repos and not train it again for each repo. Also now that we found that the better features are not selected it would be nice to find out why. They should be selected during selector fitting if they really help classification. |
@m09 Can we add feature selection to our Optimizer? We could raise the initial number from 500 to 2000 and then pick 500 from those 2000. |
@vmarkovtsev It should be doable yes. We can select over the 4k in hyperparameter-opt (or aggregate, as mentioned above). The really too expensive thing to run in hyperparameter-opt is feature extraction, even though we'd need to. Selection is fast enough I think. |
Let's continue here: #637 please add If I miss something in the description. |
Context: #586 (comment)
The text was updated successfully, but these errors were encountered: