-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
move rag answer correctness metrics tests to test_metrics.py #1131
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this test should be at the preparation card without the if __name__ == "__main__":
so it will work every time someone import this file. The difference is that here its an activation of existing class (MetricPipline and its inner metric) . This will be tested that way every preparation test for every PR.
Related: #1132 |
hmm... for Also, if you look at https://github.com/IBM/unitxt/blame/main/prepare/metrics/rag_answer_correctness.py#L63, @eladven explicitly added the |
@elronbandel @dafnapension @yoavkatz So, what should i do with this PR? (close it?) Generally, where should tests that involve models be? |
There are two issues here. (1) Where to place the tests (2) when we run the tests The rule is that testing python metric classes should be done in test_metrics, while checking catalog assets should be done at the prepare file when the catalog assets are added. Since in your case, you are testing assets, it should be in the prepare file. The second point is that we can not afford to run tests that require large model loading on each PR. This is because it has to download large models to a new VM on each PR. This is why long tests are executed only with the @elronbandel - how do you suggest to avoid this and still keep reasonable runtimes? |
Thanks @yoavkatz . This PR tests with a model, so i will close it (and we keep the test in the prepare script, under the A question about another test please: Generally, tests in |
Yes. It makes sense to move test_context_correctness to the prepare file. And yes , the test_metrics are run in the Test Library Code git action and the prepare cards in Test Catalog Preparation git action. This is done on every PR. |
No description provided.