Fix the test_accuracy function by modifying the assertion logic #867

kgao · 2024-03-29T19:39:38Z

The original code is testing the accuracy of different estimators by checking if the true Average Treatment Effect (ATE) falls within the calculated confidence interval. However, the check is done only once, using a single-point estimate (ate), which may not be sufficient to validate the estimator's performance. So it failed when the proportion of a true ATE within the confidence interval is NOT greater than 0.5 (50%).
The new logic: To check that 50% of the values are in the 90% confidence interval (which makes sense), but it's testing this with the ate, which returns a single value, so actually the threshold isn't important, it's a single point that is either in the interval or not. Instead, what we should do is generate W, D, and Y several times and check that most of the time the ate is in the bounds (like, generate 10 sets of W, D, Y and check that at least 8 of those times the true ate was inside the interval.
Also applied the logic for test_accuracy_iv (And reduced the sample size n=1000 to improve the test time)

fverac · 2024-03-29T20:27:23Z

Nice fix!

Wonder if it makes sense to apply the same logic to the test_accuracy_iv method in test_discrete_outcome.py, since the logic there is also doing this weird proportion_in_interval stuff that doesn't really make much sense with ate (but might make more sense the new way you are doing it)

kbattocchi

This is a good start, but the logic is not quite right.

In addition, please also address @fverac's comment about making parallel changes to the corresponding IV test.

econml/tests/test_discrete_outcome.py

…est_accuracy_iv 3. Fix linting Signed-off-by: kgao <[email protected]>

Signed-off-by: kgao <[email protected]>

Fix the test_accuracy function by modifying the assertion logic

cd82563

Signed-off-by: kgao <[email protected]>

kgao assigned kbattocchi and kgao Mar 29, 2024

kgao requested a review from kbattocchi March 29, 2024 19:40

kgao unassigned kbattocchi Mar 29, 2024

kbattocchi requested changes Apr 4, 2024

View reviewed changes

econml/tests/test_discrete_outcome.py Show resolved Hide resolved

econml/tests/test_discrete_outcome.py Show resolved Hide resolved

kgao added 2 commits April 9, 2024 18:36

Update: 1.Apply the logic for each estimator 2. Apply the logic for t…

ae66642

…est_accuracy_iv 3. Fix linting Signed-off-by: kgao <[email protected]>

update: cleanup - linting and code format

78d3f18

Signed-off-by: kgao <[email protected]>

kbattocchi approved these changes Apr 10, 2024

View reviewed changes

kbattocchi enabled auto-merge (squash) April 10, 2024 14:38

kbattocchi merged commit 52efc8e into main Apr 10, 2024
78 checks passed

kbattocchi deleted the fix_test_discrete_outcome branch April 10, 2024 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the test_accuracy function by modifying the assertion logic #867

Fix the test_accuracy function by modifying the assertion logic #867

kgao commented Mar 29, 2024 •

edited

Loading

fverac commented Mar 29, 2024

kbattocchi left a comment

Fix the test_accuracy function by modifying the assertion logic #867

Fix the test_accuracy function by modifying the assertion logic #867

Conversation

kgao commented Mar 29, 2024 • edited Loading

fverac commented Mar 29, 2024

kbattocchi left a comment

Choose a reason for hiding this comment

kgao commented Mar 29, 2024 •

edited

Loading