precision-recall functionality #48
Replies: 3 comments 5 replies
-
For I do not know why you're observing the behavior you describe though! My only guess is that the return_bundle contains a missing value for recall and the pandas groupby operation that calculates precision_recall per group (here) cannot handle a missing value. Ah! But maybe this is the fundamental issue... in that groupby command, with only one element listed in |
Beta Was this translation helpful? Give feedback.
-
Ok, I have figured out why my first try did not create any sensible results. I was using the first for loop to query for a singe Moa and then if I calculate the precision, it will obviously be 1!
So that code above is incorrect, right? I should be running precision-recall over the whole data frame? |
Beta Was this translation helpful? Give feedback.
-
@gwaygenomics Why are the precisions so low? shouldn't they be 100%? since we only have two MOAs |
Beta Was this translation helpful? Give feedback.
-
I am currently struggling to fully understand the functionality of the precision-recall operation.
Thus far I have taken a data sample from the LINCS database and followed the demo. Doing that I have noticed that I do not understand what the replicate_groups do.
I have done a first evaluation here
replicate_groups=["Metadata_moa"]
but that only gave the precision column as output and all precisions were = 1."Metadata_broad_sample"]
which resulted in a recall column - so I think I am using it correctly now. Not sure about the actual values tho.I have spent a little time looking at the precision_recall.py but haven't gotten very far with understanding that.
So, again my question in short:
If Precision at k is the proportion of recommended items in the top-k set that are relevant, then why do we need two replicate_group values?
Beta Was this translation helpful? Give feedback.
All reactions