Calculation of P/ R/ F1-Score from Model Export #163

sushds · 2023-11-07T08:44:32Z

sushds
Nov 7, 2023

Hi @StijnKas, directing this question to you as we have already discussed this topic before.

I am trying to calculate the metrics mentioned in the title using the model export. It was easier to process this data in BigQuery so I uploaded the tables in there. Now I see that there are a number of columns that could be of interest to calculate these metrics but are probably aggregated and named accordingly. What I am looking for: # of true positives, # of false positives, # of false negatives and # of true negatives.

I expect this information to be present in the model exports. I see column names such as: pyNegatives, pyPositives, pyBinPositives, pyBinNegatives, pyBinPositivesPercentage, pyBinNegativesPercentage, pyTotalBins, py Peformance and pyBinResponseCount.

I believe I need to use the numbers from these columns but I am not sure how to get there. Could you maybe help me in explaining these columns / getting to the numbers I am trying to calculate?

Answered by operdeck

Nov 7, 2023

Hi Sushant, It is pyBinPositives and pyBinNegatives that you'll need. Whether these positives and negatives are "true" or "false" obviously depends on the threshold - which is why an area under a curve is a better metric when you have a continuous outcome. FYI see the "ADM explained" articles on PDS tools documentation to see how we use these bin positives/negatives to calculate AUC.

View full answer

operdeck · 2023-11-07T11:50:41Z

operdeck
Nov 7, 2023
Maintainer

Hi Sushant, It is pyBinPositives and pyBinNegatives that you'll need. Whether these positives and negatives are "true" or "false" obviously depends on the threshold - which is why an area under a curve is a better metric when you have a continuous outcome. FYI see the "ADM explained" articles on PDS tools documentation to see how we use these bin positives/negatives to calculate AUC.

1 reply

sushds Nov 9, 2023
Author

Hi Otto, thanks for the response.

I referred to the function in PDS tools documentation. I understand that since we have varying thresholds it is nice to use AUC as it deals with the overall classification performance. It will be helpful to know more about the thresholds that determine these operating points. Is that information available somewhere?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calculation of P/ R/ F1-Score from Model Export #163

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Calculation of P/ R/ F1-Score from Model Export #163

sushds Nov 7, 2023

Replies: 1 comment · 1 reply

operdeck Nov 7, 2023 Maintainer

sushds Nov 9, 2023 Author

sushds
Nov 7, 2023

Replies: 1 comment 1 reply

operdeck
Nov 7, 2023
Maintainer

sushds Nov 9, 2023
Author