[Question] Calculation of precision in `KPrecision` #18

wei-ann-Github · 2024-03-07T04:19:39Z

Refering to

instruct-qa/instruct_qa/evaluation/faithfulness_metrics.py

Line 528 in 89118ad

precision = 1.0 * num_common / len(prediction_tokens)

num_common is a count of unique overlapping tokens. When calculating precision, the denominator used is the length of prediction_tokens. prediction_tokens does not seem to consist of only unique tokens.

May I know:

Is there a reason for using unique tokens in the numerator but all tokens in the denominator in the calculation of precision?
if considering set(prediction_tokens) as the denominator will affect its correlation with human judgement? If so, is it less correlated or more?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Calculation of precision in `KPrecision` #18

[Question] Calculation of precision in `KPrecision` #18

wei-ann-Github commented Mar 7, 2024

[Question] Calculation of precision in KPrecision #18

[Question] Calculation of precision in KPrecision #18

Comments

wei-ann-Github commented Mar 7, 2024

[Question] Calculation of precision in `KPrecision` #18

[Question] Calculation of precision in `KPrecision` #18