fix(metrics): fixed NDCG calculation and added comprehensive test cases #17126

iamarunbrahma · 2024-12-02T15:20:26Z

Description

This PR fixes a bug in the NDCG (Normalized Discounted Cumulative Gain) metric calculation where the IDCG was incorrectly calculated using retrieved_ids instead of expected_ids. The fix ensures IDCG represents the best possible ranking of relevant documents.

Key changes:

Fixed IDCG calculation to use min(len(retrieved_ids), len(expected_ids))
Added proper handling for empty expected_ids case
Added comprehensive test cases covering various scenarios including:
- Perfect ranking
- Partial matches
- No relevant docs
- More relevant docs than retrieved
- Single relevant doc
- Different modes (linear/exponential)
- Empty expected_ids

Fixes #17070

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

I added new unit tests to cover this change
- Added comprehensive test cases in test_metrics.py covering various NDCG scenarios
- Tests include edge cases like empty expected_ids and different ranking modes

…test cases (run-llama#17126)" This reverts commit ccfe6d7.

…test cases (run-llama#17126)" This reverts commit ccfe6d7. Signed-off-by: Alexey Rodriguez Yakushev <[email protected]>

fix(metrics): fixed NDCG calculation and added comprehensive test cases

e24a364

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Dec 2, 2024

iamarunbrahma added 2 commits December 2, 2024 21:08

test: update test_metrics.py

be02ca9

Merge branch 'main' into idcg_fix

9a57a16

logan-markewich approved these changes Dec 3, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Dec 3, 2024

logan-markewich merged commit ccfe6d7 into run-llama:main Dec 3, 2024
11 checks passed

alexeyrodriguez added a commit to alexeyrodriguez/llama_index that referenced this pull request Dec 10, 2024

Revert "fix(metrics): fixed NDCG calculation and added comprehensive …

4ddcf78

…test cases (run-llama#17126)" This reverts commit ccfe6d7.

alexeyrodriguez mentioned this pull request Dec 10, 2024

fix(metrics): fixed NDCG calculation and updated previous tests #17236

Merged

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(metrics): fixed NDCG calculation and added comprehensive test cases #17126

fix(metrics): fixed NDCG calculation and added comprehensive test cases #17126

iamarunbrahma commented Dec 2, 2024

fix(metrics): fixed NDCG calculation and added comprehensive test cases #17126

fix(metrics): fixed NDCG calculation and added comprehensive test cases #17126

Conversation

iamarunbrahma commented Dec 2, 2024

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?