Skip to content

Fast-Forward Indexes v0.2.0

Compare
Choose a tag to compare
@mrjleo mrjleo released this 10 Mar 20:36
· 113 commits to main since this release
4fbca6e

Index structures

  • New: OnDiskIndex is based on HDF5 and can be accessed on-demand from disk
  • Indexes can now grow dynamically in size

Performance

  • Data is now represented using pandas data frames internally
  • Many operations have been vectorized to improve performance
  • Early stopping now works in batches rather than per query

Misc

  • New: Indexer class for indexing corpora
  • New: PyTerrier transformers are provided for scoring and interpolation using Fast-Forward indexes

API changes

Many parts of the API have changed. Some of the most important breaking changes:

  • Scores are now computed using Index.__call__
  • Queries are not explicitly provided anymore but attached to the ranking
  • InMemoryIndex objects cannot be saved to or loaded from disk anymore