-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add metadata for DatabaseReader #13799
base: main
Are you sure you want to change the base?
Conversation
Add metadata for DatabaseReader
Hi maintainers, I have updated the Thank you! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @cluic for the contribution. I've left some comments.
@@ -73,16 +73,21 @@ def __init__( | |||
"set of credentials." | |||
) | |||
|
|||
def load_data(self, query: str) -> List[Document]: | |||
def load_data(self, query: str, metadata: dict=None, doc_cols: list=None, metadata_cols: list=None) -> List[Document]: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
from typing import Dict, List
def load_data(
self,
query: str,
metadata: Dict = {},
doc_cols: List = [],
metadata_cols: List = []
)
Rather than Optional, I think you can just set defaults as I did above.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please also use the types that I've indicated above, otherwise mypy might complain
Please also run |
Description
Add Metadata Functionality to DatabaseReader
In the original implementation, the DatabaseReader loaded all the data into text format. However, sometimes we don't need to retrieve all the columns; we only need some fields as metadata. Therefore, I have updated the load_data method.
With this update, we can now load data from the database and specify certain fields to be included as metadata. Here a simple example:
Fixes # (issue)New Package?
Did I fill in the
tool.llamahub
section in thepyproject.toml
and provide a detailed README.md for my new integration or package?Version Bump?
Did I bump the version in the
pyproject.toml
file of the package I am updating? (Except for thellama-index-core
package)Type of Change
Please delete options that are not relevant.
Bug fix (non-breaking change which fixes an issue)Breaking change (fix or feature that would cause existing functionality to not work as expected)This change requires a documentation updateHow Has This Been Tested?
Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration
Suggested Checklist:
make format; make lint
to appease the lint gods