v0.2.3
What's Changed
We now have 2 CLI scripts for sharing Lilac datasets (via huggingface):
lilac upload local/Capybara --url_or_repo=lilacai/Capybara
To download the dataset to a local project directory:
lilac download lilacai/Capybara
For more details on sharing datasets, see the Sharing Guide
With this change, we added a new environment variable USE_TABLE_INDEX
, useful for frozen demos. This will dramatically improve the performance of queries as we use a cached DuckDB table. This will slow down labeling, or any edits, as the table will get re-computed upon each change.
Upload / Download
Bug fixes
- Fix a bug with CSV source reader for TSV files, and named columns. by @nsthorat in #1040
- Progress bar by @brilee in #1043
- Fix bug with ItemMedia not rendering media fields that are deeply nested siblings. by @nsthorat in #1044
- Fix clustering an enriched field by @dsmilkov in #1048
- Propagate filters in the group by panel by @dsmilkov in #1041
Performance
UI
- Add clustering in the UI by @dsmilkov in #1045
- Add search to the cluster UI. Add some polish. by @nsthorat in #1054
- Add clusters to the schema menu. Migrate to a custom carousel component so the page doesn't freeze. by @nsthorat in #1050
Clustering
- Add
dataset.cluster(input)
whereinput
can be any lambda func by @dsmilkov in #1042 dataset.cluster()
flattens any repeated before clustering by @dsmilkov in #1051
Lilac Garden
Other Changes
- Move the import of .env.local in publish_pip to the top of the file. by @nsthorat in #1039
- fix: migrate embeddings by azure openai to openai > 1.0.0 by @dechantoine in #1053
- Streamline lilac deployment by @brilee in #1057
- Add a notebook for working with concepts from python. by @nsthorat in #1055
Full Changelog: v0.2.2...v0.2.3