Skip to content

Releases: databricks/lilac

v0.1.11

26 Oct 20:09
Compare
Choose a tag to compare

What's Changed

  • Swap in new embedding-chunked splitter algorithm by @brilee in #791
  • Add the language model reference concept to Lilac. by @nsthorat in #797
  • Improve ergonomics of map(). by @nsthorat in #795

Other Changes

  • Fix a flaky test by @dsmilkov in #792
  • Add pdb-compatible script and documentation by @brilee in #794
  • Switch splitter to new algorithm, fixing last blocking bug. by @brilee in #796
  • Add more examples to the language model reference. by @nsthorat in #798

Full Changelog: v0.1.10...v0.1.11

v0.1.10

24 Oct 17:18
Compare
Choose a tag to compare

What's Changed

  • Add the simplest dataset.map() by @nsthorat in #769
  • Add sampling to our ParquetSource by @dsmilkov in #773
  • Add intelligent sampling in ParquetSource by @dsmilkov in #778
  • Add include_labels and exclude_labels when exporting data via dataset.to_* by @dsmilkov in #768
  • Improve the Export Dataset modal dialog by @dsmilkov in #775
  • Allow searching by pre-computed concepts in the searchbox. by @nsthorat in #783
  • Add spacy embedding-clustering splitter by @brilee in #784

Other Changes

  • Add notebook exploring chunking algorithms by @brilee in #771
  • Remove TextSplitterSignal, fixing related tests. by @brilee in #774
  • Fix the concept labeler with the new preview concept key. by @nsthorat in #780
  • Improve export preview text. by @nsthorat in #782
  • Upgrade ruff version and fix associated new linter errors by @brilee in #777

New Contributors

Full Changelog: v0.1.9...v0.1.10

v0.1.9

12 Oct 21:52
Compare
Choose a tag to compare

What's Changed

Other Changes

Full Changelog: v0.1.8...v0.1.9

v0.1.8

12 Oct 21:49
Compare
Choose a tag to compare

What's Changed

Other Changes

Full Changelog: v0.1.7...v0.1.8

v0.1.7

12 Oct 01:19
Compare
Choose a tag to compare

What's Changed

Other Changes

  • Fix port on HF demo since we use app_port: 5432 by @dsmilkov in #759

Full Changelog: v0.1.6...v0.1.7

v0.1.6

11 Oct 15:49
Compare
Choose a tag to compare

Features

Bug fixes / Other

  • Fix duplicate stats requests by @dsmilkov in #754
  • Add CLI prompts for token in deploy script by @nsthorat in #755
  • Add docker deploy instructions in dev.md and multi-platform build by @dsmilkov in #753

Full Changelog: v0.1.15...v0.1.6

v0.1.15

04 Oct 22:34
Compare
Choose a tag to compare

Features

Bug fixes / other changes

Full Changelog: v0.1.4...v0.1.15

v0.1.4

29 Sep 18:30
Compare
Choose a tag to compare

What's Changed

Other Changes

  • Add --hf_token and dont require it when we have public datasets, or we set load on start server. by @nsthorat in #727

Full Changelog: v0.1.3...v0.1.4

v0.1.3

29 Sep 14:00
Compare
Choose a tag to compare

You can now deploy Lilac to a HuggingFace dataset with just a few lines of Python, or from the CLI:

Deploy a single configuration object, and have it load entirely on the space:

ll.deploy_config(
  hf_space='nsthorat-lilac/nikhil-demo',
  # Create the space if it doesn't exist.
  create_space=True,
  config=ll.Config(datasets=[
    ll.DatasetConfig(
      namespace='local',
      name='glue_ax',
      source=ll.HuggingFaceSource(dataset_name='glue', config_name='ax'))
  ]))

Deploy a Lilac project you've loaded locally:

ll.deploy_project(
  hf_space='nsthorat-lilac/nikhil-project-demo',
  project_dir='./data',
  datasets=['local/glue_ax'],  # This is optional. If not defined, uploads all datasets.
  # Create the space if it doesn't exist.
  create_space=True)

Or via the CLI:

lilac deploy-project --project_dir='~/my_project'

Features

Other Changes

  • Improve the schema view to be less overwhelming. by @nsthorat in #726

Demo

  • Add the textbook quality programming dataset to the demo. Clean up old datasets. by @nsthorat in #723

Full Changelog: v0.1.2...v0.1.3

v0.1.2

27 Sep 18:31
Compare
Choose a tag to compare

Internal change

  • Migrate the span values to use __span__ and make the code back-compat by @dsmilkov in #718

Full Changelog: v0.1.1...v0.1.2