Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(RFC): Adds altair.datasets #3631

Draft
wants to merge 159 commits into
base: main
Choose a base branch
from
Draft

feat(RFC): Adds altair.datasets #3631

wants to merge 159 commits into from

Commits on Oct 2, 2024

  1. wip

    dangotbanned committed Oct 2, 2024
    Configuration menu
    Copy the full SHA
    7933771 View commit details
    Browse the repository at this point in the history

Commits on Oct 4, 2024

  1. Configuration menu
    Copy the full SHA
    b30081e View commit details
    Browse the repository at this point in the history
  2. refactor: Make version accessible via data.source_tag

    - Allow quickly switching between version tags
    
    #3150 (comment)
    dangotbanned committed Oct 4, 2024
    Configuration menu
    Copy the full SHA
    279586b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    32150ad View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    f1d18a2 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    4d3c550 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    7e65841 View commit details
    Browse the repository at this point in the history

Commits on Oct 5, 2024

  1. Configuration menu
    Copy the full SHA
    05773af View commit details
    Browse the repository at this point in the history

Commits on Oct 6, 2024

  1. Configuration menu
    Copy the full SHA
    4fff80a View commit details
    Browse the repository at this point in the history

Commits on Oct 7, 2024

  1. Configuration menu
    Copy the full SHA
    3a284a5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    22a5039 View commit details
    Browse the repository at this point in the history
  3. feat: Support env var VEGA_GITHUB_TOKEN

    Not required for these requests, but may be helpful to avoid limits
    dangotbanned committed Oct 7, 2024
    Configuration menu
    Copy the full SHA
    a618ffc View commit details
    Browse the repository at this point in the history
  4. feat: Add support for multi-version metadata

    As an example, for comparing against the most recent I've added the 5 most recent
    dangotbanned committed Oct 7, 2024
    Configuration menu
    Copy the full SHA
    1792340 View commit details
    Browse the repository at this point in the history

Commits on Oct 8, 2024

  1. Configuration menu
    Copy the full SHA
    fa2c9e7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    24cd7d7 View commit details
    Browse the repository at this point in the history
  3. feat: Adds refresh_tags

    - Basic mechanism for discovering new versions
    - Tries to minimise number of and total size of requests
    dangotbanned committed Oct 8, 2024
    Configuration menu
    Copy the full SHA
    7dd461f View commit details
    Browse the repository at this point in the history
  4. feat(DRAFT): Adds url_from

    Experimenting with querying the url cache w/ expressions
    dangotbanned committed Oct 8, 2024
    Configuration menu
    Copy the full SHA
    9768495 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    c38c235 View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2024

  1. Configuration menu
    Copy the full SHA
    a22cc8a View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2024

  1. feat: Major GitHub rewrite, handle rate limiting

    - `metadata_full.parquet` stores **all known** file metadata
    - `GitHub.refresh()` to maintain integrity in a safe manner
    - Roughly 3000 rows
    - Single release: **9kb** vs 46 releases: **21kb**
    dangotbanned committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    1181860 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    31eeb20 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    511a845 View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2024

  1. Configuration menu
    Copy the full SHA
    c76cfd4 View commit details
    Browse the repository at this point in the history

Commits on Oct 13, 2024

  1. Configuration menu
    Copy the full SHA
    d3f0497 View commit details
    Browse the repository at this point in the history

Commits on Oct 24, 2024

  1. Configuration menu
    Copy the full SHA
    1b3390b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a770ba9 View commit details
    Browse the repository at this point in the history
  3. docs(typing): Add WorkInProgress marker to data(...)

    - Still undecided exactly how this functionality should work
    - Need to resolve `npm` tags != `gh` tags issue as well
    dangotbanned committed Oct 24, 2024
    Configuration menu
    Copy the full SHA
    686a485 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2024

  1. Configuration menu
    Copy the full SHA
    ba4491d View commit details
    Browse the repository at this point in the history

Commits on Oct 29, 2024

  1. Configuration menu
    Copy the full SHA
    1a4e107 View commit details
    Browse the repository at this point in the history

Commits on Nov 5, 2024

  1. Configuration menu
    Copy the full SHA
    989b9b7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0bbf2e9 View commit details
    Browse the repository at this point in the history

Commits on Nov 6, 2024

  1. Configuration menu
    Copy the full SHA
    9c386e2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1937f2b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    66fa6d1 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    937aa01 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    21b2edd View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    6527305 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    336eeca View commit details
    Browse the repository at this point in the history
  8. refactor: Rename metadata_full -> metadata

    Suffix was only added due to *now-removed* test files
    dangotbanned committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    225be0a View commit details
    Browse the repository at this point in the history
  9. refactor: tools.vendor_datasets -> tools.datasets package

    Will be following up with some more splitting into composite modules
    dangotbanned committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    e91baab View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    7782925 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    bc86ca1 View commit details
    Browse the repository at this point in the history
  12. refactor: Remove write_schema from _Npm, _GitHub

    Handled in `Application` now
    dangotbanned committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    a6f5645 View commit details
    Browse the repository at this point in the history
  13. refactor: Rename, split _Npm, _GitHub into own modules

    `tools.datasets.npm` will later be performing the requests that are in `Dataset.__call__` currently
    dangotbanned committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    07a8342 View commit details
    Browse the repository at this point in the history
  14. refactor: Move DataLoader.__call__ -> DataLoader.url()

    -`data.name()` -> `data(name)`
    - `data.name.url` -> `data.url(name)`
    dangotbanned committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    b89e6dc View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    7b0fe29 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    572d069 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    07dcc0b View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    d8f3791 View commit details
    Browse the repository at this point in the history

Commits on Nov 7, 2024

  1. refactor: Replace name_js, name_py with dataset_name

    Since we're just using strings, there is no need for 2 forms of the name.
    The legacy package needed this for `__getattr__` access with valid identifiers
    dangotbanned committed Nov 7, 2024
    Configuration menu
    Copy the full SHA
    4642a23 View commit details
    Browse the repository at this point in the history
  2. fix: Remove invalid semver.sort op

    I think this was added in error, since the schema of the file never had `semver` columns
    
    Only noticed the bug when doing a full rebuild
    dangotbanned committed Nov 7, 2024
    Configuration menu
    Copy the full SHA
    65f87fc View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    6349b0f View commit details
    Browse the repository at this point in the history
  4. refactor: Move public interface to _io

    Temporary home, see module docstring
    dangotbanned committed Nov 7, 2024
    Configuration menu
    Copy the full SHA
    f1d610c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    c4ef112 View commit details
    Browse the repository at this point in the history
  6. refactor: Split Reader._url_from into url, _query

    - Much more generic now in what it can be used for
    - For the caching, I'll need more columns than just `"url_npm"`
    - `"url_github" contains a hash
    dangotbanned committed Nov 7, 2024
    Configuration menu
    Copy the full SHA
    eb876eb View commit details
    Browse the repository at this point in the history
  7. feat(DRAFT): Adds GitHubUrl.BLOBS

    - Common prefix to all rows in `metadata[url_github]`
    - Stripping this leaves only `sha`
    - For **2800** rows, there are only **109** unique hashes, so these can be used to reduce cache size
    dangotbanned committed Nov 7, 2024
    Configuration menu
    Copy the full SHA
    661a385 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    22dcb17 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    669df02 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    2051410 View commit details
    Browse the repository at this point in the history

Commits on Nov 8, 2024

  1. Configuration menu
    Copy the full SHA
    0ea4e21 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a2e9baa View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c8a1429 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    279fea9 View commit details
    Browse the repository at this point in the history
  5. chore: Remove duckdb comment

    Not planning to support this anymore, requires `fsspec` which isn't in `dev`
    
    ```
    InvalidInputException
    Traceback (most recent call last)
    Cell In[6], line 5
           3 with duck._reader._opener.open(url) as f:
           4     fn = duck._reader._read_fn['.json']
    ----> 5     thing = fn(f.read())
    
    InvalidInputException: Invalid Input Error: This operation could not be completed because required module 'fsspec' is not installed"
    ```
    dangotbanned committed Nov 8, 2024
    Configuration menu
    Copy the full SHA
    7d6c7ca View commit details
    Browse the repository at this point in the history
  6. ci(typing): Add pyarrow-stubs to dev dependencies

    Will put this in another PR, but need it here for IDE support
    dangotbanned committed Nov 8, 2024
    Configuration menu
    Copy the full SHA
    0bb4210 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    8984425 View commit details
    Browse the repository at this point in the history
  8. refactor: Split datasets into public/private packages

    - `tools.datasets`: Building & updating metadata file(s), generating annotations
    - `altair.datasets`: Consuming metadata, remote & cached dataset management
    dangotbanned committed Nov 8, 2024
    Configuration menu
    Copy the full SHA
    9d062c8 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    a17d674 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    69a619c View commit details
    Browse the repository at this point in the history
  11. refactor: Remove unimplemented tag="latest"

    Since `metadata.parquet` is sorted, this was already the behavior when not providing a tag
    dangotbanned committed Nov 8, 2024
    Configuration menu
    Copy the full SHA
    a259b10 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2024

  1. feat: Rename _datasets_dir, make configurable, add docs

    Still on the fence about `Loader.cache_dir` vs `Loader.cache`
    dangotbanned committed Nov 9, 2024
    Configuration menu
    Copy the full SHA
    88968c8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b987308 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4a2a2e0 View commit details
    Browse the repository at this point in the history
  4. docs: Add basic example to Loader class

    Also incorporates changes from previous commit into `__repr__`
    4a2a2e0
    dangotbanned committed Nov 9, 2024
    Configuration menu
    Copy the full SHA
    e6dd27e View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    2a7bc4f View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    c572180 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    9ab9463 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    dd3edd6 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    146cb50 View commit details
    Browse the repository at this point in the history

Commits on Nov 10, 2024

  1. Configuration menu
    Copy the full SHA
    94ad0d1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4093383 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    76cdd45 View commit details
    Browse the repository at this point in the history
  4. test: Adds test_datasets

    Initially quite basic, need to add more parameterize and test caching
    dangotbanned committed Nov 10, 2024
    Configuration menu
    Copy the full SHA
    bb7bc17 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    ebc1bfa View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    fe0ae88 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    7089f2a View commit details
    Browse the repository at this point in the history

Commits on Nov 11, 2024

  1. refactor: Tidy up changes from last 4 commits

    - Rename and properly document "file-like object" handling
      - Also made a bit clearer what is being called and when
    - Use a more granular approach to skipping in `@backends`
      - Previously, everything was skipped regardless of whether it required `pyarrow`
      - Now, `polars`, `pandas` **always** run - with `pandas` expected to fail
    - I had to clean up `skip_requires_pyarrow` to make it compatible with `pytest.param`
      - It has a runtime check for if `MarkDecorator`, instead of just a callable
    
    bb7bc17, ebc1bfa, fe0ae88,
    7089f2a
    dangotbanned committed Nov 11, 2024
    Configuration menu
    Copy the full SHA
    e1290d4 View commit details
    Browse the repository at this point in the history
  2. refactor: Rework _readers.py

    - Moved `_Reader._metadata` -> module-level constant `_METADATA`.
      - It was never modified and is based on the relative directory of this module
    - Generally improved the readability with more method-chaining (less assignment)
    - Renamed, improved doc `_filter_reduce` -> `_parse_predicates_constraints`
    dangotbanned committed Nov 11, 2024
    Configuration menu
    Copy the full SHA
    9d88e1b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    60d39f5 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d6f0e45 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    b7d57a0 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    5c2e581 View commit details
    Browse the repository at this point in the history
  7. docs: Finish _Reader, fill parameters of Loader.__call__

    Still need examples for `Loader.__call__`
    dangotbanned committed Nov 11, 2024
    Configuration menu
    Copy the full SHA
    b70aef8 View commit details
    Browse the repository at this point in the history
  8. refactor: Rename backend -> backend_name, get_backend -> backend

    `get_` was the wrong term since it isn't a free operation
    dangotbanned committed Nov 11, 2024
    Configuration menu
    Copy the full SHA
    403b787 View commit details
    Browse the repository at this point in the history

Commits on Nov 12, 2024

  1. Configuration menu
    Copy the full SHA
    3fbc759 View commit details
    Browse the repository at this point in the history
  2. test: Remove pandas fallback for pyarrow

    There are enough alternatives here, it only added complexity
    dangotbanned committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    4f5b4de View commit details
    Browse the repository at this point in the history
  3. test: Adds test_all_datasets

    Disabled by default, since there are 74 datasets
    dangotbanned committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    69a72b6 View commit details
    Browse the repository at this point in the history
  4. refactor: Remove _Reader._response

    Can't reproduce the original issue that led to adding this.
    All backends are supporting `HTTPResponse` directly
    dangotbanned committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    08101cc View commit details
    Browse the repository at this point in the history
  5. fix: Correctly handle no remote connection

    Previously, `Path.touch()` appeared to be a cache-hit - despite being an empty file.
    - Fixes that bug
    - Adds tests
    dangotbanned committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    90428a6 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    8ad78c1 View commit details
    Browse the repository at this point in the history
  7. feat: Update to v2.10.0, fix tag inconsistency

    - Noticed one branch that missed the join to `npm`
      - Moved the join to `.tags()` and added a doc
    - https://github.com/vega/vega-datasets/releases/tag/v2.10.0
    dangotbanned committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    e650454 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    72296b0 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    ca1b500 View commit details
    Browse the repository at this point in the history
  10. ci: Introduce datasets refresh to generate_schema_wrapper

    Unrelated to schema, but needs to hook in somewhere
    dangotbanned committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    5bd70d1 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    012f98b View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    bc0f42c View commit details
    Browse the repository at this point in the history
  13. revert: Remove comment

    dangotbanned committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    5e677c0 View commit details
    Browse the repository at this point in the history

Commits on Nov 13, 2024

  1. Configuration menu
    Copy the full SHA
    a99d2c9 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7e6da39 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b49e679 View commit details
    Browse the repository at this point in the history
  4. fix: Ensure latest [tag] appears first

    When updating from `v2.9.0` -> `v2.10.0`, new tags were appended to the bottom.
    This invalidated an assumption in `Loader.(dataset|url)` that the first result is the latest
    dangotbanned committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    7a14394 View commit details
    Browse the repository at this point in the history
  5. refactor: Misc models.py updates

    - Remove unused `ParsedTreesResponse`
    - Align more of the doc style
    - Rename `ReParsedTag` -> `SemVerTag`
    dangotbanned committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    99f823e View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    dcef1d9 View commit details
    Browse the repository at this point in the history
  7. test: Fix @datasets_debug selection

    Wasn't being recognised by `-m not datasets_debug` and always ran
    dangotbanned committed Nov 13, 2024
    Configuration menu
    Copy the full SHA
    173f3d6 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    3f5a805 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    4fc8446 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    882af33 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    9e9deeb View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    88d4491 View commit details
    Browse the repository at this point in the history

Commits on Nov 15, 2024

  1. Configuration menu
    Copy the full SHA
    ebc8dec View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f2823b4 View commit details
    Browse the repository at this point in the history

Commits on Nov 16, 2024

  1. Configuration menu
    Copy the full SHA
    fa5bea8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    95582df View commit details
    Browse the repository at this point in the history
  3. feat: Adds _pl_read_json_roundtrip

    First mentioned in #3631 (comment)
    
    Addresses most of the  `polars` part of #3631 (comment)
    dangotbanned committed Nov 16, 2024
    Configuration menu
    Copy the full SHA
    dc4a230 View commit details
    Browse the repository at this point in the history

Commits on Nov 17, 2024

  1. Configuration menu
    Copy the full SHA
    7ddb2a8 View commit details
    Browse the repository at this point in the history

Commits on Nov 18, 2024

  1. Configuration menu
    Copy the full SHA
    9544d9b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7b3a89e View commit details
    Browse the repository at this point in the history

Commits on Nov 20, 2024

  1. feat(DRAFT): Adds altair.datasets.url

    A dataframe package is still required currently,.
    Can later be adapted to fit the requirements of (#3631 (comment)).
    
    Related:
    - #3631 (comment)
    - #3631 (comment)
    - #3150 (reply in thread)
    
    @mattijn, @joelostblom
    dangotbanned committed Nov 20, 2024
    Configuration menu
    Copy the full SHA
    c835c13 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0817ff8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e01fdd7 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0c5195e View commit details
    Browse the repository at this point in the history

Commits on Nov 21, 2024

  1. Configuration menu
    Copy the full SHA
    5595d90 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9f62151 View commit details
    Browse the repository at this point in the history
  3. revert: Remove `"polars[pyarrow]" backend

    Partially related to #3631 (comment)
    
    After some thought, this backend didn't add support for any unique dependency configs.
    I've only ever used `use_pyarrow=True` for `pl.DataFrame.write_parquet` to resolve an issue with invalid headers in `"polars<1.0.0;>=0.19.0"`
    dangotbanned committed Nov 21, 2024
    Configuration menu
    Copy the full SHA
    1bd4552 View commit details
    Browse the repository at this point in the history
  4. test: Add a complex xfail for test_load_call

    Doesn't happen in CI, still unclear why the import within `pandas` breaks under these conditions.
    Have tried multiple combinations of `pytest.MonkeyPatch`, hard imports, but had no luck in fixing the bug
    dangotbanned committed Nov 21, 2024
    Configuration menu
    Copy the full SHA
    11da9c8 View commit details
    Browse the repository at this point in the history

Commits on Nov 22, 2024

  1. refactor: Renaming/recomposing _readers.py

    The next commits benefit from having functionality decoupled from `_Reader.query`.
    Mainly, keeping things lazy and not raising a user-facing error
    dangotbanned committed Nov 22, 2024
    Configuration menu
    Copy the full SHA
    694ada0 View commit details
    Browse the repository at this point in the history
  2. build: Generate VERSION_LATEST

    Simplifies logic that relies on enum/categoricals that may not be recognised as ordered
    dangotbanned committed Nov 22, 2024
    Configuration menu
    Copy the full SHA
    6f41c7e View commit details
    Browse the repository at this point in the history
  3. feat: Adds _cache.py for UrlCache, DatasetCache

    Docs to follow
    dangotbanned committed Nov 22, 2024
    Configuration menu
    Copy the full SHA
    88d06a6 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    a0d2df4 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    f21b52b View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    de03046 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    e7974d9 View commit details
    Browse the repository at this point in the history

Commits on Nov 23, 2024

  1. Configuration menu
    Copy the full SHA
    8ba48a9 View commit details
    Browse the repository at this point in the history

Commits on Nov 24, 2024

  1. Configuration menu
    Copy the full SHA
    9d97096 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a698de9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c907dc5 View commit details
    Browse the repository at this point in the history
  4. revert: Remove _readers._filter

    Feature has been adopted upstream in narwhals-dev/narwhals#1417
    dangotbanned committed Nov 24, 2024
    Configuration menu
    Copy the full SHA
    a3b38c4 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    a6c5096 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    71423ea View commit details
    Browse the repository at this point in the history
  7. docs: Finish Loader.cache

    Not using doctest style here, none of these return anything but I want them hinted at
    dangotbanned committed Nov 24, 2024
    Configuration menu
    Copy the full SHA
    7dd9c18 View commit details
    Browse the repository at this point in the history
  8. refactor(typing): Use Mapping instead of dict

    Mutability is not needed.
    Also see #3573
    dangotbanned committed Nov 24, 2024
    Configuration menu
    Copy the full SHA
    a982759 View commit details
    Browse the repository at this point in the history

Commits on Nov 30, 2024

  1. Configuration menu
    Copy the full SHA
    d20e9c1 View commit details
    Browse the repository at this point in the history

Commits on Dec 2, 2024

  1. feat(DRAFT): Utilize datapackage schemas in pandas backends

    Provides a generalized solution to `pd.read_(csv|json)` requiring the names of date columns to attempt parsing.
    cc @joelostblom
    
    The solution is possible in large part to vega/vega-datasets#631
    
    #3631 (comment)
    dangotbanned committed Dec 2, 2024
    Configuration menu
    Copy the full SHA
    909e7d0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d93fda1 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9274284 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    8e232b8 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    9330895 View commit details
    Browse the repository at this point in the history
  6. feat: Use _pl_read_json_roundtrip instead of pl.read_json for `py…

    …arrow`
    
    Provides better dtype inference
    dangotbanned committed Dec 2, 2024
    Configuration menu
    Copy the full SHA
    caf534d View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    75bf2ba View commit details
    Browse the repository at this point in the history