Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Saving tables as parquet #40

Open
wants to merge 9 commits into
base: master
Choose a base branch
from
Open

Saving tables as parquet #40

wants to merge 9 commits into from

Conversation

siranipour
Copy link
Contributor

@siranipour siranipour commented Feb 10, 2021

Requires pandas 1.2.0 to save multi index df's

also needs the columns to be changed to str but that should be fine. Other than that, it works really nicely

Also requires pyarrow dependency. Which pandas looks for before looking for fastparquet

p.s it'd be nice to do editable installs, feels like im coding with c++ using this flint thing

@Zaharid
Copy link
Contributor

Zaharid commented Feb 10, 2021

Cool. Would be good to have command line options controlling this.

also needs the columns to be changed to str but that should be fine. Other than that, it works really nicely

We may want to make sure that various vp actions only trade in strings, e.g. for the replica numbers.

p.s it'd be nice to do editable installs, feels like im coding with c++ using this flint thing

flit install --symlink

should work.

@Zaharid
Copy link
Contributor

Zaharid commented Feb 18, 2021

I think I rather have this fail on non string column names than it being converted silently. Ultimately we would like this to be able to round trip. Ideally for any dataframe but we will have to live with the constrains here.

@Zaharid
Copy link
Contributor

Zaharid commented Feb 18, 2021

Looks good from a quick look.

@siranipour
Copy link
Contributor Author

Think we can merge this boss?

@siranipour
Copy link
Contributor Author

I'm interested to see if the VP test will pass if we do

@siranipour
Copy link
Contributor Author

bump

@wilsonmr
Copy link
Contributor

wilsonmr commented Mar 24, 2021

Just to understand correctly. If in the environment I fix it to save CSVs still then (once the optional dependencies is a thing) this will just be the old behaviour?

EDIT: I like the look of this but for another project which uses reportengine I want to have a get out of jail free card if this causes some issues.

@siranipour
Copy link
Contributor Author

Yes in principle if you do app runcard.yaml --table-formats csv, it should just be doing what it used to. And once the optional install is in place it will be installing the same dependencies too (except for pandas >= 1.2.0)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants