Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

replace default parquet engine #71

Open
wants to merge 5 commits into
base: main
Choose a base branch
from
Open

Conversation

gnzsnz
Copy link

@gnzsnz gnzsnz commented Mar 22, 2024

  • deprecate Fastparquet
  • enable Pyarrow as default engine

Existing stores using fastparquet will require a manual update on metadata.json.

"engine": "fastparquet" --> "engine": "pyarrow"

Changes to be committed:
modified: CHANGELOG.rst
modified: README.rst
modified: examples/pystore-tutorial.ipynb
modified: meta.yaml
modified: pystore/collection.py
modified: pystore/item.py
modified: pystore/store.py
modified: requirements.txt
modified: setup.py

- deprecate Fastparquet
- enable Pyarrow as default engine

Existing stores using fastparquet will require a manual update on
metadata.json.

"engine": "fastparquet" --> "engine": "pyarrow"

Changes to be committed:
	modified:   CHANGELOG.rst
	modified:   README.rst
	modified:   examples/pystore-tutorial.ipynb
	modified:   meta.yaml
	modified:   pystore/collection.py
	modified:   pystore/item.py
	modified:   pystore/store.py
	modified:   requirements.txt
	modified:   setup.py
gnzsnz added 4 commits May 5, 2024 20:44
- Rename metadata.json file to pystore_metadata.json, as it classes with pyarrow
- Fix append. Dask does not support parquet write with overwrite on the same file.
  Added support for append by writing in temp directory.
- Remove parameter chunksize as it's not used
- Fix overwrite on collection.write
- update requirements, only needs dask[dataframe]
- update CHANGELOG, state change in metadata.json required to avoid conflicts with pyarrow
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants