Skip to content

Backport data source to pyspark 3 #11

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 12 commits into from
May 29, 2025
Merged

Backport data source to pyspark 3 #11

merged 12 commits into from
May 29, 2025

Conversation

lhoestq
Copy link
Member

@lhoestq lhoestq commented May 22, 2025

useful for demos, since spark 4 is not out yet (sooooon)

TODO:

  • reader
  • writer

Backport:

  • 3.5
  • 3.4
  • 3.3
  • 3.2 (doesn't have mapInArrow)

@lhoestq lhoestq marked this pull request as ready for review May 27, 2025 16:52
@lhoestq
Copy link
Member Author

lhoestq commented May 27, 2025

I also updated the README a bit, this is ready to merge :)

@lhoestq lhoestq requested a review from allisonwang-db May 27, 2025 17:43
@wengh
Copy link
Collaborator

wengh commented May 28, 2025

btw pyspark 4.0.0 showed up on PyPI https://pypi.org/project/pyspark/ but it's not on the official website yet (?)

Copy link
Collaborator

@allisonwang-db allisonwang-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@lhoestq lhoestq merged commit c0fbd4c into main May 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants