Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

filename column inaccessible with pandas backend and parquet #427

Closed
praateekmahajan opened this issue Dec 13, 2024 · 0 comments · Fixed by #449
Closed

filename column inaccessible with pandas backend and parquet #427

praateekmahajan opened this issue Dec 13, 2024 · 0 comments · Fixed by #449
Assignees
Labels
bug Something isn't working

Comments

@praateekmahajan
Copy link
Collaborator

Describe the bug

When add_filename=True you can't access the column name because I believe it's a reserved name. Changing it to file_name or path will solve it.

Steps/Code to reproduce bug

Please list minimal steps or code snippet for us to be able to reproduce the bug.

A helpful guide on on how to craft a minimal bug report http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports.

Expected behavior

A clear and concise description of what you expected to happen.

Environment overview (please complete the following information)

  • Environment location: [Bare-metal, Docker, Cloud(specify cloud provider - AWS, Azure, GCP, Collab)]
  • Method of NeMo-Curator install: [pip install or from source]. Please specify exact commands you used to install.
  • If method of install is [Docker], provide docker pull & docker run commands used

Environment details

If NVIDIA docker image is used you don't need to specify these.
Otherwise, please provide:

  • OS version
  • Dask version
  • Python version

Additional context

Add any other context about the problem here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant