-
Notifications
You must be signed in to change notification settings - Fork 414
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Preserve input data types #149
Conversation
output heterogeneous data types
Tests failed on README.rst, line 187 for 2 reasons
|
The only failing check now is number 2. With permission, I would like to edit the test so that the output columns of |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for your contribution! I was going to comment requesting changes in your PR, but I did them myself in https://github.com/scikit-learn-contrib/sklearn-pandas/pull/153/files. If everything looks good I'm gonna merge that one.
@@ -504,3 +509,4 @@ Other contributors: | |||
* Ritesh Agrawal (@ragrawal) | |||
* Vitaley Zaretskey (@vzaretsk) | |||
* Zac Stewart (@zacstewart) | |||
* Timothy Sweetser (@hacktuarial) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please preserve the alphabetic order. :)
Looks great! Thanks for reviewing. |
As discussed in #138, when using
DataFrameMapper
withdefault=None
, the current behavior is to create anp.array
with the unselected columns. This has the undesired side effect of casting them to a common data type. This PR preserves the data types of unselected columns whendefault=None, input_df=True, output_df=True
.