-
Notifications
You must be signed in to change notification settings - Fork 327
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
improve numeric type inference in xsv stats #22
Comments
Here is the output I see:
Notice that the type inferred for the
I expect there is room for |
Hi Burnt, Yes, I had a feeling that how it interpreted the data may affect the outcome of it's results. How did you change it from unicode to integer? Thanks! |
I fixed your data:
|
Hi, So correcting that one field made it read all records as integer. Got it, thanks! |
Yes. There's some middle ground here, but we have to be careful with the heuristics we choose. There's no need to close this issue. Improving |
Hi Brunt,
I apologize about that and I look forward to any improvements you may bring forth. Thanks! |
What about being able to force the type -- no need to implement dangerous heuristics, leave the responsibility to the user. The only thing to implement would be to be able to interpret the plain text as number, however it is formatted. |
A few years later -- I often get 90% of the way with xsv slice / search and then want to end with a Thanks for a cool tool :) |
Hello,
Sample data here:
http://www.sharecsv.com/s/9096d32f98aa0ac671a1cca16fa43be8/SalesJan2009.csv
Running
xsv stats SalesJan2009.csv --everything | xsv table
Why does it say the price max is 1200 and the min is 800? The max is actually 7500 and I see a record for 250.
Thanks!
The text was updated successfully, but these errors were encountered: