-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Converter for route_id #62
Comments
Thanks for reporting the issue. This does appear to be a bug. I will take a closer look in the next week or so.
You seem to have a good understanding of the internals of partridge. If you would like to implement your suggestion I'd gladly review it and work with you to get it merged. |
An intermediate step could be to detect this case and raise a descriptive error. @TheFrok what do you think about that as a next step? |
I started working on it. After I gave it some thought I decided that since conversion function can't always be reversed it might not be a option to "reconvert" the column back to I tried to push my changes but I don't have the right permission, so if you could please help me with that :) |
@TheFrok I recommend using the workflow described here: https://help.github.com/en/github/collaborating-with-issues-and-pull-requests/creating-a-pull-request-from-a-fork. You can fork this repo into your account, push a branch to your repo, and create a pull request into this repo. |
Cool, I'll do that. |
I need to refresh my memory on some of the codebase. Let me take a closer look this week and make some recommendations. Thanks for your patience. |
Testing some ideas on a new branch. It's a work in progress. https://github.com/remix/partridge/compare/dw/fix-pruning-dtype-mismatch |
You can take a look at this - master...TheFrok:id-column-converter . |
@invisiblefunnel I tried to use that, and it worked for me except for the fact that I got an error when tried to prune int8 with int16. But that suppose simpler to fix :) |
@TheFrok I prefer your approach because it doesn't interfere with filtering/pruning! Can you make a pull request? I will review and leave comments later this week. |
Description
I tried to change the types of the
_id
columns (i.e.route_id
) in some table from dtype object to numeric, to lower the memory usage. I did that by adding a converter to the default config.It went fine at, but the DataFrames came back empty. I looked into that a little bit and I think it is because the read_file method does the
prune
part before the type conversion, causing the comparison of object column (the column in the current table) with numeric column (from the dependency table that is type converted).I'm not sure what would be the right solution for that, maybe changing both columns to object before comparison.
What I Did
The text was updated successfully, but these errors were encountered: