-
Notifications
You must be signed in to change notification settings - Fork 603
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
csvsort seems to hang on large files/sort jobs #157
Comments
Yep. It's the type inference--it has to buffer the whole file into memory, iterate over it to figure out what the type is, iterate over it again to coerce to the correct type and then finally sort all that data. It's crazy slow. Not sure what to do about it. |
Bummer. I wonder how CSVFix handles it, and how the performance is. |
How about an option to skip type coercion? This would be similar to specifying that all columns are "text" when opening in LibreOffice. |
You can now pass |
I have a rather large (42m) CSV. I'm sitting here looking crazy waiting while csvsort has been running for over 10 minutes trying to sort the file. To perform the same sort in Excel or LibreOffice is slow, but still less than a minute.
The text was updated successfully, but these errors were encountered: