Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

writecsv performance #3483

Closed
ViralBShah opened this issue Jun 21, 2013 · 3 comments
Closed

writecsv performance #3483

ViralBShah opened this issue Jun 21, 2013 · 3 comments
Labels
domain:io Involving the I/O subsystem: libuv, read, write, etc. performance Must go faster

Comments

@ViralBShah
Copy link
Member

Now, readcsv is a whole lot faster, but writecsv is really slow. Perhaps the same kinds of issues as #3350.

@JeffBezanson
Copy link
Sponsor Member

Just tried a 435MB random number csv; reading it took 11 sec and writing it back out took 7 sec. Can you give a case where it is slow?

@ViralBShah
Copy link
Member Author

Try the file in my home directory on julia.mit.edu. Read it and then write the result back.

ViralBShah pushed a commit that referenced this issue Jun 22, 2013
buffered writedlm. substring print invokes write. fixes #3483
@ViralBShah
Copy link
Member Author

Thanks @tanmaykm. Now, readcsv is as fast as DataFrames.readtable for UTF8 strings, and writecsv performance is significantly improved - 4x on my test file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
domain:io Involving the I/O subsystem: libuv, read, write, etc. performance Must go faster
Projects
None yet
Development

No branches or pull requests

2 participants