Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataFrame.WriteCsv should use UTF8 encoding #5645

Closed
pgovind opened this issue Dec 1, 2020 · 1 comment
Closed

DataFrame.WriteCsv should use UTF8 encoding #5645

pgovind opened this issue Dec 1, 2020 · 1 comment
Assignees
Labels
Microsoft.Data.Analysis All DataFrame related issues and PRs

Comments

@pgovind
Copy link

pgovind commented Dec 1, 2020

It currently uses ASCII. Prototyping suggests that the WriteCsvNoHeader unit test will fail. A possible reason might be the presence of UTF8-BOM bytes. Needs to be investigated.

@pgovind pgovind self-assigned this Dec 1, 2020
@pgovind pgovind transferred this issue from dotnet/corefxlab Mar 6, 2021
@pgovind pgovind added the Microsoft.Data.Analysis All DataFrame related issues and PRs label Mar 6, 2021
@luisquintanilla
Copy link
Contributor

Can't repro in version. 0.20.0-preview.22313.1. You can use the encoding parameter to provide UTF8 encoding. However, when the DataFrame is saved to a file, quotes aren't added to values that include the separator as part of the value. This causes issues similar to #5647

@ghost ghost locked as resolved and limited conversation to collaborators Aug 31, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Microsoft.Data.Analysis All DataFrame related issues and PRs
Projects
None yet
Development

No branches or pull requests

2 participants