-
-
Notifications
You must be signed in to change notification settings - Fork 8k
Unexpected Automatic Date Conversion #2196
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
To be clear, this ultimately boils down to how V8 (Chrome/Node) handles dates: new Date("Mayslanding, NJ 08234"); // Thu May 01 8234 00:00:00 in your timezone We try to correct for it in if(s.toLowerCase().match(/\b(jan|feb|mar|apr|may|jun|jul|aug|sep|oct|nov|dec)\b/)) return o;
if(s.toLowerCase().match(/\b(january|february|march|april|may|june|july|august|september|october|november|december)\b/)) return o; |
I'll take a closer look at those and put together a PR when I get a chance. |
I am also seeing this when creating an excel file in my project. In our case, the data is a string "1/$1.99" It gets turned into a date with some of the numbers but some randomness as well. 1/2/1999 |
@jbull328, just to confirm, you're seeing that behavior on creation/output of a document with that data? I am not seeing it on input in |
Correct @matthew-macgregor we did see that on output. I ended up abandoning using the module. |
@jbull328 If this is showing up when you write a CSV, that's Excel's automatic conversion. For example, consider the CSV
If you write that as plaintext to a file and open in Excel, it will interpret those as dates: In any case that's an issue unrelated to this one. |
I want to confirm that the date parsing strategy of sheetjs is a bit too aggressive for my taste. Effectively I can only use There are dozens of issues on this topic. If I switch on Is there something in between? So a config that will parse numbers if possible, and leaves the rest as string? |
@flaushi it's aggressive because V8 (chrome/node) is aggressive. |
Moving this to #1300 |
0.18.1 fixes this issue. To verify in NodeJS: $ for v in 0.17.4 0.17.5 0.18.0 0.18.1; do npm i "xlsx@$v"; node -pe 'var XLSX = require("xlsx"); [XLSX.version, XLSX.readFile("t.csv").Sheets.Sheet1.A1]'; done [ '0.17.4', { t: 'n', v: 2313448, w: '1/1/34' } ]
[ '0.17.5', { t: 'n', v: 2313448, w: '1/1/34' } ]
[ '0.18.0', { t: 'n', v: 2313448, w: '1/1/34' } ]
[ '0.18.1', { t: 's', v: 'Januaryville, NJ 08234' } ] |
Issue Overview
One of our systems which uses
sheetjs
for parsing csv/xlsx data produced an unexpected result this week. The data which caused the issue was"Mayslanding, NJ 08234"
, which the library coerced to adate
as{ t: 'n', v: 2313568, w: '5/1/34' }
. My hunch is that the code which attempts to guess if a cell might be a date is too lax given some inputs.This happens when
raw: false
, which is the default. We have worked around it by simply settingraw: true
and doing our own type conversions. Still, this unexpected result seems likely to trip up others using the default setting.I am willing to volunteer to submit a PR to fix this issue but would like some feedback from a maintainer before writing any code. My guess is that any changes to this date conversion code could very easily cause other unexpected side effects, so advice would be appreciated.
Steps to Reproduce:
Given the (somewhat fabricated) csv data below, the following code will produce the result:
CSV input:
Unexpected result:
The text was updated successfully, but these errors were encountered: