Designate header row when using `from_xls` or `from_xlsx` #7

rdmurphy · 2016-03-28T00:15:01Z

In our tx_salaries repo, we use csvkit (and by extension, now agate-excel!) to prep Excel files for processing. An extra option we also have — after the conversion has happened — is to peel off rows until we hit the header row. We always know where that header row is, so we can pass that index in and be good to go.

When I was doing some tests with agate-excel directly, I quickly hit that wall. It gets a little funky if an Excel spreadsheet has some useless extra rows before the data.

That sound like something that'd be worth having as part of the package? I imagine it could be another option to be passed into the from_ commands.

table = agate.Table.from_xlsx('i_dont_want_your_extra_rows.xlsx', header_row=4)

The biggest decision to make? Do you use zero-based indexing, or start at 1 like Excel? 😬 (We forget what we picked constantly.)

The text was updated successfully, but these errors were encountered:

jpmckinney · 2017-01-04T18:59:27Z

Related wireservice/csvkit#669

jpmckinney · 2017-01-27T23:16:43Z

Also wireservice/csvkit#336

Add skip_lines like Table.from_csv, closes #7

jpmckinney closed this as completed in 09161b3 Jan 28, 2017

jpmckinney pushed a commit that referenced this issue Jan 28, 2017

Merge pull request #18 from wireservice/skip_lines

9b68d3c

Add skip_lines like Table.from_csv, closes #7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Designate header row when using `from_xls` or `from_xlsx` #7

Designate header row when using `from_xls` or `from_xlsx` #7

rdmurphy commented Mar 28, 2016

jpmckinney commented Jan 4, 2017

jpmckinney commented Jan 27, 2017

Designate header row when using from_xls or from_xlsx #7

Designate header row when using from_xls or from_xlsx #7

Comments

rdmurphy commented Mar 28, 2016

jpmckinney commented Jan 4, 2017

jpmckinney commented Jan 27, 2017

Designate header row when using `from_xls` or `from_xlsx` #7

Designate header row when using `from_xls` or `from_xlsx` #7