Plaintext draft parsing fails to extract document date and title with long author lists #5731

jennifer-richards · 2023-05-31T21:05:40Z

Describe the issue

On a draft where the document creation date appears after line 15, such as happens with long author lists, the _stripheaders() helper method breaks the first page as ending just before the date. Its paginated output is then used by PlaintextDraft to extract (among other things) the title and creation date. When doing so, it is assumed that both these fields will appear on the first page of the draft. As a result, neither can be extracted when the author list is long.

This can be fixed by modifying PlaintextDraft to consider the first two pages instead of just the first page when extracting these fields.

Alternatively, _stripheaders() could be changed, but it's quite intentional in doing it this way so I'm worried that the change might have other consequences.

Code of Conduct

I agree to follow the IETF's Code of Conduct

The text was updated successfully, but these errors were encountered:

jennifer-richards added bug Something isn't working medium component: submit/ labels May 31, 2023

jennifer-richards mentioned this issue Jun 1, 2023

feat: Extract document creation date from XML draft #5733

Merged

rjsparks added the accepted label Jun 15, 2023

jennifer-richards mentioned this issue Jul 11, 2023

fix: send the whole txt submission to the DraftParser #5956

Merged

rjsparks mentioned this issue Jul 13, 2023

I-D Submission fails with "no date" when date is present #3571

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plaintext draft parsing fails to extract document date and title with long author lists #5731

Plaintext draft parsing fails to extract document date and title with long author lists #5731

jennifer-richards commented May 31, 2023

Plaintext draft parsing fails to extract document date and title with long author lists #5731

Plaintext draft parsing fails to extract document date and title with long author lists #5731

Comments

jennifer-richards commented May 31, 2023

Describe the issue

Code of Conduct