Skip to content

Error when using AUTO with recursive depth issue #102

@pramitchoudhary

Description

@pramitchoudhary

Running into the below error for the attached file.

result = parse(".../Lalayan_vs_Garland.pdf", parser_type="AUTO", page_nums=1, pages_per_split=1)

Error,

2025-06-06 12:34:54.736 | DEBUG    | lexoid.core.utils:router:587 - Using STATIC_PARSE for PDF without images.
2025-06-06 12:34:54.737 | DEBUG    | lexoid.api:parse_chunk:61 - Auto-detected parser type: ParserType.STATIC_PARSE
2025-06-06 12:34:54.737 | DEBUG    | lexoid.api:parse_chunk:67 - Using static parser
---------------------------------------------------------------------------
RecursionError                            Traceback (most recent call last)
File [~/.jupyter_venv/lib/python3.11/site-packages/pdfplumber/utils/pdfinternals.py:78](http://localhost:8888/lab/tree/projects/experiments/notebooks/LLMS/~/.jupyter_venv/lib/python3.11/site-packages/pdfplumber/utils/pdfinternals.py#line=77), in resolve_all(x)
     77 try:
---> 78     return resolve_all(resolved)
     79 except RecursionError as e:
...
MalformedPDFException: maximum recursion depth exceeded while calling a Python object
  • Works fine in LLM_PARSE mode
  • Tried with a different version of the same doc, that works.
  • Not sure if the attached file got corrupted, but it did manage to read the other version fine.

Lalayan_vs_Garland.pdf

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions