Skip to content

issue in "text_only_sample" #9

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Jaiaggarwaaaaal opened this issue Sep 9, 2024 · 0 comments
Open

issue in "text_only_sample" #9

Jaiaggarwaaaaal opened this issue Sep 9, 2024 · 0 comments

Comments

@Jaiaggarwaaaaal
Copy link

Jaiaggarwaaaaal commented Sep 9, 2024

This issue is for a:

 [ ] bug report -> please search issues before submitting
 [ ] feature request
 [ ] documentation issue or request
 [x] regression (a behavior that used to work and stopped in a new release)

Minimal steps to reproduce

When I run main.py in the api folder, the port connects successfully. However, when I execute main.py from the eval folder, it throws a "500 Internal Server Error"a nd the terminal output in the api directory shows the following error:
AttributeError: 'lxml.etree._ProcessingInstruction' object has no attribute 'is_phrasing'

Any log messages given by the failure

AttributeError: 'lxml.etree._ProcessingInstruction' object has no attribute 'is_phrasing'

Screenshot 2024-09-09 at 6 09 51 PM

Expected/desired behavior

I updated the unstructured library to version 0.15.9 due to issues with the old version, when running the requirements file in the api folder. However, after updating, the text-only sample raw files fail to process and throw the error, while the vision_and_text_sample raw files work correctly and produce output in the output folder.

OS

macOS

@Jaiaggarwaaaaal Jaiaggarwaaaaal changed the title issue in running eval folder's data file "text_only_sample" issue in "text_only_sample" Sep 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant