Skip to content

Commit

Permalink
fix bug in pdf parser (infiniflow#986)
Browse files Browse the repository at this point in the history
### What problem does this PR solve?

infiniflow#963 

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
  • Loading branch information
KevinHuSh authored May 30, 2024
1 parent 8d0b2d4 commit a5b03c7
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion deepdoc/parser/pdf_parser.py
Original file line number Diff line number Diff line change
Expand Up @@ -392,7 +392,7 @@ def _naive_vertical_merge(self):
b["text"].strip()[-1] in ",;:'\",、‘“;:-",
len(b["text"].strip()) > 1 and b["text"].strip(
)[-2] in ",;:'\",‘“、;:",
b_["text"].strip()[0] in "。;?!?”)),,、:",
b_["text"].strip() and b_["text"].strip()[0] in "。;?!?”)),,、:",
]
# features for not concating
feats = [
Expand Down

0 comments on commit a5b03c7

Please sign in to comment.