Skip to content

Commit

Permalink
Add proces graph to docs (#500)
Browse files Browse the repository at this point in the history
  • Loading branch information
bosd authored Apr 8, 2023
1 parent 86152bf commit 61f2cb2
Showing 1 changed file with 49 additions and 0 deletions.
49 changes: 49 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -31,6 +31,55 @@ Go from PDF files to this:
{'date': (2014, 8, 3), 'invoice_number': '42183017', 'amount': 4.11, 'desc': 'Invoice 42183017 from Amazon Web Services'}
{'date': (2015, 1, 28), 'invoice_number': '12429647', 'amount': 101.0, 'desc': 'Invoice 12429647 from Envato'}

```mermaid
flowchart LR
InvoiceFile[fa:fa-file-invoice Invoicefile\n\npdf\nimage\ntext] --> Input-module(Input Module\n\npdftotext\ntext\npdfminer\npdfplumber\ntesseract\ngvision)
Input-module --> |Extracted Text| C{keyword\nmatching}
Invoice-Templates[(fa:fa-file-lines Invoice Templates)] --> C{keyword\nmatching}
C --> |Extracted Text + fa:fa-file-circle-check Template| E(Template Processing\n apply options from template\nremove accents, replaces etc...)
E --> |Optimized String|Plugins&Parsers(Call plugins + parsers)
subgraph Plugins&Parsers
direction BT
tables[fa:fa-table tables] ~~~ lines[fa:fa-grip-lines lines]
lines ~~~ regex[fa:fa-code regex]
regex ~~~ static[fa:fa-check static]
end
Plugins&Parsers --> |output| result[result\nfa:fa-file-csv,\njson,\nXML]
click Invoice-Templates https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md
click result https://github.com/invoice-x/invoice2data#usage
click Input-module https://github.com/invoice-x/invoice2data#installation-of-input-modules
click E https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#options
click tables https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#tables
click lines https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#lines
click regex https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#regex
click static https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#parser-static
```

## Installation

1. Install pdftotext
Expand Down

0 comments on commit 61f2cb2

Please sign in to comment.