This script will check an imap folder for unread emails. Any unread email that does not have an attachment will be converted to a pdf and then emailed to the address you specify. The script is run at a configurable interval.
This was built to integrate with paperless-ng which works with pdf attachements. However, I get many documents that are html only, so I wanted them converted to pdf for storage in paperless-ng.
The following parameters are used (defaults in parentheses):
IMAP_URL
IMAP_USERNAME
IMAP_PASSWORD
SMTP_USERNAME
(optional) uses imap username if not providedSMTP_PASSWORD
(optional) uses imap password if not providedIMAP_FOLDER
Which folder to watch for unread emailsSMTP_URL
MAIL_SENDER
: Address the mail with pdf should be sent fromMAIL_DESTINATION
: Where to send the resulting pdfSMTP_PORT
: (587)SMTP_TLS
: (True)INTER_RUN_INTERVAL
: Time in seconds that the system should wait between running the scriptPRINT_FAILED_MSG
: Flag to control printing of error messagesHOSTS
: Semicolon separated list of hosts that should be added to /etc/hosts to prevent dns lookup failuresWKHTMLTOPDF_OPTIONS
: Python dict (json) representation of wkhtmltopdf_options that can be passed to the used pdfkit libraryMAIL_MESSAGE_FLAG
: Flag to apply to email after processing.
Must be one of: SEEN (default), ANSWERED, FLAGGED, UNFLAGGED, DELETEDIMAP_FILTER
: Criteria to use when searching for mail to be processed. If no value is provided, a suitable value is determined based on theMAIL_MESSAGE_FLAG
. See imap-tools search criteria documentation for how to specify the filter. This should be in the text format (e.g.(TEXT "hello" NEW)
rather thanAND(text="hello", new=True)
)
This image is stored in the github registry, so you can use it without downloading this code repository.
The image address is ghcr.io/rob-luke/emails-html-to-pdf:latest
.
So to use it in a docker-compose it would be something like...
version: "3.8"
services:
email2pdf:
image: ghcr.io/rob-luke/emails-html-to-pdf:latest
container_name: email2pdf
environment:
- IMAP_URL=imap.provider.com
- [email protected]
- IMAP_PASSWORD=randompassword
- IMAP_FOLDER=Paperless
- SMTP_URL=smtp.provider.com
- [email protected]
- [email protected]
- INTER_RUN_INTERVAL=600
- HOSTS=127.0.0.1 tracking.paypal.com
- WKHTMLTOPDF_OPTIONS={"load-media-error-handling":"ignore"}
Open the docker-compose file and enter your details in the environment. This will run the script every minute.
docker-compose up -d
Or if you prefer you can run the script manually by running these commands.
poetry install
poetry run src/main.py
- try adding
127.0.0.1 tracking.paypal.com
to theHOSTS
env (check for missing domain in error log) - add
{"load-media-error-handling":"ignore"}
asWKHTMLTOPDF_OPTIONS
option (could be the tracking pixel that is not beeing loaded - append
"enable-local-file-access":true
or"load-error-handling":"ignore"
toWKHTMLTOPDF_OPTIONS
if you get afile://...
error - add
127.0.0.1 true
to theHOSTS
env if you get ahttp:///true/...
error
The recommended editor for development is either IntelliJ or Visual Studio Code
For Visual Studio Code, it is recommended to use the devcontainer included in the repository. With the Remote - Containers extension installed, you should be prompted to open the devcontainer when opening the folder.
For debugging, copy the env.example
file and rename it to just env
. Then edit the variables inside
to the required values for testing. These will be automatically configured when launching via either the
debug menu or by pressing F5. The env
file is included in the gitignore.
Formatting issues will cause the github build to fail. To fix formatting issues in your script, open the file and run the "Format Document" command.