Crawl and download images using Selenium + PhantomJS Using python3 and PyQt5
- Supported Search Engine: Google, Bing, Baidu
- Keywords input from keyboard, or input from line seperated keywords list file for batch process.
- Download image using customizable number of threads.
- Fully supported conditional search (eg. filetype:, site:).
- Switch for Google safe mode.
- Proxy configuration (socks, http).
- CMD and GUI ways of using are provided.
- Windows prebuilt executable release from release page.
Download Latest version of Python3.5 installer from here
Download latest version of PyQt5 install from here
Official phantomjs prebuilt executable can be downloaded from here
Then copy phantomjs.exe to ${project_directory}/bin/
pip3.exe install -r requirements.txt
pip3.exe install pyinstaller
mkdir bin
copy the downloaded phantomjs.exe from 3.1.3 into ./bin folder.
pyinstaller image_downloader_gui.spec
The bundle will be built in ./dist folder.
apt-get install python3-pip python3-pyqt5 pyqt5-dev-tools
- For PC users
Official phantomjs prebuilt executable can be downloaded from here
[Warning]: PhantomJS installed from ubuntu source by apt-get do not work in this project.
- For Raspberry Pi Users
Unofficial phantomjs prebuilt executable or .deb for raspberry pi can be downloaded from here
Add the path of phantomjs executable to $PATH, or simply copy it to /usr/local/bin/.
pip3 install -r requirements.txt
usage: image_downloader.py [-h] [--engine {Google,Bing,Baidu}]
[--max-number MAX_NUMBER]
[--num-threads NUM_THREADS] [--timeout TIMEOUT]
[--output OUTPUT] [--safe-mode] [--face-only]
[--proxy_http PROXY_HTTP]
[--proxy_socks5 PROXY_SOCKS5]
keywords