Skip to content

hiris1228/ocr_test

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Solanaceae OCR test set

All Solanaceae preserved specimen images are from GBIF

Description

  • Please load the file url_taxon_gbifID.csv to obtain the image_url, taxon, gbif_id.
  • All raw data JSON files in DwC standard have been saved in separate folders with taxon names specified

OCR Test Script

pytesseract

  • To run the notebook, please install packages including pytesseract, PIL, and thefuzz

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published