OCR

Optical character recognition program based on deep learning.

How to use

Clone this repository with git clone [email protected]:GabRayz/OCR.git.
Compile the project by running the command make in the project folder.
Execute the program with ./ocr.

Graphical User Interface

The character recognition program can be used through a GUI. Use ./ocr to launch it. Drag and drop an image in the GUI to process it. The extracted text is shown on the left side. The result can then be saved into a txt file.

Command Line Interface

The CLI offers more functionalities. Type ./ocr help to list available commands. Type ./ocr {command} to see the usage of the command.

./ocr write_dataset: Creates a new learning dataset from a directory containing images to be segmented. Each image must contain the following string : 0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'()-_.,?!:;.
./ocr learn: Trains a new neural network from a given dataset. The so created network can be saved.
./ocr read: Reads an image using the given neural network. The extracted text is displayed and saved into a txt file.

Dependencies

SDL2 : SDL2 is used to make the Graphical User Interface.
Image Magick 7 : Import, resize, rotate images.
Hunspell 7 : Hunspell is a spell checker and morphological analyzer designed for languages with rich morphology and complex word compounding and character encoding.

Features

Import images
Preprocessing :
- Grayscale
- Noise Reduction
- Contrast Enhancement
- Binarization (Otsu's Method)
- Rotation Detection (Hough's Transformation)
- Rotation Correction
Segmentation :
- Paragraphs & Lines Detection (X-Y Cut)
- Characters Detection (Connected Component Labeling CCL)
Recognition :
- Neural Network Loading
- Neural Network Recognition
Postprocessing :
- Uppercase removal
- SpellCheck (Hunspell)
Display (Graphical User Interface)

Collaborators

Gabriel Rayzal : github.com/GabRayz
Tony Heng : github.com/TonyHg
Nathan Cabasso : github.com/Vardiak

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
Dictionnary		Dictionnary
Font		Font
Img		Img
dataset/images		dataset/images
save		save
.gitignore		.gitignore
AUTHORS		AUTHORS
Makefile		Makefile
README.md		README.md
ccl.c		ccl.c
ccl.h		ccl.h
dataset.c		dataset.c
dataset.h		dataset.h
hough.c		hough.c
hough.h		hough.h
image.c		image.c
image.h		image.h
linkedlist.c		linkedlist.c
linkedlist.h		linkedlist.h
main.c		main.c
main.h		main.h
matrix.c		matrix.c
matrix.h		matrix.h
neuralnetwork.c		neuralnetwork.c
neuralnetwork.h		neuralnetwork.h
segmentation.c		segmentation.c
segmentation.h		segmentation.h
spellcheck.c		spellcheck.c
spellcheck.h		spellcheck.h
window.c		window.c
window.h		window.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR

How to use

Graphical User Interface

Command Line Interface

Dependencies

Features

Collaborators

About

Releases

Packages

Languages

Vardiak/OCR

Folders and files

Latest commit

History

Repository files navigation

OCR

How to use

Graphical User Interface

Command Line Interface

Dependencies

Features

Collaborators

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages