Tesseract OCR – Machine Learning
Tesseract OCR is a library and engine for optical character recognition. Version 4.0 has a greater facility for neural network training. The Tesseract Wiki is a good place to start. The Tesseract V4.0 neural network in particular implements an LSTM engine.