Tesseract OCR – Machine Learning

Tesseract OCR is a library and engine for optical character recognition. Version 4.0 has a greater facility for neural network training. The Tesseract Wiki is a good place to start. The Tesseract V4.0 neural network in particular implements an LSTM engine.

DeepSpeech – Machine Learning

DeepSpeech Speech Recognition Machine Learning These are notes to the project, which seem to me worth pursuing. Having recently seen a number of AWS re:invent videos on Vision and Language Machine Learning tools at Amazon, I have ML-envy. Time to start a project, but while I wait for the Amazon Transcribe and Amazon Translate to ... Read more