Choose filesChoose files or enter remote file URLremote file URL
Choose OCR software
Choose language or script
Choose file format to save
The default OCR software is Tesseract-OCR 5. Tesseract-OCR is a great neural net (LSTM) based OCR engine with more than 100 languages supported. However, Tesseract-OCR doesn't support converting scanned PDF document to editable Word document, so if you need this specific function, you'll need to change OCR software option to "ExtendedOCR". If target format is set to txt, the text content will be displayed in a text editor.
Different OCR software may recognize different text from same image, so we design this online OCR program to be open for all kinds of open-source OCR software. More OCR software will be tested and deployed later.