#

tesseract

Here are 1,085 public repositories matching this topic...

pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

python pdf font data-science ocr tesseract epub mupdf text-processing pdf-documents extract-data table-extraction text-shaping xps pymupdf

Updated May 28, 2024
Python

GpScanner

goksenpasli / GpScanner

Twain Scanner Application

pdf scanner wpf tesseract udf eyp win10 twain tarayici win7 win11

Updated May 28, 2024
C#

CCExtractor / ccextractor

CCExtractor - Official version maintained by the core team

c rust image ocr video image-processing tesseract subtitles tesseract-ocr dvb teletext hacktoberfest cea-608 cea-708 hacktoberfest2021

Updated May 28, 2024
C

jankstar / pydocu

fastapi server for classification of documents and extraction of data

transformers tesseract torch data-extraction document-classification parsing-library bert fastapi

Updated May 28, 2024
Python

SkeathyTomas / genshin_artifact_auxiliary

A Genshin Impact artifact rater sticking upon artifacts inside the game window. 刻晴办公桌 | 原神 | 圣遗物评分。集成在游戏窗口之上的原神圣遗物导出、评分工具，无需游戏内外来回切换对比，游戏中快速计算与查阅结果。

python ocr tesseract paddleocr genshin-impact pyside6 rapidocr

Updated May 28, 2024
Python

hamidurrk / epaper-scraper

Web scraper for extracting data from online newspapers

python tesseract asynchronous-programming sqlite3 lxml webscraping cuda-toolkit selenium-python beautifulsoup4 dataminig

Updated May 28, 2024
Python

SubhamTyagi / android-ocr

Tesseract based OCR for android

android ocr foss tesseract reader fdroid image-reader ocr-android ocr-recognition ocr-text-reader math-ocr

Updated May 28, 2024
Java

OSS-DocumentScanner

Akylas / OSS-DocumentScanner

Android document document scanning app

android pdf opencv scanner image-processing tesseract document document-scanner document-scanner-app document-scanning document-scan document-scan-to-text zxingcpp

Updated May 28, 2024
C++

tesseract-ocr / tesseract

Tesseract Open Source OCR Engine (main repository)

machine-learning ocr tesseract lstm tesseract-ocr hacktoberfest ocr-engine

Updated May 28, 2024
C++

scribeocr / scribeocr

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

ocr abbyy tesseract proofreading

Updated May 28, 2024
JavaScript

sivakumar-mahalingam / fastmrz

⚡Extracting the Machine Readable Zone (MRZ) from passport or any document images.

python opencv ocr tesseract passport text-recognition tesseract-ocr mrz opencv-python identity-document mrz-scanner passport-mrz

Updated May 27, 2024
Python

shelfio / aws-lambda-tesseract

6 MB Tesseract (with English training data) to fit inside AWS Lambda

nodejs ocr aws-lambda serverless npm-package tesseract node-module optical-character-recognition

Updated May 27, 2024
Shell

Franky1 / Tesseract-OCR-5-Docker

Docker Image with latest Tesseract OCR Version 5.x.x built from sources

docker ocr tesseract tesseract-ocr tesseract-5

Updated May 27, 2024
Python

GerHobbelt / mupdf

mupdf mirror/clone + extra work done / extra tooling. Geared for use with Qiqqa.

pdf ocr tesseract mupdf qiqqa

Updated May 26, 2024
C++

stscoundrel / old-danish-dictionary-builder

Build "Dictionary of the Old Danish Language" into easier-to-use data formats

kotlin python typescript spring-boot tesseract medieval-studies danish-language medieval-languages old-danish otto-kalkar

Updated May 26, 2024
Python

danpla / dpscreenocr

Program to recognize text on screen

ocr tesseract tesseract-ocr

Updated May 26, 2024
C++

GerHobbelt / qiqqa-open-source

The open-sourced version of the award-winning Qiqqa research management tool for Windows (a bleeding edge dev fork) ・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・ ☞☞☞ File any issues you find in the main repo issue tracker at https://github.com/jimmejardine/qiqqa-open-source/issues

metadata pdf tesseract citations mupdf document-classification meta-analysis document-management qiqqa

Updated May 25, 2024
TeX

OCRmyPDF

ocrmypdf / OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

python pdf ocr image-processing tesseract

Updated May 25, 2024
Python

GauravSingh9356 / J.A.R.V.I.S

Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking…

Updated May 25, 2024
Python

a943512 / PyAibote

Python package for doing RPA

python opencv cross-platform tesseract sikuli rpa tagui

Updated May 25, 2024
Python

Improve this page

Add a description, image, and links to the tesseract topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tesseract topic, visit your repo's landing page and select "manage topics."