This repository contains a collection of layout analysis and text recogntion models.
This project is maintained by JKamlah
Dive in and explore the collection of models!
model | OCR engine | Type of model | Description | Default model |
---|---|---|---|---|
German print | Kraken | Text recognition | Kraken model for german prints trained from several datasets. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German print | Tesseract | Text recognition | OCR model for german prints trained from several datasets. Best model variant for Tesseract. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German print | Tesseract | Text recognition | OCR model for german prints trained from several datasets. Fast model variant for Tesseract. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German newspapers | Kraken | Text recognition | Kraken model with kraken topology for german newspapers trained from several datasets. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German newspapers | Kraken | Text recognition | Kraken model with sgd topology for german newspapers trained from several datasets. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German newspapers | Kraken | Text recognition | Kraken model with htr+ topology for german newspapers trained from several datasets. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German newspapers | Kraken | Text recognition | Kraken model with htru topology for german newspapers trained from several datasets. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German newspapers | Kraken | Text recognition | Kraken model with gpt topology for german newspapers trained from several datasets. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German newspapers | Kraken | Text recognition | Kraken (default) model for german newspapers trained from several datasets. See https://github.com/UB-Mannheim/kraken/wiki/Training-German-Print | Download |
German newspapers | Tesseract | Text recognition | OCR model for german newspapers trained from several datasets. Best model variant for Tesseract. See https://github.com/UB-Mannheim/kraken/wiki/Training-german-newspapers | Download |
German newspapers | Tesseract | Text recognition | OCR model for german newspapers trained from several datasets. Fast model variant for Tesseract. See https://github.com/UB-Mannheim/kraken/wiki/Training-german-newspapers | Download |
UBMA Segmentation | Kraken | Layout analysis | Kraken segmentation model for a wide range of materials. | Download |
Historical Reports 2col | Kraken | Layout analysis | A Kraken segmentation model for 2 column layout. | Download |