Skip to Main Content

Course & Subject Guides

Optical Character Recognition (OCR) @ Pitt

This LibGuide introduces users to optical character recognition (OCR), outlines OCR best practices, provides information and resources for OCR tools, and links to example OCR projects.

HTR Tools

Introductory content

Handwritten text recognition (HTR) is similar to OCR in that machine learning is used to generate transcriptions of documents, however HTR is using machine learning to transcribe handwritten documents instead of printed documents. There are different HTR tools, programs, and programming packages available for different types of HTR projects. In this LibGuide, they're organized into three categories: business, personal, and archival.

  • Archival HTR tools are used by libraries, archives, museums, and government institutions to make their digitized collections of handwritten documents searchable, as well as by researchers studying handwritten manuscripts.
  • Business HTR tools are built for businesses who use handwritten information written on physical documents in their work to be transcribed to computer text to increase access to that information and/or store it in databases. Businesses that use HTR are insurance companies, banks, and healthcare companies.
  • Personal HTR tools are available via apps on smartphones and computers. Individuals use Personal HTR to generate transcriptions of handwritten documents usually written themselves. Students may use HTR to generate text transcriptions of their handwritten class notes to study or share online. Personal HTR tools are useful for community archiving and family archiving.

Archival Tools

 

Transkribus is a software program that allows users to load documents into the program and create HTR models to generate transcriptions using PyLaia and HTR+ engines. Transkribus Expert Client is the software available for download to operate on your desktop, and Transkribus Lite is the online version of Transkribus with the same abilities to load documents, run line segmentation, transcribe ground truth, create models, and use premade models, with added capability of collaboration of multiple users working on the same document collection.

 

Business Tools

 

Handwriting OCR is one of a suite of features in SS&C Chorus Document Automation from Vivado, an AI company which engineers software for business such as banks, hospitals, and insurance agencies. SS&C Chorus Document Automation is designed for businesses to transfer information contained in handwritten documents from their clients and employees to a digital platform such as a database to easier access and assess that information in aggregate.

Personal Tools

 

Live Text is a feature from Apple IOS 15, available for use in Apple iPhones and and iPads. Live Text is HTR/OCR that can be used directly in your camera app and available in photos. If you take a picture of a sign with text on it, Live Text can detect and transcribe the text so that it is copy-and-paste-able directly from the camera app before the picture is taken or the photo in your camera roll. Live Text has some success with handwritten text as well, depending on how clear the text is written and how uniform the letters are to standardized characters.

 

Google Lens is a feature of the Google app available for Apple and Android devices. Google Lens can be used with your device's camera or in photos to extract typed or handwritten text. That extracted text can be Google-searched from the Google Lens app or copy-and-pasted.