Okapi is a new CLI tool built on ripgrep that enables bulk text editing by allowing users to find and replace patterns in context across multiple files. It was developed to address OCR errors (scannos) in large-scale digitization projects, particularly for historical government documents. The tool integrates with text editors to provide more controlled corrections than simple regex replacements.
Background
ripgrep is a popular high-performance command-line search tool, while OCR (Optical Character Recognition) often produces errors called 'scannos' when digitizing historical documents. Traditional regex replacements lack context awareness for accurate corrections.
- Source
- Lobsters
- Published
- Mar 30, 2026 at 03:14 PM
- Score
- 5.0 / 10