33 min

#6 - OCR Part 1: teaching digital machines to read paper documents Fat Tailed Thoughts

    • Technology

Bank statements, credit card statements, and tax forms all contain valuable data, but it's trapped on paper and in PDFs. We humans recognize the ink patters them as letters, but they contain no instructions for the computer. Optical Character Recognition (OCR) is how machines learn to read.

We explore the mechanics of OCR - the scale of the paper problem in financial services and why paper-based data is so difficult for computers to extract. We look at how accuracy statistics for machines can be misleading and why that results in people - lots of people - staying involved in the digitization process.

This week's conversation is a prelude to the next where we'll look at OCR startups and the tremendous business opportunities they're starting to unlock.

Check out this week's letter for the full story. Follow @FatTailThoughts on Twitter and your co-hosts @KleeBeard and @StevenDickens3 for more content.

Bank statements, credit card statements, and tax forms all contain valuable data, but it's trapped on paper and in PDFs. We humans recognize the ink patters them as letters, but they contain no instructions for the computer. Optical Character Recognition (OCR) is how machines learn to read.

We explore the mechanics of OCR - the scale of the paper problem in financial services and why paper-based data is so difficult for computers to extract. We look at how accuracy statistics for machines can be misleading and why that results in people - lots of people - staying involved in the digitization process.

This week's conversation is a prelude to the next where we'll look at OCR startups and the tremendous business opportunities they're starting to unlock.

Check out this week's letter for the full story. Follow @FatTailThoughts on Twitter and your co-hosts @KleeBeard and @StevenDickens3 for more content.

33 min

Top Podcasts In Technology

No Priors: Artificial Intelligence | Technology | Startups
Conviction | Pod People
Lex Fridman Podcast
Lex Fridman
All-In with Chamath, Jason, Sacks & Friedberg
All-In Podcast, LLC
Hard Fork
The New York Times
Acquired
Ben Gilbert and David Rosenthal
The Neuron: AI Explained
The Neuron