Digital Research Workflow Series: Optical Character Recognition for Research Efficiency
This workshop will familiarize learners with the basics of optical character recognition (OCR) and digital corpus creation. Learners will receive a walkthrough of at least one OCR program (TBD, but suggestions welcome), which they will use to OCR a document of their choosing (a sample document will be made available, as well). The workshop will also cover cleaning up corpora as part of making them machine readable. Use of software like Notepad++ will be demonstrated for this purpose.
Digital Research Workflow Series hosted by the University Libraries Teaching and Leanring Unit and Open & Digital Scholarship Services and the Center for Research Data and Digital Scholarship.
- Thursday, November 8, 2018
- 1:00pm - 1:50pm