View Single Post
 
Old 10-23-2024, 10:49 PM
Xavier Xavier is offline Windows XP Office 2016
Advanced Beginner
 
Join Date: Jul 2023
Posts: 67
Xavier is on a distinguished road
Default Pdf looks crap when OCRd - help!

I've been working on a very large project and part of it involves converting some court documents into editable Word Documents (RTF files).

I've seen and read "Cleaning up Text Pasted from Websites, E-mails, PDFs etc"
https://www.msofficeforums.com/word/...-pdfs-etc.html but found that hasn't helped.

I've never had problems like this in OCR-ing PDFs ever. If someone can take a look at the attached and point me in the direction of somewhere that explains how I can try and automate the process a bit to clean up the Word Docs I would be really grateful.

I've attached the Word Docs as .docx files, but the ones I am working with are RTF files. Unfortunately the file uploader doesn't allow for these files to be uploaded.

I've tried removing column and section breaks and selecting everything and pressing Control + Spacebar, which are tricks I've been taught in the past. But it is still looking like the dog's breakfast in my Word Doc.
Attached Files
File Type: docx Chamberlain 2nd Inquest for forum.docx (91.7 KB, 6 views)
File Type: pdf Chamberlain 2nd Inquest test for forum.pdf (226.7 KB, 10 views)
Reply With Quote