Quote:
Originally Posted by dnc
I'm actually collecting the page position of the bookmark and the cell widths and heights. This is then used to OCR just those areas after the forms are filled out and scanned in.
|
Huh? Why would you scan, then OCR something that is already text?? Sounds bizarre to me. Why not simply extract the required content directly?
Even so, what fumei says is relevant - you can interrogate the bookmarks directly (without selecting them) to get all you need to know about where they are, including where their cells are on the page and the cell dimensions, and all without ever looking at anything else in the document.