I need to collect the main headers from the document, convert it to word and add bookmarks to those headers in an automatic way. For this purpose it's better to do the mining in the word format instead of the pdf, as the data is much more structured and further information regarding the text is available is an easier manner which I need (color, font, size etc). Unfortunately there are no settings in Adobe Acrobat which change the headers. Only some very basic changes are available: Include Comments, include Images, Recognize text where needed and Retain Flowing Text / Retain Page Layout.
|