This is one of those 'if I was going there I wouldn't have started from here' types of reply.
If this project is yet to start, it would make things far simpler if you used content controls in the template that creates the documents with the data you want to recover. You may then find
Insert Content Control Add-In and
Extract data from forms useful
If you already have the documents, then you need to supply a sample document that shows exactly how it is formatted in order to establish the best way forward.
Personally I would extract to Excel and then use mail merge to create the documents with the extracted texts.