To give more context, it is a document library with half a million documents that have more errors than any machine can do

(i have read some of them.. outch!) it is not the core of the project, but it is very important for the rest of the project.
Any pointers to where to look about the creation of that macro?
Thank you very much!