Cpprep
The cpprep program is used during content preparation right after guiprep has been used to generate the files in the text folder.
I use cpprep to make several automatic corrections of the kind of things that might get past a proofer before sending it up to dpscans. It also does other preprocessing that should make the proofer's work more accurate. It removes HTML markup for <i> <b> and <sc> from each .txt file, allowing the proofing to deal with the words and the later formatting round to worry about the markup. It looks for and corrects typical punctuation scannos, such as spaced double quotes, the number 1 used where the letter "l" should be in contractions, etc. It also looks for hyphen breaks across pages. If found, it inserts an asterisk at the end of the first page and at the start of the following page. Finally, it attempts to determine when there is a paragraph break at the top of a page. If it's fairly certain there is, it inserts a blank line at the top of the page where the new paragraph starts.
A log of the changes made is generated as cpprep.log, and the original files are backed-up in the _textback_ folder.