Cpprep

From DPWiki
Jump to navigation Jump to search
Replacement filing cabinet.svg Attention

This page has been kept for Archival and Historical Purposes and does not reflect the latest information and documentation regarding DP. Please see the Official Documentation for the latest information, or ask around on the Forums. Thank you.

Exquisite-khelpcenter.png Note

The script described below was an aid for text preparers, but is no longer available. The description has been kept as a possible idea for the future.

The cpprep program is used during content preparation right after guiprep has been used to generate the files in the text folder.

I use cpprep to make several automatic corrections of the kind of things that might get past a proofer before sending it up to dpscans. It also does other preprocessing that should make the proofer's work more accurate. It removes HTML markup for <i> <b> and <sc> from each .txt file, allowing the proofing to deal with the words and the later formatting round to worry about the markup. It looks for and corrects typical punctuation scannos, such as spaced double quotes, the number 1 used where the letter "l" should be in contractions, etc. It also looks for hyphen breaks across pages. If found, it inserts an asterisk at the end of the first page and at the start of the following page. Finally, it attempts to determine when there is a paragraph break at the top of a page. If it's fairly certain there is, it inserts a blank line at the top of the page where the new paragraph starts.

A log of the changes made is generated as cpprep.log, and the original files are backed-up in the _textback_ folder.