User:Theshriek/my process for CP and PM

From DPWiki
Jump to navigation Jump to search

My process for CP/PM:

I use this process for CPing because I do not have ABBYY finereader (or any other OCR software).

Next:

  • On TIA download the ABBYY gz file.
  • Open Guiguts and go to File > Content Providing > Import TIA Abbyy OCR file. Find the file you downloaded and save it as a .txt file.
  • Run Tools > Basic Fixup
  • From the Content Providing menu select the follow:
  • 1. Run Dehyphenator
  • 2. Filter File
  • 3. Fix Common English Scannos
  • 4. Add [Blank Page] to Empty Pages
  • 5. Remove Headers/Footers
  • Once you have the text file as a whole where you want it, you should then use the GG feature in File > Content Providing > Export as Prep Text Files into a subfolder called "textw" to split the master text file into separate text files for each page.
  • Delete all the blank pages at the start and end
  • Use IrfanView batch rename to for textw and put in text folder.
  • Follow steps 10-11 of User:Monicas wicked stepmother/PM process