User:Theshriek/my process for CP and PM

From DPWiki

My process for CP/PM:

I use this process for CPing because I do not have ABBYY finereader (or any other OCR software).

Next, I use the steps given to me by user Netsirk021.

  • On TIA download the ABBYY gz file and unzip to the work folder.
  • Open Guiguts and go to File > Content Providing > Import TIA Abbyy OCR file. Find the file you downloaded and save it as a .txt file.
  • Run Tools > Basic Fixup (with pretty much everything checked, but it's up to you how much you want to
  • Tools > Remove End of Line Spaces.
  • Find and replace on two single quotes (to replace with a double quote) and ^, to just delete altogether.
  • Find and replace double quote plus space. Replace with just a double quote.
  • Once you have the text file as a whole where you want it, you should then use the GG feature in File > Content Providing > Export as Prep Text Files into a subfolder called "textw" to split the master text file into separate text files for each page.
  • Delete all the blank pages at the start and end
  • Run Guiprep. Go to the Change Directory tab and make sure you are on the folder where your textw sub-folder is The first time you use Guiprep, you should set up the options you want on the Select Options tab. Once you do this, it'll save that for the future. Followed the recommendations on the DP Wiki about this (under the Select Options header here: https://www.pgdp.net/wiki/Guiprep_Insta ... tart_Guide)
  • When you've got your options selected and you're on the right directory, go to the Process Text tab and check off which things you want to run. For the most part, I do Dehypenization (see bullet below), Rename Txt Files, Filter Files, Fix Common Scannos, and Fix Zero Byte Files.
  • For dephyphenization: Run the dehyphenate routine; Look in the "text" folder that Guiprep will have created for dehyphenated versions
  • Go into the Headers & Footers tab to Get the Headers (and then later the Footers). Then you can select the ones you want to delete and the Remove Selected button will get rid of them for you.
  • Follow steps 10-17 of User:Monicas wicked stepmother/PM process