User:Theshriek/my process for CP and PM
Jump to navigation
Jump to search
My process for CP/PM:
I use this process for CPing because I do not have ABBYY finereader (or any other OCR software).
- Follow steps 1-6 of User:Monicas wicked stepmother/PM process
- Follow step 8 of User:Monicas wicked stepmother/PM process
Next:
- On TIA download the ABBYY gz file.
- Open Guiguts and go to File > Content Providing > Import TIA Abbyy OCR file. Find the file you downloaded and save it as a .txt file.
- Run Tools > Basic Fixup
- From the Content Providing menu select the follow:
- 1. Run Dehyphenator
- 2. Filter File
- 3. Fix Common English Scannos
- 4. Add [Blank Page] to Empty Pages
- 5. Remove Headers/Footers
- Once you have the text file as a whole where you want it, you should then use the GG feature in File > Content Providing > Export as Prep Text Files into a subfolder called "textw" to split the master text file into separate text files for each page.
- Delete all the blank pages at the start and end
- Use IrfanView batch rename to for textw and put in text folder.
- Follow steps 10-11 of User:Monicas wicked stepmother/PM process