User:Katt83

My PP Checklist

Here's my checklist. It's different, because I use ppgprep and then ppg. They automatically rewrap, deal with page separators, and generate both text and html from a single source. And ppgprep usually automatically renumbers and moves the footnotes/illustrations.

1   Download
1.a     text and images
1.b     Read discussion board
2   Footnotes (only if really screwed up)
2.a     1st pass, Check Footnotes
2.b     Footnote Count
2.c     Step Thru
2.d     Index
2.e     Landing Zones
2.f     Move to end-ch(text) and eof (html)
3   Pre-Post Process
3.a     Illustrations (move to top of paragraphs)
3.b     do_preproc <book> (gw2dic, ggprep, ppgprep, pppsmq)
3.c     Smart Quotes (ppsmq)
3.c.1       Resolve *ERR, *Sync
3.c.5       Resolve tildes
3.d     Fixup->Convert Windows CP to Unicode
3.e     Add header to src file
4   Sequential Inspection
4.a     Check italic, bold
4.b     Greek, etc.
4.c     Sidenotes
4.d     Mark xrefs with {}
4.e     Search for (free asterisks) (?<!/)\*(?!/)
4.f     \n\n\n
4.g     Check/Change subscripts/superscripts
4.h     Run Fixup (all opts)
5   Super-Duper Checking (only when truly paranoid or feeling fastidious)
5.a     Read Book
5.b     Save map of italic/bold from Gutwrench
5.c     Scan zoomed-in images for tb, italics, bold, etc.
5.d     Count paras per pg vs. gutwrench map
5.e     Run GutAxe
5.f     Mark spaced initials with \
5.g     Mark eop dashes with nowrap
5.h     no-wrap mdashes at end of paragraphs
5.i     emphasized text (<f>)
5.j     Note any oö?
6   Formatting Text
6.a     Front Matter
6.b     TOC
6.c     LOI
6.d     Index
6.e     any advertisements
6.f     ASCII Tables
6.g     Check TOC, LOI, and indices for accuracy
7   Word Frequency Checks
7.a     Char Cnts
7.a.1       check 1-time-only chars
7.a.2       equal counts (), [], {}
7.b     Caps Check
7.c     Accents
7.d     Emdashes
7.e     Lower
7.f     Upper
8   Scanno Checks
8.a     Jeebies
8.b     Stealth Scannos
8.b.1       eng-common
8.b.2       misspelled
8.b.3       regex
8.b.4       scannos3
8.c     Gutcheck
8.d     Spellcheck
9   Transcriber's Section
9.a     Transcriber's section in both text and html versions
9.b     Italics/bold warning in text version
9.c     List Corrections (maybe)
9.d     Spellcheck Section
10  Put out for Smooth Reading
10.a        ppspell
10.b        pptext
10.c        Zip and upload
10.d        Send out Thank-yous!
10.e        Process comments
11  Process Hi-Res Images
11.a        Number complete
11.c        Rotate
11.d        Crop (close as possible, set expand from ctr, fixed aspect, +5mm)
11.e        Set Levels
11.f        Color Selection (tolerance=16, anti-aliased, not contiguous)
11.f.1      Contract/Shrink Selection by 3 pixels
11.f.2      Add specks to selection (use lasso)
11.f.3      Feather selection by 2 pixels
11.f.4      Delete selection (make it white with DEL key)
11.h        Re-size Procedure
11.h.3      Unsharp mask (Filters > Enhance > Unsharp mask). Start off with radius 1.5, amount .75, threshold 8 and see how it looks.
11.i        Set Levels
11.j        Change color depth to 3-10 (posterize)
11.k        Save as B/W as png (full compression); color as jpeg
11.l        Set width/height for all pics
11.m        optipng or jpegoptim the pics
11.n        Thumbnails (maybe)
12  HTML
12.a        Check <title>
12.b        Title pages
12.c        Illustrations
12.d        Adverts
12.e        Tables
12.f        Indices
12.g        Sidenotes
12.h        Check chapters (h2/h3) and <p>s
12.i        Check blockquotes
12.j        Check poems
12.k        Fix fractions
12.l        Pick proper type for tbs
12.m        Update transliterations and weird characters
13  HTML Checks
13.a        Check all pf h sections for issues (quotes, dashes) (including imports)
13.b        Find orphaned markup in HTML dialog
13.c        Link Checker
13.d        Tidy
13.e        WC3 Validators
13.f        Hand-check each link
13.g        Skim whole HTML
14  PPV Checks
14.a        Compare src to orig.txt (maybe)
14.b        ppvimage
14.c        pphtml
14.d        pptext
14.e        ppspell (one last time)
14.f        READ the transcriber notes!!!
14.g        Check notes.txt one more time
15  Final Text Checking
15.a        Check word freq char counts one last time
15.b        Gutcheck again
15.c        Skim the Text again
15.d        Create a Unicode version, if needed
16  Check HTML (at the END)
16.a        Skim the HTML again
16.b        Load HTML in ie and opera
16.c        Check word freq char counts one last time
16.d        Find orphaned markup in HTML dialog
16.f        Tidy
16.g        WC3 Validator
16.h        WC3 CSS Validator
16.i        WDG HTML Validator
17  Upload
17.a        Zip files to be uploaded
17.a.1      bookname.txt (NO bin)
17.a.2      bookname.html
17.a.3      images folder (NO thumbnails/other non-images)
17.b        e-mail: Add dp-post@pgdp.net
17.c        Title properly capitalized
17.d        Subtitle
17.e        Check other periodical entries, if needed
17.f        Author's name spelling
17.g        all editors, illustrators, etc.
17.h        Copy/Edit Credit Line (CP, OCR, PM, images, PP)
17.i        Preview

Home-Grown DP Tools

gwmap2_18.zip gwmap2_19.zip	Ruby	gwmap2: Generates reference map of formatting, for final, quick PP-scan of images. (or F2, I'm flexible) Marks any errors it finds (blanks, misspelled footnotes, etc.) with *. To get command-line help: ruby gwmap2.rb -h
imgclean.zip	Ruby	imgclean: My update to Rfrank's tool to crop as much whitespace as possible from around png images. Takes as input the directory and a starting width and height (in pixels), but adjusts them in or out to encompass all of the text. Outputs the cropped pngs to a new directory, trimmed, and makes a copy of the original with a box showing the cropping to another new directory, overlay. Requires Ruby 1.8.6, RMagick, and ImageMagick. See http://github.com/rmagick/rmagick and, for Windows, http://rmagick.rubyforge.org/install-faq.html#win for more info.
imgsplit.zip	Ruby	imgsplit: Splits png images into 2 columns lengthwise, if possible. Takes as input the directory and a starting width and height (in pixels) of the minimum allowed columnar separator, and moves it around in the middle of the page until it finds a blank area that matches. Outputs the split pngs to a new directory, split, and makes a copy of the original with a box showing the column-separator to a new directory, split_debug. Requires Ruby 1.8.6, RMagick, and ImageMagick. See http://github.com/rmagick/rmagick and, for Windows, http://rmagick.rubyforge.org/install-faq.html#win for more info.
ggprep.zip pptxt.zip pphtml.zip ppvimage.zip	Perl	Tools that I think are vital to PPing. Use ggprep immediately after downloading a new PP-project. Right before uploading, run pptxt, pphtml, and ppvimage as final sanity tests. Most initially written by rfrank (exception is dcwilson's ppvimage), with major and/or minor tweaks by yours truly. Run them with -h to get help on their command-line options.
ppgprep_18.zip ppgprep_19.zip	Ruby	ppgprep: Pre-processes a text, converting it to suitable starting-input to ppg. Optionally moves illustrations and renumbers and/or moves footnotes. Run after ggprep.
trnote_perl.zip trnote_rb.zip	Perl Ruby	trnote: Tags and collects <ins> corrections at end of html. Choices of perl or Ruby. Ruby versions also work with ppg .src files.

Note: Most Ruby tools work with either Ruby 1.8.6 or Ruby 1.9. Please make sure you run the proper versions. Run Ruby -v to determine your installation.

Other Notes

See Merry's Museum for my current big project.

Contents

My PP Checklist

Home-Grown DP Tools

Other Notes

Navigation menu