User:Frau Sma/My PP Checklist

From DPWiki
Jump to navigation Jump to search

Disclaimer (2012–02–11): This list is terribly outdated and does not reflect my current PPing process at all. I'm just leaving it here in case anyone might still be interested in the regexes it lists, or anything like that.

Note that this is my personal checklist. It is not meant as instructional to anyone but myself, and it should not be taken as the way to PP(V). It also doesn't contain everything I do when PP(V)ing because there are things that I don't need to remind myself about.

Things to Do Before Splitting

  1. read project comments and project thread
  2. make sure file separators end in a hyphen
    -----File(.*)[^-]$
  3. check proofer names for problematic characters
    -----File: [a-z0-9]+\.png---.*[.@-].*[^-]\-+$
  4. sequential inspection
  5. /* */ und /# #/ --> /P P/ etc. (Rewrap Markers)
  6. *
  7. orphaned brackets and markup
  8. fixup, EOL spaces
  9. replace <tb>s
  10. front matter in /X X/ or /F F/
  11. page breaks
  12. word frequency
  13. English: jeebies
  14. gutcheck
  15. spell check
  16. sidenotes?
  17. footnotes (Footnotes)
  18. poetry line numbers??
  19. balanced markup
    \<(\w+)>\n?[^<]+<(?!/\1>)
  20. malformed markup
    <\w+/>|<[^!\w/]|[^-\w"]>|,/?\w>|</?\w\.|</?>|[^<]/\w+>|</?\w+[^\w> ]
  21. duplicate words?
    [^a-zA-ZäöüÄÖÜßáéíóúÁÉÍÓÚàèìòùÀÈÌÒÙøæØÆ]([a-zA-ZäöüÄÖÜßáéíóúÁÉÍÓÚàèìòùÀÈÌÒÙøæØÆ'-]+)[ \n]+\1[^a-zA-ZäöüÄÖÜßáéíóúÁÉÍÓÚàèìòùÀÈÌÒÙøæØÆ]
  22. non-breaking spaces (abbreviations, ellipses, dashes)
  23. only vulgar fractions? if so, replace
    [0-9]/[0-9]
    ½ ¼ ¾
  24. SPLIT

Text Version

  1. superscripts / subscripts?
  2. [] markup for non-Latin-1 characters?
  3. convert markup
  4. German: replace >< with single quotes or `´
  5. front matter
  6. indent footnotes 1
    (\[Footnote([^[]|\n)+\])
    /###\n$1\n#/
  7. double-check for footnotes containing []
  8. tidy up footnotes
  9. indent footnotes 2
    /###
    /#[2]
  10. English: check for dangling dashes
  11. rewrap, remove end-of-line spaces
  12. German: spaced dashes
    (--[^  «]|[^  »]--)
  13. fix no-wrap sections
  14. center thought breaks
  15. remove rewrap markers
  16. add TN, corrections, rewrap appropriately
  17. transliterations
  18. replace ~ with non-breaking space
  19. gutcheck
  20. short lines?
    ^[^\n]{1,54}\n[^\s]
  21. Unicode characters?
    [\x{0100}-\x{ffffff}]
  22. Non-ASCII characters?
    [\x7F-\x{0100}]
  23. blank line at end of file?

HTML Version

  1. custom labels
  2. rewrap poetry
  3. no-wrap markup --> /F F/ etc.?
  4. move footnotes
  5. change quotes to smart quotes (ppsmq)
    "(([^"]|\n)+)"
    [**LQU]$1[**RQU]
  6. replace single quotes
    ‘’ ([**LSQU][**RSQU])   ›‹
  7. replace ampersands with [**AMP]
  8. autogenerate
  9. replace ~ with &nbsp;
    [**LQU] with &ldquo;
    [**RQU] with &rdquo;
    [**LSQU] with &lsquo;
    [**RSQU] with &rsquo;
    [**AMP] with &amp;
    [oe] with &oelig;
    other characters in [] notation
  10. lang, xml:lang
  11. fix page numbers 1
    <span class="pagenum"><a name="Page_([^"]+)" id="Page_\1">\[Pg ([^\]]+)\]</a></span>
    <a class="pagenum" name="Page_$1" title="$1"> </a>
  12. fix page numbers 2
    (<a class="pagenum" name="Page_([^"]+)" title="\2"> </a>)</p>\n\n<p>
    </p>\n\n<p>$1\n
  13. fix page numbers 3
    ([^>])(<a class="pagenum" name="Page_([^"]+)" title="\3"> </a>)\n
    $1\n$2
  14. fix footnotes
    <a name="([^"]+)" id="\1"></a><a href="([^"]+)"><span class="label">\[([0-9A-Z]+)\]</span>
    <a name="$1" href="$2" class="label">[$3]
  15. fix footnote markers
    <a name="([^"]+)" id="\1"></a><a href=
    <a name="$1" href=
  16. replace ... with &hellip;
  17. German: &ndash; instead of &mdash;
  18. nowrap for en-dashes, ellipses at end of speech/paragraph
  19. en-dashes in ranges of numbers
  20. <i>, <em>, <cite>, …
  21. margin at top and bottom of text
  22. fractions (some options here)
  23. superscripts / subscripts?
  24. table <caption>s (weird display in FF)
  25. check whether any special formatting (e.g. drop caps) is needed
  26. corrections, TN
    ([^ ]+)\[\*\* ([^\]]+)\]
    <ins title="$1">$2</ins>
  27. check for
    </([^>]+)><\1>
  28. fix up CSS for printing
  29. remove unused CSS
  30. validator, tidy, link checker

Things to Check Before Uploading

  1. diff F2 / text
  2. diff text / HTML
  3. DOS line breaks
  4. file permissions
  5. no additional files