User:Frau Sma/My PP Checklist
Jump to navigation
Jump to search
Disclaimer (2012–02–11): This list is terribly outdated and does not reflect my current PPing process at all. I'm just leaving it here in case anyone might still be interested in the regexes it lists, or anything like that.
Note that this is my personal checklist. It is not meant as instructional to anyone but myself, and it should not be taken as the way to PP(V). It also doesn't contain everything I do when PP(V)ing because there are things that I don't need to remind myself about.
Things to Do Before Splitting
- read project comments and project thread
- make sure file separators end in a hyphen
-----File(.*)[^-]$
- check proofer names for problematic characters
-----File: [a-z0-9]+\.png---.*[.@-].*[^-]\-+$
- sequential inspection
- /* */ und /# #/ --> /P P/ etc. (Rewrap Markers)
- *
- orphaned brackets and markup
- fixup, EOL spaces
- replace <tb>s
- front matter in /X X/ or /F F/
- page breaks
- word frequency
- English: jeebies
- gutcheck
- spell check
- sidenotes?
- footnotes (Footnotes)
- poetry line numbers??
- balanced markup
\<(\w+)>\n?[^<]+<(?!/\1>)
- malformed markup
<\w+/>|<[^!\w/]|[^-\w"]>|,/?\w>|</?\w\.|</?>|[^<]/\w+>|</?\w+[^\w> ]
- duplicate words?
[^a-zA-ZäöüÄÖÜßáéíóúÁÉÍÓÚàèìòùÀÈÌÒÙøæØÆ]([a-zA-ZäöüÄÖÜßáéíóúÁÉÍÓÚàèìòùÀÈÌÒÙøæØÆ'-]+)[ \n]+\1[^a-zA-ZäöüÄÖÜßáéíóúÁÉÍÓÚàèìòùÀÈÌÒÙøæØÆ]
- non-breaking spaces (abbreviations, ellipses, dashes)
- only vulgar fractions? if so, replace
[0-9]/[0-9]
½ ¼ ¾
- SPLIT
Text Version
- superscripts / subscripts?
- [] markup for non-Latin-1 characters?
- convert markup
- German: replace >< with single quotes or `´
- front matter
- indent footnotes 1
(\[Footnote([^[]|\n)+\])
/###\n$1\n#/
- double-check for footnotes containing []
- tidy up footnotes
- indent footnotes 2
/###
/#[2]
- English: check for dangling dashes
- rewrap, remove end-of-line spaces
- German: spaced dashes
(--[^ «]|[^ »]--)
- fix no-wrap sections
- center thought breaks
- remove rewrap markers
- add TN, corrections, rewrap appropriately
- transliterations
- replace ~ with non-breaking space
- gutcheck
- short lines?
^[^\n]{1,54}\n[^\s]
- Unicode characters?
[\x{0100}-\x{ffffff}]
- Non-ASCII characters?
[\x7F-\x{0100}]
- blank line at end of file?
HTML Version
- custom labels
- rewrap poetry
- no-wrap markup --> /F F/ etc.?
- move footnotes
- change quotes to smart quotes (ppsmq)
"(([^"]|\n)+)"
[**LQU]$1[**RQU]
- replace single quotes
‘’ ([**LSQU][**RSQU]) ›‹
- replace ampersands with [**AMP]
- autogenerate
- replace ~ with
[**LQU] with “
[**RQU] with ”
[**LSQU] with ‘
[**RSQU] with ’
[**AMP] with &
[oe] with œ
other characters in [] notation - lang, xml:lang
- fix page numbers 1
<span class="pagenum"><a name="Page_([^"]+)" id="Page_\1">\[Pg ([^\]]+)\]</a></span>
<a class="pagenum" name="Page_$1" title="$1"> </a>
- fix page numbers 2
(<a class="pagenum" name="Page_([^"]+)" title="\2"> </a>)</p>\n\n<p>
</p>\n\n<p>$1\n
- fix page numbers 3
([^>])(<a class="pagenum" name="Page_([^"]+)" title="\3"> </a>)\n
$1\n$2
- fix footnotes
<a name="([^"]+)" id="\1"></a><a href="([^"]+)"><span class="label">\[([0-9A-Z]+)\]</span>
<a name="$1" href="$2" class="label">[$3]
- fix footnote markers
<a name="([^"]+)" id="\1"></a><a href=
<a name="$1" href=
- replace ... with …
- German: – instead of —
- nowrap for en-dashes, ellipses at end of speech/paragraph
- en-dashes in ranges of numbers
- <i>, <em>, <cite>, …
- margin at top and bottom of text
- fractions (some options here)
- superscripts / subscripts?
- table <caption>s (weird display in FF)
- check whether any special formatting (e.g. drop caps) is needed
- corrections, TN
([^ ]+)\[\*\* ([^\]]+)\]
<ins title="$1">$2</ins>
- check for
</([^>]+)><\1>
- fix up CSS for printing
- remove unused CSS
- validator, tidy, link checker
Things to Check Before Uploading
- diff F2 / text
- diff text / HTML
- DOS line breaks
- file permissions
- no additional files