User:Chapka

From DPWiki
Jump to navigation Jump to search
Links

Notes on stage directions--work this up into something?

Just a note to PP. There are at least four types of stage directions in this book:

1. Stage directions set as part of a line, between two brackets and in italics.

2. Centred stage directions on their own line, used for entrances.

3. Stage directions set after a line--right justified at the end of a line or on the next line if there isn't room--with an opening, but no closing bracket, and in italics.

4. Stage directions set after a line as #2, but in small Roman type and with brackets before and after.

1 and 2 are straightforward, but with 3 and 4 the printer has typeset them as:

  • right-justified if they fit on a single line; but
  • with a hanging indent if they run to two or more lines.

You can see an example of both styles of #3 on [url=https://www.pgdp.net/c/tools/page_browser.php?project=projectID675f597912174&imagefile=104.png&mode=imageText&round_id=F1]this page[/url], and you can see the different treatment of #4 [url=page_browser.php?project=projectID675f597912174&imagefile=159.png]here[/url] and [url=https://www.pgdp.net/c/tools/page_browser.php?project=projectID675f597912174&imagefile=148.png&mode=imageText&round_id=F1]here[/url].

Because of the Guidelines, these are going to come through the rounds with a mix of formatting; six spaces if it's right-justified at the end of a line of dialogue, /*nowrap*/ if it's right-justified on its own line, and /#rewrap#/ if it's long enough that there's a hanging indent; but this is an artefact of the page width and not necessarily something it makes sense to follow in PP.

Proposed revision to Content Providing FAQ

I propose that we replace the current section: ABBYY FineReader Scanning Instructions with a new section at the same level as "OCR":

Preparing Illustration Images

Many books have illustrations within the text, and we like to include them in our HTML versions. This means the CP or PM must provide images of these illustrations and include them in the project at the time it is uploaded. Please provide separate illustrations, in .jpg or .png format and in higher quality than the page images themselves.

What to provide

In addition to capturing the text for OCR, you need to provide high-quality images of:

  • Any illustrations in the book, including drop capitals and decorations. You do not need to provide images of simple horizontal lines, but anything more complex should be included to give the postprocessor the option to include it in the finished book.
  • The book's cover must be included if it includes any text or decoration. It's good practice to include it even if it doesn't, to give the postprocessor the widest range of options for creating the final cover. Including the back cover can also be helpful, especially if the front cover includes library stickers or other markings.
  • If the cover is blank, please also include a high quality image of the book's title page as an option for creating an ebook cover.

Processing high quality images

If you are harvesting images from online, provide the highest quality images available.

If you are scanning the book yourself, scan illustrations should be scanned at a sufficient resolution to capture fine detail. While it may not be needed now, it is important if the book is to be reprinted or screen technology improves. Generally speaking, 300 DPI is adequate for line art, continuous tone, and descreened images; screened images often require 600 DPI to avoid moire effects.

Do not use ABBYY FineReader for scanning images; it processes images in several ways that are effective on text, but unacceptable for illustrations. Color images should be scanned or harvested in color.

Crop each illustration, but leave some space around it to allow the postprocessor to rotate and clean up the illustration. Do not feel that you need to provide clean rotated images in perfect, ready to post format; this can be done by the PP. If you do wish to clean them up, many PPs appreciate this, but please leave them larger than you think the PP will need. This allows the PP to resize them to the way they like it.

Proposed revision to Release Queues page

A proposed revision to the Special Day section of the Release Queues page to clarify when Special Days begin and end.

Current Version

When do Special Day queues start?

All special day queues start at the server time of 8am of the day before the "open" date for that queue. For example the St. Valentine's Day special day queue starts at the server time of 8am 13 February.

Handling Special Day queues

Special Day projects are handled differently in several ways.

  • For P1:
    • A project marked with a special day code won't release until the day (no matter the genre) but it will release on that day, subject to the limits in the queue definition which can restrict the number of projects active at the same time. The special day processing allows PMs to prep books ahead of time but be sure that they won't release until the designated time. If any projects remain in the queue at the end of the day, a PF or Squirrel will usually push them into P1 manually.
    • Because we are a global community, special "days" are defined to start when that date first comes along anywhere in the world (effectively when New Zealand hits that date) and to close when the last of our community ends the day (effectively at midnight in Hawaii). So each "day" runs for well over 24 hrs. That is why you may see projects releasing into P1 at a time that appears to be a day before or a day after the designated date.
  • For P2, projects can release either through the genre queues or through the special day queues, which ever comes up first. That is, in P1, books marked for a special day are held until that day arrives. In P2, they are not held back, but rather are pushed forward if a special day happens along.
  • P3 handles special day projects similarly to P2, except that the special day queues release only one book at a time.
  • We currently have no Special Day queues defined and active in F1 and F2, and projects release only through the normal queues defined in those rounds.
  • As mentioned above, the usual limits based on the number of projects an author or PM may have in a round are bypassed for projects marked with a special day code (except "Children's Book Week" or "Type In Project").

Proposed Revision

When do Special Day queues start?

Because we are a global community, special day queues start when that date first comes along anywhere in the world, and last until the day ends for the last of our community.

In practical terms, this means all special days currently begin at 8 am server time the day before the "open" date, and last until 6 am server time the day after.

For example, the St. Valentine's Day special day queue starts at the server time of 8 am on the 13th of February, and ends at 6 am on the 15th. This means the queue is open as long as it's Valentine's Day somewhere in the world.

Handling Special Day queues

Special Day projects are handled differently in several ways.

  • For P1, a project marked with a special day code won't release until the day (no matter the genre) but it will release on that day, subject to the limits in the queue definition which can restrict the number of projects active at the same time. The special day processing allows PMs to prep books ahead of time but be sure that they won't release until the designated time. If any projects remain in the queue at the end of the day, a PF or Squirrel will usually push them into P1 manually.
  • For P2, projects can release either through the genre queues or through the special day queues, whichever comes up first. That is, in P1, books marked for a special day are held until that day arrives. In P2, they are not held back, but rather are pushed forward if a special day happens along.
  • P3 handles special day projects similarly to P2, except that the special day queues release only one book at a time.
  • We currently have no Special Day queues defined and active in F1 or F2, and projects release only through the normal queues defined in those rounds.
  • As mentioned above, the usual limits based on the number of projects an author or PM may have in a round are bypassed for projects marked with a special day code (except "Children's Book Week" or "Type In Project").

Sources for Scan Harvesting

These revisions (to the current version) return the material removed during earlier revisions. Most of this will be in the new section, "Notes on Approved Sources," to keep it separate from the main body of the page, which is unapproved and defunct sources; there are also a few additional entries for the "Defunct" list. In most cases I only removed these entries in the first place because they're duplicative of the notes on the official non-Wiki source list. The only entry I didn't restore from the original page was Google Books, since the only thing there were links to the (now defunct) scan harvesting project.

I did make Changes to some entries:

  • Removed links to defunct coordination project pages like this one for Early Canadiana Online. These pages have mostly not been updated since 2008 or so and rely on externally hosted project pages that no longer exist.
  • Removed "No permission" tag from items listed on the official Image Sources list (Weaving archive, UMich, Hearth)
  • Noted that Million Books Project is listed under TIA and University of Michigan Historical Math Collection is included under UMDL entry in source list

Notes on Approved Sources (new section above Unapproved list)

Early Canadiana Online
(From the PP forum) "We have received official permission from the people who run the www.canadiana.org site to use their scans. They would like to be acknowledged in the credits line for all books that come from their scans, which we have agreed to do."
Case Western Reserve University Preservation Department (English)
"You are very welcome to use our on-line book collection. We would appreciate receiving copies of the proofed texts. All books we have in our collection are in public domain." Some books have been done, and some are cleared but not completed.
HEARTH Project (English)
Page images and text of Home economics books and journals from 1850-1950. Random samplings show characteristic OCR errors, indicating that the text has probably had only light proofing, if any.
Million Books Project (General) (Mainly English)
Hosted by The Internet Archive; choose TIA as image source. A large number of books available, however be warned that quality control is poor. Check that the book has all pages available and properly scanned. Also, do not trust the dates posted, check against the title and verso of the book.
Nietz Full-Text Collection (English)
140 school textbooks from the 19th century.
On-Line Digital Archive of Documents on Weaving and Related Topics PDFs at Arizona
:!: some modern material :!:
The University of Michigan Historical Mathematics Collection
Choose UMDL as Image Source.
University of Wisconsin
Browse the UW Digital Collections (See Image Sources script); many projects are hosted here, including:
  • African Studies Collection
  • Belgian-American Research Collection
  • Digital Library for the Decorative Arts and Material Culture
  • Ecology and Natural Resources Collection
  • Foreign Relations of the United States
  • Historical Primary Sources
  • History of Science and Technology
  • The University of Wisconsin Collection
  • Wisconsin Pioneer Experience
The Internet Archive
This page links both to Internet Archive's Wayback Machine, and their Archive search for other media. The search for books is below the Wayback Machine search. You may restrict your search to Texts only by clicking on the appropriate icon.

Add to Defunct List

Library of Congress Digitization Project (English)
(7 million pages!). Be aware that some of these link to other sites (such as Making of America)
Digital Information Organization in Japan
Links to a number of Japanese digital libraries.
Internet Public Library
Domain now owned by term paper writing service.

Books We Want

Author Title Genre Language Scan Requests Notes
Young, Ella Wonder Smith and His Son, The Children's English Unknown 2024
Snedeker, Caroline Downright Dency Children's English Unknown 2024
Woolf, Virginia Orlando Novel English Unknown 2024
Sullivan, Mark Our Times History English Yes 2024 6 vols.
Brinckle, Gertrude, et al. Howard Pyle: A Record... Art English Yes 1 2024
Barrie, J. M. Peter Pan (play) Drama English Yes 2024
Hecht, Ben, et al. Front Page, The Drama English Yes 2024
Rackham, Arthur Arthur Rackham's Book of Pictures Art English Yes 2024
American Institute of Child Life Young Folks' Directory Reference English Yes 2024 List of books, etc.
Godolphin, Mary Aesop's Fables in Words of One Syllable Children's English Yes 2024
Wells, Richard Manners, culture and dress... English Yes 2024
Ebers, G. Egypt: Descriptive, Historical, and Picturesque Travel English Yes 2024
Boston Public Library List of Books for Boys and Girls Reference English Yes 2024
Weaver, Louise, et al. One Thousand Ways to Please a Family English Yes 2024
Hewins, Caroline Books for Boys and Girls Reference English Yes 2024
New York... Catalogue of Books for Public School Libraries Reference English Yes 2024
Carnegie Library of Pittsburgh Catalogue of Books in the Children's Department... Reference English Yes 2024

Revision of Post-Processing Policy re: Cover Pages

This is a proposed revision of the page DP Official Documentation:PP and PPV/Post-Processing Policy re: Cover Pages, which is transcluded in several other official and unofficial documentation pages.


Only Use Materials from the Project

It is DP policy that, when we create a new cover for a coverless book, we confine ourselves to images that are found within the book itself.

In creating your cover, you may use:

  • Any image from the project itself
  • If necessary, an image of the same book from another source, but only if you can confirm that it is exactly the same edition used in the project
  • Any of the blank book cover images from this wiki
  • Original elements consisting of plain text, block colors or gradients and simple shapes, with an appropriate disclaimer

You may not use images from any other source, including:

  • Images from another edition of the same book, even if they appear to be identical to the images in the project
  • Images from other books or sources, even if they are in the public domain
  • Original art more complex than a simple box or a block of text

If in doubt, please ask for advice in this forum thread.

Public Domain Dedication

PG requires that where a new cover is created, a Transcriber's Note should include the following sentence:

New original cover art included with this eBook is granted to the public domain.

Note that the wording of this statement has changed. Historically, this disclaimer was sometimes added directly to the cover image, but it is now recommended to include it in the TN instead.

A disclaimer is not necessary if a book's title page is used as a cover, since it is part of the original book.

Below are translations of the disclaimer for books in other languges, as of 15 April 2023. Note that they may need updating.

French La page de couverture, créée expressément pour cette version électronique, a été placée dans le domaine public.
Spanish El transcriptor ha creado la imagen de la cubierta y la sitúa en el dominio público.
German German copyright law differs from US copyright law. The following is a direct translation of the English statement and will place the work in the public domain in the US only:
Das Umschlagbild wurde vom Bearbeiter gestaltet und in die Public Domain eingebracht.

The German status of "gemeinfrei" is slightly different than the US concept of "public domain"; to achieve this, use the text:

Das Umschlagbild wurde vom Bearbeiter geschaffen. Ein Urheberrecht wird nicht geltend gemacht. Das Bild darf von jedermann unbeschränkt genutzt werden.

Orphaned but Useful

Index to the LOFE

How to Use This Index

This Index is not intended to replace reading and studying the Formatting Guidelines or the Library of Formatting Examples, but to help reviewers who are already familiar with the formatting guidelines to quickly find specific sections of the LOFE. Some rules are only mentioned once, and it's not always easy to remember where they come up.

Many of the examples in the LOFE are repeated multiple times in different sections, sometimes with different commentary. Each such page will normally be linked only once for each topic.

Out of line markup is listed here as /*nowrap*/ or /#rewrap#/; "block quotation" here means a set-off quote that may use either markup. The LOFE itself uses the older terminology, where the /#rewrap#/ tags are referred to as "Block Quotes".

Where there is a overview of a topic, the page containing the overview will be listed first and in boldface.

Advertisements

Asterism

Generally: 125-09A/B, 125-35A-E

Back Matter

see Front and Back Matter

Boldface see Inline Markup

Chapters see Headings

Continued

Correspondence

Drama

Drop Capitals

Ellipsis

End, The

Endnotes see Footnotes

Font change (<f>) tag (move to/link from inline?)

  • Distinguishing Fraktur from Antiqua: 104-04A

Footnotes

Front and Back Matter (break out?)

Gesperrt see Inline Markup

Greek

Headings

Horizontal Rules see Thought Breaks

Illustrations

Index

Initials

  • Spacing of: 126-09A (Is this more of a proofing issue?)

Inline Markup

Italics

101:; for more see inline markup.

Letters see Correspondence

Major Divisions see...

Music

See generally... See also ads (or move from?)

Numbers

see also...

Out of Line Markup

Paragraphs

Periodicals

Plays

Poetry

see Poetry under Out of Line Markup

Quotations

See also Punctuation:Quotation Marks under Inline Markup

Sidenotes

Signatures see Correspondence

Small Caps see Inline Markup

Tables

Thought Breaks

see also Headings->Sections->Ambiguous->...


Working on proofreading examples

New Format

Dash-example1.PNG

First Topic--Second Topic--Third Topic--Fourth
topic--Fifth Topic--Sixth
Topic--Final Topic

In Forumpost:232654 acunning40 wrote: "...later on during post-processing the line breaks will be moved around automatically, and at that point the end of the line is equivalent to a space. We normally remove spaces around em-dashes like--this, so that means we also can't leave an em-dash at the start or end of a line, since that's the same as a space."

"The exceptions to this are at the beginning or end of a paragraph, or a line of poetry. In those cases the text won't be joined to the text before or after, so it's fine to leave the dash at the start or end of the line. In regular paragraph text where it all flows together, though, the dashes should always be "clothed"--have a non-space character on both sides."


Old Format

At line and page ends
1. Dash-example1.PNG In Forumpost:232654 acunning40 wrote: "...later on during post-processing the line breaks will be moved around automatically, and at that point the end of the line is equivalent to a space. We normally remove spaces around em-dashes like--this, so that means we also can't leave an em-dash at the start or end of a line, since that's the same as a space."

"The exceptions to this are at the beginning or end of a paragraph, or a line of poetry. In those cases the text won't be joined to the text before or after, so it's fine to leave the dash at the start or end of the line. In regular paragraph text where it all flows together, though, the dashes should always be "clothed"--have a non-space character on both sides."

First Topic--Second Topic--Third Topic--Fourth

topic--Fifth Topic--Sixth
Topic--Final Topic

2. Mark started to respond, but was cut off by the whistle.

"I think it's almost time for — {End of Page here}

Guideline for end-of-page hyphenation and dashes:
Proofread end-of-page hyphens or em-dashes by leaving the hyphen or em-dash at the end of the last line, and mark it with a * after the hyphen or dash.
Mark started to respond, but was cut off by the whistle.

"I think it's almost time for--*

3. Dash-example2.PNG Forum advice has no clear consensus, except to leave a proofer's [**note] and discuss the matter in the relevant project forum. The issue is that the dash appears to end a broken off sentence, rather than merely signaling a pause or parenthetical insertion, as discussed in the guidelines examples.

In forumpost:281992 laurawisewell says "Interesting. It certainly looks like the space in the image is much bigger than around the other dashes on the page, so it seems intentional. I'd probably close it up as per guidelines but leave a [**note] for the PPer."

In response grumbuskin says "I have come across a few of these. I do not close up the space, but do leave a [**Intentional space?] note for the PPer" and t-bonham adds "It doesn't matter much whether you leave that gap or close it up .... In proofing rounds, you can just concentrate on getting the text correct, and leave problems like this for the formatters or PPer's to take care of."

Garweyne writes later in the thread "I add that removing the gaps in PP is quite easy, while introducing them requires to look at the images. A space near a dash will never go unnoticed, since any PP and WW software looks at them. A comment makes sure that a following proofer does not remove them, or if he does, that the PPer will be warned anyway."

Why, in lots of the books, nowadays, the girls themselves

cling to the men in a close embrace, or put their
mouths tenderly to theirs-- [** space?]Well, of course, it sounds
rather disgusting, but in your own earlier books, I'm
sure there's more of it--of passion. Isn't there?


or


Why, in lots of the books, nowadays, the girls themselves
cling to the men in a close embrace, or put their
mouths tenderly to theirs--[** space?]Well, of course, it sounds
rather disgusting, but in your own earlier books, I'm
sure there's more of it--of passion. Isn't there?



Notes on Metrical Drama

Metrical drama raises X issues not often found in other texts.

  1. Speaker names. There are many ways to indicate which character is speaking.
  2. Stage directions. There are multiple ways of placing and formatting stage directions.
  3. Dropped lines. Where one line is indented as far as the end of the previous line. Also known as "stichomythia."

On their own, these can all be handled relatively easily, but their interactions can be tricky to represent in HTML.