What happens at PG

From DPWiki
Jump to navigation Jump to search

Some DP volunteers have expressed concern that interaction with Project Gutenberg is not clear and transparent. It does help to remember that although DP was created to produce texts for PG, they are still two separate organizations. Here is an attempt to describe what happens for an average book that is processed through DP and posted to PG. Note that this process has evolved over years, and may change more in the future.

Copyright clearance. (PG)
A volunteer submits scanned images of the title page and verso on the PGLAF Copyright clearance website. The information typed in at this point is what will end up in the catalog, if no one edits it further. Copyright clearance must be completed before project is created at DP.
Project preparation. (DP)
A Content Provider prepares page images, ocr text, etc. A PM createes the project page, and releases it into the rounds.
Proofreading. (DP)
Project goes through proofreading and formatting rounds.
Post-processing. (DP)
PP produces final set of files. (Often plain text, html and images.)
File upload. (PG)
PPV (or PP with DU privileges) uploads the completed files via PGLAF upload form. (For more detail see the discussion thread: Guide to Direct Uploading (DU) and Posting to PG)
Posting to the main website. (PG)
A white-washer takes these files, checks copyright clearance, and runs some automated procedures to look for problems and check conformance with PG standards. If everything looks good, the WW gets the next free PG number, adds the PG header and footer, and posts the files.
Notification. (PG)
A "posted" note is then sent to the Posted mailing list and all email addresses provided by the volunteer who uploaded the file.
Project page update. (DP)
One of the Squirrels who monitor dp-post sees the posted note, changes the status of the DP project, and fills in its PG number.
Automated processes. (PG)
Alternate file formats are automatically generated. (This includes mobile formats like ePub, and html, if no html has been submitted.) A catalog record is also automatically generated, based on the metadata in the PG header.
Cataloging (PG)
A catalog editor may review the catalog record, and fix any problems that occurred, edit for consistency with other records, and add additional information (such as author birth and death dates or subject headings).

Distributed Proofreaders - What Happens At PG.png