User:Srjfoo/temp

From DPWiki
Jump to navigation Jump to search

Purpose of this draft: disambiguate/simplify references to duplicates and clearances.

Selecting a book

If you don't already have a book in mind, you'll need to find one. Some things to take into consideration:

Possible sources for books to CP

  • Personal scans from books you own, have borrowed (with the owner's permission) or have checked out of the library.
  • Online scan sources
  • Check the "Books I'd Like to See" topic for the current year to see if something that's been requested appeals to you.

If you plan to also be the Project Manager, be sure to pick a project on which you will enjoy working, because you will be shepherding this project through up to 5 Rounds. This may take several years from start to finish, but much of this time can be spent waiting for your project to be released in a round.

If you want the project to go through the system quickly, pick a popular genre; watch which release queues are moving fast, as this changes regularly.

If you choose to get a book from one of the on-line book archives, please follow the individual site guidelines regarding acceptable use and protocol. We don't want to be bad neighbors. It is considered good form to credit the source of the scan when the text is submitted to Project Gutenberg, so make sure the PM knows its source.

Projects that are part of another publication

If the project is a portion of another publication and the entire publication is no longer under copyright restrictions, please do not separate out the smaller portion of the publication as a separate project.

Check for completeness

At this juncture, regardless of the source, you should check the book for completeness before going any further. See the Project completeness checklist (For purposes of this draft: Project completeness checklist) for a list of the types of things to check.

Digital Library of India/Public Library of India scansets

Some scansets from the Internet Archive that were provided by "Digital Library of India"/"Public Library of India" have incorrect publication information so that works appear to be in the public domain in the US when they aren't. If you are considering using one of these scansets, please verify publication information independently with other sources. If you can't confirm that the publication is Public Domain in the US, please don't run the book. For more information, please read this forum thread.

Difficulty

This applies both to the difficulty of preparing the project and the difficulty for the proofreaders and formatters.

If you're doing your own scanning, see Scan/Download images for more information. Narrow gutters and a large number of illustrations involve more time in preparation, and more time for the Post-Processor, but may not be terribly difficult for proofreaders and formatters. Pages with lots of dense text, tables, inline formatting, sub- and/or superscripts, poetry or block-quotes, but with a good sized gutter may make it easy to scan, but more difficult for proofreaders and formatters. See Prepare the Illustrations for more on preparing illustrations.

Is it in the Public Domain?

DP works only on projects that are in the Public Domain as determined by a clearance from Project Gutenberg. For the current year, (2025), usually that means it was published in or before 1929. See clearances, below, for more detailed information.

On 1 January 2026 books first published in 1930 enter the Public Domain in the US. Project Gutenberg is now clearing books in anticipation of that day, but please don't load projects onto DP's server until they are fully in the public domain.

Each year on 1 January, there will be an update in the public domain publication year. For information about handling projects that enter the public domain on 1 January, please read the Handling projects entering Public Domain on 1 January section of the Project Managing FAQ.

Check for Duplicates

Make sure the book is not already at PG, or underway at DP: PG's In Progress Search form searches the PG clearance and posted databases by title and/or author so you can find out whether a book is already at PG or whether a clearance for it has been requested, and when, and whether the clearance has been approved. The clearances searched go back to 2004; the posted titles search covers the whole collection, including those cleared before 2004. You should also use DP's [[dp:tools/search.php|Project Search] to check whether a book is in progress at DP as well as DP Canada's search (which also has the option to search fadedpage).

"Cleared" Status means someone has requested and received copyright clearance, but has not yet finished the project. If this clearance is several years old, it has probably (though not certainly) been abandoned.

After you get clearance, you will get an e-mail along with the other clearance holder, letting each of you know that the other is working on it. You can then communicate with them to find out if they are working on it, or if you are free to begin processing it.

Some projects, most notably periodicals and multi-volume editions, will have "blanket clearances." This does not mean that the person who requested the original clearance has all of the volumes ready to scan! Most of these clearances are associated with DP in some way, so if an Überproject doesn't exist for the periodical/set you have (where the PM will often list the volumes they have available), you can post in the Content Provider's forum to find out who's working on what.

If the project says "Posted" it has been posted to Project Gutenberg with the accompanying ebook number. It is a good idea to look at all of these lists by author and separately by title.

Running a project that is already in PG.

Even if a book is already in PG, it may be worth processing again. This will require some legwork to determine, so be sure you feel strongly about the book before pursuing this. PG welcomes different editions, illustrated versions, different translations, etc. In addition, many of the older ebooks have more errors than we would find acceptable today and reprocessing them through DP may be the best way to change that. If the book has a PG number under 10,000 then it probably doesn't have an illustrated version and might be a good candidate for an upgrade.

Below is a list of reasons you might provide an existing PG project through DP. You will need a copyright clearance for each of these cases. For reworks, the PM should put a note into the Project Comments section explaining why the project is being redone. If you are a CP only, then you should include details of why the project is being redone in a text file attached in the project zip file.

Basic upgrade
You have the same text version, there are no illustrations, and the PG version is riddled with errors: Be sure to let PG know, when you upload the final version, that this is a revision of an existing ebook, based on a paper copy in hand. If there are only a few problems, submit them via the PG errata process.
Illustration Upgrade
You have the same text version, but there are illustrations and they are not present in the PG version: Same as the basic upgrade except that you'll be submitting an illustrated html version.
Different Translation
PG will treat this as a completely different ebook and welcomes them. There are already at least half a dozen translations of the Iliad, for example, and more are always welcome.
Different Edition
Some books were published in very different editions. Where this is the case, PG welcomes them as separate ebooks. You will have to document the fact that your edition has significant differences from the version that is already in PG.

Selecting a Project

Which book you pick is up to you. The only requirement is that it be copyright clearable (discussion below). It is best if it is something in which you have interest. Chances are that you will find others who will work on it as well.

Finding a book.

There are several ways to find a project to CP (Content Provide). You can search the library, buy from a local bookstore, raid your own bookshelves, ask a friend, pull them out of the trash, or find projects that are already scanned at some of the many on-line sources for scans. Be sure to pick a project on which you will enjoy working, because you will be shepherding this project through up to 5 Rounds (if you choose to be the project manager), and until the project is posted. This may take several years from start to finish, but much of this time can be spent waiting for your project to be released in a round.

If you want the project to go through the system quickly, pick a popular genre; watch which release queues are moving fast, as this changes regularly.

If you choose to get a book from one of the on-line book archives, please follow the individual site guidelines regarding acceptable use and protocol. We don't want to be bad neighbors. It is considered good form to credit the source of the scan when the text is submitted to Project Gutenberg, so make sure the PM knows its source.

Projects that are part of another publication

If the project is a portion of another publication and the entire publication is no longer under copyright restrictions, please do not separate out the smaller portion of the publication as a separate project.

Check for completeness

At this juncture, regardless of the source, you should check the book for completeness before going any further. See the Project completeness checklist for a list of the types of things to check.

Digital Library of India/Public Library of India scansets

Some scansets from the Internet Archive that were provided by "Digital Library of India"/"Public Library of India" have incorrect publication information so that works appear to be in the public domain in the US when they aren't. If you are considering using one of these scansets, please verify publication information independently with other sources. If you can't confirm that the publication is Public Domain in the US, please don't run the book. For more information, please read this forum thread.

Difficulty.

Some things can make the project harder than others. The amount of time you wish to spend on this should be considered. Check the inner margin (gutter) of the book. The wider this is, the easier it will be to scan, and the fewer extra measures you'll need to take in OCR and answering forum questions. This does not mean that you should not work with books that have a narrow gutter, just that they will be much harder. Projects with a lot of illustrations are also harder and more time-consuming. This will be discussed more under Scan/Download images and Prepare the Illustrations.

Copyrights and clearances.

Do a preliminary check to see if it is clearable. For the current year (2025), usually that means it was published in or before 1929. See clearances, below, for more information.

On 1 January 2026 books first published in 1930 enter the Public Domain in the US. Project Gutenberg is now clearing books in anticipation of that day, but please don't load projects onto DP's server until they are fully in the public domain.

Each year on 1 January, there will be an update in the public domain publication year. For information about handling projects that enter the public domain on 1 January, please read the Handling projects entering Public Domain on 1 January section of the Project Managing FAQ.

Check for Duplicate Projects

Make sure the book is not underway or already at PG: PG's In Progress Search form searches the PG clearance and posted databases by title and/or author so you can find out whether a book is already at PG or whether a clearance for it has been requested, and when, and whether the clearance has been approved. These clearance listings searched go back to 2004. You should also use DP's Project Search to check whether a book is in progress at DP as well as DP Canada's search (which also has the option to search fadedpage).

"Cleared" Status means someone has requested and received copyright clearance, but has not yet finished the project. If this clearance is several years old, it has probably (though not certainly) been abandoned.

After you get clearance, you will get an e-mail along with the other clearance holder, letting each of you know that the other is working on it. You can then communicate with them to find out if they are working on it, or if you are free to begin processing it.

Some projects, most notably periodicals and multi-volume editions, will have "blanket clearances." This does not mean that the person who requested the original clearance has all of the volumes ready to scan! Most of these clearances are associated with DP in some way, so if an Überproject doesn't exist for the periodical/set you have (where the PM will often list the volumes they have available), you can post in the Content Provider's forum to find out who's working on what.

If the project says "Posted" it has been posted to Project Gutenberg with the accompanying ebook number. It is a good idea to look at all of these lists by author and separately by title.

Running a project that is already in PG.

Even if a book is already in PG, it may be worth processing again. This will require some legwork to determine, so be sure you feel strongly about the book before pursuing this. PG welcomes different editions, illustrated versions, different translations, etc. In addition, many of the older ebooks have more errors than we would find acceptable today and reprocessing them through DP may be the best way to change that. If the book has a PG number under 10,000 then it probably doesn't have an illustrated version and might be a good candidate for an upgrade.

Below is a list of reasons you might provide an existing PG project through DP. You will need a copyright clearance for each of these cases. For reworks, the PM should put a note into the Project Comments section explaining why the project is being redone. If you are a CP only, then you should include details of why the project is being redone in a text file attached in the project zip file.

Basic upgrade
You have the same text version, there are no illustrations, and the PG version is riddled with errors: Be sure to let PG know, when you upload the final version, that this is a revision of an existing ebook, based on a paper copy in hand. If there are only a few problems, submit them via the PG errata process.
Illustration Upgrade
You have the same text version, but there are illustrations and they are not present in the PG version: Same as the basic upgrade except that you'll be submitting an illustrated html version.
Different Translation
PG will treat this as a completely different ebook and welcomes them. There are already at least half a dozen translations of the Iliad, for example, and more are always welcome.
Different Edition
Some books were published in very different editions. Where this is the case, PG welcomes them as separate ebooks. You will have to document the fact that your edition has significant differences from the version that is already in PG.

Get a clearance

You have obtained a book, and have decided that it is both clearable and not already in PG or in progress, or you have a book you think is clearable and need to find out for sure. In both cases, it is time to ask the experts.

You will need to have scans of the Title Page and Verso (the back of the title page), also known as the TP&V. You may need scans of other material as well, such as an inscription on the fly-leaf, in order to establish date.

Check for Duplicates

The clearance team does not check for duplicate clearances. In addition, having a clearance does not mean you "own" that title for some period of time.

Before you move forward with a clearance request, please make sure that the book isn’t already in progress at common proofreading sites:

  • PG checks for cleared, posted, in progress at either DP or DPCanada or posted to Faded Page.
  • David's List: Clearances more than five years old roll off; new clearances may not make it on the list for a month or two, but in general, this is a good place to check and see if a book has been cleared within the last five years.
  • In progress at DP, DP Canada. Note that the extended search at DP Canada, linked to above, can also search their FadedPage display site.

If your search string doesn't find anything, please try variations. Sometimes projects are duplicated because the search string was too restrictive. Common words may find too many results, but the shortest string that will uniquely identify a title is usually best. If the title is extremely common ("Poems", for example), try searching for both title and author.

Copyright Clearance.

Copyright clearance is a process by which Project Gutenberg determines if a book is in the public domain according to the copyright laws of the United States. Project Gutenberg maintains a set of Rules that are used to determine if a book is clearable. This DP site operates under U.S. law; if you cannot obtain a clearance, your book cannot be processed through this site.

Please read PG's copyright clearance rules for details.

If your book is not clearable under PG's rules, but the author and everyone else associated with the book (i.e., the illustrator, editor, translator) has been deceased for at least 70 years, you may wish to send the book to one of our sister sites, DP Canada.

As of June, 2018, Project Gutenberg is approving clearances for books that will become Public Domain as of January 1, 2019 and January 1, 2020. Please do not upload projects that are not in the public domain in the US as of the time you upload the project files. See this post.

Create a PGLAF Account.

The next step is to set up an account at PGLAF (the branch of PG that handles clearances). If you have Direct Uploading or PPV access, you already have a PGLAF account.

To create a new account, browse to PGLAF and read the welcome page. This contains a lot of useful information on the clearance process, and a number of useful links. Next, Click the New username link, and fill out the form. Be sure that the email address you enter is valid and is checked regularly; this is the address where posted notices and clearance notifications go, and also where you will be contacted if a conflict occurs.

Submit a Clearance Request.

After completing the registration process, log in, and select "Submit a New Clearance Request". A large form will appear; most of the information required should be available directly from the title page of your book. If not, you will have to do some research. Document any findings in the field provided; be sure to list the source of any information not found on your book's title page and verso page (the page immediately following the title page). If a date is listed twice in different contexts (separate publication date and copyright date, for example) enter it twice. Remember when attaching images that they should be small in size (100k is a reasonable maximum; most should be smaller), but the smallest text should still be legible. Multi-volume works can be cleared in a single clearance request if the dates are the same, or if you provide the earliest and latest title and verso.

Checking the Clearance Registration Form details

Before you submit your clearance request, it is important to carefully review the title page scan of your book and the publication information and verify that the information you have entered on the form is correct. Please ensure that:

  • Author field contains data
  • Author's name is spelled correctly
  • All authors listed on title page are listed in upload form
  • Any illustrator, translator, editor, etc., listed on title page is listed in upload form
  • Title is complete and all words are spelled correctly
  • Title is appropriately capitalized
  • If English, the title and subtitle should be capitalized using Sentence case.
  Example: The story of the little red hen
  • For titles in languages other than English (LOTE), please follow the capitalization used on the book's Title Page. However, if the title there is fully capitalized, please use the conventions for title capitalization common in the book's language (If in doubt, you may refer to the capitalization used for books in that language within the catalogs of major libraries such as the Library of Congress or WorldCat).
  • Subtitle listed on title page is listed in upload form
  • If you are uploading a periodical, please check the Project Gutenberg for previously posted issues of that periodical and follow the title formatting they used.
  • If your project is part of a multiple volume set, state the number, i.e. English History, (Vol. 2/6).

Note: It is very important that all the information you enter in the clearance registration form match the details of the project for which you are requesting clearance: The uploaded item MUST match the clearance. This certainly includes all the publication metadata (publisher, location and date). If the information does not match, then Project Gutenberg should not accept the upload of your project once it completes post-processing.

Types of Clearances.

There are several types of clearances. The most common is rule 1, but some others are used on occasion. Project Gutenberg only clears based on the United States Copyright Laws. However, if you would like a detailed discussion of copyrights in other countries, visit The Online Books Page.

Wait.

All that is left now is to wait for the results of your request. Basic clearances using the standard rules are usually processed within several weeks. Rule 6 clearances, which require more research, usually take longer (and may require further research on your part before it clears). You must receive the clearance before loading the project onto DP.

You may get a response that says NOT OK. A reason for the denial of the clearance will always be given. Be sure to check that reason, since technical difficulties such as corrupted files can easily generate this response. Feel free to resubmit your clearance request after correcting whatever problem was noted.



temp draft stuff:

Mac Keyboard input for special characters

(click 'expand' to open)
Portions of the below information have been included in Typing Greek. Preserving the complete version for possible future use.

Using the keyboard to input characters with diacriticals

On newer Macs, if you simply hold down a character that can take a diacritical known to macOS, and pause, you should see a popup similar to what you might see on a phone or tablet, and can simply choose the diacritical you want. This is by far the easiest option if it's available to you and the character you want is available.

Using the keyboard to input characters with diacriticals via dead-keys

If you can't use the above method, but would still like to type characters with diacriticals, this is the easiest option. If you have the Input menu enabled, choose "Show Keyboard Viewer". The following screenshot was taken from a Mac running Catalina, but the keyboard viewer has looked similar for a long time, now.

Keyviewer mac.png

Press and hold the Option/Alt key, and you should see something like the following if you have the ABC Extended keyboard selected.

Keyviewer mac latin diacriticals.png

The ABC Extended keyboard is recommended because it has a larger number of diacriticals available than the U.S. keyboard. If you're using the U.S. keyboard, go to System Preferences->Keyboard. Under the Keyboard tab, select "Show keyboard and emoji viewers in menu bar. Under the Input Sources tab, click on the + to add a keyboard, and choose ABC Extended.

The keys that are outlined in orange show the diacritical that is associated with those keys when the option key is held down. If you press the e key with the option key held down, and the let up the keys, you should see the following:

Keyviewer mac latin diacriticals allowed.png

Simply press the key for the letter you want, and you've typed the letter.


Using the keyboard to input Greek directly on a Mac

Chances are that if you want to type Greek directly on your Mac keyboard for DP, you will be most likely to be using the Polytonic Greek characters. If you've never enabled additional keyboards before, be aware that with each new version of the operating system, there may be small differences in exactly where to look. As of macOS 10.15 (Catalina), go to System Preferences->Keyboard. Under the Keyboard tab, select "Show keyboard and emoji viewers in menu bar. Under the Input Sources tab, click on the + to add a keyboard, and choose Greek - Polytonic from the list. If it's not already chosen, click the checkbox for "Show Input menu in menu bar".

In the menu bar, click on the menu that shows which keyboard you're using. The top section shows the keyboards you have enabled. The next section down allows you to bring up the Emoji & Symbols Character Viewer and the Keyboard Viewer. If you open the Keyboard Viewer, and choose the Polytonic Greek keyboard, the display should look similar to:

Keyviewer mac greek diacriticals.png

The orange outlines are around the keys that are considered dead-keys: type the key that shows the diacritical combination you want, followed by the letter that you need that takes that diacritical. The above keyboard shows the unshifted marks -- where there's a breathing mark, it's smooth. The following keyboard shows the shifted marks -- where there's a breathing mark, it's rough.

Keyviewer mac greek diacriticals shifted.png

Once you release the appropriate mark, the keys on the keyboard with an orange outline will show those characters that are eligible to take that mark. The following example shows the letters (unshifted) that can take a rough breathing mark with a circumflex.

Keyviewer mac greek diacriticals allowed.png


The iota subscript and breve are available by holding down the option key [no screenshot].

Missing pages

The following is/was a draft. I need to look through it and see if it's irrelevant at this point in time, or should be reviewed by the other squirrels and added elsewhere, but not right now (so I'm hiding it behind a collapsible section).

(click 'expand' to open)

General

Note that missing illustration files should be treated separately, instead of as part of a missing pages (MP) project. See Add or replace illustration images for a project on this page.

If you have found a matching online edition, please include a link in your email to db-req.

[**mention MP finders]

PMs

  • Once you have located the missing pages, and verified that they are from the same edition as the main project, create the proofing images, a.k.a. page scans, and txt files for the missing pages. Number them so that they will end up in the correct place in the main project after a lexical sort. For example, if there are two pages missing between current pngs 012 and 013, label the missing pages 012a and 012b.
  • If the project has left P1, create a new project for the missing pages. If you are the PM for the main project, the easiest way to create the new project is to use the "Clone this project" link on the project page. The new project should have the same author, clearance, etc. as the main project. Make sure that the credit line reflects the correct image source. The title should be the same, with the addition of "[missing pages]" at the end, but without any status information (such as rounds repeated or skipped). E.g., the MP project for "An Interesting Book {R} {P1->P1} {P3 skipped}" would be "An Interesting Book [missing pages]". Load the missing pages into the project as you would for a regular project.
  • Put the MP project into P1 Unavailable. Do NOT release it yourself!
  • Send an email to db-req including your dp username, the project title and projectID of the missing pages project, and the project title and projectID of the main project into which it is to be merged. It is useful if you include the current state of the main project. You should also identify the source of the scans and, if using an online source, provide a link.

Here's a template you can use for your email:

 Subject: MP project: <number of pages> pp. Needs <rounds needed>

 projectID############# - <Project Name> [missing pages]
   to be merged into
 projectID############# - <full Project Name>
   current state: <state of original project>
 Image source: <source name> <link>
 
 Thanks,
 
 <yourname> (<dp-username>)

Example:

 Subject: MP project: 3 pp. Needs P1, P2, F1
 
 projectID123abc3456yoi - An Interesting Book [missing pages]
   to be merged into
 projectID543sdfig456rt - An Interesting Book {R} {P3 skipped}
   current state: F2 Unavailable
 Image source: TIA www.archive.org/details/interestingbk01
 
 Thanks,
 
 not a squirrel (notasquirrel)
  • Db-req will push the MP project through until it is in a compatible state, and then merge the missing pages into the main project.

Please do not try to push the project through yourself.

Db-req can push the MP project through exactly the same rounds as the main project, which simplifies the merge steps required, especially if the main project repeated and/or skipped any rounds, or is too old to have been through P2.

Note that the MP project has to go through the rounds, even if you don't need the proofed output.

If a project is still in P1 when the missing pages are discovered, the PM can add the pages without squirrel assistance. See the PM FAQ section about missing pages. Note that this covers only adding missing pages while a project is still in P1. If the project needs to be re-numbered, that will still need to be done by a squirrel.

If you aren't able to locate scans from a matching edition, or simply don't have the time to prep the MP project, db-req still needs to know! Send an email to db-req, including the project title and projectID, a list of the missing pages, and a link to the scans (if you were able to locate them).

PPers

If you are the PPer of a project with an active PM, you should always contact the PM first, and ask them to take care of the missing pages. If it is not clear how active a PM is, ask them to contact you within a week or two, and say that you will contact db-req if they do not respond within that time. If you don't hear back within the specified time, mention in your email to db-req that you attempted to contact the PM, but did not get a response. Be aware that the PM may just be on vacation, and not checking messages frequently.

If the PM is known to be inactive (from previous experience), the above step can be skipped.

If you have found a matching edition online that can be used as a source for the missing images, please include that information in your email.

Here's a template you can use for your email:

 Subject: MP project: <number of pages> pp.
 
 I'm the PPer for: 
 projectID############# - <full Project Name>
 
 Missing pages xx - yy. PM contacted on <date>, but has not 
 responded in <not less than a week>. Matching edition 
 found at <image source>: <link> (if applicable).
 
 Thanks,
 
 <yourname> (<dp-username>)

Example:

 Subject: MP project: 2 pp.
 
 I'm the PPer for:
 projectID543sdfig456rt - An Interesting Book {R} {P3 skipped}
 
 Missing pages numbered 17 and 18 between 021.png and 022.png. 
 PM contacted on 23 July, but has not responded in the last two weeks. 
 Matching edition found at TIA: www.archive.org/details/interestingbk02
 
 Thanks,
 
 not a squirrel (notasquirrel)

Db-req will run the MP project, and will let you know when it is done. Please do not upload for PPV or to Project Gutenberg until after you receive notification that the missing pages have been added to the project.



Adding translations

(click 'expand' to open)
Note -- French translation rolled out 2017-04-25

All translations should be done on the test site.

If you feel comfortable updating the PO file for your locale directly, and are the only translator for the locale, there should be no need to coordinate with others. However, it's a good idea to recruit others to help you fine-tune the translations.

If you are the only person who is doing the translating, and don't feel comfortable updating the PO file yourself, let the site Translation Coordinator know.

If you are one of multiple translators, there should be one person coordinating the translation effort for your locale. Preferably, the coordinator for the locale will feel comfortable with updating the PO file, but again, if not, contact the site Translation Coordinator.

Using poedit or some form of a poeditor (if you are familiar with the process and have a preferred editor) as part of the process mades it much easier to view the whole message, and to avoid mistakes in the markup in the PO file. PO files are text files, and fully editable with a text editor, but long strings that are broken over several lines can be confusing to translate.

The PO file contains *all* strings in the DP code that have been marked up for translation, even those of lower priority that non-admin users will never see, so it's important to have a way to tell where the strings to be translated come from.

The Loco online PO editor is perfectly acceptable to use, but has no easy way to tell where the string comes from. Its advantage is that you don't have to install anything on your computer.

Poedit has to be installed on your computer, and has a means to tell where the strings come from in the code. It's available for Windows, Mac and Linux.

If there's not enough context for you to determine the correct translation (this can be especially problematical for very short strings), please ask!

Words needing context differentiation

Also known as disambiguation. Where the strings are longer, and unique, context is usually not a problem. Where the word is by itself, or with maybe one other word, and the single word or word in combination is used in multiple places, it can be a problem.

Words identified in DP Code Text Localization page:

  • Completed
  • Available
  • None -- resolved all instances of "None" with initial cap.
  • none -- standalone, not capitalized
  • all
  • Bad
  • Unknown
  • Edit -- should be generally translated to mean "modify" or "modification", not "edition"

As of 11:42, 16 April 2017 (EDT), most of the above still need to be investigated.

String length

For a lot of things, the difference between the length of the original English and the length of the translation doesn't matter. In a few cases, it really does.

Graphs.
The length of the translation of the caption/graph header must not exceed the width of the graph plus label text. It won't wrap; it'll be truncated.
Labels.
Last Edit of Project Info
Dernière Édition des Infos du Projet
Last State Change
Dernière Modification de l'État du projet
The way things are currently configured, row labels adjust according to the content. Perhaps a better place to look is at the column labels for the round pages. It works, but it's ugly -- so should that be a consideration? (Column labels may also appear too long in English.)

Random Rules

Mostly not translated. How will this be handled?

Notes from lvl's pages

Going through lvl's localization related pages and making notes of comments that seem most relevant to the current effort. User:Lvl/Devel/I18n TODO list, User:Lvl/Devel/Distributed gettext fixing effort. He recognized the need for context disambiguation, but at that time we couldn't install a recent enough version of gettext() to use it. He's got a lot of stuff that specifically talks about the code end of things which I'm not qualified to comment on, so my notes here will have to do specifically with things that translators need to be aware of, or that I need to touch base with Casey about.

  • Need for short names in UI

Recognized the need for random rule translation. Not sure I understand his proposal, though.

Also commented about translating emails.

  • 403 and 404 text not translated. Upon investigation, these are not within the c directory, but at the same level as. The page headers have the _() markup, but are not included in the POT. The rest of the text on the page doesn't have the markup at all. Another one that might be useful to translate is down-index.php, which is at the same level (directly in htdocs).
  • Comments about jpgraph and collisions between gettext() and the image cache -- I'm assuming this has been resolved, since the graph text seems to be being translated.
  • Translator icon in the stats -- According to lvl: "This does not need a database change, merely three lines added in stats/include/member.inc, function showMbrRoles()". (also needs an icon) (is this something we want to do?)

Prioritization

Where a directory contains a mixture of priorities, relevant files will be listed separate.y, but otherwise, directories/subdirectories are listed.

High priority

  • activity_hub.php and default.php
  • anything in the accounts directory that has translatable text.
  • tools/proofers -- anything in this directory that has translatable text should probably be treanslated
  • faq/faq_central.php -- lvl classified as high priority. It's not currently translated that I can see. It's a PHP page that is a mixture of PHP and HTML. Since the "Help" link in the navbar is to this page, it probably ought to be translated. Most of what's listed there is now in the DP Official Documentation in the wiki, but as long as there's such a high-profile link to this document, it probably should be translated. (Casey will work on this.)
  • Other files in the faq/ directory: font_sample.php, pophelp/prefs/* and pophelp/teams/*, privacy, prooffacehelp.php, simple_proof_rules.php (it only outputs the header, but the header is not currently marked up for translation), site_progress_snapshot_legend.php (not currently marked up for translation)
  • userprefs.php
  • Anything in pinc that has translatable text should probably be translated.
  • in the tools directory: change_sr_commitment.php, changestate.php, pool.php, request_access.php, set_project_event_subs.php, set_project_holds.php and upload_text.php
  • in the tools/project_manager/ directory: diff.php, detail_legend.inc, generate_post_files.php, page_detail.php,
  • in the tools/post-proofers/ directory: smooth_reading.php
  • stats/members/, stats/teams/ (probably most stuff that's in stats/?)

Medium priority

Not as important as the high-priority stuff that might affect new users from the beginning, but should be done.

  • anything in the quiz/ directory
  • anything in the tools/post-proofers/ directory that hasn't already been translated.

Low Priority

faq/pophelp/project_manager/*? -- the only two files there are HTML files so may need to be refactored. Both appear to be pretty old. Either I'm totally inept at finding things, or these files are (no longer) referenced from the code?


  • anything in the tools/project_manager/ directory that isn't translated elsewhere.


Very Low Priority

  • tools/authors/ -- most of what's displayed via listing.php and index.html (which appears to forward to listing.php?) is translated elsewhere.

Ignore

May not need to be translated at all. At this point in time, it's pretty important for anyone in an administrative position to be able to communicate effectively in English. The only things that should be in this category are those that are seen only by squirrels and translators, or code that is not currently implemented.

  • Anything in tools/site_admin or locale
  • Anything in sample (not currently in use)
  • tools/proofers/md_*, tools/proofers/hiero/* (currently unused code)

Referenced articles

User:Lvl/Devel/Distributed gettext fixing effort
It's fairly out-dated, but has some useful information.
User:Lvl/Devel/I18n TODO list
Also outdated, but has useful information. I haven't studied it closely enough and correlated it with the current effort.
DP Code Text Localization
Listed here to keep track of it.
Site Translation
Compare this with the Translation FAQ document once the updated version is rolled out to TEST and PROD: https://www.pgdp.net/c/faq/translate.php
User:Acunning40/translation
More relevant to translating stuff that's kept in the wiki, and may need to be updated, but has some useful comments.