Talk:Harvesting Americana claims
djvu
No result if I search for "djvu". How do you proceed with the djvu format? --Keichwa 11:47, 26 August 2006 (PDT)
I am not sure I understand the question. If the book has a djvu edition available it will be listed. Some don't have one available. De2164 09:54, 27 August 2006 (PDT)
- My question is: how do you convert a djvu image file into OCR'able scans? The answer could be:
:ddjvu -format=pgm -page $i -black *.djvu - | pnmtopng -comp 2
The wiki lacks this info.--Keichwa 13:59, 27 August 2006 (PDT)
This page is not about background information, it is only meant to maintain claiming status of harvested books. The background info, including a paragraph about djvu format can be found in the Harvesting/Internet Archive "American Libraries" article, also linked to from the top of this article. --Sigal 15:05, 27 August 2006 (PDT)
- Yes, thanks for the pointer. I want us to start a dedicated djvu article. --Keichwa 09:15, 28 August 2006 (PDT)
Separate Page for PG texts
I can see this page getting unweildy as claimed items are added and texts into PG get tagged as such. I propose creating a "Texts in PG" sub-page and such items be moved there, e.g., Harvesting_Americana_claims/Texts_in_PG. What do you think? -- vls 19:45, 15 September 2006 (PDT)
I say good idea 17:26, 16 September 2006 (PDT) De2164 17:27, 16 September 2006 (PDT)
- I agree. Any technical reason why we can't do this (i.e., would the script to create the project status page have to be updated? Caw 16:11, 27 January 2007 (PST)
- The script should be updated, but it is well within my capabilities. I think we need to make a bigger change though. TIA are adding hundreds of books a week and the current status page (which I didn't upload yet) is well above 10MB even without the claimed and posted items. I'm looking for a creative idea how to avoid the status page becoming useless. Maybe we better take this discussion to the forum where we may get more responses. --Sigal 05:06, 28 January 2007 (PST)
- If we are going to take a more formal step towards organizing the claim system, I feel it needs to be a on-line database instead that all claims regardless of location is generated.
- User
- location
- file name
- Status
- examining
- claim
- in rounds
- posted to PG
- rejected (with a reason field. Bad, missing pages, etc.)
- returned to available
If the cumbersome Dave's clearance list data could be merged would be better yet. De2164 05:33, 28 January 2007 (PST)