User:Rossolson/RTF Recapture
Jump to navigation
Jump to search
Goal: Clean Up
Make the eBooks that only have RTF files compatible with more modern HTML workflows.
Phase 1: Identify candidate eBooks.
(Found via Google Search)
- Vaknin
- https://www.gutenberg.org/ebooks/4742 - TrendSiters Digital Content, Vaknin
- https://www.gutenberg.org/ebooks/8421 - The First Book of Factoids, Vaknin
- https://www.gutenberg.org/ebooks/28363 - MindGames: Short Fiction about Bizarre Mental Health Disorders , Vaknin
- https://www.gutenberg.org/ebooks/8218 - Wars and Empire, Vaknin
- https://www.gutenberg.org/ebooks/11261 - Cyclopedia of Philosophy, Vaknin
- https://www.gutenberg.org/ebooks/12701 - The Suffering of Being Kafka, Vaknin
- (Likely more of the 29 eBooks from Vaknin will have similar issues.)
- https://www.gutenberg.org/ebooks/5330 - Rhyme and Reason; a Compilation of Verses, Rhymes and Senses, Dom (HTML has incorrect line breaks and formatting issues.)
- https://www.gutenberg.org/ebooks/5766 - Praetor's Lunch, Dom (More line-break weirdness.)
- https://www.gutenberg.org/ebooks/27460 - Il Vanzeli di Mateo (HTML has character conversion issues.)
- https://www.gutenberg.org/ebooks/22746 - The Copy/South Dossier, Story
Phase 2: Conversion Script
Create RTF to HTML conversion tool that is specific to these particular RTF files.
Phase 3: Evaluate
Evaluate conversions for acceptability.
Phase 4: Release
Release conversions.
Other issues:
- Stray files: https://www.gutenberg.org/ebooks/2401