Talk:A personal View of PMing

From DPWiki
Jump to navigation Jump to search

Irfan View and Black

Regarding this: "I now go and check that IrfanView hasn't decided to change any blank page images into solid black ones. I have no idea why it should do this, but please check them after every IrfanView run." I haven't run across this yet, but I looked into settings (Options->Properties/Settings), and I found under 'Viewing' that the Main Window Color is set as black. This might have something to do with the blank pages turning to black. (I remember I had to change the background color in XnView from black to white too.) So maybe next time someone runs into black problem, they could try changing this option, re-running batch and seeing it fixes the issue. -- camomiletea 00:14, 6 October 2011 (PDT)

There are also a few other settings using black... so it's not that simple ;) Under 'Browsing/editing' - Background color for cut. -- camomiletea 00:20, 6 October 2011 (PDT)

BEL Characters in ABByy output.

I bought ABBYY v12 at the end of 2014 at a reduced price. I find that it not only inserts tabs, but also "bell" characters (x07). I don't know if there is a way to stop ABByy from generating them, but there is an easy way to get rid of them. Similar to the tab removal technique, use Notepad++ "find in files". However this will use a regular expression rather than extended search mode. The regular expression: \x07 . I replace it with nothing.

Directory structure

While the Scan Tailor default is a sub-folder called /out, I always change that to a folder under work called "st_out", which means I can instantly see all the processes under "work". I normally download the raw jp2 scan set, not the processed jp2. On a few occasions I've encountered a cropped out caption in the processed jp2 set, and I don't think the smaller download size is worth the possible pain of finding something missing (or worse, somebody else finding something missing). Because I'm using unprocessed files, I run Scan Tailor before the OCR, meaning that the fr_out files are converted to pngs. I still use the raw tifs (from the raw jp2) for the illustration images.

Running Scan Tailor on the raw images only needs one extra step. Orientation: select the first page and rotate upright, select Scope: every other page. Then select the second page, which should be rotated the other way upright, select Scope: every other page again. Now you have upright images in ST.

I go into more detail about my PM processes where they differ from yours here.

Thank you so much for this wiki page - I couldn't have starting PMing without it.--Monicas wicked stepmother (talk) 05:33, 13 April 2015 (EDT)