<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<p>I'll do it. I have already converted it, so I'll just give it a
read and fix up the graphics.</p>
<p>Mike<br>
</p>
<br>
<div class="moz-cite-prefix">On 7/6/2017 1:51 PM, Rob Kam wrote:<br>
</div>
<blockquote type="cite"
cite="mid:1737955646.10682208.1499363477927@mail.yahoo.com">
<div style="color:#000; background-color:#fff; font-family:lucida
console, sans-serif;font-size:16px">
<div id="yui_3_16_0_ym19_1_1499355715660_7157"><span
id="yui_3_16_0_ym19_1_1499355715660_7156">Thanks for the
challenge Bernie but no thanks. I don't have the patience to
correct the OCR.<br>
<br>
Rob</span></div>
<div class="qtdSeparateBR"
id="yui_3_16_0_ym19_1_1499355715660_7155"><br>
</div>
<div class="yahoo_quoted"
id="yui_3_16_0_ym19_1_1499355715660_7036" style="display:
block;">
<div style="font-family: lucida console, sans-serif;
font-size: 16px;" id="yui_3_16_0_ym19_1_1499355715660_7035">
<div style="font-family: HelveticaNeue, Helvetica Neue,
Helvetica, Arial, Lucida Grande, Sans-Serif; font-size:
16px;" id="yui_3_16_0_ym19_1_1499355715660_7034">
<div dir="ltr" id="yui_3_16_0_ym19_1_1499355715660_7153">
<font id="yui_3_16_0_ym19_1_1499355715660_7152" size="2"
face="Arial">
<hr id="yui_3_16_0_ym19_1_1499355715660_7154" size="1">
<b id="yui_3_16_0_ym19_1_1499355715660_7151"><span
style="font-weight:bold;"
id="yui_3_16_0_ym19_1_1499355715660_7150">From:</span></b>
Bernard Arthur Hutchins Jr <a class="moz-txt-link-rfc2396E" href="mailto:bah13@cornell.edu"><bah13@cornell.edu></a><br>
<b><span style="font-weight: bold;">To:</span></b> Rob
Kam <a class="moz-txt-link-rfc2396E" href="mailto:robkam@ymail.com"><robkam@ymail.com></a> <br>
<b><span style="font-weight: bold;">Cc:</span></b>
<a class="moz-txt-link-rfc2396E" href="mailto:synth-diy@synth-diy.org">"synth-diy@synth-diy.org"</a>
<a class="moz-txt-link-rfc2396E" href="mailto:synth-diy@synth-diy.org"><synth-diy@synth-diy.org></a><br>
<b><span style="font-weight: bold;">Sent:</span></b>
Thursday, 6 July 2017, 18:30<br>
<b><span style="font-weight: bold;">Subject:</span></b>
Re: [sdiy] Can anyone OCR the AN23.PDF File Here?<br>
</font> </div>
<div class="y_msg_container"
id="yui_3_16_0_ym19_1_1499355715660_7033"><br>
<div id="yiv0105428593">
<style type="text/css"><!--#yiv0105428593 P {margin-top:0;margin-bottom:0;}--></style>
<div dir="ltr"
id="yui_3_16_0_ym19_1_1499355715660_7032">
<div id="yiv0105428593divtagdefaultwrapper"
style="font-size:10pt;color:rgb(0, 0,
0);font-family:Arial, Helvetica, sans-serif,
EmojiFont, 'Apple Color Emoji', 'Segoe UI Emoji',
NotoColorEmoji, 'Segoe UI Symbol', 'Android
Emoji', EmojiSymbols;" dir="ltr">
<div id="yui_3_16_0_ym19_1_1499355715660_7161">Thanks
Rob -</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7162"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7163">True
- the equations are now usable, but
slightly more blurred than my original PDF.
Likewise, the figures are now OK but of slightly
lower quality, which does NOT matter much for
hand drawings. </div>
<div id="yui_3_16_0_ym19_1_1499355715660_7305"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7306">I
did note a lot of OCR misreads in the text.
A careful proofing of the text took me 18
minutes and there are 25 errors, some not at all
obscure, and about 13 of which I had to look at
the original to see what they were supposed to
be. (One was hard to detect since it
substituted an Rf for an Ri, a disaster). A
full proofread/correction would take at least
30 minutes (188 eight-hour days for 6000 pages).
And I wrote this! Almost certainly a
volunteer would have more trouble and miss
errors.</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7236"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7235">In
the spirit of no good deed going unpunished,
Rob, let me put you on the spot. Take your scan,
find and fix the 25 errors. Let us know how
easy/hard this was and the time it took, and
show your results. </div>
<div id="yui_3_16_0_ym19_1_1499355715660_7147"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7042">I
will post the "solution" to the "find the
errors" this evening if I get the chance.</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7031"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7309">Since
there is no improvement in the
figures/equations, and the text is a serious
downgrade, tell me again (anyone) why an
OCR/ebook is a good idea here. </div>
<div id="yui_3_16_0_ym19_1_1499355715660_7310"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7311">Bernie</div>
<br>
<br>
<div style="color:rgb(0, 0, 0);"
id="yui_3_16_0_ym19_1_1499355715660_7230">
<hr tabindex="-1"
style="display:inline-block;width:98%;"
id="yui_3_16_0_ym19_1_1499355715660_7312">
<div id="yiv0105428593divRplyFwdMsg" dir="ltr"><font
style="font-size:11pt;"
id="yui_3_16_0_ym19_1_1499355715660_7313"
face="Calibri, sans-serif" color="#000000"><b>From:</b>
Rob Kam <a class="moz-txt-link-rfc2396E" href="mailto:robkam@ymail.com"><robkam@ymail.com></a><br>
<b>Sent:</b> Thursday, July 6, 2017 7:24 AM<br>
<b>To:</b> Bernard Arthur Hutchins Jr<br>
<b>Cc:</b> <a class="moz-txt-link-abbreviated" href="mailto:synth-diy@synth-diy.org">synth-diy@synth-diy.org</a><br>
<b>Subject:</b> RE: [sdiy] Can anyone OCR
the AN23.PDF File Here?</font>
<div id="yui_3_16_0_ym19_1_1499355715660_7314"> </div>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7229">
<div class="yiv0105428593WordSection1"
id="yui_3_16_0_ym19_1_1499355715660_7228">
<div class="yiv0105428593MsoNormal"
id="yui_3_16_0_ym19_1_1499355715660_7227"><span
style="font-size:11.0pt;color:#1F497D;"
id="yui_3_16_0_ym19_1_1499355715660_7226">There’s a second attempt at
<a rel="nofollow" target="_blank"
href="http://www.sdiy.info/AN23b.rtf"
id="yiv0105428593LPlnk943095"
moz-do-not-send="true"><span
style="font-size:12.0pt;color:#1F497D;text-decoration:none;">http://www.sdiy.info/AN23b.rtf</span></a>
converting the equations to images
instead, (and still manually tweaking
the OCR). It took six minutes to do from
the scan/PDF and the text still needs
comparing and correcting against the
original.</span></div>
<div class="yiv0105428593MsoNormal"
id="yui_3_16_0_ym19_1_1499355715660_7315"><span
style="font-size:11.0pt;color:#1F497D;"> </span></div>
<div class="yiv0105428593MsoNormal"
id="yui_3_16_0_ym19_1_1499355715660_7317"><span
style="font-size:11.0pt;color:#1F497D;"
id="yui_3_16_0_ym19_1_1499355715660_7316">There are already experts at
this sort of project, at Archive.org who
have been doing this for a number of
years
<a rel="nofollow" target="_blank"
href="https://archive.org/details/texts&tab=about"
id="yiv0105428593LPlnk701899"
moz-do-not-send="true">
<span
style="font-size:12.0pt;color:#1F497D;text-decoration:none;"
id="yui_3_16_0_ym19_1_1499355715660_7318">https://archive.org/details/texts&tab=about</span></a>
</span></div>
<div
id="yiv0105428593LPBorder_GT_14993575599660.8014538408476546"
style="margin-bottom:20px;overflow:auto;width:100%;text-indent:0px;">
<table
id="yiv0105428593LPContainer_14993575599590.6258880287200199"
style="width:90%;position:relative;overflow:auto;padding-top:20px;padding-bottom:20px;margin-top:20px;border-top-width:1px;border-top-style:dotted;border-top-color:rgb(200,
200,
200);border-bottom-width:1px;border-bottom-style:dotted;border-bottom-color:rgb(200,
200, 200);background-color:rgb(255, 255,
255);" cellspacing="0">
<tbody
id="yui_3_16_0_ym19_1_1499355715660_7320">
<tr style="border-spacing:0px;"
id="yui_3_16_0_ym19_1_1499355715660_7319"
valign="top">
<td
id="yiv0105428593TextCell_14993575599620.066468748753864"
colspan="2"
style="vertical-align:top;position:relative;padding:0px;display:table-cell;">
<div
id="yiv0105428593LPTitle_14993575599620.797908623843816"
style="color:rgb(179, 27,
27);font-weight:normal;font-size:21px;font-family:'Segoe
UI Light', 'Segoe WP Light',
'Segoe UI', 'Segoe WP', Tahoma,
Arial,
sans-serif;line-height:21px;">
<a rel="nofollow"
id="yiv0105428593LPUrlAnchor_14993575599630.8805197125413591"
target="_blank"
href="https://archive.org/details/texts&tab=about"
style="text-decoration:none;"
moz-do-not-send="true">Free
Books : Download &
Streaming : eBooks and Texts
...</a></div>
<div
id="yiv0105428593LPMetadata_14993575599640.4270309974458135"
style="margin:10px 0px
16px;color:rgb(102, 102,
102);font-weight:normal;font-family:'Segoe
UI', 'Segoe WP', Tahoma, Arial,
sans-serif;font-size:14px;line-height:14px;">
archive.org</div>
<div
id="yiv0105428593LPDescription_14993575599650.8568978319302971"
style="display:block;color:rgb(102, 102,
102);font-weight:normal;font-family:'Segoe
UI', 'Segoe WP', Tahoma, Arial,
sans-serif;font-size:14px;line-height:20px;max-height:100px;overflow:hidden;">
The Internet Archive offers over
12,000,000 freely downloadable
books and texts. There is also a
collection of 550,000 modern
eBooks that may be borrowed by
anyone ...</div>
</td>
</tr>
</tbody>
</table>
</div>
<br>
<br>
<br>
To put my two cents in, the synth DIY
community should see whether they are able
to raise the funds to compensate (against
unsold hardcopy,
<span class="yiv0105428593SpellE">ebooks</span>
etc.) for releasing <span
class="yiv0105428593SpellE">Electronotes</span>
under a non-commercial Creative Commons
licence
<a class="moz-txt-link-freetext" href="https://creativecommons.org/licenses/by-nc/2.0/uk/">https://creativecommons.org/licenses/by-nc/2.0/uk/</a>
<div class="yiv0105428593MsoNormal"><span
style="font-size:11.0pt;color:#1F497D;"> </span></div>
<div class="yiv0105428593MsoNormal"><span
style="font-size:11.0pt;color:#1F497D;">Rob</span></div>
<div class="yiv0105428593MsoNormal"><span
style="font-size:11.0pt;color:#1F497D;"> </span></div>
<div>
<div style="border:none;border-top:solid
#E1E1E1 1.0pt;padding:3.0pt 0cm 0cm
0cm;">
<div class="yiv0105428593MsoNormal"><b><span
style="font-size:11.0pt;"
lang="EN-US">From:</span></b><span
style="font-size:11.0pt;"
lang="EN-US"> Bernard Arthur
Hutchins Jr
[<a class="moz-txt-link-freetext" href="mailto:bah13@cornell.edu">mailto:bah13@cornell.edu</a>]
<br>
<b>Sent:</b> 06 July 2017 01:42<br>
<b>To:</b> Rob Kam
<a class="moz-txt-link-rfc2396E" href="mailto:robkam@ymail.com"><robkam@ymail.com></a>;
<a class="moz-txt-link-abbreviated" href="mailto:mskala@ansuz.sooke.bc.ca">mskala@ansuz.sooke.bc.ca</a><br>
<b>Cc:</b> <a class="moz-txt-link-abbreviated" href="mailto:synth-diy@synth-diy.org">synth-diy@synth-diy.org</a><br>
<b>Subject:</b> Re: [sdiy] Can
anyone OCR the AN23.PDF File Here?</span></div>
</div>
</div>
<div class="yiv0105428593MsoNormal"> </div>
<div id="yiv0105428593divtagdefaultwrapper"
style="font-family:Arial, Helvetica,
sans-serif, EmojiFont, 'Apple Color
Emoji', 'Segoe UI Emoji', NotoColorEmoji,
'Segoe UI Symbol', 'Android Emoji',
EmojiSymbols, EmojiFont, 'Apple Color
Emoji', 'Segoe UI Emoji', NotoColorEmoji,
'Segoe UI Symbol', 'Android Emoji',
EmojiSymbols;">
<div><span
style="font-size:10.0pt;color:black;"> </span></div>
<div class="yiv0105428593MsoNormal"><span
style="font-size:10.0pt;color:black;">Tkanks
Rob -
</span></div>
<div>
<div class="yiv0105428593MsoNormal"><span
style="font-size:10.0pt;color:black;"> </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"><span
style="font-size:10.0pt;color:black;">But a manual identifications and 5
minutes/page is no good for the
small improvement. Still months of
8-hour days to do 6000 pages. My
PDF is still much better already.
The equations are still unusable.
It makes the same text errors,
apparently. Why not just say it
just can't do this? Wasn't
intended to. </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"><span
style="font-size:10.0pt;color:black;"> </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"><span
style="font-size:10.0pt;color:black;">Thanks for trying - useful data
point! </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"><span
style="font-size:10.0pt;color:black;"> </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"
style="margin-bottom:12.0pt;"><span
style="font-size:10.0pt;color:black;">Bernie</span></div>
<div>
<div class="yiv0105428593MsoNormal"
style="text-align:center;"
align="center"><span
style="font-size:10.0pt;color:black;">
<hr size="2" align="center"
width="98%">
</span></div>
<div id="yiv0105428593divRplyFwdMsg">
<div class="yiv0105428593MsoNormal"
style=""><b><span
style="font-size:11.0pt;color:black;">From:</span></b><span
style="font-size:11.0pt;color:black;"> Rob Kam <</span><a
rel="nofollow"
ymailto="mailto:robkam@ymail.com"
target="_blank"
href="mailto:robkam@ymail.com"
moz-do-not-send="true"><span
style="font-size:11.0pt;">robkam@ymail.com</span></a><span
style="font-size:11.0pt;color:black;">><br>
<b>Sent:</b> Wednesday, July 5,
2017 6:47 PM<br>
<b>To:</b> Bernard Arthur
Hutchins Jr; </span><a
rel="nofollow"
ymailto="mailto:mskala@ansuz.sooke.bc.ca"
target="_blank"
href="mailto:mskala@ansuz.sooke.bc.ca"
moz-do-not-send="true"><span
style="font-size:11.0pt;">mskala@ansuz.sooke.bc.ca</span></a><span
style="font-size:11.0pt;color:black;"><br>
<b>Cc:</b> </span><a
rel="nofollow"
ymailto="mailto:synth-diy@synth-diy.org"
target="_blank"
href="mailto:synth-diy@synth-diy.org"
moz-do-not-send="true"><span
style="font-size:11.0pt;">synth-diy@synth-diy.org</span></a><span
style="font-size:11.0pt;color:black;"><br>
<b>Subject:</b> RE: [sdiy] Can
anyone OCR the AN23.PDF File
Here?</span><span
style="font-size:10.0pt;color:black;">
</span></div>
<div>
<div
class="yiv0105428593MsoNormal"><span
style="font-size:10.0pt;color:black;"> </span></div>
</div>
</div>
<div>
<div>
<div
class="yiv0105428593MsoNormal"
style=""><span
style="font-size:11.0pt;color:#1F497D;">Hi
Bernie,</span><span
style="color:black;"></span></div>
<div
class="yiv0105428593MsoNormal"
style=""><span
style="font-size:11.0pt;color:#1F497D;"><br>
At </span><a rel="nofollow"
target="_blank"
href="http://www.sdiy.info/AN23.rtf"
id="yiv0105428593LPlnk309394"
moz-do-not-send="true"><span
style="font-size:11.0pt;">http://www.sdiy.info/AN23.rtf</span></a><span
style="font-size:11.0pt;color:#1F497D;"> this took 10 minutes to OCR
with </span><a rel="nofollow"
target="_blank"
href="https://www.google.co.uk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0ahUKEwiZhc6ZmPPUAhVG6RQKHRHpA1UQFggoMAA&url=http%3A%2F%2Fwww.abbyy.com%2Fen-gb%2Fsupport%2Ffinereader-12%2F&usg=AFQjCNHLOjsz219pjjTDqDytG2Cpm9N90w"
moz-do-not-send="true"><span
style="font-size:11.0pt;color:#1F497D;text-decoration:none;">ABBYY
FineReader 12</span></a><span
style="font-size:11.0pt;color:#1F497D;">, first manually identifying
areas of text vs. images.
Obviously it still needs
further corrections.
<br>
<br>
Rob</span><span
style="color:black;"> </span></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br>
<br>
</div>
</div>
</div>
</div>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
Synth-diy mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Synth-diy@synth-diy.org">Synth-diy@synth-diy.org</a>
<a class="moz-txt-link-freetext" href="http://synth-diy.org/mailman/listinfo/synth-diy">http://synth-diy.org/mailman/listinfo/synth-diy</a>
</pre>
</blockquote>
<br>
</body>
</html>