<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
</head>
<body dir="ltr">
<div id="divtagdefaultwrapper" dir="ltr" style="font-size: 10pt; color: rgb(0, 0, 0); font-family: Arial, Helvetica, sans-serif, EmojiFont, 'Apple Color Emoji', 'Segoe UI Emoji', NotoColorEmoji, 'Segoe UI Symbol', 'Android Emoji', EmojiSymbols;">
<p>Thanks Rob -</p>
<p><br>
</p>
<p>Really makes my point, and I guess I should not rely on volunteers! I don't blame you one bit - just does not work.</p>
<p><br>
</p>
<p>I expect no one else want to try either. If anyone does, don't look at the crib below until after you try. Errors located and circled in red. </p>
<p><br>
</p>
<p><a href="http://electronotes.netfirms.com/AN23Rob.PDF" class="OWAAutoLink" id="LPlnk587417" previewremoved="true">http://electronotes.netfirms.com/AN23Rob.PDF</a><br>
</p>
<p><br>
</p>
<p>Please all, let's agree that the OCR issue is bogus as applied here.</p>
<p><br>
</p>
<p>Bernie</p>
<br>
<br>
<div style="color:rgb(0,0,0)">
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> Rob Kam <robkam@ymail.com><br>
<b>Sent:</b> Thursday, July 6, 2017 1:51 PM<br>
<b>To:</b> Bernard Arthur Hutchins Jr<br>
<b>Cc:</b> synth-diy@synth-diy.org<br>
<b>Subject:</b> Re: [sdiy] Can anyone OCR the AN23.PDF File Here?</font>
<div> </div>
</div>
<div>
<div style="color:#000; background-color:#fff; font-family:lucida console,sans-serif; font-size:16px">
<div id="yui_3_16_0_ym19_1_1499355715660_7157"><span id="yui_3_16_0_ym19_1_1499355715660_7156">Thanks for the challenge Bernie but no thanks. I don't have the patience to correct the OCR.<br>
<br>
Rob</span></div>
<div class="qtdSeparateBR" id="yui_3_16_0_ym19_1_1499355715660_7155"><br>
</div>
<div class="yahoo_quoted" id="yui_3_16_0_ym19_1_1499355715660_7036" style="display:block">
<div id="yui_3_16_0_ym19_1_1499355715660_7035" style="font-family:lucida console,sans-serif; font-size:16px">
<div id="yui_3_16_0_ym19_1_1499355715660_7034" style="font-family:HelveticaNeue,Helvetica Neue,Helvetica,Arial,Lucida Grande,Sans-Serif; font-size:16px">
<div dir="ltr" id="yui_3_16_0_ym19_1_1499355715660_7153"><font size="2" face="Arial" id="yui_3_16_0_ym19_1_1499355715660_7152">
<hr size="1" id="yui_3_16_0_ym19_1_1499355715660_7154">
<b id="yui_3_16_0_ym19_1_1499355715660_7151"><span id="yui_3_16_0_ym19_1_1499355715660_7150" style="font-weight:bold">From:</span></b> Bernard Arthur Hutchins Jr <bah13@cornell.edu><br>
<b><span style="font-weight:bold">To:</span></b> Rob Kam <robkam@ymail.com> <br>
<b><span style="font-weight:bold">Cc:</span></b> "synth-diy@synth-diy.org" <synth-diy@synth-diy.org><br>
<b><span style="font-weight:bold">Sent:</span></b> Thursday, 6 July 2017, 18:30<br>
<b><span style="font-weight:bold">Subject:</span></b> Re: [sdiy] Can anyone OCR the AN23.PDF File Here?<br>
</font></div>
<div class="y_msg_container" id="yui_3_16_0_ym19_1_1499355715660_7033"><br>
<div id="yiv0105428593">
<div dir="ltr" id="yui_3_16_0_ym19_1_1499355715660_7032">
<div id="yiv0105428593divtagdefaultwrapper" dir="ltr" style="font-size: 10pt; color: rgb(0, 0, 0); font-family: Arial, Helvetica, sans-serif, EmojiFont, 'Apple Color Emoji', 'Segoe UI Emoji', NotoColorEmoji, 'Segoe UI Symbol', 'Android Emoji', EmojiSymbols, EmojiFont, 'Apple Color Emoji', 'Segoe UI Emoji', NotoColorEmoji, 'Segoe UI Symbol', 'Android Emoji', EmojiSymbols;">
<div id="yui_3_16_0_ym19_1_1499355715660_7161">Thanks Rob -</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7162"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7163">True - the equations are now usable, but slightly more blurred than my original PDF. Likewise, the figures are now OK but of slightly lower quality, which does NOT matter much for hand drawings. </div>
<div id="yui_3_16_0_ym19_1_1499355715660_7305"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7306">I did note a lot of OCR misreads in the text. A careful proofing of the text took me 18 minutes and there are 25 errors, some not at all obscure, and about 13 of which I had to look at the original to see what
they were supposed to be. (One was hard to detect since it substituted an Rf for an Ri, a disaster). A full proofread/correction would take at least 30 minutes (188 eight-hour days for 6000 pages). And I wrote this! Almost certainly a volunteer would
have more trouble and miss errors.</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7236"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7235">In the spirit of no good deed going unpunished, Rob, let me put you on the spot. Take your scan, find and fix the 25 errors. Let us know how easy/hard this was and the time it took, and show your results. </div>
<div id="yui_3_16_0_ym19_1_1499355715660_7147"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7042">I will post the "solution" to the "find the errors" this evening if I get the chance.</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7031"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7309">Since there is no improvement in the figures/equations, and the text is a serious downgrade, tell me again (anyone) why an OCR/ebook is a good idea here. </div>
<div id="yui_3_16_0_ym19_1_1499355715660_7310"><br>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7311">Bernie</div>
<br>
<br>
<div id="yui_3_16_0_ym19_1_1499355715660_7230" style="color:rgb(0,0,0)">
<hr tabindex="-1" id="yui_3_16_0_ym19_1_1499355715660_7312" style="display:inline-block; width:98%">
<div id="yiv0105428593divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" id="yui_3_16_0_ym19_1_1499355715660_7313" style="font-size:11pt"><b>From:</b> Rob Kam <robkam@ymail.com><br>
<b>Sent:</b> Thursday, July 6, 2017 7:24 AM<br>
<b>To:</b> Bernard Arthur Hutchins Jr<br>
<b>Cc:</b> synth-diy@synth-diy.org<br>
<b>Subject:</b> RE: [sdiy] Can anyone OCR the AN23.PDF File Here?</font>
<div id="yui_3_16_0_ym19_1_1499355715660_7314"> </div>
</div>
<div id="yui_3_16_0_ym19_1_1499355715660_7229">
<div class="yiv0105428593WordSection1" id="yui_3_16_0_ym19_1_1499355715660_7228">
<div class="yiv0105428593MsoNormal" id="yui_3_16_0_ym19_1_1499355715660_7227"><span id="yui_3_16_0_ym19_1_1499355715660_7226" style="font-size:11.0pt; color:#1F497D">There’s a second attempt at
<a rel="nofollow" target="_blank" href="http://www.sdiy.info/AN23b.rtf" id="LPlnk72125" previewremoved="true">
<span style="font-size:12.0pt; color:#1F497D; text-decoration:none">http://www.sdiy.info/AN23b.rtf</span></a> converting the equations to images instead, (and still manually tweaking the OCR). It took six minutes to do from the scan/PDF and the text still needs
comparing and correcting against the original.</span></div>
<div class="yiv0105428593MsoNormal" id="yui_3_16_0_ym19_1_1499355715660_7315"><span style="font-size:11.0pt; color:#1F497D"> </span></div>
<div class="yiv0105428593MsoNormal" id="yui_3_16_0_ym19_1_1499355715660_7317"><span id="yui_3_16_0_ym19_1_1499355715660_7316" style="font-size:11.0pt; color:#1F497D">There are already experts at this sort of project, at Archive.org who have been doing this
for a number of years <a rel="nofollow" target="_blank" href="https://archive.org/details/texts&tab=about" id="LPlnk637705" previewremoved="true">
<span id="yui_3_16_0_ym19_1_1499355715660_7318" style="font-size:12.0pt; color:#1F497D; text-decoration:none">https://archive.org/details/texts&tab=about</span></a>
</span>
<div id="LPBorder_GT_14993670741300.17156451145104" style="margin-bottom:20px; overflow:auto; width:100%; text-indent:0px">
<table id="LPContainer_14993670741240.8647753949174561" cellspacing="0" style="width:90%; overflow:auto; padding-top:20px; padding-bottom:20px; margin-top:20px; border-top-width:1px; border-top-style:dotted; border-top-color:rgb(200,200,200); border-bottom-width:1px; border-bottom-style:dotted; border-bottom-color:rgb(200,200,200); background-color:rgb(255,255,255)">
<tbody>
<tr valign="top" style="border-spacing:0px">
<td id="TextCell_14993670741260.7731319830910077" colspan="2" style="vertical-align: top; padding: 0px; display: table-cell; position: relative;">
<div id="LPRemovePreviewContainer_14993670741260.20258585323731526"></div>
<div id="LPTitle_14993670741270.29065428092860457" style="top:0px; color:rgb(179,27,27); font-weight:normal; font-size:21px; font-family:wf_segoe-ui_light,'Segoe UI Light','Segoe WP Light','Segoe UI','Segoe WP',Tahoma,Arial,sans-serif; line-height:21px">
<a id="LPUrlAnchor_14993670741280.27192376971136123" href="https://archive.org/details/texts&tab=about" target="_blank" style="text-decoration:none">Free Books : Download & Streaming : eBooks and Texts ...</a></div>
<div id="LPMetadata_14993670741280.7021146677640517" style="margin:10px 0px 16px; color:rgb(102,102,102); font-weight:normal; font-family:wf_segoe-ui_normal,'Segoe UI','Segoe WP',Tahoma,Arial,sans-serif; font-size:14px; line-height:14px">
archive.org</div>
<div id="LPDescription_14993670741290.10828915176730747" style="display:block; color:rgb(102,102,102); font-weight:normal; font-family:wf_segoe-ui_normal,'Segoe UI','Segoe WP',Tahoma,Arial,sans-serif; font-size:14px; line-height:20px; max-height:100px; overflow:hidden">
The Internet Archive offers over 12,000,000 freely downloadable books and texts. There is also a collection of 550,000 modern eBooks that may be borrowed by anyone ...</div>
</td>
</tr>
</tbody>
</table>
</div>
<br>
</div>
<div id="yiv0105428593LPBorder_GT_14993575599660.8014538408476546" style="margin-bottom:20px; overflow:auto; width:100%; text-indent:0px">
<table id="yiv0105428593LPContainer_14993575599590.6258880287200199" cellspacing="0" style="width:90%; overflow:auto; padding-top:20px; padding-bottom:20px; margin-top:20px; border-top-width:1px; border-top-style:dotted; border-top-color:rgb(200,200,200); border-bottom-width:1px; border-bottom-style:dotted; border-bottom-color:rgb(200,200,200); background-color:rgb(255,255,255)">
<tbody id="yui_3_16_0_ym19_1_1499355715660_7320">
<tr valign="top" id="yui_3_16_0_ym19_1_1499355715660_7319" style="border-spacing:0px">
<td id="yiv0105428593TextCell_14993575599620.066468748753864" colspan="2" style="vertical-align:top; padding:0px; display:table-cell">
<div id="yiv0105428593LPRemovePreviewContainer_14993575599620.08509437477094406">
</div>
<div id="yiv0105428593LPTitle_14993575599620.797908623843816" style="color:rgb(179,27,27); font-weight:normal; font-size:21px; font-family:'Segoe UI Light','Segoe WP Light','Segoe UI','Segoe WP',Tahoma,Arial,sans-serif; line-height:21px">
<a rel="nofollow" id="yiv0105428593LPUrlAnchor_14993575599630.8805197125413591" target="_blank" href="https://archive.org/details/texts&tab=about" style="text-decoration:none">Free Books : Download & Streaming : eBooks and Texts ...</a></div>
<div id="yiv0105428593LPMetadata_14993575599640.4270309974458135" style="margin:10px 0px 16px; color:rgb(102,102,102); font-weight:normal; font-family:'Segoe UI','Segoe WP',Tahoma,Arial,sans-serif; font-size:14px; line-height:14px">
archive.org</div>
<div id="yiv0105428593LPDescription_14993575599650.8568978319302971" style="display:block; color:rgb(102,102,102); font-weight:normal; font-family:'Segoe UI','Segoe WP',Tahoma,Arial,sans-serif; font-size:14px; line-height:20px; max-height:100px; overflow:hidden">
The Internet Archive offers over 12,000,000 freely downloadable books and texts. There is also a collection of 550,000 modern eBooks that may be borrowed by anyone ...</div>
</td>
</tr>
</tbody>
</table>
</div>
<br>
<br>
<br>
To put my two cents in, the synth DIY community should see whether they are able to raise the funds to compensate (against unsold hardcopy,
<span class="yiv0105428593SpellE">ebooks</span> etc.) for releasing <span class="yiv0105428593SpellE">
Electronotes</span> under a non-commercial Creative Commons licence https://creativecommons.org/licenses/by-nc/2.0/uk/
<div></div>
<div class="yiv0105428593MsoNormal"><span style="font-size:11.0pt; color:#1F497D"> </span></div>
<div class="yiv0105428593MsoNormal"><span style="font-size:11.0pt; color:#1F497D">Rob</span></div>
<div class="yiv0105428593MsoNormal"><span style="font-size:11.0pt; color:#1F497D"> </span></div>
<div>
<div style="border:none; border-top:solid #E1E1E1 1.0pt; padding:3.0pt 0cm 0cm 0cm">
<div class="yiv0105428593MsoNormal"><b><span lang="EN-US" style="font-size:11.0pt">From:</span></b><span lang="EN-US" style="font-size:11.0pt"> Bernard Arthur Hutchins Jr [mailto:bah13@cornell.edu]
<br>
<b>Sent:</b> 06 July 2017 01:42<br>
<b>To:</b> Rob Kam <robkam@ymail.com>; mskala@ansuz.sooke.bc.ca<br>
<b>Cc:</b> synth-diy@synth-diy.org<br>
<b>Subject:</b> Re: [sdiy] Can anyone OCR the AN23.PDF File Here?</span></div>
</div>
</div>
<div class="yiv0105428593MsoNormal"> </div>
<div id="yiv0105428593divtagdefaultwrapper" style="font-family: Arial, Helvetica, sans-serif, EmojiFont, 'Apple Color Emoji', 'Segoe UI Emoji', NotoColorEmoji, 'Segoe UI Symbol', 'Android Emoji', EmojiSymbols, EmojiFont, 'Apple Color Emoji', 'Segoe UI Emoji', NotoColorEmoji, 'Segoe UI Symbol', 'Android Emoji', EmojiSymbols, EmojiFont, 'Apple Color Emoji', 'Segoe UI Emoji', NotoColorEmoji, 'Segoe UI Symbol', 'Android Emoji', EmojiSymbols;">
<div><span style="font-size:10.0pt; color:black"> </span></div>
<div class="yiv0105428593MsoNormal"><span style="font-size:10.0pt; color:black">Tkanks Rob -
</span></div>
<div>
<div class="yiv0105428593MsoNormal"><span style="font-size:10.0pt; color:black"> </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"><span style="font-size:10.0pt; color:black">But a manual identifications and 5 minutes/page is no good for the small improvement. Still months of 8-hour days to do 6000 pages. My PDF is still much better already. The equations
are still unusable. It makes the same text errors, apparently. Why not just say it just can't do this? Wasn't intended to. </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"><span style="font-size:10.0pt; color:black"> </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"><span style="font-size:10.0pt; color:black">Thanks for trying - useful data point! </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal"><span style="font-size:10.0pt; color:black"> </span></div>
</div>
<div>
<div class="yiv0105428593MsoNormal" style="margin-bottom:12.0pt"><span style="font-size:10.0pt; color:black">Bernie</span></div>
<div>
<div class="yiv0105428593MsoNormal" align="center" style="text-align:center"><span style="font-size:10.0pt; color:black">
<hr size="2" width="98%" align="center">
</span></div>
<div id="yiv0105428593divRplyFwdMsg">
<div class="yiv0105428593MsoNormal" style=""><b><span style="font-size:11.0pt; color:black">From:</span></b><span style="font-size:11.0pt; color:black"> Rob Kam <</span><a rel="nofollow" target="_blank" href="mailto:robkam@ymail.com"><span style="font-size:11.0pt">robkam@ymail.com</span></a><span style="font-size:11.0pt; color:black">><br>
<b>Sent:</b> Wednesday, July 5, 2017 6:47 PM<br>
<b>To:</b> Bernard Arthur Hutchins Jr; </span><a rel="nofollow" target="_blank" href="mailto:mskala@ansuz.sooke.bc.ca"><span style="font-size:11.0pt">mskala@ansuz.sooke.bc.ca</span></a><span style="font-size:11.0pt; color:black"><br>
<b>Cc:</b> </span><a rel="nofollow" target="_blank" href="mailto:synth-diy@synth-diy.org"><span style="font-size:11.0pt">synth-diy@synth-diy.org</span></a><span style="font-size:11.0pt; color:black"><br>
<b>Subject:</b> RE: [sdiy] Can anyone OCR the AN23.PDF File Here?</span><span style="font-size:10.0pt; color:black">
</span></div>
<div>
<div class="yiv0105428593MsoNormal"><span style="font-size:10.0pt; color:black"> </span></div>
</div>
</div>
<div>
<div>
<div class="yiv0105428593MsoNormal" style=""><span style="font-size:11.0pt; color:#1F497D">Hi Bernie,</span><span style="color:black"></span></div>
<div class="yiv0105428593MsoNormal" style=""><span style="font-size:11.0pt; color:#1F497D"><br>
At </span><a rel="nofollow" target="_blank" href="http://www.sdiy.info/AN23.rtf" id="yiv0105428593LPlnk309394"><span style="font-size:11.0pt">http://www.sdiy.info/AN23.rtf</span></a><span style="font-size:11.0pt; color:#1F497D"> this took 10 minutes to OCR
with </span><a rel="nofollow" target="_blank" href="https://www.google.co.uk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0ahUKEwiZhc6ZmPPUAhVG6RQKHRHpA1UQFggoMAA&url=http%3A%2F%2Fwww.abbyy.com%2Fen-gb%2Fsupport%2Ffinereader-12%2F&usg=AFQjCNHLOjsz219pjjTDqDytG2Cpm9N90w"><span style="font-size:11.0pt; color:#1F497D; text-decoration:none">ABBYY
FineReader 12</span></a><span style="font-size:11.0pt; color:#1F497D">, first manually identifying areas of text vs. images. Obviously it still needs further corrections.
<br>
<br>
Rob</span><span style="color:black"> </span></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br>
<br>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</body>
</html>