<div dir="ltr"><p>After many messages I don't feel it's really clear what the goals are. What would Bernie find acceptable, what would be great and what would be perfect. I understand the need to professional work with attention to detail, but at the same time, clearly this is not a viable solution given its cost. Others suggested a group effort, but this does not seem to be an option for Bernie. So what is? Better OCR at $0 cost using a professional scanner at $0 cost? </p><p>The OCR on the main text doesn't seem hard to me, even using just commercially available OCR programs. And that is really all you want to OCR. Humans will still understand equations and schematics, why try to OCR/re-make those?</p><p>b</p><div class="gmail_quote"><div dir="ltr">On Wed, Jul 5, 2017, 20:44 Bernard Arthur Hutchins Jr <<a href="mailto:bah13@cornell.edu" target="_blank">bah13@cornell.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div dir="ltr">
<div id="m_8681260506899796249m_-7766039601560400767divtagdefaultwrapper" style="font-size:10pt;color:#000000;font-family:Arial,Helvetica,sans-serif" dir="ltr">
<p><br>
</p>
Tkanks Rob -
<div><br>
</div>
<div>But a manual identifications and 5 minutes/page is no good for the small improvement. Still months of 8-hour days to do 6000 pages. My PDF is still much better already. The equations are still unusable. It makes the same text errors, apparently. Why
not just say it just can't do this? Wasn't intended to. </div>
<div><br>
</div>
<div>Thanks for trying - useful data point! </div>
<div><br>
</div>
<div>Bernie<br>
<br>
<div style="color:rgb(0,0,0)">
<hr style="display:inline-block;width:98%">
<div id="m_8681260506899796249m_-7766039601560400767divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> Rob Kam <<a href="mailto:robkam@ymail.com" target="_blank">robkam@ymail.com</a>><br>
<b>Sent:</b> Wednesday, July 5, 2017 6:47 PM<br>
<b>To:</b> Bernard Arthur Hutchins Jr; <a href="mailto:mskala@ansuz.sooke.bc.ca" target="_blank">mskala@ansuz.sooke.bc.ca</a><br>
<b>Cc:</b> <a href="mailto:synth-diy@synth-diy.org" target="_blank">synth-diy@synth-diy.org</a><br>
<b>Subject:</b> RE: [sdiy] Can anyone OCR the AN23.PDF File Here?</font>
<div> </div>
</div>
<div>
<div class="m_8681260506899796249m_-7766039601560400767WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1f497d">Hi Bernie,</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1f497d"><br>
At <a href="http://www.sdiy.info/AN23.rtf" id="m_8681260506899796249m_-7766039601560400767LPlnk309394" target="_blank">
http://www.sdiy.info/AN23.rtf</a> this took 10 minutes to OCR with <a href="https://www.google.co.uk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0ahUKEwiZhc6ZmPPUAhVG6RQKHRHpA1UQFggoMAA&url=http%3A%2F%2Fwww.abbyy.com%2Fen-gb%2Fsupport%2Ffinereader-12%2F&usg=AFQjCNHLOjsz219pjjTDqDytG2Cpm9N90w" target="_blank">
<span style="color:#1f497d;text-decoration:none">ABBYY FineReader 12</span></a>, first manually identifying areas of text vs. images. Obviously it still needs further corrections.
<br>
<br>
Rob</span><span> </span></p>
</div>
</div>
</div>
</div>
</div>
</div>
_______________________________________________<br>
Synth-diy mailing list<br>
<a href="mailto:Synth-diy@synth-diy.org" target="_blank">Synth-diy@synth-diy.org</a><br>
<a href="http://synth-diy.org/mailman/listinfo/synth-diy" rel="noreferrer" target="_blank">http://synth-diy.org/mailman/listinfo/synth-diy</a><br>
</blockquote></div></div>