Mudcat Café Message

sj

User Name	Thread Name	Subject	Posted
JohnInKansas	Tech: Dragon Naturally Speaking (22)	RE: Tech: Dragon Naturally Speaking	29 Jan 14
I've been doing quite a lot of scanning of textbooks to pdf. Since the scanner produces an image, OCR is necessary to convert it to "searchable text," which is one of the real advantages of having it digital. The ABBYY OCR that came with my scanners does a much better job than previous others, but still leaves quite a few "clinkers." The converter provided with the scanners for conversion from image to pdf is from Nuance, the makers of DNS, and also includes the ability to convert between .jpg, .pdf, Word, and a couple of other formats. Conversion accuracy is heavily dependent on the accuracy of the OCR for scanned documents. Best results probably are obtained when the starting point is "plain text," either Word documents or .txt, with "formatted text" (.rtf) fairly close. In a very few cases I've made the pdf from .jpg, converted .pdf to .txt, corrected the errors, and then made a new .pdf from the corrected text; but that's only justified for "very important documents" that just don't come out right straight from the .jpg scan to .pdf. I haven't messed with text-to-voice or voice-to-text, so only know what I've seen in the comic books (called advertisements - or more often "marvelous special offers" - by the authors). John

Post to this Thread -

Back to the Main Forum Page

By clicking on the User Name, you will requery the forum for that user. You will see everything that he or she has posted with that Mudcat name.

By clicking on the Thread Name, you will be sent to the Forum on that thread as if you selected it from the main Mudcat Forum page.

By clicking on the Subject, you will also go to the thread as if you selected it from the original Forum page, but also go directly to that particular message.

By clicking on the Date (Posted), you will dig out every message posted that day.

Try it all, you will see.