Help get this topic noticed by sharing it on Twitter Twitter, Facebook Facebook, or email.
happy I’m excited
Alex Lunnon (Technical Product Manager) December 11, 2011 22:09

Clearer OCR User Interface



Editable Text vs Searchable Text

I'm not sure what the difference is between the two options, however I'm sure most users wouldn't think to look here after trying to convert to word and getting a bitmap.

When pressing OCR, there should be a window appearing to ask which of the two we'd like to do. Most users won't bother to look into the options.
3 people like
this idea
+1
Reply

  • Lance Flores
    I agree with you. Searched the help file and couldn't identify the difference between Searchable and Editable functions. Seems logical if you should be able to ocr a pdf or scan and ask for both, as Adobe will create an editable and searchable document. I would always want to be able to edit a searchable document; often to do a global search and replace.
  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. happy, confident, thankful, excited sad, anxious, confused, frustrated indifferent, undecided, unconcerned kidding, amused, unsure, silly

  • loopysan
    Got here from here: http://gsfn.us/t/2mybk

    I found out the hard way also, converting a document expecting actual text, not a bitmap image. It was frustrating, actually, to have to figure out the difference. I would suggest at the least, forget the combo drop down box and use radio buttons so that the 2 options are "out there" and add a popup help item to it, something like that.

    In reference to my specific issue, there should be an option to "force" an OCR in spite of there being text objects already in the document. A variety of reasons exists why a user would want the OCR engine to parse over a document again, especially if a page is re-scanned or updated. Right now, the logic of the OCR process is very limited, if not "broken." Why did I pay extra for this feature when I cannot use it but once in a document if saved after an OCR run?

    Also, again, it would sweet if the optimize function/remove objects included text objects/OCR data. That would strip a document of all previous OCR data. Seems like a very useful function to me. : )
  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. happy, confident, thankful, excited sad, anxious, confused, frustrated indifferent, undecided, unconcerned kidding, amused, unsure, silly

  • Patricia Lane
    Hi Alex,

    At your request, I am copying one of my comments from this thread http://community.nitropdf.com/nitropd...
    ***
    Morning Alex,
    Thanks for jumping in. Your post helped.The output missed quite some bits though, which means one has to be very careful and compare original and editable file line by line.
    Also, there's got to be a more efficient way of going about this. Here is what I had to go through (about 15 minutes for one page!):
    1. Scan page to pdf, name and save file.
    2. Open pdf in Nitro, click on other/extract images
    3. Find file no2, right click to create PDF
    4. Click on OCR to convert to editable text
    5. Click on convert to Word.
    *Then check output carefully*.

    Have to admit AbbyFineReaderPro does a better and easier job than Nitro for things like this...
    ***
    If Nitro could come up with an easy, reliable and fully editable PDF to Word conversion module, you'd be a huge hit with translators around the world. We often have to work with clients' PDF files (often having to scan them ourselves) and need a converter that will let us work (with or without CAT tools) on the extracted document to translate it and format it as close to its source text version as possible.

    Thanks.
    Patricia
  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. happy, confident, thankful, excited sad, anxious, confused, frustrated indifferent, undecided, unconcerned kidding, amused, unsure, silly