Register

If this is your first visit, please click the Sign Up now button to begin the process of creating your account so you can begin posting on our forums! The Sign Up process will only take up about a minute of two of your time.

Results 1 to 5 of 5
  1. #1
    Junior Member
    Join Date
    Jan 2011
    Posts
    9
    Member #
    26451
    Liked
    1 times
    Hi,

    I have a bunch of loose paper documents I would like to scan as .doc files. I have a Canoscan 5600 scanner. Does anyone know of a good way of doing this? I have an image of 1 of the documents I'd like to scan as a document http://entirefilingbankruptcy.com/fedexemptions.html - I would like to have this table but as a document to paste into the page with the correct formating.

    I've tried using office document imaging, which allows me to copy the text from my document as a scanned .tiff image, except for the fact I lose the layout of the text from the document.

    Any ideas? Cheers & thanks!

  2.  

  3. #2
    Senior Member Webzarus's Avatar
    Join Date
    May 2011
    Location
    South Carolina Coast
    Posts
    3,322
    Member #
    27709
    Liked
    770 times
    Nope... Not gonna happen... When you take something from an image, there is not a single program I know of that will create MS formats...

    OCR is meant for text only, but I've never found a single thing that will scan and convert an image of a document and save the original formatting... Believe me, I've tried.

    Scan and save to PDF or scan directly to PDF preserves the format, but its still in image format, once you OCR it and save or export to word, all formatting is lost.

    I have several thousand patents that a client wants scanned and converted to word... But not gonna happen, with current technology ( that I'm aware of ).

  4. #3
    Unpaid WDF Intern TheGAME1264's Avatar
    Join Date
    Dec 2002
    Location
    Not from USA
    Posts
    14,483
    Member #
    425
    Liked
    2783 times
    Not to mention that OCR is at best about 95% accurate....even when something is MICR-encoded, like say a check. That means you're going to spend a whole bunch of time fixing errors and tracking errors and running into characters like @$*!(#)@!*!#! throughout your document.

    Scanning technology is just not there yet.
    If I've helped you out in any way, please pay it forward. My wife and I are walking for Autism Speaks. Please donate, and thanks.

    If someone helped you out, be sure to "Like" their post and/or help them in kind. The "Like" link is on the bottom right of each post, beside the "Share" link.

    My stuff (well, some of it): My bowling alley site | Canadian Postal Code Info (beta)

  5. #4
    Senior Member Webzarus's Avatar
    Join Date
    May 2011
    Location
    South Carolina Coast
    Posts
    3,322
    Member #
    27709
    Liked
    770 times
    I use a high end Minolta scanner, scan directly to COMPACT PDF, document is perfect in every way, selectable text, graphics, ect ... But when you "export" to word ( a real option in adobe ) the only problems it has is symbols as the symbol tables in word are different than symbol tables anywhere else, it still trashes the format, unless you're using standard letter ( left align ) formatting which no one does.

    So even high end equipment and high end software is useless for scanning and converting to word.

    Someone told me ( I haven't had time to try ), but they said if I Export to word, and open it with open office, it will display the formatting correctly, then save it out of there as a word document.

    Not sure in want to waste my time, cause I'm sure MS has done something to fix that too .

  6. #5
    Junior Member
    Join Date
    Jan 2011
    Posts
    9
    Member #
    26451
    Liked
    1 times
    Unfortunately that's what I figured. OK thanks anyways for the replies guys!


Remove Ads

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
All times are GMT -6. The time now is 11:18 PM.
Powered by vBulletin® Version 4.2.3
Copyright © 2019 vBulletin Solutions, Inc. All rights reserved.
vBulletin Skin By: PurevB.com