Document Scanning - Beauty and the Beast!

Posted on January 16th, 2009 in Computers by chris

HP Scanjet 5590 scannerLike all would-be record-keeping obsessives, I aspire to having an office devoid of papers and filing cabinets but with all my records at my fingertips on the computer, carefully filed by subject and date, and with metadata that allows me to locate it instantly during any relevant search. Of course this aspiration has yet to materialise; my desire to keep all documents (letters, bills, statements, drawings, ideas, pieces torn from newspapers and magazines, etc) means that the piles of paper awaiting scanning constantly dominate my desk. For the last ten years I have used a series of Hewlett Packard flatbed scanners with an ADF (automatic document feeder) built into the lid. HP make good hardware and the image quality is usually excellent, but I’ve always had the feeling that the ADF in the lid was an afterthought, a means of getting the scanner to do something it wasn’t originally intended to do. The first photo here shows my HP Scanjet 5590, purchased in around 2003. In theory I should be able to drop a stack of documents into the feeder tray, press a button, and then do something else. The reality is different. Documents that have previously been folded frequently misfeed or jam, or two sheets are dragged in together. This 5590 supports duplex scanning whereby both sides of the page are scanned but this process involves three slow passes of the paper through the feeder, any of which may result in a paper jam. Each pass requires the document to make a 180 degree U-turn around a roller. If the paper becomes jammed there is no easy way to expose the paper path to retrieve it. Even if the paper doesn’t jam, the scan rate is s-l-o-w. Working through piles of paper documents is a tedious process, mainly waiting for the scanner and checking for misfeeds.

Fujitsu ScanSnap and HP Scanjet scannersFor most of the last decade in which I have digitized my paper documents I have used a Windows PC, and my document management software was ScanSoft PaperPort. Like most applications it has features that are really cool and others that are frustrating. On the whole it was fit for my purposes and enabled me easily to scan my documents into PDFs and store those PDFs in a deep hierarchical arrangement of folders. ScanSoft is a PC-only application so when I migrated to the Apple Mac I looked for an alternative. I settled upon Yep, which describes itself as iPhoto for PDFs (iPhoto is the Mac’s built-in repository for photographs). I may post a separate article devoted to Yep because it is a really nice application. Suffice to say that Yep enables me to scan, organise, and later find, the PDFs that encapsulate my life. Browsing the Yep support forum the subject of scanners has been raised several times, and the model that received repeated praise was the Fujitsu ScanSnap S510M (the M suffix denotes the Mac-specific variant of the scanner that is also available for the PC). After a particularly frustrating afternoon with my HP ScanJet I decided I would never conquer the rising piles of paper documents unless I used a better scanner.

It’s pleasing and easy to do research on the ScanSnap S510M scanner because nobody seems to have much bad to say about it. It seems to provide the features that everyone wants from a document scanner: speed, reliability, and simplicity. The online reviews, plus comments in forums and websites such as Amazon were all positive. One of the strong themes that came through was the inclusion of bundled software that integrated well with the scanner and the Mac. This clinched it. I placed my order and it was delivered the following day.

Fujitsu ScanSnap scannerThe Fujitsu ScanSnap S510M scanner is one of those pleasing items, like the iMac, where the act of unpacking its box and setting it up, is a pleasure. Everything is neatly packed, there’s a printed list of what to expect in the box, and there’s even a paper quick-start guide, a rarity these days when a PDF is usually included on the installation CD.

With the ScanSnap there are three CDs to install: Adobe Acrobat 8 Professional, Abyy Fine Reader for ScanSnap (a special version of the OCR program especially for this scanner), and finally the ScanSnap Manager, the software that controls the scanning process from the Mac. My one disappointment was that all the documentation only refers to compatibility with OS/X version 10.4, aka Tiger, whereas I have been running version 10.5 (Leopard) for over a year. I needn’t have worried. Everything worked perfectly. I knew the S510M was slightly out of date, its case colour having been intended to harmonise with the older white Mac whereas my more recent iMac is grey aluminium rather than white. No matter. The ScanSnap still looks pleasing enough sitting on the desk beside my iMac. My first thought was how small it is. The ScanSnap is tiny compared with my HP ScanJet flatbed, and the photographs on this page don’t adequately convey that. The ScanSnap is about the size of a loaf of bread; certainly smaller than our electric toaster. The outer lid hinges up to become the document feeder tray, and opening this automatically turns the power on, a blue LED clearly showing when it’s powered up. Inside is a separate articulated flap that folds out to form the paper catcher, although this is arguably unnecessary when the scanner sits on the desk; the two control buttons (power and scan) have been cleverly positioned to be easily operable when the catch tray is still folded closed.

Fujitsu ScanSnap scannerIn use, everything about the ScanSnap is exactly what I’ve always wanted. It works just like I want my document scanner to work. The warm-up time is negligible, barely noticeable at all, maybe a second or two. The speed of document feeding is impressive; I never have the feeling that I’m waiting for the scanner in the way I did with the ScanJet. There’s an option for the scanner to ignore blank pages. By default both sides of the page are scanned simultaneously (although you can turn this off if you wish). Documents can be saved as JPGs or PDFs, although the PDFs at this stage are strictly images; you can choose whether every page (or n pages) are saved as separate PDF files, or whether a batch of pages is saved in a single PDF file.  The ScanSnap Manager software offers to process each file using the Abyy Fine Reader OCR application which turns the PDF into one with text content that can be searched.  Alternatively, scanned PDFs can be dragged onto the Abyy icon in the dock for processing at a later time.  With auto-sizing every document, from a business card to a full letter or A4 page is scanned precisely.  Because there may be many different sized documents in a batch placed into the feeder tray I’ve noticed that smaller documents (e.g. till receipts or snippets torn from a newspaper) twist diagonally as they are scanned.  No problem, the ScanSnap’s software automatically recognises this and corrects the resulting image.  Very impressive.  In the very rare event of a misfeed, the cover of the scanner can be flipped open to reveal the entire paper path.  No more poking about with a letter knife as I used to have to do with the HP ScanJet’s document feeder.

 Drawbacks?  Well arguably I’ll still need to keep a flatbad scanner for the rare times when I want to scan from a book or magazine but I can envisage the ScanJet gathering dust while the ScanSnap sees daily use.  Cost is another; the ScanSnap wasn’t cheap but good tools often aren’t.  Despite the price I wish I’d bought a ScanSnap ages ago.

Post a comment

You must be logged in to post a comment.