|

"What is PDF Extraction?"

By Rowan Hanna
PDF Store Support Team
Issue 22 for 2005

Quite simply, PDF extraction refers to the extraction or conversion of PDF data into a re-usable form. While some editing is possible natively from within Acrobat, and even more can be done with the likes of PitStop Professional and ARTS PDF ImageWorks, serious editing should be done in the original source applications. What then, can be done for files requiring significant re-work when the source files are either unavailable or unusable?
The answer of course, is to extract the relevant data from the PDF file. Assuming that the PDF file in question is not secured, then your task will be the relatively simple one of choosing which of the available tools will work best for you. From Acrobat 6, it's possible to convert entire PDF documents into a number of native image and other formats using the Save As command, various tools enable users to extract selected objects from PDF files.
Depending on your exact needs, this method may be inefficient due to the volumes, or perhaps you have exacting requirements that are not immediately met by Acrobat's native extraction functionality. Perhaps you don't own Acrobat? Over the coming weeks, I'll address specific situations, citing relevant tools, so stay tuned!
|