textfrompdf is a text extraction
tool for winxp/2000 that automates the conversion of
adobe pdf documents to text files. the pdfs may be on local drives, network drives, or on the internet.
pdf
files are great for exchanging
formatted documents between people who don't use the same software. but sometimes we need to be able to take the text out of a pdf
file and use it in web pages, word processing documents, powerpoint presentations,
desktop publishing software, search and indexing
applications , or in content management systems.
textfrompdf provides access to the text content in pdf documents without requiring any
adobe product. the extracted content is saved to text
files where it can be easily searched, archived, repurposed, and managed. a console version is included for script or batch
file execution.
what matters most to you? speed or accuracy? textfrompdf offers both using multiple extraction engines optimized for different purposes.
need speed?
boasting the fastest text extractor available on the market, textfrompdf's "simple formatting" option can process hundreds of pdfs in minutes. this option utilizes an extraction engine written in assembly language. programming code written in this language executes at blazing speeds not normally attainable by code written in other languages.
accuracy?
if faithful reproduction of the original pdf layout is required, textfrompdf can provide amazingly accurate conversion results. this extraction engine has been refined over many years to produce text
files that are as close to the original pdf layout as possible.