Text Extraction
KT Text Filters
Extract textual contents from numerous file types: DOC, RTF, HTM, HTML, PDF, XLS, XML, PPT, HLP, TXT, etc. Text extraction is provided into either plain text (trial version) or UNICODE (purchased version) and is normally required for information retrieval
TEXTfromPDF
TEXTfromPDF is a text extraction tool for WinXP/2000 that automates the conversion of Adobe PDF documents to text files. It gives a company access to the text content in PDF documents without requiring any Adobe product. The extracted content is saved to
PDF2Text Pilot
PDF to text file converter that allows extracting text from a batch of PDF files. PDF2Text Pilot is an open-source tool. Software developers can use the code as an example of solving a text extraction task. Working from command line is supported.
PDF Extractor SDK v.2.20.415
PDF Extractor SDK allows developers to convert PDF to text, PDF to XML, extract images from PDF, convert PDF tables into CSV for Excel, extract information about PDF file in .NET or ActiveX interfaces. Works without any additional software required.
IFunia PDF2Text for Mac v.2.0.0
iFunia PDF2Text for Mac is a text extraction tool that facilitates Mac users extracting and reusing the unformatted Text from PDF document. As the program supports batch and selected conversion, you can simultaneously convert multiple PDF to .txt.
IndexSWF Pro
IndexSWF Pro plug-in provides extended support for flash files in Explorer and in Google Desktop Search. IndexSWF Pro plug-in can be used with: - Google Desktop Search to provide advanced indexing of flash (SWF and FLV) files: - plain and HTML
Miraplacid Text Driver Terminal Edition
Miraplacid Text Driver (Text Printer Driver) extracts text from documents and saves it to file, copies to Clipboard, uploads to a server or emails. Text Output can be formatted as plain text or text with layout, previewed and saved in Unicode or specified
Miraplacid Text Driver
Miraplacid Text Driver 2007 for Windows 2000/XP/2003/Vista extracts text from any printable documents. Accounting professionals, Medical Insurance companies, Health care providers and many others use it to extract text from all kind of document formats
PCLTool SDK
PCLTool SDK is a collection of tools to convert all levels of complex HP PCL text, raster and vector files into bitmap formats (TIF, in-memory DIB, PCX, DCX, BMP, XPS and PNG) or vector formats (in-memory GDI, .PDF, WMF and EMF) with matching TrueType
EePDF Image PDF to Word OCR Converter v.2.0
eePDF Image PDF to Word OCR Converter allows for the batch processing of scanned Adobe PDF documents and images to Word files, image PDF to Word Converter convert scanned PDF to Word without requiring any Adobe product. PDF to Word keeps layout in convers
Scanned PDF to XML OCR Converter v.2.0
eePDF Scanned PDF to XML OCR Converter allows for the batch processing of scanned Adobe PDF documents to XML files, Scanned PDF to XML OCR Converter can convert scanned PDF to XML without requiring any Adobe product.
E-PDF To Word Converter
PDF To Word Converter is a pdf conversion tool that allows for the batch processing of Adobe PDF documents to Word files, it is support Win98, ME, NT, 2000, XP, 2003 systems. PDF To Word Converter convert PDF to Word without requiring any Adobe product.
MiniPDF PDF To Word Converter
PDF To Word Converter is a pdf conversion tool that allows for the batch processing of Adobe PDF documents to Word files, it is support Win98, ME, NT, 2000, XP, 2003 systems. PDF To Word Converter convert PDF to Word without requiring any Adobe product.
PDF-Analyzer
The PDF-Analyzer is a tool extracting all attributes from pdf files. You can use it from the explorer contextmenu and "stand alone", too. You can see all attributes/properties of a selected pdf file. The document informations e.g. titles, topic
Bytescout Document SDK for .NET v.1.00.88
Generate Word documents (doc, docx) in ASP.NET, Visual Basic .NET and C# without MS Word installed. Bytescout Document SDK is 100% managed .NET (1.10, 2.00 and higher) library for document (DOC, DOCX) writing, reading and modification.