Freeware Listing: Text Extraction
- teXtracta
- License: Freeware

Extract textual content from standard Windows documents. Images, PDF, Word documents, etc. Renders unformatted text. Ideal for import into databases, search engines, etc. Runs on most Windows systems. teXtracta. Document conversion Text Extraction.
- Publisher: teXtracta
- Date: 06-07-2011
- Size: 309 KB
- Platform: Win2000, Windows Server, Windows Vista, WinOther
- Web Text eXtraction and analysis Tools
- License: Freeware

Web Textual eXtraction Tools C++ Parallel web crawler, noun phrase idenification, Multi-lingual Part of Speech Tagging, Tarjan's Algorithm, Co-RelationShip Mappings...
Web Text eXtraction and analysis Tools License - GNU General Public License (GPL).
- Publisher: Wtxt
- Date:
- Platform: WinOther
- BioEval
- License: Freeware

BioEval, a web-based evaluation platform for one or multiple biomedical text extraction systems and the collaborative creation of gold standards. Handles evaluation of extraction of one to many related entities, plus supporting evidence and normalization
BioEval License - Academic Free License (AFL).
- Publisher: Bioeval
- Date:
- Platform: WinOther
- textkit4j
- License: Freeware

Provides a set of tools for processing text, such as text extraction and classification. Classification implementations to be implemented include: Bayesian and Statistical (N-gram).
textkit4j License - GNU General Public License (GPL).
- Publisher: Textkit4j
- Date:
- Size: 18 KB
- Platform: Linux, Mac OS X, WinOther
- TextMarker
- License: Freeware

TextMarker system is a rule-based tool designed for information extraction and text processing tasks. The comprehensible rule language can be easily extended and supports several scripting functionalities. TextMarker uses DLTK and UIMA.
for WindowsAll
.
- Publisher: Peter Kl+aT-gl
- Date:
- Platform: WinOther
- Detexter
- License: Freeware

Detexter was designed as a simple and useful application that allows the user to extract text from PDF files.
Detexter uses the PDFBox library for its text extraction. The application was developed with the help of the Java programming language.
.
- Publisher: Stephen J Haggai
- Date:
- Platform: WinOther
- AXPDF PDF to TEXT Converter
- License: Freeware

AXPDF PDF to text converter is the wise choice for your routine PDF to txt conversion. AXPDF PDF to text Converter is a stand-alone program. It will help you convert PDFs to ASCII text files directly, instantly and accurately. There is no need to install any PDF readers or Adobe reader on PC and there is also no training needed for users. Just simply click CONVERT button and you will get the high quality extracted text file. With double clicking on the text file, Window Notepad will launch for you to view and modify the content. You no longer need to reformat or retype your tedious and endless DOCs manually.
- Publisher: autodwg.com
- Date:
- Size: 6912 KB
- Platform: Win2000, Windows Server, WinOther, WinVista
- Miraplacid Text Driver SDK TE
- License: Freeware

Miraplacid Text Driver SDK generates virtual printer driver with all the functionality you find in Miraplacid Text Driver Terminal Server Edition. You can customize it and embed into your software. With the driver generated with Miraplacid Text Driver SDK TE, you can save the extracted information as plain, formatted text or as XML or RSS for future processing in all installed codepages and Unicode. After installation on your PC, driver generated with the SDK appears as a new virtual printer in your system. All the documents you print to this "printer" will be accessible from you software via COM interface.
- Publisher: Miraplacid
- Date: 31-03-2018
- Size: 6759 KB
- Platform: Win7 x32, Win7 x64, WinOther, WinVista, WinVista x64, WinXP, Other
- Miraplacid Text Driver SDK
- License: Freeware

Miraplacid Text Driver SDK generates virtual printer driver with all the functionality you find in Miraplacid Text Driver. You can customize it and embed into your software. With the driver generated with Miraplacid Text Driver SDK you can save the extracted information as plain, formatted text or as XML or RSS for future processing in all installed codepages and Unicode. After installation on your PC, driver generated with the SDK appears as a new virtual printer in your system. All the documents you print to this "printer" will be accessible from you software via COM interface. You can browse through the document page to page, get and modify extracted text.
- Publisher: Miraplacid
- Date: 31-03-2018
- Size: 6752 KB
- Platform: Win7 x32, Win7 x64, WinOther, WinVista, WinVista x64, WinXP, Other
- Corrupt Office Salvager
- License: Shareware

This program will extract the text/data from unopenable damaged or corrupted Microsoft Office and Open Office files 2.X and 3.X files with the extensions .doc, docx, xls, xlsx, ppt, pptx, odt, ods and odp as well as possibly the template and macro variants of these extensions such as dot, xlt and pps if they are changed to the correct corresponding extensions mentioned. It may succeed at doing so where MS Office or Open Office fails to salvage text. It can also attempt to recover formatting for just Open Office s. At this time unfortunately there is no facility for recovering anything but basic formatting for MS Office files through the previously mentioned text extractions.
- Publisher: S2 Services
- Date:
- Size: 49725 KB
- Platform: Win2000, Win7 x32, Win7 x64, WinOther, WinVista, WinVista x64
- YKConverter
- License: Shareware

YKConverter was specially developed as an utility that tries to extract the text from documents in various formats (HTML, Word, PDF, Powerpoint, Excel).
YKConverter can then save the text as UTF-8 encoded text for subsequent content analysis.
.
- Publisher: Will Lowe
- Date:
- Platform: WinOther
- HTML Parser
- License: Freeware

Primarily used for transformation or extraction, it features filters, custom tags, visitors, and easy to use JavaBeans. HTML Parser is a robust, fast, and well tested package.
HTML Parser is a useful Java library designed for HTML transformation or extraction.
The two fundamental use-cases that are handled by the parser are extraction and transformation (the syntheses use-case, where HTML pages are created from scratch, is better handled by other tools closer to the source of data).
In general, to use the HTMLParser you will need to be able to write code in the Java programming language.
- Publisher: Derrick Oswald
- Date:
- Platform: WinOther
- TextCaptureX
- License: Shareware

TextCaptureX is a COM library that allows screen text extraction in Windows applications.It is accessible from any COM aware programming languages. You can use it to extract text from any application that doesn't provide communication API's in order to feed another program. You can also use it to extract text from legacy systems, file directories, status bar messages, Windows error messages and more. TextCaptureX is not OCR based so it's incredible fast. So it's convenient to embed it into dictionaries or translation tools. Imagine: your customer captures any text on screen, even when copy/paste is not available, with a hotkey and with a mouse click your dictionary pops up with the text already translated/explained.
- Publisher: deskperience.com
- Date:
- Size: 8663 KB
- Platform: WinOther
- Smart OCR:Text Miner
- License: Freeware

Smart OCR:Text Miner allows you to convert image to text and extract text from PDF, edit text, share text, perform text mining and save your useful data for sharing them later just in one process. It helps you to extract all related useful data from unstructured text where you get from OCR or other rich text source. The text mining in this application not only perform data extraction from the OCRed text, but also help extracting page text and title information from any web page.
Text/Data Mining Function:
This application not only serves OCR but also provides strong text mining capability where you can first OCR a page of rich text and then perform text mining on it such as text extraction, concept extraction, entities extraction, keyword extraction, and sentiments extraction.
- Publisher: Appstyle Pte Ltd
- Date:
- Size: 12288 KB
- Platform: Android, WinMobile
- All Free OCR
- License: Freeware

All Free OCR provides an efficient solution for companies and users looking to efficiently manage their documents. It can extract text from images, scanned papers and scanned PDF documents to eliminate the need for retyping. The cutting-edge OCR technology guarantees you highly accurate text extraction. You no longer have to wait for the online OCR because of slow internet connection - just hit the button to input the image and leave the rest to All Free OCR. It easily recognizes text and characters from PDF scanned documents, photographs, faxes, and digital camera captured images. It allows you to easily extract text from images and save as editable and searchable text, such as DOC and TXT.
- Publisher: AllFreeVideoSoft
- Date: 20-08-2014
- Size: 7134 KB
- Platform: Win2000, Win7 x32, Win7 x64, Windows Server, Windows Vista, WinOther, WinVista, WinVista x64
- mini Scan to Excel OCR Converter
- License: Freeware

mini Scan to Excel OCR Converter is the best tool for you to convert scanned PDF files, normal PDF files and scanned Image files to editable Excel documents. mini Scan to Excel OCR Converter does batch convert scanned documents to editable MS Excel documents on the fly, you can re-use tables and spreadsheets from PDF files in Microsoft Excel, OpenOffice, Google Docs, and WordPerfect Office.
mini Scan to Excel OCR Converter contains a Document Imaging (Scan and Edit Documents) application, supporting powerful scanning and editing features.
mini Scan to Excel OCR Converter supports following conversion options:
1.
- Publisher: miniPDF.com, Inc.
- Date:
- Size: 12994 KB
- Platform: WinOther
- AZ TGA to PDF Converter
- License: Shareware

The AZ TGA to PDF Converter application was designed to be an affordable and easy-to-use application that allows you to convert various image formats, such as TGA to PDF file. You can convert up to 2000 images at a time to PDF document.
AZ TGA to PDF Converter is easy-to-use image to PDF converter that converts your photos, scans, faxes and drawings into PDF documents. Just add images (TGA formats) and click the "Make PDF" button, this software will directly convert them to a PDF file. You can convert up to 2000 images at a time to PDF document.
The application allows you to specify all detailed parameters of conversion, or you can also let the software setup them automatically.
- Publisher: A-Z PDF Inc
- Date:
- Size: 1894 KB
- Platform: WinOther
- AZ WMF to PDF Converter
- License: Shareware

The AZ WMF to PDF Converter application was designed to be a quick and easy-to-use PDF converter software for converting batch of images, photos into PDF documents with one click. It supports a plenty of image formats such as WMF. The maximum number of images in the conversion task can reach 2000.
AZ WMF to PDF Converter is one of the best and easiest PDF converter that make PDF documents from different image formats (WMF). You can add up to 2000 images to the conversion list. This application can automatically setup all necessary parameters and create high-quality PDF files by employing special techniques.
- Publisher: A-Z PDF Inc
- Date:
- Size: 1894 KB
- Platform: WinOther
- AZ PCX to PDF Converter
- License: Demo

The AZ PCX to PDF Converter application was designed to be a quick and easy-to-use PDF converter software for converting batch of images, photos into PDF documents with one click. It supports a plenty of image formats such as PCX. 2000 images maximum are allowed in a conversion task.
AZ PCX to PDF Converter is a quick and easy-to-use PDF converter software designed for converting batch of images, photos into PDF documents with one click. It supports a plenty of image formats such as PCX. It supports to convert up to 2000 images to a PDF file. The application allows you to specify all detailed parameters of conversion, or you can also let the software setup them automatically.
- Publisher: A-Z PDF Inc
- Date:
- Size: 1894 KB
- Platform: WinOther
- HexDump32
- License: Shareware

HexDump32 is a small Win32 utility which opens up a file and displays its data in hexadecimal and ASCII format.
Note: HexDump32 is not an editor. The binary contents of files can be viewed but not modified.
The dump has the segment offset on the left, hex values in the center, and ASCII interpretation on the right. Characters which are not readily displayable (outside of ASCII 32 - 126) are displayed in the ASCII representation at the right of the dump display as the dot character.
You can copy to the Windows clipboard from the display frame of the dump by selecting the text and using the Ctrl-C keyboard shortcut.
- Publisher: Salty Brine Software
- Date:
- Size: 1648 KB
- Platform: Win2000, Windows 7, WinOther, WinServer, WinVista
Text Extraction: Freeware | All











