all software/

Text Extractor and Caterpillar - Html Extractor Software for Windows

Downloads and Reviews 1-10 of 10
Simply extract HTML and Text from any webpage
Extract,collect and manage text/html from web
Extract plain text directly from PDF files
Extract text from PDF file
$19.5 - Iconico

If you're frustrated that a website has unselectable text or if you need to view some HTML that's been encrypted then this application is a must. HTML Text Extractor is the simplest and easiest way to view and save the HTML and Text from any webpage.



$19.95 - WebTextExtractor

Tired of select-copy-paste from internet web pages to collect material? Web Text Extractor can help you! It can easily extract,collect,store and manage text/html from web pages,filter invisible characters and give you a clean text when necessary!



$59.95 - Retsina Software Solutions

PDF plain text extractor(P2T) extract plain text from pdf file without any help of PDF SDK, it deals with the raw file directly and focus on text extracting. P2T supports Chinese/Korean/Japanese/All European languages well.



$23 - A-PDF.com

A-PDF Text Extractor is an utility designed to extract text from Adobe PDF files for use in other applications. The program is a standalone application. No Adobe Acrobat needed.



 
HTML Text Extractor and Integrator

Caterpillar - HTML Extractor
$34 - Stormdance

Caterpillar rapidly extracts all the text requiring translation from multiple web pages to a single output file - so you can process/translate the extracted text segments in any way you like using other software. Then with another click of the mouse, your processed text is integrated back into the web pages.

By generating a single output file containing all the text requiring translation Caterpillar provides a simple way to incorporate web page localisation into your existing translation work flow.

Caterpillar is being used successfully with translation memory software such as WordFast - enabling you to translate web sites in the familiar surroundings of MS Word.



 
Converts PDF to Text

TEXTfromPDF
$44.95 - Salty Brine Software

TEXTfromPDF is a text extraction tool for WinXP/2000 that automates the conversion of Adobe PDF documents to text files. The PDFs may be on local drives, network drives, or on the Internet.

PDF files are great for exchanging formatted documents between people who don't use the same software. But sometimes we need to be able to take the text out of a PDF file and use it in Web pages, word processing documents, PowerPoint presentations, desktop publishing software, search and indexing applications, or in content management systems.

TEXTfromPDF provides access to the text content in PDF documents without requiring any Adobe product. The extracted content is saved to text files where it can be easily searched, archived, repurposed, and managed. A console version is included for script or batch file execution.

What matters most to you? Speed or Accuracy? TEXTfromPDF offers both using multiple extraction engines optimized for different purposes.

Need Speed?
Boasting the fastest text extractor available on the market, TEXTfromPDF's "Simple Formatting" option can process hundreds of PDFs in minutes. This option utilizes an extraction engine written in Assembly Language. Programming code written in this language executes at blazing speeds not normally attainable by code written in other languages.

Accuracy?
If faithful reproduction of the original PDF layout is required, TEXTfromPDF can provide amazingly accurate conversion results. This extraction engine has been refined over many years to produce text files that are as close to the original PDF layout as possible.



A powerful All-In-One file viewer and manager

FileSee
$29.95 - Filesee.com

FileSee is a powerful All-In-One file viewer. It is a tool that helps you to view files quickly. FileSee is a combination of a file manager, a text file viewer, an image viewer, a pdf viewer, a flash player, a video player, a mp3 player, a midi player, a hex file editor, a html viewer, a zip file decompresser, a filename batch renamer and manager, a desktop searching engine, a dll viewer, a wav to mp3 converter, a pdf text extractor.... So, FileSee is an All-In-One file viewer and a powerful file manager! FileSee's interface is similar to Windows Explorer. Supported file types include: txt, html, htm, mht, shtml, shtm, pdf, swf, jpg, jpeg, gif, tif, tiff, bmp, psd, vsd, png, ico, wmf, wmf, tga, pcx, wbmp, jp2, jpc, pgx, pnm, ras, j2k, midi, mid, mp3, wav, avi, mpg, mpeg, wma, wmv, asf, zip, rar, cab, gzip, jar, tar, bh, lha, zoo, ace, arc, arj, fxd, fxr, fxm, xls, xl, xlt, ppt, pps, pot, doc, dot, exe, dll, ocx, ax, rm, ram, ra, rmvb, rp, rpm, rt, wpl, wmx, wmd, wmz, wax, wvx, cda, rmi, aif, aifc, aiff, au, snd, dvr-ms, mpe, mlv, mp2, mpv2, mp2v, mpa, mov, qt, etc.


A powerful link extractor utility.

Web Link Extractor
$49.95 - Web Data Miner

Web Link Extractor is a powerful link extractor utility.
It extracts Link Text and Link URL from the web pages you specify.
And put the result into a Text/CSV file that you can open with NotePad or Excel.

Key Features:

* Easy fuzzy matching for URL
You may use URL fuzzy matching,
for example: http: //www.yahoo.com/groups/*/member/*.htm

* Filters for Link URL or Link Text
You may specify the min length of link text
You may specify the keywords that the link URL must include or exclude
You may specify the keywords that the link text must include or exclude

* Support Next Pages
It may craw the pages with Next Page link one by one until the last page

* Output Link URL or Link Text
The output data could be link URL,link text or both of them.
The output file format could be *.txt or *.csv
You can browse the data by NotePad or Excel

* Task Reuse
You may save your task parameters to a file and load it next time.


It searches for passwords in binary files.

Words extractor
$0 - Cubic Design

Words Extractor is a universal hacking tool that extracts (human) text from binary (machine) files. Is suitable for many purposes like finding a cheat in a game, finding hidden text or passwords in a file (exe, bin, dll), recovering corrupted documents (like Word, RTF), checking against suspicious software etc...

This program can be use virtually with any file in your computer. You can use it to separate the string that contains human text/words by binary code. Virtually it has an infinite number of usages.
This software was certified as free of spyware, free of addware, free of viruses by Softpedia

Example of usage:
1. Let's suppose you have a new game and you want to know find the cheats for this game. Drag and drop the file (for example 'Game.exe') in Words Extractor and press START. The program will remove all machine code (binary code) and will reveal only the text strings. Among this strings it is possible to find your cheats or even more information about the underneath of that game (for example comments of the programmers who made the game, path where files are kept, hints, possible messages and error messages for user or beta testers, etc).

2. Another use will be to recover a corrupted document. Let us suppose you had an important document on a floppy/CD/flash, stick and it became corrupted and you cannot open it any more. Drag your document in Words Extractor and it will separate the text in that document from the binary code. You will be able to recover in this way your text but you will lose the binary information like pictures, embedded objects or formatting.

3. Another use will be to check against a suspicious program. Let's suppose you just received an email with a program attached from a friend but at a second thought it seems that actually your friend didn?t personally sent that message to you but it was automatically send by a virus running in your friend's computer.
Drag and drop the suspicious program in Words Extractor...


Extract plain text from one or more PDF files

Midas Extractor
$14.95 - Surefire Software

Midas Extractor extracts plain text from Adobe PDF files and creates text files with the extracted content. Select multiple files or folders for text extraction with easy drag and drop or from file open type dialogs. Double click on summary list of extracted text files to view the text. Midas Extractor does not need any other PDF or Adobe tools to run.


© 2007-2008 Software Institute

Software Institute periodically updates pricing and product information from third-party sources,
so some information may be slightly out-of-date. You should confirm all information before relying on it.