



Build a custom web spider / web crawler using web data extraction / screen scraping technology. Use the web extract for web data mining of contact lists, product catalogs, government databases, real estate listings, or build a custom email extractor.
Download websites FAST! and navigate offline. Speed your navigation. Save time and money. Resume broken downloads. Uses project files to store list of websites.
Create proper Google and Yahoo sitemaps of the highest quality and accuracy with push-button ease. Unique and powerful web spider crawls your entire site beforehand, making absolutely certain there are no "dead ends" for web spiders.
Eliminate cut and paste! Web spider / web crawler using web data extraction / screen scraping technology. Use the web extract for web data mining of contact lists, product catalogs, govt. databases, real estate listings. Our easiest product yet!

AmiPic Lite
$0 - ALTom Soft
Have you ever tried to find interesting pictures, movies, mp3 or other files on Internet? If you do, you know how difficult it is! AmiPic is set of software tools designed primarily for boosting your web search experience.
AmiPic is fully customizable web spider, pictures finder, Internet search and download tool. It finds and downloads pictures, movies,mp3, other files and web pages matching your interests. It has file splitter, popup blocker and detailed text search in saved documents. When saving web pages on hard disk it automatically removes unsafe HTML fragments,such as scripts and applets. You can add new search engines manually without program modification. Numerous partial download options speed-up web sites processing. Some its features are listed below:
- Web pages and files search and download
- Custom search engines scripting lets you add and modify search engines manually
- Advanced download options to speed-up files retrieval
- Your own off-line search engine finds text in saved documents
- Ability to resume broken downloads
- Ability to block unwanted content sites on-the-fly
- Favorite sites for high priority processing

HTML Link Validator
$35 - Lithops Software
HTML Link Validator is a professional tool for checking web pages for broken links, on a web server or on your local computer. Whether a novice or an experienced webmaster, you should always test your web site for broken links. There are many reasons why a link may be broken. The file name may be typed incorrectly, or, as often happens, the resource (file) to which a link refers becomes unavailable. Case errors are also possible. For example, the link to the file "image.gif" while the real filename is "image.GIF". Fortunately, unlike some other validators, HTML Link Validator keeps case awareness in mind. It can catch most types of errors and gives you a full report on all links found in a particular web page or on a whole web site. It validates Internet Explorer and Netscape Navigator Favorites, Internet shortcuts, and lists of links; it can also validate links directly from MS Access databases. It can validate thousands of HTML documents at once. When validating pages and sites which are on a web server, anywhere on the Internet, Validator acts as a fully automated, multi-threaded, link-following web spider, and gives you a full report on links found in all scanned files and pages. You need only specify a starting address. If your files are located on your local computer, just double-click on the folder with your HTML files, and Validator will find all HTMLs in this folder and all nested folders, create a file list, mark the files with errors, display links in a convenient format, and allow you to edit them. It can even find and validate all HTMLs on your hard drive at the same time. Several report types are available (List of Bad Links, List of Redirected Links, etc.) in TXT, HTML, or Access format. HTML Link Validator can also create various file lists and find unused (orphaned) files on a web site. Unlike some other validators, HTML Link Validator has no limitation on number of links and pages it can validate.

dtSearch Web with Spider single-server
$999 - dtSearch Corp.
dtSearch Reviews
* "The most powerful document search tool on the market"-- Wired Magazine
* "dtSearch ... leads the market" -- Network Computing
* "Finding a virtual needle in a digital haystack is now much easier" -- MS OfficePRO
* "Blindingly fast" -- Computer Forensics: Incident Response Essentials
* "A powerful arsenal of search tools" -- The New York Times
* "Super fast, super-reliable" -- The Wall Street Journal
* "Covers all data sources ... powerful Web-based engines" -- eWEEK
* "Searches at blazing speeds" -- Computer Reseller News Test Center
See www.dtsearch.com for hundreds more reviews and case studies.
INSTANTLY SEARCH TERABYTES OF TEXT
The dtSearch product line can instantly search terabytes of text across a desktop, network, Internet or Intranet site. dtSearch products also serve as tools for publishing, with instant text searching, large document collections to Web sites or CD/DVDs. General features include:
* over two dozen indexed, unindexed, fielded and full-text search options
*highlights hits in HTML, XML and PDF while displaying embedded links, formatting and images
* converts other file types - word processor, database, spreadsheet, email and full-text of email attachments, ZIP, Unicode, etc. - to HTML for display with highlighted hits
* built-in Spider adds local or remote web sites (including dynamically-generated content) to your searchable database
Products include: dtSearch Desktop with Spider; dtSearch Network with Spider; dtSearch Web with Spider; dtSearch Publish for CD/DVDs; dtSearch Text Retrieval Engine for Win & .NET; dtSearch Text Retrieval Engine for Linux.
See www.dtsearch.com for hundreds of reviews and developer case studies, and to download fully-functional evaluations.
dtSearch is the Smart Choice for Text Retrieval® since 1991.

RafaBot
$49 - Spadix Software
Bulk website downloader. Download website from a starting URL, search engine results or web dirs.
RafaBot is a high-speed, multi-threading, large scale web spidering robot. An ideal tool for bulk website download. It can download website from a starting URL, search engine results or web dirs and able to follow external links. It can process either a single website or many thousands of websites in one session.
RafaBot saves all downloaded files in user hard drive in native format and user can view / research / extract them off-line without being connected to the Internet. It has numerous filters to restrict download like - URL filter, date modified, text, file type, file size, etc. It allows user-selectable recursion levels, retrieval threads, timeout, proxy support and accesses password-protected sites.

Chilkat Spider ActiveX
$0 - Chilkat Software, Inc.
Crawl a website with this free ActiveX spidering component. Advanced features include caching, "avoid" patterns, and robots.txt compliance.

dtSearch Desktop with Spider
$199 - dtSearch Corp.
dtSearch Reviews
* "The most powerful document search tool on the market"-- Wired Magazine
* "dtSearch ... leads the market" -- Network Computing
* "Finding a virtual needle in a digital haystack is now much easier" -- MS OfficePRO
* "Blindingly fast" -- Computer Forensics: Incident Response Essentials
* "A powerful arsenal of search tools" -- The New York Times
* "Super fast, super-reliable" -- The Wall Street Journal
* "Covers all data sources ... powerful Web-based engines" -- eWEEK
* "Searches at blazing speeds" -- Computer Reseller News Test Center
See www.dtsearch.com for hundreds more reviews and case studies.
INSTANTLY SEARCH TERABYTES OF TEXT
The dtSearch product line can instantly search terabytes of text across a desktop, network, Internet or Intranet site. dtSearch products also serve as tools for publishing, with instant text searching, large document collections to Web sites or CD/DVDs. General features include:
* over two dozen indexed, unindexed, fielded and full-text search options
*highlights hits in HTML, XML and PDF while displaying embedded links, formatting and images
* converts other file types -- word processor, database, spreadsheet, email and full-text of email attachments, ZIP, Unicode, etc. -- to HTML for display with highlighted hits
* built-in Spider adds local or remote web sites (including dynamically-generated content) to your searchable database
Products include: dtSearch Desktop with Spider; dtSearch Network with Spider; dtSearch Web with Spider; dtSearch Publish for CD/DVDs; dtSearch Text Retrieval Engine for Win & .NET; dtSearch Text Retrieval Engine for Linux.
See www.dtsearch.com for hundreds of reviews and developer case studies, and to download fully-functional evaluations.
dtSearch is the Smart Choice for Text Retrieval® since 1991.
© 2007-2008 Software Institute
Software Institute periodically updates pricing and product information from third-party sources,
so some information may be slightly out-of-date. You should confirm all information before relying on it.