Freeware Listing: Web Data Mining
- WebHarvest - web data extraction tool
- License: Freeware

Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages..
- Publisher: web-harvest.sourceforge.net
- Date: 27-10-2012
- Size: 6255 KB
- Platform: WinOther
- Web Data Grid
- License: Freeware

Web Data Grid is a component written in javascript and must be used in a html page to show a dinamic data grid. Its most important objective is to separate the data from the way to present them into the page: this purpose is reached loading the data from a special file in an analogous mode to that XML do, even if the tecnology utilized is different. Its principal features are: - division of table in several data-pages; - visualized values filtering; - job division between client and server; For large size databases it's possible to activate server-side functionality: the records number transferred to the browser correspond to those visualized for every page only; sorting and filtering operations are executed from server and works with the same previous point data trasferred division on the base of the records number for page.
- Publisher: Augusto Fiastrelli
- Date: 07-04-2013
- Size: 246 KB
- Platform: JavaScript, Scripts
- Microsoft SQL Server 2012 Data Mining Add-ins for Microsoft Office 2010
- License: Shareware

Microsoft SQL Server 2012 Data Mining Add-ins for Microsoft Office 2010 (Data Mining Add-ins) helps you take advantage of SQL Server 2012 predictive analytics in Office Excel 2010cand Office Visio 2010. The download includes the following components:
Table Analysis Tools for Excel: This add-in provides easy-to-use tasks that leverage SQL Server 2012 data mining models within Excel 2010 using either your spreadsheet data or external data accessible through your SQL Server 2012 Analysis Services instance.
Data Mining Client for Excel: By using this add-in, you can create, test, explore, and manage data mining models within Excel 2010 using either your spreadsheet data or external data accessible through your SQL Server 2012 Analysis Services instance.
- Publisher: Microsoft
- Date:
- Platform: Windows 7, WinServer, WinVista
- Web Data Shark!
- License: Shareware

Web Data Shark! is a simple and easy to use application that can find contacts and e-mail addresses from different search engines, enabling you to send bulk e-mails with just a few clicks.
The application comes with optional proxy support and provides high extraction speed. The usage is simple: just select the search engine, choose the keyword and the location and let the application do the rest!
.
- Publisher: Instant Leads
- Date:
- Platform: Win7 x64, Windows 7, WinOther, WinVista, WinVista x64
- WebScamin
- License: Shareware

WebScamin also contain ExpressionMiner plugin which enable users to build and evaluate custom-expression(including regular expression). Obtained results(such as prices, dates, phone numbers...) can be sorted and exported to csv file for further processing in MS Office or OpenOffice.
WebScamin uses a plugin architecture for the user defined web data mining and analyzing. In the future .NET developers can extend functionality of WebScamin by making their own plugins..
- Publisher: Jospin | Jozef B+itora
- Date:
- Size: 2938 KB
- Platform: Windows 7, Windows 8, WinOther, WinVista
- Web Scraper Plus+: Web Spider Edition
- License: Shareware

Build a custom web spider / web crawler using web data extraction / screen scraping technology. Use the web extract for web data mining of contact lists, product catalogs, government databases, real estate listings, or build a custom email extractor. Creating your own web grabber that can screen scrape the data to a database or Excel has never been easier. The website spider / crawlers can even sort and filter the webpages while spidering. Web Scraper Plus+: Web Spider Edition is the leading web data harvesting application. With Web Scraper Plus+, you can login to a secure website --> submit a search form --> crawl the results --> and scrape sections and fields of resulting html pages to rows and columns in your favorite spreadsheet or database.
- Publisher: velocityscape.com
- Date:
- Size: 52520 KB
- Platform: Win2000, Windows Server, WinOther
- Concurrent Versions Data Mining System
- License: Freeware

The Concurrent Versions Data Mining System (CVDMS) is a Web application designed to provide data mining of CVS repositories in the form of statistics and visualizations.
Concurrent Versions Data Mining System License - GNU General Public License (GPL).
- Publisher: Cvdms
- Date:
- Platform: WinOther
- SmarterStats Free Edition
- License: Freeware

SmarterStats is the perfect solution for providing intuitive web site statistics. SmarterStats application is the perfect solution for providing intuitive web site statistics. With intelligently organized reports, a distributed architecture, and no database requirements, SmarterStats provides an enterprise-level application without the enterprise-level price.SmarterStats is an application that essentially takes the information stored in the log files created by a web server and protrays that information in a graphical, easy-to-read format.It is also the most advanced web analytic software currently available, allowing system administrators to decrease the overall cost of providing web analytics to end users, and in most cases it will end up saving companies over 50% of the costs of competing products.
- Publisher: smartertools.com
- Date: 05-06-2009
- Size: 8335 KB
- Platform: WinOther
- CRMExplorer
- License: Freeware

CRMExplorer helps the SugarCRM user to get more transparency on all CRM objects and their interconnections: what we call structural visualization and network analysis. Using our innovative visualization methods such as DependencyGraph, TreeMap or ListView, you can browse and explore your SugarCRM data in a completely new way. Easy to use, flexible and highly interactive. 1) Get a complete overview of your SugarCRM data objects and connections. 2) Advanced visualizations for SugarCRM data. 3) Web-based and interactive user interface with Adobe Flash/Flex. 4) Data mining and ad-hoc analysis.
- Publisher: Orpheus GmbH
- Date: 02-09-2011
- Size: 69421 KB
- Platform: Win7 x32, Win7 x64, Windows Server, WinOther, WinVista, WinVista x64
- Vietspider Web Data Extractor
- License: Freeware

The web crawler is a program that automatically traverses the web by downloading the pages and following the links from page to page. A general purpose of web crawler is to download any web page that can be accessed through the links.
This process is called web crawling or spidering. Many sites, in particular search engines, use spidering as a means of providing up-to-date data. Web crawlers are mainly used to create a copy of all the visited pages for later processing by a search engine that will index the downloaded pages to provide fast searches. Crawlers can also be used for automating maintenance tasks on a website, such as checking links or validating HTML code.
- Publisher: VietSpider
- Date: 02-03-2011
- Size: 50000 KB
- Platform: Linux, Win2000, Windows 7, WinOther, WinVista
- Java Based Data Mining Library
- License: Freeware

The goal of this project is to provide java based libraries for core data mining algorithms. Most of the free implementations on the web are not robust/mature/scalable. This project aims at providing robust code that scales well for huge data sets.
Java Based Data Mining Library License - GNU Library or Lesser General Public License (LGPL).
- Publisher: Jminelib
- Date:
- Platform: WinOther
- XPath Scraper Basic
- License: Shareware

XScraper is a data mining software and XPath expression tester, especially useful in seo. Helpful when comes to scrape urls and other data from websites. Free version is available with basic functions: testing XPath expressions and for manual data scraping. Quick guide: in first textbox put website address (with homepage in second box put XPath expression (starting with // or /). Choose type of data TEXT for getting text, HREF for getting links. Last thing you need to do is to run scraper (push the green button). And done! Application available in Polish and English.
.
- Publisher: TechFormator
- Date:
- Platform: Win7 x32, Win7 x64, WinOther, WinVista, WinVista x64
- Ferda Data Miner x64
- License: Freeware

Ferda is a user friendly data mining tool. It is a modular distributed multiplatform framework based on Internet Communications Engine. Ferda is very powerful in working with association rules.
Data mining is the process of extracting patterns from data thus being a very important process in information analysis.
.
- Publisher: Martin Ralbovsky
- Date:
- Size: 13004 KB
- Platform: Win7 x64, WinOther, WinVista x64
- Ferda Data Miner
- License: Freeware

Ferda is a user friendly data mining tool. It is a modular distributed multiplatform framework based on Internet Communications Engine. Ferda is very powerful in working with association rules.
Data mining is the process of extracting patterns from data thus being a very important process in information analysis.
.
- Publisher: Martin Ralbovsky
- Date:
- Size: 13004 KB
- Platform: Windows 7, WinOther, WinVista
- AnySubmitter Free
- License: Freeware

AnySubmitter is a powerful web automation software. It is a All-In-One SEO tool for webmasters. With AnySubmitter, You can do web data extracting, web scraping, web content publishing, and any web automaticion task. Features: 1) Support Captcha breaking AnySubmitter Integrated decaptcher API that can automatically input the captcha field for you and save your time. 2) Support Content Spinning AnySubmitter Intergrated the best spinner API that can generated uniqued content for you, even the content you extracted from any other websiet. 3) Support Custom Submission Content with AnySubmitter you may design yourself submission content for your submission task through the XML file.
- Publisher: AnySubmitter
- Date: 01-06-2011
- Size: 10479 KB
- Platform: Win2000, Win7 x32, Windows Server, WinOther, WinVista
- IRobotSoft
- License: Freeware

IRobot is the Internet-robot engine for Visual Web automation and Web scraping. Completely automatic Web recording. Create your own IRobot Web robot agents to automate everything on the Web. Support Web data integration and computation. Share your Web experience with your friends. Batch form submissions. IRobot is all you need for Web computing. Test Web interfaces with our powerful automation engine.
Features of IRobot include:
1. Multi-thread robots surfing;
2. Internet robot recording;
3. Automated Web extraction, computation;
4. Batch data submission from databases;
5.
- Publisher: IRobotSoft
- Date: 23-08-2011
- Size: 1837 KB
- Platform: Win2000, Windows Server, WinOther
- WebExtractor360
- License: Freeware

WebExtractor360 is a free and open source web data extractor. It allows you to extract Images, Phrases, HTML Headers, HTML Tables, URLs (Links), URLs (Keywords), Emails, Phone, Fax and ANY other information on the web by specifying a Regular Expression. The web extractor software starts by crawling the specified web URL or any local file resource. All data that maps to the Match (Regular Expression) field will be returned as a result. Upon completion of the matching process for the specified URL, the crawler will continue to process other URLs that the specified URL links to. The entire process is repeated until the Maximun URL has been reached or there are no more URLs to process.
- Publisher: ConnectCode Pte Ltd
- Date: 09-11-2012
- Size: 767 KB
- Platform: WinOther
- Gait-CAD (Data Mining for MATLAB)
- License: Freeware

The Matlab toolbox Gait-CAD is designed for the visualization and analysis of time series and features with a special focus to data mining problems including classification, regression, and clustering..
- Publisher: gait-cad.sourceforge.net
- Date: 27-10-2012
- Size: 13437 KB
- Platform: WinOther
- Java Data Mining Package
- License: Freeware

The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization)..
- Publisher: jdmp.org
- Date: 16-07-2012
- Size: 1823 KB
- Platform: Linux, Mac OS X, WinOther
- RapidMiner -- Data Mining, ETL, OLAP, BI
- License: Freeware

No 1 in Business Analytics: Data Mining, Predictive Analytics, ETL, Reporting, Dashboards in One Tool. 1000+ methods: data mining, business intelligence, ETL, data mining, data analysis + Weka + R, forecasting, visualization, business intelligence.
- Publisher: rapid-i.com
- Date: 03-06-2012
- Size: 38605 KB
- Platform: Linux, Mac OS X, Unix, WinOther
Web Data Mining: Freeware | All














