Software Listing: Data Clustering
- High Dimensional Data Clustering (HDDC)
- License: Shareware

The High Dimensional Data Clustering (HDDC) toolbox contains an efficient unsupervised classifiers for high-dimensional data. This classifier is based on Gaussian models adapted for high-dimensional data. Reference: C. Bouveyron, S. Girard and C. Schmid, High-Dimensional Data Clustering, Computational Statistics and Data Analysis, to appear, 2007.
- Publisher: Charles Bouveyron
- Date: 06-02-2013
- Size: 51 KB
- Platform: Matlab, Scripts
- S-SOFM Toolbox
- License: Freeware
- Price: 0.00

The Spherical Self-Organizing Feature Maps is a contemporary technique for data clustering and visualization. The main advantages they offer are the following: 1. Smooth training 2. Implementation in arbitrary dimension without additional computational cost. 3. Data visualization in arbitrary dimensions This toolbox contains a set of functions and a GUI which can be used to create glyphs from spherical Self-Organizing Feature Maps (SOFMs). It is freely distributable for educational and research purposes. Example: % loads the data to be visualized load henon-1024-4.mat %loads the S-SOFM structure load c4-24.
- Publisher: Alexandros Leontitsis
- Date: 13-04-2013
- Size: 6328 KB
- Platform: Matlab, Scripts
- TTCL
- License: Freeware
- Price: 0.00

The Template Clustering Library: a C++ template-based library for data clustering tasks
TTCL License - GNU Library or Lesser General Public License version 3.0 (LGPLv3).
- Publisher: Ttcl
- Date:
- Platform: WinOther
- NeuroXL Clusterizer
- License: Shareware
- Price: 99.95

Clustering is known as best process of grouping the comments of similar kind in small size groups into bigger population. It even has extensive application in analytics of the business. The queries facing by businesses is that how to manage great sums of accessible information in significant structures. Or smash the huge heterogeneous populace in small size of harmonized and organized groups. The Analysis of cluster is an analytical tool of the data analysis which aims at cataloging dissimilar objects in diverse groups in the manner which the level of connection among 2 objects is max in the case if they belong to similar group as well as least if not.
- Publisher: OLSOFT
- Date: 26-07-2014
- Size: 2331 KB
- Platform: Win7 x32, Win7 x64, Windows 8, WinOther, WinVista
- Simulation of Ant Based Clustering Algorithm Based on Cemetery Organization (Lumer&Faeita Method)
- License: Shareware

Many ant species exhibit the behavior of clustering corpses to form cemeteries. Each ant seems to behave individually, moving randomly in space while picking up or depositing (dropping) corpses. The decision to pick up or drop a corpse is based on local information of the ant's current position. Lumer and Faeita applied this concepts in data clustering. In this code the process of moving, picking up/dropping patterns, etc in the algorithm is shown. After Generating sample patterns and setting the parameters you can run the process. For generating sample data, the number of clusters, features and patterns should be selected.
- Publisher: Heidar Rastiveis
- Date: 06-01-2013
- Size: 113 KB
- Platform: Matlab, Scripts
- Gene Expression Data Analysis Studio
- License: Freeware
- Price: 0.00

GEDAS is a software to perform microarray data analysis with friendly user interface and convenient data display. Currently some commonly used data clustering algorithms have been implemented in this software.
Gene Expression Data Analysis Studio License - GNU General Public License (GPL).
- Publisher: Gedas
- Date:
- Platform: WinOther
- NMath Stats
- License: Shareware
- Price: 495.00

NMath Stats is a .NET statistics library that provides functions for statistical computation and biostatistics, including descriptive statistics, probability distributions, combinatorial functions, multiple linear regression, hypothesis testing, analysis of variance, multivariate statistics, partial least squares, and nonnegative matrix factorization.
NMath Stats is a .NET statistics library containing classes for data manipulation, statistical computation, and biostatistics. Product features include:
A data frame class for holding data of various types (numeric, string, boolean, datetime, and generic), with methods for appending, inserting, removing, sorting, and permuting rows and columns.
- Publisher: CenterSpace Software
- Date:
- Size: 15052 KB
- Platform: Win7 x64, Windows 7, WinOther, WinVista, WinVista x64
- Event Tracking Software
- License: Shareware
- Price: 160.00

The event log file contains event messages along with their information such as event type, severity of the event, event status, event ID, event time and date and others. These Event Logs get modified by adding the event messages into them. There is a system component called Log Client that edits event messages for event logging. Event Logs generated by various system components are a very important as they contain valuable information which have been generated in real time. This information becomes important in the context of big enterprises where there is a huge network of computers and there is a great need of event tracking software.
- Publisher: Event Tracking
- Date: 02-08-2012
- Size: 43725 KB
- Platform: WinOther
- Iris data set clustering
- License: Shareware

Iris data set clustering using partitional algorithm. Concepts like loading text document and plotting of 4 Dimensional data with the fourth dimension as the intensity of colour of the plot. I used K means algorithm to update the centres from where we calculate the euclidean distance of the other points and group them after certain number of iterations. The code is well commented..... Plz rate the file...
- Publisher: Yella
- Date: 25-04-2013
- Size: 1085 KB
- Platform: Matlab, Scripts
- Clustering data and searching optimal cutoff employing VIF criterion
- License: Shareware

Performs hierarchical clustering of data using specified method and seraches for optimal cutoff empoying VIF criterion suggested in "Okada Y. et al - Detection of Cluster Boundary in Microarray Data by Reference to MIPS Functional Catalogue Database (2001)". Namely, it searches cutoff where groups are independent. The techinque uses an econometric approach of verifying that variables in multiple regression are linearly independent: if all the diagonal elements of inverse correlation matrix of data are less than VIF (as rule of thumb VIF=10). Searching procedure is the variaition of bisection method, so it's complexity is log(n) at most.
- Publisher: Denis
- Date: 23-06-2013
- Size: 10 KB
- Platform: Matlab, Scripts
- Gene Clustering Application
- License: Shareware

The Gene Clustering Application was developed as an accessible and handy graphical instrument that allows you to perform hierarchical clustering of gene expression data.
Gene Clustering Application is a tool that was developed with the help of the Java programming language and can run on multiple platforms.
.
- Publisher: Mark Weindling
- Date:
- Platform: WinOther
- Gait-CAD (Data Mining for MATLAB)
- License: Freeware
- Price: 0.00

The Matlab toolbox Gait-CAD is designed for the visualization and analysis of time series and features with a special focus to data mining problems including classification, regression, and clustering..
- Publisher: gait-cad.sourceforge.net
- Date: 27-10-2012
- Size: 13437 KB
- Platform: WinOther
- MATLAB spectral clustering package
- License: Freeware
- Price: 0.00

A MATLAB spectral clustering package to handle large data sets (200,000 RCV1 data) on a 4GB memory general machine. We implement various ways of approximating the dense similarity matrix, including nearest neighbors and the Nystrom method..
- Publisher: alumni.cs.ucsb.edu
- Date: 07-10-2012
- Size: 15437 KB
- Platform: Mac OS X, Unix, WinOther
- Multi-way Distributional Clustering
- License: Freeware
- Price: 0.00

Machine learning toolkit for unsupervised and semi-supervised clustering that demonstrates excellent results on real-world data (see Bekkerman et al. ICML-2005 and ECML-2006)..
- Publisher: comraf.sourceforge.net
- Date: 03-07-2012
- Size: 19 KB
- Platform: Linux, Unix
- Tungsten Clustering and Replication
- License: Freeware
- Price: 0.00

Tungsten is a family of open source technologies for database clustering and replication. Tungsten includes replication, management, SQL routing, and proxying that improve database availability, protect data, and raise application throughput..
- Publisher: tungsten.sourceforge.net
- Date: 23-08-2012
- Size: 27471 KB
- Platform: WinOther
- Clustering and Data Analysis Toolbox
- License: Freeware
- Price: 0.00

Nowadays due to the yearly multiplying data comes always the claim for useful methods and algorithms that make the processing of these data easier. For the solution of this problem data mining tools come into existence, to which the clustering algorithms belong. At the Department of Process Engineering of the University of Veszprem much research has been done on the clustering algorithms, many articles, publications and an MSc theme were published dealing with this topic. To unite all these information and knowledge a "Clustering and Data Analysis Toolbox" was needful. The purpose of this work was to compile a continuously extensible, standard tool, which is useful for any MATLAB user for one's aim.
- Publisher: Janos Abonyi
- Date: 23-01-2013
- Size: 2099 KB
- Platform: Matlab, Scripts
- Dp algorithm
- License: Shareware

a kind of usefull clustering algorithm that is better than kmeans and ward hierarchical clustering algorithms in some data sets.
- Publisher: liu chen
- Date: 14-05-2013
- Size: 164 KB
- Platform: Matlab, Scripts
- GVM Java Library
- License: Shareware

GVM, otherwise known as Greedy Variance Minimization was developed as a spatial clustering algorithm that can cope with extremely large data sets.
Now you can make use of this handy package to further improve your development process.
.
- Publisher: Tom Gibara
- Date:
- Platform: WinOther
- NeuroXL Package
- License: Shareware
- Price: 149.95

NeuroXL Package is a neural network toolkit for Microsoft Excel. It consists of NeuroXL Predictor and NeuroXL Clusterizer. NeuroXL Predictor add-in is a neural network forecasting tool that quickly and accurately solves forecasting and estimation problems in Microsoft Excel. It is designed from the ground-up to aid experts in solving real-world forecasting problems. NeuroXL Predictor interface is easy-to-use and does not require advanced knowledge of neural networks, and is integrated seamlessly with Microsoft Excel. Five transmission functions are now available to choose: Threshold, Hyperbolic tangent, Zero-based log-sigmoid, Log-sigmoid and Bipolar sigmoid.
- Publisher: OLSOFT
- Date: 26-07-2014
- Size: 4033 KB
- Platform: Win7 x32, Win7 x64, Windows 8, WinOther, WinVista
- Restorer2000 Data Recovery Software
- License: Demo
- Price: 29.99

Restorer2000 is a powerful data recovery and undelete software available on the market. It allows you to undelete a file, unerase, unformat and recover data from an NTFS, FAT partition even if it's damaged or deleted. It's a quick and easy data recovery solution. Drive Image creation is an important feature for data recovery from bad drives. Restorer 2000 is optimal alternative to high expensive data recovery service for readable hard disks..
- Publisher: BitMart Data Recovery Software, Inc.
- Date: 15-01-2003
- Size: 1350 KB
- Platform: WinOther









