
Top : Computers : Software : Databases : Data Mining :
Public Domain Software
Websites
A Microsoft Access application designed to provide tools to explore your databases with graphs and queries. It is also a quick way to generate/prototype Access Graphs without running the Wizards.
site exerpt
Data Mining Access book will show you how and why you need to combine options, menus, workgroup security and the operating system to protect your database. Graf-FX Explore your data with this versatile graphing and data mining shareware tool. The Toolshed Fully...A public-domain software package for the interactive visual exploration of multivariate data sets. It is available on all UNIX platforms which support XR4 or higher. The current version of the software (3.1) supports scatterplots, star glyphs, parallel coordinates, and dimensional stacking.
site exerpt
XmdvTool Home Page: Overview It is available on all major UNIX/LINUX/MAC and Window platforms. XmdvTool is developed based on OpenGL and Tcl/Tk. It supports four methods for displaying flat form data and hierarchically clustered data: Scatterplots Star Glyphs Parallel Coordinates Dimensional Stacking XmdvTool also...The VisDB has been developed to support the exploration of large database. The VisDB system implements several visual data mining techniques, allowing an exploration of large databases (up-to about one million data values).
site exerpt
VisDB homepage B has been developed to support the exploration of large database. The VisDB system implements several visual data mining techniques, allowing an exploration of large databases (up-to about one million data values The techniques supported by the system are Pixel-oriented...Model based clustering and discriminant analysis, including hierarchical clustering and EM. Developed at University of Washington.
site exerpt
Model-Based Clustering Software Fortran and interfaced to the S-PLUS commercial software package and the R language. Use of the MCLUST software requires a license agreement. NOTE: the function hc for hierarchical clustering in MCLUST supersedes the function mclust in commercial S-PLUS 2002 version...Uses the Minimum Message Length (MML) principle to do mixture modeling. Mixture modeling concerns modeling a statistical distribution by a mixture of other distributions, and is also known as unsupervised concept learning in Artificial Intelligence. Links to related research papers and software.
http://www.cs.monash.edu.au/~dld/Snob.html
Software suite which has algorithms for association rules, building classifiers, and clustering data from relational database products using JDBC. References to related articles, and research papers.
site exerpt
research The main working area of our group lies in the Information Systems, specially in the framework of the Databases, as well as the handling of imprecise information in such environments. The use of tools coming from the fields of Artificial...A Software system for data mining based on rough set theory. GUI based operation on MS Windows platforms, with a wide variety of algorithms. Information on features, documentation, utilities, and upcoming releases.
site exerpt
ROSETTA Redirecting to http rosetta.lcb.uu.se...A multi-lingual toolkit for various decision tree algorithms with C++ libraries. Available for free download for a variety of platforms.
site exerpt
ISoft Products; Data Mining Softwares. AC2 K compliance consulting training order AC2 tests all possible combinations of database fields to find the criteria that answer the question you put to your data best and ranks all relevant criteria along a decision tree Byte Magazine Find an...A decision tree tool that automatically sifts large, complex databases, searching for and isolating significant patterns and relationships. Offers free limited capability demo for download, product features, applications, user feedback, and associated books.
site exerpt
Salford Systems Please click here to continue if you are not redirected within three seconds....Includes source code, related research papers and associated work.
site exerpt
FDEP Home Page The program includes three algorithms for computing functional dependencies from relations: a) simple top-down algorithm, b) bottom-up algorithm, and c) bi-directional algorithm. Source code FDEP is implemented in GNU C (version 2.7.2.3 FDEP distribution contains the source code, man pages,...Client-server Java based data mining software for mining association rules. Developed at University of Massachusetts.
site exerpt
ARMiner Info Index Miner has been written in Java and it is distributed under the GNU General Public License. ARMiner has been developped at UMass/Boston as a Software Engineering project in Spring 2000. Last ARMiner Server version is 1.0a (12/05/2001 Last ARMiner Client...Environment for statistical computing and dynamic graphics based on Lisp. Contains contributed code and submission instructions.
http://lib.stat.cmu.edu/xlispstat/
MLC++ is a standard C++ library for supervised machine learning, with back-end and front-end tools for data mining tasks like Decision Trees, and Clustering. Information on legal issues, mailing lists, history, standards, platform support, and download instructions.
site exerpt
SGI MLC Home Page C classes for supervised machine learning. The MLC utilities were created using the library. MLC up to version 1.3.X) was developed at Stanford University and was public domain; that version is still distributed as such by SGI. SGI MLC V2.0...Collection of standalone data mining programs, available as scripts or Java programs. Contains information for downloading, installation, data preparation, and operating instructions. Supplementary to the book titled Predictive Data Mining - A Practical Guide.
site exerpt
Data-Miner Software Kit All examples are for the iris data set, a sample of 150 cases that is included in the package. DMSK Java Version Take advantage of the latest software technology with a Java version of DMSK. Run DMSK as a JDK1.1...A freely available software toolkit for clustering low- and high-dimensional data sets. It is well-suited for clustering data sets arising in many areas including information retrieval, customer purchasing transactions, science, and biology.
http://www.cs.umn.edu/~karypis/cluto
Open Source creation of a data mining C++ procedure library. Initially focused on mining generalised association rules and generalised sequential patterns
site exerpt
QuickMiner Miner project is to build a C library for data mining with an emphasis on speed. This is a critical component of the CodeWeb system. Initial work will concentrate on mining generalised association rules and generalised sequential patterns. These techniques,...Written in Python, the toolbox handles caching of database queries and parallelism within a collection of independent queries. Our toolbox provides a number of routines for basic data mining tasks on top of which the user can add more functions - mainly domain and data collection dependent - for complex and time consuming data mining tasks. GNU/GPL. From the Computer Sciences Laboratory of The Australian National University
http://cslab.anu.edu.au/ml/dm/dm_software.html
By David Chickering at Microsoft Research. The WinMine Toolkit is a set of tools for Windows 2000/NT/XP that allow you to build statistical models from data. The majority of the tools are command-line executables that can be run in scripts.
site exerpt
WinMine Toolkit Home Page Toolkit is a set of tools for Windows 2000/NT/XP that allow you to build statistical models from data. The majority of the tools are command-line executables that can be run in scripts. Click here or on the icon to see...Includes source code, related papers and associated projects.
site exerpt
MDEP Home page Program for the discovery of multivalued dependencies from relations I.Savnik, P.A.Flach Introduction Dependencies between attributes of a database relation express the presence of structure in that relation. In particular, the existence of a multivalued dependency X Y in a relation...An unsupervised Bayesian classification system that seeks a maximum posterior probability classification.
http://ic-www.arc.nasa.gov/ic/pr...toclass/autoclass-c-program.html
Open source software for extraction and reporting using a powerful template tool. Deft combines declarative concepts of SQL with all of Perl's features. Requires Linux and Perl
site exerpt
defindit.com Declarative programming, templates, data analysis, and consulting Additional open source software Deft Downloads at SourceForge Noah Healy Declarative Data Mining, and Templated Report Generation. API documentation Implementation details Introduction to Deft concepts Template loop control (with illustrations) Template description (older) Deft is table oriented declarative programming. Deft...A small Excel based freeware to build Classification Tree models in Excel. Uses C4.5 algorithm. Very easy to learn and use - but capability is limited.
site exerpt
Classification Tree in Excel If you are interested in tree based clasification models here is an application in Excel which lets you build such a model in Excel. Classification Tree Decision Tree in Excel 314 KB in Zipped format. 800 KB when unzipped A...Modelling tool that analyzes data generating classification, regression or class probability prediction models.
site exerpt
Shih Shih focuses on predictive modeling through the software Shih Data Miner, shortly know as Shih. Shih builds models in an easy way. Shih runs on any platform, it's simple and is addressed to the end user, not necessarily a statistician....Data Mining applications developed with Visual Basic or the .NET Framework by Kingsley Tagbo, including Naive Bayes Classifiers. Site provides public domain data mining applications with source code and online documentation. The latest release as of October 2002 is 'Visual Basic Data Mining With Naive Bayes' and '.NET Data Mining With Naive Bayes'.
http://www.visual-basic-data-mining.net
Platform- and data-source-independent library for embedded data mining based on the CWM/OMG and other data mining standards. XELOPES-Java algorithms: SVMs, market basket analysis, sequence analysis, decision trees, cluster analysis, multidimensional grouping. XELOPES-C++ algorithms: SVMs, decision trees. [GPL]
site exerpt
prudsys AG Mining standards and can be combined with all prudsys products. A trial version of the XELOPES Library is available for download Download) Application fields Integration of prudsys models into user applications All Data Mining products developed by prudsys (as well...A freely available software toolkit for finding frequent patterns in diverse datasets. It contains highly efficient algorithms for finding patterns in transactional, sequential, and graph datasets.
http://www.cs.umn.edu/~karypis/pafi/
Windows software tool that induces rules using the PNC2 cluster algorithm. An integrated parameter tuning component allows an easy adjustment of the algorithm's behaviour to the given problem without any further knowledge. [Gnu GPL]
site exerpt
The PNC2 Rule Induction System Index System is a free machine learning software tool, that automatically induces rules from your data using the newly invented PNC2 Cluster Algorithm. An integrated parameter tuning component allows an easy adjustment of the algorithm's behaviour to your particular data without...Frequent itemset and association rule mining implementations (C++) such as Apriori, Eclat and FP-growth.
http://www.adrem.ua.ac.be/~goethals/software/index.html
GPL C/C++ software for data analysis of discrete data using principal/independent component methods. Examples are DPCA, LDA, GaP (like PLSI and NMF). Targetted at text, with MPI and multithreading.
http://cosco.hiit.fi/search/MPCA/
Source code for program for creation of hierarchical classification trees. Information about implementations, documentation, and related research papers.
site exerpt
ECOBWEB concept formation program B concept formation program ECOBWEB is a concept formation program for the creation of hierarchical classification trees. It implements several extensions to Fisher's COBWEB program. In particular, it can work well with numeric attributes, it can perform simple constructive induction,...