Software
Software Links
- TiMBL: Tilburg Memory Based Learner TiMBL is a program implementing several Memory-Based Learning techniques (these learners store representation of the training set explicitly, and classifies new cases by extrapolation from the most similar stored cases). [Free for educational or non-commercial research purposes]
- BETSY - Bayesian Essay Test Scoring sYstem BETSY is a freeware windows-based program that classifies text based on trained material. Designed for automated essay scoring, BETSY can be applied to any text classification task.
- Meta-MEME v2.0.1 Software toolkit for building and using motif-based hidden Markov models of DNA and proteins - from the Univ. of California-San Diego.
- SAM: Sequence Alignment and Modeling System SAM is a collection of tools for creating and using HMMs for biological sequences. Free license for academic and nonprofit usages.
- Weka 3 - Open Source Machine Learning Software in Java Suite that implements decision trees and tables, rule learners, Naive Bayes, support vector machines, voted perceptrons, multi-layer perceptron. Meta schemes include bagging, stacking, boosting etc. [Free under GPL]
- HMMER - Profile hidden Markov models for biological sequence analysis Sean Eddy's tools to build HMMs from multiple alignments, align sequences to HMMs, calculate e-scores, and other tools.
- SUBDUE: Knowledge Discovery in Structural Databases The program discovers interesting and repetitive subgraphs in a labeled graph representation using the minimum description length principle. Applications to molecular biology. [Free]
- LNKnet Pattern Classification Software LNKnet is a software package developed at MIT Lincoln Laboratory which integrates more than 20 neural network, statistical, and machine learning classification, clustering, and feature selection algorithms into a modular software package. [Public domain license]
- MIX Software for learning Mixture Distributions. Commercial license.
- LIBSVM - A Library for Support Vector Machines LIBSVM is an integrated tool implementing support vector classification, regression, and distribution estimation. C++ and Java sources available. [Free]
- The RIPPER rule learner A system for learning rules from data. [Free for research purposes]
- The MEME/MAST system MEME (Multiple EM for Motif Elicitation) is a program for discovering motifs (highly conserved regions) in groups of related DNA or protein sequences. MAST (Motif Alignment and Search Tool) is a tool for searching biological sequence databases for sequences that contain one or more of a group of known motifs.
- Software Packages for Graphical Models / Bayesian Networks A list of software packages for Graphical Models / Bayesian Networks. Some have learning capabilities.
- An AI learning system A description of an AI system, with a demonstration program and Delphi sourcecode available.
- The BUGS Project - Bayesian inference Using Gibbs Sampling BUGS is a piece of computer software for the Bayesian analysis of complex statistical models using Markov chain Monte Carlo (MCMC) methods. Software for several platforms is available for downloading, and all the manuals are also available online or in downloadable formats.
- EM algorithm for Mixture models Shotaro Akaho implementation of EM algorithm for modeling Mixtures of Gaussians (in Java). Free. An extended version is available from the author.
- Machine learning programs by Peter Clark QM: Guiding inductive learning with a Qualitative Model. LPE: Lazy Partial Evaluation. CN2: Rule induction from examples. Free.
- Bayes Net Toolbox for Matlab Supports several inference algorithms (e.g. junction tree, Pearl, variable elimination etc.) and learning algorithms (e.g. EM, MCMC, structure learning, etc). Allows simulation of static and dynamic networks, including HMMs, IOHMMs, and Kalman filters.
- The CHILL empirical parser acquisition system CHILL is a general approach to the problem of inducing natural language parsers. It uses an annotated corpus, and produces a parser by using ILP for inducing the rules that control the actions of a shift-reduce parser. [Free]
- AutoClass AutoClass takes a database of cases described by a combination of real and discrete valued attributes, and automatically finds the natural classes in that data. It can be seen as a Naive Bayes classifier where the class node is hidden. [Free]
- Incremental Decision Tree Induction An algorithm that incrementally constructs decision trees from labeled examples. [Free for individual research purposes]
- The R Project for Statistical Computing R, also known as `GNU S', is a free system for statistical computation and graphics similar to S. It provides a wide variety of statistical and graphical techniques (linear and nonlinear modelling, statistical tests, time series analysis, classification, clustering, etc.).
- C4.5 and FOIL Home page of R. Quinlan. FTP links to FOIL (inductive logic programming) and C4.5 (learning decision trees).
- SVM-Light: Support Vector Machine Software Training Software for Large-Scale SVMs. [Free for non-commercial use]
- MLC++ MLC++ is a library of C++ classes for supervised machine learning. Provided by SGI. [Free for internal research purposes]
- The NEITHER Theory Revision System A propositional theory refinement system that will modify a incomplete or incorrect rule base so as to make it consistent with a set of input training examples. [Free]
- Pfam A large collection of multiple sequence alignments and trained hidden Markov models covering many common protein domains.
- The `Bow' Toolkit for statistical language modeling, text retrieval, classification, and clustering Bow (or libbow) is a library of C code useful for writing statistical text analysis, language modeling and information retrieval programs. The current distribution includes the library, as well as front-ends for document classification (rainbow), document retrieval (arrow) and document clustering (crossbow). [Free]
- HMM and other statistical programs On this page an imlementation of Hidden Markov Models and an application to part-of-speech tagging. Also available a multivariate hypothesis testing software for Gaussian Data and TRUEVIZ: A groundtruth/metadata Editing and Visualizing Toolkit for OCR.
- FastMix FastMix generates Gaussian mixture models for large datasets using efficient EM clustering algorithms. [Free]
- WHIRL Distribution Page A Word-based Information Representation Language by W. Cohen [Free for non-commercial purposes]
- Statistical Decision Trees A program for inducing Bayesian decision trees. Applications to speech. [Free]
- PRODIGY system An architecture for planning and learning. [Free]
- SNoW SNoW is a learning architecture specifically taylored for learning in very high-dimensional feature spaces. The current release uses sparse variations of Winnow, Perceptron, and Naive Bayes. [free for personal academic and research purposes]
- Machine Learning Packages from the CMU Artificial Intelligence Repository Links to external ftp sites. Systems include ACCEL, CLASSWEB, FOCL, FOIL, GOLEM, INDEX, MILES, MOBAL, OC1, Occamn, PEBLS, RWM
- WinMine Toolkit Tools for learning dependency networks or Bayesian networks from data. [Free]
| Help build the largest human-edited directory on the web. |
| Submit a Site - Open Directory Project - Become an Editor |