RapidMiner (formerly YALE) provides an easy to use extension and plugin mechanism
that makes it possible to integrate new operators and adapt RapidMiner to your
personal requirements.
RapidMiner is usable without graphical user interface (command line version).
Furthermore, a clean Java API allows the usage of single operators or complete
data mining processes or machine learning experiments directly from your Java
applications.
This turns RapidMiner into a library or interpreter for complex data mining tasks.
The Word Vector Tool (WVTool) is a flexible Java library for statistical language modeling.
It especially supports the creation of word vector representations of text documents in the
vector space model (each document is represented by the terms it contains).
The vector space model is the point of departure for many text processing applications
(e.g. web mining, text classification or information retrieval).
The classification of high dimensional data like time series often requires
the efficient extraction of meaningful features.
This plugin provides virtually all methods which are known today for the
feature extraction from value series.
A special highlight is the completely automatic feature extraction by means
of genetic programming.
The Distributed Data Mining (DM) Plugin for RapidMiner (formerly YALE) provides a
framework and basic functionality for simulations of distributed data mining scenarios.
It includes flexible networking pattern and distribution schemes of examples to nodes
as well as interactive visualizations.
The data stream plugin for RapidMiner (formerly YALE)
provides operators for simulating data streams from one or
several data sets, for simulating concept drifts, and for
handling concept drifts on data streams with simulated or
real-world concept drift.