RapidMiner (formerly YALE) is a flexible Java environment for knowledge discovery in databases, machine learning, and data mining. Many nestable learning and preprocessing operators (including Weka) are provided. It features an XML-based graphical user interface, a plugin mechanism, and high-dimensional plotting, and provides an easy-to-use extension mechanism that makes it possible to integrate new operators and adapt the system to your personal requirements. A command line version is also included.
| Tags | education Information Management Scientific/Engineering Artificial Intelligence Adaptive Technologies Office/Business Database |
|---|---|
| Licenses | AGPL |
| Operating Systems | OS Independent Windows Windows Unix |
| Implementation | Java |
Recent releases


Changes: This release provides 35 new operators, more than 30 bugfixes, and many new features. New Preprocessing Options: removal of duplicates, nominal value splits, data set union and superset, and others. New Learning Schemes: linear and quadratic discriminant analysis, fast large margin, and others. New Process Possibilities: FileIterator, MacroConstruction, SetData, ExceptionHandling, and others. Improvements include better handling of time zones, helper operators for attribute renaming, and others.


Changes: This release focuses on the most requested data analysis, ETL, and BI requirements. It provides more than 50 new operators and a lot of new features including upgraded data pivotings, new aggregation functions, and date and time handling. The usage of function-based attribute construction was simplified, and optimized wizards or new visualizations, including zooming and panning, were added. More powerful processes are now possible due to enhanced macros and the new result storage mechanism. This release also provides fixes for more than 30 bugs.


Changes: This is a minor bugfix and feature release. In particular, two errors that caused problems in the new parameter optimization wizard for string and integer parameters were fixed. New features include a new 64-bit version for Windows x64 systems, multiple groupings for aggregations, better support for date and time columns, the grouping and ungrouping of models, and an improved attribute subset preprocessing. Additionally, the runtime of several analysis schemes was drastically reduced, in some cases up to a factor of 13.


Changes: This is a major bugfix and feature release. The new features include a 64-bit version for Windows x64 systems, an improved and more comprehensive documentation of all operators inside the program, two different icon sets and look and feels, and a new and improved plotting facility including different shapes for plotter points. The most important preprocessing operators can now produce both views and materialized data changes, and several new preprocessing models were added. New wizards for parameter optimization were added. About 80 bugs were fixed.


Changes: This release is mainly a bugfix release. In particular, two errors which caused unusually high memory consumption for some data mining processes were fixed. More precise data representations are also used again, since too many rounding errors occurred. In total, about additional 30 bugs were fixed compared to version 4.1beta and it is therefore highly recommended to change to this version instead. Additionally, some improvements for plotting and some new operators were added.