com.rapidminer.operator.preprocessing.discretization
Class MinimalEntropyDiscretization

java.lang.Object
  extended by com.rapidminer.operator.Operator
      extended by com.rapidminer.operator.preprocessing.PreprocessingOperator
          extended by com.rapidminer.operator.preprocessing.discretization.MinimalEntropyDiscretization
All Implemented Interfaces:
ConfigurationListener, PreviewListener, ParameterHandler, LoggingHandler

public class MinimalEntropyDiscretization
extends PreprocessingOperator

This operator discretizes all numeric attributes in the dataset into nominal attributes. The discretization is performed by selecting a bin boundary minimizing the entropy in the induced partitions. The method is then applied recursively for both new partitions until the stopping criterion is reached. For Details see a) Multi-interval discretization of continued-values attributes for classification learning (Fayyad,Irani) and b) Supervised and Unsupervised Discretization (Dougherty,Kohavi,Sahami). Skips all special attributes including the label.

Please note that this operator automatically removes all attributes with only one range (i.e. those attributes which are not actually discretized since the entropy criterion is not fulfilled). This behavior can be controlled by the remove_useless parameter.

Author:
Sebastian Land, Dirk Dach

Field Summary
static java.lang.String PARAMETER_REMOVE_USELESS
          Indicates if useless discretized attributes, i.e. such attributes with only a single range after discretization should be removed.
static java.lang.String PARAMETER_USE_LONG_RANGE_NAMES
          Indicates if long range names should be used.
 
Fields inherited from class com.rapidminer.operator.preprocessing.PreprocessingOperator
PARAMETER_CREATE_VIEW, PARAMETER_RETURN_PREPROCESSING_MODEL
 
Constructor Summary
MinimalEntropyDiscretization(OperatorDescription description)
           
 
Method Summary
 Model createPreprocessingModel(ExampleSet exampleSet)
           
 java.util.List<ParameterType> getParameterTypes()
          Returns a list of ParameterTypes describing the parameters of this operator.
 
Methods inherited from class com.rapidminer.operator.preprocessing.PreprocessingOperator
apply, getInputClasses, getOutputClasses
 
Methods inherited from class com.rapidminer.operator.Operator
addError, addValue, addWarning, apply, checkDeprecations, checkForStop, checkIO, checkProperties, clearErrorList, cloneOperator, createExperimentTree, createExperimentTree, createFromXML, createMarkedExperimentTree, createMarkedProcessTree, createProcessTree, createProcessTree, getAddOnlyAdditionalOutput, getApplyCount, getDeliveredOutputClasses, getDeprecationInfo, getDesiredInputClasses, getEncoding, getErrorList, getExperiment, getInnerOperatorsXML, getInput, getInput, getInput, getInputDescription, getIOContainerForInApplyLoopBreakpoint, getIODescription, getLog, getName, getOperatorClassName, getOperatorDescription, getParameter, getParameterAsBoolean, getParameterAsColor, getParameterAsDouble, getParameterAsFile, getParameterAsFile, getParameterAsInputStream, getParameterAsInt, getParameterAsMatrix, getParameterAsString, getParameterList, getParameters, getParameterType, getParent, getProcess, getStartTime, getStatus, getUserDescription, getValue, getValues, getXML, hasBreakpoint, hasBreakpoint, hasInput, inApplyLoop, isDebugMode, isEnabled, isExpanded, isParallel, isParameterSet, log, logError, logNote, logWarning, performAdditionalChecks, processFinished, processStarts, register, registerOperator, remove, rename, resume, setApplyCount, setBreakpoint, setEnabled, setExpanded, setInput, setListParameter, setOperatorParameters, setParameter, setParameters, setParent, setUserDescription, toString, unregisterOperator, writeXML
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Field Detail

PARAMETER_USE_LONG_RANGE_NAMES

public static final java.lang.String PARAMETER_USE_LONG_RANGE_NAMES
Indicates if long range names should be used.

See Also:
Constant Field Values

PARAMETER_REMOVE_USELESS

public static final java.lang.String PARAMETER_REMOVE_USELESS
Indicates if useless discretized attributes, i.e. such attributes with only a single range after discretization should be removed.

See Also:
Constant Field Values
Constructor Detail

MinimalEntropyDiscretization

public MinimalEntropyDiscretization(OperatorDescription description)
Method Detail

createPreprocessingModel

public Model createPreprocessingModel(ExampleSet exampleSet)
                               throws OperatorException
Specified by:
createPreprocessingModel in class PreprocessingOperator
Throws:
OperatorException

getParameterTypes

public java.util.List<ParameterType> getParameterTypes()
Description copied from class: Operator
Returns a list of ParameterTypes describing the parameters of this operator. The default implementation returns an empty list if no input objects can be retained and special parameters for those input objects which can be prevented from being consumed.

Specified by:
getParameterTypes in interface ParameterHandler
Overrides:
getParameterTypes in class PreprocessingOperator


Copyright © 2001-2009 by Rapid-I