Welcome,
Guest
. Please
login
or
register
.
Did you miss your
activation email?
Home
Help
Search
Login
Register
Rapid-I
Rapid-I Forum
»
RapidMiner
»
Feature Requests
»
Setting Ranges in a Declare Missing Value Operator
Pages: [
1
]
« previous
next »
Print
Author
Topic: Setting Ranges in a Declare Missing Value Operator (Read 941 times)
mnorth
Newbie
Posts: 6
Setting Ranges in a Declare Missing Value Operator
«
on:
October 26, 2010, 06:00:07 PM »
I asked about this on the Support forum, but it seems what I might actually be after is a new feature.
I have an attribute in my data set containing millions of examples (observations). This attribute, Crime_Instances, has no missing values, but has about 25 distinct outlier values comprising less than 1% of the total examples. I would like to use a Declare Missing Values operator to submit an expression such as Crime_Instances > 8, so that all examples that have a value of more than 8 in the Crime_Instances attribute would be set to missing. It wouldn't matter if one example had 15 in this attribute, while another had 25 and another had 90, they would all be set to missing using a single expression in a single operator. This feature would be very useful to me.
As a side note, Sebastian suggested a method for accomplishing this and it did work, though it required six extra operators to achieve my desired result, so it's not as if I can't do it, I just can't do it very easily right now.
Thanks!
Matt
Logged
Simon Fischer
Administrator
Sr. Member
Posts: 422
Re: Setting Ranges in a Declare Missing Value Operator
«
Reply #1 on:
October 28, 2010, 09:28:31 AM »
Hi,
the next RapidMiner version will have a constant MISSING which you can use in, e.g., "Generate Attributes". Then you just generate a new attribute
if(oldValue > 8, MISSING, oldValue)
Best,
Simon
Logged
Simon Fischer
Rapid-I
Want to meet others from the RapidMiner community and share your research and results? Visit
http://www.rcomm2011.org
mnorth
Newbie
Posts: 6
Re: Setting Ranges in a Declare Missing Value Operator
«
Reply #2 on:
November 02, 2010, 09:30:03 PM »
This looks like it will work for what I need. I'll look forward to this feature being added!
Logged
elgoya
Newbie
Posts: 1
Re: Setting Ranges in a Declare Missing Value Operator
«
Reply #3 on:
September 25, 2011, 06:28:31 AM »
The Declare Missing Operator has new mode "expression", this example works: if(Crime_Instances > 8, true, false)
This not function: Crime_Instances > 8
Excuseme for very bad english.
Logged
Pages: [
1
]
Print
« previous
next »
Jump to:
Please select a destination:
-----------------------------
General Community
-----------------------------
=> News and Updates
=> Data Mining
=> Chit Chat
-----------------------------
RapidMiner
-----------------------------
=> Getting Started
=> Data Mining / ETL / BI Processes
=> Problems and Support
=> Feature Requests
=> Development
-----------------------------
RapidAnalytics
-----------------------------
=> Getting Started
=> Applications and Integration
-----------------------------
RapidNet
-----------------------------
=> Getting Started
=> Problems and Support
Loading...