Welcome,
Guest
. Please
login
or
register
.
Did you miss your
activation email?
Home
Help
Search
Login
Register
Rapid-I
Rapid-I Forum
»
RapidMiner
»
Data Mining / ETL / BI Processes
»
Finding good classifier
Pages: [
1
]
« previous
next »
Print
Author
Topic: Finding good classifier (Read 1110 times)
Stephan
Guest
Finding good classifier
«
on:
October 25, 2008, 09:50:07 AM »
Hi,
I want to use a machine learning-based classification
algorithm to generate a classifier. The training set
was gathered by supervised learning and consists of about
600 examples, each of which has 30 features and a binary
label (example belongs to class or not).
What classification algorithm would yield most accurate
results (in terms of minimal classification error) in
my case?
Support Vector Machines seem to be a good choice but I
would like to hear your opinion.
If this question cannot be generally answered, I would
appreciate any hints (also papers,websites,books) how a
professional data miner proceeds in finding a good
algorithm.
Regards,
Stephan
Logged
Sebastian Land
Administrator
Hero Member
Posts: 2421
Re: Finding good classifier
«
Reply #1 on:
October 25, 2008, 11:45:25 AM »
Hi Stephan,
one cannot say, which algorithm will yield the best performance without knowing the distribution behind the data. But if you would know the distribution, you wouldn't need any learner at all...
With two classes, SVMs seems to be a good tool, since they can cover a wide range of model types depending on the chosen Kernel. They are able to use linear and quadratic decision boundarys with linear or polynomial kernel or implicitly model the densities using RBF Kernels.
Unfortunatly it can't be shortly described, how professional data miner work things out. If it would be, they wouldn't be professionals any more, rather becoming unemployed...
Choosing the right algorithm(s) is more a guided search, considering previous results on the path to the one best process. This search is espacially guided by experience and a deep understanding of the algorithms itself.
But now to the good news:
RapidMiner is designed to really fast change processes and exchange learners. You even might do this automatically by OperatorSelection and ParameterOptimiziation. If you will "play" a little bit with your processes, you will gain your own experience with the algorithms...
Greetings,
Sebastian
Logged
Hope to see you at RapidMiner Community Meeting and Conference (RCOMM 2011) in Dublin from June 7-10, 2011.
The Call for Paper is online now!
More information at
http://www.rcomm2011.org
Pages: [
1
]
Print
« previous
next »
Jump to:
Please select a destination:
-----------------------------
General Community
-----------------------------
=> News and Updates
=> Data Mining
=> Chit Chat
-----------------------------
RapidMiner
-----------------------------
=> Getting Started
=> Data Mining / ETL / BI Processes
=> Problems and Support
=> Feature Requests
=> Development
-----------------------------
RapidAnalytics
-----------------------------
=> Getting Started
=> Applications and Integration
-----------------------------
RapidNet
-----------------------------
=> Getting Started
=> Problems and Support
Loading...