Time |
Monday Training 1 |
Tuesday Conference 1 |
Wednesday Conference 2 |
Thursday Training 2 |
09:00 - 11:00 |
|
Introductory Speech Dr. Ingo Mierswa; Rapid-I
Data Mining - Learning under Resource Constraints (Invited Talk) Prof. Dr. Katharina Morik; University of Dortmund |
Forecasting Historical Volatility for Option Trading, a Rapidminer Approach (Invited Talk) Thomas N. Ott; Neural Market Trends
Application of Short Term Load Forecasting using Support Vector Machines in RapidMiner 5.0 Marin Matijas, Tomislav Lipic; HEP Opskrba, Institut Ruder Boskovic |
Training T1: Time Series Analysis and Forecasting |
11:00 - 13:00 |
Training M1: RapidMiner and RapidAnalytics Overview |
User Support
Integrated Tutorial Tool for RapidMiner 5 Goslin, Hofmann; ITB Dublin
Pattern Recognition Engineering Shafait, Reif, Kofler, Breuel; DFKI Kaiserslautern and Technical University Kaiserslautern
Data Mining Workflow Templates for Intelligent Discovery Assistance in RapidMiner Kietz, Serban, Bernstein, Fischer; University of Zürich, Rapid-I |
Architecture
Distributed Pattern Recognition in RapidMiner Arimond, Kofler, Shafait; DFKI Kaiserslautern
Implementing Hierarchical Heavy Hitters in RapidMiner: Solutions and Open Questions Stolpe, Friscke; University of Dortmund
High Speed Data Mining with Ingres VectorWise Olaf Laber; Ingres |
Training T2: Text Mining with RapidMiner |
13:00 - 14:00 |
Lunch |
Lunch |
Lunch |
Lunch |
14:00 - 16:00 |
Training M2: Data Mining with RapidMiner |
Algorithms
Cross-Validation: the illusion of reliable performance estimation Prekopcsak, Henk, Gaspar-Papanek; Budapest University of Technology and Economics
WhiBo - RapidMiner plug-in for component based data mining algorithm design Vukiecevic, Jovanovic, Delibasic, Suknovic; University of Belgrade Serbia
Landmarking for Meta-Learning using RapidMiner Abdelmessih, Shafait, Reif, Goldstein; German University in Cairo, DFKI Kaiserslautern |
Workshops
Collaborative Work with RapidMiner and RapidAnalytics Dr. Simon Fischer; Rapid-I
Integrating R into RapidMiner Sebastian Land; Rapid-I |
Training T3: Market Basket Analysis and Recommender Systems |
16:00 - 18:00 |
Training M3: Data Preprocessing and Meta Data Propagation |
Workshop and Game Show
Reporting with RapidMiner and RapidAnalytics Tobias Malbrecht, Rapid-I
Who want's to be a Data Miner? Game Show: Watch the Gurus and learn how to quickly design analysis processes or participate and battle for a price! Our host tonight: Dr. Simon Fischer |
Information and Relation Extraction
A RapidMiner Framework for Protein Interaction Extraction Fayruzov, Dittmar, Spence, De Cock, Teredesai; Ghent University, University of Washington
An Information Extraction Plugin for RapidMiner 5 Felix Jungermann; University of Dortmund
Two pre-processing plugins for improved learning from Semantic Web Data Khan, Grimnes, Dengel; DFKI Kaiserslautern |
|
19:00 |
|
Dinner |
|
|
Training M1- RapidMiner and RapidAnalytics Overview
- Data Import
- Repository
The training starts with an introduction to the RapidMiner's Workbench and shows how to efficiently utilize the graphical user interface to manage projects, processes, data sets and results. During the lesson you will be guided through the different available perspectives and views. After learning the basics, it is shown how to go the first step towards all Data Mining Tasks: Import the data into the integrated repositories. These integrated repositories are then described in more detail and it is shown how to configure RapidAnalytics Repositories for coordinating efficient team work and deployment of results. Training M2- Data Mining with RapidMiner
- Cross Validation
- Attribute Selection
After we got comfortable with the RapidMiner and RapidAnalytics Workbench, this lesson introduces into data mining. Together with some background knowledge from statistical learning theory first Data Mining processes are designed. For assessing how well these process fit the problem, the Cross-Validation is introduced as most common and reliable statistical tool for performance estimation. With the gained knowledge we can start to get first insight into our data: Which are the most important attributes? Does sex or age determine if one is interested in soccer? All this will be imparted by having hands-on exercises for designing processes which illustrate the theoretical background. Training M3- Data Preprocessing
- Process Design and Meta Data Propagation
- Quick Fixes
After the last lesson thought you how to perform Data Mining, we have to admit, that most data in real world isn't in the condition to be directly used for Data Mining. This lesson now will demonstrate the various preprocessing capabilities to prepare your data for the final mining. During this you will learn to appreciate the new Meta Data Propagation, which will help you a lot during process design. It will keep you informed about the effects of your operators, offers QuickFixes to repair broken parts of the process. All of this is done in background and in real time, without actually processing the possible millions of records in your data. Training T1Time Series Analysis and Forecasting- Data Preprocessing for Forecasting Scenarios
- Numerical Predictions
- Validation of Forecasting
This lesson will tackle the following type of tasks: A time series, i.e. a collection of measurements for different time points is given, and one searches the most probable value of this measurement for future time points. This includes the demonstration of necessary preprocessing steps like numerical data transformations, an introduction how to apply previously introduced statistical regression methods, neural networks, or support vector machines (SVM), and a discussion of validation methods in order to estimate the goodness of the predictions. These methods are especially useful for numerical predictions from series data as they often occur in financial markets but also in production settings and many others. Training T2Text Mining with RapidMiner
- Preprocessing of Unstructured Data
- Text Analysis
- Applications of Text and Web Mining
This lesson will shift the focus to knowledge discovery from unstructured data like text documents. It will show the necessary preprocessing steps and the most successful methods for automatic text classification (including Naive Bayes and Support Vector Machines, SVM) and text clustering. Practical exercises for different settings that frequently occur in different settings like for example e-mail spam detection, automatical e-mail routing, adaptive personal news filtering, sentiment analysis of text documents like news, web pages, blogs, e-mail or PDF documents will enable the participants to transfer the previously gained knowledge of general data mining to text mining problems. Training T3Market Basket Analysis- Data Preprocessing for Transaction Data
- Finding Itemsets and Association Rules
- Visualization and Application of Models
During this lesson it is demonstrated how to analyse data from customer transactions to better understand the needs of your customers, to predict their behaviour, and to increase sales and improve profitability by leveraging cross- and up-selling opportunities. Together with the necessary preprocessing steps, the fast and highly effective operators of RapidMiner for shopping basket analysis are introduced. After the patterns have been extracted, the practical steps which are necessary to create and exploit such models are exercised.
|