Pages: [1]
  Print  
Author Topic: questions on importing EXCEL files whose columns are of mixed formats  (Read 604 times)
surfreta
Newbie
*
Posts: 3


« on: November 11, 2013, 10:35:11 PM »

Hello, I am trying to read an Excel file into Rapidminer. However, this Excel file have mixed data format. For instance, a given column may contain some cells which are just numerical values, while some other cells are plain texts. How should I set up formats when using ReadExcel operator.
Logged
Marius
Administrator
Hero Member
*****
Posts: 1794



WWW
« Reply #1 on: November 13, 2013, 09:51:32 AM »

Hi,

RapidMiner can only handle columns with a consistent data type. So in your case you have to define all columns as polynominal to make sure that the Read Excel operator can correctly read the complete file. Then you can process the file further in RapidMiner. If there is a condition that determines the type of a row you could use Filter Examples to split the data set into a part that contains the text values and another that contains the numbers. Then use Parse Numbers on the number part to convert polynominal to numbers.
Please note that Parse Numbers will silently fail if there is a value in the column that cannot be converted to a number.

Best regards,
Marius
Logged

Please add [SOLVED] to the topic title when your problem has been solved! (do so by editing the first post in the thread and modifying the title)
Please click here before posting.
Pages: [1]
  Print  
 
Jump to: