Pages: [1]
  Print  
Author Topic: [SOLVED] Text mining on text attribute of a CSV?  (Read 1029 times)
wypee
Newbie
*
Posts: 7


« on: January 04, 2014, 04:02:56 AM »

Sorry for this rather noob question. I am just getting started with RapindMiner by trying it on one of the Kaggle challenge that requires categorizing user comments as insulting/not-insulting. Now there is this input CSV with tens of thousands of entries of user comments but I notice that all of rapidminer text mining operators expect document. So, am I to split each row of the CSV as a file and then feed as a set of documents to rapidminer? That doesn't seem right. What is the right way of doing it?

Thanks in advance.
« Last Edit: January 11, 2014, 06:51:57 AM by wypee » Logged
awchisholm
Sr. Member
****
Posts: 398


WWW
« Reply #1 on: January 04, 2014, 11:50:23 AM »

Hello wypee

Use the Process Documents from Data operator. Before this you need to convert the type of the attribute containing the data you want to analyse to text. Use the Nominal to Text operator for this.

regards

Andrew
Logged

wypee
Newbie
*
Posts: 7


« Reply #2 on: January 05, 2014, 07:04:43 PM »

That's what I needed! Thanks a lot, Andrew.  Smiley
« Last Edit: January 05, 2014, 07:06:22 PM by wypee » Logged
Pages: [1]
  Print  
 
Jump to: