Pages: [1]
  Print  
Author Topic: How to add filename to the wordlist output?  (Read 420 times)
sridhar
Newbie
*
Posts: 2


« on: December 21, 2013, 11:59:56 AM »

Hi,

I am processing text files. I want to add text file name for the word list output.

I would like to see the output as follows:

TextFile_Name| Word | occurances
---------------------------------------------------
R1.doc | java | 2
R1.doc | oracle | 3
R1.doc | database | 1
R2.doc | sql | 1

Can you please suggest on how to achieve the same in Rapid Miner?

Thanks a lot for your help!

Regards
Sridhar
Logged
awchisholm
Sr. Member
****
Posts: 369


WWW
« Reply #1 on: December 21, 2013, 12:32:42 PM »

Hello

The blog post here http://rapidminernotes.blogspot.co.uk/2013/04/counting-words-in-lots-of-documents.html has an example where the file name is used in a text processing context. You could use this as a starting point.

regards

Andrew
Logged

Pages: [1]
  Print  
 
Jump to: