Pages: [1]
  Print  
Author Topic: Google trends in RapidMiner: sentences to value series  (Read 4310 times)
Katharina Morik
Newbie
*
Posts: 6


« on: April 16, 2013, 07:37:08 PM »

Did anyone frame texts from different time as a value series?
I would like to do something like Google's Trends over a historical corpus of sentences.
Now, I wonder how to turn the text representation into a value series.
Any experiences out there?

Curious
Katharina
Logged
awchisholm
Sr. Member
****
Posts: 390


WWW
« Reply #1 on: April 16, 2013, 09:54:11 PM »

Hello Katharina

Do you mean something like this as a final output where the numbers are something like term occurrences?

Time, word1, word2, ...wordN
10:01, 4, 0, ...9
10:04, 1, 1, ...2
13:05, 0, 0, ...0

I have done something very similar on log files where I treat each line as a document.

One issue is new words because some new word might turn up later that has never been seen before causing a gnashing of teeth and grinding of cog wheels. The solution is a strict word list and something to spot new words.

regards

Andrew
Logged

Pages: [1]
  Print  
 
Jump to: