As promised in last post, here I present word clouds generated from Flipkart and Amazon India tweets. A word cloud (http://en.wikipedia.org/wiki/Tag_cloud) is a graphical representation of word frequency with greater prominence to words which are more frequent in source text. I have generated two sets of word clouds where each set contains two clouds, one for Filpkart and another for Amazon. First set of clouds is based upon all the words present in extracted tweets however second set of clouds is based upon the sentiment words only which are present in extracted tweets for analysis. Corpus of tweets and sentiment words used in this analysis are same as that of used in earlier analysis which I shared in last post.
Before showing word clouds, here I summarize the key steps involved in analysis-
Now the same for Amazon India-
As you see these word clouds, you can get the sense of what people are talking about e.g. in case Flipkart people are mostly talking about Xiaomi mi3 mobile phone which has been a recent hit in Flipkart.
Now coming on the second set of word clouds where we focus only upon the words carrying some sort of sentiment, here is that for Flipkart-
And here is same for Amazon India-
As one can see in both of the above word clouds that most of the prominent words carry positive sentiments. However if you focus upon words carrying negative sentiments you would see that degree of prominence in case of Flipkart is more than that of Amazon India and this supports the outcome of analysis we saw in last post.
Thanks for now!
Before showing word clouds, here I summarize the key steps involved in analysis-
- Read the tweets in R as described in last post
- Perform some text processing. For this, I used text mining package in R (http://cran.r-project.org/web/packages/tm/index.html)
- Generate a two-column format structure from tweets where first column contains words and second column contains frequency of the word in all tweets
- Convert all words in lower case
- Remove stop words
- Remove punctuation marks
- Get top 100 most occurring words for first set of clouds and top 100 sentiment words for second set of clouds
- Generate word cloud. I used word cloud package in R (http://cran.rproject.org/web/packages/wordcloud/index.html) for this
Now the same for Amazon India-
As you see these word clouds, you can get the sense of what people are talking about e.g. in case Flipkart people are mostly talking about Xiaomi mi3 mobile phone which has been a recent hit in Flipkart.
Now coming on the second set of word clouds where we focus only upon the words carrying some sort of sentiment, here is that for Flipkart-
And here is same for Amazon India-
As one can see in both of the above word clouds that most of the prominent words carry positive sentiments. However if you focus upon words carrying negative sentiments you would see that degree of prominence in case of Flipkart is more than that of Amazon India and this supports the outcome of analysis we saw in last post.
Thanks for now!
Great Article
ReplyDeleteR Project Topics for Computer Science
FInal Year Project Centers in Chennai
JavaScript Training in Chennai
JavaScript Training in Chennai