Using quanteda’s fcm()
and textplot_network()
, you can perform visual analysis of social media posts in terms of cooccurances of hashtags or usernames in a few steps. The dataset for this example contains only 10,000 Twitter posts, but you can easily analyze more one million posts on your laptop computer.
library(quanteda)
load("data/data_corpus_tweets.rda")
head(tweet_dfm)
## Document-feature matrix of: 6 documents, 42,327 features (100% sparse).
head(topuser)
## [1] "@pablo_iglesias_" "@elenavalenciano" "@canete2014_"
## [4] "@nigel_farage" "@martinschulz" "@mlp_officiel"
user_fcm <- fcm(user_dfm)
head(user_fcm)
## Feature co-occurrence matrix of: 6 by 6 features.
## 6 x 6 sparse Matrix of class "fcm"
## features
## features @pacomarhuenda @pablo_iglesias_ @kopriths @gapatzhs
## @pacomarhuenda 0 1 0 0
## @pablo_iglesias_ 0 0 0 0
## @kopriths 0 0 0 1
## @gapatzhs 0 0 0 0
## @mariaspyraki 0 0 0 0
## @ernesturtasun 0 0 0 0
## features
## features @mariaspyraki @ernesturtasun
## @pacomarhuenda 0 0
## @pablo_iglesias_ 0 1
## @kopriths 1 0
## @gapatzhs 1 0
## @mariaspyraki 0 0
## @ernesturtasun 0 0
user_fcm <- fcm_select(user_fcm, topuser)