6.4 Sentiment analysis
Now that we have assembled the data set with tweets pertaining to the 9 airlines, we are ready to do sentiment analysis. We will use get_nrc_sentiment()
from syuzhet
package. The input to this function is a character vector. Therefore, we will simply pull()
this vector out from airlines_df
.
Depending on the number of tweets this code will take a few minutes to execute so please be patient.
airlines_sent <- airlines_df %>%
pull(text) %>% # This returns a character vector
get_nrc_sentiment()
Take a look at the sentiment data using head()
. For my sample, the results are shown in Table 6.2,
anger | anticipation | disgust | fear | joy | sadness | surprise | trust | negative | positive |
---|---|---|---|---|---|---|---|---|---|
0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
1 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 1 |
0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
0 | 2 | 0 | 2 | 0 | 1 | 0 | 1 | 2 | 1 |
1 | 2 | 1 | 0 | 1 | 1 | 1 | 2 | 1 | 2 |
0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 |
0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |