10.6 LDA output

Now we are ready see the words associated with each of the 20 topics. Recall that we used stemming, which means some of the words will be difficult to read. We print first 10 words for each topic using terms() function.

terms(lda_model, 10)
##       Topic 1    Topic 2    Topic 3     Topic 4    Topic 5    Topic 6  
##  [1,] "christma" "googl"    "speaker"   "internet" "memori"   "stick"  
##  [2,] "wife"     "store"    "sound"     "email"    "card"     "box"    
##  [3,] "daughter" "android"  "dot"       "web"      "black"    "stream" 
##  [4,] "enjoy"    "instal"   "qualiti"   "surf"     "storag"   "cabl"   
##  [5,] "happi"    "limit"    "tap"       "brows"    "friday"   "remot"  
##  [6,] "mom"      "appl"     "bluetooth" "check"    "expand"   "roku"   
##  [7,] "mother"   "expect"   "listen"    "basic"    "download" "faster" 
##  [8,] "son"      "download" "portabl"   "want"     "space"    "fast"   
##  [9,] "like"     "load"     "voic"      "video"    "movi"     "netflix"
## [10,] "birthday" "function" "connect"   "access"   "add"      "cut"    
##       Topic 7   Topic 8   Topic 9     Topic 10  Topic 11  Topic 12 
##  [1,] "prime"   "turn"    "version"   "charg"   "batteri" "smart"  
##  [2,] "free"    "issu"    "model"     "connect" "life"    "control"
##  [3,] "movi"    "star"    "upgrad"    "wifi"    "long"    "voic"   
##  [4,] "account" "reason"  "paperwhit" "problem" "charg"   "turn"   
##  [5,] "access"  "bit"     "older"     "charger" "hour"    "hous"   
##  [6,] "video"   "review"  "featur"    "issu"    "week"    "room"   
##  [7,] "stream"  "figur"   "origin"    "plug"    "last"    "command"
##  [8,] "member"  "problem" "reader"    "unit"    "day"     "skill"  
##  [9,] "netflix" "button"  "improv"    "power"   "hold"    "autom"  
## [10,] "content" "open"    "generat"   "replac"  "longer"  "integr" 
##       Topic 13   Topic 14 Topic 15   Topic 16  Topic 17    Topic 18
##  [1,] "weather"  "day"    "parent"   "qualiti" "reader"    "size"  
##  [2,] "question" "sale"   "control"  "pictur"  "paperwhit" "small" 
##  [3,] "list"     "order"  "daughter" "nice"    "white"     "cover" 
##  [4,] "answer"   "review" "child"    "camera"  "night"     "carri" 
##  [5,] "news"     "decid"  "son"      "excel"   "paper"     "fit"   
##  [6,] "fun"      "store"  "children" "clear"   "eye"       "case"  
##  [7,] "shop"     "servic" "case"     "pretti"  "bright"    "hold"  
##  [8,] "listen"   "want"   "learn"    "high"    "librari"   "nice"  
##  [9,] "timer"    "month"  "download" "fast"    "adjust"    "hand"  
## [10,] "joke"     "custom" "age"      "amaz"    "dark"      "travel"
##       Topic 19    Topic 20
##  [1,] "friend"    "ipad"  
##  [2,] "famili"    "money" 
##  [3,] "user"      "worth" 
##  [4,] "enjoy"     "cost"  
##  [5,] "learn"     "spend" 
##  [6,] "item"      "expens"
##  [7,] "fun"       "extra" 
##  [8,] "entertain" "pay"   
##  [9,] "technolog" "mini"  
## [10,] "member"    "compar"

LDA did a fairly good job of picking topics. For instance, Topic 1 is all about celebrations and festivities. Topic 2 seems to be about Google App Store, Android, and download speeds of the apps. Topic 3 is about speaker quality, Topic 4 is about Internet, Topic 5 is about memory and storage, and so on.

Topic 20 looks to be about iPad and how it is worth the money. We can expect that whenever this topic showed up, the reviewer probably rated the corresponding Amazon product lower. We will perform this analysis next.