I do not understand what we are being asked to do when the curriculum says… (ECS Version 6.0, page 246-247)….
“Now it is your turn, Text mining—analyzing word counts.
Demonstrate how to do the following:
This is a good opportunity to explain that the tweets are stored in an array or vector, where the numbers in front indicate the place the tweet is in the vector.
Arrays are an important concept in computer science. Storing items in an array allows us to access particular
elements, search and sort. Demo how to view the vector and point out that each of the array elements of the corpus matches the corresponding tweet in the data file.
Create a frequency table that separates out each word and counts how many times it appears in all the tweets.
Ask questions such as: What is the word that appears least frequently? What is the word that appears most frequently?
Demo how to produce frequency tables that show only the most frequently appearing words and the different sorting options.
Demo how to produce a bar chart of frequently occurring words.”…
I need help with understanding were to start with this… I’m lost? Help!