Lightweight text analytics from Voyant Tools include word frequency lists, frequency distribution charts, and Key Word in Context analysis. I employed Voyant Tools to examine word frequencies, collocates, trends, and keywords in context throughout an Alice in Wonderland sample corpus. The following was the corpus given:
This corpus has 1 document with 2,161 total words and 610 unique word forms. Created about a month ago.
Vocabulary Density: 0.282
Readability Index: 7.304
Average Words Per Sentence: 26.4
Most frequent words in the corpus: alice (27); little (15); way (13); think (11); like (11)
I employed Voyant Tools to examine word frequencies, collocates, trends, and keywords in context throughout an A COMEDY, CALL’D, THE Basset-Tabl. The following was the corpus given:
This corpus has 1 document with 17,961 total words and 3,924 unique word forms. Created about 15 minutes ago.
Vocabulary Density: 0.218
Readability Index: 10.641
Average Words Per Sentence: 30.4
Most frequent words in the corpus: sir (162); ha (121); lady (103); lord (81); madam (72)
I don’t believe you would have been able to asses all the calculations of things, such as a repetition of words. I also do not think anyone has time to see the average words per sentence. I believe that the vocabulary density and readability index gives you more insight into the works’ level of writing.
A very intriguing component of Voyant tool is the Reader Tool. The text is shown for viewing in the Reader Tool. You may get more material by scrolling down in the text reader. You can also mouse over a word to see how frequently it appears in the document. You can also click on a word or search for it in the search box to see how frequently it appears in your corpus. Overall I think it could be useful to analyze one work to see their writing style and the importance of repeated words. This is a great use, especially for those trying to read text distant.
A COMEDY, CALL’D, THE Basset-Tabl