Fill This Form To Receive Instant Help
Homework answers / question archive / FIN 4230 Assignment 4 Due time: 11:59pm@ May 12, 2022 Question background: In R, the package “gutenbergr” offers over 53,000 free books
FIN 4230 Assignment 4
Due time: 11:59pm@ May 12, 2022
Question background:
In R, the package “gutenbergr” offers over 53,000 free books. Each book has its own id number. For
instance, we can use following codes to install and visit the package:
install.packages("gutenbergr")
library(gutenbergr)
You also can use following code to examine the structure of the book database:
gutenberg_metadata
And the sample output is
You can see it has several contents like id, title, author, etc.
In this assignment, we are interested in analyzing Charles Dickens’ books. Charles Dickens wrote
fourteen and a half novels. In fact, you can use following code to find all his works id.
library(dplyr)
gutenberg_metadata %>% filter(author == "Dickens, Charles")
Question:
Please select five of Charles Dickens’ books (you also can select other authors’ books to your interests)
to perform sentiment and tf_idf analysis (similar to our CH3_4_newversion.R lecture). To begin
with, for example, you can use following code to collect sample five books contents and store in the
self-named dataframe: dickens_df:
dickens_df <- gutenberg_download(c(98, 1400, 46, 730, 786))
Then you can perform analysis and data manipulation similar to our class lecture. Please submit your
code, output graph and results. If you encounter coding errors, please also attach them so that I can give
partial credits.