GitHunt
JA

JacobMannix/webscraping-songs-and-lyrics

Web scraping Wikipedia for a list Billboard Hot 100 songs and using this list to further web scrape Genius to get the lyrics of these songs.

Web Scraping: Billboard Hot 100 in R

GitHub
GitHub top language
GitHub last commit

R project to create a tool that scrapes the web for data.

For this project I first web scraped wikipedia to get a list of the top 100 songs on the Billboard Hot 100 over a number of years. I then used this song information in order to web scrap the Lyrics for these songs from Genius.

I used the lyrics to plot the frequency of the words used in the songs as well as try and detect differences in the wordchoice throughout the decades.

Languages

R100.0%

Contributors

MIT License
Created February 17, 2020
Updated October 14, 2020