以下为卖家选择提供的数据验证报告:
数据描述
This data set is the top 100 songs (with lyrics) for every single year from 1959 to 2023 in the United States.
The top 100 songs of the year was web scrapped from https://billboardtop100of.com and the lyrics were pulled from the Genius API.
The all_songs_data_processed.csv file is the output file of the starter notebook. This file is a cleaned up version of the data set where part of speech data (verbs, nouns, ect) are already extracted from the lyrics. This file should be an easier starting point for people who want to jump right in.
I also provided a starter EDA notebook for those who want to just jump into analysis.
Data Description:
Data set size: ( 6500 x 12 )
Each row describes a song with 12 variables.
- Album
- Album URL
- Artist
- Featured Artists
- Lyrics
- Media Link (i.e. YouTube link)
- Rank (at end of year)
- Release Date (year-month-day)
- Song Title
- Song URL (link of lyrics from genius api)
- Writers
- Year of ranking
