以下为卖家选择提供的数据验证报告:
数据描述
Context
IMDb is the most popular movie website and it combines movie plot description, Metastore ratings, critic and user ratings and reviews, release dates, and many more aspects.
While browsing Kaggle, I quickly noticed that a Dataset covering TV Shows hadn't been uploaded so far. So I took it upon myself to make sure a usable dataset would be uploaded on to the platform/
Content
The Primary Datasets to be used are: Shows_Episodes_Details.csv - Contains TV Series along with the associated Episodes which have been rated on IMDb, and the Episode Ratings. TV Series Names 1k Votes.csv - 6605 Entries. Contains the TV Series Names, Ratings for the Series, Number of Votes received. If required, I can upload .txt versions of all the uploaded files.
The data was obtained from the IMDb Interfaces Page. This dataset contains both TV Shows and Movies combined. I whittled down the dataset to include information pertaining to TV Shows, Mini Shows and the related Episodes. I used Access to assemble the data. My poor system couldn't handle the cleanup process using Python, so Access to the rescue.
The data was last updated on 30 January 2021.
Acknowledgements
Date Scraped From: Information courtesy of IMDb (http://www.imdb.com). Used with permission.
A shout-out to Lucas Lunelli. He created a Viz on Tableau Public, and I reached out to him to get an idea of how he cleaned up the data using Python. It was useful learning, even if the method I used for data assembly wasn't the same.
Projects
I have already created a Viz using the dataset, just to prove that work can be carried out on it. Hopefully folks here can try and expand on the work here and create interesting Projects as well.
