
卖家暂未授权典枢平台对该文件进行数据验证,您可以向卖家
申请验证报告
。 数据描述
About Dataset
Summary
The data contains American movie details from Wikipedia between the 1970s and 2020s. The data was compiled using the Wikipedia API and includes almost 18,000 movies.
The data is ideal for natural language processing and machine learning tasks such as recommender systems. The data is provided in CSV format and includes columns for title, image and plot.
Criteria used
For a movie to be included in the data, it must have had a Wikipedia page that:
- Appeared in: category:{decade}s_American_films
- Contained a "Plot" section (not a "Synopsis" or "Summary" section)
- Contained an image that was not a Wikipedia placeholder image (e.g. File:Question_Mark.svg)
This means the data will not include EVERY movie between the 1970s-2020s.

Wikipedia Movies
18.45MB
申请报告