以下为卖家选择提供的数据验证报告:
数据描述
Plots extracted from Wikipedia for all movies with > 1000 ratings on IMDb and released between 1950 to 2023. Useful for a demo projects on Large Language models (LLMs)(e.g. a movie searching app - https://www.cinemattr.ca).
Details of Extraction
The plot summary section of each movie was cleaned of all links, references and other irrelevant stuff to get a pure text value.Missing plots were fallbacked to IMDb synopses.
Details of the data
89% movies have plot details, 100% have a short summary (untouched from wikipedia, useful for matching metadata and other details for a retriever application)
The columns stars, directors, genres are a list of values, useful for loading into a vector database.

Movie Plots from Wikipedia
95.78MB
申请报告