以下为卖家选择提供的数据验证报告:
数据描述
This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well. Raw text and already processed bag of words formats are provided Train / Validation / Test = 7 / 1 / 2 http://ai.stanford.edu/~amaas/data/sentiment https://www.kaggle.com/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews

IMDB Dataset Split
25.98MB
申请报告