殿下万岁

verify-tagIMDB Movie Ratings Sentiment Analysis

movies and tv showsbeginnernlpclassificationbinary classification

6

已售 0
50.28MB

数据标识:D17220457506112799

发布时间:2024/07/27

以下为卖家选择提供的数据验证报告:

数据描述

Description:

The dataset is comprised of tab-separated files with phrases from the Rotten Tomatoes dataset. The train/test split has been preserved for the purposes of benchmarking, but the sentences have been shuffled from their original order. Each Sentence has been parsed into many phrases by the Stanford parser. Each phrase has a PhraseId. Each sentence has a SentenceId. Phrases that are repeated (such as short/common words) are only included once in the data.

train.tsv contains the phrases and their associated sentiment labels. We have additionally provided a SentenceId so that you can track which phrases belong to a single sentence. test.tsv contains just phrases. You must assign a sentiment label to each phrase. The sentiment labels are:

0 - negative 1 - somewhat negative 2 - neutral 3 - somewhat positive 4 - positive

The dataset can be downloaded here: https://archive.ics.uci.edu/ml/datasets/spambase

Objective:

  • Understand the Dataset & cleanup (if required).
  • Build classification models to predict the ratings of the movie.
  • Compare the evaluation metrics of vaious classification algorithms.
data icon
IMDB Movie Ratings Sentiment Analysis
6
已售 0
50.28MB
申请报告