🍋柠檬🎨

yelp-reviews-for-sentianalysis-binary-np-csv

Binary ClassificatioUnited StatesRatings and ReviewsText

80

已售 0
161.73MB

数据标识:D17169524124017928

发布时间:2024/05/29

卖家暂未授权典枢平台对该文件进行数据验证,您可以向卖家

申请验证报告

数据描述

About Dataset

The Yelp reviews polarity dataset is constructed by considering stars 1 and 2 negative, and 3 and 4 positive. For each polarity 280,000 training samples and 19,000 testing samples are take randomly. In total there are 560,000 trainig samples and 38,000 testing samples. Negative polarity is class 1, and positive class 2.

The files train.csv and test.csv contain all the training samples as comma-sparated values. There are 2 columns in them, corresponding to class index (1 and 2) and review text. The review texts are escaped using double quotes ("), and any internal double quote is escaped by 2 double quotes (""). New lines are escaped by a backslash followed with an "n" character, that is "\n".

data icon
yelp-reviews-for-sentianalysis-binary-np-csv
80
已售 0
161.73MB
申请报告