🍋柠檬🎨

yelp-reviews-for-sentianalysis-binary-np-csv

Binary ClassificatioUnited StatesRatings and ReviewsText

80

已售 0
161.73MB

数据标识:D17169524124017928

发布时间:2024/05/29

About Dataset

The Yelp reviews polarity dataset is constructed by considering stars 1 and 2 negative, and 3 and 4 positive. For each polarity 280,000 training samples and 19,000 testing samples are take randomly. In total there are 560,000 trainig samples and 38,000 testing samples. Negative polarity is class 1, and positive class 2.

The files train.csv and test.csv contain all the training samples as comma-sparated values. There are 2 columns in them, corresponding to class index (1 and 2) and review text. The review texts are escaped using double quotes ("), and any internal double quote is escaped by 2 double quotes (""). New lines are escaped by a backslash followed with an "n" character, that is "\n".

看了又看

暂无推荐

验证报告

目前该文件尚无匹配的数据质量验证程序。我们将在后续版本中提供相应的验证支持,敬请谅解。

data icon
yelp-reviews-for-sentianalysis-binary-np-csv
80
已售 0
161.73MB
申请报告