
卖家暂未授权典枢平台对该文件进行数据验证,您可以向卖家
申请验证报告
。 数据描述
About Dataset
The Yelp reviews full star dataset is constructed by randomly taking 130,000 training samples and 10,000 testing samples for each review star from 1 to 5. In total there are 650,000 trainig samples and 50,000 testing samples.
The files train.csv and test.csv contain all the training samples as comma-sparated values. There are 2 columns in them, corresponding to class index (1 to 5) and review text. The review texts are escaped using double quotes ("), and any internal double quote is escaped by 2 double quotes (""). New lines are escaped by a backslash followed with an "n" character, that is "\n".

yelp-reviews-for-sa-finegrained-5-classes-csv
190.72MB
申请报告