以下为卖家选择提供的数据验证报告:
数据描述
Preporcessed Toxic Comments Classification Dataset
The obstacle I faced in Toxic Comments Classification Challenge was the preprocessing part. One can easily improve their LB performance if the preprocessing is done right.
This is the preprocessed version of Toxic Comments Classification Challenge dataset. The code for preprocessing: https://www.kaggle.com/fizzbuzz/toxic-data-preprocessing

Cleaned Toxic Comments
43.68MB
申请报告