以下为卖家选择提供的数据验证报告:
数据描述
this data has been pre-processed and ready to be used in your model immediately what have been in the data is
- Removing Duplicated
- All HTML Tags Removed
- Removed Emails , URLS , Special characters and white Spaces
- digits
- all data in Low case
- Removed Stop words
- Tag the words with their part-of-speech And Lemmatize the words using their POS tags
- Labels converted To binary 0 for Negative 1 For positive
- Tokenization and padding

IMDB Reviews dataset processed 50 K
38.4MB
申请报告