雨蝶

verify-tagGoEmotions

exploratory data analysisnlptext miningstatistical analysistext

1

已售 0
17.6MB

数据标识:D17222352756155147

发布时间:2024/07/29

以下为卖家选择提供的数据验证报告:

数据描述

Context

GoEmotions is a corpus of 58k carefully curated comments extracted from Reddit, with human annotations to 27 emotion categories or Neutral.

Content

Number of examples: 58,009. Number of labels: 27 + Neutral. Maximum sequence length in training and evaluation datasets: 30. On top of the raw data, we also include a version filtered based on reter-agreement, which contains a train/test/validation split:

Size of training dataset: 43,410. Size of test dataset: 5,427. Size of validation dataset: 5,426. The emotion categories are: admiration, amusement, anger, annoyance, approval, caring, confusion, curiosity, desire, disappointment, disapproval, disgust, embarrassment, excitement, fear, gratitude, grief, joy, love, nervousness, optimism, pride, realization, relief, remorse, sadness, surprise.

For more details on the design and content of the dataset, please see our paper. .

Acknowledgements

Inspiration

Multi Classification of emotions

data icon
GoEmotions
1
已售 0
17.6MB
申请报告