老下头

verify-tagTen Million Reddit Answers

popular culturenlptextonline communitiessocial networks

19

已售 0
857.55MB

数据标识:D17175186446293787

发布时间:2024/06/05

以下为卖家选择提供的数据验证报告:

数据描述

Context

The spiritual successor to our One Million Reddit Questions, this dataset presents ten millions of question-answer pairs, labelled by score and pre-analyzed sentiment.

Content

This dataset contains ten million comments on /r/AskReddit - and the associated parent posts, procured using SocialGrep. The posts and the comments are labelled with date of creation and their score.

Acknowledgements

We would like to thank the Kaggle community. WIthout you, this dataset would not have been here.

Inspiration

This dataset presents a novel corpus, ripe for training question-answering language models and much more. What can you do with it, reader? The sky is the limit.

data icon
Ten Million Reddit Answers
19
已售 0
857.55MB
申请报告