以下为卖家选择提供的数据验证报告:
数据描述
[About r/Jokes] The funniest sub on Reddit. Hundreds of jokes posted each day, and some of them aren't even reposts! You can see threadscore, upvotes, and comments to quantify how funny or (unfunny) jokes are.
[Acknowledgements] Thank you to Anaconda Jupyter, Python, Microsoft Azure, and PRAW for libraries, services, and programming tools.
[Inspiration]
- Practice and hone your NLP skills, create wordclouds, sentiment analysis, Topic Modelling, and the likes from the dataset
- Quantify and predict how funny a joke is by using threadscore as label/ target variable
[Warning] A lot of the jokes have strong language and adult humor

Reddit r/Jokes Dataset
83.65MB
申请报告