以下为卖家选择提供的数据验证报告:
数据描述
RCGroups.com is a model aircraft, truck, boat and general hobbyist forum board. In 2011, a poll and thread was started about climate change. This thread has continued for over 10 years and includes over 60,000 unique posts about climate science denial and its associated arguments from both sides of the debate. The scraped data includes authors, dates, and post content. In total there are almost 500 unique users who participated by posting replies in this thread, and almost 1,000 who voted in the poll.
We also have the results of the poll, so we are able to include a file that lists users and their viewpoints as a AGW denier, believer, or "unsure". This is great for classifying the users!
Link: [ https://www.rcgroups.com/forums/showthread.php?1452521 ] Archive: [ https://archive.ph/IuTMp ]
See code section for notebook with stopwords to clean up text column and start sentiment analysis. Also in code section: sample PostgreSQL database setup and sample queries. We include a script in code section to create two new columns that describe the sentiment of each post.
Note: the thread has continued since scraping this data. Posts newer than April 2022 are not in this dataset.
