以下为卖家选择提供的数据验证报告:
数据描述
This is a dataset containing 60,000 Stack Overflow questions from 2016-2020. Questions are classified into three categories:
- HQ: High-quality posts without a single edit.
- LQ_EDIT: Low-quality posts with a negative score, and multiple community edits. However, they still remain open after those changes.
- LQ_CLOSE: Low-quality posts that were closed by the community without a single edit.
How to cite
Annamoradnejad, I., Habibi, J., & Fazli, M. (2022). Multi-view approach to suggest moderation actions in community question answering sites. Information Sciences, 600, 144-154.
@article{annamoradnejad2022multiview, title={Multi-View Approach to Suggest Moderation Actions in Community Question Answering Sites}, author={Annamoradnejad, Issa and Habibi, Jafar and Fazli, Mohammadamin}, journal = {Information Sciences}, volume = {600}, pages = {144-154}, year = {2022}, issn = {0020-0255}, doi = {https://doi.org/10.1016/j.ins.2022.03.085}, url = {https://www.sciencedirect.com/science/article/pii/S0020025522003127} }
Notes:
- Questions are sorted according to Question Id.
- Question body is in HTML format.
- All dates are in UTC format.
Source:
https://github.com/Moradnejad/StackOverflow-Questions-Quality-Dataset

60k Stack Overflow Questions with Quality Rating
21.04MB
申请报告