麻酱

verify-tag60k Stack Overflow Questions with Quality Rating

musicnlptext miningtext

22

已售 0
21.04MB

数据标识:D17171349182117466

发布时间:2024/05/31

以下为卖家选择提供的数据验证报告:

数据描述

This is a dataset containing 60,000 Stack Overflow questions from 2016-2020. Questions are classified into three categories:

  1. HQ: High-quality posts without a single edit.
  2. LQ_EDIT: Low-quality posts with a negative score, and multiple community edits. However, they still remain open after those changes.
  3. LQ_CLOSE: Low-quality posts that were closed by the community without a single edit.

How to cite

Annamoradnejad, I., Habibi, J., & Fazli, M. (2022). Multi-view approach to suggest moderation actions in community question answering sites. Information Sciences, 600, 144-154. 
@article{annamoradnejad2022multiview,   title={Multi-View Approach to Suggest Moderation Actions in Community Question Answering Sites},   author={Annamoradnejad, Issa and Habibi, Jafar and Fazli, Mohammadamin},   journal = {Information Sciences},   volume = {600},   pages = {144-154},   year = {2022},   issn = {0020-0255},   doi = {https://doi.org/10.1016/j.ins.2022.03.085},   url = {https://www.sciencedirect.com/science/article/pii/S0020025522003127} } 

Notes:

  • Questions are sorted according to Question Id.
  • Question body is in HTML format.
  • All dates are in UTC format.

Source:

https://github.com/Moradnejad/StackOverflow-Questions-Quality-Dataset

data icon
60k Stack Overflow Questions with Quality Rating
22
已售 0
21.04MB
申请报告