verify-tagNewsgroups (Text Classification)

classification

2

已售 0
41.64MB

数据标识:D17222543110632641

发布时间:2024/07/29

以下为卖家选择提供的数据验证报告:

数据描述

Newsgroups (Text Classification)

Comprehensive Collection of Text Classification Datasets


Source

Huggingface Hub: link

About this dataset

> The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.

does not include cross-posts and includes only the "From" and "Subject" headers.

Research Ideas

> - Text classification > - Text clustering > - Sentiment analysis

Acknowledgements

> > > ### License > > > > License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication > > No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: bydate_sci.electronics_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_sci.med_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_sci.med_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.sys.ibm.pc.hardware_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_talk.politics.guns_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.windows.x_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_comp.graphics_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_rec.sport.hockey_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_rec.autos_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.graphics_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_rec.motorcycles_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_comp.windows.x_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_alt.atheism_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_rec.sport.baseball_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.sys.mac.hardware_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_soc.religion.christian_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_comp.sys.mac.hardware_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_rec.motorcycles_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_sci.space_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_talk.politics.misc_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_comp.os.ms-windows.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_soc.religion.christian_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_comp.sys.ibm.pc.hardware_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_misc.forsale_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_talk.politics.mideast_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_sci.space_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_rec.motorcycles_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_talk.politics.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_talk.politics.mideast_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_talk.politics.guns_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_sci.electronics_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_talk.religion.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_comp.sys.ibm.pc.hardware_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_alt.atheism_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_talk.politics.mideast_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_soc.religion.christian_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.sys.mac.hardware_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_sci.crypt_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_sci.crypt_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_misc.forsale_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_sci.electronics_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_rec.sport.hockey_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_talk.politics.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.os.ms-windows.misc_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_rec.sport.hockey_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_talk.religion.misc_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_sci.crypt_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_sci.electronics_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_rec.autos_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_rec.sport.baseball_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_sci.crypt_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_talk.religion.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_misc.forsale_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_alt.atheism_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_rec.sport.baseball_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_sci.med_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_rec.sport.baseball_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_comp.sys.mac.hardware_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_rec.autos_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_comp.os.ms-windows.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.os.ms-windows.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_misc.forsale_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_talk.politics.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_soc.religion.christian_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_comp.windows.x_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.graphics_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_rec.sport.hockey_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_talk.religion.misc_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.windows.x_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_sci.space_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_talk.politics.mideast_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_rec.autos_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_comp.graphics_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 19997_talk.politics.guns_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_comp.sys.ibm.pc.hardware_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_alt.atheism_test.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_talk.politics.guns_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: 18828_rec.motorcycles_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_sci.space_train.csv

Column name Description
text The text of the newsgroup document. (string)

File: bydate_sci.med_train.csv

Column name Description
text The text of the newsgroup document. (string)
data icon
Newsgroups (Text Classification)
2
已售 0
41.64MB
申请报告