Down Shift

verify-tagAG News (News articles)

businessnlpclassificationnews

14

已售 0
11.28MB

数据标识:D17171513875707900

发布时间:2024/05/31

以下为卖家选择提供的数据验证报告:

数据描述

AG News (News articles)

News Articles Text Classification


Source

Huggingface Hub: link

About this dataset

> The ag_news dataset provides a new opportunity for text classification research. It is a large dataset consisting of a training set of 10,000 examples and a test set of 5,000 examples. The examples are split evenly into two classes: positive and negative. This makes the dataset well-suited for research into text classification methods

How to use the dataset

> If you're looking to do text classification research, the ag_news dataset is a great new dataset to use. It consists of a training set of 10,000 examples and a test set of 5,000 examples, split evenly between positive and negative class labels. The data is well-balanced and should be suitable for many different text classification tasks

Research Ideas

> - This dataset can be used to train a text classifier to automatically categorize news articles into positive and negative categories. > - This dataset can be used to develop a system that can identify positive and negative sentiment in news articles. > - This dataset can be used to study the difference in how positive and negative news is reported by different media outlets

Acknowledgements

> AG is a collection of more than 1 million news articles. News articles have been gathered from more than 2000 news sources by ComeToMyHead in more than 1 year of activity. ComeToMyHead is an academic news search engine that has been running since July, 2004. The dataset is provided by the academic comunity for research purposes in data mining (clustering, classification, etc), information retrieval (ranking, search, etc), XML, data compression, data streaming, and any other non-commercial activity. For more information, please refer to the link http://www.di.unipi.it/~gulli/AG_corpus_of_news_articles.html . > > > ### License > > > > License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication > > No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: train.csv

Column name Description
text The text of the news article. (string)
label The label of the news article. (integer)

File: test.csv

Column name Description
text The text of the news article. (string)
label The label of the news article. (integer)
data icon
AG News (News articles)
14
已售 0
11.28MB
申请报告