淡然若水

verify-tagIrish Times - Waxy-Wany News

europelinguisticsnlptextnews

6

已售 0
51.76MB

数据标识:D17222386811731825

发布时间:2024/07/29

以下为卖家选择提供的数据验证报告:

数据描述

Context

This news dataset is a composition of 1.61 million headlines posted by the Irish Times that cover a quarter of a century.

Created over 160 years ago, the agency can provides long term birds eye view of the happenings in Europe. The major categories include business, sport, culture, lifestyle and opinion in addition to news.

See these reports for date distribution and missing dates.

Content

CSV Records: 1;611;495

  • 1 publish_date: Date of the article being published in yyyyMMdd format
  • 2 headline_category: Category of the headline, Ascii, dot delimited, lowercase values
  • 3 headline_text: Title of the article in English in UTF-8 charset

Start Date: 1996-01-01 ; End Date: 2021-06-30 ;

A separate bonus dataset containing fifteen months of observational data from Nigeria is included.

Inspiration

Special Thanks to the journalists who were involved in the creation of this dataset.

Great care has been taken to conserve these headlines exactly in the order in which they were published by the agency.

Minimal cleanup and processing was required for this dataset due to generally optimal categories, a clean site layout and formatting.

data icon
Irish Times - Waxy-Wany News
6
已售 0
51.76MB
申请报告