以下为卖家选择提供的数据验证报告:
数据描述
Context
This news dataset is a composition of 1.61 million headlines posted by the Irish Times that cover a quarter of a century.
Created over 160 years ago, the agency can provides long term birds eye view of the happenings in Europe. The major categories include business, sport, culture, lifestyle and opinion in addition to news.
See these reports for date distribution and missing dates.
Content
CSV Records: 1;611;495
- 1 publish_date: Date of the article being published in yyyyMMdd format
- 2 headline_category: Category of the headline, Ascii, dot delimited, lowercase values
- 3 headline_text: Title of the article in English in UTF-8 charset
Start Date: 1996-01-01 ; End Date: 2021-06-30 ;
A separate bonus dataset containing fifteen months of observational data from Nigeria is included.
Inspiration
Special Thanks to the journalists who were involved in the creation of this dataset.
Great care has been taken to conserve these headlines exactly in the order in which they were published by the agency.
Minimal cleanup and processing was required for this dataset due to generally optimal categories, a clean site layout and formatting.
