困困

verify-tagA Million News Headlines

australialinguisticsnlptextnews

1

已售 0
60.85MB

数据标识:D17220495897868779

发布时间:2024/07/27

以下为卖家选择提供的数据验证报告:

数据描述

Context

This contains data of news headlines published over a period of nineteen years.

Sourced from the reputable Australian news source ABC (Australian Broadcasting Corporation)

Agency Site: (http://www.abc.net.au)

Content

Format: CSV ; Single File

  1. publish_date: Date of publishing for the article in yyyyMMdd format
  2. headline_text: Text of the headline in Ascii , English , lowercase

Start Date: 2003-02-19 ; End Date: 2021-12-31

Inspiration

I look at this news dataset as a summarised historical record of noteworthy events in the globe from early-2003 to end-2021 with a more granular focus on Australia.

This includes the entire corpus of articles published by the abcnews website in the given date range. With a volume of two hundred articles per day and a good focus on international news, we can be fairly certain that every event of significance has been captured here.

Digging into the keywords, one can see all the important episodes shaping the last decade and how they evolved over time. Ex: afghanistan war, financial crisis, multiple elections, ecological disasters, terrorism, famous people, criminal activity et cetera.

Similar Work

Similar news datasets exploring other attributes, countries and topics can be seen on my profile.

Most kernals can be reused with minimal changes across these news datasets.

Prepared by Rohit Kulkarni

data icon
A Million News Headlines
1
已售 0
60.85MB
申请报告