verify-tagDelpher Dutch Newspaper Archive (1618-1699)

languageseuropefinanceeconomicslinguisticsnews

3

已售 0
66.63MB

数据标识:D17220864714172188

发布时间:2024/07/27

以下为卖家选择提供的数据验证报告:

数据描述

Context:

"Tulip mania, tulipmania, or tulipomania (Dutch names include: tulpenmanie, tulpomanie, tulpenwoede, tulpengekte and bollengekte) was a period in the Dutch Golden Age during which contract prices for bulbs of the recently introduced tulip reached extraordinarily high levels and then dramatically collapsed in February 1637. It is generally considered the first recorded speculative bubble (or economic bubble)." -- From Wikipedia, CC BY-SA

Market forecasting is difficult. There are many factors that may affect the market, and a high degree of uncertainty. One thing that some researchers have been investigating is whether natural language processing (NLP) of news texts can help with market forecasting. Recent publications suggest that it can be.

This dataset an interesting test case for these methodologies. It contains Dutch-language newspapers from the years immediately preceding and following tulip mania. Can you use NLP techniques to model the tulip market over time?

Content:

This dataset contains the texts of 8,559 newspaper deliveries from the 17th century, from June 14th, 1618 to December 31, 1699. The text is in Dutch. Since the text was scraped from old newspapers using OCR (optical character recognition), there are some errors in the text.

Acknowledgments:

This dataset was compiled by Delpher, an archive service provided by the National Library of the Netherlands. It is provided under a CC-BY 4.0 license. For more information, and newspapers from other years, please visit their website (in Dutch). If you use this dataset in your work, please include this citation:

Delpher open newspaper archive (1.0). Creative Commons Attribution 4.0 , The Hague, 2017 .

data icon
Delpher Dutch Newspaper Archive (1618-1699)
3
已售 0
66.63MB
申请报告