以下为卖家选择提供的数据验证报告:
数据描述
Description
Tweets were collectect between April 9 and July 16, 2020 using not only the SPX500 tag but also the top 25 companies in the index and "#stocks". 1300 tweets were manually classified and reviewed. The proposed specialised dictionary is also present in the data of this contribution. All the source code used to download tweets, check the top words and evaluate the sentiment are present.
Content
- ID -> Contains id used for the tweet. - Date and time -> Date and time when the tweet was tweeted. - Tweet -> Tweet/text written by the user. - Sentiment -> Wheter the tweet was postive or negative.
Inspiration
Currently exploring nlp and learning more about it, found this very easy to use dataset for the topic and now sharing with other fellow kaggelers.
Notebook and a task is missing for the dataset hence it shows as 9.4 usability.
Acknowledgements
Cite - Bruno Taborda, Ana de Almeida, José Carlos Dias, Fernando Batista, Ricardo Ribeiro, April 15, 2021, "Stock Market Tweets Data", IEEE Dataport, doi: https://dx.doi.org/10.21227/g8vy-5w61.
I do not own this dataset, thanks to the authors for creating this dataset. Please cite if using the dataset.
