[2006.09723] A Tweet-based Dataset for Company-Level Stock Return Predictionopen searchopen navigation menucontact arXivsubscribe to arXiv mailings

Public opinion influences events, especially related to stock market movement, in which a subtle hint can influence the local outcome of the market. In this paper, we present a dataset that allows for company-level analysis of tweet based impact on one-, two-, three-, and seven-day stock returns. Our dataset consists of 862, 231 labelled instances from twitter in English, we also release a cleaned subset of 85, 176 labelled instances to the community. We also provide baselines using standard machine learning algorithms and a multi-view learning based approach that makes use of different types of features. Our dataset, scripts and models are publicly available at: https://github.com/ImperialNLP/stockreturnpred.

1 mentions: @upura0
Keywords: dataset
Date: 2020/06/27 21:51

Related Entries

Read more Kaggle参加報告: Quora Insincere Questions Classification
0 users, 1 mentions 2019/12/14 00:51
Read more 「NLPコンペの知見を実務に活かすために」の題目で発表しました - u++の備忘録
0 users, 1 mentions 2020/02/28 14:58
Read more [1912.11762] The Application of Machine Learning Techniques for Predicting Results in Team Sport: A ...
0 users, 1 mentions 2020/03/29 08:21
Read more Weekly Kaggle News #18 | Revue
0 users, 1 mentions 2020/04/17 06:52
Read more [2005.04518] What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social M...
0 users, 1 mentions 2020/05/18 20:21
Read more Google Cloud & NCAA® March Madness Analytics | Kaggle
0 users, 1 mentions 2020/05/23 06:52