Small description
This dataset contains the average sentiment of all tweets about bitcoin from 01/08/2017 until 21/01/2019. It also contains the finantial data of bitcoin for that same period.
How did I gathered the tweets?
To collect all tweets I used this github. Some days have missing data but I think it is minimal. I collected over 17.7 million tweets
How did I do sentiment analysis?
I used the library VaderSentiment. I added about 30 expresions and words to the dictionary. To score the expresions I used the same methodology as the authors described in their paper.
What do I want to accomplish?
I would like to create a predictive model for bitcoin's price using only twitter's data. Check the thesis paper here