MR
mraduldubey/bostonbombing
Getting started with Twitter data analysis.
twitter-data-analysis
The tweet dataset containing tweets about Boston Marathon Bombing, is cleaned, vectorized, extracted features upon, clustered using Kmeans and represented on
a Scatter Graph by Multi Dimensional Scaling (MDS) of distance b/w tweet-vectors calculated using Cosine Distance. Check the Jupyter Nottebook: "TweetClustering.ipynb".
Getting Started
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.
- Extract the downloaded the repository.
- Run "jupyter notebook" in the terminal.
- Open "TweetClustering.ipynb".
KMeans (Clusters=25)
Prerequisites
- Jupyter notebook
- NLTK
- sKlearn
- matplotlib
pip install matplotlib
Deployment
Use conda or pip to install the requirements.
Authors
- Mradul Dubey - Twitter Data Analysis - MradulDubey
License
This project is licensed under the GNU GENERAL PUBLIC LICENSE License - see the LICENSE.md file for details.
Acknowledgments
On this page
Languages
Jupyter Notebook99.9%Python0.1%
Contributors
GNU General Public License v3.0
Created April 11, 2017
Updated May 27, 2017
