GitHunt
MR

mraduldubey/bostonbombing

Getting started with Twitter data analysis.

twitter-data-analysis

The tweet dataset containing tweets about Boston Marathon Bombing, is cleaned, vectorized, extracted features upon, clustered using Kmeans and represented on
a Scatter Graph by Multi Dimensional Scaling (MDS) of distance b/w tweet-vectors calculated using Cosine Distance. Check the Jupyter Nottebook: "TweetClustering.ipynb".

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

  • Extract the downloaded the repository.
  • Run "jupyter notebook" in the terminal.
  • Open "TweetClustering.ipynb".

KMeans (Clusters=25)

KMeans REsult

Prerequisites

Languages

Jupyter Notebook99.9%Python0.1%

Contributors

GNU General Public License v3.0
Created April 11, 2017
Updated May 27, 2017