Microsoft Research have released over 50 free data sets via their Open Data site. They include
- 38 million tweets from the 2012 US presidential election
- Profiles of 1 million celebrities (1000 with images)
Are there actually 1 million celebrities now?! Maybe someone can analyze the data to confirm. As in all data science we’ll need to start with firming up our definitions of terms—e.g “celebrity”.