Football-Data maintains comprehensive CSV files of football matches dating back the the early 90s. Leagues currently covered are Belgian Dutch English French German Greek Italian Portuguese Scottish Spanish Turkish The data is well-formatted and free—so a great resource if you want to try out some data science techniques with simple, real-world, data.
Learning Tree just published my article on conducting sentiment analysis using R and a web service.
Learning Tree published my article on using Principal Component Analysis to reduce the dimensionality of data.
SplashData, a purveyor of password managers, has produced its annual list of the year’s worst passwords. The top ten are 123456 password 12345 12345678 qwerty 123456789 1234 baseball dragon football I guess we should all be shocked at how poor these passwords are. However, there’s no breakdown of which sites these passwords came from. If […]
2016 is the year of the US presidential election. Prepare to be besieged by polls. The embarrassment of the 2015 UK parliamentary election predictions is a distant memory. We get to start over. However, Mona Chalabi reminds us, via the Guardian’s Datablog, of the challenges facing pollsters. She lists five: The media are fallible. They […]