FiveThirtyEight are sharing the data and code behind some of their articles. A goldmine for those wishing to learn more about data science.
R comes with a range datasets that can be used when learning the basics or trying out a new approach/package. mtcars and weather` are popular choices. However, most of the common datasets are “toy” examples. They are great for practising basic techniques, but are useless when it comes to realistically simulating data science tasks. The […]
Karl Broman and Kara Woo offer some good advice on organizing data in spreadsheets. They advocate confining the use of spreadsheets to data entry and storage—moving calculations and visualizations to other tools. This certainly avoids some of the biggest problems with using spreadsheets. However, spreadsheets don’t enforce any discipline. It’s up to the user to […]
Using Excel in business critical operations is risky to the point of recklessness. Phocas have just published an article highlighting seven problems with spreadsheets. The problems are susceptibility to human errors difficulties in troubleshooting lack of agility lack of collaborative features lack of support for rapid decision-making degree of complexity for the average business user […]
Wired has an article on a slot machine hack that seems to make use of data analytics. The hack relies on a weakness introduced in the pseudo-random number generator used by the machines from Austrian gaming company Novomatic. Teams in the casino video the machines using their cellphones. The videos are then sent to St […]