As the amount of data we collect continues to explode, attention needs to shift to making sense of it. Tools like Hadoop and Spark allow us to analyse these huge datasets, but they don't really make sense of it. Senior managers want insights. They want to have their business illuminated by the data. At present, this is done by data scientists analyzing the data and weaving it into a … [Read more...]
What’s hot at the 2015 Strata+Hadoop World conference?
It’s always refreshing to see data scientists turn the tools on themselves. Gives the field credibility. Benedikt Koehle looked at the frequency of ngrams in the abstracts for the 2015 Strata+Hadoop World conference. He concluded, as a result of this analysis, that 2015 will be probably known as the “Spark Strata” He also notes the resurgence of interest in R, at the expense … [Read more...]
Free on-line data science courses from Stanford
Stanford has a number of free on-line courses in session that might be of interest to data scientists. They started last week, but you can still join. Statistical Learning The Statistical Learning course is an introductory-level course in supervised learning with a focus on regression and classification methods. The course uses R, and the text book, "An Introduction to Statistical Learning with … [Read more...]
How to build a predictive model using Azure Machine Learning
I’ve just published an article about how to use Microsoft’s Azure Machine Learning platform . You can read it on the Learning Tree International blog. … [Read more...]
Microsoft to acquire Revolution Analytics
Microsoft have already shown R some love by supporting it in their Azure Machine Learning service. But, today they announced an acquisition that borders on infatuation---they are acquiring Revolution Analytics. Revolution Analytics is a commercial provider of R software, support and services---with strength in the big data area. They are the developers of the open-source RHadoop packages that … [Read more...]