As spreadsheet abuse has now been "Dilbertized" it's clearly entered mainstream consciousness. About time. We've been complaining about it for years. … [Read more...]
Data science with Microsoft
Jan Mulkens recently published an article on Microsoft's recent rush to enhance it's data science offering. And, as he illustrates, they have been very busy this year. He highlights a number of their flagship data science initiatives. Azure Machine Learning Power BI Cortana Analytics Suite Acquisition of Datazen & Revolution Analytics Integration of R in SQL Server Other significant data … [Read more...]
Twitter’s anomaly detection package for R
Twitter have an interest in detecting anomalies in their service. Anomalies could be down to user engagement, spamming or technical issues. Regardless of the reasons, it's something they want to know about when it happens. To aid detection of anomalies in their time series data they have developed, and open-sourced, an anomaly detection package for R. Their algorithm is based on the Generalized … [Read more...]
Visual p-hacking
It's that time of year when we start getting the "best x of 2015" posts. Nathan Yau of FlowingData just published his list of the best visualization projects. Yau reckons that this was the year of using visualization to teach about data and statistics. My favorite is "Science Isn't Broken" by Christie Aschwanden of FiveThirtyEight. It's a visual interactive demonstration of how you can shape the … [Read more...]
R is fastest growing topic on Stack Overflow
Joshua Kunst has analysed Stack Overflow postings and found that R is currently the fastest growing topic. Stack Overflow is a question and answer site popular with software developers. Questions are tagged with topics and these topics formed the basis of the analysis. Kunst used R to perform the analysis and he provides a walk-through---making extensive use of the pipeline operator from the … [Read more...]