Stanford has a number of free on-line courses in session that might be of interest to data scientists. They started last week, but you can still join.
Statistical Learning
The Statistical Learning course is an introductory-level course in supervised learning with a focus on regression and classification methods. The course uses R, and the text book, “An Introduction to Statistical Learning with Applications in R”, is currently available as a free download.
Mining Massive Datasets
The Mining Massive Datasets course teaches algorithms for extracting models and other information from very large amounts of data, with an emphasis on techniques that are efficient and scale well. The course textbook, “Mining of Massive Datasets”, is currently available as a free download.
Machine Learning
The Machine Learning course provides a broad introduction to machine learning, data-mining, and statistical pattern recognition.