Data Scientist, Part .1

24 July 2012

I want to be a data scientist. I want to learn in the most efficient way. I want to learn from the best.

One of the foremost data scientists is Hilary Mason, (blog@hmason). She has a tremendous ability to make difficult concepts easy to understand. See: An Introduction to Machine Learning in 30 Minutes.

What did I learn from that video? This can be fun!

In addition to learn the necessary math, I should use the most appropriate tools. A little sleuthing found a survey of the data scientists competing at Kaggle.com.

The winner? R , the open-source tool for statistical analysis.

The other tool to learn? Python, due to its ease of use and large number of libraries.

Combined, those two tools make it easy to find, consume, and analyze data from many places. Next up: math.