Skip to content

Data Science resources

January 21, 2020

Inspired by many similar lists, for example this one by Carl Anderson and this one by Yu Wu, below is a list of free, online resources for learning data science (i.e. programming, machine learning, and statistics). This list includes textbooks (published online or with a free PDF) as well as YouTube videos and online courses (MOOCs). As always with these things, your mileage may vary.

Introductory

James, Witten, Hastie & Tibshirani (2013) “An Introduction to Statistical Learning, with Applications in R” Springer.

Thomas (2018) “Mathematics for Machine Learning

Irizarry (2019) “Introduction to Data Science: Data Analysis and Prediction Algorithms with R”

Welling (2010) “A First Encounter with Machine Learning

Daumé III (2017) “A Course in Machine Learning

R Programming

Wickham & Grolemund (2017) “R for Data Science: Import, Tidy, Transform, Visualize, and Model Data” O’Reilly.

Wickham (2nd ed., 2019) “Advanced R” Chapman & Hall/CRC Press.

Wickham (2nd ed., 2015) “ggplot2: Elegant Graphics for Data Analysis

Lovelace, Nowosad & Muenchow (2019) “Geocomputation with R” CRC Press.

Python Programming

Downey (2nd ed., 2014) “ThinkStats: Exploratory Data Analysis in Python” O’Reilly.

Adhikari & DeNero “Computational and Inferential Thinking: The Foundations of Data Science”

Sklearn basics (Jupyter notebook)

Plotting and Visualization in Python (Jupyter notebook)

More Advanced

Hastie, Tibshirani & Friedman (2nd ed., 2009) “The Elements of Statistical Learning: Data Mining, Inference, and Prediction”

Goodfellow, Bengio & Courville (2016) “Deep Learning” MIT Press.

McElreath (2015; 2nd ed. 2020) “Statistical Rethinking: A Bayesian Course with Examples in R and Stan” YouTube videos

Wikle, Zammit-Mangion & Cressie (2019) “Spatio-Temporal Statistics with R” Chapman & Hall/CRC Press.

Collins II (2003) “Fundamental Numerical Methods and Data Analysis

Leskovec, Rajaraman & Ullman (3rd ed., 2020) “Mining of Massive Datasets” CUP.

Hyndman & Athanasopoulos (2nd ed., 2018) “Forecasting: Principles and Practice” OTexts.

Blitzstein & Hwang (2nd ed., 2019) “Introduction to Probability” CRC Press.

Petersen & Pedersen (2012) “The Matrix Cookbook

Courses

fast.ai (Jeremy Howard & Rachel Thomas)

Deep Learning Specialization (Andrew Ng, Coursera)

Intro to Hadoop and MapReduce (Udacity)

Statistical Learning (Trevor Hastie & Rob Tibshirani, Stanford Online)

Linear Algebra (Gilbert Strang, MIT OCW)

From → R

Leave a Comment

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

Ella Kaye on Ella Kaye

Computational Bayesian statistics

Bayes' Food Cake

A bit of statistics, a bit of cakes.

RWeekly.org - Blogs to Learn R from the Community

Computational Bayesian statistics

Richard Everitt's blog

Computational Bayesian statistics

Let's Look at the Figures

David Firth's blog

Nicholas Tierney

Computational Bayesian statistics

Sweet Tea, Science

Two southern scientistas will be bringing you all that is awesome in STEM as we complete our PhDs. Ecology, statistics, sass.

Mad (Data) Scientist

Musings, useful code etc. on R and data science

Another Astrostatistics Blog

The random musings of a reformed astronomer ...

Darren Wilkinson's blog

Statistics, computing, functional programming, data science, Bayes, stochastic modelling, systems biology and bioinformatics

(badness 10000)

Computational Bayesian statistics

Igor Kromin

Computational Bayesian statistics

Statisfaction

I can't get no

Xi'an's Og

an attempt at bloggin, nothing more...

Sam Clifford

Postdoctoral Fellow, Bayesian Statistics, Aerosol Science

%d bloggers like this: