From Olav Laudy Data Science
Jump to: navigation, search



Topological Data Analysis and Machine Learning reinforce each other. Good read on how to visualize!

This is how to do large scale data science!

[must read]. Fantastically clear post on vector models and k-nn.

[Python] Great tutorial on word2vec and doc2vec

Great intro on Random Forests in Python and R

Continue reading this after you read the post on calculus on computational graphs.

Nicely explained: model based clustering, with examples in R.

[must read] Calculus on computational graphs.

[R] Simple tutorial on building word clouds

If you always wanted to understand LSTM's, this is your chance!

Light intro in some concepts of optimization.

[A good read] Controlling for confounding variables.

Nice intro to multilevel models - linear mixed models - random effect models in R.

Fairly tech paper, but the technique is new and fascinating! This could well be the next generation of models.

Good tips on advancing in data science.

Read this extremely clearly written article on Generalized Additive Models (gams) + how to do it in R.

Very nice overview of the basic data mining algorithms with R and Python code.

[mindblowing piece of video] how the brain does backpropagation

Cool article about Bayesian optimization of hyper parameters with Gaussian processes.

Interesting take on the maturity of the different categories of artificial intelligence.

Just for fun] 5 cool and unusual datasets to play around with

Great post on matrix factorization and the relation between k-means and PCA.

Survival analysis tutorial in R.

Simple tutorial on deeplearning with the Keras framework

Nice overview of Watson trade-off analytics.

Great read on recommendation systems and the technicalities behind the Netflix challenge.

Great article on Generalized Additive Models

15 Questions about plots in R.

Good post on preventing model leakage, illustrated by a cross validation example in Python.

[What a jewel!] If you want to get into deep learning, read this extremely accessible book.

Great article on word embeddings

Nice and well formulated tutorial on the R functions Apply, Mapply and Sapply.

Nice data science competition model write up

Great read on detecting fraud in online games

Just because you can: R and the location of letters in words.

Some good insights in feature creation using machine learning models.

Humor that only data scientists make smile.

Free data science trainings on the web.

[R code] Simple example: intro to gradient descent by deriving it for a linear model.

Well explained and useful intro introduction to Graph databases with an application of building a recommendation algorithm.

Stop hiring data scientists until ready!

A great set of data science tutorials on Git (including an explanation of Git hub)

Nice illustration of decision boundaries for various machine learning models.

The ultimate data science cheatsheet collection

Very useful tutorial on how to use Git with R.

Readable article on deep neural networks for vision and the recent ability for these networks to 'dream'.

Nice small writeup on an R model for a Kaggle competition.

Very useful R viz cheatsheet*J83sNCx8YssrIajdoqO/Capture1.PNG

[Technical] Tough read on uncertainty in deep learning models, but well worth it.

First steps: getting started with SparkR.

Practical guide to visualize high dimensional data

Play with this tool to show how deep neural nets 'dream'

Fantastic practical insight in modelbuilding

Great article about the difference between machine learning and statistical modeling.

Important article that discusses how to visualize what a deep neural network learns.

Nice Kaggle coding walkthough.

Fantastic to see how neural networks are equipped with episodic memory to give power of reasoning.

Understanding boosting, with nice vizualizations

Great way to explain complexity of an algorithm.

Informative read on a model building journey.

Quick walkthrough of machine learning models and deep learning.

[Nerd] Just because it's funny

Nice showcase of modeling on Spark: Word2Vec & Gradient Boosting Machines.

Insightful post on modeling human behavior.

This is how Facebook knows who you are, even without seeing your face.

Inspiring and insightful interview with Top Kaggler

Large scale flash memory failures: a good read into analytics at work to understand life cycle of hardware components. What is missing, is the forward looking part: can you see how to include that?

If you have nothing better to do today, analyze this terabyte dataset with your favorite Click Through Rate models.

Read up on ROC and AUC with an application of predicting the number of deaths from the Titanic. (soo useful).

Interesting read on hyper-parameter optimization.

The never ending possibilities of the neural network: teaching the computer to have conversations.

Simple (technical) read on Random forest

Brilliant article on R in the IBM Cloud.

Cool automation in R

This is how machine translation works. Cool stuff!

[YouTube+code] Neural network evolves to play Super Mario World.

Fascinating result! Machine learning method beats humans in verbal comprehension questions IQ test. Technical paper.

Mapping example in R, good example code, with an application to crime analytics.

A must read on model mixing! Well written, lots of examples, and not available in such collection and overview in literature.

Basic R: getting familiar with data frames. An easy to follow and well illustrated tutorial.

Interesting, non-technical, read on a recommendation system for an online retailer.

How to become a data scientist: a nice guide with lots of detail.

This makes me smile: logistic regression to find out the value of chess pieces.

Clearly written article on A/B testing and proving your analytical model by setting up an experiment.

I love the thinking! This is what we need to do more in data science.

Insightful paper on characteristics of fraud that are detectable in data by using analytics.

Good hints: speeding up your R code

Tuning the parameters of your Random Forest model

Simple introduction to text mining: bag of words and term frequency / inverse document frequency (TF-IDF)

A useful pointer: lessons learned in high-performance R.

Awesome paper from Google about the prediction of energy efficiency in their data centers. Well written, includes some examples how the predictions can be used to make the datacenter more efficient.

Long, but worth the read: Hofstadter, the author of Godel, Escher and Bach on intelligence, AI and machine learning.

Simple intro read into the top 10 data mining algorithms. The real trick is to start using them :)

Excellent series on the working of various machine learning models by understanding their decision boundery, shown in simple R code.

AirBnb rocks! Check out the nerd section on their home-grown modeling tool (and check out the article on handling missing data in Random Forests).

Simple Random Forest explanation and coding example.

An overlooked area in Machine Learning: prediction intervals

Great read to expand your intuition on high dimensional spaces

Insightful tips to improve your model

Nice overview of the different data scientist skills.

How to keep your data scientists: I like the first point (the other points make sense too)

Great article on how unstructured data became important to data science

Thought provoking examples on how different datasets give rise to the same regression equations

Good post on interpreting categorical regression coefficients

Interesting post on A/B testing and the need for statistical sound criteria

Questions from Data Science interviews

A simple post on model evaluation. Specially the picture is useful to explain this process to the business

A great post on handling Twitter responses using R

A good showcase for meta modeling; 3 three layer model is build from many model ensembles

How to filter out relevant predictors for your model

Practical article on K-means clustering