User:MPopov (WMF)/Reading List

From Meta, a Wikimedia project coordination wiki

I've written an employee operations manual for Discovery's future data analysts. We occasionally do A/B tests of new features, the guidelines for which are also posted here on Meta. My former colleague Oliver Keyes collaborated with our Legal department on Discovery's data access guidelines.

One of the reports I've done that I'm proud of is "From Zero to Hero" where I used the variable importance feature of random forests to see which features of a search query are important in predicting whether the query will yield zero results. For a really in-depth look into random forests, I recommend Understanding Random Forests: From Theory to Practice (Louppe, 2014).

Blogs[edit]

Some of my favorite data science & statistics-related blogs to read include: