Analytics

Open Distro for Elasticsearch

If you use Elasticsearch and Kibana for log centralization or analytics, then this new development is worth taking a peek at. It is an open-source distro of Elasticsearch that contains a number of the more “enterprise”-y features that you would normally have to pay for. Whether this is a good thing for the long-term health of Elasticsearch remains to be seen…

Continue reading

Python Data Science Handbook

For those interested in Data Science, and how to use Python, Pandas, Numpy, etc. in pursuit of said interest, this site provides Jupyter notebooks with an interactive playground for learning. If you haven’t worked with Jupyter notebooks, this is a great place to start.

Continue reading

A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments

I saw this presentation at KDD and thought it provided hard-won insights into the types of interpretation problems one encounters when performing online tests. I’ve run into some of these issues myself, but this paper is still the best overview I’ve seen of problems you might not think about until they bite you.

Continue reading