The Yhat Blog

machine learning, data science, engineering

  • Introducing Gobenchdb

    by Eric Cox | Jul 06 2015

    A command line tool that stores Go benchmark data in a database

  • 7 Datasets You've Likely Never Seen Before

    by Greg | Jun 22 2015

    Some datasets that may have fallen by the wayside.

  • ROC Curves in Python and R

    by Greg | Jun 15 2015

    ROC curves are a great tool for binary classification. Learn more in this post!

  • Rodeo 0.4: Spark, themes, resizeable panes

    by Greg | Jun 04 2015

    Introducing the latest version of our Rodeo IDE.

  • Rodeo: A data science IDE for Python

    by Greg | Apr 23 2015

    Introducing our latest open source project: the Rodeo IDE.

  • Building a Client-side Blog Search Algorithm

    by Greg | Apr 14 2015

    How we built a page recommender to power our blog's search engine.

  • db.py 0.4: Handlebars Meets SQL

    by Greg | Mar 13 2015

    Learn how to use db.py and Handlebars to make your SQL scripts shorter and easier to read.

  • ML Pitfalls: Measuring Performance (Part 1)

    by Eric | Mar 03 2015

    Common machine learning pitfalls and how to avoid them.

  • Base R Plots

    by Greg | Feb 23 2015

    Introduction to plotting and graphics in R (without ggplot2)

  • What is Linear Regression? A Qualitative Exploration

    by Greg | Feb 19 2015

    A high level introduction to what linear regression is and how it works.