Data Cleaning IS Analysis, Not Grunt Work

The act of cleaning data is the act of preferentially transforming data so that your chosen analysis algorithm produces interpretable results. That is also the act of data analysis.

This is a long, but wonderful post. It is deeply expressive of a core belief of mine: data analysis and data transformation/preparation/cleaning are fundamentally inseparable. Attempting to separate them is (IMO) one of the biggest problems in many data teams, and unifying them is one of the biggest opportunities.

This core belief (based on deep personal experience as a data analyst) is what motivated me to want to build dbt back in 2016.


Want to receive more content like this in your inbox?