Logs were our lifeblood. Now they're our liability.


If the first ten years of data science were all about collecting and analyzing everything, the second ten will be about how to be deliberate and selective about collected and analyzed data.

I don't know that I agree with this perspective, but it's an interesting and contrarian one. My thought process around this gets pretty philosophical pretty quickly, so I'll spare you most of it. I do think it is quite legitimate for companies to use data as a source of competitive advantage, the problem today is that companies are essentially saving logs with equivalent care as companies of yesteryear storing passwords in plain text. Differential privacy should be a requirement: the value of this data shouldn't be in its ability to map to individuals, but rather in its ability to generate insights about populations.

Anyway...IMO logs aren't going anywhere, but they will likely look different in the future.


