Folding Data #24
This is our last Folding Data of 2021. I hope that I've been adding more value than noise to your inbox 😉. The final issue of the year brings together top (most clicked-on) stories, talks, and tools.
Interesting Reads: Catch up on 7 articles you might have missed
Every week, we’ve shared an interesting read. These got the most attention from our readers in 2021:
- Thinking of Analytics Tools as Products
- In Defense of Zillow’s Besieged Data Scientists
- Reflecting on Four Years at Databricks
- Top 40+ Data Science Product Interview Questions
- Red Hot: The 2021 Machine Learning, AI and Data (MAD) Landscape
- Automating Data Protection at Scale, Part 1
- Every Single Cognitive Bias in One Infographic
Top 10 tools of 2021
Similarly, we shared interesting tools each week. These are the ones that seemed to get you all the most excited:
- Hex – next-generation data notebook experience
- Evidently – regression testing for ML models
- DataProfiler – data profiler you always wanted in Pandas
- Lux – because what good is profiling without charts
- Temporal – orchestration framework for [not only data] apps
- Malloy – post-SQL query language by Looker founders
- Materialize – streaming applications on top of good-old PostgreSQL
- Zingg – ML powered data deduplication & entity resolution
- Evidence – BI that's just SQL + Markdown
- GrowthBook – open-source experimentation platform
Best by Datafold in 2021
Here are posts & talks by Datafold that you might enjoy.
- The Modern Data Stack: Open-source Edition
- Proactive Data Quality on the Data Engineering Podcast
- The State of Data Quality in 2021
- Data Quality Management According to Lyft, Shopify, and Thumbtack
- 9 Best Tools for Data Quality in 2021
- Why Looker was and Lightdash is a big deal for BI
- Datafold: From Breaking Data to Series A
We also had a blast working with our customers. Huge thanks to the data teams at Patreon & Dutchie for sharing their journeys to better data!
Featured Data Quality Meetup Lightning Talks
We hosted four Data Quality Meetups this year with amazing contributions by the Data leaders from the industry, open-source, and vendor community. Here are the featured ⚡️talks of 2021:
- Setting Sisyphus Free: Data Discoverability in a Scaling Company - by Maura Church - Director of Data Science @ Patreon
- Fake It Till You Make It: A Backward Approach to Data Products - by Alex Viana, VP of Data @ HealthJoy
- Automating Data Quality at Thumbtack - by John Lee, Director, Product Analytics @ Thumbtack
All the memes that are fit to print