Tag: Data Science

data quality pipelines

Why Data Quality Checks Fail – Too Many Alerts, Not Enough Ownership

Every morning, your team wakes up to over one hundred data quality alerts. I know, I recall this issue at Facebook. Some of them would be fixable issues others should have just been warnings. It’s so easy to build data pipelines and add data quality checks these days that I am sure for some people…
Read more


May 24, 2026 0

How To Set-up Your Data Stack For 2026 – Data Infrastructure For AI

We are several years into the AI Revolution, so to speak, and with that has come an increased demand for data. The increased demand for data comes an increased demand for data infrastructure. Some companies already have reliable data stacks; others are looking to migrate to Snowflake, Databricks, or some other solution(I am sure some…
Read more


April 13, 2026 0
why data pipelines exist

What Are Data Pipelines And Why Do They Exist

The demand for data has grown substantially in this AI-driven world. Meaning, there are more and more data pipelines being created. The funny thing is, when I first started in the data world, no one around me used the term data pipeline. I am sure plenty of data teams used the term data pipeline. But…
Read more


April 1, 2026 0

The Most Common Types of Data Pipelines You’ll Actually Build

Whether you’re working at a large enterprise or a small business, there’s almost always a need to extract data from source systems, process it, and use it for operational or analytical purposes. That process, moving data from point A to point B, transforming it along the way, and making it usable, is what we typically…
Read more


February 12, 2026 0
real time vs batch

Batch Vs Real-Time Data Pipelines – Do We Still Need To Pick?

One of the questions most data engineers need to answer is whether this data pipeline should be real-time or batch. Sometimes posed as streaming vs batch. The tools you might use to do that have changed over the past few years, but that was always the question. The business, of course, would always ask for…
Read more


November 12, 2025 0
how to grow data team

When Should You Hire More Data Engineers And Analysts – How To Grow Your Data Team

Is your data team constantly feeling the pressure to deliver? Do members of your team say they feel like they’re doing work meant for two people? If the answer to either or both of these questions is a resounding yes, you may feel tempted to think, “We just need more hands on deck.” However, hiring…
Read more


September 15, 2025 0

6 Real-World ETL Use Cases with Estuary

After working in data for over a decade, one thing that remains the same is the need to create data pipelines. Whether you call them ETLs/ELTs or something else, companies need to move and process data for analytics. The question becomes how companies are actually building their data pipelines. What ETL tools are they actually…
Read more


May 9, 2025 0
parsing pdfs with python

Challenges You Will Face When Parsing PDFs With Python – How To Parse PDFs With Python

Scraping data from PDFs is a right of passage if you work in data. Someone somewhere always needs help getting invoices parsed, contracts read through, or dozens of other use cases. Most of us will turn to Python and our trusty list of Python libraries and start plugging away. Of course, there are many challenges…
Read more


November 19, 2024 0
metrics consulting

How Data Teams Drive Business Success by Understanding Core Metrics

A key responsibility for any data team is to understand the core metrics driving their business. Starting from the top, these metrics often include figures like gross revenue and expenses. However, these high-level metrics can feel too far removed and abstract from the actual business.  Many companies, therefore, break down these top-line metrics into more…
Read more


November 13, 2024 0
how to lead a data team

9 Must-Watch Videos for Aspiring Data Leaders: Bridging Tech and Business for Data Team Success

Leading data teams can be challenging. You’ve got management and non-technical teams constantly reaching out with ad-hoc data requests; you’re likely trying to figure out what tools will work best and not blow the bank. Not to mention, you’ve got to bridge the gap between business and technology. All while trying to grow your data…
Read more


November 6, 2024 0