Blog

Building a Data Warehouse on Amazon Redshift

Photo by Jezael Melgoza on Unsplash As an organization grows, its data storage, monitoring and analysis requirements also exponentially increase. Traditional data warehouse don’t always easily handle massive amounts of growth. This caused a need for alternative solutions, starting from the mid 2000s. One such solution is Amazon Redshift from Amazon Web Services. What is Amazon…
Read more


December 20, 2019 0

5 AWS Technologies That’ll Make Your Life Easier

Amazon Web Services (AWS) has simplified much of developers’ workflows and development over the past decade. AWS allows engineers to command and control cloud-based infrastructure, data, and other technical pieces of infrastructure without the hassle of developing entire frameworks from scratch. Initially, AWS was launched to take care of online retail operations for Amazon, but…
Read more


December 7, 2019 0
cloud consulting

Airbnb’s Airflow Versus Spotify’s Luigi

Photo by Marcin Jozwiak on Unsplash We recently wrote about ETLs and why they’re important. We wanted to provide an outline for what ETL tools are. You could refer to these ETL tools as workflow tools that help manage moving data from point A to point B. Two of these popular workflow tools are Luigi by Spotify and…
Read more


November 25, 2019 0

Healthcare Fraud Detection With Python

This April a 1.5 billion dollar medicare scheme took advantage of hundreds of thousands of seniors in the US. In reality, this is just a small sliver of the billions of dollars healthcare fraud costs both consumers and insurance providers annually. Healthcare fraud can come from many different directions. Some people might think of the…
Read more


November 6, 2019 0

What Are ETLs and Why Are They Important?

Creating a world of self-service analytics Photo by chuttersnap on Unsplash The rise in self-service analytics is a significant selling point in the business intelligence world. Part of the point of creating self-service analytics is having easy access to the data from your organization. The question is how do you get your data from external application…
Read more


November 2, 2019 0

5 Use Cases for DynamoDB

Introduction Web-based applications face scaling due to the growth of users along with the increasing complexity of data traffic. Along with the modern complexity of business comes the need to process data faster and more robustly. Because of this, standard transactional databases aren’t always the best fit. Instead, databases such as DynamoDB have been designed…
Read more


October 30, 2019 0
predictive modeling

What Is Predictive Modeling?

Photo by Roman Mager on Unsplash In this modern world it is hard to imagine visiting a website that doesn’t automatically personalize what you see or predicting what product you will want to buy? It seems like the whole world wide web already knows who we are. Well, this is what predictive modeling enables us to…
Read more


October 15, 2019 0

Why Big Data Analytics is a Necessity in Digital Advertising

In the age of digital, the traditional way of advertising is slowly fading away — this means fewer brochures and more Facebook videos, fewer mailers and more personalized e-mails. An article on The Balance lists some of the cons of previous traditional advertising methods, which include hard to quantify results, expensive legwork, and unappealing hard-sell…
Read more


October 15, 2019 0

DynamoDB vs. Hadoop vs. MongoDB

Are All NoSQL Systems The Same? Photo by Campaign Creators on Unsplash Which database is best for your current business needs is usually dependent on the skill set of your dev team and the applications in place already. Understanding which database system will best fit your companies both current and future needs is an important step. Databases…
Read more


October 5, 2019 0
data science agile

Using Agile Methodologies in Data Science

Photo by Matteo Vistocco on Unsplash Agile is an umbrella term that refers to several methodologies that focus on being iterative and on getting tangible products and features out quickly at the end of what are often called sprints. This framework has been adapted for multiple domains, including programming and design. Similarly, data science has also…
Read more


October 4, 2019 0