NoSQL Pandas Dataframes

tldr; Pandas Dataframes are simply a table with some rows (McKinney, 2015) and many additional features built right in to make data scientists lives much easier. What are NoSQL Pandas dataframes? Pandas is an open source Python library that provides high performance yet easy to use data structures and data analysis Read more…

By Andrew Odendaal, ago

Graph Databases and their Properties

The concept of a graph in mathematics is simply a collection of elements which are typically called Nodes and are joined together by Edges. Each Node represents a piece of information in the graph and each Edge represents some relationship or connection between any of the two Nodes (Cox, 2017). Read more…

By Andrew Odendaal, ago

Relational (SQL) vs NoSQL Database Models

The Relational (SQL) database has been a central piece to businesses since the 1970s when they first had their claim to fame. Experts say that the world’s data is doubling every two years (Gajani, 2017) and this is simply something that traditional relational databases cannot manage with their architectural inner Read more…

By Andrew Odendaal, ago

Netflix Hadoop Big Data Marketing Use Case

Netflix is a video streaming service that has a wealth of information about their user base likes, dislikes, general consumer habits, retention lengths and much more. Netflix uses their big data to commission original programming content that it knows will succeed and be accepted in relevant published markets (O’Neill, 2016). Read more…

By Andrew Odendaal, ago

Data Warehouses vs Data Marts

Although the terms “data warehouse” and “data mart” sound similar, they are quite different. It is important to first understand how they differ in order to define some characteristics and practical applications for each. Serra (2012) has a great explanation of data warehouses as being “a single organizational repository of Read more…

By Andrew Odendaal, ago

An Introduction to Client-Server Systems

A client-server architecture/system is one where the server maintains all models of the backend datasets and business logic around taking and providing information to fulfill a service request from a client which in turn shows the representation thereof. The client/server model has quickly grown in popularity throughout the 1990s and Read more…

By Andrew Odendaal, ago