Exploring Criminal Activity Data My primary data set to explore is the crime table in my database. I need to assess the values and consistency across each of the fields available in what was provided by the source. It is possible for data to be a little inconsistent across each of the years over the…
Category: data engineering
Introduction to the Elastic Stack
This post will describe the Elastic Stack, also and formerly the ELK Stack, and its individual components. In a follow-up post, I’ll demonstrate how to get the ELK Stack up and running. What is the ELK Stack? The ELK Stack consists of Elasticsearch, Logstash and Kibana developed by Elastic. Later the company came out with…
Getting Started with the ELK Stack
In this post I’ll demonstrate how to get the ELK Stack up and running. Installing the ELK Stack First we will install Elasticsearch, then Logstash, and then finally Kibana. Most of the instructions are the same if you follow the official documentation (links provided below). If you follow my instructions, you’ll get the parts that…
Derive a Star Schema By Example
This post will describe the implementation of the star schema using the “Crime” table from my Criminal Analysis project. The original table represents criminal incidents in Washington DC from 2009 through 2020 (October). The table also has 23 columns, 7 of which are spatial grouping categories. I’ll demonstrate how to decompose the table of data…
Data Storage: pgAdmin
pgAdmin is the leading Open Source management tool for Postgres, the world’s most advanced Open Source database. pgAdmin 4 is designed to meet the needs of both novice and experienced Postgres users alike, providing a powerful graphical interface that simplifies the creation, maintenance and use of database objects. https://www.pgadmin.org/docs/pgadmin4/4.28/index.html This post will describe how to…
Data Storage: Installing PostgreSQL and PostGIS
This post will describe how to install both the PostgreSQL database and the PostGIS geodatabase extension. If you have been following my Criminal Analysis: Data Storage posts then this will be a repeat of information. Installing and Setting up PostgreSQL On a Linux machine, getting PostgreSQL installed and running is pretty easy and straight forward….
Criminal Analysis: Data Storage (part 2)
In this post I will go over setting up a geospatial database using PostGIS, an extension to PostgreSQL. For information about setting up a PostgreSQL database please refer back to my previous Data Storage. Below is the project plan to ensure I load up all my downloaded map data. Installing and Setting up First you…
Criminal Analysis: Data Storage
Now that we have collected our data, lets work on building a database to store our project data. For this project I have decided to use a PostgreSQL database. The image below provides some planning details to help us implement the database. Installing and Setting up On a Linux machine, getting PostgreSQL installed and running…