In this example, I’m going to demonstrate using the base distributed computing package to download GeoJSON files, perform some processing on those files and then write the resultant tables to a CSV file. The data being used comes from open source criminal activity data provided by Washington DC. You can view the original post written…
Category: ETL
Criminal Analysis: Data Storage (Part 3)
In this post, I will demonstrate loading my criminal activity data into ElasticSearch sot it can be explored, analyzed and visualized in Kibana. For instructions on installing and configuring the Elastic (formerly ELK) Stack, see my previous post. Although this post will specially reference the crime data from my PostgreSQL database, I will include additional…
Getting COVID-19 Data (Julia)
In this post, I will cover getting open source COVID-19 data for the United States using Julia. The data pipeline demonstrated here is very simple example and could easily be adapted into a Prefect, Apache NiFi or Apache AirFlow ETL process. Data Search Performing a quick search on DuckDuckGo I got The COVID Tracking Project,…
Getting COVID-19 Data (Python)
In this post, I will cover getting open source COVID-19 data for the United States using Python. The data pipeline demonstrated here is very simple example and could easily be adapted into a Prefect, Apache NiFi or Apache AirFlow ETL process. Data Search Performing a quick search on DuckDuckGo I got The COVID Tracking Project,…
Getting COVID-19 Data (R)
In this post, I will cover getting open source COVID-19 data for the United States using R. The data pipeline demonstrated here is very simple example and could easily be adapted into a Prefect, Apache NiFi or Apache AirFlow ETL process. Data Search Performing a quick search on DuckDuckGo I got The COVID Tracking Project,…
Introduction to the Elastic Stack
This post will describe the Elastic Stack, also and formerly the ELK Stack, and its individual components. In a follow-up post, I’ll demonstrate how to get the ELK Stack up and running. What is the ELK Stack? The ELK Stack consists of Elasticsearch, Logstash and Kibana developed by Elastic. Later the company came out with…
Getting Started with the ELK Stack
In this post I’ll demonstrate how to get the ELK Stack up and running. Installing the ELK Stack First we will install Elasticsearch, then Logstash, and then finally Kibana. Most of the instructions are the same if you follow the official documentation (links provided below). If you follow my instructions, you’ll get the parts that…