As part of my Getting COVID-19 Data posts in R, Python and Julia, I will now advance to part two of the conversion process. As we saw in Part 1 of this post series, we duplicated the R scripts into the language specific script folder and changed the file extensions to the appropriate language. In…
Category: Python
Getting COVID-19 Data (Python)
In this post, I will cover getting open source COVID-19 data for the United States using Python. The data pipeline demonstrated here is very simple example and could easily be adapted into a Prefect, Apache NiFi or Apache AirFlow ETL process. Data Search Performing a quick search on DuckDuckGo I got The COVID Tracking Project,…
Converting R scripts to Python (Part 1)
As part of my Criminal Analysis project, I have been initially coding everything in R. I am going to eventually convert everything to Python then probably to Julia. In preparation for this, I created my project directory to reflect this (as seen below). The following operations in the command line interface (CLI) will be in…
Criminal Analysis: Data Search (part 1)
Now that we have our crime data, lets work on gathering the other data I wrote down during the previous Planning post. Part 1 of Data Search will focus on our Map Data branch in the project plan. Map Data Boundary Data Using the same Open Data website, I just queried for DC boundary and…
Criminal Analysis: Data Search (part 0)
Searching for Criminal Activity Data My initial search begins with querying for “Washington DC” and “Crime Data”. This is very broad but it lets me see what is available. I can always get more specific. https://duckduckgo.com/?q=Washington+DC+crime+data&t=lm&ia=web My results yielded a couple pages of interest. Check out the sites that seem interesting or could provide the…
Criminal Analysis: Planning
UPDATE (12/6/20): I have removed the original content in this post concerning the Data Search work for the crime data. You can now find that portion of the original post here. In this series, I will be exploring the criminal activity in Washington DC over the past decade. I have explored the data in previous…