In this example, I’m going to demonstrate using the base distributed computing package to download GeoJSON files, perform some processing on those files and then write the resultant tables to a CSV file. The data being used comes from open source criminal activity data provided by Washington DC. You can view the original post written…
Category: Julia
Benchmarking CSV vs CSVFiles packages: Write
This post will cover benchmarking in Julia using a specific case to evaluate the functions in CSV and CSVFiles packages to write a CSV file. Packages and Versioning In this use case, I am using Julia v1.5.3 with the following packages: Please reference each packages documentation for more details. CSV, CSVFiles, DataFrames, BenchmarkTools. Setting up…
Benchmarking CSV vs CSVFiles packages: Read
This post will cover benchmarking in Julia using a specific case to evaluate the functions in CSV and CSVFiles packages to read a CSV file. Packages and Versioning In this use case, I am using Julia v1.5.3 with the following packages: Please reference each packages documentation for more details. CSV, CSVFiles, DataFrames, BenchmarkTools. Setting up…
PostgreSQL Table Creation and Bulk Insertion
As part of converting my Criminal Analysis Data Project code from R to Julia, I thought I would create a series of small posts detailing components of the translation process of data operations in smaller bits. This particular post will show a solution for how to take tabular data from a CSV and load it…
Julia’s Gadfly for R ggplot2 Users
Over the past week I have been reading the documentation and playing with Julia’s Gadfly package. I thought it would be helpful to fellow R users coming from the world of ggplot2 to put together a quick reference guide to show the translation from one to the other. The coding and style for creating data…
Linux/Ubuntu 20.04: Upgrading Julia (v1.4.1 to v1.5.3)
Currently, the Julia programming language version that is available in the APT package management interface is 1.4.1, which was released by Julia on 2020-04-14. Recently I decided to just get the latest and greatest stable version that was released 2020-11-09. In going through the process, I thought it would be helpful to document it for…
Converting R scripts to Julia (Part 2)
As part of my Getting COVID-19 Data posts in R, Python and Julia, I will now advance to part two of the conversion process. As we saw in Part 1 of this post series, we duplicated the R scripts into the language specific script folder and changed the file extensions to the appropriate language. In…
Getting COVID-19 Data (Julia)
In this post, I will cover getting open source COVID-19 data for the United States using Julia. The data pipeline demonstrated here is very simple example and could easily be adapted into a Prefect, Apache NiFi or Apache AirFlow ETL process. Data Search Performing a quick search on DuckDuckGo I got The COVID Tracking Project,…
Converting R scripts to Julia (Part 1)
UPDATE (8-JAN-2021): I have decided to demonstrate the conversion process using my Getting COVID-19 Data post and script instead of my Criminal Analysis project for right now. I will still be working through the conversion process for those scripts as well, but for now, I will demonstrate the conversion and translation process on a shorter…
Converting R scripts to Python (Part 1)
As part of my Criminal Analysis project, I have been initially coding everything in R. I am going to eventually convert everything to Python then probably to Julia. In preparation for this, I created my project directory to reflect this (as seen below). The following operations in the command line interface (CLI) will be in…