Using ogr2ogr to convert Shapefiles to GeoJSON

nyc map outline graphic

Fireworks are a great summer pastime, and another great pastime here in NYC is complaining.

NYC map of firework complaints by zip code

Put the two together and you've got fireworks complaints.

This map was created from NYC zip code boundaries GIS data, which comes as a shapefile that you can download from NYC Open Data.

The focus of this post is just on preparing the zip code boundaries data, but not actually creating the map - you can find the code for the map here.

We will use a command line tool called ogr2ogr from the GDAL library.

If you're familiar with GIS data, feel free to skip right to the command.

To-do

  1. Convert the zip code shapefile into GeoJSON format.
  2. We will also be converting the map data from its current projected coordinate system into latitude and longitude coordinates.

We can do all of this with one command using ogr2ogr.

Working with GIS data

These are the files you will get if you download the zip code boundaries data, as of June 2020.

ZIP_CODE_040114.dbf
ZIP_CODE_040114.sbn
ZIP_CODE_040114.shp
ZIP_CODE_040114.shx
ZIP_CODE_040114.prj
ZIP_CODE_040114.sbx
ZIP_CODE_040114.shp.xml

The shapefile is ZIP_CODE_040114.shp, and this is the file we are most concerned with.

A shapefile contains data that describes geographic features.

If you look at the zip code map above, you can see that each zip code area makes up a polygon, and the data describes the boundaries of these polygons.

polygon

The data contains the coordinates that make up each of the polygons, as well as any other attributes about the data you might want to include, such as the zip code associated with each polygon.


In another post about working with GeoDjango, I used the GDAL library to examine a shapefile and the attributes it contains.

I won't go into that in this post, but if you are interested please check it out here.


Projected coordinate systems

This zip code data is in the Lambert Conic Conformal projection.

You can read more about projection here, but basically it is a way to flatten a spherical surface (the Earth) onto a plane (the map).

We are going to convert the data from the Lambert Conic Conformal projection to latitude and longitude coordinates.

The map is unprojected when latitude and longitude coordinates are used.

Okay, now it's time to convert.

ogr2ogr

The docs for ogr2ogr can be found here.

First let's look at the projection metadata, which is in the .prj file.

If you open ZIP_CODE_040114.prj you will see that the projection is Lambert Conformal Conic.

PROJECTION["Lambert_Conformal_Conic"]

With GDAL we need to update the projection in the .prj file to "Lambert_Conformal_Conic_2SP”.

That is the supported version name - see the docs.

So just make that change in ZIP_CODE_040114.prj and save the file.

Now we are ready to convert the map coordinates, as well as the file format into GeoJSON.

ogr2ogr -f GeoJSON -s_srs ZIP_CODE_040114.prj -t_srs EPSG:4326 zipcodes_nyc.json ZIP_CODE_040114.shp
  • The -f flag indicates the format for the output, which is GeoJSON.
  • The -s_srs flag specifies the source file spatial reference system (SRS) which is the projection we are starting with, and ogr2ogr can parse the file ending in .prj and get the projection from it.
  • The t_srs flag specifies the SRS that we are converting to, which in this case is EPSG:4326, which represents latitude and longitude coordinates.
  • The output file name, zipcodes_nyc.json.
  • The input file name, ZIP_CODE_040114.shp.

Each spatial reference system has an EPSG code, and the code for geographic coordinates is 4326 - read more here.

Now you should have a file zipcodes_nyc.json that is ready to go.

Merry Map-Making!

If you have questions or comments, feel free to write them below or reach out to me on Twitter @LVNGD.

And please show me any interesting maps you make as well!

blog comments powered by Disqus

Recent Posts

abstract_tree.png
Solving the Lowest Common Ancestor Problem in Python
May 9, 2023

Finding the Lowest Common Ancestor of a pair of nodes in a tree can be helpful in a variety of problems in areas such as information retrieval, where it is used with suffix trees for string matching. Read on for the basics of this in Python.

Read More
rectangles_cover.png
How to write a custom fragment shader in GLSL and use it with three.js
April 16, 2023

This blog post walks through the process of writing a fragment shader in GLSL, and using it within the three.js library for working with WebGL. We will render a visually appealing grid of rotating rectangles that can be used as a website background.

Read More
streaming data
Streaming data with Flask and Fetch + the Streams API
April 10, 2023

Streaming can be a great way to transfer and process large amounts of data. It can help save space and/or time, if the data uses a lot of memory, or if you want to start processing or visualizing the data as it comes in.

Read More
Get the latest posts as soon as they come out!