Using ogr2ogr to convert Shapefiles to GeoJSON

nyc map outline graphic

Fireworks are a great summer pastime, and another great pastime here in NYC is complaining.

NYC map of firework complaints by zip code

Put the two together and you've got fireworks complaints.

This map was created from NYC zip code boundaries GIS data, which comes as a shapefile that you can download from NYC Open Data.

The focus of this post is just on preparing the zip code boundaries data, but not actually creating the map - you can find the code for the map here.

We will use a command line tool called ogr2ogr from the GDAL library.

If you're familiar with GIS data, feel free to skip right to the command.

To-do

  1. Convert the zip code shapefile into GeoJSON format.
  2. We will also be converting the map data from its current projected coordinate system into latitude and longitude coordinates.

We can do all of this with one command using ogr2ogr.

Working with GIS data

These are the files you will get if you download the zip code boundaries data, as of June 2020.

ZIP_CODE_040114.dbf
ZIP_CODE_040114.sbn
ZIP_CODE_040114.shp
ZIP_CODE_040114.shx
ZIP_CODE_040114.prj
ZIP_CODE_040114.sbx
ZIP_CODE_040114.shp.xml

The shapefile is ZIP_CODE_040114.shp, and this is the file we are most concerned with.

A shapefile contains data that describes geographic features.

If you look at the zip code map above, you can see that each zip code area makes up a polygon, and the data describes the boundaries of these polygons.

polygon

The data contains the coordinates that make up each of the polygons, as well as any other attributes about the data you might want to include, such as the zip code associated with each polygon.


In another post about working with GeoDjango, I used the GDAL library to examine a shapefile and the attributes it contains.

I won't go into that in this post, but if you are interested please check it out here.


Projected coordinate systems

This zip code data is in the Lambert Conic Conformal projection.

You can read more about projection here, but basically it is a way to flatten a spherical surface (the Earth) onto a plane (the map).

We are going to convert the data from the Lambert Conic Conformal projection to latitude and longitude coordinates.

The map is unprojected when latitude and longitude coordinates are used.

Okay, now it's time to convert.

ogr2ogr

The docs for ogr2ogr can be found here.

First let's look at the projection metadata, which is in the .prj file.

If you open ZIP_CODE_040114.prj you will see that the projection is Lambert Conformal Conic.

PROJECTION["Lambert_Conformal_Conic"]

With GDAL we need to update the projection in the .prj file to "Lambert_Conformal_Conic_2SP”.

That is the supported version name - see the docs.

So just make that change in ZIP_CODE_040114.prj and save the file.

Now we are ready to convert the map coordinates, as well as the file format into GeoJSON.

ogr2ogr -f GeoJSON -s_srs ZIP_CODE_040114.prj -t_srs EPSG:4326 zipcodes_nyc.json ZIP_CODE_040114.shp
  • The -f flag indicates the format for the output, which is GeoJSON.
  • The -s_srs flag specifies the source file spatial reference system (SRS) which is the projection we are starting with, and ogr2ogr can parse the file ending in .prj and get the projection from it.
  • The t_srs flag specifies the SRS that we are converting to, which in this case is EPSG:4326, which represents latitude and longitude coordinates.
  • The output file name, zipcodes_nyc.json.
  • The input file name, ZIP_CODE_040114.shp.

Each spatial reference system has an EPSG code, and the code for geographic coordinates is 4326 - read more here.

Now you should have a file zipcodes_nyc.json that is ready to go.

Merry Map-Making!

If you have questions or comments, feel free to write them below or reach out to me on Twitter @LVNGD.

And please show me any interesting maps you make as well!

blog comments powered by Disqus

Recent Posts

point_polygon_yn.png
Point in Polygon search with MongoDB
Nov. 12, 2022

In a recent project, we had a large number of points on a canvas, where a user could draw a region of interest to see only the points within that area. Here is a demo of how to do that using MongoDB with a geospatial 2D-index. Visualized using D3.

Read More
main_graphic.jpg
Image Similarity with Python Part II: Nearest Neighbor Search
Feb. 18, 2022

This is Part II of my post on image similarity in Python with perceptual hashing. In this post, we will use Spotify's Annoy library to perform nearest neighbors search on a collection of images to find similar images to a query image.

Read More
kruskal animation shot
Kruskal's Algorithm Animation + Maze Generation
Feb. 7, 2022

Kruskal's algorithm finds a minimum spanning tree in an undirected, connected and weighted graph. We will use a union-find algorithm to do this, and generate a random maze from a grid of points.

Read More
Get the latest posts as soon as they come out!