Mapping and Plotting Tools

geopandas provides a high-level interface to the matplotlib library for making maps. Mapping shapes is as easy as using the plot() method on a GeoSeries or GeoDataFrame.

Loading some example data:

In [1]: world = geopandas.read_file(geopandas.datasets.get_path('naturalearth_lowres'))

In [2]: cities = geopandas.read_file(geopandas.datasets.get_path('naturalearth_cities'))

We can now plot those GeoDataFrames:

# Examine country GeoDataFrame
In [3]: world.head()
Out[3]: 
     pop_est  ...                                           geometry
0     920938  ...  MULTIPOLYGON (((180.00000 -16.06713, 180.00000...
1   53950935  ...  POLYGON ((33.90371 -0.95000, 34.07262 -1.05982...
2     603253  ...  POLYGON ((-8.66559 27.65643, -8.66512 27.58948...
3   35623680  ...  MULTIPOLYGON (((-122.84000 49.00000, -122.9742...
4  326625791  ...  MULTIPOLYGON (((-122.84000 49.00000, -120.0000...

[5 rows x 6 columns]

# Basic plot, random colors
In [4]: world.plot();
../../_images/world_randomcolors.png

Note that in general, any options one can pass to pyplot in matplotlib (or style options that work for lines) can be passed to the plot() method.

Choropleth Maps

geopandas makes it easy to create Choropleth maps (maps where the color of each shape is based on the value of an associated variable). Simply use the plot command with the column argument set to the column whose values you want used to assign colors.

 # Plot by GDP per capita
In [5]: world = world[(world.pop_est>0) & (world.name!="Antarctica")]

In [6]: world['gdp_per_cap'] = world.gdp_md_est / world.pop_est

In [7]: world.plot(column='gdp_per_cap');
../../_images/world_gdp_per_cap.png

Creating a legend

When plotting a map, one can enable a legend using the legend argument:

# Plot population estimates with an accurate legend
In [8]: import matplotlib.pyplot as plt

In [9]: fig, ax = plt.subplots(1, 1)

In [10]: world.plot(column='pop_est', ax=ax, legend=True)
Out[10]: <AxesSubplot:>
../../_images/world_pop_est.png

However, the default appearance of the legend and plot axes may not be desirable. One can define the plot axes (with ax) and the legend axes (with cax) and then pass those in to the plot() call. The following example uses mpl_toolkits to vertically align the plot axes and the legend axes:

# Plot population estimates with an accurate legend
In [11]: from mpl_toolkits.axes_grid1 import make_axes_locatable

In [12]: fig, ax = plt.subplots(1, 1)

In [13]: divider = make_axes_locatable(ax)

In [14]: cax = divider.append_axes("right", size="5%", pad=0.1)

In [15]: world.plot(column='pop_est', ax=ax, legend=True, cax=cax)
Out[15]: <AxesSubplot:>
../../_images/world_pop_est_fixed_legend_height.png

And the following example plots the color bar below the map and adds its label using legend_kwds:

# Plot population estimates with an accurate legend
In [16]: import matplotlib.pyplot as plt

In [17]: fig, ax = plt.subplots(1, 1)

In [18]: world.plot(column='pop_est',
   ....:            ax=ax,
   ....:            legend=True,
   ....:            legend_kwds={'label': "Population by Country",
   ....:                         'orientation': "horizontal"})
   ....: 
Out[18]: <AxesSubplot:>
../../_images/world_pop_est_horizontal.png

Choosing colors

One can also modify the colors used by plot() with the cmap option (for a full list of colormaps, see the matplotlib website):

In [19]: world.plot(column='gdp_per_cap', cmap='OrRd');
../../_images/world_gdp_per_cap_red.png

To make the color transparent for when you just want to show the boundary, you have two options. One option is to do world.plot(facecolor="none", edgecolor="black"). However, this can cause a lot of confusion because "none" and None are different in the context of using facecolor and they do opposite things. None does the “default behavior” based on matplotlib, and if you use it for facecolor, it actually adds a color. The second option is to use world.boundary.plot(). This option is more explicit and clear.:

In [20]: world.boundary.plot();
../../_images/world_gdp_per_cap_transparent.png

The way color maps are scaled can also be manipulated with the scheme option (if you have mapclassify installed, which can be accomplished via conda install -c conda-forge mapclassify). The scheme option can be set to any scheme provided by mapclassify (e.g. ‘box_plot’, ‘equal_interval’, ‘fisher_jenks’, ‘fisher_jenks_sampled’, ‘headtail_breaks’, ‘jenks_caspall’, ‘jenks_caspall_forced’, ‘jenks_caspall_sampled’, ‘max_p_classifier’, ‘maximum_breaks’, ‘natural_breaks’, ‘quantiles’, ‘percentiles’, ‘std_mean’ or ‘user_defined’). Arguments can be passed in classification_kwds dict. See the mapclassify documentation for further details about these map classification schemes.

In [21]: world.plot(column='gdp_per_cap', cmap='OrRd', scheme='quantiles');
../../_images/world_gdp_per_cap_quantiles.png

Missing data

In some cases one may want to plot data which contains missing values - for some features one simply does not know the value. Geopandas (from the version 0.7) by defaults ignores such features.

In [22]: import numpy as np

In [23]: world.loc[np.random.choice(world.index, 40), 'pop_est'] = np.nan

In [24]: world.plot(column='pop_est');
../../_images/missing_vals.png

However, passing missing_kwds one can specify the style and label of features containing None or NaN.

In [25]: world.plot(column='pop_est', missing_kwds={'color': 'lightgrey'});

In [26]: world.plot(
   ....:     column="pop_est",
   ....:     legend=True,
   ....:     scheme="quantiles",
   ....:     figsize=(15, 10),
   ....:     missing_kwds={
   ....:         "color": "lightgrey",
   ....:         "edgecolor": "red",
   ....:         "hatch": "///",
   ....:         "label": "Missing values",
   ....:     },
   ....: );
   ....: 
../../_images/missing_vals_grey.png ../../_images/missing_vals_hatch.png

Other map customizations

Maps usually do not have to have axis labels. You can turn them off using set_axis_off() or axis("off") axis methods.

In [27]: ax = world.plot()

In [28]: ax.set_axis_off();
../../_images/set_axis_off.png

Maps with Layers

There are two strategies for making a map with multiple layers – one more succinct, and one that is a little more flexible.

Before combining maps, however, remember to always ensure they share a common CRS (so they will align).

# Look at capitals
# Note use of standard `pyplot` line style options
In [29]: cities.plot(marker='*', color='green', markersize=5);

# Check crs
In [30]: cities = cities.to_crs(world.crs)

# Now we can overlay over country outlines
# And yes, there are lots of island capitals
# apparently in the middle of the ocean!
../../_images/capitals.png

Method 1

In [31]: base = world.plot(color='white', edgecolor='black')

In [32]: cities.plot(ax=base, marker='o', color='red', markersize=5);
../../_images/capitals_over_countries_1.png

Method 2: Using matplotlib objects

In [33]: import matplotlib.pyplot as plt

In [34]: fig, ax = plt.subplots()

# set aspect to equal. This is done automatically
# when using *geopandas* plot on it's own, but not when
# working with pyplot directly.
In [35]: ax.set_aspect('equal')

In [36]: world.plot(ax=ax, color='white', edgecolor='black')
Out[36]: <AxesSubplot:>

In [37]: cities.plot(ax=ax, marker='o', color='red', markersize=5)
Out[37]: <AxesSubplot:>

In [38]: plt.show();
../../_images/capitals_over_countries_2.png

Control the order of multiple layers in a plot

When plotting multiple layers, use zorder to take control of the order of layers being plotted. The lower the zorder is, the lower the layer is on the map and vice versa.

Without specified zorder, cities (Points) gets plotted below world (Polygons), following the default order based on geometry types.

In [39]: ax = cities.plot(color='k')

In [40]: world.plot(ax=ax);
../../_images/zorder_default.png

We can set the zorder for cities higher than for world to move it of top.

In [41]: ax = cities.plot(color='k', zorder=2)

In [42]: world.plot(ax=ax, zorder=1);
../../_images/zorder_set.png

Pandas Plots

Plotting methods also allow for different plot styles from pandas along with the default geo plot. These methods can be accessed using the kind keyword argument in plot(), and include:

  • geo for mapping

  • line for line plots

  • bar or barh for bar plots

  • hist for histogram

  • box for boxplot

  • kde or density for density plots

  • area for area plots

  • scatter for scatter plots

  • hexbin for hexagonal bin plots

  • pie for pie plots

In [43]: gdf = world.head(10)

In [44]: gdf.plot(kind='scatter', x="pop_est", y="gdp_md_est")
Out[44]: <AxesSubplot:xlabel='pop_est', ylabel='gdp_md_est'>
../../_images/pandas_line_plot.png

You can also create these other plots using the GeoDataFrame.plot.<kind> accessor methods instead of providing the kind keyword argument.

In [45]: gdf.plot.bar()
Out[45]: <AxesSubplot:>
../../_images/pandas_bar_plot.png

For more information check out the pandas documentation.

Other Resources

Links to jupyter Notebooks for different mapping tasks:

Making Heat Maps