While it is often an overlooked aspect of research and commercial applications, joining tabular country data or performing statistical summaries of raster and vector feature data within country boundaries is a key component of geographical, political, climate, and social sciences. The CShapes database (Weidmann, Kuse, and Gleditsch 2010) is one of a small number of vectorized global municipal boundaries. Additional global vectorized municipal boundaries are available from The Database of Global Administrative Boundaries [GADM; University of California, Berkley (2018)], Natural Earth [NE; Kelso and Patterson (2010)], Esri/Garmin (Esri, Garmin International, Inc., U.S. Central Intelligence Agency, and National Geographic Society 2020), the Food and Agricultural Organization [GAUL; Food and Agricultural Organization of the United Nations (2015)], and the United Nations [SALB; The United Nations (2015)].
Whereas most of these data sets are fantastic resources for maps and current analysis, they provide no resources for historic visualizations or time-series analysis of spatially explicit country-year processes. The CShapes dataset presents ADMIN 0 boundaries, however, this is not the primary focus; the purpose of CShapes is to provide historical state boundaries dating back to 1946. This presents a few interesting questions: 1) what defines an independent state in this context, 2) what defines the spatial extent of a state, and 3) what constitutes a change in state boundaries? Weidmann, Kuse, and Gleditsch (2010) wisely chose to offload recognition of an independent state by relying on 2 well established lists of states born out of political science and conflict data sets: 1) Gleditsch and Ward (1999) and 2) the Correlates of War Project (2017). Most nation-state boundary data sets take practical approaches to the spatial extent of a state by relying on internationally accepted boundaries. States may often make wild claims about their territories (see Venezuela), however, as a practical matter, the actual boundaries are those by which they exert control over. This component of the CShapes dataset was developed internally and is described in detail within Weidmann, Kuse, and Gleditsch (2010). Lastly, CShapes relays temporal territorial changes by observing the merger, emergence, disappearance, or dissolution of independent states. These distinctions are also outlined in detail in Weidmann, Kuse, and Gleditsch (2010). CShapes is available as a standalone vectorized shapefile and a package for the R software package. As an added bonus, the
cshapes R package offers functionality to calculate paired country distances based using either the distances between capitals, the distance between the centroid of the nation-states, or the minimum distance between state boundaries. These are excellent tools for several modeling exercises.
NE is the only fully open-source and public domain pre-packaged database of vectorized administrative boundaries. Natural Earth presents global coverage of ADMIN 0 and ADMIN 1 boundaries, but no ADMIN 2 coverage and is available as downloadable shapefiles, SQlite, and GeoPackage, or a limited number of features via the
rnaturalfeatures R package. Unlike GADM, GAUL, and SALB, Natural Earth also features the premiere collection of additional public domain cultural and physical vector data themes. These include, but are not limited too, disputed areas, urban areas, parks and protected areas, water boundaries, coastlines, reefs, lakes, and bathymetry. Although NE delivers the largest collection of public domain vectorized boundaries layers, users should be aware of limited data quality issues. There are sparse instances of boundary overlaps, bad geometries, and topology errors that may cause processing errors in some GIS software or packages. The Esri/Garmin World Countries dataset suffers from similar issues.
GADM is available as a standalone vectorized shapefile layer and is also distributed through the R
raster package. GADM presents highly detailed boundaries to ADMIN 2 municipal subdivisions (U.S. county equivilent), while Esri/Garmin databases offer ADMIN 1 boundaries (U.S. States), Natural Earth provides ADMIN 1 boundaries for a limited number of countries, and GAUL and SALB provide ADMIN 1 and ADMIN 2 boundaries for select nation-states. Although GADM’s level of detail and delineation is superior to its competitors, there are drawbacks. The high resolution, specifically along coastlines, is often cumbersome when processing the data at large spatial extents with geospatial software. GADM boundaries may require simplification procedures to reduce processing times. Furthermore, GADM boundaries are not validated by any federal or international entity; consequently, while they are detailed, they are not necessarily accurate. A 2011 comparison of vectorized global administrative boundaries found that GADM boundaries were less accurate than SALB and GAUL (Brigham, Gilbert, and Xu 2011). For personal use and research exercises, the lack of validation is likey inconsequential; it’s also not commonplace in most geographic boundary databases. SALB promotes federally validated geo-databases, but participating nation-state boundaries are decentralized and must be acquired individually through the SALB web portal that links to federal geo-spatial repositories. GADM nation-state layers are aggregated into a singular data product that is more convenient for most applications.
As a final note, although GADM has no restrictions for personal and academic use, commercial use requires a licensing fee that is potentially prohibitive for small to medium sized business useage. GAUL and SALB also prohibits commercial use of their data products. This is in stark contrast to Natural Earth and CShapes, which implement Creative Commons licenses, and Esri/Garmin’s World Countries data product that is bundled with Esri GIS products and have complex transference rules.
Screenshot or Representative Figure:
Free for all use.
We describe CShapes, a new dataset that provides historical maps of state boundaries and capitals in the post-World War II period. The dataset is coded according to both the Correlates of War and the Gleditsch and Ward (1999) state lists, and is therefore compatible with a great number of existing databases in the discipline. Provided in a geographic data format, CShapes can be used directly with standard GIS software, allowing a wide range of spatial computations. In addition, we supply a CShapes package for the R statistical toolkit. This package enables researchers without GIS skills to perform various useful operations on the GIS maps. The paper introduces the CShapes dataset and structure and gives three examples of how to use CShapes in political science research. First, we show how results from quantitative analysis can be depicted intuitively as a map. The second application gives an example of computing indicators on the CShapes maps, which can then be used in statistical tests. Third, we illustrate the use of CShapes for generating different weights matrices in spatial statistical applications. All the examples can be replicated using the freely available R package and do not require specialized GIS skills. The dataset is available for download from the CShapes website (http://nils.weidmann.ws/projects/cshapes).
- West Bounding Coordinate: -180.00
- East Bounding Coordinate: 180.00
- North Bounding Coordinate: 83.11387
- South Bounding Coordinate: -55.90223
Spatial Reference Information:
- Coordinate System: Latitude and Longitude
- Geodetic Model: WGS1984
Time Period Information:
- Beginning Date: 1946
- Ending Date: 2016
- Resolution: annual