Abstract
Redistricting is the process of dividing a geographic area consisting of spatial units-often represented as spatial polygons-into smaller districts that satisfy some properties. It can therefore be formulated as a set partitioning problem where the objective is to cluster the set of spatial polygons into groups such that a value function is maximized [1]. Widely used algorithms developed for point-based data sets are not readily applicable because polygons introduce the concepts of spatial contiguity and other topological properties that cannot be captured by representing polygons as points. Furthermore, when clustering polygons, constraints such as spatial contiguity and unit distributedness should be strategically addressed. Toward this, we have developed the Constrained Polygonal Spatial Clustering (CPSC) algorithm based on the A* search algorithm that integrates cluster-level and instance-level constraints as heuristic functions. Using these heuristics, CPSC identifies the initial seeds, determines the best cluster to grow, and selects the best polygon to be added to the best cluster. We have devised two extensions of CPSC-CPSC* and CPSC*-PS-for problems where constraints can be soft or relaxed. Finally, we compare our algorithm with graph partitioning, simulated annealing, and genetic algorithm-based approaches in two applications- congressional redistricting and school districting.
Original language | English (US) |
---|---|
Article number | 5936062 |
Pages (from-to) | 2065-2079 |
Number of pages | 15 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 24 |
Issue number | 11 |
DOIs | |
State | Published - 2012 |
Keywords
- Spatial clustering
- constraint-based processing
- data mining
- polygonal clustering
- spatial databases and GIS
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications
- Computational Theory and Mathematics