This R script performs a combined SOM/SuperSOM clustering of the 640 administrative districs of India. Fowlkes-Mallows Similarity Index is used to identify robust initializations of the clustering. Data_India.txt is a specially conceived geographic database of 55 indicators, covering issues of economic activity, urban structure, socio-demographic development, consumption levels, infrastructure endowment and basic geographical positioning within the Indian space. Data refer to 2011 or to 2001-2011 evolutions.
Robust SOM Clustering is part of the R-Geo-Soft Models Project.
Fusco G., Perez J., 2015, Spatial Analysis of the Indian Subcontinent: the Complexity Investigated through Neural Networks, CUPUM 2015 - 14th International Conference on Computers in Urban Planning and Urban Management, MIT, Cambridge (Ma.), July 5th-7th 2015, Proceedings, 287, 1-20, http://web.mit.edu/cron/project/CUPUM2015/proceedings/Content/analytics/287_fusco_h.pdf
Perez J., 2015, Spatial Structures in India in the Age of Globalisation. A Data-Driven Approach, Phd in geography, University of Avignon (France)
Wehrens R., Buydens L., 2007, Self- and Super-organizing Maps in R: The ko-honen Package, Journal of Statistical Software, Vol. 21, Issue 5.