Published September 5, 2020 | Version 4.4.0
Dataset Open A collective open dataset of COVID-19 outbreak in the south Indian state of Kerala

Description is a consolidated multi-source open dataset of metadata from the COVID-19 outbreak in the Indian state of Kerala. It is created and maintained by volunteers of ‘Collective for Open Data Distribution-Keralam’ (CODD-K), a nonprofit consortium of individuals formed for the distribution and longevity of open-datasets. covers a set of correlated temporal and spatial metadata of SARS-CoV-2 infections and prevention measures in Kerala. Static releases of this dataset snapshots are manually produced from a live database maintained as a set of publicly accessible Google sheets. This dataset is made available under the Open Data Commons Attribution License v1.0 (ODC-BY 1.0). 

Schema and data package
Datapackage with schema definition is accessible at Provided datapackage and schema are based on Frictionless data Data Package specification.

Temporal and Spatial Coverage 

This dataset covers COVID-19 outbreak and related data from the state of Kerala, India, from January 31, 2020 till the date of the publication of this snapshot. The dataset shall be maintained throughout the entirety of the COVID-19 outbreak.  

The spatial coverage of the data lies within the geographical boundaries of the Kerala state which includes its 14 administrative subdivisions. The state is further divided into Local Self Governing (LSG) Bodies. Reference to this spatial information is included on appropriate data facets. Available spatial information on regions outside Kerala was mentioned, but it is limited as a reference to the possible origins of the infection clusters or movement of the individuals.  

Longevity and Provenance 

The dataset snapshot releases are published and maintained in a designated GitHub repository maintained by CODD-K team. Periodic snapshots from the live database will be released at regular intervals. The GitHub commit logs for the repository will be maintained as a record of provenance, and archived repository will be maintained at the end of the project lifecycle for the longevity of the dataset.

Data Stewardship 

CODD-K expects all administrators, managers, and users of its datasets to manage, access, and utilize them in a manner that is consistent with the consortium’s need for security and confidentiality and relevant legal frameworks within all geographies, especially Kerala and India. As a responsible steward to maintain and make this dataset accessible— CODD-K absolves from all liabilities of the damages, if any caused by inaccuracies in the dataset. 


This dataset is made available by the CODD-K consortium under ODC-BY 1.0 license. The Open Data Commons Attribution License (ODC-By) v1.0 ensures that users of this dataset are free to copy, distribute and use the dataset to produce works and even to modify, transform and build upon the database, as long as they attribute the public use of the database or works produced from the same, as mentioned in the citation below. 

Disclaimer is provided under the ODC-BY 1.0 license as-is. Though every attempt is taken to ensure that the data is error-free and up to date, the CODD-K consortium do not bear any responsibilities for inaccuracies in the dataset or any losses—monetary or otherwise—that users of this dataset may incur. 


Files (846.6 kB)

Name Size Download all
846.6 kB Preview Download

Additional details


  • A citizen science initiative for open data and visualization of COVID-19 outbreak in Kerala, India Jijo Pulickiyil Ulahannan, NIkhil Narayanan, Nishad Thalhath, Prem Prabhakaran, Sreekanth Chaliyeduth, Sooraj P Suresh, Musfir Mohammed, Rajeevan E, Sindhu Joseph, Akhil Balakrishnan, Jeevan Uthaman, Manoj Karingamadathil, Sunil Thonikkuzhiyil Thomas, Unnikrishnan Sureshkumar, Shabeesh Balan, Neetha Nanoth Vellichirammal medRxiv 2020.05.13.20092510; doi: