pyQuARC: Open Source Library for Earth Observation Metadata Quality Assessment
Description
pyQuARC is designed to read and evaluate Earth observation metadata records hosted within the Common Metadata Repository (CMR), which is a centralized metadata repository for all of NASA's Earth observation data products. The CMR serves as the backend for NASA's Earthdata Search meaning that high-quality metadata helps connect users to the existing data in Earthdata Search. pyQuARC implements the Analysis and Review of CMR (ARC) team's metadata quality assessment framework to provide prioritized recommendations for metadata improvement and optimized search results. pyQuARC makes basic validation checks, pinpoints inconsistencies between dataset-level (i.e. collection) and file-level (i.e. granule) metadata, and identifies opportunities for more descriptive and robust information. It currently supports DIF10 (collection), ECHO10 (collection and granule), UMM-C, and UMM-G metadata standards. As open source software, pyQuARC can be adapted to add customized checks, implement future metadata standards, or support other metadata types.
Files
NASA-IMPACT/pyQuARC-v1.2.5.zip
Files
(947.7 kB)
Name | Size | Download all |
---|---|---|
md5:a32f2adb2d1b0826ac951db4bbf8c8fc
|
947.7 kB | Preview Download |
Additional details
Related works
- Documents
- Other: https://wiki.earthdata.nasa.gov/display/CMR/CMR+Metadata+Best+Practices%3A+Landing+Page (URL)
- Is supplement to
- Software: https://github.com/NASA-IMPACT/pyQuARC/tree/v1.2.5 (URL)
Dates
- Available
-
2021-08-19V1.0.0 Beta Release
- Updated
-
2021-09-02Support for DIF10 collection level metadata
- Updated
-
2022-02-07Include 'version.txt' in package build
- Updated
-
2022-02-28Minor bug fixes and enhancements
- Updated
-
2022-03-03Bug fixes and enhancements
- Updated
-
2022-03-11Bug fixes and enhancements
- Updated
-
2022-07-21Support for ECHO10 granule, UMM-C, and UMM-G metadata
- Updated
-
2022-12-02Support for regression testing and enhancements
- Updated
-
2022-12-22Bug fixes
- Updated
-
2023-08-02Revisions and enhancements
- Updated
-
2023-10-11Minor revisions and additional checks
- Updated
-
2024-02-28Minor Updates
Software
- Repository URL
- https://github.com/NASA-IMPACT/pyQuARC
- Programming language
- Python
- Development Status
- Active
References
- Bugbee, K., Roux, J. L., Sisco, A., Kaulfus, A., Staton, P., Woods, C., Dixon, V., Lynnes, C., & Ramachandran, R. (2021). Improving discovery and use of NASA's Earth observation data through metadata quality assessments. Data Science Journal, 20. https://doi.org/10.5334/dsj-2021-017
- White House Office of Science and Technology Policy (OSTP). 2016. Common Framework for Earth-Observation Data. Available at: https://obamawhitehouse.archives.gov/sites/default/files/microsites/ostp/common_framework_for_earth_observation_data.pdf.