Published November 10, 2024 | Version 1
Presentation Open

New Python based Architecture for the Keck Observatory Archive

  • 1. ROR icon California Institute of Technology
  • 2. ROR icon NASA Exoplanet Science Institute

Description

We describe the development of the Keck Observatory Archive (KOA) Data Discovery Service, a web-based dashboard that returns metadata for wide-area queries of the entire KOA archive within seconds. Currently in beta, this dashboard will support exploration, visualization, and data access across multiple instruments. This effort is underpinned by open-source VO compliant query infrastructure, and will offer services that can be hosted on web pages or in Jupyter notebooks. The effort also informs the design of a new modern landing page that meets expectations of accessibility and ease of use.

 

The new query infrastructure is based on nexsciTAP, a component-based, DBMS-agnostic Python implementation of the IVOA Table Access Protocol, developed at NExScI and integrated into the NASA Exoplanet Archive and the NEID archive, and into the PyKOA Python client. This infrastructure incorporates R-tree spatial indexing, built as memory-mapped files as part of Montage, a software toolkit used to create composite astronomical images. R-trees are used most often in geospatial analysis, here they enable searches of the entire KOA archive, an eclectic collection of 100 million records of imaging and spectroscopic data, in 2 seconds, and speeds up spatial searches by x20. The front end is built on the open-source Plotly-Dash framework, which allows users to build an interactive user interface based on a single python file.

Files

adass_2024_koa_dds.pdf

Files (779.7 kB)

Name Size Download all
md5:4af3a10697a87498ff6f74cfaba95612
779.7 kB Preview Download

Additional details

Dates

Submitted
2024-11-03
Presented at ADASS 2024

Software

Programming language
Python
Development Status
Active