emem: A Content-Addressed, Verifiable Earth-Memory Protocol for AI Agents over Foundation-Model Embeddings
Description
LLM agents asked “what is on the ground at this place, right now” have no stable handle for a patch of Earth and no way to prove the answer they return; the same agents asked “what did we learn here” fall back on unsigned, per-session scratchpads a server can rewrite. We present emem, a protocol giving both questions one trust surface. Every spatial fact is keyed by a 64- bit geographic cell (cell64, about 9.55 m at the equator), a band, and a temporal slot (tslot); the fact is serialized to deterministic CBOR, content-addressed by a truncated BLAKE3 digest (a 26-character base32 fact_cid), and returned inside an Ed25519 receipt over a domain-separated, length-prefixed preimage that any party can verify offline against the responder’s public key, with-out trusting the issuer. Facts are grounded in frozen Earth-observation foundation embeddings: Tessera (128-D, CPU-streamed from Cloud-Optimized GeoTIFFs), Clay v1.5 and Prithvi-EO-2.0 (1024-D, GPU-pinned), and Galileo (multimodal), whose independent receptive fields drive a triple-encoder change-consensus algorithm in which agreement is signal and a lone vote flags receptive-field aliasing. A read layer offers bi-temporal recall, Lance IVF_PQ k-NN with a TurboQuant binary-rotation Hamming fast path, region analytics, and a temporal-freshness kernel; a memory layer adds Anthropic-memory-tool file verbs over a CoALA-typed, capability-bound scratchpad, signed temporal edges, multi-attester contradiction scoring, and episodic-to-semantic merge with an opt-in sleep-time refiner. Cold cells materialize lazily on first request: any cell on Earth answers citeably with no pre-seeded corpus, and absence is itself a signed, typed, content-addressed receipt rather than a 404. We describe the protocol, its mathematics, a single-binary Rust reference implementation serving mirrored MCP and REST surfaces, and a mapping onto recent agent-memory literature.
Files
emem-zenodo.pdf
Files
(564.6 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:48ae4f8a782c9957a07f36109a586abf
|
564.6 kB | Preview Download |
Additional details
Dates
- Issued
-
2026-06-15Date of manuscript completion
Software
- Repository URL
- https://github.com/Vortx-AI/emem
- Programming language
- Rust , Python
- Development Status
- Active
References
- [1] Tessera Project. "Tessera: a global per-pixel annual Earth-observation embedding (geotessera)." Cambridge / geotessera, 128-D annual embedding stack 2017-2024, published as Cloud-Optimized GeoTIFF tiles. https://github.com/ucam-eo/geotessera [2] Clay Foundation. "Clay v1.5: an open foundation model for Earth." Wavelength-conditioned ViT- L/8 MAE with DINOv2 teacher, 1024-D, Apache-2.0. https://github.com/Clay-foundation/model [3] Jakubik, J., et al. "Prithvi-EO-2.0: a versatile multi-temporal foundation model for Earth observation applications." IBM/NASA, Prithvi-EO-2.0-300M-TL, 2024. arXiv:2412.02732. https://arxiv.org/abs/2412.02732 [4] Tseng, G., et al. "Galileo: learning global and local features of many remote sensing modal- ities." NASA Harvest, multimodal (S1/S2/DEM/climate) geospatial foundation model, 2025. arXiv:2502.09356. https://arxiv.org/abs/2502.09356 [5] McClelland, J.L., McNaughton, B.L., O'Reilly, R.C. "Why there are complementary learn- ing systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory." Psychological Review 102(3):419-457, 1995. https://doi.org/10.1037/0033-295X.102.3.419 [6] Rasmussen, P., et al. "Zep: a temporal knowledge graph architecture for agent memory (Graphiti)." arXiv:2501.13956, 2025. https://arxiv.org/abs/2501.13956 [7] Sumers, T., Yao, S., Narasimhan, K., Griffiths, T. "Cognitive Architectures for Lan- guage Agents (CoALA)." Transactions on Machine Learning Research, 2024. arXiv:2309.02427. https://arxiv.org/abs/2309.02427 [8] Packer, C., et al. "MemGPT: Towards LLMs as Operating Systems." arXiv:2310.08560, 2023. https://arxiv.org/abs/2310.08560 [9] MIRIX: Multi-Agent Memory System for LLM-Based Agents (six memory types). arXiv:2507.07957, 2025. https://arxiv.org/abs/2507.07957 [10] Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory. arXiv:2504.19413, 2025. https://arxiv.org/abs/2504.19413 [11] Hu, Y., et al. "Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions (MemoryAgentBench)." arXiv:2507.05257, 2025. https://arxiv.org/abs/2507.05257 [12] Lewis, P., et al. "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks." NeurIPS, 2020. arXiv:2005.11401. https://arxiv.org/abs/2005.11401 [13] Lei, J., Zhang, D., Li, J., et al. "delta-mem: Efficient Online Memory for Large Language Models." arXiv:2605.12357, 2026. (Compact in-attention associative memory.) [14] Laurie, B., Langley, A., Kasper, E. "Certificate Transparency." RFC 6962, IETF, 2013. (Merkle leaf/node domain separation.) https://www.rfc-editor.org/rfc/rfc6962 [15] Josefsson, S., Liusvaara, I. "Edwards-Curve Digital Signature Algorithm (EdDSA)." RFC 8032, IETF, 2017. (Ed25519.) https://www.rfc-editor.org/rfc/rfc8032 [16] Bormann, C., Hoffman, P. "Concise Binary Object Representation (CBOR)." RFC 8949, IETF, 2020. (Deterministic encoding, Section 4.2.) https://www.rfc-editor.org/rfc/rfc8949 [17] O'Connor, J., Aumasson, J.-P., Neves, S., Wilcox-O'Hearn, Z. "BLAKE3: one function, fast everywhere." 2020. https://github.com/BLAKE3-team/BLAKE3-specs [18] Josefsson, S. "The Base16, Base32, and Base64 Data Encodings." RFC 4648, IETF, 2006. (base32- nopad.) https://www.rfc-editor.org/rfc/rfc4648 [19] Bormann, C. "Concise Binary Object Representation (CBOR) Tags for Object Identifiers / IPLD multibase 'b'." RFC 9090 / IPLD CID. (Tag 42, base32 multibase.) https://www.rfc- editor.org/rfc/rfc9090 [20] Uber Technologies. "H3: A Hexagonal Hierarchical Geospatial Indexing System." (Migration target DGGS, resolution 13.) https://h3geo.org 55 [21] Google. "S2 Geometry Library: spherical cell hierarchy for geographic indexing." https://s2geometry.io [22] Anthropic. "Memory tool (context-management-2025-06-27): file-op verbs for agent memory." Claude developer documentation, 2025. https://docs.anthropic.com [23] Anthropic. "Model Context Protocol (MCP): Streamable HTTP, JSON-RPC 2.0, specification revision 2025-11-25." https://modelcontextprotocol.io [24] STAC Contributors. "SpatioTemporal Asset Catalog (STAC) Specification, version 1.0.0." 2021. https://stacspec.org [25] Xiao, S., et al. "C-Pack: Packed Resources for General Chinese Embeddings." 2023. arXiv:2309.07597. (BGE family; the bge-base-en-v1.5 variant, 768-D, powers memory_search and topic routing.) https://arxiv.org/abs/2309.07597 [26] LanceDB. "Lance columnar format and IVF_PQ approximate-nearest-neighbor index." https://lancedb.github.io/lance [27] Zandieh, A., et al. "TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate." Google Research, 2025. arXiv:2504.19874. (Randomized sign-flip rotation for binary / scalar quanti- zation without a codebook.) https://arxiv.org/abs/2504.19874 [28] Drusch, M., et al. "Sentinel-2: ESA's optical high-resolution mission for GMES operational ser- vices." Remote Sensing of Environment 120:25-36, 2012. (10 m optical; Copernicus Open / MPC.) [29] Justice, C.O., et al. "The Moderate Resolution Imaging Spectroradiometer (MODIS): land remote sensing for global change research." IEEE TGRS 36(4):1228-1249, 1998. (MOD11/MOD13/MCD64A1 connectors.) [30] European Space Agency / Airbus. "Copernicus DEM (GLO-30): 30 m global digital elevation model." ESA, 2021. https://doi.org/10.5270/ESA-c5d3d65 [31] Pekel, J.-F., Cottam, A., Gorelick, N., Belward, A.S. "High-resolution mapping of global sur- face water and its long-term changes." Nature 540:418-422, 2016. (JRC Global Surface Water.) https://doi.org/10.1038/nature20584 [32] Hansen, M.C., et al. "High-resolution global maps of 21st-century forest cover change." Science 342:850-853, 2013. (Hansen GFC v1.12 loss-year.) https://doi.org/10.1126/science.1244693 [33] Zanaga, D., et al. "ESA WorldCover 10 m 2021 v200." ESA / CC-BY-4.0, 2022. https://doi.org/10.5281/zenodo.7254221 [34] Poggio, L., et al. "SoilGrids 2.0: producing soil information for the globe with quantified spatial uncertainty." SOIL 7:217-240, 2021. https://doi.org/10.5194/soil-7-217-2021 [35] Kerner, H., et al. "Fields of The World (FTW): a global benchmark and product of ~3.17 billion agricultural field boundaries, 241 countries, 10 m, CC-BY-4.0." source.coop PMTiles archive, 2024. https://fieldsofthe.world [36] Beck, H.E., et al. "Present and future Koppen-Geiger climate classification maps at 1-km resolution." Scientific Data 5:180214, 2018. (climate_archetype seed centroids.) https://doi.org/10.1038/sdata.2018.214 [37] Healey, S.P., et al. "Mapping forest change using stacked generalization: an ensemble ap- proach." Remote Sensing of Environment 204:717-728, 2018. (Triple-consensus threshold provenance.) https://doi.org/10.1016/j.rse.2017.09.029 [38] European Union. "Regulation (EU) 2023/1115 on deforestation-free supply chains (EUDR)." Official Journal L150, 2023; application deferred via Reg. 2024/3234 and 2025/2650. https://eur- lex.europa.eu/eli/reg/2023/1115 [39] Overture Maps Foundation. "Places, buildings, transportation, and divisions themes (divi- sions/division_area)." ODbL / CDLA-Permissive, 2024. https://overturemaps.org [40] GeoNames. "cities-5000: 68,581 populated places with population >= 5,000, CC-BY-4.0." https://www.geonames.org 56 [41] Dong, S., Xu, S., He, P., Li, Y., Tang, J., Liu, T., Liu, H., Xiang, Z. "Memory Injec- tion Attacks on LLM Agents via Query-Only Interaction (MINJA)." arXiv:2503.03704, 2025. (In- jection success rate above 90%; motivating threat model for unsigned outside-channel memory.) https://arxiv.org/abs/2503.03704 [42] Oke, T.R. "Boundary Layer Climates," 2nd ed. Methuen, 1987. (Urban thermal diffusivity for /v1/heat_solve FTCS solver.) [43] Snyder, J.P. "Map Projections: A Working Manual." USGS Professional Paper 1395, 1987. (UTM in emem-fetch::proj.) [44] Dosovitskiy, A., et al. "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (ViT)." ICLR, 2021. arXiv:2010.11929. https://arxiv.org/abs/2010.11929 [45] He, K., Chen, X., Xie, S., Li, Y., Dollár, P., Girshick, R. "Masked Autoencoders Are Scalable Vision Learners (MAE)." CVPR, 2022. arXiv:2111.06377. https://arxiv.org/abs/2111.06377 [46] Oquab, M., et al. "DINOv2: Learning Robust Visual Features without Supervision." Transactions on Machine Learning Research, 2024. arXiv:2304.07193. (Clay v1.5 teacher.) https://arxiv.org/abs/2304.07193 [47] Assran, M., et al. "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture (I-JEPA)." CVPR, 2023. arXiv:2301.08243. (Architecture pattern for the dynamics head.) https://arxiv.org/abs/2301.08243 [48] Wu, D., et al. "LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory." arXiv:2410.10813, 2024. https://arxiv.org/abs/2410.10813 [49] Lin, K., et al. "Sleep-time Compute: Beyond Inference Scaling at Test-time." arXiv:2504.13171, 2025. https://arxiv.org/abs/2504.13171 [50] Merkle, R.C. "A Digital Signature Based on a Conventional Encryption Function." Advances in Cryptology — CRYPTO '87, LNCS 293, pp. 369-378, 1988. https://doi.org/10.1007/3-540-48184-2_ 32 [51] Bernstein, D.J., Duif, N., Lange, T., Schwabe, P., Yang, B.-Y. "High-speed high-security signatures (Ed25519)." Journal of Cryptographic Engineering 2(2):77-89, 2012. https://doi.org/10.1007/s13389- 012-0027-1 [52] Moon, B., Jagadish, H.V., Faloutsos, C., Saltz, J.H. "Analysis of the Clustering Properties of the Hilbert Space-Filling Curve." IEEE Transactions on Knowledge and Data Engineering 13(1):124-141, 2001. https://doi.org/10.1109/69.908985 [53] Vancutsem, C., et al. "Long-term (1990-2019) monitoring of forest cover changes in the humid tropics (JRC TMF)." Science Advances 7:eabe1603, 2021. https://doi.org/10.1126/sciadv.abe1603 [54] Bourgoin, C., et al. "GFC2020: A Global Map of Forest Land Use for year 2020 to support the EU Deforestation Regulation." Earth System Science Data 18:1331-1365, 2026. https://doi.org/10.5194/essd-18-1331-2026 [55] Reiche, J., et al. "Forest disturbance alerts for the Congo Basin using Sentinel-1 (RADD)." Envi- ronmental Research Letters 16:024005, 2021. https://doi.org/10.1088/1748-9326/abd0a8 [56] Sims, M.J., et al. "Global drivers of forest loss at 1 km resolution (WRI Global Drivers)." Envi- ronmental Research Letters 20:074027, 2025. https://doi.org/10.1088/1748-9326/add606 [57] Tatem, A.J. "WorldPop, open data for spatial demography." Scientific Data 4:170004, 2017. https://doi.org/10.1038/sdata.2017.4