Inverse Causal Discovery: Retrieving the Generative Topology of Empirical Matter
Authors/Creators
Description
Inverse Physics and Generative Representations in Structural Biology
Current structural biology operates under a “Big Data” paradigm: the Protein Data Bank exceeds 200,000 structures, totaling terabytes of coordinate files. Yet this representation captures noise alongside signal, treating biological matter as static collections of atoms rather than as outputs of generative processes.
We propose that biological structures—and matter more generally—are deterministic outputs of low-entropy generative seeds. If this is true, the inverse problem becomes tractable: given empirical coordinates (possibly noisy), can we recover the underlying generative parameters?
We demonstrate an “Inverse Physics” framework that addresses this problem by integrating geometric reconstruction (RMSD minimization) with topological constraints derived from persistent homology. The topological loss function forces the algorithm to preserve fundamental connectivity—holes, tunnels, and voids—before fitting atomic positions. This effectively acts as an infinite-resolution denoiser, discarding measurement noise while retaining structural truth.
We validate this framework on two systems:
-
α-helical protein motifs, where we recover Pauling–Corey parameters (radius r = 2.27 Å, pitch = 5.40 Å) from noisy coordinates with RMSD = 0.15 Å, superior to typical X-ray resolution, achieving 28:1 compression.
-
Genus-2 topological manifolds (double torus), where standard algorithms collapse the structure to a sphere, but our persistent homology constraints preserve both holes with parameter recovery error below 0.5%.
These results suggest a paradigm shift: from descriptive biology (storing coordinates) to generative biology (storing executable seeds). We discuss implications for semantic structural search, distributed biomanufacturing, and the fundamental nature of biological information.
Other
POLYFORM NONCOMMERCIAL LICENSE 1.0.0
COPYRIGHT NOTICE
================
Software/Work: Topological Restructuring of Artificial Intelligence: A 100× Efficiency Leap via Ramanujan Spectral Reservoirs.
Copyright (c) 2026 Andrés Sebastián Pirolo. All rights reserved.
Contact: apirolo@abc.gob.ar
TERMS AND CONDITIONS
====================
1. Rights Granted
The "Licensor" (Andrés Sebastián Pirolo) grants you the rights to:
* Install and execute the software/algorithms for personal, educational, or academic research use.
* Modify the software/algorithms for personal, educational, or academic research use.
* Distribute the software/algorithms, provided that:
* You do not charge any fee for the software/algorithms.
* You include this license file and the copyright notice in all copies.
2. Limitations
You may NOT use the software/algorithms, or any modified version of them, for any Commercial Purpose.
"Commercial Purpose" includes, but is not limited to:
* Using the software/algorithms to provide a service for a fee.
* Using the software/algorithms to develop a product for sale (e.g., drug discovery pipelines, compression software, proprietary databases).
* Using the software/algorithms in any business operations or commercial research and development.
3. No Warranty
THE SOFTWARE/WORK IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Files
Pirolo2026_Inverse_discovery-3.pdf
Files
(147.2 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:877f4de9e4b0e3b8b5508d97bfbca381
|
145.2 kB | Preview Download |
|
md5:4b5d81b0fb19bf419f161e55da4f9b98
|
2.0 kB | Preview Download |
Additional details
Related works
- Has part
- Preprint: 10.5281/zenodo.18502829 (DOI)