From Rational Number Reconstruction to Set Reconciliation and File Synchronization
Description
This work revisits set reconciliation, the problem of synchronizing two multisets of fixed-size values while minimizing transmission complexity. We propose a new number-theoretic reconciliation protocol called Divide and Factor (D&F;) that achieves optimal asymptotic transmission complexity — as do previously known alternative algorithms. We analyze the computational complexities of various D&F; variants, study the problem of synchronizing sets of variable-size files using hash functions and apply D&F; to synchronize file hierarchies taking file locations into account. We describe btrsync, our open-source D&F; implementation, and benchmark it against the popular software rsync. It appears that btrsync transmits much less data than rsync, at the expense of a relatively modest computational overhead.
Files
article.pdf
Files
(465.1 kB)
Name | Size | Download all |
---|---|---|
md5:48ecbc5a2794e217e0a780b9cf9227ea
|
465.1 kB | Preview Download |