There is a newer version of the record available.

Published January 6, 2019 | Version v1.4.0
Software Open

ekzhu/datasketch: hashfunc to replace hashobj

  • 1. University of Toronto
  • 2. @src-d
  • 3. Free University of Berlin
  • 4. Adobe
  • 5. Six Five Design

Description

Now support hashfunc parameter for MinHash and HyperLogLog. The old parameter hashobj is removed.

# Let's use MurmurHash3.
import mmh3

# We need to define a new hash function that outputs an integer that
# can be encoded in 32 bits.
def _hash_func(d):
    return mmh3.hash32(d)

# Use this function in MinHash constructor.
m = MinHash(hashfunc=_hash_func)

Files

ekzhu/datasketch-v1.4.0.zip

Files (2.5 MB)

Name Size Download all
md5:4c4b76e205742a3590df4291d6bc520d
2.5 MB Preview Download

Additional details

Related works