Published April 20, 2020 | Version v1
Presentation Open

CTD2020: Learned Representations from Lower-Order Interactions for Efficient Clustering

  • 1. Lawrence Berkeley National Laboratory

Description

Efficient agglomerative clustering is reliant on the ability to exploit useful lower-order information contained within data, yet many real-world datasets do not consist of features which are naturally amenable to metric functions as required by these algorithms. In this work, we present a framework for learning representations which contain such metric structure, allowing for efficient clustering and neighborhood queries of data points. We demonstrate how this framework fits in with both traditional clustering pipelines, and more advanced approaches such as graph neural networks. Finally, we present numerical results on the TrackML particle tracking challenge dataset, where our framework shows favorable results in both physics-based tracking methods, and new end-to-end deep learning approaches with graph neural networks developed in the context of the Exa.TrkX project.

Files

talk.pdf

Files (823.9 kB)

Name Size Download all
md5:970847bd4f423d60974db2e8c6cc0878
823.9 kB Preview Download

Additional details

Related works

Is identical to
Presentation: https://indico.cern.ch/event/831165/contributions/3717131 (URL)
Is part of
https://cern.ch/CTD2020 (URL)
Is supplemented by
Video/Audio: https://youtu.be/vcG3vt1mCgQ (URL)