Published April 24, 2024 | Version v1
Journal Open

Margin-Based Active Learning of Classifiers

  • 1. ROR icon University of Milan
  • 2. ROR icon Politecnico di Milano
  • 3. Google Research

Description

We study active learning of multiclass classifiers, focusing on the realizable transductive setting. The input is a finite subset X of some metric space, and the concept to be learned is a partition C of X into k classes. The goal is to learn C by querying the labels of as few elements of X as possible. This is a useful subroutine in pool-based active learning, and is motivated by applications where labels are expensive to obtain. Our main result is that, in very different settings, there exist interesting notions of margin that yield efficient active learning algorithms. First, we consider the case X ⊂ Rm, assuming that each class has an unknown “personalized” margin separating it from the rest. Second, we consider the case where X is a finite metric space, and the classes are convex with margin according to the geodesic distances in the thresholded connectivity graph. In both cases, we give algorithms that learn C exactly, in polynomial time, using O(log n) label queries, where O(·) hides a near-optimal dependence on the dimension of the metric spaces. Our results actually hold for or can be adapted to more general settings, such as pseudometric and semimetric spaces.

Files

22-1127.pdf

Files (1.3 MB)

Name Size Download all
md5:411bc892f3ec817017f75f9332e6294a
1.3 MB Preview Download

Additional details

Funding

European Commission
ELIAS - European Lighthouse of AI for Sustainability 101120237