Who Are You? Cartel Detection Using Unlabeled Data
- 1. Department of Economics, University of Alberta, Canada
- 2. Institute for Applied Economic Research (Ipea), Brazil
- 3. Department of Economics, University of Brasilia, Brasil
Description
We propose a data-driven machine learning approach to flag bid-rigging cartels in the Brazilian road maintenance sector. First, we apply a clustering algorithm to group the tenders based on their attributes. Second, we use the labels created by the clustering algorithm as a target variable to predict them using a classifier. We rank the predictors according to their relevance to decrease the number of false positive (detect cartel when it does not exist) and false negative (do not detect cartel when it does exist) predictions. Our results shed light on the need to use a range of predictors to recognize the vast profile of strategies practiced by bid-rigging cartels, such as misleading competitive dynamics, bid combination, and cover bidding behavior. Our method can improve cartels' deterrence in different economic sectors, especially when labeled data are not available. In a controlled environment with a simulated dataset, the overall average accuracy of the algorithm is 99.33%. In a real-world cartel case with a labeled dataset, the overall average accuracy is 80.25%. When applied to the road maintenance dataset, our model identified a group containing 273 (31% of the total) suspicious tenders. We conclude by offering a policy prescription discussion for antitrust authorities.
Files
Files
(6.0 MB)
Name | Size | Download all |
---|---|---|
md5:eb0db7f8fceeacf613d146d2ee10625c
|
6.0 MB | Download |