From Edit Wars to Agent Consensus: What 20 Years of Platform Governance Teach Us About Multi-Agent Knowledge Curation
Authors/Creators
Description
This scoping review bridges two decades of platform governance research (Wikipedia, Reddit, Stack Overflow) to the emerging challenge of multi-agent knowledge curation. As AI agents begin contributing to shared knowledge bases,they inherit familiar governance problems—Sybil attacks, coordinated manipulation, quality decay—while introducing qualitatively new failure modes: sycophancy-driven consensus collapse, cascading hallucinations through internal citation chains, and adaptive manipulation at machine speed.
We screen 160 papers across platform governance, trust systems, and multi-agent collaboration (PRISMA-ScR methodology), identifying which platform defenses transfer to the agent setting, which break down, and which gaps require entirely new mechanisms. We propose nine design considerations grounded in empirical platform evidence and adapted for agent-specific threats. Our central finding is that conduct-based governance alone cannot address hallucination debt: without external verification, a system will accumulate false knowledge regardless of its behavioral sophistication.
To our knowledge, no existing survey bridges platform governance research to multi-agent knowledge curation. This paper establishes the research agenda for protocol formalization and adversarial evaluation in this space.
Files
paper2-1.pdf
Files
(244.2 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:401adf2d8f82cad622cf88ab4402bd1b
|
244.2 kB | Preview Download |
Additional details
Identifiers
Related works
- Continues
- Preprint: arXiv:2603.20833 (arXiv)
Dates
- Created
-
2026-03
- Submitted
-
2026-03-27submitted arxiv
- Submitted
-
2026-05-10submitted zenodo
References
- Arksey, H. and O'Malley, L. "Scoping Studies: Towards a Methodological Framework." International Journal of Social Research Methodology, 8(1), 2005. Allison, K. and Bussey, K. "Communal Quirks and Circlejerks: A Taxonomy of Processes Contributing to Insularity in Online Communities." ICWSM, 2020. Amodei, D. et al. "Concrete Problems in AI Safety." arXiv:1606.06565, 2016. Bikhchandani, S., Hirshleifer, D., and Welch, I. "A Theory of Fads, Fashion, Custom, and Cultural Change as Informational Cascades." Journal of Political Economy, 100(5), 1992. Boella, G. and van der Torre, L. "Regulative and Constitutive Norms in Normative Multiagent Systems." KR, 2004. Grisel, F. "Canceling Disputes: How Social Capital Affects the Arbitration of Disputes on Wikipedia." Law & Social Inquiry, 2023. Cantone, D. et al. "Review Bombing as Ideology-Driven Polarisation of User Ratings." Quality & Quantity, 2024. Carman, M. et al. "Manipulating Visibility of Political and Apolitical Threads on Reddit via Score Boosting." ICWSM Workshop, 2018. Chen, L. et al. "Governance and Design of Digital Platforms: A Review and Assessment." Journal of Management, 2021. Chen, Z. et al. "AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases." arXiv, 2024. de Condorcet, M. "Essai sur l'application de l'analyse à la probabilité des décisions rendues à la pluralité des voix." 1785. Del Vicario, M. et al. "The Spreading of Misinformation Online." PNAS, 113(3), 2016. Douceur, J. "The Sybil Attack." IPTPS, 2002. Du, Y. et al. "Improving Factuality and Reasoning in Language Models through Multiagent Debate." ICML, 2023. Esteva, S., de la Cruz, D., and Sierra, C. "ISLANDER: An Electronic Institutions Editor." AAMAS, 2002. Friess, D. and Eilders, C. "A Systematic Review of Online Deliberation Research." Policy & Internet, 7(3), 2015. Gordon, M. et al. "Jury Learning: Integrating Dissenting Voices into Machine Learning Models." CHI, 2022. Greshake, K. et al. "Not What You've Signed Up For: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection." AISec, 2023. Halfaker, A., Geiger, R. S., Morgan, J. T., and Riedl, J. "The Rise and Decline of an Open Collaboration System: How Wikipedia's Reaction to Popularity Is Causing Its Decline." American Behavioral Scientist, 57(5), 2013. Hoffman, K., Zage, D., and Nita-Rotaru, C. "A Survey of Attack and Defense Techniques for Reputation Systems." ACM Computing Surveys, 42(1), 2009. Huang, J.-T. et al. "On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents." arXiv:2408.00989, 2024. Ismail, R. and Josang, A. "The Beta Reputation System." Bled eConference, 2002. Jeong, Y. et al. "Coordinated Manipulation of Upvotes and Downvotes." ICWSM, 2020. Johnson, S. "Governance-Aware Vector Subscriptions for Agent Knowledge Bases." arXiv:2603.20833, 2026. Josang, A., Ismail, R., and Boyd, C. "A Survey of Trust and Reputation Systems for Online Service Provision." Decision Support Systems, 43(2), 2007. Kampik, T., Mansour, A., Boissier, O., Kirrane, S., and Padget, J. "Governance of Autonomous Agents on the Web: Challenges and Opportunities." ACM Transactions on Internet Technology, 22(4), 2022. Khan, A. et al. "Debating with More Persuasive LLMs Leads to More Truthful Answers." ICML, 2024. Köster, R., Hadfield-Menell, D., Everett, R., Weidinger, L., and Hadfield, G. "Spurious Normativity Enhances Learning of Compliance and Enforcement Behavior in Artificial Agents." PNAS, 119(3), 2022. Kumar, S. et al. "Community Interaction and Conflict on the Web." WWW, 2018. Liang, T. et al. "Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate." EMNLP, 2024. Mazloomzadeh, V. et al. "Reputation Gaming in Stack Overflow." arXiv preprint, 2021. Muller-Birn, C. et al. "Work-to-Rule: The Emergence of Algorithmic Governance in Wikipedia." C&T, 2013. Navajas, J. et al. "Aggregated Knowledge from a Small Number of Debates Outperforms the Wisdom of Large Crowds." Nature Human Behaviour, 2(2), 2018. Ostrom, E. "Governing the Commons: The Evolution of Institutions for Collective Action." Cambridge University Press, 1990. Park, P. et al. "AI Deception: A Survey of Examples, Risks, and Potential Solutions." Patterns, 5(1), 2024. Pitre, A. et al. "CONSENSAGENT: Consensus-Driven Multi-Agent Collaboration." ACL Findings, 2025. Pitt, J. et al. "Interleaving Multi-Agent Systems and Social Networks for Organized Adaptation." Computational and Mathematical Organization Theory, 19(3), 2013. Savarimuthu, B. T. R., Ranathunga, S., and Cranefield, S. "Harnessing the Power of LLMs for Normative Reasoning in MASs." LNCS, 2025. Seering, J. "Reconsidering Self-Moderation: The Role of Research in Supporting Community-Based Models for Online Content Moderation." CSCW, 2020. Sharma, M. et al. "Towards Understanding Sycophancy in Language Models." ICLR, 2024. Shumailov, I. et al. "AI Models Collapse When Trained on Recursively Generated Data." Nature, 631, 2024. Solorio, T. et al. "Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing." 2013. Tillmann, A. "Literature Review of Multi-Agent Debate for Problem-Solving." arXiv:2506.00066, 2025. Tricco, A. C. et al. "PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation." Annals of Internal Medicine, 169(7), 2018. Vosoughi, S., Roy, D., and Aral, S. "The Spread of True and False News Online." Science, 359(6380), 2018. Wang, Y. et al. "Stop Overvaluing Multi-Agent Debate." arXiv, 2025. Weber, D. and Neumann, F. "Amplifying Influence through Coordinated Behaviour in Social Networks." Social Network Analysis and Mining, 11(1), 2021. Wojcik, S. et al. "Birdwatch: Crowd Wisdom and Bridging Algorithms Can Inform Understanding and Reduce the Spread of Misinformation." arXiv, 2022. Yamak, Z. et al. "SocksCatch: Automatic Detection and Grouping of Sockpuppets in Social Media." Knowledge-Based Systems, 2018. Yao, S. et al. "How Sycophancy Shapes Multi-Agent Debate." arXiv, 2025. Yasseri, T. et al. "Dynamics of Conflicts in Wikipedia." PLoS ONE, 7(6), 2012. Yu, H. et al. "SybilGuard: Defending Against Sybil Attacks via Social Networks." SIGCOMM, 2006. Zhang, J. et al. "Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View." arXiv, 2024. Zhou, F. et al. "A Survey of Information Cascade Analysis: Models, Predictions, and Recent Advances." ACM Computing Surveys, 54(2), 2021. Zou, W. et al. "PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models." USENIX Security, 2025.