Published November 29, 2022 | Version v1
Conference paper Open

Application of blockchain and the SimHash algorithm to detect plagiarism

Creators

  • 1. Vasyl Stefanyk Precarpathian National University

Description

In this paper, a novel, unified approach for detecting plagiarism was proposed. It is built on the Ethereum blockchain environment and uses the SimHash algorithm for similarity detection, scalability and plagiarism detection in education field as main area of application. Through a network of social and smart contracts, Blockchain technology will function as an open technology that protects against unwanted access, but at the same time will save timestamp and current state of the analyzed documents. Also, the unified algorithm was compared to widely used TF-IDF algorithm on the same dataset and showed improvement by nearly 7%.

Notes

R. Dorosh, "Application of blockchain and the SimHash algorithm to detect plagiarism," 2022 International Conference on Innovative Solutions in Software Engineering (ICISSE), Vasyl Stefanyk Precarpathian National University, Ivano-Frankivsk, Ukraine, Nov. 29-30, 2022, pp. 130-134, doi: 10.5281/zenodo.8429018

Files

2022_ICISSE_Dorosh.pdf

Files (257.1 kB)

Name Size Download all
md5:16ee53cfd7990ff46fa76f6e17aa4eb1
257.1 kB Preview Download

Additional details

Identifiers

ISBN
978-966-640-534-3

References

  • M. S. Ali, M. Vecchio, M. Pincheira, K. Dolui, F. Antonelli and M. H. Rehmani, "Applications of Blockchains in the Internet of Things: A Comprehensive Survey," in IEEE Communications Surveys & Tutorials, vol. 21, no. 2, pp. 1676-1717, Secondquarter 2019, doi: 10.1109/COMST.2018.2886932
  • S. Nakamoto, "Bitcoin: A Peer-to-Peer Electronic Cash System", Aug 2022, [online] Available: https://bitcoin.org/bitcoin.pdf
  • V.Buterin, "Ethereum: A Next-Generation Smart Contract and Decentralized Application Platform", Aug 2022, [online] Available: https://ethereum.org/669c9e2e2027310b6b3cdce6e1c52962/Ethereum_Whitepaper_-_Buterin_2014.pdf
  • Anzelmi, Daniele, Domenico Carlone, Fabio Rizzello, Robert Thomsen and Dil Muhammad Akbar Hussain. "Plagiarism Detection Based on SCAM Algorithm". In Proceedings of the International MultiConference on Engineers and Computer Scientists 2011 (Vol. Volume I, pp. 272-277). Newswood Limited, International Association of Engineers, IAENG
  • M. Kozlenko, I. Lazarovych, V. Tkachuk and V. Vialkova, "Software Demodulation of Weak Radio Signals using Convolutional Neural Network," 2020 IEEE 7th International Conference on Energy Smart Systems (ESS), 2020, pp. 339-342, doi: 10.1109/ESS50319.2020.9160035
  • M. Kozlenko and V. Vialkova, "Software Defined Demodulation of Multiple Frequency Shift Keying with Dense Neural Network for Weak Signal Communications," 2020 IEEE 15th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET), 2020, pp. 590-595, doi: 10.1109/TCSET49122.2020.235501
  • M. Kozlenko,V. Sendetskyi, O. Simkiv, N. Savchenko, A. Bosyi, "Identity Documents Recognition and Detection using Semantic Segmentation with Convolutional Neural Network (short paper)". Cybersecurity Providing in Information and Telecommunication Systems 2021 (CPITS), 2021, pp. 234-242
  • Dutchak, M., Kozlenko, M., Lazarovych, I., Lazarovych, N., Pikuliak, M., Savka, I. "Methods and Software Tools for Automated Synthesis of Adaptive Learning Trajectory in Intelligent Online Learning Management Systems". Innovations in Smart Cities Applications Volume 4. SCA 2020. Lecture Notes in Networks and Systems, vol 183. Springer, Cham. https://doi.org/10.1007/978-3-030-66840-2_16
  • C. Pungilă, D. Galis, V. Negru, "A New High-Performance Approach to Approximate Pattern-Matching for Plagiarism Detection in Blockchain-Based Non-Fungible Tokens (NFTs)", arXiv:2205.14492 [cs.CR], May 2022
  • G.S. Manku, A. Jain, A.D. Sarma, "Detecting Near-Duplicates for Web Crawling", research.google.com, https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/33026.pdf (accessed Aug. 23, 2022)
  • "All the news dataset". Kaggle.com https://www.kaggle.com/datasets/snapcrack/all-the-news (accessed Aug. 27, 2022)