Published April 4, 2023 | Version 1.0.0
Conference paper Open

Machine Learning-based Classification of Online Industrial Datasets

  • 1. Slovak University of Technology in Bratislava, Bratislava, Slovakia
  • 2. Slovnaft, a.s., Bratislava, Slovakia
  • 3. Slovak University of Technology in Bratislava Bratislava, Slovakia


We aim to incorporate data analytics into industrial process control by utilizing machine learning (ML) algorithms to classify the real-time data of online analyzers. Real-time visualization of results onto a front-end system (i.e., refinery control room) provides an extensive view of the production process, increasing efficiency of production. Selected ML classifiers are assessed according to the performance metrics based on individual scores. These parameters, along with the complexity of implementation, provide an adequate pointer for selecting a suitable classifier model to serve as a decision-making tool. In our use case, accurate categorization of measurements provides a cheap validation guideline that would otherwise be not possible. Computed metrics indicate a difficulty to classify the cases when the slight deviations (drifts) occur from real values. Based on the true positivity rate, linear SVM separation is desirable for data drift prediction (64 %), while k-Means is more successful in detecting outliers (65 %) and normal operation (99 %).



Files (3.1 MB)

Name Size Download all
3.1 MB Preview Download

Additional details


FrontSeat – Fostering Opportunities Towards Slovak Excellence in Advanced Control for Smart Industries 101079342
European Commission


  • Thumeera R. Wanasinghe, Ray Gosine, Lesley James G.K.I. Mann, Oscar De Silva, and Peter J. Warrian. The internet of things in the oil and gas industry: A systematic review. IEEE, 2020.
  • Hutama A. Bramantyo, Bagus Satrio Utomo, and Efrilia M. Khusna. Data processing for iot in oil and gas refineries. J. Commun. Netw., 2022.
  • James G Speight. The refinery of the future. Gulf Professional Publishing, Elsevier, 2020.
  • Tyler Wall, CFE Media and Technology. How to get started with industrial data analytics, 2022.
  • Muzammil khan, Salman Raza Naqvi, Zahid Ullah, Syed Ali Ammar Taqvi, Muhammad Nouman Aslam Khan, Wasif Farooq, Muhammad Taqi Mehran, DagmarJuchelková, and Libor Štepanec. Applications of machine learning in thermochemical conversion of biomass-a review. Fuel, 332:126055, 2023.
  • Batta Mahesh. Machine learning algorithms -a review. IJSR, 2019.
  • Martin Ester, Hans-Peter Kriegel, Jiirg Sander, and Xiaowei Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. 1996.
  • Michael D Twa, Srinivasan Parthasarathy, Cynthia Roberts, Ashraf M Mahmoud, Thomas W Raasch, and Mark A Bullimore. Automated decision tree classification of corneal shape. Optometry and Vision Science, 82(12):1038––1046, 2005.
  • Aized Amin Soofi and Arshad Awan. Classification techniques in machine learning: Applications and issues. J. Basic Appl., 13:459–465, 2017
  • Muzammil Khan Muhammad Taqi Mehran, Zeeshan Ul Haq, Zahid Ullah, Salman Raza Naqvi, Mehreen Ihsan, and Haider Abbass. Applications of artificial intelligence in covid-19 pandemic: A comprehensive review. Expert Systems with Applications, 185:115695, 2021.
  • Sven IvarHommeltoft. Isobutane alkylation: Recent developments and future perspectives. Applied Catalysis A: General, 221(1):421–428, 2001.
  • Fuels PALL Corporation and Chemicals. Refineries: Application focus - h2so4 alkylation unit, 2018.
  • Brian Malley and Daniele Ramazzotti and Joy Tzung-yu Wu . Data Pre-processing, pages 115–141. 2016.
  • Max Kuhn and Kjell Johnson. Data Pre-processing, pages 27–59. 2013.
  • Anmol Tomar. Stop using elbow method in k-means clustering, instead, use this!, 2022.
  • Erich Schubert. Stop using the elbow criterion for k-means and how to choose the number of clusters instead, 2022.
  • Salma Ghoneim. Accuracy, recall, precision, f-score & specificity, which to optimize on?, 2019.
  • Corinna Cortes and Vladimir Vapnik. Support-vector networks, 1995.
  • Naeem Seliya, Taghi M. Khoshgoftaar, and Jason Van Hulse. A study on the relationships of classifier performance metrics. 21st IEEE ICTAI, 2009.
  • Bradley J. Erickson and Felipe Kitamura. Performance metrics for machine learning models. PubMed Central, 2021.