Efficient allocation of image recognition and LLM tasks on multi-GPU system

Lawenda, Marcin; Samborski, Krzesimir; Khloponin, Kyrylo; Szustak, Łukasz

doi:10.48550/arXiv.2503.15252

Published March 20, 2025 | Version v1

Preprint Open

Efficient allocation of image recognition and LLM tasks on multi-GPU system

1. Poznan Supercomputing and Networking Center
2. Częstochowa University of Technology

This work is concerned with the evaluation of the performance of parallelization of learning and tuning processes for image classification and large language models. For machine learning model in image recognition, various parallelization methods are developed based on different hardware and software scenarios: simple data parallelism, distributed data parallelism, and distributed processing. A detailed description of presented strategies is given, highlighting the challenges and benefits of their application. Furthermore, the impact of different dataset types on the tuning process of large language models is investigated. Experiments show to what extent the task type affects the iteration time in a multi-GPU environment, offering valuable insights into the optimal data utilization strategies to improve model performance. Furthermore, this study leverages the built-in parallelization mechanisms of PyTorch that can facilitate these tasks. Furthermore, performance profiling is incorporated into the study to thoroughly evaluate the impact of memory and communication operations during the training/tuning procedure. Test scenarios are developed and tested with numerous benchmarks on the NVIDIA H100 architecture showing efficiency through selected metrics.

Files

view.pdf

Files (614.7 kB)

Name	Size	Download all
view.pdf md5:df1d59ff3c12aea018bc48d05b7b1dbf	614.7 kB	Preview Download

Additional details

Issued: 2025-03-20

	All versions	This version
Views	55	55
Downloads	48	48
Data volume	34.4 MB	34.4 MB

Efficient allocation of image recognition and LLM tasks on multi-GPU system

Authors/Creators

Description

Files

view.pdf

Files (614.7 kB)

Additional details

Dates