Speaker Diarization Based on Speech Signal Approximation by Step Function

Latypov, Rustam; Stolov, Evgeni

doi:10.5281/zenodo.4514965

Published January 29, 2021 | Version v1

Conference paper Open

Speaker Diarization Based on Speech Signal Approximation by Step Function

1. Kazan Federal University

In this paper, we describe a new method for speaker clustering in an audio file. The main idea is to replace the speech signal with a step function having a limited number of levels. The research goal is to determine the signal characteristics obtained from the analysis of the step function produced. The step function is created by setting multiple levels that divide the signal range into non-overlapping strips. All the source signal values, which are inside a stip, are changed for the strips mark. Using the sine function as a template, we get recommendations for choosing the sources best-keeping features. We employ the obtained results to solve the problem of speaker diarization. The developed diarization algorithm requires little computer resources. The experiments show the suitability of the developed results to the conditions of the online diarization process.

Files

CUsersRussiaFRUCTprocessing3.Zenodo_DOI..2.FRUCT_PublicationFRUCT28papersLat.pdf

Files (1.8 MB)

Name	Size	Download all
CUsersRussiaFRUCTprocessing3.Zenodo_DOI..2.FRUCT_PublicationFRUCT28papersLat.pdf md5:96831091d214180c652b9026a5a0fadc	1.8 MB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	41	41
Downloads	27	27
Data volume	54.3 MB	54.3 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

FRUCT Oy

Published in

Proceedings of the 28th FRUCT conference, 28, 598-604, 2021.

Imprint

ISBN: 978-952-69244-4-1.

Conference

The 28th IEEE Conference of Open Innovations Association FRUCT (FRUCT28) , Moscow, Russia, 25-29 January 2021

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: February 6, 2021
Modified: July 19, 2024

Speaker Diarization Based on Speech Signal Approximation by Step Function

Creators

Description

Files

CUsersRussiaFRUCTprocessing3.Zenodo_DOI..2.FRUCT_PublicationFRUCT28papersLat.pdf

Files (1.8 MB)