On Audio Processes in the Artificial Intelligence [self.]

doi:10.5281/zenodo.50355

Published March 1, 2016 | Version v1

Conference paper Open

On Audio Processes in the Artificial Intelligence [self.]

1. Norwegian University of Science and Technology, Trondheim, Norway

This paper describes [self.], an open source art installation that embodies artificial intelligence (AI) in order to learn, react, and respond to stimuli from its immediate environment. Biologically inspired models are implemented to achieve this behavior, and Csound is used for most parts of the audio processing involved in the system. The artificial intelligence is physically represented by a robot head, built on a modified moving head for stage lighting. Everything but the motors of the stage lighting unit was removed and a projector, camera and microphones added. No form of knowledge or grammar have been implemented in the AI, the system starts in a ``tabula rasa state and learns everything via its own sensory channels, forming categories in a bottom-up fashion. The robot recognizes sounds and faces, and is able to recognize similar sounds, link them with the corresponding faces, and use the knowledge of past experiences to form new sentences. Since the utterances of the AI is solely based on audio and video items it has learned from the interaction with people, an insight into the learning process (i.e. what it has learned from who) can be glimpsed. This collage-like composition has guided several design choices regarding the aesthetics of the audio and video output. This paper will focus on the audio processes of the system, herein audio recording, segmentation, analysis, processing and playback.

Files

Brandtsegg_Tidemann.pdf

Files (1.7 MB)

Name	Size	Download all
Brandtsegg_Tidemann.pdf md5:2a5e149a6741a1ec3f9b39fdf01beca4	1.7 MB	Preview Download

173

Views

164

Downloads

Show more details

	All versions	This version
Views	173	172
Downloads	164	163
Data volume	298.5 MB	296.9 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

The Bonch-Bruevich St. Petersburg State University of Telecommunications

Imprint

Proceedings of the Third International Csound Conference, 44-59. St. Petersburg, Russia. ISBN: 978-5-89160-124-6.

Conference

The Third International Csound Conference (ICSC2015) , St. Petersburg, Russia, 2-4 October 2015

Creative Commons Attribution Share Alike 4.0 International

Permits almost any use subject to providing credit and license notice. Frequently used for media assets and educational materials. The most common license for Open Access scientific publications. Not recommended for software. Read more

Technical metadata

Created: May 18, 2016
Modified: January 20, 2020

On Audio Processes in the Artificial Intelligence [self.]

Creators

Description

Files

Brandtsegg_Tidemann.pdf

Files (1.7 MB)