University of Kentucky Master's Theses

Archived

This content is available here for research, reference, and/or recordkeeping.

AUDIO SCENE SEGEMENTATION USING A MICROPHONE ARRAY AND AUDITORY FEATURES

Harikrishnan Unnikrishnan, University of KentuckyFollow

Date Available

12-7-2011

Year of Publication

2010

Document Type

Thesis

Degree Name

Master of Science (MS)

College

Engineering

Department/School/Program

Electrical Engineering

Faculty

Dr. Kevin D. Donohue

Faculty

Dr. Stephen D. Gedney

Abstract

Auditory stream denotes the abstract effect a source creates in the mind of the listener. An auditory scene consists of many streams, which the listener uses to analyze and understand the environment. Computer analyses that attempt to mimic human analysis of a scene must first perform Audio Scene Segmentation (ASS). ASS find applications in surveillance, automatic speech recognition and human computer interfaces. Microphone arrays can be employed for extracting streams corresponding to spatially separated sources. However, when a source moves to a new location during a period of silence, such a system loses track of the source. This results in multiple spatially localized streams for the same source. This thesis proposes to identify local streams associated with the same source using auditory features extracted from the beamformed signal. ASS using the spatial cues is first performed. Then auditory features are extracted and segments are linked together based on similarity of the feature vector. An experiment was carried out with two simultaneous speakers. A classifier is used to classify the localized streams as belonging to one speaker or the other. The best performance was achieved when pitch appended with Gammatone Frequency Cepstral Coefficeints (GFCC) was used as the feature vector. An accuracy of 96.2% was achieved.

Recommended Citation

Unnikrishnan, Harikrishnan, "AUDIO SCENE SEGEMENTATION USING A MICROPHONE ARRAY AND AUDITORY FEATURES" (2010). University of Kentucky Master's Theses. 622.
https://uknowledge.uky.edu/gradschool_theses/622

Download

Included in

Electrical and Computer Engineering Commons

COinS

University of Kentucky Master's Theses

Archived

AUDIO SCENE SEGEMENTATION USING A MICROPHONE ARRAY AND AUDITORY FEATURES

Date Available

Year of Publication

Document Type

Degree Name

College

Department/School/Program

Faculty

Faculty

Abstract

Recommended Citation

Included in

Search

Browse by Author

Author Corner

Links

Connect

University of Kentucky Master's Theses

Archived

AUDIO SCENE SEGEMENTATION USING A MICROPHONE ARRAY AND AUDITORY FEATURES

Author

Date Available

Year of Publication

Document Type

Degree Name

College

Department/School/Program

Faculty

Faculty

Abstract

Recommended Citation

Included in

Share

Search

Browse by Author

Author Corner

Links

Connect