Abstract

Since transmission capacities have significantly increased over the past few years, researchers are now able to transmit a larger amount of data, namely multichannel audio content, in the consumer applications. What has not been investigated in a systematic way yet is how to deliver the multichannel content. Specifically, researchers' attention is focused on the quest of a standardized immersive reproduction format that incorporates height loudspeakers coupled with the new high-resolution and three-dimensional (3D) media content for a comprehensive 3D experience. To better understand and utilize the immersive audio reproduction, this research focused on the (1) interaction between the positioning of height loudspeakers and the signals fed to the loudspeakers, (2) investigation of the perceptual characteristics associated with the height ambiences, and (3) the influence of inverse filtering on perceived sound quality for the realistic 3D sound reproduction. The experiment utilized the existence of two layers of loudspeakers: horizontal layer following the ITU-R BS.775 five-channel loudspeaker configuration and height layer locating a total of twelve loudspeakers at the azimuth of ±30°, ±50°, ±70°, ±90°, ±110° and ±130° and elevation of 30°. Eight configurations were formed, each of which selected four height-loudspeakers from twelve. In the subjective evaluation, listeners compared, ranked and described the eight randomly presented configurations of 4-channel height ambiences. The stimuli for the experiment were four nine-channel (5 channels for the horizontal and 4 for the height loudspeakers) multichannel music. Moreover, an approach of Finite Impulse Response (FIR) inverse filtering was attempted, in order to remove the particular room's acoustic influence. Another set of trained professionals was informally asked to use descriptors to characterize the newly presented multichannel music with height ambiences rendered with inverse filtering. The experimental results indicate the significance of the positioning of the loudspeakers with respect to the signals being fed to those loudspeakers in delivering a 3D sound field. Furthermore, it has been revealed that the perceptual characteristics that listeners linked for multichannel music with height ambiences include envelopment, elevated-ness and fullness. Last but not least, after applying the inverse filtering the subjective preference was not affected significantly. This allows for the author to believe that, in fact, the room's influence with respect to the subjective evaluation is not as important as the appropriate loudspeaker-positioning for the multichannel-reproduced music with height ambiences.

Library of Congress Subject Headings

Multichannel communication--Data processing; Loudspeakers--Data processing

Publication Date

5-19-2014

Document Type

Thesis

Student Type

Graduate

Degree Name

Telecommunications Engineering Technology (MS)

Department, Program, or Center

Electrical, Computer and Telecommunications Engineering Technology (CAST)

Advisor

Sungyoung Kim

Advisor/Committee Member

Mark J. Indelicato

Advisor/Committee Member

Clark Hochgraf

Comments

Physical copy available from RIT's Wallace Library at TK5015.15 .K37 2014

Campus

RIT – Main Campus

Plan Codes

TCET-MS

Share

COinS