
Universal Speech and Audio Coding AlgorithmsforMultimedia and Teleconferencing Applications |
|
|
| Contents |
| Introduction |
What are Speech and Audio Coding?Speech and audio coding or compression is the field concerned with compact digital representations of speech or audio signals for the purpose of efficient transmission or storage. The central objective is to represent a signal with a minimum number of bits while maintaining perceptual quality. Current applications for speech and audio coding algorithms include cellular and personal communications networks (PCNs), teleconferencing, desktop multi-media systems, and secure communications. Historically, coding algorithms using incompatible compression techniques have been optimized for particular signal classes, i.e., narrowband (telephone quality; 4 kHz BW), wideband (AM grade; 7 kHz BW), high-quality (FM grade; 15 kHz BW), and high-fidelity (CD quality; 20 kHz BW). |
| Objectives |
The goal of this project is to develop a new family of universal, scalable, and interoperable speech and audio compression algorithms for teleconferencing and multimedia applications.In particular, the project objectives are to:
|
| Research Topics |
Several research topics are currently under study, including:
|
| Recent Accomplishments |
Until now, our work has progressed separately on two parallel paths along the lines of low rate coding of narrowband speech and high-fidelity audio coding. Recently, we have:IN LOW RATE NARROWBAND SPEECH CODING, we have:
|
| Presentation on NDTC Project (April 1997) |
We recently presented to Intel a summary of our research activities during 1997.Click here to view the presentation on low-rate coding of narrowband speech.Click here to view the presentation on variable-rate coding of high-fidelity audio. |
| Example Speech and Audio Coders |
We have developed several speech and audio coders during the course of this project.Click here to see an example 2400 bit-per-second narrowband speech coder.Click here to see an example variable rate high-fidelity audio coder. |
| Research Group |
This page describes speech/audio coding research conducted by several people, including:Dr. Andreas Spanias, Principal InvestigatorSassan Ahmadi, Research Associate Ted Painter, Research Associate |
| Publications |
Several PostScript documents are also available which give more details on our work:Painter, E. M., and Spanias, A. S. (1997). A Review of Algorithms for Perceptual Coding of Digital Audio Signals. |
| Related Sites |
Other sites related to this project include the following:Telecommunications Research CenterCollege of Engineering and Applied Sciences Arizona State University, which is located in the city of Tempe, Arizona 85287-7206 USA |
| Acknowledgements |
The work described on this page is sponsored by a grant from the Intel Corporation.We gratefully acknowlege the generous support of the Intel Corporation's NDTC group which has made possible the work desribed on this site. In addition to several research grants, the Intel NDTC group has donated to the ASU-TRC Speech Lab several high performance workstations fully equipped with application software, including two high-end Pentium and two state-of-the art Pentium-Pro NT workstations. |
| Contacts |
For further information, direct all correspondance to:Dr. Andreas S. Spanias <spanias@asu.edu> |