Use this URL to cite or link to this record in EThOS: https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.380170
Title: Adaptive differential pulse code modulation and sub-band coding of speech signals
Author: Wong, Kam-Him James
Awarding Body: University of Southampton
Current Institution: University of Southampton
Date of Award: 1987
Availability of Full Text:
Access from EThOS:
Full text unavailable from EThOS. Please try the link below.
Access from Institution:
Abstract:
Developments of Adaptive Differential Pulse Code Modulation (ADPCM) and Sub-band Coder for speech signals are presented that increase their robustness of transmission errors. The impetus for the study derives from the current activities in Europe to produce a continental digital cellular mobile radio system. In order to transmit digitised speech in the mobile radio environment, the speech codec must have special properties. Apart from being able to produce toll quality speech at a reasonable cost, operate at a bit rate below 24 kbit/s, they must also be able to withstand burst and distributed transmission errors, and have the capability to operate in noisy acoustical ambient conditions. Our attention is focused on two strong contenders that satisfy these requirements, namely adaptive differential pulse code modulation (ADPCM) and sub-band coding (SBC). An ADPCM codec is designed with step-size prediction and vector predictors. The step-size prediction increases the codec's dynamic range; while the vector predictors enhance the speech quality and increase the codec's robustness to transmission errors. The codec yields toll quality speech when the encoded bit rate is 24 kbit/s. The sub-band codec separates the speech signal into sub-bands by means of quadrature mirror filters and encodes each sub-band signal using one-word memory quantizers. A semi-adaptive bit allocation strategy for the quantizers is used dependent on whether the speech is voiced, unvoiced or in transition, or whether data signals are present. We have developed the SBC for operation over mobile radio channels. Specifically, we apply either step-size forcing or step-size leakage algorithms. The resultant codec configuration produces near toll quality speech while operating at transmission bit rates of 16 kbit/s. Its speech quality can be maintained without significant degradation for bit error rates (BER) up to 3x10⁻³. Entropy coding has been embedded into the sub-band codec. We classify the sub-band signals and the adaptive quantizer step-sizes to produce a number of codebooks. By this technique we have achieved a reduction in the average bit rate from 8% to 15%. Joint optimisation of the SBC and Reed-Solomon or convolution coding followed by post speech enhancement techniques is studied. We find that embedded systematic Reed-Solomon coding enables 16 kbit/s SBC speech to be conveyed over Rayleigh fading channels at 24 kbit/s to yield toll quality speech at BERs < 10⁻².
Supervisor: Not available Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID: uk.bl.ethos.380170  DOI: Not available
Keywords: Mobile radio speech coding
Share: