• 沒有找到結果。

Introduction to hearing aid systems

Chapter 1 Introduction

1.2 Introduction to hearing aid systems

The number of hearing impaired individuals who need to use hearing aids is rapidly increasing [1]. But the cost of hearing aids is still a big concern for those in need. The goal of the NCTU hearing-aid team is to develop a low cost and low power-consumption, but yet high-performance hearing aid. It is therefore very important to consider signal processing tasks in terms of ease of implementation and power consumption in addition to effectiveness. In the following, the basic structure of hearing-aids is presented; along the way, major issues and signal processing tasks will be discussed and the main theme of this thesis will become clear.

Current hearing aids are digital devices that embody input (microphone) and output (receiver/subminiature loudspeaker) transducers, analog input conditioning circuits, an analog-to-digital converter (ADC), a digital signal processor (DSP), a digital-to-analog converter (DAC) and a power amplifier [2]. The block diagram of a digital hearing aids is shown in Fig. 1.2. Hearing aids amplify incoming sound according to the hearing-loss profile of the patient so as to increase the signal level and compensate part of the hearing loss.

Fig. 1.2 Block diagram of a digital hearing aid.

There are two major signal processing issues discussed in this thesis, namely the feedback problem and the noise reduction problem. First, consider the feedback problem.

Due to imperfect earmold fitting and venting in the hearing aid device, there is acoustic leakage from the receiver to the microphone. The leakage causes a regenerative feedback loop, which frequently makes the hearing aid oscillate and results in a whistling sound. For people with severe hearing loss, this problem is serious because at high gains there is an increased risk of squealing which may render the hearing aid useless. Feedback becomes more apparent when an object is close to the ear (i.e. when using a telephone) or when the jaw is moving (i.e. when chewing). In case of squealing, hearing aid users tune down the hearing aid gain or fit hearing aids more tightly in the ear canal, but such adjustments may compromise the function and comfort of the hearing aid [3].

To solve the problem of feedback, modern hearing-aids usually utilize adaptive filtering to imitate and eliminate the feedback signal. But before doing that, knowledge about typical feedback path is essential in making specifications on the adaptive feedback canceling filter. The first contribution of this thesis therefore consists of establishing an acoustic measuring environment and carrying out feedback path measurement and estimation task. An ITC hearing-aid earmold embodied with a receiver and a microphone, together with an artificial ear in the acoustic chamber, is used in the experiment. More technical details can be found in Chapter two. Our measurement results provide a reference point for the specification and implementation of the feedback canceling filter.

The second major issue is the background noise experienced in the daily lives of hearing aid users; this greatly reduces the speech intelligibility as well as comfort of listening [4]. Quite often noise reduction is a built-in feature for modern hearing aids, especially when the users’ primary need is to comprehend human speech. To achieve effective reduction, the usual approach is first to identify the defining characteristics of the speech and non-speech signals, then do different operations to signals with different characteristics. Not surprisingly, the most common operation is filtering.

In Chapter three, we consider the possibility of using the Kalman filter in hearing-aids for noise reduction. The single-band Kalman filter is studied while the multi-band Kalman filter is kept aside due to the possible surge in computational cost. We did simulation and comparison studies between the Kalman filter and the well-known spectrum subtraction method. The Kalman filter, in general, has the potential of obtaining better performance in reducing the noise, while the spectral subtraction method enjoys simple formulation and easy implementation. In considering signal processing tasks for real-time applications like hearing-aids, it is important to keep in mind that performance evaluation is not always the

major criterion of choosing particular signal processing algorithms. Easy implementation and low consumption, which often come with the low computation cost, are rather important, if not more important, in making the decision. We also suggest the possibility of combining the Kalman filter and the existing analysis/synthesis filter bank as the future development in order to achieve a balance between the performance and low power/cost.

Chapter 2

Measurement of feedback paths in hearing aids

2.1 Introduction to the feedback path

Undesirable acoustic feedback is an uncomfortable squeal that occurs when the output sound from the receiver leaks to the microphone and that leaked sound gets re-amplified.

Unstable acoustic feedback develops into a problem when the earmold/earshell is loosely fitted (or a large vent is part of the earmold/earshell) and when there is large signal amplification in the hearing instrument. From a signal processing viewpoint, this feedback mechanism is usually modeled as a linear operation and typical system identification methods are used to obtain the feedback path model.

The most straightforward method to reduce the possibility of unstable acoustic feedback is to reduce the gain, but this is often unacceptable because reducing gain means that the effectiveness of hearing-aids would be severely limited. To maintain the amplifying gain while prevent the onset of unstable acoustic feedback, many modern hearing instruments feature adaptive feedback canceling algorithms. Among them the popular methods include adaptive notch filters, automatic gain reduction in a multi-channel system and several variations of adaptive feedback cancellation [4],[5]. Of these, the adaptive

feedback cancellation methods that estimate the feedback path and subtract it from the input signal of the hearing instrument are arguably the most effective to date. From a clinical perspective, although many manufacturers claim an increased gain of ≧15dB (with the application of these algorithms) before the onset of unstable acoustic feedback , an increased gain of <10dB is often the case.

The hearing instrument system, in particular the acoustic feedback path, has been studied, especially been modeled as a linear system depicted in Fig. 2.1. For example, Lybarger [6] measured the frequency response, but the frequency response was limited only to magnitude responses at a few selected frequencies. Bustamante [5] chose click sequences as stimuli and measured the response. The impulse response of the system was calculated by averaging the responses. Stinson [7] measured the frequency response with a sweep stimulus. Egolf et al. [8] modeled each component of a glass type analog hearing instrument by two-port methods that took load effects into consideration. Kates [9] transferred each component of an ITE hearing instrument from frequency domain into time domain that matched the magnitude response, but ignored the phase response. Jingbo Yang [2] used sweep stimulus and white noise method to model the EFP in an ITE hearing instrument and verify the Nyquist criterion. Then he employed an adaptive LMS FIR filter to determine the impulse response of the EFP.

In this chapter, we describe a measuring platform to facilitate research for the development of acoustic feedback algorithms. Specially we consider the magnitude and phase response of the external feedback path (EFP) obtained by the sweep stimulus method.

Using the characteristics of EFP obtained, we predict theoretically (Nyquist criterion) the

impulse response. An adaptive Finite-Impulse-Response (FIR) filter can be used to match the actual EFP impulse response such that the effects of the EFP can be nullified (Fig. 2.1) [2].

Fig. 2.1 Block diagram of the basic hearing instrument system with adaptive feedback cancellation [2].

2.2 ITC digital hearing instrument

In this section, we will describe the experimental setup of the measurement platform.

The hearing instrument used in this setup is a custom-made In-The-Canal (ITC) hearing instrument embodying only a microphone and a receiver. The block diagram is shown in Fig.

2.2.

Fig. 2.2 Experimental setup with the ITC placed in-situ.

Legend

(1) Artificial head (Head and Torso Simulator-Type 4128C) (2) Artificial right ear (Head and Torso Simulator-Type 4128C) (3) Artificial pinna (Head and Torso Simulator-Type 4128C) (4) ITC hearing instrument shell with a microphone and receiver (5) Knowles FG-23742-D36 microphone

(6) Knowles FK-23451-000 receiver (7) Acoustical feedback path

(8) Interface between microphone and ADC (9) Interface between DAC and receiver (10) PC, B&K3560-C

2.3 Sweep stimulus method

The method we used to obtain the frequency response of the EFP is called the sweep stimulus method. In the method, a sweep signal (its frequency changes from low to high) is utilized as the stimulus to the receiver. The sinusoidal sweep signal 1volt(rms) from 100Hz to 25000Hz is injected into (A) and the output of (C) is measured. In general hearing aids just consider the frequency bandwidth which is from 200Hz to 8000Hz. But in our experiment we measure the frequency response from 100Hz to 25000Hz to obtain the broader result and get a better impulse response. We depict the signal flow of the measurement platform in Fig. 2.3. The stimulus sweep signal is obtained from the pulse generator (B&K 3560C). Because the microphone in the hearing aid is unstable above 10000Hz, we use another microphone (B&K4190L) put near the hearing aid’s microphone to measure the feedback path from 100Hz to 25000Hz. By means of the platform, we obtain the sound pressure level of EFP. Then we transfer it to voltage and then depict the frequency response in Fig. 2.4.

Fig. 2.3 Experimental system to measure the EFP.

Fig. 2.4 Frequency response of EFP.

The Nyquist criterion states that oscillations will occur if the magnitude response of the loop-gain is greater than unity and the phase response is a multiple of 2π. Most hearing aids have different gains in different frequency. Usually the maximum gain is above 40dB.

Therefore we suggest the gain of our hearing aid system is a constant. So we plus the gain 40dB and get the closed-loop response as Fig. 2.5. Therefore in our system the oscillations may occur in 3000Hz and 9000Hz. As a result, the howling may occur in 3000Hz in a general hearing aid.

Fig. 2.5 Frequency response of the closed-loop.

On the other hand, we also used the hearing aid microphone and the microphone in right ear simulator to model the acoustic feedback path which is equivalent to the transfer function from (B) to (D) in Fig. 2.3. We assume that the sound pressure level in ear canal is almost the same as that of the receiver. So the SPL of hearing aid divided by that of the ear canal is equivalent to the transfer function of the acoustic feedback. The frequency response of the acoustic feedback path is as Fig. 2.6. We also give it a 40dB gain to obtain the closed-loop response which is as Fig. 2.7. Therefore the hearing aid may oscillate in 6200Hz.

Fig. 2.6 Frequency response of the acoustic feedback path.

2.4 Modeling the EFP in the time-domain

In the previous section, we have modeled the frequency response of the EFP with sweep stimulus. As most of the recent feedback cancellation methods are based on adaptive filtering ( the filter response changes with time according to the variance of the EFP ), it is imperative to model the EFP in the time domain. The time-domain model proposed by Kates [9], as mentioned earlier, considered only the magnitude response and ignored the phase response. However, as we have shown in the previous section, the phase of the EFP is important for determining the frequency where acoustic oscillation occurs.

To determine the impulse response of the external feedback path, we employed a Matlab program which uses the IFFT function to transfer the frequency response to the time-domain that is impulse response. In order to get a more natural impulse response, we took the broadband frequency response as Fig. 2.4 to deal with. The impulse response is depicted in Fig. 2.8. And we can obtain the feedback time which is about 3 milliseconds (ms). As a result , the processing time of feedback cancellation should be in 3 ms. Therefore the time-domain model (or impulse response) can therefore be used for the development of acoustic feedback cancellation algorithms in the future.

Fig. 2.8 Impulse response of the EFP.

Chapter 3

Adaptive noise cancellation algorithms in hearing aids

3.1 Introduction

Noise problem naturally arise in many areas, such as voice communication, speech recognition, and hearing aids. As a result, noise cancellation becomes an important research topic and many studies have been using techniques such as short-time spectral amplitude estimation [10]-[13], iterative Wiener filtering [14]-[16], audio-based filtering [17],[18], signal-subspace processing [19],[20], and hidden Markov modeling (HMM) [21],[22].

Although significant results have been achieved, most of them are not suitable for real-time implementation because their computational complexities are generally too high.

The Kalman filter is well known in signal processing for its efficient structure, and it can be used for time-varying system. It is used in a wide range of engineering applications from radar to computer vision, and is an important topic in control theory and noise cancellation. The Kalman filter is a recursive estimator, that is to say, it only needs the estimated state from the previous time step and the current measurement to estimate the current state. Since it has the potential of high performance, is naturally adaptive, and may qualify for low computational cost, we decided to study the possibility of applying it to noise cancellation in hearing aids.

In [23], Paliwal and Basu used a Kalman filter to enhance speech corrupted by white noise. On a short-time base, speech signals were modeled as stationary AR process and AR parameters were assumed to be known. Gibson, Koo, and Gray considered speech enhancement with colored noise in [24]. They modeled both speech and colored noise as AR processes and developed scalar and vector Kalman filtering algorithms. To estimate the AR coefficients, an EM-based algorithm was employed. In [25], Lees and Ann proposed a non-Gaussian AR model for speech signals. They modeled the distribution of the driving-noise as a Gaussian mixture and applied a decision-directed nonlinear Kalman filter.

Again, an EM-based algorithm was used to identify unknown parameters. Niedzwiecki and Cisowki [26] assumed that speech signals are nonstationary AR processes and used a random-walk model for the AR coefficients. An extended Kalman filter was then used to simultaneously estimate speech and AR coefficients, Note that the stability of the extended Kalman filter is not guaranteed and dimensions of the Kalman filter are greatly increased.

The aforementioned Kalman filtering algorithms still require extensive computations for two reasons: first, using EM algorithms to identify AR coefficients costs a lot, and second, using Kalman filters usually involves matrix inversion. However, the above costs depend on the order of the Kalman filter used. In order to carefully assess the pros and cons of using Kalman filters in hearing aids, we intend to use low-order Kalman filters and low-cost Yule-Walker method for estimating AR coefficients. As a future plan, Kalman filters incorporating with sub-band filtering structure will also be studied; in this case, zero or first-order Kalman filters may be sufficient and the calculation only involves scalar operations, thus saving a considerable amount of computation [27].

3.2 Kalman filtering for white noise cancellation

We derived the following from [27],[24]. On a short-time basis, a speech sequence ( )n

x can be represented as an AR process, which is essentially the output of an all-pole linear system driven by a white noise sequence:

speech signal z n is assumed to be contaminated by a zero-mean additive Gaussian ( ) noise v n( ) , i.e., z n

( )

=x n

( ) ( )

+v n . Let v n( ) be white and

( ) [ ( ) (n x n x n 1) x n( p 1)]T

=Δ − ⋅⋅⋅ − +

x . Equation (3.1) and the corrupted speech ( )z n can be formulated in the state-space domain as

( )x n =Fx(n− +1) HTw n( ) (3.2) From standard Kalman filtering theory for white message and measurement noise, the state vector estimate is

xˆ =Fxˆ(n−1)+G(n)[z(n)−HFxˆ(n−1)] (3.6)

With the initial condition xˆ(0)=0. The gain and error covariance equations are

G(n)=M(n)HT[HM(n)HT +R]1 (3.7) M(n)=FS(n−1)FT +HTQH (3.8) S(n)=[IG(n)H]M(n) (3.9) where G(n)is the Kalman gain vector, M(n)is the prediction-error covariance matrix,

) (n

S is the filtering-error covariance matrix, Rv2 is the variance of the noise sequence{ nv( )}, and Qw2 is the variance of the driving term{ nw( )}. With the initial autocorrelation method from the noisy observations when the true values are not known.

The frame length of 360 samples corresponds to 15ms and was selected arbitrarily from within the common parameter update range of 10 to 30 ms often used in linear predictive systems. The frame length is not critical to the filter performance within these bounds. The noise variance σv2 is estimated in some time interval before speech is present. Fig. 3.1 is the flowchart of the Frame-based AR Kalman filtering for white noise cancellation. At each iteration, we alternately estimate the parameters and filter the speech.

For the complexity of estimating AR coefficients, we propose a simpler method to reduce the computation of AR coefficients. First we estimate the AR coefficients from a clean speech, and then we use the Kalman filter with the fixed AR coefficients to filter the noisy speech. Fig. 3.2 is the flowchart of the fixed AR Kalman filtering for white noise cancellation. We reduce the computation and get a not bad performance in Kalman filtering for white noise cancellation. The simulation results are listed in section 3.3.

Fig. 3.1 The flowchart of the frame-based AR Kalman filtering for white noise cancellation.

Fig. 3.2 The flowchart of the fixed AR Kalman filtering for white noise cancellation.

3.3 Simulation

In this section, we take a girl’s sound “girl , meat ball” in Mandarin from Hearing and Speech Engineering Lab in National Yang-Ming University to be the clean input speech.

This sentence of sound is sampling in 24KHz and about 3.2sec. The white noise we used is a function “rand” in Matlab. We add the white noise in some level of SNR and used the Kalman filter to enhance the noisy signal. Here are some simulation results and we will discuss them in the next section.

In the frame-based AR method we estimate AR coefficients and filter speech in every frame. Fig. 3.3 and Fig. 3.4 are the plots of frame-based method. Then we evaluate the SNRseg and PESQ of speech frames in Table 3.1 and Table 3.2 in different AR orders and in different SNR. Differently we estimate the AR coefficients once from the clean sentence and filter the noisy sentence with the fixed AR coefficients. Then we have the simulation plots of fixed AR method in Fig. 3.5 and Fig. 3.6. After that we also construct Table 3.3 and Table 3.4 with evaluation of speech frames in different AR orders and in different SNR.

Finally we also take five girl’s speech samples from Hearing and Speech Engineering Lab in National Yang-Ming University to be the clean input speech. We contaminate the five speech samples in SNR0 and evaluate those PESQ as the benchmark. Then we use the fixed AR method and framed-base AR method in AR order 4 to enhance these noisy speeches. The simulation results are as Fig. 3.11 and Fig. 3.12.

Fig. 3.3 Time plot of enhanced SNR-5 white noisy speech by frame-based AR4 Kalman filtering.

Fig. 3.4 Spectrogram of enhanced SNR-5 white noisy speech by frame-based AR4 Kalman filtering.

Fig. 3.5 Time plot of enhanced SNR-5 white noisy speech by fixed AR4 Kalman filtering.

Table 3.1 SNRseg of speech frames for frame-based AR Kalman filtering.

Fig. 3.7 SNRseg of speech frames for frame-based AR Kalman filtering.

Table 3.2 PESQ of speech frames for frame-based AR Kalman filtering.

Fig. 3.8 PESQ of speech frames for frame-based AR Kalman filtering.

Table 3.3 SNRseg of speech frames for fixed AR Kalman filtering.

Fig. 3.9 SNRseg of speech frames for fixed AR Kalman filtering.

Table 3.4 PESQ of speech frames for fixed AR Kalman filtering.

Fig. 3.10 PESQ of speech frames for fixed AR Kalman filtering.

1.5

Fig. 3.11 PESQ of sentences in different SNR0 speech samples.

Fig. 3.11 PESQ of sentences in different SNR0 speech samples.

相關文件