The TS-FNN based adaptive predictor - Proposed TS-FNN based coding system

5.2 Proposed TS-FNN based coding system

5.2.1 The TS-FNN based adaptive predictor

In this chapter, the predictor is implemented by using the TS-type fuzzy neural network (TS-FNN) proposed by Takagi and Sugeno [37]-[39]. It is noted that a highly complex nonlinear model can be easily described in the TS-FNN system with a small number of fuzzy “If-Then” rules [37]-[39]. In general, the kth rule, R_k, of the TS-FNN model can be expressed by

R_k: If z₁(t) is F_1k . . . and z_i(t) is F_ik Then

y_k(t) = A_kx(t) + B_ku(t) f or k = 1, 2, . . . , L, (5.1) where z₁(t), . . . , z_i(t) are the input variables, F_1k, . . . , F_ik are the fuzzy quan-tiﬁers (fuzzy sets) associated with corresponding input variables, y_k(t) ∈ R^p represents the output of the kth rule, t denotes the current discrete time index, L denotes the number of rules, A_k ∈ R^p×n, x(t) ∈ Rⁿ, B_k ∈ R^p×m, and u(t)∈ R^m are the consequent parts of the rule.

The proposed TS-FNN based predictor is shown in Fig. 5.2. As can be seen in Fig. 5.2 , the proposed network has four layers. The nodes in Layer1 (the input layer) transmit the input signal to their output directly, i.e., it plays the role of signal buﬀering. In the proposed approach, there are two nodes in the input layer (i = 2). Layer2 is the “membership layer”. In this layer, the number of nodes associated with corresponding input variable is set to be three (j = 3). Layer3 is the “rule layer”. Since the interconnection topology between nodes in layer2 is fully connected, we have nine (i.e., jⁱ) nodes (i.e., rules) in layer3. Layer4 is the “output layer”. The output layer has only one node in the proposed approach, and its output is the prediction

Layer1 Layer2 Layer3 Layer4

Rule Layer

Layer1 Layer2 Layer3 Layer4

Rule Layer

Rule R_k, k=1..9 z_i, i=1,2 m_ij, j=1..3

Figure 5.2: Proposed TS-FNN predictor.

value of the coding pixel. It is noted that the pixel values are normalized by 255 before they are feed into the system, suppose the input image has 8 bits per pixel, and the output of the network is scaled by 255 and then rounded to the nearest integer as the prediction value.

For notation convenience, the net input to the jth node in layer l and the corresponding activation function are denoted by net^(l)_j and f_j^(l) respectively, and the output of jth node in layer l, is given by O_j^(l) = f_j^(l)(net^(l)_j ). A brief description about the proposed TS-FNN based predictor is given below.

Fig. 5.3 deﬁnes the texture context of the coding pixel. Besides, the input vector z(t) of the proposed network is given by

x₁

Figure 5.3: Texture context around the coding pixel.

where t denotes the discrete coding sequence. Obviously, z₁ measures the strength of vertical deviation (i.e., horizontal edge detection), while z₂ is for that of horizontal deviation (i.e., vertical edge detection). It should be noted that all the pixels used should be normalized by 255 before feed into the network in order not to fall into the saturation region during the network learning process.

Layer1: Input layer

The nodes in this layer just perform the buﬀering operations. That is, the input variables z₁ and z₂ are transmitted to layer2 directly. Therefore, the output of the ith node in layer1 is given by

O_i⁽¹⁾ = z_i for i = 1, 2. (5.3)

Layer2: Membership layer

In this layer, the activation function of the jth node associate with the ith input, z_i, in layer1 is denoted by f_ij⁽²⁾. Each node in this layer performs the membership degree measurement of a Gaussian function, and its output, µ_ij = O⁽²⁾_ij , speciﬁes the degree to which the given input z_i satisﬁes the fuzzy

quantiﬁer f_ij⁽²⁾, that is,

µ_ij = O_ij⁽²⁾= f_ij⁽²⁾(z_i) = exp{−(z_i− m_ij)²

σ_ij² } for i = 1, 2, and j = 1, 2, 3, (5.4) where m_ij and σ_ij denotes the center and the width (i.e., standard devia-tion) of the Gaussian membership function respectively, and the subscript ij indicates the jth node associated with the ith input z_i in layer1.

Layer3: Rule layer

The links in this layer are used to implement the antecedent matching.

The matching operation (or the fuzzy “AND” aggregation) is chosen as the simple “product” operation instead of “min” operation. Therefore, each node in this layer multiplies incoming signals and sends the product to its output. The output of the kth node in this layer represents the ﬁring strength of the kth rule, and is given by

α_k = O_k⁽³⁾ = µ_1p∗ µ2q = net⁽³⁾_k for k = 1, 2, . . . , 9,

p = 1, 2, 3, and 1≤ q = k − 3(p − 1) ≤ 3. (5.5)

Layer4: Output layer

This layer performs the defuzziﬁcation process to get the numerical out-put. As can be seen in Fig. 5.2, the output of each node in layer3 (i.e., the ﬁring strength α_k) is ﬁrst weighted by a factor w_k, and then summed up to-gether as the net input of layer4. The connection weight w_k, which represents the output action for the kth rule, is given by

w = A x(t) + B u(t) for k = 1, 2, . . . , 9, (5.6)

where x(t) = [x₁, x₂, . . . , x₆]^T is composed of the six causal neighbors around the coding pixel, A_k = [a¹_k, a²_k, . . . , a⁶_k]^T can be regarded as the sixth-order predictor coeﬃcients associated with the kth rule, u(t) = [u₁, u₂]^T = [x₁ − x₃, x₂ − x₃]^T = z(t) is used for horizontal and vertical edge detection, and the vector B_k = [b₁, b₂]^T is the weighting coeﬃcients associated with the kth rule for the vector u(t). As we know that a large prediction error can take place around an edge. Besides, an edge among image pixels during the coding process can be regarded as a step command in control system.

Moreover, we ﬁnd both the predictive coding and a control system have the same objective to minimize the diﬀerence between the desired and the actual output, i.e., the so-called error signal. These observations lead to the motivation of enhancing the prediction result with control technologies. For this, the term B_ku(t), which is called “P-controller” in control system, is applied in (5.6) for prediction error suppression. It should be noted that the proposed P-controller compensator is quite diﬀerent to the so-called “error modeling” or “bias cancelation technique” in [14], [19], and both of which can be applied jointly for further reﬁnement of prediction errors.

The output of the TS-FNN system is then a linear combination of the consequent part. That is, the ﬁnal output of the predictor network is a weighted summation of the individual output of the rules in layer3 (Fig. 5.2), and is given by

y(t) = net⁽⁴⁾ =

As can be seen in (5.7), the output of the proposed TS-FNN based pre-dictor is implemented in an un-normalized manner which has the advantages of a faster training rate and a much simpler input/output sensitivity equa-tion [41], [44], [45]. In addiequa-tion, it should be noted again that all the pixels

used are normalized by 255 before they are feed into the proposed predictor network. Therefore, the predictor output should be scaled by 255 this time and bounded in the range of [0, 255] as the prediction value.

在文檔中具邊界前瞻之非失真影像預測編碼技術 (頁 88-93)