Preprocessing… - Simulation data - Experiments and Results…

Chapter 4. Experiments and Results…

4.2. Simulation data

4.2.3. Preprocessing…

In this part, we generate original training data set taken from the uniform distribution in (-1, 1). Then we use tangent sigmoid as a function to transfer the original training data set to a new one which is Gaussian-based. We make an assumption about whether the training data set is center distribution or more uniform will make better analyzed performance. They will give some illustration in following figures.

Figure 4.22 and 4.23 represent the original training data set X and the new training data set 'X which is transferred by tangent function respectively.

Figure 4.22 The original training data setX

Figure 4.24 RMSE as a function of FWHM under SCSP

Figure 4.25 Correlation coefficient as a function of FWHM under SCSP

Chapter 5 Discussion

For regularization concept, almost all inverse problem methods involve a trade-off between two optimizations：agreement between data and solution, and smoothness of the solution. We define that the unconstrained minimum of agreement and the unconstrained minimum of smoothness is the best solution. Figure 5.1 will give you a brief thought about that. Here, we have a question for how to define or find out the location of the best solution between “Best smoothness” line and “Best agreement” line.

Figure 5.1 Where is the best solution

The estimated criterion RMSE and correlation coefficient would involve a trade-off relationship. In our data experiment results, we hope the RMSE is low and correlation coefficient is high to verify our proposed method. So, we need some

verification to explain this problem. We make a assumption that our proposed method and PLS may have different curves as shown in Figure 5.2. In further study, we will have a fundamental proof for this issue.

Figure 5.2 Trade-off curves of Bayesian-based PLS and PLS

The preprocessing result, we transfer the original data set to Gaussian form to examine whether the performance is better or not. We make different widths for FWHM to verify our proposed method. But we could obviously find out the hypothesis for data preprocessing doesn’t accomplish to our expectation. The results after preprocessing might be influenced by the limitation of tangent function. The data after tangent function transferring may be divergent so that the analyzed results would be affected for this reason.

The local and global minimum problem is another issue we concern. We would like to find the best solution to approximate nearly global minimum.

Chapter 6 Conclusions and Future works

6.1 Conclusions

We have established a probability based analyzed method which combines the advantages of regularization and the properties of PLS for a novel calibration model.

The proposed method, Bayesian-based PLS, is able to reduce the noise signal hidden in the training data. And it has better analyzed results than original PLS method when training data accompanying noise signal during calibration phase. So we can apply our method to on-line analyzed system for further application.

6.2 Future works

In data preprocessing issue, we might to make tries for other kinds of transfer function (e.g., arcsine function) to make sure the data divergent problem and improve the limitation of transformation accuracy to obtain better performance for further study. The track of best solution between the agreement and smoothness is our next objective to achieve. Then, we also consider to make the results approximated to the global minimum so that we can apply the proposed method for weights initialization of backpropogation network. There still have another issue we have to take into account. The selection of appropriate prior would probably affect the analyzed result.

So we need to make a study about the prior probability to make sure that we don’t have a bad or wrong one.

References

[1] Hsiao TC, Lin CW, Chiang HH, “Partial least squares algorithm for weights initialization of the back-propagation network”, Neurocomputing, vol.

50, pp. 237-247, 2003.

[2] Chen S, Chng ES, Alkadhimi K, “Regularized orthogonal least squares algorithm for constructing radial basis function networks”, International Journal of Control, vol. 64, pp. 829-837, 1996.

[3] Chang SH, Chiou YJ, Yu C, Lin CW, Hsiao TC, “A Novel Multivariate Analysis Method with Noise Reduction”, 4^th European Congress for Medical and Biomedical Engineering, 2008.

[4] MacKay DJC, “Bayesian interpolation”, Neural Computation, vol. 4, pp.

415-447, 1992.

[5] Bhandare P, Mendelson Y, Peura RA, Janatsch G, Kruse-Jarres JD, Marbach R, Heise HM, “Multivariate determination of glucose in whole blood using

partial least-squares and artificial neural networks based on mid-infrared spectroscopy”, Applied Spectroscopy, vol. 47, pp. 1214-1221, 1993.

[6] Möcks J, Verleger R, “Multivariate methods in biosignal analysis: application of principal component analysis to event-related”, Techniques in the behavioral

and neural sciences, vol. 5, pp. 399-458, 1991.

[7] Castellanos G, Delgado E, Daza G, Sanchez LG, Suarez JF, “Feature Selection in Pathology Detection using Hybrid Multidimensional Analysis”, Proceedings of International Conference of EMBS, pp. 5950-5953, 2006.

[8] Oja E, “A simplified neuron model as a principal component analyzer”, Journal of Mathematics and Biology, vol. 15, pp. 267-273, 1982.

[9] Harald M, Tormod N, “Multivariate Calibration”, 2^nd Edition, John Wiley &

Sons, Great Britain, 1996.

[10] Huang KY, “Neural Networks and Pattern Recognition”, 2^nd Edition, 維科圖書有限公司, 2003.

[11] Oja E, Karhunen J, “Recursive construction of Karhunen-Loeve expansions for pattern recognition purposes”, Proceedings of 5^th Int. Conf. on Pattern Recognition, pp. 1215-1218, 1980.

[12] Hsiao TC, Lin CW, Zeng MT, Chiang Kenny HH, “The Implementation of Partial Lease Squares with Artificial Neural Network Architecture”, 20^th Annual International Conference of the IEEE Engineering in Medicine Biology Society, vol. 3, pp. 1341-1343, 1998.

[13] Chen S, Cowan CFN, Grant PM, “Orthogonal least squares learning algorithm for radial basis function networks”, IEEE Transactions on Neural Networks, vol. 2, pp. 302-309, 1991.

[14] Press HW, Vetterling WT, Teukolsky SA, Flannery BP, “Numerical Recipes in C: the art of scientific computing”, 2^nd Edition, Cambridge University Press, 1993.

[15] Orr MJL, “Regularization in the selection of radial basis function centers”, Neural Computation, vol. 7, pp. 606-623, 1995.

[16] Hertz J, Krough A, Palmer R, “Introduction to the Theory of Neural Computation”, Redwood city, California, USA, Addison-Wesley, 1991.

[17] Ham FM, Kostanic I, “A Neural Network Architecture for Partial Least Squares Regression with Supervised Adaptive Modular Hebbian Learning”, Neural, Parallel, Scientific Computation, vol. 6, pp. 35-72, 1998.

[18] Jeffreys H, “Theory of Probability”, Oxford University Press, 1939.

[19] Gull SF, “Bayesian inductive inference and maximum entropy”, Maximum Entropy and Bayesian Methods in Science and Engineering, vol. 1, pp. 53-74, 1988.

在文檔中貝氏架構下部分最小平方法 (頁 52-0)