Chapter 4. Experiments and Results…
4.2. Simulation data
4.2.3. Preprocessing…
In this part, we generate original training data set taken from the uniform distribution in (-1, 1). Then we use tangent sigmoid as a function to transfer the original training data set to a new one which is Gaussian-based. We make an assumption about whether the training data set is center distribution or more uniform will make better analyzed performance. They will give some illustration in following figures.
Figure 4.22 and 4.23 represent the original training data set X and the new training data set 'X which is transferred by tangent function respectively.
Figure 4.22 The original training data setX
45
Figure 4.24 RMSE as a function of FWHM under SCSP
Figure 4.25 Correlation coefficient as a function of FWHM under SCSP
Chapter 5 Discussion
For regularization concept, almost all inverse problem methods involve a trade-off between two optimizations:agreement between data and solution, and smoothness of the solution. We define that the unconstrained minimum of agreement and the unconstrained minimum of smoothness is the best solution. Figure 5.1 will give you a brief thought about that. Here, we have a question for how to define or find out the location of the best solution between “Best smoothness” line and “Best agreement” line.
Figure 5.1 Where is the best solution
The estimated criterion RMSE and correlation coefficient would involve a trade-off relationship. In our data experiment results, we hope the RMSE is low and correlation coefficient is high to verify our proposed method. So, we need some
47
verification to explain this problem. We make a assumption that our proposed method and PLS may have different curves as shown in Figure 5.2. In further study, we will have a fundamental proof for this issue.
Figure 5.2 Trade-off curves of Bayesian-based PLS and PLS
The preprocessing result, we transfer the original data set to Gaussian form to examine whether the performance is better or not. We make different widths for FWHM to verify our proposed method. But we could obviously find out the hypothesis for data preprocessing doesn’t accomplish to our expectation. The results after preprocessing might be influenced by the limitation of tangent function. The data after tangent function transferring may be divergent so that the analyzed results would be affected for this reason.
The local and global minimum problem is another issue we concern. We would like to find the best solution to approximate nearly global minimum.
Chapter 6
Conclusions and Future works
6.1 Conclusions
We have established a probability based analyzed method which combines the advantages of regularization and the properties of PLS for a novel calibration model.
The proposed method, Bayesian-based PLS, is able to reduce the noise signal hidden in the training data. And it has better analyzed results than original PLS method when training data accompanying noise signal during calibration phase. So we can apply our method to on-line analyzed system for further application.
6.2 Future works
In data preprocessing issue, we might to make tries for other kinds of transfer function (e.g., arcsine function) to make sure the data divergent problem and improve the limitation of transformation accuracy to obtain better performance for further study. The track of best solution between the agreement and smoothness is our next objective to achieve. Then, we also consider to make the results approximated to the global minimum so that we can apply the proposed method for weights initialization of backpropogation network. There still have another issue we have to take into account. The selection of appropriate prior would probably affect the analyzed result.
So we need to make a study about the prior probability to make sure that we don’t have a bad or wrong one.
49
References
[1] Hsiao TC, Lin CW, Chiang HH, “Partial least squares algorithm for weights initialization of the back-propagation network”, Neurocomputing, vol.
50, pp. 237-247, 2003.
[2] Chen S, Chng ES, Alkadhimi K, “Regularized orthogonal least squares algorithm for constructing radial basis function networks”, International Journal of Control, vol. 64, pp. 829-837, 1996.
[3] Chang SH, Chiou YJ, Yu C, Lin CW, Hsiao TC, “A Novel Multivariate Analysis Method with Noise Reduction”, 4th European Congress for Medical and Biomedical Engineering, 2008.
[4] MacKay DJC, “Bayesian interpolation”, Neural Computation, vol. 4, pp.
415-447, 1992.
[5] Bhandare P, Mendelson Y, Peura RA, Janatsch G, Kruse-Jarres JD, Marbach R, Heise HM, “Multivariate determination of glucose in whole blood using
partial least-squares and artificial neural networks based on mid-infrared spectroscopy”, Applied Spectroscopy, vol. 47, pp. 1214-1221, 1993.
[6] Möcks J, Verleger R, “Multivariate methods in biosignal analysis: application of principal component analysis to event-related”, Techniques in the behavioral
and neural sciences, vol. 5, pp. 399-458, 1991.
[7] Castellanos G, Delgado E, Daza G, Sanchez LG, Suarez JF, “Feature Selection in Pathology Detection using Hybrid Multidimensional Analysis”, Proceedings of International Conference of EMBS, pp. 5950-5953, 2006.
[8] Oja E, “A simplified neuron model as a principal component analyzer”, Journal of Mathematics and Biology, vol. 15, pp. 267-273, 1982.
[9] Harald M, Tormod N, “Multivariate Calibration”, 2nd Edition, John Wiley &
Sons, Great Britain, 1996.
[10] Huang KY, “Neural Networks and Pattern Recognition”, 2nd Edition, 維科圖書 有限公司, 2003.
[11] Oja E, Karhunen J, “Recursive construction of Karhunen-Loeve expansions for pattern recognition purposes”, Proceedings of 5th Int. Conf. on Pattern Recognition, pp. 1215-1218, 1980.
[12] Hsiao TC, Lin CW, Zeng MT, Chiang Kenny HH, “The Implementation of Partial Lease Squares with Artificial Neural Network Architecture”, 20th Annual International Conference of the IEEE Engineering in Medicine Biology Society, vol. 3, pp. 1341-1343, 1998.
[13] Chen S, Cowan CFN, Grant PM, “Orthogonal least squares learning algorithm for radial basis function networks”, IEEE Transactions on Neural Networks, vol. 2, pp. 302-309, 1991.
[14] Press HW, Vetterling WT, Teukolsky SA, Flannery BP, “Numerical Recipes in C: the art of scientific computing”, 2nd Edition, Cambridge University Press, 1993.
[15] Orr MJL, “Regularization in the selection of radial basis function centers”, Neural Computation, vol. 7, pp. 606-623, 1995.
[16] Hertz J, Krough A, Palmer R, “Introduction to the Theory of Neural Computation”, Redwood city, California, USA, Addison-Wesley, 1991.
[17] Ham FM, Kostanic I, “A Neural Network Architecture for Partial Least Squares Regression with Supervised Adaptive Modular Hebbian Learning”, Neural, Parallel, Scientific Computation, vol. 6, pp. 35-72, 1998.
[18] Jeffreys H, “Theory of Probability”, Oxford University Press, 1939.
[19] Gull SF, “Bayesian inductive inference and maximum entropy”, Maximum Entropy and Bayesian Methods in Science and Engineering, vol. 1, pp. 53-74, 1988.