A CASE STUDY: HYPERTENSION DETECTION
6.2 Further Research
As compared with neural network based method, L-J approach with combined kernel functions was observed to have a better performance. In addition, L-J method has the advantage on the basis of a single training run and is easier to compute for feature selection as compared with other SVM based methods. However, the computation speed is relatively slow when the kernel functions are complicated.
Hence, this subject is worth investigating in the future.
In addition to computation speed problems, the rules generalization is another important issue. Obviously, SVM based methods do not provide the ability of rule generalization although they have the strong foundation in statistical learning theory.
Most of the engineering or medical problems with regard to the rule generalization are still more exigent than feature selection for operators or non-experts. Hence, how to provide a function of rule generalization extend the SVM based method deemed an important issue in the interdisciplinary applications.
REFERENCES
1. Amaldi, E. and Kann, V., On the approximation of minimizing non zero variables or unsatisfied relations in linear systems, Theoretical Computer Science, 209, 237-260, 1998.
2. Aronszajn, N., theory of reproducing kernels, Transactions of the American Mathematical Society, 68(3), 337-404, 1950.
3. Bekkerman, R., El-Yaniv, R., Tishby, N. and Winter, Y., Distributional word cluster vs. words for text categorization, Journal of Machine Learning Research, 3, 1183-1208, 2003.
4. Bi, J., Bennett, K., Embrechts, M., Breneman, C. and Song, M., Dimensionality reduction via sparse support vector machines, Journal of Machine Learning Research, 3, 1229-1243, 2003.
5. Blake C.L. and Merz, C.J., UCI repository of machine learning databases, Dept.
Infor. Comput. Sci., Univ. California, Irvine, CA, 1998.
6. Breiman, L., Friedman, J.H., Olshen, R.A. and Stone, C.J., Classification and Regression Trees, Wadsworth and Brooks, 1984.
7. Caruana, R. and de Sa, V., Benefitting from the variables that variable selection discards, Journal of Machine Learning Research, 3, 1245-1264, 2003.
8. Chen, W., Bao, W., Begum, S., Elkasabany, A., Srinivasan, S.R. and Berenson, G..S., Age-related patterns of the clustering of cardiovascular risk variables of syndrome X from childhood to young adulthood in population made up of black and white subjects: the Bagalusa Heart Study. Diabetes, 49, 1042-1048, 2000a. 9. Chen, C.H., Lin, K.C., Tsai, S.T. and Chou, P., Different association of
hypertension and insulin-related metabolic syndrome between man and women in 8437 nondiabetic Chinese. American Journal of Hypertension, 13(7), 846-853, 2000b.
10. Chiang, C.F., A Feature Selection for Support Vector Machines, M.S. thesis, Tung Hai University, 2002.
11. Cortes, C. and Vapnik, V., Support-vector networks, Machine learning, 20(3), 273-297, 1995.
12. Cristianini, N. and Taylor J.S., An introduction to SVMs and other kernel-based learning methods. Combridge University Press, 2000.
13. Dhillon. I., Mallea, S. and Kumar, R., A divisive information-theoretic feature clustering algorithm for text classification, Journal of Machine Learning Research, 3, 1265-1287, 2003.
14. Dong, B., Cao, C. and Lee, S.E., Applying support vector machines to predict building energy consumption in tropical region, Energy and Buildings, 37, 545-553, 2005.
15. Dudoit, S., Fridlyand, J. and Speed, T., Comparison of discrimination methods for the classification of tumors using gene expression data, Journal of American Statistical Association, 97(457), 77-87, 2002.
16. Erwin, T., Reinhard, M., Karl, S. and Gilbert, R., Artificial neural networks compared to factor analysis for low-dimensional classification of high-dimensional body fat topography data of healthy and diabetic subjects, Computers and Biomedical Research, 33, 365-374, 2000.
17. Fayyad, U., Piatetsky-Shapiro, G. and Smyth, P., From data mining to knowledge discovery: An overview. In Fayyad, U., Piatetsky-Shapiro, G., Smyth, P. and Uthurusamy, R., editors, Advances in Knowledge Discovery and Data Mining, pages 495-515, AAAI Press / The MIT Press, 1996.
18. Guyon, I. and Elisseeff A., An introduction to variable and feature selection, Journal of Machine Learning Research, 3, 1157-1182, 2003.
19. Hammer, B. and Gersmann, K., A note on the universal approximation capability of support vector machines, Neural Processing Letters, 17, 43-53, 2003.
20. Haykin, S., Neural networks: a comprehensive foundation, New Jersey, 2nd ed., Prentice Hall, 1999.
21. Hermes, L. and Buhmann, J.L., Feature selection for support vector machines, Proc. of the International Conference on Pattern Recognition, 2, 716-719, 2000.
22. Jain, A.K., Duin, R.P.W. and Mao, J., Statistical pattern recognition: a review, IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(1), 4-37, 2000.
23. Jeppesen, J., Hein, H.O., Suadicani, P. and Gyntelberg, F. High triglycerdes and low HDL cholesterol and blood pressure and risk of ischemic heart disease.
Hypertension, 36, 226-232, 2000.
24. Joachims, T., The Maximum-Margin Approach to Learning Text Classifiers:
Methods, Theory, and Algorithms. PhD thesis, Universitaet Dortmund, 2000.
25. Kim, Y.I., Kim, C.H., Choi, C.S., Chung, Y.E., Lee, M.S., Lee, S.I., Park, J.Y., Hong, S.K. and Lee, K.U., Microalbuminuria is associated with the insulin resistance syndrome independent of hypertension and type 2 diabetes in the Korean population. Diabetes Research and Clinical Practice, 52, 145-152, 2001.
26. Kohavi, R. and John, G.., Wapper for feature selection, Artificial Intelligence, 97(1-2), 273-324, 1997.
27. LeCun, Y., Jackel, L. D., Bottou, L., Brunot, A., Cortes, C., Denker, J. S., Drucker, H., Guyon, I., Muller, U. A., Sackinger, E., Simard, P. and Vapnik, V., Comparison of learning algorithms for handwritten digit recognition, International Conference on Artificial Neural Networks, Fogelman, F. and Gallinari, P. (Ed.), 53-60, 1995.
28. Lin, C.J., Formulations of support vector machines: a note from an optimization
point of view, Neural Computation, 13(2), 307-317, 2001.
29. Lin, J.D., Chiou, W.K., Weng, H.F., Tsai, Y.H., & Liu, T.H. Comparison of three-dimensional anthropometric body surface scanning to waist-hip ratio and body mass index in correlation with metabolic risk factors. Journal of Clinical Epidemiology, 55, 757-766, 2002.
30. Liu, H. and Motoda, H., Feature selection for knowledge discovery and data mining, Norwell, MA: Kluwer Academic, 1998.
31. Lukas, L., Devos, A., Suykens, J.A.K., Vanhamme, L., Howe, F.A., Majos, C., Moreno-Torres, A., Van Der Graff, M., Tate, A.R., Arus, C. and Van Huffel, S., Brain tumor classification based on long echo proton MRS signals, Artificial Intelligence in Medicine, 31, 73-89, 2004.
32. Muller, K. R., Mika, S., Ratsch, G., Tsuda, K. and Scholkopf, B., An introduction to kernel-based learning algorithms, IEEE Transactions on Neural Networks, 12(2), 181-201, 2001.
33. Mykkanen, L., Haffner, S.M., Ronnemaa, T., Bennemaa, T., Bergman, R.N. and Laakso, M., Low insulin sensitive is associated with clustering of cardiovascular disease risk factors. American Journal of Epidemiology, 146, 315-321, 1997.
34. McNeely, M.J., Boyko, E.J., Shofer, J.B., Newell-Morris, L., Leonetti, D.L. and Fujimoto, W.Y., Standard definitions of overweight and central adiposity for determing diabetes risk in Japanese American, American Journal of clinical Nutrition, 74, 101-107, 2001.
35. Osuna E., Freund, R. and Girosi, F., Training support vector machines: an application to face detection, Proc. Computer Vision and Pattern Recognition ’97, 130-136, 1997.
36. Oyang, Y.J., Hwang, S.C., Ou, Y.Y., Chen, C.Y. and Chen, Z.W., Data classification with radial basis function networks based on a novel kernel density estimation algorithm, IEEE Transactions on Neural Networks, 16(1), 225-236, 2005.
37. Papadopoulos, A., Fotiadis, D.I. and Likas, A., Characterization of clustered microcalcifications in digitized mammograms using neural networks and support vector machines, Artificial Intelligence in Medicine, 34, 141-150, 2005.
38. Pardo, M. and Sberveglieri G.., Classification of electronic nose data with support vector machines, Sensors and Actuators B, 107, 730-737, 2005.
39. Quinlan, J.R., Induction of decision trees, Machine Learning, 1(1), 81-106, 1986.
40. Rakotomamonjy, A., Variable selection using SVM-based criteria, Journal of Machine Learning Research, 3, 1357-1370, 2003.
41. Reunanen, J., Overfitting in making comparisons between variable selection methods, Journal of Machine Learning Research, 3, 1371-1382, 2003.
42. Roth, V. and Steinhage., Nonlinear discriminant analysis using kernel functions. In:
Advances in neural information processing system, 12, 568-574, 2000.
43. Sanchez, V.D.A., Advanced support vector machines and kernel methods, Neurocomputing, 55, 5-20, 2003.
44. Schmidt, M., Identifying Speakers with Support Vector Networks, in Interface ’96 Proceedings, Sydney, 1996.
45. Scholkopf, B. and Smola, A.J., Learning with kernels: support vector machines, regularization, optimization, and beyond, Cambridge, Mass: MIT Press, 2002.
46. Scholkopf, B., Smola, A.J. and Muller, K.R., Nonlinear component analysis as a kernel eigenvalue problem, Neural Computation, 10(5), 1299-1319, 1998.
47. Scholkopf, B., Support Vector Learning, PhD Theiss, R. Oldenbourg Verlag, Munich, 1997.
48. Scholkopf, B., Burges, C.J.C. and Smola, A.J., Extracting support data for a given task. In U.M. Fayyad and R. Uthursamy, editors, Proceedings, First International Conference on Knowledge Discovery and Data mining, Menlo Park, 1995. AAAI Press.
49. Su, C.T., Hsu, H.H. and Tsai, C.H., Knowledge mining from trained neural networks. Journal of Computer Information Systems, 42(4), 61-70, 2002.
50. Tefas, A., Kotropoulos, C. and Pitas, I., Using support vector machines to enhance the performance of elastic graph matching for frontal face authentication, IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(7), 2001.
51. Torkkola, K., Feature extraction by non-parametric mutual information maximization, Journal of Machine Learning Research, 3, 1415-1438, 2003.
52. Vapnik, V., The Nature of Statistical Learning Theory, Springer-Verlag, New York, 1995.
53. Vapnik, V., Statistical Learning Theory, Wiely: New York, 1998.
54. Wang, C.W., Index of Kernel Functions for Support Vector Machine, M.S. Thesis, National Cheng Kung University, 2003.
55. Wang, M.L., Li, W.J. and Xu, W.B., Support vector machines for prediction of peptidyl prolyl cis/trans isomerization, Journal of Peptide Research, 63, 23-28, 2004.
56. Wahba, G., An introduction to model building with reproducing kernel Hilbert
space, Computing Science and Statistics, 32,
/I2000Proceedings/GWahba/wahba_short.pdf.
57. Weston, J., Elisseff, A., Schoelkopf, B. and Tipping, M., Use of the zero norm with linear models and kernel methods, Journal of Machine Learning Research, 3, 1439-1461, 2003.
58. Yao, X.J., Panaye, A., Doucet, J.P., Chen, H.F., Zhang, R.S., Fan B.T., Liu M.C.
and Hu, Z.D., Comparative classification study of toxicity mechanisms using support vector machines and radial basis function neural networks, Analytica Chimica Acta, 535, 259-273, 2005.
59. Yeang, C.H., Ramaswamy, S., Tamayo, P., Mukherjee, S., Rifkin, R.M., Angelo, M., Reich, M., Lander, E., Mesirov, J. and Golub, T., Molecular classification of multiple tumor types, Bioinformatics: Discovery Note, 1(1), 1-7, 2001.
60. Zhu, Y., Li, C. and Zhang, Y., A practical parameters selection method for SVM, Springer-Verlag Berlin Heidelberg, 2004.
61. Zien, A., Ratsch, G., Mika, S., Scholkopf, B., Lengauer, T. and Muller, K.R., Engineering support vector machine kernels that recognize translation initiation sites, Bioinformatics, 16, 851-824, 2000.