• 沒有找到結果。

第四章、 實驗

4.2 實驗 B

4.2.1 資料收集與實驗設計

資料的收集方法是在 PDA 上安裝 GPS 座標收集程式,並且走訪各地,

本篇論文實地前往蘭嶼,收集20930 筆資料,分布圖可見圖 4.2-1。收集完的 資料先進行zero-mean 的處理,之後隨機選出其中 80%的資料做為訓練集合,

20%的資料做為測試集合。之後再以本論文提出的方法,分別對東距(TE)和北

距(T )兩個依變數進行迴歸分析,找出東距和北距各別的迴歸公式。 N

(a) 蘭嶼地圖 (b) 實地接收資料

圖 4.2-1 蘭嶼 GPS 資料

小波迴歸的部分,實驗參數如所示表 4.2-1。所選用的母小波為墨西哥帽,

SRGP 的部份參數如表 4.2-2 所示,並採用兩種運算子集合

{

+,-,*,/

}

{

+,-,*,/,sin,cos,pow

}

。實驗的輸入則為三維資料,即 GPS 裝置所接收到的經 度(ψ)、緯度(λ)和高度(h)。輸出則為 2-D 平面的東距(TE)和北距(T )兩條迴N 歸公式。

表 4.2-1 Wavelet Regression 參數

WR 參數 設定值

母小波 墨西哥帽

a0 2

b0 0.015625 位子取樣 65

伸縮程度 7

門檻值選擇 前三名係數 表 4.2-2 SRGP 參數

SRGP 參數 設定值

停止條件1 Fitness

( )

⋅ ≥1.0

停止條件2 Max. Generation = 10000 停止條件3 1000 代 Fitness Value 毫無改進 族群大小 100

突變機率 0.01 最大樹節點各數 1023 適應度比重ω 1 0.7 適應度比重ω 2 0.3

運算子

{

+,-,*,/

}

{

+,-,*,/,sin,cos,pow

}

避免結果過於偏頗,相同參數的實驗重複十次。實驗分為三大部分,第一 階段訓練階段(Training Phase)是以隨機挑選出來的 80%資料集合進行迴歸計 算,所找出的公式再進行第二階段的驗證階段(Testing Phase)。將驗證完成的 迴歸公式,用虛擬的路線再次驗證,則是第三階段(Navigation Phase)。

4.2.2 實驗結果

實驗以同樣參數重複十次之後所得到的迴歸結果如圖 4.2-2 所示,實驗平 均後的數據如表 4.2-3 和表 4.2-4 所示。從數據上來看,東距和北距的迴歸誤 差都不大。這是由於蘭嶼本身就是個小島,再加上資料量有2 萬多筆,因此所 得到的迴歸結果精準度非常的高。

表 4.2-3 東距TE結果 Avg Miss Error 0.011625924 Abs AME 0.0567445 MSE 19.24798808 STDEV 3.214468702

表 4.2-4 北距T 結果 N Avg Miss Error 0.00033886 Abs AME 0.002516617 MSE 0.155121984

STDEV 0.2366143

圖 4.2-2 蘭嶼作標轉換迴歸結果

下圖 4.2-3 和圖 4.2-4 是東距和北距對應每一個輸入變數(經度、緯度、高 度)的迴歸結果。

296000 298000 300000 302000 304000 306000 308000 310000

121.48 121.5 121.52 121.54 121.56 121.58

(E) exTree

(a) 經度與東距

296000 298000 300000 302000 304000 306000 308000 310000

22 22.02 22.04 22.06 22.08 22.1

(E) exTree

(b) 緯度與東距

296000 298000 300000 302000 304000 306000 308000 310000

-100 0 100 200 300 400

(E) exTree

(c) 高度與東距 圖 4.2-3 東距迴歸結果

2432000 2434000 2436000 2438000 2440000 2442000 2444000

121.48 121.5 121.52 121.54 121.56 121.58

(N) exTree

(a) 經度與北距

2432000 2434000 2436000 2438000 2440000 2442000 2444000

22 22.02 22.04 22.06 22.08 22.1

(N) exTree

(b) 緯度與北距

2432000 2434000 2436000 2438000 2440000 2442000 2444000

-100 0 100 200 300 400

(N) exTree

實驗的第二階段以 20%的資料進行驗證階段(Testing Phase),東距迴歸公 式和北距迴歸公式表示在下表所示。

表 4.2-5 驗證階段結果 Avg Miss Error Abs AME 東距 0.003165313 0.218385085 北距 0.066605948 0.144373673

實驗第三階段導航驗證(Navigation Phase)在圖之中。由於蘭嶼外海的部份 並沒有資料點,找出的迴歸公式在外海部分誤差較為高,但也在1.2 以下,而 蘭嶼島之內的導航結果,精準度都在0.3 之內。

圖 4.2-5 導航驗證結果

從導航結果來看,符號式迴歸引擎所找出的座標轉換迴歸公式的確可以代 替傳統的複雜座標轉換公式。實驗結果

第五章、結論與討論

迴歸分析是希望透過數學方式找出資料變數之間的關係。透過迴歸分析可 以解釋變數之間的關聯,也可以藉此預測資料的走向。一對一的簡單迴歸可以 使用數學計算找出精準的迴歸公式,但是一對多的複迴歸是一個非常困難的問 題。透過符號式迴歸可以找出一條迴歸公式,而且這條迴歸公式可以寫成數學 形式,但是符號式迴歸的計算是非常花費時間的運算。

本論文提出結合小波迴歸以及基因規劃兩種技術來增進符號式迴歸引擎 的效能。基因規劃為基礎的符號式迴歸引擎因為演化過程會花費大量時間在尋 找資料趨勢,小波迴歸正好可以解決這問題。透過小波分析出資料的大方向趨 勢,再以逆轉換找到資料趨勢迴歸公式,以此作為基因規劃的初始條件。最後 再用基因規劃的演化找出更精準的迴歸公式。

從實驗結果可以知道,本研究的確可以找到準確的迴歸公式。小波迴歸分 析所得的迴歸公式可以找出資料的趨勢,用此提升基因規劃初始染色體的適應 度,可以讓基因規劃的運算資源集中在尋找更精準的迴歸方程式。 無論是在 人造資料之中或是真實世界中的 GPS 座標轉換的例子之中,都可以得到誤差 值非常低的迴歸公式。

由於小波分析在處理多維度資料時候,其計算量會和資料維度成次方成 長。同時演化式計算中最令人詬病的就是大量的計算時間,因此如何加速小波 分析的處理速度以及縮減基因規劃的演化時間是未來可以研究的方向。

參考文獻

[ 1 ] A.-K. Seghouane and S.-I. Amari, “The aic criterion and symmetrizing the kullbackvleibler divergence,” IEEE Transactions on Neural Networks, vol.

18, no. 1, pp. 97-106, 2007

[ 2 ] B. McKay, M. J. Willis, and G. W. Barton, “Using a tree structured genetic algorithm to perform symbolic regression,” in First International Conference on Genetic Algorithms in Engineering Systems: Innovations and Applications, GALESIA, vol. 414. Sheffield, UK: IEE, 12-14 Sept. 1995, pp.

487-492.

[ 3 ] B. Tresp, “Dynamic neural regression models,” Instituts fur Statistik der Ludwig-Maximilians-Universitat Munchen, Tech. Rep. 181, 2000.

[ 4 ] C. Ferreira, “Expression programming: A new adaptive algorithm for solving problems,” Complex Systems, vol. 13, no. 2, pp. 87-129, 2001.

[ 5 ] C. H. Kummell, “Reduction of Observation Equations Which Contain More Than One Observed Quantity,” The Analyst, vol. 6, no. 4, pp. 97-105, 1879.

[ 6 ] C. Zhou, W. Xiao, T. M. Tirpak, and P. C. Nelson, “Evolving accurate and compact classification rules with gene expression programming," IEEE Transactions on Evolutionary Computation, vol. 7, no. 6, pp. 519-531, 2003.

[ 7 ] C.-C. Lai and S.-H. Doong, “An Optimal material distribution system based on nested genetic algorithm,” IEICE Transaction on Information and Systems, vol. E87-D, no. 3, pp. 780-784,2004

[ 8 ] C.-D. Si, J.-J. Lian, Z.-H. Qie, S.-Q. Li, and X.-M. Wu, “Research on the genetic regression model of earth-rock dam safety monitoring and its application,” in Proceedings of 2005 International Conference on Machine Learning and Cybernetics, 18-21 Aug. 2005, pp. 2874-2879.

[ 9 ] D. A. Augusto and H. J. C. Barbosa, “Symbolic regression via genetic programming,” in VI Brazilian Symposium on Neural Network, 22-25 Jan.

2000, pp. 173-178.

[ 10 ] D. Beasley, D. R. Bull, and R. R. Martin, “A Sequential Niche Technique for Multimodal Function Optimization,” Evolutionary Computation, vol. 1, no.

2, pp. 101-125, 1993

[ 11 ] D. Castano and A. Kunoth, “Multilevel regularization ofwavelet based fitting of scattered data—Some experiments,” Numer. Algor., vol. 39, no. 1-3, pp.

81-96, 2005.

[ 12 ] D. Castano and A. Kunoth, “Robust Regression of Scatterd Data with Adaptive Spline-Wavelets,” IEEE Transactions of image processing, vol. 15, no. 6, 2006.

[ 13 ] D. Castano, “Adaptive scattered data fitting with tensor product spline-wavelets,” Ph.D. thesis, Universitat Bonn, Bonn, Germany, 2004.

[ 14 ] D. L. Donoho and I. M. Johnstone, “Adapting to unknown smoothness via wavelet shrinkage,” Journal of the American Statistical Association, vol. 90, no. 432, pp. 1200-1224, 1995

[ 15 ] D. L. Donoho and I. M. Johnstone, “Ideal spatial adaptation by wavelet shrinkage,” Biometrika, vol. 81, no. 3, pp. 425-455, 1994

[ 16 ] E. E. Korkmaz and G. Ucoluk, “A controlled genetic programming approach for the deceptive domain," IEEE Transactions on Systems, Man, and Cybernetics, PartB, vol. 34, no. 4, pp. 1730-1742, 2004.

[ 17 ] E. O. Costa and A. Pozo, “A (μ+λ) - GP algorithm and its use for regression problems,” in Proceedings of 18th IEEE International Conference on Tools with Artificial Intelligence, Nov. 2006, pp. 10 - 17.

[ 18 ] F. Yang, M. A. White, A. R. Michaelis, K. Ichii, H. Hashimoto, P. Votava, A.-X. Zhu, and R. R. Nemani, “Prediction of continental-scale evapotran-spiration by combining MODIS and ameriflux data through support vector machine,” IEEE Transactions on Geoscience and Remote Sensing, vol. 44, no.11, Part, pp. 3542-3461, 2006.

[ 19 ] Francis Galton, “Regression towards mediocrity in hereditary stature,”

Journal of the Anthropological Institute, vol. 15, pp. 246-263, 1886.

[ 20 ] H. Drucker, C. J. C. Burges, L. Kaufman, A. Smola, and V. Vapnik, “Support vector regression machines,“ in Advances in Neural Information Processing Systems, vol. 9. The MIT Press, p. 155, 1997.

[ 21 ] H.-J. Chou, “A GP-Based Symbolic Regression Engine for the Transformation of GPS Coordinates,” Master thesis, National University of Kaohsiung, Kaohsiung, 2007.

[ 22 ] H.-S. Oh and T. C.M. Lee, “Hybrid local polynomial wavelet shrinkage:

wavelet regression with automatic boundary adjustment,” Computational Statistics & Data Analysis, vol. 48, pp. 809-819, 2005.

[ 23 ] H.-S. Oh, P. Naveau, and G. Lee, “Polynomial boundary treatment for wavelet regression,” Biometrika, vol. 88, pp. 291-298, 2001

[ 24 ] Haar A., Zur Theorie der orthogonalen Funktionensysteme, Mathematische Annalen, 69, pp 331–371, 1910.

[ 25 ] I. D. Coope, “Circle fitting by linear and nonlinear least squares,” Journal of Optimization Theory and Applications, vol. 76, no. 2, pp. 381-388, 1993.

Fitting: Variable Bandwidth and Spatial Adaptation,” Journal of the Royal Statistical Society. Series B (Methodological), vol. 57, no. 2, pp. 371-394, 1995

[ 28 ] J. R. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection. The MIT Press, 1992.

[ 29 ] J.-T. Chien and C.-H. Huang, “Aggregate a posteriori linear regression adaptation, “ IEEE Transactions on Speech and Audio Processing, vol. 14, no. 3, pp. 797-807, 2006.

[ 30 ] L. Lindbom, M. Sternad, and A. Ahlen, “Tracking of time-varying mobile radio channels .1. the wiener lms algorithm,” IEEE Transactions on Communications, vol. 49, no. 12, pp. 2207-2217, 2001.

[ 31 ] M. C. Jones, J. S. Marron and S. J. Sheather, “A Brief Survey of Bandwidth Selection for Density Estimation,” Journal of the American Statistical Association, vol. 91, no. 433, pp. 401-407, 1996.

[ 32 ] O. Reiersol, “Identifiability of a Linear Relation between Variables Which Are Subject to Error,” Econometrica, vo. 18, no. 4, pp. 375-389, 1950.

[ 33 ] P. Craven and G. Wahba, “Smoothing noisy data with spline functions,”

Numerische Mathematik, vol 31, no. 4, pp. 377-403, 1978.

[ 34 ] P. Naveau and H.-S. Oh, “Polynomial Wavelet Regression for Images With Irregular Boundaries,” IEEE Transactions on Image Processing, vol. 13, no.

6, pp. 773-781, 2004.

[ 35 ] R. Collobert and S. Bengio, “Support vector machines for large-scale regression problems,” Machine Learning Research, vol. 1, pp. 143-160, 2001.

[ 36 ] R. J. Jennrich, A Introduction to Computational Statistics: Regression Analysis. Englewood Cliffs, NJ: Prentice-Hall, 1995.

[ 37 ] S. C. Schwartz, “Estimation of Probability Density by an Orthogonal Series,”

The Annals of Mathematical Statistics, vol. 38, no. 4, pp. 1261-1265, 1967.

[ 38 ] S. G. Mallat, “A theory for Multiresolution Signal Decomposition: The Wavelet Representation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 11, no. 7, pp. 674-693,1989

[ 39 ] S. G. Mallat, “Multiresolution approximations and wavelet orthonormal bases of L2(R),” Transactions of the American Mathematical Society, vol. 315, no.

1, pp. 69-87, 1989

[ 40 ] S. Gustafson, E. K. Burke, and N. Krasnogor, “On improving genetic programming for symbolic regression," in Proceedings of the 2005 IEEE Congress on Evolutionary Computation, vol. 1. IEEE Press, 2-5 Sep 2005, pp. 912-919.

[ 41 ] S. Liu and J. Wang, “A simplified dual neural network for quadratic programming with its KWTA application,” IEEE Transactions on Neural Networks, vol. 17, no. 6, pp. 1500-1510, 2006

[ 42 ] S.-H. Doong, C.-C. Lai, and C.-H. Wu, “Genetic subgradient method for solving location-allocation problems,” Applied Soft Computing, vol. 7, pp.

373-386, 2007.

[ 43 ] T. C. M. Lee and H.-S. Oh, “Automatic polynomial wavelet regression,”

Statistics and Computing, vol. 14, pp.337-341, 2004

[ 44 ] T. I. Aksyonova, V. V. Volkovich, and I. V. Tetko, “Robust polynomial neural networks in quantative-structure activity relationship studies,” Systems Analysis Modelling Simulation, vol. 43, no. 10, pp. 1331-1339, 2003.

[ 45 ] X. Xiao, Y. Li, and R. Mukkamala, “A model order selection criterion

with applications to cardio-respiratory-renal systems,“ IEEE Transactions on Biomedical Engineering, vol. 52, no. 3, pp. 445-453, 2005.

[ 46 ] Y. Liu and S. D. Brown, “Wavelet multiscale regression from the perspective of data fusion: new conceptual approaches,” Anal Bioannal Chem, vol. 380, pp. 445-452, 2004.

[ 47 ] Z. Li Wu, C. hung Li, J. K.-Y. Ng, and K. R. Leung, “Location estimation via support vector regression,” IEEE Transactions on Mobile Computing, vol. 6, no. 3, pp. 311-321, 2007.

[ 48 ] John Holland, “Adaptation in Natural and Artificial Systems,” Ann Arbor Univ. of Michigan Press, 1975.

[ 49 ] N. L. Cramer, “A representation for the Adaptive Generation of Simple Sequential Programs,” Proceedings of an International Conference on Genetic Algorithms and the Applications, pp. 183-187,

[ 50 ] Charles K, “An introduction to Wavelets,” San Diego Academic Press, 1992 [ 51 ] Joseph Fourier, “Théorie Analytique de la Chaleur”, Paris, 1822

相關文件