第五章 結論與建議
第二節 建議
本節依據研究結果及未完備之處,提供後續使用演算法於組卷研究者以下幾點 建議,做後續參考。
ㄧ、本模擬研究使用 MRCMLM 架構,探討不測驗長度與組卷設計之效果,而在學 術研究上另有其它新提出的理論模式,可供後續做組卷之相關研究。
二、演化式演算法有很多種方法,本研究使用 Kalyanmoy Deb 基因演算與粒子群演 算法,未來也可以嘗試比較不同的演算法求得最佳解。
三、本實驗設計使用複本測驗與BIB設計兩種,後續研究可嘗試使用不同情境組卷;
而實驗設計為模擬研究,所使用之試題與能力值皆在理想狀態下,未來後續研 究能使用實徵資料進行探討。
71
參考文獻
中文參考文獻
尹邦嚴、嚴亞男(2007)。使用粒子族群最佳化產生多目標平行試卷組題。暨南大 學資訊管理研究所碩士論文。
余民寧(1997)。教育測驗與評量-成就測驗與教學評量。台北:心理出版社。
余民寧(1997)。試題反應理論IRT及其應用。台北:心理出版社。
巫眇鴦(2002)。運用線性規劃於選題策略之研究。國立台南師範學院資訊教育研 究所碩士論文,台南市。
何榮桂(1994),電腦化題庫概述,測驗與輔導,126 期,2576-2577 頁。
凃柏原、吳玫娟(2011)。多向度試題反應模式在國中基測自然科試題分析之應用。
國立臺南大學測驗統計研究所碩士論文。
施淑娟(1997)。認知網路評量模式及其實例應用之研究-以「分數的加法」單元 為例。國立台中師範學院國民教育研究所碩士論文,台中市。
孫光天、陳岳宏、賴膺守、謝凱隆、陳新豐(1999)。使用貪婪演算法作為一有效 益之選題策略。中華民國八十八年全國計算機會議,A379-386,私立淡江大學。
孫光天、陳新豐、謝凱隆、蔡志煌(1999)。利用類神經網路技術於選題策略之研 究。第八屆電腦輔助教學研討會學生論文,25-36,私立逢甲大學。
孫光天、戴伯昌、賴膺守(1999)。利用基因演算法於選題策略之研究。全國計算 機會議(NCS’ 99)論文集。私立淡江大學。
孫光天、蔡淑燕(2003)。運用多區間動態規劃於選題策略之研究。國立臺南師範 學院資訊教育研究所碩士論文,台南市。
孫光天、程千芬、蔡淑燕(2003)。運用進階基因演算法於選題策略之研究。全國 計算機會議。
72
陳仁欽、王文中(2005,10 月)。多分題之多向度電腦適性測驗。發表於第四十四 屆心理學年會。桃園:中原大學。
陳柏熹、王文中(2000,9 月)。測驗組之題間多向度電腦化適性測驗。發表於中 華心理學會第三十九屆年會。台北:台灣師範大學。
陳柏熹、王文中(2000,12 月)。題間與題內多向度電腦化適性測驗。發表於 2000 年教育與測驗學術研討年會。台北:台灣師範大學。
Ackerman, T. (1989). An alternative methodology for creating parallel test forms using the IRT information function. The Annual Meeting of the National Council for Measurement in Education, San Francisco.
Ackerman, T. A. (1991). The use of unidimensional parameter estimates of
multidimensional items in adaptive testing. Applied Psychological Measurement, 13, 113-127.
Ackerman, T. A. (1992). A Didactic Explanation of Item Bias, Item Impact, and Item Validity From a Multidimensional Perspective. Journal of Educational Measurement, 29(1),
73
67-91.
Ackerman, T. A. (1994). Using Multidimensional Item Response Theory to Understand What Items and Tests Are Measuring. Applied Measurement In Education, 7(4), 255-278.
Ackerman, T. A. (1996). Graphical Representation of Multidimensional Item Response Theory Analyses. Applied Psychological Measurement, 20(4), 311-329.
Ackerman, T. A., Gierl, M. J., & Walker, C. M. (2003). Using Multidimensional Item Response Theory to Evaluate Educational and Psychological Tests. Education Measurement: Issues and Practice, 22(3), 37-53.
Adams, R. J., Wilson, M., & Wang, W.-C. (1997). The multidimensional random coefficientsmultinomial logit model. Applied Psychological Measurement, 21(1), 1-23.
Baker, F. B. (1992). Item Response Theory:Parameter Estimation Techniques. New York:Marcel-Dekker.
Baker, F. B., Cohen, A. S., & Barmish, B. R. (1988). Item characteristics of tests constructed by linear programming. Applied Psychological Measurement, 12(2), 189-199.
Birnbaum, A. (1968). Some latent trait model and their use in inferring an examinee’s ability. In F. M. Lord and M. R. Novick, Statistical theories of mental test scores, 17-20. Reading, Mass: Addison-Wesley.
Bock, R. D., & Aitken, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443-459.
Bock, R. D., & Gibbons, R., & Muraki, E.(1988)Full information item factor analysis.
Applied Psy-chological Measurement, 12,261-280
Bock, R. D. & Mislevy, R. J. (1982). Adaptive EAP estimation of ability in a
microcomputer environment. Applied Psychological Measurement, 6, 431-444.
Bramlette, M. F. (1991). Initialization, mutation and selection methods in genetic algorithms for function optimization. In R. K. Belewand L. B. Booker, eds.,
74
Proceedings of the Fourth International Conference on Genetic Algorithm, Morgan Kaufmann.
Burke, D. S., Grefenstette, J. J., De Jong, K. A., Wu, A. S., & Ramsey, C. L.(1998).
Putting more genetics into genetic algorithms. Evolutionary Computation, 6(4), 387-410.
Clark, D. P., & Russell, L. D. (1997). Molecular Biology- Made simple and fun. Illinois:
Cache River Press.58
Cool, L. L., & Hambleton, R. K. (1978). A comparative study of item selection methods utilizing latent trait theoretic models and concepts (Rep. No. 88). Amherst: School of Education, University of Massachusetts, Laboratory of Psychometric and
Evaluative Research.
Davis, L. (Ed.). (1991). Handbook of Genetic Algorithms. New York: van Nostrand Reinhold.
Deb, K. (2000). An efficient constraint handling method for genetic algorithms. Computer Methodsin Applied Mechanics and Engineering, 186(2-4), 311-338.
Deb. K. (2001). Genetic Algorithms for Optimization. (Rep. No. 2001002). India:
Department of Mechanical Engineering Indian Institute of Technology Kanpur, Kanpur Genetic Algorithms Laboratory(KanGAL).
De Gruijter, D. N. M. (1990). Test construction by means of linear programming. Applied Psychological Measurement, 14, 175-181.
Eberhart, R.C. and Kennedy, J. (1995). A new optimizer using particle swarmtheory.
Proc. Sixth International Symposium on Micro Machine and Human Science, Nagoya, Japan, pp.39-43.
Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists.Mahwah, NJ:Lawrence Erlbaum Associates.
Hambleton, R. K., & Swaminathan, H. (1985). Item Response Theory: Principles and Applications. Hingham, MA: Kluwer, Nijhoff.
Hambleton, R. K., Swaminathan, H., & Rogers, H. J.(1991). Fundamentals of Item
75
Response Theory. California, NP:Sage.
Harvey, I. (1992). The SAGA cross: The mechanics of crossover for variable-length genetic algorithms. Parallel problem solving from nature, 2, 269-278. Amsterdam, NL: Elsevier.59
Hattie, J. (1981). Decision criteria for determining unidimensional and multidimensional normal ogive models of latent trait theory. Armidale, New South Wales, Australia:
The University of New England, Center for Behavioral Studies.
Holland, J. H. (1975). Adaptation in natural and artificial systems (2nd ed.: MIT Press, Cambridge, Massachusetts, 1992). Ann Arbor, Michigan: University of Michigan Press.
Hulin, C. L., Lissak, R.I., & Drasgow, F. (1982). Recovery of two- and three parameter logistic item characteristic curves: a Monte Carlo study. Applied Psychological Measurement, 6,249-260.
Kennedy, J. and Eberhart, R.C. (2001). Swarm Intelligence. Morgan Kaufmann Press.
Kubota, N., Fukuda, T., Arakawa, T., & Shimojima, K. (1997). Evolutionary Transition on Virus Evolutionary Genetic Algorithm. Proceedings of International Conference on Evolutionary Computation, 1997, 291-296.
Lord, F. M. (1977). Practical applications of item characteristic curve theory. Jaurnal of Educational Measurement, 14, 117-138.
Lord, F. M. (1980). Applications of item response theory to practical testing problems.
Hillsdale, NJ: Lawrence Eribaum Associates.
Lord and M.R. Novick(1968). Statistical theories of mental test scores. Addison-Wesley Reading,Massachusetts.
McDonald, R. P. (1999). Test theory: A unified treatment. Mahwah, NJ: Erlbaum.
McDonald, R. P. (2000). A basis for multidimensional item response theory. Applied Psychological Measurement, 24, 99-114.
Mitchell, M. (1996). An introduction to genetic algorithms. London: MIT Press.
76
Mullis, I. V.S., Martin, M. O., Ruddock, G. J., O`Sullivan, C. Y., Arora, A., & Erberber, E.(2005). TIMSS 2007 Assessment Frameworks. Chestnut Hill, MA Boston College.
Papadimitrion, C. H., & Steiglitz, K. (1982). Combinatorial Optimization: Algorithms and Complexity. Prentice-Hall, Inc., NJ:Englewood Cliffs.
Rasch, G. (1980). Probability models for some intelligence and attainment tests. Chicago:
The University of Chicago Press (Original edition published in 1960).
Reckase, M. D. (1985). The difficulty of test items that measure more than one ability.
Applied Psychological Measurement, 9, 401-412.
Reckase, M. D., & Mckinley, R. L. (1991). The discrimination power of items that measure more than one ability. Applied Psychological Measurement, 15, 361-373.
Reckase, M. D. (1997). A linear logistic multidimensional model for dichotomous item response data. In W. J. van der Linden & R. K. Hambleton (Eds.), Handbook of modern item response theory(pp. 271-286). New York: Springer-Verlag.
Reynolds, C. W. (1987). Flocks, herds and schools: a distributed behavioral model.
Computer Graphics, 21(4), 25-34.
Ribeiro Filho, J. L., & Treleaven, C. (1994). Genetic algorithm programming environment. IEEEComputer, 27(6), 28-43.
Sanders, P. F., & Verschoor, A. J. (1998). Parallel Test Construction Using Classical Item Parameters. Applied Psychological Measurement, 22(3), 212-223.
Segall, D. O. (1996). Multidimensional adaptive testing. Psychometrika, 61, 331-345.
Shih, C.-L. & Wang W.-C. (2007, July). Comparison of Item Selection Strategies in Multidimensional Computerized Adaptive Testing. Pacific Rim Objective Measurement Symposium, Taiwan.
Shi, Y. and Eberhart, R.C. (1998). A modified particle swarm optimizer. IEEE International Conference on Evolutionary Programming, Alaska, May 4-9.
Shi, Y. and Eberhart, R.C. (1999). Empirical study of particle swarm
optimization.Proceedings of the Evolutionary Computation 1999 Congress, Vol 3, pp.1945-1950.
77
Stocking, M. L., & Swanson, L. (1993a). A method for severely constrained item
selection in adaptive testing. Applied Psychological Measurement, 17(3), 277-292.
Stocking, M. L., & Swanson, L. (1993b). A model and heuristic for solving very large item selection problems, Applied Psychological Measurement, 17(2), 151-166.
Stocking, M. L. (1994). Three practical issues for modern adaptive testing item pools.
Educational Testing Service, Princeton, N. J.
Sun, K. T. (2000). A greedy approach to test construction problems. Proceedings of the National Science Council (Part D): Mathematics, Science, and Technology
Education, 11(2), 78-87.
Sun, K. T. (2002). A Genetic Algorithm for Parallel Test Forms (Rep. No. 00101).
Taiwan, Tainan:National Tainan Teachers College, AI & ICAI Lab.
Suganthan, P. N. (1999). Particle swarm optimizer with neighbourhood operator.Proc.
Congress on Evolutionary Computation, pp.1958-1962.
Sympson, J. B. (1978). A model for testing with the multidimensional items. In D. J.
Weiss (Ed.). Item response theory and computerized adaptive testing conference proceedings. MN: University of Minnesota press.
Sympson, J. B., & Hetter, R. D. (1985). Controlling item-exposure rates in computerized adaptive testing. Proceeding of the 27th annual meeting of the Military Testing Association (pp. 973-977). San Diego, CA: Navy Personnel Research and Development Center.
Vale, D.C., & Gialluca, K. A. (1988). Evaulation of efficiency of iem calibration. Applied Psychological Measurement, 12(1), 53-67. 61
van der Linden W. J. & Hambleton, R. K. (1997), Handbook of Modern Item Response Theory. (pp. 258-270). New York: Springer-Verlag press.
van der Linden, W. J. (1998). Optimal assembly of psychological and educational tests.
Applied Psychological Measurement, 22(3), 195-209.
van der Linden, W. J., & Reese, L. M. (1998). A model for optimal constrained adaptive testing. Applied Psychological Measurement, 22(3), 259-270.
78
79
附錄一、題庫訊息矩陣
題庫A在單峰分布之訊息矩陣
能力值區塊 -2 -1 0 1 2
-2 139.477 172.965 196.960 197.701 174.493
-1 171.873 205.361 229.357 230.097 206.889
0 193.013 226.501 250.497 251.237 228.029
1 191.342 224.831 248.826 249.566 226.359
2 167.914 201.402 225.398 226.138 202.930
題庫B在單峰分布之訊息矩陣
能力值區塊 1 2 3 4 5
1 195.511 217.967 219.110 201.187 173.715
2 218.543 241.000 242.142 224.220 196.748
3 219.273 241.729 242.871 224.949 197.477
4 199.863 222.320 223.462 205.539 178.067
5 170.930 193.386 194.529 176.606 149.134
題庫C在單峰分布之訊息矩陣
能力值區塊 1 2 3 4 5
1 93.432 121.785 150.919 169.872 168.504
2 121.539 149.892 179.026 197.979 196.611
3 150.162 178.515 207.650 226.602 225.234
4 168.421 196.774 225.908 244.861 243.493
5 166.382 194.735 223.870 242.822 241.454