未來研究方向

第五章結論與建議

第二節未來研究方向

本研究所提出之以題組結構為基礎之適性測驗選題策略已有不錯的成效，未來可發展的建議如下：

壹、可實驗不同單元並評估成效

因本研究限於人力、物力所以只針對其中一個單元，將來可嘗試運用在其他單元上，看成效如何。

貳、擴充更多系統模組

因為目前僅有單純的系統功能，若將來要做大量的試題管理或更新試題的功能並非相當完整，倘若可以擴充更多更人性化的模組，在系統的使用上會更方便。

參、多媒體試題的開發

目前系統內的試題是以文字與圖形為主，可以考慮設計多媒體試題，以增加試題型態的趣味性。

肆、可研發多點防猜機制

研究者在分析學生的作答情形後發現：在同一份試卷中，部分低分數學生的電腦適性測驗成績，遠高過全測之分數，表示電腦適性測驗的預測精準度太低，這樣的差異結果值得加以探討並設法預防，以加強適性測驗系統之預測精準度。

伍、使其符合系統建置標準

因應學習標準化的趨勢，可改良系統使其符合SCORM、QTI規格，以利日後題庫資源的交換使用。

陸、可結合貝氏網路進行推論

因為目前的選題策略是依據試題的上下位關係來做判斷的，將來也可結合貝氏網路進行推論，這樣可以得到更多的訊息。

這個系統到目前為止並不十分完善，還有許多改良的空間。而且在我國的教育圈中無論是教育政策或教材都是不斷的在變化與更新，就跟本系統一樣，需要不斷的維護與開發，才能真正達到這個研究欲完成的目標。

參考文獻

王德蕙(2006)。題組(testlet)測驗分數信度估計方法之比較~以閱讀理解測驗為例。國立臺南大學測驗統計研究所碩士論文。

白曉珊、劉育隆、郭伯臣、施慶麟 (2006)。電腦化適性診斷測驗與適性補救教學模式之研發─以「整數四則」單元為例。2006 年第七屆海峽兩岸心理與教育測驗學術研討會，2006 年 10 月 28~29 日，國立政治大學。

何政翰 (2004)。國小數學領域電腦適性化測驗系統之建製。國立台中師範學院教育測驗統計研究所碩士論文。

余民寧（2003）。教育測驗與評量－成就測驗與教學評量。台北：心理出版社。

林文質（2005）。以多點計分試題結構為基礎的電腦適性測驗演算法。台中健康暨管理學院資訊工程學系研究所碩士論文。

林立敏、白曉珊、郭伯臣、劉育隆 (2006)。數位個別指導教材研發與適性補救教學模式之研究－以國小五年級數學「因數與倍數」單元為例。TANET2006 台灣區網際網路研討會，2006 年 11 月 3 日，花蓮教育大學。

洪碧霞、吳裕益（1996）。國民小學數學標準參照測驗編製。台南：台南師院測驗發展中心。

胡豐榮（2001）。SS 分析法的基本特性與數學性質介紹。測驗統計簡訊 43 期，

17-31 頁。台中市。台中師範學院。

莊惠萍、劉育隆、郭伯臣、曾彥鈞（2006）。電腦化適性測驗與數位個別指導整合教學模式之研發。「台灣教育傳播暨科技學會」2006 年學術研討會。2006 年12 月 16 日，國立台灣師範大學。

許元（1998）。資訊系統分析、設計與製作。台北市：松崗。

許志毅（2004）。國小數學領域電腦化適性診斷測驗及補救教學系統之內容開發及試用—以「扇形」單元為例。國立台中師範學院教育測驗統計研究所碩士論文。

張大鈞（2001）。互動式線上學習系統發展之研究－以微處理機課程為例。國立

彰化師範大學工業教育研究所碩士論文，未出版，彰化縣。

郭伯臣（2003）。國小數學科電腦化適性診斷測驗(I)。國科會研究專案報告 NSC-91-2520-S-142-001。

郭伯臣（2004）。國小數學科電腦化適性診斷測驗(II)。國科會研究專案報告 NSC-92-2521-S-142-003。

郭伯臣（2005）。國小數學科電腦化適性診斷測驗(III)。國科會研究專案報告 NSC-93-2521-S-142-004。

郭伯臣、謝友振、張峻豪、蔡坤穎（2005）。以結構理論為基礎之適性測驗與適性補救教學線上系統。台灣數位學習發展研討會，2005 年 5 月 6-7 日，國立台灣師範大學。

黃珮璇、王暄博、郭伯臣、劉湘川（2006）。國小數學科電腦化適性診斷測驗強韌性探究。2006 年電腦與網路科技在教育上的應用研討會。2006 年 3 月 23-24 日，國立新竹教育大學。

黃朝恭（2000）。國民小學國語科多媒體線上測驗系統建置之相關研究。臺中師範學院教育測驗統計研究所，未出版，台中市。

曾彥鈞（2007）。以知識結構為基礎的適性診斷測驗系統及降低猜測機制之研發。

國立台中教育大學數學教育學系碩士班碩士論文。

曾彥鈞、劉育隆、郭伯臣、楊智為（2006）。以知識結構為基礎之適性化診斷測驗系統建置，TANET2006 台灣區網際網路研討會。2006 年 11 月 1-3 日，

花蓮教育大學。

楊智為、張雅媛、郭伯臣、許天維（2006）。以試題結構理論為基礎之適性測驗選題策略強韌性探究。2006 數位科技與創新管理國際研討會，華梵大學，

2006 年 4 月 1 日。

趙琬津(2006)。數位個別指導模式與教材研發-以「三角形」單元為例。國立台中教育大學教育測驗統計所碩士論文。

劉湘川（2003）。混合型語義結構分析之研究。測驗統計年刊 11 輯，16 頁。台

中市。台中師範學院。

劉湘川、楊志良（2003）。態度問題關聯結構分析方法之發展--以健保態度問題為例。第六屆工程科技與中西醫學應用研討會。台中縣。台中健康暨管理學院。

劉湘川、簡茂發（2004）。混合型態度問題關聯結構分析。第六屆兩岸心理與教育測驗學術研討會。中國測驗學會。陜西師範大學。

蔡昆穎（2004）。國小數學領域電腦化適性診斷測驗及補救教學系統之內容開發及試用─以「擴分、約分」單元為例。國立台中師範學院教育測驗統計研究所碩士論文。

盧炎成(2006)。個別化數位補救教學模式之成效以「小數的除法」單元為例。國立台中教育大學教育測驗統計所碩士論文。

竹谷誠（1987）。評定尺度ﾗﾞ一ﾀの意味分析法。日本行動計量學會誌。14，2，

10-17。

Allen, S., & Sudweeks, R. R. (2001, April). Identifying and managing local item dependence in context-dependent item sets. Paper presented at the Annual Meeting of the American Educational Research Association, Seattle, WA.

Appleby, J., Samules, P., &Treasure-Jones, T. (1997). Diagnosys: A knowledge-based diagnostic test of basic mathematical skills. Computers & Education, Vol.28, No.2, pp.113-131.

Airasian, P.W., & Bart, W.M. (1973). Ordering Theory: A new and useful measurement model. Journal of Educational Technology, Vol. 5. pp.56-60.

Bart, W.M., & Krus, D.J. (1973 ). An ordering theoretic method to determine hierarchies among items. Educational and Psychological Measurement, 33, pp.291-300.

Bunderson, C. V., Inouye, D. K., & Olsen, J. B. (1989). The four generations of computerized educational measurement. In R. L. Linn (Ed.), Educational measurement (3rd ed.) (pp. 367-407). New York: Macmillan.

Brown, J.S. and Burton, R.(1978) “Diagnostic models for procedural bugs in basic mathematical skills”, Cognitive Science, 2:155-192.

Carmines, E. G., & Zeller, R. A. (1979). Reliability and validity assessment. Beverly Hills,CA：Sage.

Chang,K.E.,Liu, & Chen, S.W. (1998). A testing system for diagnosing misconceptions in DC electric circuits. Computers & Education,31, pp .195-210.

Crehan, K. D., Sireci, S. G., Haladyna, T. M., & Henderson, P. A. (1993, April). A comparison of testlet reliability for polytomous scoring methods. Paper presented at the Annual Meeting of the American Educational Research Association, Atlanta, GA.

Ebel, R. L. (1951). Writing the test item. In E. F. Lindquist (Ed.), Educational Measurement(pp.185-249). Washington, DC: American Council on Education.

Gessaroli, M. E., & Folske, J. C. (2002). Generalizing the reliability of tests comprised of testlets. International Journal of Testing, 2(3&4), 277-295.

Haladyna, T. M. (1992). Context-dependent item sets. Educational Measurement:

Issues andPractice, 11(1), 21-25.

Lee, G. (2000a). A comparison of methods of estimating conditional standard errors of measurement for testlet-based test scores using simulation techniques. Journal of Educational Measurement, 36(2), 91-112.

Lee, G. (2000b). Estimating conditional standard errors of measurement for tests composed of testlets. Applied Measurement in Education, 13(2), 161-180.

Lee, G., Brennan, R. L., & Frisbie, D. A. (2000). Incorporating the testlet concept in test score analyses. Educational Measurement: Issues and Practice, 19, 9-15.

Lee, G., & Frisbie, D. A. (1999). Estimating reliability under a generalizability theory model for test scores composed of testlets. Applied Measurement in Education, 12(3), 237-255.

Lee, Y. W. (2002, August). Score reliability of a test composed of passage-based testlets: A generalizability theory perspective. Paper presented at the

International Conference of the Korea English Education Society, Chungbuk, Korea.

Takeya（1999）Structure analysis methods for instruction, Takushoku University Press, Hachioji, Tokyo, Japan.

Takeya (1991) New item structure theorem. Tokyo: Waseda University.

VanLehn K.(1988) “Student models. In Polson M.C. & Richardson J.J. (eds.)”, Foundations of intelligent tutoring systems. Lawrence Erlbaum. Hillsdale.

Wainer, H. et al. (Eds.). (1990). Computerized adaptive testing: A primer. Hillsdale, NJ:

Lawrence Erlbaum Associates.

Wainer, H. (2000), “Computerized adaptive testing: A primer (2nd ed.).Hillsdale”, NJ:

Lawrence Erlbaum Publishers.

Wainer, H., & Kiely, G. L. (1987). Item clusters and computerized adaptive testing: A case for testlets. Journal of Educational Measurement, 24(3), 185-201.

Wainer, H., & Lewis, C. (1990). Toward a psychometrics for testlets. Journal of Educational Measurement, 27(1), 1-14.

Wenger, E.(1987)“Artificial Intelligence and Tutoring Systems. Morgan Kaufmann”, Los Altos, CA 94022.

Wesman, A. G. (1971). Writing the test item. In R. L. Thorndike (Ed.), Educational measurement (2nd ed., pp. 81-129). Washington, DC: American Council on Education.

Yen, W. M. (1993). Scaling performance assessments: Strategies for managing local item dependence. Journal of Educational Measurement, 30(3), 187-213.

在文檔中題組式適性診斷測驗系統之建置 (頁 55-62)

第五章 結論與建議

第二節 未來研究方向

參考文獻

第五章結論與建議

第二節未來研究方向