• 沒有找到結果。

結論與未來研究方向

7.1 結論

本研究提出對於關聯式資料庫之關鍵字查詢處理技術,是將查詢關鍵字進行 拆解,拆解成一至多個查詢條件並且分組。接著透過給定設計查詢條件單位以邏 輯運算子組合運算的優先順序指定括弧位置,產生最可能表達使用者給予關鍵字 查詢的搜尋意圖的條件組合,最後再轉換成所對應的 SQL 查詢語句,並且執行於 關聯式資料庫。此外,我們根據該查詢條件所得到的查詢結果,依統計觀點提供 具有一致性和分佈變化量大幅改變的特殊資料摘要呈現給使用者。

實驗中評估本研究所提出方法能處理的關鍵字查詢之轉換正確率,透過控制 型和非控制型的實驗結果得到,在多數情況下能正確處理查詢型態為在單一屬性 值、資料型態和屬性值組合、數值條件式的查詢。在查詢結果摘要的部分,所提 出根據一致性和分佈變化差異的摘要方法,對於使用者得知隱含在查詢結果的重 要資訊也是有幫助。

7.2 未來研究方向

本研究自動將關鍵字和數值條件式描述的查詢轉換成結構化查詢語言的

SQL 語句時,只考慮 基本資料庫查 詢條件句型 ,未來可增 加像是 聚合 函數

(aggregation functions),例如:Avg、Count、Min、Max 等,可提供更多樣化的查 詢功能。

此外,現階段並沒有考慮查詢關鍵字和資料庫綱目或資料庫內容上,文字不 同卻有語意關聯接近的情形。若能利用一個外部參考資源去輔助處理語意,查詢 關鍵字和資料庫綱目比對或是查詢關鍵字和資料庫內容比對時,除了字詞上完全 比對之外,還可處理使用者下查詢關鍵字中所對應相似字的詞彙,能更有彈性甚 至找出更多相關性結果。也可先行收集使用者過往所輸入過的查詢關鍵字,利用 查詢記錄檔(Query Log)去得知使用者使用查詢關鍵字的使用習慣,了解使用者輸 入查詢關鍵字時,通常最可能所表達的意圖為何,對於查詢意圖上的分析能更有 幫助。

在查詢結果摘要部份,則可考量以其他方法觀點做摘要,不僅只從查詢結果 的統計資訊做分析。依照本論文所提出之處理方式,得知查詢結果後,再開始統 計該查詢結果的各個相關統計資訊,需要處理時間而無法立即呈現給使用者。若 所需要的參考資訊能從事先建立的資訊去做分析,對於摘要即時呈現的角度而言,

才可以進一步提昇摘要時的處理效率。

參考文獻

[1] B. Aditya, G. Bhalotia, S. Chakrabarti, A. Hulgeri, C. Nakhe, Parag,and S.

Sudarshan, “Banks: Browsing and keyword searching in relational databases,” in Proceedings of the 28th international conference on Very Large Data Bases (VLDB), 2002.

[2] S. Agrawal, S. Chaudhuri, and G. Das, “DBXplorer: A system for keyword-based search over relational databases,” in Proceedings of the 18th International

Conference on Data Engineering (ICDE), 2002.

[3] V. Bicer, T. Tran and R. Nedkov, “Ranking Support for Keyword Search on Structured Data using Relevance Models,” in Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM), 2011.

[4] A. Chapman, A. Elkiss, M. Jayapandian, Y. Li, A. Nandi and C. Yu, “Making database systems usable,” in Proceedings of ACM international conference on Management of data (SIGMOD), 2007.

[5] S.Cheng, A.Termehchy and Vagelis Hristidis, “Predicting the Effectiveness of Keyword Queries on Databases,” in Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM), 2012.

[6] J. Coffman and A. C. Weaver, “A framework for evaluating database keyword search strategies,” in Proceedings of the 19th ACM international conference on Information and knowledge management (CIKM), 2010.

[7] M. Drosou and E. Pitoura, “ReDRIVE: result-driven database exploration through recommendations,” in Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM), 2011.

[8] S. Fakhraee and F. Fotouhi, “DBSemSXplorer: Semantic-based Keyword Search System over Relational Databases for Knowledge Discovery,” in Proceedings of the 3rd International Workshop on Keyword Search on Structured Data (KEYS), 2012.

[9] K.Golenberg, B. Kimelfeld and Y. Sagiv, “Keyword proximity search in complex data graphs,” in Proceedings of ACM international conference on Management of data (SIGMOD), 2008.

[10] V. Hristidis, L. Gravano, and Y. Papakonstantinou, “Efficient IR-Style keyword search over relational databases,” in Proceedings of the 29th international conference on Very Large Data Bases (VLDB), 2003.

[11] V. Hristidis and Y. Papakonstantinou, “DISCOVER: Keyword search in relational databases,” in Proceedings of the 28th international conference on Very Large Data Bases (VLDB), 2002.

[12] Y. Huang , Z. Liu and Y. Chen, “Query biased snippet generation in XML search,”

in Proceedings of the 2008 ACM SIGMOD international conference on Management of data (SIGMOD), 2008.

[13] A. Hulgeri and C. Nakhe, “Keyword Searching and Browsing in Databases using BANKS,” in Proceeding of the 18th International Conference on Data Engineering (ICDE), 2002.

[14] X. Liu, H. Fang, C. L. Yao and M. Wang, “Finding Relevant Information of Certain Types from Enterprise Data,” in Proceedings of the 20th ACM

international conference on Information and knowledge management (CIKM), 2011.

[15] Z. Liu, P. Sun and Y. Chen, “Structured search result differentiation,” in Proceedings of the VLDB Endowment Volume 2 : 313-324, Aug. 2009.

[16] Y. Mass and Y. Sagiv, “Language Models for Keyword Search over Data Graphs,”

in Proceedings of the fifth ACM international conference on Web search and data mining (WSDM), 2012.

[17] R. Patil and Z. Chen, “STRUCT: Incorporating Contextual Information for English Query Search on Relational Databases,” in Proceedings of the 3rd International Workshop on Keyword Search on Structured Data (KEYS), 2012.

[18] J. Pound, A. K. Hudek, I. F. Ilyas and G. Weddell, “Interpreting keyword queries over web knowledge bases,” in Proceedings of the 21st ACM international

conference on Information and knowledge management (CIKM), 2012.

[19] J. X. Yu, L. Qin and L. Chang, “Keyword search in relational databases: A survey,”

in in Proceedings of the 26th International Conference on Data Engineering, 2010.

[20] Z. Zeng, Z. Bao, T. W. Ling and M. L. Lee, “ISearch: An Interpretation based Framework for Keyword Search in Relational Databases,” in Proceedings of the 3rd International Workshop on Keyword Search on Structured Data (KEYS), 2012.

[21] L.Zhang, Y.Zhang, Y.Chen, “Summarizing Highly Structured Documents for Effective Search Interaction,” in Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (SIGIR), 2012.

附錄 A 查詢控制型實驗搜尋意圖描述

IMDB 查詢控制型 搜尋意圖描述 搜尋意圖描述 1

電影名稱為 Seven Pounds

搜尋意圖描述 2

Steven Spielber 導過(direct)的電影

搜尋意圖描述 3

George 2008 年之後擔任攝影師(cinematographer)過的電 影,或 Julia Roberts 擔任女演員(actress)的在 2010 年之後 的電影

搜尋意圖描述 4

演員 Will smith 或是電影劇中角色(character) wise jr. 的電 影

搜尋意圖描述 5

Will smith 在 2008 或 2010 年演過的電影

搜尋意圖描述 6

allen , Dreyfus James, audrey 演過的電影

搜尋意圖描述 7

男演員(actor) allen, Dreyfus James , 女演員(actress) Audrey 演過的電影

搜尋意圖描述 8

電影 Godfather 的演員(actor)或女演員(actress)

DBLP 查詢控制型 搜尋意圖描述

搜尋意圖描述 1

作者(author) Smith 在 SIGMOD 發表過的 Jorunal paper

搜尋意圖描述 2

Smith 在 SIGMOD 發表過的 Journal Paper 或是