執行時間 - 實驗結果與評估方法 - 實驗結果與分析

Chapter 5 實驗結果與分析

5.1 實驗結果與評估方法

5.1.1 執行時間

很直觀的，隨著要求的隱匿度愈高與想要隱匿的配對節點愈多，所需要的執 行時間也會愈長。圖 5.1 中，KMPN 的執行時間在一對節點（H1）中，隨著 k 值 的提升，上升趨勢也較 H2、H3 緩慢，圖 5.2 中 KMPA 亦是如此。在實驗的過程 中，若次短路徑無法滿足修改之條件（如：已無邊可選擇、可修改之權重小於必

須修改之權重等），則程式碼可能會不斷的尋找下一條次短路徑進行修改，在此情

況下，需要比較多的時間來完成修改條件。而圖 5.1 與圖 5.2 中，在 k 需求較小 時，無法比較 KMPN 與 KMPA 之間的效率，但隨著配對數量與 k 的需求愈高時，

KMPA 明顯地較 KMPN 花費更多的時間。

圖 5.1 KMPN 之執行時間

圖 5.2 KMPA 之執行時間 5.1.2 被擾動的邊之比率

由於本論文採取了平均分配權重的原故，導致使得每一條尚未經過之邊，都 會盡可能地分配修改之權重。KMPN 採用最短路徑尚未經過之邊做為修改之對象，

KMPA 則包含了前（k-1）對配對之 AV 來進行權重之刪減，故 KMPA 將會較 KMPN 所擾動之數量略多。如圖圖 5.3 及圖 5.4 中，KMPA 之修改比率比 KMPN 些微增 加，但不相差甚遠。

圖 5.3 KMPN 被擾動的邊之比率

圖 5.4 KMPA 被擾動的邊之比率 5.1.3 被擾動的權重比率

權重修改的大小攸關於次短路徑與最短路徑之間的權重差距，若挑選之次短 路徑權重差距越大，則比率會明顯地上升，圖 5.5 與圖 5.6 中，當 k 值漸漸上升 時，修改權重比率於 KMPN 與 KMPA 之數值，大部分皆位於 0.05 至 0.20 之區間。

而 KMPA 之 H1 中，k=10 的部分有較明顯地上升，原因在於尋找到的次短路徑，

因無法達成修改條件，讓演算法不斷的尋找下一條次短路徑，使路徑之總權重有明顯的差距。

圖 5.5 KMPN 被擾動的權重比率

圖 5.6 KMPA 被擾動的權重比率 5.1.4 資訊損失

絕大部分之隱匿相關研究，除保護資訊隱私之外，也詳加考量隱匿後資料之實用性。若為了保護敏感資訊不會被攻擊者所發現，進而濫用隱私，則公布後之資料反而無法被應用，反而失去了保護之意義。本研究也考量圖形透過隱匿後，

與原始圖形之間的差異，我們採用 Kullback and Leibler (KL) divergence 計算修改過後的圖形與修改之前圖形的差異。

KL divergence 是用來衡量擁有相同隨機變數 x 下的機率分配 f 與 g 之相異性，

通常分配 f 代表真實的分布數據，分配 g 代表一個理論、模型或是近似 f 的另一項 分配。本研究中，我們將分配 f 視為修改前之權重於路徑上的比率分配，而分配 g 視為修改後的分配結果，透過 KL divergence 來比較修改前與修改過後的差異性之大小，當兩者之間沒有差異，則 KL divergence 之值為 0，若差異愈大，KL divergence 之值也愈大。本研究所使用的 KL divergence 公式如下：

圖 5.7 KMPN 所形成之資訊損失

圖 5.8 KMPA 所形成之資訊損失

Chapter 6 結論與未來方向

過去，有許多學者提出防範社會網絡之隱私攻擊的相關隱匿研究，包含針對社會網絡個體的連結度、社會網絡個體間的關係結構與社會網絡路徑間的權重隱私，想防範其他攻擊者對於不同的社會網絡類型所造成的隱私危害。但是都無法 防範攻擊者透過權重圖形，揭露最短路徑資訊之威脅。本研究提出了 k-anonymous path privacy 之想法，並使輸出後的圖形，讓攻擊者能夠辨識原始之最短路徑達到

（1/k）以下，達成資訊保護之效果。在方法中，本論文比較了兩項修改方式，分 別為針對 NV 之 KMPN，以及修改範圍包含針對前（k-1）個配對的 AV 之 KMPA。

由結果顯示，無論是 KMPN 亦或是 KMPA，都有其優劣點。在相同的配對之下，

KMPN 之方法能修改的邊數略 KMPA 少，但每一條邊上修改的權重數量較多；而 KMPA 修改的邊之數量較多，但單條邊上所修改的權重數量較少。兩種方法的資訊 損失數據上是相差不遠的，我們無法以資訊損失的數據來判定其優劣。但若以修 改之邊的數量愈少為佳的角度來觀察，則 KMPN 是比較可行的方法。

本研究之方法可以解決最短路徑被輕易識別之問題，但此方法每當完成一條路徑，

便會重新排名，使得修改所需求的時間量大幅增加。在未來，我們將會針對修改時間提出較有效率的改善方法，並持續研究相關議題，提防攻擊者由隱匿過後之圖形，得到隱私之情報。

參考文獻

Backstrom, L., Dwork, C., & Kleinberg, J. (2011). Wherefore art thou {R3579X}?:

anonymized social networks, hidden patterns, and structural steganography.

Communications of the ACM, 54(12), 133-141. New York, NY, USA: ACM.

Backstrom, L., Huttenlocher, D., Kleinberg, J., & Lan, X. (2006). Group formation in large social networks: membership, growth, and evolution. (J. Chodorow,

Ed.)Science, pages(9), 44-54. ACM.

Barbaro, M., & Zeller, T. (2006). A Face Is Exposed for AOL Searcher No. 4417749.

New York Times. Retrieved from

http://www.nytimes.com/2006/08/09/technology/09aol.html

Boyd, D., & Ellison, N. B. (2007). Social Network Sites: Definition, History, and Scholarship. Journal of Computer-Mediated Communication, 13(1-2).

Carminati, B., Ferrari, E., & Perego, A. (2007). Private Relationships in Social

Networks. Data Engineering Workshop, 2007 IEEE 23rd International Conference on (pp. 163-171).

Cheng, J., chee Fu, A. W., & Liu, J. (2010). K-isomorphism: privacy preserving network publication against structural attacks. Proceedings of the 2010 international

conference on Management of data (pp. 459-470). New York, NY, USA: ACM.

Das, S., Egecioglu, O., & El Abbadi, A. (2010). Anonymizing weighted social network graphs. Data Engineering (ICDE), 2010 IEEE 26th International Conference (pp.

904-907).

Gross, R., & Acquisti, A. (2005). Information revelation and privacy in online social networks. Proceedings of the 2005 ACM workshop on Privacy in the electronic society (pp. 71-80). New York, NY, USA: ACM.

Hay, M., Miklau, G., Jensen, D., Towsley, D., & Weis, P. (2008). Resisting structural re-identification in anonymized social networks. Proc. VLDB Endow., 1(1), 102-114. VLDB Endowment.

He, Y., & Naughton, J. F. (2009). Anonymization of Set-Valued Data via Top-Down , Local Generalization. Proceedings of the VLDB Endowment, 2(1), 934–945.

VLDB Endowment.

Korolova, A., Motwani, R., Nabar, S. U., & Xu, Y. (2008). Link privacy in social networks. CIKM’08: Proceeding of the 17th ACM conference on Information and

knowledge management (pp. 289-298). New York, NY, USA: ACM.

Li, N., Zhang, N., & Das, S. K. (2011). Preserving Relation Privacy in Online Social Network Data. IEEE Internet Computing, 15, 35-42. Los Alamitos, CA, USA:

IEEE Computer Society.

Li, Y., & Shen, H. (2011). Anonymizing Graphs Against Weight-based Attacks with Community Preservation. JCSE, 5(3), 197-209.

Liu, K., & Terzi, E. (2008). Towards identity anonymization on graphs. SIGMOD ’08:

Proceedings of the 2008 ACM SIGMOD international conference on Management of data (pp. 93-106). New York, NY, USA: ACM.

Liu, K., Das, K., Grandison, T., & Kargupta, H. (2008). Chapter 21 Privacy-Preserving Data Analysis on Graphs and Social Networks. (H. Kargupta, J. Han, P. Yu, R.

Motwani, & V. Kumar, Eds.)DisClosure, 415-434.

Liu, L., Liu, J., & Zhang, J. (2010). Privacy Preservation of Affinities in Social Networks. IADIS ICIS, 372-376.

Liu, L., Wang, J., Liu, J., & Zhang, J. (2009). Privacy Preservation in Social Networks with Sensitive Edge Weights. SDM (pp. 954-965). SIAM.

Martins, E. V., & Pascoal, M. B. (2003). A new implementation of Yen's ranking loopless paths algorithm. Quarterly Journal of the Belgian French and Italian Operations Research Societies, 1(2), 121-133.

Meyerson, A., & Williams, R. (2004). On the complexity of optimal K-anonymity.

PODS ’04: Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (pp. 223-228). New York, NY, USA:

ACM.

Motwani, R., & Nabar, S. U. (2008). Anonymizing Unstructured Data. CoRR, abs/0810.5.

Park, H., & Shim, K. (2007). Approximate algorithms for K-anonymity. Proceedings of the 2007 ACM SIGMOD international conference on Management of data (pp.

67-78). New York, NY, USA: ACM.

Samarati, P., & Sweeney, L. (1998). Generalizing data to provide anonymity when disclosing information. Proceedings of the seventeenth ACM

SIGACT-SIGMOD-SIGART symposium on Principles of database systems (pp.

188-202). New York, NY, USA: ACM.

Sweeney, L. (2002). k-anonymity: a model for protecting privacy. Int. J. Uncertain.

Fuzziness Knowl.-Based Syst., 10, 557-570. River Edge, NJ, USA: World Scientific Publishing Co., Inc.

Tai, C. H., Yu, P. S., Yang, D. N., & Chen, M. S. (2011). Privacy-preserving social network publication against friendship attacks. Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining (pp.

1262-1270). New York, NY, USA: ACM.

Wasserman, S., & Faust, K. (1994). Social Network Analysis: Methods and Applications (Structural Analysis in the Social Sciences) (1st ed.).

Wu, W., Xiao, Y., Wang, W., He, Z., & Wang, Z. (2010). k-symmetry model for identity anonymization in social networks. EDBT ’10: Proceedings of the 13th

International Conference on Extending Database Technology (pp. 111-122). New York, NY, USA: ACM.

Yen, J. Y. (1971). Finding the K Shortest Loopless Paths in a Network. Management Science, 17(11), 712-716.

Ying, X., & Wu, X. (2008). Randomizing Social Networks: a Spectrum Preserving Approach. SDM (pp. 739-750).

Zheleva, E., & Getoor, L. (2008). Preserving the Privacy of Sensitive Relationships in Graph Data. In F. Bonchi, E. Ferrari, B. Malin, & Y. Saygin (Eds.), Privacy, Security, and Trust in KDD (Vol. 4890, pp. 153-171).

Zhou, B., & Pei, J. (2008). Preserving Privacy in Social Networks Against Neighborhood Attacks. Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on (pp. 506-515).

Zhou, B., Pei, J., & Luk, W. (2008). A brief survey on anonymization techniques for privacy preserving publishing of social network data. SIGKDD Explor. Newsl., 10(2), 12-22. New York, NY, USA: ACM.

Zou, L., Chen, L., & Ö zsu, M. T. (2009). k-automorphism: a general framework for privacy preserving network publication. Proc. VLDB Endow., 2, 946-957.

在文檔中社會網絡權重路徑隱匿之探討 (頁 32-0)