本研究探討如何利用雲端運算來計算龐大圖形中,任意兩點的最短路徑距離,
並同時保護圖形上節點的敏感資訊。為了保護圖形上鄰居隱私與特定敏感路徑,
本研究提出兩種隱私保護,分別為(1) k-skip 最短路徑隱私與(2) 敏感路徑隱私。
敏感路徑隱私可將特定敏感路徑的起點、終點分散到不同外包子圖,讓攻擊 者即使辨識出子圖上的節點,也無法在子圖上計算出兩點的最短路徑距離。
另外,為了更有效率地進行圖形轉換與在雲端環境中計算最短路徑距離,本 研究提出結合 k-skip 最短路徑、節點階層還有由下而上之圖形切割技術,可以避 免事前計算所有最短路徑,並加速最短路徑距離的查詢時間。
在實驗分析中,我們使用圖形產生器產生 3 個不同規模,具有冪次法則與小 世界現象的無尺度網路,並透過變動隱藏鄰居層數與敏感路徑比例來檢驗提出演 算法之特性。首先,實驗結果呈現使用 k-skip 最短路徑技術來保護鄰居隱私,其 圖形轉換效率優於事前計算所有最短路徑再保護鄰居隱私之方法。另外,在實驗 k-skip 最短路徑隱私可將外包子圖上的鄰居隱私保護擴展到 0 至 k-1 層鄰居隱私,
並維持最短路徑距離正確性。結果中顯示當想要隱藏的鄰居隱私層數增加時,處 理時間會快速增加。當需要保護的敏感路徑比例增加時,最短路徑查詢時間與圖 形轉換時間也會隨之增加。最後,切割子圖數量增加,最短路徑查詢時間會隨之 減少,顯示出分散運算之效果。
本研究提出之演算法,可透過調整 k 值增加隱匿鄰居節點的範圍,降低節點 被辨識之機會,並且在無尺度網路為模擬對象的實驗中可有效率地進行圖形轉換 與最短路徑距離查詢。但是,針對增加 k 值能提升多少隱私保護效果,本研究尚 未設計衡量指標來數據化隱私保護之效果。另外,本研究提出之圖形分散演算法 目前只使用於無尺度網路,尚未對其它類型之網路結構做設計考量。
33
因此,本研究之未來研究方向可以分為以下兩點進行:
第一,設計能評估本研究隱私保護效果之衡量指標。本研究未來可導入機率 圖模型 (Probabilistic Graph Model, PGM),用來計算各節點或外包圖出現之機率。
例如採用貝式網路 (Bayesian Network, BN)或隱馬可夫鏈 (Hidden Markov Model, HMM)之機率計算方法,並參考其中模型選擇 (Model Selection)之方法,例如採 用 Akaike information criterion (AIC) 、 Bayesian information criterion (BIC) 或 Mutual information (MI)等指標,來做為本研究隱私保護效果之衡量指標。
第二,設計能應用在其他網路結構之圖形分散方法與最短路徑距離計算方 法。本研究目前適用於無尺度網路結構,比如道路網路、網路拓樸之隱私保護。
未來可基於本研究之三層架構,設計適用於社群網路等有多個團(clique)之網路 結構之圖形分散與最短路徑距離計算方法,以提供各種情境下更好的隱私保護效 果與更有效率的查詢速度。
34
參考文獻
[1] Abraham, A. Fiat, A. V. Goldberg, and R. F. Werneck, “Highway dimension, shortest paths, and provably efficient algorithms,” in Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms, Philadelphia, PA, USA, 2010, pp. 782–793.
[2] D. Agrawal, A. El Abbadi, S. Antony, and S. Das, “Data management challenges in cloud computing infrastructures,” in Proceedings of the 6th international conference on Databases in Networked Information Systems, Berlin, Heidelberg, 2010, pp. 1–10.
[3] S. Das, O. Egecioglu, and A. El Abbadi, “Anonymizing weighted social network graphs,” in Proceedings of the 2010 IEEE 26th International Conference on Data Engineering (ICDE), 2010, pp. 904–907.
[4] A. W.-C. Fu, H. Wu, J. Cheng, and R. C.-W. Wong, “IS-Label: An Independent-set Based Labeling Scheme for Point-to-point Distance Querying,”
Proc. VLDB Endow., vol. 6, no. 6, pp. 457–468, Apr. 2013.
[5] J. Gao, J. X. Yu, R. Jin, J. Zhou, T. Wang, and D. Yang, “Neighborhood-privacy protected shortest distance computing in cloud,” in Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, New York, NY, USA, 2011, pp. 409–420.
[6] R. Geisberger, P. Sanders, D. Schultes, and D. Delling, “Contraction hierarchies:
faster and simpler hierarchical routing in road networks,” in Proceedings of the 7th international conference on Experimental algorithms, Berlin, Heidelberg, 2008, pp. 319–333.
[7] H. Hacigumus, B. Iyer, and S. Mehrotra, “Providing database as a service,” in Proceedings of the 18th International Conference on Data Engineering, 2002, pp.
29–38.
[8] M. Hay, G. Miklau, D. Jensen, D. Towsley, and C. Li, “Resisting structural re-identification in anonymized social networks,” The VLDB Journal, vol. 19, no.
6, pp. 797–823, Dec. 2010.
[9] IPython.parallel, http://ipython.org/ipython-doc/dev/parallel/
[10] K. Liu and E. Terzi, “Towards identity anonymization on graphs,” in Proceedings of the 2008 ACM SIGMOD international conference on Management of data, New York, NY, USA, 2008, pp. 93–106.
[11] Liu, L., Liu, J., and Zhang, J. ”Privacy preservation of affinities in social networks”. In Proceedings of the Inter-national conference on Information Systems, pp. 372-376,Porto, Portugal, 2010.
35
[12] L. Liu, J. Wang, J. Liu, J. Zhang, Privacy preservation in social networks with sensitive edge weights, In SDM, pages 954-965, 2009.
[13] S.L. Wang, Z.Z. Tsai, T.P. Hong, and H.H. Ting, “Anonymizing Shortest Paths on Social Network Graphs”, in Proceeding of the Third Asian Conference on Intelligent Information and Database Systems (ACIIDS). April, 2011, Daegu, Korea
[14] S.-L. Wang, C.-C. Shih, I.-H. Ting, and T.-P. Hong, “Degree Anonymization for K-Shortest-Path Privacy,” in Proceeding of the 2013 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2013, pp. 1093–1097.
[15] R. H. Möhring, H. Schilling, B. Schütz, D. Wagner, and T. Willhalm,
“Partitioning Graphs to Speed Up Dijkstra’s Algorithm,” in Experimental and Efficient Algorithms, S. E. Nikoletseas, Ed. Springer Berlin Heidelberg, 2005, pp.
189–202.
[16] Networkx, http://networkx.github.io/
[17] P. Sanders and D. Schultes, “Highway hierarchies hasten exact shortest path queries,” in Proceedings of the 13th annual European conference on Algorithms, Berlin, Heidelberg, 2005, pp. 568–579.
[18] Y. Tao, C. Sheng, and J. Pei, “On k-skip shortest paths,” in Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, New York, NY, USA, 2011, pp. 421–432.
[19] VirtualBox, https://www.virtualbox.org/
[20] M. L. Yiu, Y. Lin, and K. Mouratidis, “Efficient verification of shortest path search via authenticated hints,” in Proceedings of the 2010 IEEE 26th International Conference on Data Engineering (ICDE), 2010, pp. 237–248.
[21] B. Zhou and J. Pei, “Preserving Privacy in Social Networks Against Neighborhood Attacks,” in Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, Washington, DC, USA, 2008, pp. 506–515.
[22] L. Zou, L. Chen, and M. T. Özsu, “Distance-join: pattern match query in a large graph database,” Proc Vldb Endow, vol. 2, no. 1, pp. 886–897, Aug. 2009.