Sensitivity Analysis - Performance Analysis

Performance Analysis

4.3 Sensitivity Analysis

In this subsection, we experiment the impact of diﬀerent parameters by varying the value of one parameter while keeping other parameters unchanged. In each experiment we let the sensors operates for 1000 time-units. We repeat each experiment 10 times and present the average values.

In the first experiment, the communication range of each sensor is set to be 40. The error threshold is set to 0.5^◦C. We varied k, the number of hops for each sensor to exchange its readings in algorithm DClocal, from 1 to 6. In Figure 4.4, the total remaining energy of

0 50000 100000 150000

1 2 3 4 5 6

Total Remaining Energy

DCglobal DClocal

Figure 4.4: The impact of the value of k on the total remaining energy.

0 5 10 15 20 25 30 35 40

1 2 3 4 5 6

Number of Representatives

DCglobal DClocal

Figure 4.5: The impact of the value of k on the number of r-nodes.

0 50000 100000 150000

10 20 30 40 50

Stan dard Error

Total Remaining Energy

DCglobal DClocal Naive

Figure 4.6: The impact of the standard error on the total remaining energy.

DCglobal and DClocal at the end of the simulation is plotted.

As can be seen in Figure 4.5, due to the property of DCglobal, the number of r-nodes selected by DCglobal is not aﬀected by k and the available energy is almost constant. On the other hand, when the value of k is less than 3, the available energy of DCglobal is higher than that of DClocal. This is because that, for smaller k, more r-nodes are selected by DClocal.

Therefore, the energy consumed in answering queries is higher. In Figure 4.5, when the value of k increases, the number of r-nodes selected by DClocal decreases. Since the number of r-nodes for answering queries decreases, the available energy of DClocal increases. As shown in Figure 4.4, the available energy of DClocal is higher than that of DCglobal when k is equivalent to 3 or 4.

However, the energy consumption in exchanging readings also increases as the value of k increases. As mentioned before, if the cost of increasing the value of k is higher than the benefit of increasing, the available energy decreases, as can be seen in Figure 4.4 when the value of k is larger than 4. Therefore, from the observation we can concluded that in this example the optimal value of k is 3 or 4.

0 2 4 6 8 10 12 14 16 18

10 20 30 40 50

Stand ard Error

Number of Representatives

DCglobal DClocal

Figure 4.7: The impact of the standard error on the number of r-nodes.

In the second experiment, we varied the standard error of the readings of the events in our network simulator. The standard error controls the volatility of the reading of sensors since the readings of sensors are the weighted average of the events in the network. The total remaining energy of diﬀerent situation at the end of the simulation is shown in Figure 4.6.

In this experiment, we set the value of k is 3, thus the available energy of DClocal is always higher than that of DCglobal for diﬀerent values of standard error. In Figure 4.6, for both DCglobal and DClocal, the available energy decreases as the standard error increases. This is because that for more volatile environment, more r-nodes are selected to fully data-cover the whole network. The number of r-nodes selected by DCglobal and DClocal can be seen in Figure 4.7. Therefore the energy consumption of the network in more volatile environment is larger. Since no r-nodes are selected in Naïve scheme, the available energy of Naïve is always the smallest one.

In the third experiment, we varied the values of the error threshold. The available energy of DClocal with diﬀerent value of k and DCglobal is shown in Figure 4.8. As can be seen in the figure, the available energy of each algorithm is inversely proportional to the error threshold.

0 50000 100000 150000

0.1 0.3 0.5 0.8 1

Error Threshold

Total Remaining Energy

DCglobal DClocal with k = 1 DClocal with k = 3 DClocal with k = 5

Figure 4.8: The impact of the error threshold on the total remaining energy.

This is because that, for larger value of the error threshold, the number of r-nodes needed to fully data-cover the whole network is less. The number of r-nodes selected by DClocal with diﬀerent value of k and DCglobal can be seen in Figure 4.9. In addition, the available energy of DClocal with k = 3 is the highest among all the cases, which is consistent with the discussion of Figure 4.4. The available energy of DClocal with k = 1 is the lowest, since the number of r-nodes in this case is the most.

In addition, for the value of error threshold larger than 0.8, the available energy of the 4 cases tends to be constant. This is because that the r-nodes almost cover all their neighbors within k-hops distance when the value of the error threshold is 0.8.

0 10 20 30 40 50 60

0.1 0.3 0.5 0.8 1

Error Thresho ld

Number of Representatives

DCglobal DClocal with k = 1 DClocal with k = 3 DClocal with k = 5

Figure 4.9: The impact of the error threshold on the number of r-nodes.

Chapter 5 Conclusions

In this paper we focused on approximate query processing in sensor networks. Since the readings of nearby sensors are usually correlated, it is power-eﬃcient to group these sensors together and let only one of them respond to queries. We proposed a innovative concept called data-coverage to address the problem. We proved that the data-covering problem which selects minimal number of sensors as r-nodes with the union of their data-coverages data-cover all the sensors in the network is an NP-complete problem by reducing the set-covering problem to the data-covering problem. We then devised two heuristic algorithms DCglobal and DClocal to solve the data-covering problem. Algorithm DCglobal is a centralized algorithm executed at sink. The ratio between the number of selected r-nodes selected by DCglobal and that of the optimal solution is bounded. Since the message cost of selecting r-nodes in executing DCglobal is very high, we devised another distributed algorithm DClocal. Each sensor discovers its data-coverage by exchanging readings with only nearby neighbors. The sensors exchange their data-coverage size with nearby sensors to select sensors with locally largest coverage size as r-nodes. Through the experimental study, it can be seen the performance of DClocal is almost as good as DCglobal with energy consumption less than the energy consumption of DCglobal. The selected r-nodes answer queries within the pre-specified error threshold and the network lifetime is prolonged.

Bibliography

[1] I. F. Akyildiz, W. Su, Y. Sankarasubramaniam, and E. Cayirci. Wireless sensor networks:

a survey. Computer Networks, 38(4):393—422, 2002.

[2] A. D. Amis, R. Prakash, D. Huynh, and T. Vuong. Max-min d-cluster formation in wireless ad hoc networks. In INFOCOM, pages 32—41, 2000.

[3] P. Bonnet, J. Gehrke, and P. Seshadri. Towards sensor database systems. In Mobile Data Management, pages 3—14, 2001.

[4] C. Buragohain, D. Agrawal, and S. Suri. Power aware routing for sensor databases. In Infocom, Dec. 29 2004.

[5] T. Cormen, C. Leiserson, and R. Rivest. Introduction to Algorithms, pages 1033—1038.

The MIT Press, 1989.

[6] A. Deshpande, C. Guestrin, S. Madden, J. M. Hellerstein, and W. Hong. Model-driven data acquisition in sensor networks. In Proceedings of the Thirtieth International Confer-ence on Very Large Data Bases, Toronto, Canada, August 31 - September 3 2004, pages 588—599, 2004.

[7] Y. T. Hou, Y. Shi, and H. D. Sherali. Rate allocation in wireless sensor networks with network lifetime requirement. In MobiHoc, pages 67—77, 2004.

[8] I. Kang and R. Poovendran. Maximizing network lifetime of broadcasting over wireless stationary ad hoc networks. MONET, 10(6):879—896, 2005.

[9] Y. Kotidis. Snapshot queries: Towards data-centric sensor networks. In Proceedings of the 21st International Conference on Data Engineering, ICDE 2005, 5-8 April 2005, Tokyo, Japan, pages 131—142, 2005.

[10] S. Madden, M. J. Franklin, J. M. Hellerstein, and W. Hong. Tag: A tiny aggregation service for ad-hoc sensor networks. In OSDI, 2002.

[11] S. Madden, M. J. Franklin, J. M. Hellerstein, and W. Hong. The design of an acquisi-tional query processor for sensor networks. In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, California, USA, June 9-12, 2003, pages 491—502, 2003.

[12] A. M. Mainwaring, D. E. Culler, J. Polastre, R. Szewczyk, and J. Anderson. Wireless sensor networks for habitat monitoring. In WSNA, pages 88—97, 2002.

[13] A. Meka and A. K. Singh. Distributed spatial clustering in sensor networks. In EDBT, pages 980—1000, Feb. 31 2006.

[14] S. Ratnasamy, B. Karp, L. Yin, F. Yu, D. Estrin, R. Govindan, and S. Shenker. GHT:

a geographic hash table for data-centric storage. In Proceedings of the First ACM In-ternational Workshop on Wireless Sensor Networks and Applications (WSNA-02), pages 78—87, New York, Sept. 28 2002. ACM Press.

[15] W. Wang, J. Yang, and R. R. Muntz. Sting: A statistical information grid approach to spatial data mining. In Proceedings of 23rd International Conference on Very Large Data Bases, August 25-29, 1997, Athens, Greece, pages 186—195, 1997.

[16] A. Y. Wu, M. Garland, and J. Han. Mining scale-free networks using geodesic clus-tering. In W. Kim, R. Kohavi, J. Gehrke, and W. DuMouchel, editors, Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, Washington, USA, August 22-25, 2004, pages 719—724. ACM, 2004.

[17] Y. Yao and J. Gehrke. The cougar approach to in-network query processing in sensor networks. SIGMOD Record, 31(3):9—18, 2002.

[18] Y. Yao and J. Gehrke. Query processing in sensor networks. In Proceedings of the 1st International Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 5-8, 2003, 2003.

[19] O. Younis and S. Fahmy. Heed: A hybrid, energy-eﬃcient, distributed clustering approach for ad hoc sensor networks. IEEE Trans. Mob. Comput., 3(4):366—379, 2004.

在文檔中利用資料涵蓋觀念解決無線感測網路中之近似查詢處理 (頁 36-44)