• 沒有找到結果。

Adaptive Sequential Hypothesis Testing

Our proposed adaptive sequential hypothesis testing provides estimates of θ0 and θ1 adaptively based on observations of the outcomes of FCC requests.

We will consider only the estimation procedure of θ0. The estimation procedure for θ1 is similar.

The basic idea of our proposed estimation procedure is as follows. An estimate of θ0, denoted by θ , is generated when the total number of remote ˆ0 hosts that are detected as benign is greater than or equal to K, where K is a design parameter. Let Si and Fi represent, respectively, the numbers of successful and failed FCC requests sent by ri when it is detected as benign. Furthermore, remote host makes an FCC request to a local destination, its likelihood ratio is

updated according to the outcome, i.e., success or fail, of the FCC. If the FCC request is classified as success, S of i Hash r( ) is increased by one, where

Hash r( ) represents the hash result of IP address r. On the contrary, if the FCC request is classified as fail, Fi of ( )Hash r is increased by one.

Table 3: Data structure of the adaptive sequential hypothesis testing algorithm.

Hash r( ) Λ X

( )

n Si Fi

611 5.545177 0 2

849 6.415920 3 4

965 -4.674434 3 0

1540 -5.361835 7 2

… … … …

The remote host r is detected as benign if its likelihood ratio is lower than threshold η0. On the other hand, if its likelihood ratio is higher than threshold

η1, the remote host r is declared as malicious. Once remote host r is decided as benign or malicious, the corresponding Si and Fi values are added to the data structure shown in Table 4.

Table 4: Data structure for updating θ and ˆ0 θ . ˆ1 benign or malicious. However, it achieves better accuracy and thus is

worthwhile to sacrifice the decision time. In our design, θˆ0 and θˆ1 are updated for the first time when a total of K remote hosts are decided as benign or malicious, respectively. Based on ordered statistics [10], for a group of benign remote hosts which issue FCC requests randomly to local hosts, the first few hosts that are detected as benign tend to have zero or very few failed FCC requests. Similarly, the first few malicious remote hosts that are detected as malicious tend to have zero or very few successful FCC requests. Consequently, the estimates may largely deviate from the real values if we set K = . In general, a large value of 1

K provides better accuracy but longer detection time. We select K =10 in our experiments presented in the next chapter.

Chapter 5.

Experimental Results

In this chapter, we present simulation results for the TRW algorithm (with known θ and unknown θ ) and our proposed adaptive sequential hypothesis testing algorithm. The desired false positive and false negative probabilities are both set to 0.01. In other words, we choose α =0.01 and β =0.99 in our experiments. Simulations are performed for 900 benign hosts and 100 malicious hosts. The probabilities of success for an FCC request generated by a benign host or a malicious host are equal to θ0 and θ1, respectively. We performed simulations for different values of θ0 and θ1.

Figure 1 and Figure 2 compare, respectively, the false positive and false negative probabilities of the TRW algorithm with or without knowing θ0 and θ1 and our proposed adaptive algorithm, for various values of θ0 and θ1. We assume that θ0 =0.8 and θ1=0.2 are used for the TRW algorithm without knowing θ0 and θ1. As one can see, the false positive and false negative probabilities are very low for the TRW algorithm with perfect knowledge of θ0

and θ1. However, without knowing the real values of θ0 and θ1, its false positive and false negative probabilities of TRW could be much greater than the desired values when θ0 is small and θ1 is large (say, θ0 =0.6 and θ1=0.4).

The reason is that the step size of moving upward using θˆ0 =0.8 and θˆ1=0.2 is significantly larger than the step size of moving upward using θ0 =0.6 and

1 0.4

θ = . Using our proposed scheme (i.e., Adaptive SHT), the false positive and false negative probabilities are almost lower than 5% for all cases (except for

0 0.55

θ = and θ1=0.45). The results are close to the desired values because the estimates of θ0 and θ1 in our proposed scheme are quite accurate (as Table 5 shows). Note that in our proposed scheme, the false positive probabilities are larger than false negative probabilities when θ0 is large and θ1 is small. This is because the number of benign hosts is much larger than the number of malicious hosts. As a result, θˆ0 is updated much earlier than θˆ1. As mentioned before, the earlier detected benign hosts tend to have many more successful FCC requests than failed ones. This implies θˆ0 tends to be larger than the real value which makes it easier to detect a remote host as benign.

Figure 1: Comparison of false positive probabilities.

Figure 2: Comparison of false negative probabilities.

Table 5: Estimates of θ0 and θ1 for the proposed adaptive algorithm.

Table 6 and Table 7 show, respectively, the average number of FCC requests sent by a remote host to be detected as benign or malicious. The TRW algorithm with unknown θ0 and θ1 is fast in making a decision because the large step sizes. Unfortunately, as illustrated in Figures 1 and 2, its false positive and false negative probabilities are not satisfactory. The average number of FCC requests for our proposed adaptive algorithm are comparable to those for the TRW

algorithm with known θ0 and. Let ˆθ and ˆ 'θ be two successive estimates of θ . One can stop updating ˆθ if ˆθ θ'− ˆ < for a given ε ε to speed up the detection time. In other words, the time spent to obtain a stable estimate of θ can be regarded as the period of training. Of course, to adapt to a changing

environment, the training procedure should be reactivated once in a while.

Table 6: The average number of FCC requests to detect a remote host as benign.

θ0 95% 90% 85% 80% 75% 70% 65% 60% 55%

θ1 5% 10% 15% 20% 25% 30% 35% 40% 45%

TRW (known

θ0 and θ1)

2.21 3.74 4.24 6.61 9.92 14.81 26.32 59.14 225.61

TRW (unknown

θ0 and θ1)

4.45 5.00 5.70 6.63 7.80 9.36 11.26 13.40 15.23

Adaptive_SHT 15.82 17.66 20.01 23.03 27.35 33.97 46.15 77.19 304.51

Table 7: The average number of FCC requests to detect a remote host as malicious.

θ0 95% 90% 85% 80% 75% 70% 65% 60% 55%

θ1 5% 10% 15% 20% 25% 30% 35% 40% 45%

TRW (known

θ0 and θ1)

2.21 3.74 4.23 6.63 9.91 14.78 26.42 59.15 224.71

TRW (unknown

θ0 and θ1)

4.45 5.01 5.71 6.63 7.80 9.36 11.24 13.38 15.24

Adaptive_SHT 15.33 17.13 19.33 22.11 25.98 31.75 41.43 61.81 139.21

Chapter 6.

Conclusion

We have presented in this paper an adaptive sequential hypothesis testing algorithm for accurate detection of scanning worms. Numerical results show that our proposed adaptive algorithm provides accurate estimates of θ0 and θ1 and thus achieves false positive and false negative probabilities close to the desired values. The proposed adaptive estimation procedure for θ0 and θ1 is an important enhancement of the sequential hypothesis testing algorithm because it makes the algorithm much more robust to variation of θ0 and θ1. The

proposed adaptive detection algorithm is only suitable for scanning worms.

How to effectively detect other types of worms remains to be further studied.

Bibliography

[1] J. Jung, V. Paxson, A. W. Berger, and H. Balakrishnan, “Fast Portscan Detection Using Sequential Hypothesis Testing,” In Proceedings of the IEEE Symposium on Security and Privacy, May 9-12 2004.

[2] S. E. Schechter, J. Jung, and A. W. Berger, “Fast Detection of Scanning Worms Infections,” In Proceedings of the 7th International Symposium on Recent Advances in Intrusion Detection (RAID 2004), September 15-17 2004.

[3] N. Weaver, S. E. Schechter, V. Paxson, “Very Fast Containment of Scanning Worms,” In Proceedings of the 13th USENIX Security Symposium, August 9-13 2004.

[4] M. Roesch, “Snort: Lightweight Intrusion Detection for Networks,” In Proceedings of the 13th Conference on Systems Administration (LISA-99), pages 229–238, Berkeley, CA, Nov.

7–12 1999. USENIX Association.

[5] L. T. Heberlein, G. V. Dias, K. N. Levitt, B. Mukherjee, J. Wood, and D. Wolber, “A Network Security Monitor,” In Proceedings of IEEE Symposium on Research in Security and Privacy, pages 296–304, 1990.

[6] D. Moore, C. Shannon, and J. Brown, “Code-Red: a case study on the spread and victims of an Internet worm,” in Proc. ACM/USENIX Internet Measurement Workshop, France, Nov. 2002.

http://www.caida.org/dynamic/analysis/security/nimda.

[8] D. Moore, V. Paxson, S. Savage, C. Shannon, S. Staniford, and N. Weaver, “Inside the Slammer Worm,” IEEE Magazine of Security and Privacy, 1(4): 33-39, July 2003.

[9] A. Wald, Sequential Analysis, J. Wiley & Sons, New York, 1947.

[10] R. Hogg and A. Craig, Introduction to Mathematical Statistics, The Macmillan Company, 1970.

[11] C. C. Zou, D. Towsley, W. Gong, and S. Cai. “Routing Worms: A Fast, Selective Attack

Worm based on IP Address Information.” In Proceedings of the 19th Workshop on Principles of Advanced and Distributed Simulation (PADS’05), June 2005.

[12] N. Weaver, V. Paxson, S. Staniford ,and R. Cunningham. “A Taxonomy of computer worms.” In Proceedings of the 2003 ACM Workshop on Rapid Malcode, pages 11–18. ACM Press, October 27, 2003.

[13] Type I and Type II errors. From Wikipedia, the free encyclopedia,

http://en.wikipedia.org/wiki/Type_I_and_type_II_errors

相關文件