Optimal Number of Clusters in Dense Wireless Sensor Networks: A Cross-Layer Approach

(1)

Optimal Number of Clusters in Dense Wireless

Sensor Networks: A Cross-Layer Approach

Li-Chun Wang, Senior Member, IEEE, Chung-Wei Wang, Student Member, IEEE, and

Chuan-Ming Liu, Member, IEEE

Abstract—Cluster-based sensor networks have the advantages of reducing energy consumption and link-maintenance cost. One fundamental issue in cluster-based sensor networks is determining the optimal number of clusters. In this paper, we suggest a physical (PHY)/medium access control (MAC)/network (NET) cross-layer analytical approach for determining the optimal number of clus-ters, with the objective of minimizing energy consumption in a high-density sensor network. Our cross-layer design can incorpo-rate many effects, including lognormal shadowing and a two-slope path loss model in the PHY layer, various MAC scheduling, and multihop routing schemes. Compared with the base-line case with one cluster per observation area (OA), a sensor network with the proposed optimal number of clusters can reduce the energy consumption by more than 80% in some cases. We also verify by simulations that the analytical optimal cluster number can still effectively function, regardless of the different densities of sensors in various OAs.

Index Terms—Cross-layer design, optimal number of clusters, wireless sensor networks.

I. INTRODUCTION

C

LUSTER-BASED sensor networks have many advan-tages. For example, with clustering, energy consumption can be improved, because only one representative node per cluster is required to be active, and the other nodes can enter the dormant mode [2]–[4]. Clustering architecture has important applications of high-density sensor networks, because it is much easier to manage a set of cluster representatives from each cluster than to manage whole sensor nodes. Environment mea-surement [5], target tracking [6], [7], intrusion detection [8], and pursuit-evasion games [9] are typical applications of high-density sensor networks due to the fault-tolerance requirement. One of the key challenges in deploying a high-density cluster-based sensor network is determining the optimal num-ber of clusters. For a high-density sensor network, the coverage area can be partitioned into many disjoint spatial coherence Manuscript received September 13, 2007; revised April 16, 2008. First published July 16, 2008; current version published February 17, 2009. This work was supported in part by the Ministry of Education ATU Plan and in part by the National Science Council, Taiwan, under Contract 96-2221-E-009-061. This work was presented in part at the IEEE International Conference on Networks, Sensors, and Control (ICNSC), Taipei, Taiwan, March 2004. The review of this paper was coordinated by Prof. V. Leung

L.-C. Wang and C.-W. Wang are with the Department of Communication Engineering, National Chiao Tung University, Hsinchu 300, Taiwan (e-mail: [email protected]; [email protected]).

C.-M. Liu is with the Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei 106, Taiwan (e-mail: [email protected]).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TVT.2008.928637

regions [10], where the sensed information is highly spatial correlated. In this paper, we call such a spatial coherence region a basic observation area (OA). Intuitively, each basic OA with highly correlated sensed information requires only one cluster representative [11]–[14]. In general, the number of clusters in a basic OA can be determined from different aspects of the protocol layers.

1) From the physical (PHY) layer aspect, using more clus-ters can save more energy, because the transmission dis-tance between cluster representatives can be shortened. 2) From the medium access control (MAC) layer aspect,

having fewer clusters (or, equivalently, more nodes per cluster) decreases the average possibility of being a clus-ter representative for each sensor node and, thus, reduces energy consumption.

3) From the multihop routing aspect in the network (NET) layer, fewer clusters yield fewer hop counts to the data sink and result in less energy consumption.

Hence, optimizing the number of clusters in a sensor network becomes a cross-layer tradeoff design issue among the required transmission power in the PHY layer, the possibility of being a cluster representative in the MAC layer, and the hop counts in the relay path in the NET layer.

A. Related Work

The issue of optimizing the number of clusters in a wireless sensor network has been addressed by many researchers from different viewpoints.

1) First, from the viewpoint of propagation distance in the PHY layer, many authors [15]–[20] have discussed how to design the number of clusters in a sensor network. Duarte-Melo and Liu [15] concluded that the number of clusters should be as large as possible, because the distance between the cluster head and its members can be shortened. However, in [16] and [17], it was shown that a larger number of clusters also lead to more one-hop transmissions from the heads to the sink. Thus, the optimal number of clusters exists [18], [19]. Furthermore, Depedri et al. [20] discussed the effects of shadowing and path loss exponents on the optimal number of clusters. 2) From the aspect of the MAC layer, Duarte-Melo and Liu

[21] suggested that the design of the optimal number of clusters should include the impact of the contention mechanism. However, an explicit MAC protocol for achieving this goal was not presented.

(2)

3) From the routing aspect, Mhatre and Rosenberg [22] assumed that each node sends data to the corresponding cluster heads using multihop routing. They found that more clusters will result in more one-hop transmissions from the heads to the sink although the total routing traffics within each cluster is reduced because of fewer members. Furthermore, Zhou and Abouzeid [23] deter-mined the optimal number of clusters from the viewpoint of minimizing the routing overhead using an information-theoretical approach.

Current methods for determining the optimal number of clusters are based on each individual protocol layer aspect. To the best of our knowledge, a PHY/MAC/NET cross-layer analytical approach for determining the optimal number of clusters in a dense sensor network has received little notice in the literature.

B. Contributions and Organization of This Paper

The objective of this paper is to develop a PHY/MAC/NET cross-layer analytical method for calculating the optimal num-ber of clusters in each basic OA for a high-density sensor network. Based on the proposed analytical approach, an optimal number of clusters can be obtained without time-consuming search and labor-intensive field trials. Therefore, the deploy-ment of the energy-efficient cluster-based sensor network be-comes easier.

The contributions of this paper can be summarized here. • First, we propose a cross-layer analytical approach for

determining the optimal number of clusters. In the cluster number design problem, it is a concept in the average sense. Hence, we first suggest adopting the criterion of the minimal total average energy (instead of the lifetime in the extreme case) to calculate the optimal number of clusters per OA. The PHY/MAC/NET cross-layer analyt-ical design approach for the optimal number of clusters in high-density sensor networks has not been seen in the literature.

• Second, we show the existence of the optimal cluster number, regardless of the different densities of sensors in various OAs by simulations and analyses. We also take account of other randomness factors in our simulation platform (Fig. 6), including lognormal shadowing and a more realistic two-slope path loss model. The simulation results are shown to match the proposed analytical results quite well. To the best of our knowledge, this interesting finding of the optimal cluster number, regardless of the different densities of sensors in various OAs, has yet to be reported in the literature.

The rest of this paper is organized as follows: In Section II, we introduce the system model and formulate an optimization problem to find the best number of clusters in high-density sensor networks. Section III discusses how to determine the optimal number of clusters from a PHY/MAC cross-layer per-spective. In Section IV, we further consider the NET layer aspect. The numerical results are shown in Section V. Finally, we give our concluding remarks in Section VI.

Fig. 1. Sensor network divided into a grid of M = 16 basic OAs; in each of them, we form K = 4 clusters. The filled (unfilled) circle represents a cluster representative (dormant) node.

II. SYSTEMMODEL ANDPROBLEMFORMULATION

A. System Model

Consider a sensor network with N nodes in a grid of M basic OAs, each containing K square-shaped clusters. A basic

OA represents a spatial coherence region with highly correlated

information. Define dOA as the minimal distance that results

in uncorrelated information [10]. Fig. 1 shows an example of a sensor network with M = 16 OAs and four clusters per OA (i.e., K = 4). In any instant, only one sensor node per cluster is active to sense the surrounding information, whereas the other nodes enter the dormant mode. Furthermore, we suppose that the sensor nodes in each cluster can be synchronized by a certain clock synchronization mechanism [24].

To ease analysis, the grid model was used for sensor net-works [25]–[29]. In general, the shape of the clusters can be arbitrary (or close to circle) because of the randomness inherent in radio propagation. However, the grid model is still a good candidate for obtaining insights on system design [30].1

Now, we briefly introduce the operations of the cluster-based sensor network considered in this paper. First, the clusters are formed according to some cluster formation mechanisms [31]. One cluster head is selected in each cluster to schedule the subsequent cluster representatives from its cluster members. When receiving the schedule broadcast from the cluster head, all sensor nodes wake up to be the cluster representatives

1_{To design cluster-based sensor networks, we need some geometric shapes}

to tessellate the entire coverage area, such as triangle, square, and hexagon. Hexagon-shaped coverage is widely used for a base station in mobile cellular systems. Our approach can also be modified to hexagon-shaped clusters, which is not included in this paper due to the limitation on page length.

(3)

in turn. The cluster representatives periodically report sensed information to the sink based on a multihop delivery. Note that the clusters are periodically reformed, and the reformation period of the clusters is called a round in this paper.

B. Problem Formulation

In most of the literature, the network lifetime is defined as the time that elapsed from the start of the sensor network to the

death of the first node [32]. In this case, the researches cared for

the maximal energy consumption over all sensor nodes. Hence, the objective function of finding the optimal number of clusters can be expressed as follows:

Kopt = arg min 1≤K≤N

M

max

1≤i≤NEi(K)

where Ei(K) is the energy consumption of sensor i when the

number of clusters in each basic OA is K.

For a dense sensor network, we believe that the lifetime in the extreme case may not be the only performance measure to be considered. It is very likely that the remaining sensor nodes in a dense sensor network can still form a network, even if one sensor node does not function. Hence, our goal is to optimize the average energy consumption, instead of the lifetime in the extreme case (i.e., the duration for the first sensor running out its energy). In this paper, we formulate the Energy Minimizing

Problem for sensor networks as follows: Given a set of system

parameters, including the minimal uncorrelated distances dOA,

the number of sensor nodes N , and the number of basic OAs

M , find the optimal number of clusters (denoted by Kopt) in

each basic OA to minimize the average energy consumption [denoted by E(K)]. Formally

Kopt= arg min 1≤K≤N

M

E(K) (1)

subject to

0≤ Pt≤ Ptmax (PHY layer) 0≤ r ≤ rmax (PHY/MAC layer)

1≤ h ≤ hmax (NET layer)

where Pt, r, and h are the transmission power, average

retrans-mission times in the MAC layer, and average hop counts from the cluster representatives to the sink, respectively; and Ptmax,

rmax, and hmaxare the corresponding maximum values for Pt,

r, and h, respectively.

Clearly, the number of clusters K in a basic OA affects the performance of a sensor network from the PHY/MAC/NET cross-layer perspectives. Note that Pt, r, and h are all

func-tions of K. Specifically, we consider the PHY layer propaga-tion model with different path loss exponents and shadowing components.

III. PHY/MAC CROSS-LAYERASPECT

In this section, we discuss the impacts of the transmission power and the MAC scheduling policy on the number of clusters K from the PHY/MAC cross-layer perspectives.

A. Energy Consumption Model

To begin with, denote Ei→j as the energy consumption

of transmitting data from sensors i to j during each round, where i= j. Denote Ptand Peas the transmission power and

electronics power consumption, respectively. Then, according to [18] and [33], it follows that

Ei→j = ai→j[(Pe+ Pt)· t + Pe· t] = ai→j · (2Pe+ Pt)t

(2) where t and a_i→j are the duration in each transmission and the number of times link i→ j is established during each round, respectively. Here, the energy consumption for sensing is ignored, because it is much lower than that for transmission [34], [35]. Taking the expectation of Ei→j over all the links

from sensors i to j, the average energy consumption of each link during each round can be expressed as follows:

Ei→j = N i=1 N j=1,j=iEi→j N (N− 1) = (2Pe+ Pt)t· ai→j (3) where ai→j =Ni=1 N

j=1,j=iai→j/(N (N− 1)) is the

aver-age number of times each link is established during each round. Usually, t will be a given system parameter. In Sections III-B and C, we discuss how to obtain Ptand ai→j.

B. Impact of Transmission Power

Consider the two-slope path loss model. Let n1 and n2 be

the path loss exponents and X1 and X2 be the lognormally

shadowing components, respectively. At the distance of d, the received power Prcan be written as

Pr= ⎧ ⎨ ⎩ Pt L₀₁ d d₀₁ −n1 X1, d≤ dt Pt L02 d d02 _−n2 X2, d > dt (4)

where L01 and L02 are the path losses at reference distances

d01 and d02 minus antenna gain, respectively. In (4), the two different slopes (n1and n2) are adopted before and after

dis-tance threshold dt, respectively. In general, we have n1< n2.

This propagation model had been validated in outdoor sensor networks at 868 MHz [36].

Fig. 2 shows a square-shaped basic OA with four clusters (K = 4). To guarantee that any two cluster representatives in adjacent clusters can directly be connected, the one-hop transmission distance of each sensor node (which is denoted by d) must be larger than or equal to(5/K)dOA[2]. In this

paper, we assume that the cluster representatives are connected through the multihop communication method when the prop-agation distance is longer than (5/K)dOA. Hence, given a

(4)

Fig. 2. Square-shaped basic OA with four clusters (K = 4), where the maximum one-hop transmission distance between two neighboring cluster representatives is√5dOA/

√

K when the side length of a basic OA is dOA.

than dt. Then, Prcan uniquely be determined from one of the

two particular cases in (4). Without loss of generality, we have

Pr= Pt L0 d d0 −n X. (5) If dOA≤ (K/5)dt, L0= L01, d0= d01, n = n1, and X = X1. Otherwise, L0= L02, d0= d02, n = n2, and X =

X2. In Section V-D, we will evaluate the impact of the

two-slope path loss model on the determination of K.

From (5), the transmission power of a cluster representative that is needed to maintain the received power level of Prat a

distance of(5/K)dOAbecomes Pt= PrL0 √ 5dOA √ Kd0 n (6) where the shadowing effect is not considered here (i.e., X = 1) until Section III-E. Hence, from the PHY layer perspective, we know that, with larger values of K, the required transmission power of a cluster representative is lower. Note that, because the cluster representatives in adjacent clusters contend for the channel based on the IEEE 802.11 MAC protocol, only one cluster representative can transmit at any time instant. Thus, the intercluster interference does not occur; thus, we do not consider it in (6). The detail of channel contention is discussed in Section IV-C.

C. Impact of MAC Scheduling

The MAC layer scheduling policy affects how often cluster representatives i and j establish a link, i.e., ai→j. A

time-driven sensor network usually specifies the number of reports from each cluster during each round, which is denoted by R. For a sensor network with M K clusters,

M KR=N_i=1N_j=1,j=ia_i→j. In (3), we have a_i→j = N

i=1

N

j=1,j=iai→j/(N (N− 1)). Thus, it follows that

ai→j = M KR

N (N − 1). (7)

From (7), once R is specified, a_i→j in (3) can be obtained.

D. Optimal Number of ClustersKl

To determine the optimal number of clusters in a basic OA from both the MAC scheduling and PHY transmit power perspectives, we can substitute (6) and (7) into (3). Let η be the required receive power. Then, the average energy consumption of each link during each round can be expressed as

Ei→j(K) = _{N (N}M K_{− 1)}R 2Pe+ ηL0 √ 5dOA √ Kd0 n t. (8)

From (8), one can find that K affects the energy consumption in two ways. On the one hand, when K decreases, the number of sensor nodes in a cluster increases. Because a smaller value of K results in more members in a cluster and, thus, lower probability to be a cluster representative, energy consumption can be reduced. On the other hand, a smaller K can also increase the energy consumption of a sensor node due to the longer transmission distance in a larger coverage area. Because (∂2Ei→j(K)/∂2K) > 0, (8) is a convex function. Then, letting

(∂E_i→j(K)/∂K) = 0, we can obtain the optimal number of clusters Kl to minimize Ei→j from both the PHY and MAC

layer perspectives. Specifically, we can express Klas

Kl= ⎧ ⎨ ⎩ (n 2−1)β 2α 2 n , n > 2 1, n = 2 (9) where α = Pe, and β = ηL0( √ 5dOA/d0)n. E. Shadowing Effect

Now, we consider the shadowing effect in (5). Recall that shadowing component X is modeled as a lognormal random variable with zero mean and standard deviation σ. Thus, the probability density function of the received signal power Prcan

be expressed as [37] fPr(y) = 1 √ 2πσln 10₁₀ y · exp ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ − 10 log10y− 10 log10LPt0 √ 5dOA √ Kd0 _−n2 2σ2 ⎫ ⎪ ⎪ ⎪ ⎬ ⎪ ⎪ ⎪ ⎭ . (10)

Note that the logarithm of the lognormal random variable yields a normal random variable. Define

Pr(dBm)= 30 + 10 log10Pr (11) η(dBm)= 30 + 10 log10η (12) ξ(dBm)= 30 + 10 log10 Pt L0 d d0 _−n (13) where “dBm” represents the decibel value normalized to 1 mW,

(5)

mean received power without shadowing effect. Accordingly, the outage probability of Prcan be calculated as follows:

Pr{Pr< η} = Pr Pr(dBm)< η(dBm) = η(dBm) −∞ 1 σ√2πexp −z− ξ(dBm) 2 /2σ2 dz = Q ξ(dBm)− η(dBm) σ (14) where Q(x) = ∞ x 1 √ 2πexp −ζ2_/2 dζ. (15)

For the outage probability requirement θ, i.e., Pr{Pr< η} ≤

θ, it follows that

Q−1(θ)≤ ξ(dBm)− η(dBm)

σ . (16)

Thus, substituting (12) and (13) into (16), we can have

Pt≥ ηL0 √ 5dOA √ Kd0 n 10Q−1(θ)σ/10. (17) Let Pt= ηL0( √ 5dOA/ √ Kd0)n10Q −1_(θ)σ/10 in (17), and substitute (7) into (3). By taking the differential against K, for outage probability θ, we can express the optimal number of clusters subject to shadowing as

Kl= γ 2Pe 2 n , n > 2 1, n = 2 (18) where γ = ((n/2)− 1)ηL0( √ 5dOA/d0)n10Q −1_(θ)σ/10 . In the preceding discussion, we consider the outdoor prop-agation model validated in [36]. In various environments [38]–[42], Klcan also be evaluated by similar approaches.

IV. PHY/MAC/NET LAYERASPECT

Based on the PHY/MAC energy consumption model in the previous section, we now further include the effects of channel contention in the MAC layer and routing schemes in the NET layer.

A. Impact of Multihop

The multihop delivery relies on a number of middle cluster representatives to forward data to the destination. If a sensor node is far away from the sink, the multihop delivery becomes inevitable. The multihop routing scheme consumes less energy in each link than the one-hop delivery [43], but the total energy may also increase due to more relay links.

Assume that sensor i has an hi-hop routing path to the sink:

i→ γ1→ γ2→ · · · → γhi−1→ sink. In this routing path, the

energy consumption of link i→ j in each transmission is

E_i→j/a_i→j, because a_i→j is the number of times link i→ j is established, and Ei→jis the sum of the energy consumption

of link i→ j during each round. Denote E(i, sink) as the sum of the energy consumption from all the sensor nodes in this routing path. Then, we have

E(i, sink) = a_i→γ1· Ei→γ1 a_i→γ1 +Eγ1→γ2 aγ1→γ2 +· · · +Eγhi−2→γhi−1 aγ_hi−2→γ_hi−1 +Eγhi−1→sink aγ_hi−1→sink . (19) Usually, the energy consumption of receiving a packet at the sink can be ignored. Hence, we have Eγhi−1→sink=

aγhi−1→sink· (Pe+ Pt)t. Next, substituting (2) into (19), we

can obtain

E(i, sink) = ai→γ1· [(hi− 1)Pe+ hi(Pe+ Pt)] t. (20) Averaging over N routing paths, the average energy con-sumption for each route can be expressed as

E(i, sink) = N i=1 E(i, sink) N = N i=1 a_i→γ1 N · ⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ Pe· N i=1 (hi−1) N +(Pe+Pt)· N i=1 hi N ⎫ ⎪ ⎪ ⎬ ⎪ ⎪ ⎭ t. (21) Recall ai→j =Ni=1 N j=1,j=iai→j/(N (N− 1)) in (3). If

the first relay node in the routing path from node i to the sink is determined, sensor i transmits to γ1 only, and ai→j =

0 for all j= γ1. In this case, ai→j is simplified to ai→j =

N

i=1ai→γ1/(N (N− 1)). Thus, it follows that

E(i, sink) = (N− 1)ai→j·Pe· (h − 1) + (Pe+ Pt)· h t

(22) where h≡N_i=1hi/N is the average hop counts for each

routing path.

From (22), we observe that a smaller value of K yields less energy consumption, because there are fewer hop counts from the network aspect. However, from the PHY layer aspect, a large K can reduce the transmission power due to the shorter distance. Thus, there exist an optimal number of clusters from the PHY layer transmit power, MAC layer scheduling, and NET layer hop count perspectives. In the next section, we discuss how to obtain h based on the shortest-path routing strategy.

B. Calculation of Hop Countsh

In this section, we show how to calculate the value of h in (22). Although the shortest-path routing is adopted here, the analytical approach can still be applied to other routing strategies. In the considered network, it is assumed that the sink is located at the center of the entire sensor network. As shown in Fig. 3, there are 16 basic OAs with 64 clusters. In this figure, the

(6)

Fig. 3. Sensor network with 64 clusters, where the sink is located at the center of the entire sensor network, and the number in each cluster indicates the required hop counts from the associated cluster to the sink.

Fig. 4. Top-left portion of the considered sensor network in Fig. 3, where

|V_(j)_{| is proportional to the length of DE.}

number in each cluster indicates the required hop counts to the sink. For example, the hop count of the top-left cluster is seven. Denote V (h) as the set of clusters (or cluster representatives) that can reach the sink within h hops. Thus,|V (1)| = 4, and

|V (2)| = 8 in Fig. 3. The relation between |V (h)| and K can

be calculated by the following lemma.

Lemma 1: |V (h)| = 4· h, 1≤ h ≤ √KM₂ 4· (√KM− h), √KM 2 ≤ h ≤ √ KM− 1. (23)

Proof: From Fig. 3, the pattern of the required hop counts

in each cluster is symmetric around the center. Thus, we can consider only the top-left portion, as shown in Fig. 4.

Consider the case of 1≤ h ≤ (√KM /2). Suppose that the

set of clusters associated with BC is V (1) and that the set of those clusters associated with DE is V (j). Thus, we have

|V _(j)_| |V ₍₁₎_| = DE BC. (24) Clearly,|V (1)| = 1. Thus |V _(j)_{| = j.} ₍₂₅₎

Extending to include all four quadrants, we obtain|V (h)| = 4h when 1≤ h ≤ (√KM /2).

When (√KM /2)≤ h ≤√KM− 1, one can also prove

that|V (h)| = 4(√KM− h).

Theorem 1: The average hop counts h from a cluster

repre-sentative to the sink can be calculated as

h = √ KM 2 , when √ KM is an even integer √ KM 2 (KM−1)+1 KM , otherwise. (26)

Proof: Recall that the total number of cluster

representa-tives is equal to the total number of clusters, i.e., KM . Thus, in an area with KM clusters, the largest hop count is equal to

√

KM− 1, as observed from Fig. 3. When√KM is an even

integer, we have

√ KM−1_!

h=1

|V (h)|= the total number of cluster representatives

= KM. (27)

In the Appendix, we derive the total hop counts among KM clusters as follows: √ KM_!−1 h=1 h· |V (h)| = ( √ KM )3 2 . (28)

Combining (27) and (28), we can express the average hop counts for each cluster representative to the sink as

h = √ KM−1 h=1 h· |V (h)| √ KM−1 h=1 |V (h)| = √ KM 2 . (29)

Applying the same technique to the case in which√KM is

an odd integer, one can also prove that

h = √ KM 2 (KM− 1) + 1 KM . (30)

C. Impact of Channel Contention in the MAC Layer

In this section, we extend (22) to incorporate the effect of channel contention in the MAC layer. Denote r as the average retries for transmission in the contention procedures before successfully acquiring a channel. Then, (22) can be extended as

E(i, sink) = (N−1)a_i→j(r+1)·Pe·(h−1)+(Pe+Pt)·h t.

(7)

The MAC parameter r in (31) is related to the channel quality, the distribution of sensor nodes, the number of clusters, and the handshaking scheme. In this paper, the carrier sense multiple access with collision avoidance (CSMA/CA) MAC protocol is employed under an error-free channel. Based on the CSMA/CA protocol of the IEEE 802.11 standard, we derive the value of r as follows:

Let W be the minimum window size. For the maximum backoff stage m, the maximum window size is equal to 2mW .

Denote pc as the collision probability when a packet is

trans-mitted on the channel. Assume that each cluster representative always has data to send. According to [44] and [45], in the context of the IEEE 802.11 MAC protocol, the probability that a cluster representative transmits data in a randomly chosen slot time can be expressed as

τ = 2(1− 2pc)

(1− 2pc)(W + 1) + pcW (1− (2pc)m)

. (32) If there are q− 1 other cluster representatives within the cov-erage area of a cluster representative, the collision probability

pcfor a particular cluster representative can be written as

pc= 1− (1 − τ)q−1. (33)

The value of q can be estimated from the number of sensor nodes covered within the transmission range d of a cluster representative. Recall that d = (5/K)dOAin Section III-B.

Consider the example shown in Fig. 5, where the cluster rep-resentative node i located at (x, y) is inside the center of the square-shaped cluster. There are a total of 25 clusters within the transmission area of cluster representative node i. We denote

Aj as the coverage area of the jth cluster and Bj as the

area in the jth cluster interfered by the transmission of cluster representative node i. Within Aj, only the cluster representative

node in Bjwill be interfered with cluster representative node i.

Let fX(x) and fY(y) be the probability density functions of

the sensors’ locations in the x- and y-axis, respectively. The total average number of contending nodes in 25 clusters can be approximated as follows: q = 25 ! j=1 " " Bjfx(x)fY(y)dxdy " " Ajfx(x)fY(y)dxdy . (34)

In the special case when the distribution of the sensor nodes’ locations (fX(x) and fY(y)) has a self-similar property [46]

(e.g., uniform distribution), it follows that

q = 25· π # 5 KdOA 2 5 √ KdOA 2 = 5π. (35)

Finally, the average retransmission times in the MAC layer can be obtained by substituting (33) into the following equation: r = ∞ ! r=0 r· (1 − pc)· (pc)r. (36)

Fig. 5. Transmission area of the cluster representative node located at the center of the cluster. A total of 25 clusters are covered. In this example, A2

is the coverage area of the vertical lines, and B2is the coverage area of the

horizontal lines.

Note that τ and pc can iteratively be solved from (32)

and (33).

In a more general case when the locations of the sensors are not self similar, q can be a function of K. In this case, r is also a function of K, as described in the constraint in (1). Moreover, if the nonsaturated traffic is considered, τ and pc

must be reevaluated according to the results in the current literature [47]–[51]. Hence, a new average retransmission times

r can be derived from (36).

D. Optimal Number of ClustersKp

Without loss of generality, we consider the case in which

√

KM is an even integer. Substituting (6), (7), and (29) into

(22), we have E(i, sink) = M K N R· (r + 1) Pe· √ KM 2 − 1 + Pe+ ηL0 √ 5dOA √ Kd0 n · √ KM 2 t (37)

where r is defined in (36). Taking a differential with respect to

K in (37), we can get ∂E(i, sink)(K) ∂K = M N(r + 1)R· 3√KM 2 − 1 Pe + 3− n 4 ηL0 √ 5dOA √ Kd0 n t. (38)

Because (∂2E(i, sink)(K)/∂2_{K) > 0, (37) is a convex}

function. Hence, we can obtain the optimal number of clusters in a basic OA Kpby finding the root of ∂E(i, sink)(K)/∂K =

(8)

Fig. 6. Simulation scenario.

V. NUMERICALRESULT

In this section, we present numerical results to illustrate the relation of the optimal number of clusters and the related parameters in the PHY, MAC, and NET layers. Furthermore, the analytical results are validated through some simulation tests.

A. Simulation Setup

In the experiment, we consider a 16 m × 16 m sensor network with 16 basic OAs (M = 16). The sensor nodes are uniformly distributed with four kinds of density, as shown in Fig. 6, where the densities of the sensor nodes are 90, 270, 450, and 630 nodes per OA in regions 1, 2, 3, and 4, respec-tively. Hence, we have N = 5760. Furthermore, we first assume that only one set of path loss parameters is needed to obtain the relation between the energy consumption and the number of clusters K. We will discuss the impact of the two-slope path loss model in Section V-D. Referring to [52], we adopt the following system parameters: R = 5.76× 106, t = 10 ms,

Pe= 5 mW, η = 12.43 pW, d0= 0.2 m, and L0= 52 dB in

(8) and (37).

B. PHY/MAC Layer

Fig. 7 shows the energy consumption per sensor node against the number of clusters per basic OA according to (8) and the simulation results. As shown, the proposed analytical model can match the simulation results quite well. Furthermore, the optimal number of clusters (which is denoted by the circles in the figure) in a basic OA is Kl= 5, 30, and 81 for n = 3, 4, and

5, respectively. Thus, a sensor network in an environment with a smaller path loss exponent prefers fewer clusters in a basic OA. Second, when K < Kl, Elink decreases as K increases,

because a smaller K makes the distance between adjacent clus-ter representatives longer. For n = 4, the energy consumption per sensor node for the optimal Kl= 30 is reduced to 16.4 mJ,

compared with 242.6 mJ for K = 1. In this example, the optimal Klcan reduce the energy consumption by 93%. Last,

when K > Kl, the energy consumption of the sensor node is

proportional to K, because the number of times of being the

Fig. 7. Energy consumption per sensor node for different numbers of clusters

K and path loss exponents n, where σ = 0 dB. The circle represents the

optimal number of clusters.

Fig. 8. Impact of dOAand σ on the optimal number of clusters Kl, where

θ = 0.1.

cluster representative increases. Compared with K = 60, the optimal Kl= 30 can decrease the energy consumption from

20.7 to 16.4 mJ, which is equivalent to an energy reduction of 20%.

Fig. 8 shows the impact of the edge length of the basic OA

dOA on the optimal number of clusters Kl, with σ = 2 and

4 dB. In this example, θ = 0.1. We find that, as σ increases,

Klincreases. To overcome more serious shadowing, a shorter

transmission distance is preferred, thereby requiring more clus-ters. Furthermore, the network with fewer basic OAs (or larger

dOA) prefers to have more clusters in a basic OA, because

more clusters can reduce the transmission distance. Finally, the impact of the shadowing effect and dOA on Kl is more

significant for a larger n than a smaller n. One can explain this phenomenon from (18). A larger n may amplify the shadowing effect on the optimal number of clusters.

Fig. 9 shows the percentage of energy reduction of the system using the optimal Kl, compared with the system using

(9)

Fig. 9. Energy saving percentage for the optimal Kl, compared with K = 1,

with the outage probability requirement θ = 0.1 for different shadowing standard deviations.

K = 1 for different shadowing standard deviations. As shown

in the figure, the energy saving percentage (1− (Elink(K = Kl)/Elink(K = 1)) increases as path loss exponent n and

shadowing standard deviation σ increase. For σ = 3 dB, the optimal Klcan provide energy savings of 48%, 83%, and 96%

for path loss exponents n = 3, 3.5, and 4, respectively.

C. PHY/MAC/NET Layer

Fig. 10 shows the relation between the number of clusters

K and the total energy consumption of all the sensor nodes

in the routing path from a PHY/MAC/NET cross-layer per-spective. From this figure, the proposed analytical model can also match the simulation results quite well. Furthermore, when the cluster representatives share the channel without contention (i.e., r = 0), the optimal numbers of clusters Kpin a basic OA

from the PHY/MAC/NET cross-layer perspectives are 1, 17, and 59 for n = 3, 4, and 5, respectively. Comparing Figs. 7 and 10, we notice that Kp< Kl. This is because Kpconsiders all

the PHY/MAC/NET effects. Specifically, fewer clusters yield fewer hop counts to the data sink and result in less energy consumption in the NET layer aspect. Moreover, for a channel with higher collision probability, e.g., r = 0.6545, the total energy consumption increases.

D. Impact of Two-Slope Path Loss Models

Now, we further relax the assumption in Figs. 7–10 when only one set of path loss parameters is needed to obtain the re-lation between the energy consumption and the number of clus-ters K. Now, we consider when two sets of path loss parameclus-ters are required to obtain K based on (4). Referring to [36], choose (n1= 3, L01 = 52 dB) and (n2= 4, L02 = 44 dB) as the two sets of parameters, respectively, and dt= 0.95 m. Hence, if

K≥ (5dOA/d2t) = 22, d≤ dt, the first set of path loss

param-eters (n1= 3, L01= 52 dB) will be chosen; otherwise, the second set of path loss parameters (n2= 4, L02 = 44 dB) will be selected. Fig. 11 shows the average energy consumption

Fig. 10. Energy consumption with h hops in different K and n without shadowing. The circle represents the optimal number of clusters.

Fig. 11. Energy consumption per sensor node for different numbers of clusters K. The circle represents the optimal number of clusters.

versus K for this case. In the figure, the optimal number of clusters in each basic OA is 16. Note that K will affect the separation distance between two cluster representatives in neighboring clusters. Thus, K will also influence which set of parameters will be selected in the two-slope path loss model. Specifically, a shorter separation distance (or larger K) is more likely to choose a smaller path loss exponent n, which results in better signal strength. That is, from the PHY layer perspective, more clusters in a basic OA are preferred. Compared with

Kl= 5 for n = 3 using the one-slope path loss model in Fig. 7,

the two-slope path model leads to a larger value of Kl.

VI. CONCLUSION

In this paper, we have presented an analytical approach to determine the optimal number of clusters in a basic OA from the PHY/MAC/NET cross-layer perspectives. Specifically, this cross-layer analytical model integrates the effects of the trans-mission distance, power, and shadowing in the PHY layer; the

(10)

possibility of being a cluster representative; and the retransmis-sion times in the MAC layer, as well as the number of hops in the NET layer. The closed-form expressions for the optimal number of clusters in a basic OA with various shadowing and path loss conditions are presented. We demonstrate that the sug-gested analytical optimal number of clusters can significantly improve the energy consumption. With its flexibility, the pro-posed cross-layer analytical model can facilitate the design of the optimal number of clusters for a sensor network in different radio environments. Furthermore, our simulation results show the existence of the optimal cluster number, regardless of the different densities of sensors in various OAs.

One interesting research topic that can be extended from this paper is the application of a similar methodology to determine the optimal number of clusters based on other MAC and routing strategies with different values of r and h. Furthermore, it is also worthwhile to investigate how to determine the optimal number of clusters when data aggregation and a random traffic model in the upper protocol layers are considered.

APPENDIX

PROOF OF(28)INTHEOREM1 Let Z =√KM /2. Then, we have

√ KM!−1 h=1 h· v(h) = Z ! h=1 h· 4h + 2Z!−1 h=Z+1 [h· 4(2Z − h)] = 4 Z ! h=1 h2+ 8Z 2Z_!−1 h=Z+1 h− 4 2Z_!−1 h=Z+1 h2 = 8 Z ! h=1 h2+ 8Z 2Z!−1 h=Z+1 h− 4 2Z!−1 h=1 h2. (39) Now, we apply the expression for the sum of the series and can derive √ KM_!−1 h=1 h· v(h)=8 6Z(Z + 1)(2Z + 1) + 8Z[(Z + 1) + (2Z− 1)] · (Z − 1) 2 −4 6(2Z− 1) [(2Z − 1) + 1] [2(2Z − 1)+ 1] = 4Z3 =( √ KM )3 2 . (40) REFERENCES

[1] L.-C. Wang and C.-W. Wang, “A cross-layer design of clustering archi-tecture for wireless sensor networks,” in Proc. IEEE ICNSC, Mar. 2004, pp. 547–552.

[2] Y. Xu, J. Heidemann, and D. Estrin, ”Geography-informed energy con-servation for Ad Hoc routing,” in Proc. ACM/IEEE Int. Conf. Mobile

Comput. Networking, Jul. 2001, pp. 70-84.

[3] A. Cerpa and D. Estrin, “ASCENT: Adaptive self-configuring sensor net-works topologies,” IEEE Trans. Mobile Comput., vol. 3, no. 3, pp. 272– 285, Jul./Aug. 2004.

[4] A. Jindal and K. Psounis, “A clustering method that uses lossy aggregation of data,” in Proc. ACM Sensys, Nov. 2004, pp. 269–270.

[5] K. Dasgupta, K. Kalpakis, and P. Namjoshi, “An efficient clustering-based heuristic for data gathering and aggregation in sensor networks,” in Proc.

WCNC, 2003, vol. 3, pp. 1948–1953.

[6] F. Zhao, J. Liu, J. Liu, L. Guibas, and J. E. Reich, “Collaborative signal and information processing: An information-directed approach,” Proc.

IEEE, vol. 91, no. 8, pp. 1199–1209, Aug. 2003.

[7] W.-P. Chen, J. C. Hou, and L. Sha, “Dynamic clustering for acoustic target tracking in wireless sensor networks,” IEEE Trans. Mobile Comput., vol. 3, no. 3, pp. 258–271, Jul.–Sep. 2004.

[8] Y. Liu, H. Man, and C. Comaniciu, “A game theoretic approach to effi-cient mixed strategies for intrusion detection,” in Proc. IEEE Int. Conf.

Commun., 2006, pp. 2201-2206.

[9] B. Sinopoli, C. Sharp, L. Schenato, S. Shaffert, and S. S. Sastry, “Distrib-uted control applications within sensor networks,” Proc. IEEE, vol. 91, no. 8, pp. 1235–1246, Aug. 2003.

[10] A. D’Costa, V. Ramachandran, and A. M. Sayeed, “Distributed classifica-tion of Gaussian space-time sources in wireless sensor networks,” IEEE

J. Sel. Areas Commun., vol. 22, no. 6, pp. 1026–1036, Aug. 2004.

[11] M. C. Vuran and I. F. Akyildiz, “Spatial correlation-based collaborative medium access control in wireless sensor networks,” IEEE/ACM Trans.

Netw., vol. 14, no. 2, pp. 316–329, Apr. 2006.

[12] S. Yoon and C. Shahabi, “The Clustered AGgregation (CAG) technique leveraging spatial and temporal correlations in wireless sensor networks,”

ACM Trans. Sens. Netw., vol. 3, no. 1, p. 3, Mar. 2007.

[13] H. Gupta, V. Navda, S. R. Das, and V. Chowdhary, “Efficient gathering of correlated data in sensor networks,” in Proc. ACM Int. Symp. Mobile Ad

Hoc Netw. Comput., 2005, pp. 405–407.

[14] G. Maierbacher and J. Barros, “Source-optimized clustering for distrib-uted source coding,” in Proc. GLOBECOM, 2006, pp. 1–5.

[15] E. J. Duarte-Melo and M. Liu, “Energy efficiency in many-to-one com-munications in wireless networks,” in Proc. Midwest Symp. Circuits Syst., Aug. 2002, pp. 615–618.

[16] F. G. Nocetti, J. S. Gonzalez, and I. Stojmenovic, “Connectivity based k-hop clustering in wireless networks,” Telecommun. Syst., vol. 22, no. 1–4, pp. 205–220, 2003.

[17] S. Basagni, M. Mastrogiovanni, A. Panconesi, and C. Petrioli, “Localized protocols for ad hoc clustering and backbone formation: A performance comparison,” IEEE Trans. Parallel Distrib. Syst., vol. 17, no. 4, pp. 292– 306, Apr. 2006.

[18] W. B. Heinzelman, A. Chandrakasan, and H. Balakrishnan, “An application-specific protocol architecture for wireless microsensor net-works,” IEEE Trans. Wireless Commun., vol. 1, no. 4, pp. 660–670, Oct. 2002.

[19] S. Bandyopadhyay and E. J. Coyle, “Minimizing communication costs in hierarchically-clustered networks of wireless sensors,” Comput. Netw., vol. 44, no. 1, pp. 1–16, Jan. 2004.

[20] A. Depedri, A. Zanella, and R. Verdone, “An energy efficient protocol for wireless ad hoc sensor networks,” in Proc. IEEE AINS, Jun./Jul. 2003, pp. 1–6.

[21] E. J. Duarte-Melo and M. Liu, “Analysis of energy consumption and life-time of heterogeneous wireless sensor networks,” in Proc. GLOBECOM, Nov. 2002, pp. 21–25.

[22] V. Mhatre and C. Rosenberg, “Design guidelines for wireless sensor networks: Communication, clustering and aggregation,” Ad Hoc Netw., vol. 2, no. 1, pp. 45–63, Jan. 2004.

[23] N. Zhou and A. Abouzeid, “Information-theoretic bounds for mobile ad-hoc networks routing protocols,” in Proc. Int. Conf. Inf. Netw., Oct. 2003, vol. 2662, pp. 651–661.

[24] F. Sivrikaya and B. Yener, “Time synchronization in sensor networks: A survey,” IEEE Netw., vol. 18, no. 4, pp. 45–50, Jul./Aug. 2004.

[25] D. Ganesan, D. Estrin, and J. Heidemann, “DIMENSIONS: Why do we need a new data handling architecture for sensor networks?” ACM

SIGCOMM Comput. Commun. Rev., vol. 33, no. 1, pp. 143–148,

Jan. 2003.

[26] J. N. Al-Karaki, A. E. Kamal, and R. Ul-Mustafa, “On the optimal cluster-ing in mobile ad hoc networks,” in Proc. Consum. Commun. Netw. Conf., Jan. 2004, pp. 71–76.

[27] S. Pattem and B. Krishnamachari, “The impact of spatial correlation on routing with compression in wireless sensor networks,” ACM Trans. Sens.

Netw., vol. V, no. N, pp. 1–31, Sep. 2006.

[28] M. Enachescu, A. Goel, R. Govindan, and R. Motwani, “Scale-free aggre-gation in sensor networks,” Theor. Comput. Sci., vol. 344, no. 1, pp. 15– 29, Nov. 2005.

(11)

[29] A. Scaglione and S. Servetto, “On the interdependence of routing and data compression in multi-hop sensor networks,” Wirel. Netw., vol. 11, no. 1/2, pp. 149–160, Jan. 2005.

[30] Principles of Wireless Networks: A Unified Approach, K. Pahlavan and P. Krishnamurthy, Eds. Englewood Cliffs: Prentice–Hall, 2002. [31] L.-C. Wang, C.-W. Wang, and C.-M. Liu, “Adaptive contention

window-based cluster head election mechanism for wireless sensor networks,” in

Proc. IEEE VTC, Sep. 2005, vol. 3, pp. 1819–1823.

[32] W. R. Heinzelman, A. Chandrakasan, and H. Balakrishnan, “Energy-efficient communication protocol for wireless microsensor networks,” in

Proc. IEEE Annu. Hawaii Int. Conf. Syst. Sci., Jan. 2000, pp. 3005–3014.

[33] E. Shih, S. Cho, F. S. Lee, B. H. Calhoun, and A. Chandrakasan, “Design considerations for energy-efficient radios in wireless microsensor net-works,” J. VLSI Signal Process., vol. 37, no. 1, pp. 77–94, 2004. [34] A. Sawides, S. Park, and M. B. Srivastava, “On modeling networks of

wireless microsensors,” in Proc. ACM SIGMETRICS Int. Conf. Meas.

Model. Comput. Syst., 2001, pp. 318–319.

[35] [Online]. Available: http://www.xbow.com/

[36] A. Martinez-Sala, J.-M. Molina-Garcia-Pardo, E. Egea-Lopez, J. Vales-Alonso, L. Juan-Llacer, and J. Garcia-Haro, “An accurate radio channel model for wireless sensor networks simulation,” J. Commun. Netw., vol. 7, no. 4, pp. 401–407, Dec. 2005.

[37] Wireless Communications: Principles and Practice, 2nd ed. T. S. Rappaport, Ed. Englewood Cliffs, NJ: Prentice–Hall, 2002.

[38] J. Zhao and R. Govindan, “Understanding packet delivery performance in dense wireless sensor networks,” in Proc. Conf. Embedded Netw. Sens.

Syst., 2003, pp. 1–13.

[39] S. Willis and C. Kikkert, “Radio propagation model for long-range ad hoc wireless sensor network,” in Proc. Int. Conf. Wirel. Netw., Commun.

Mobile Comput., Jun. 2005, vol. 1, pp. 826–832.

[40] A. Cerpa, J. L. Wong, L. Kuang, M. Potkonjak, and D. Estrin, “Statistical model of lossy links in wireless sensor networks,” in Proc. Inf. Process.

Sens. Netw., Apr. 2005, pp. 81–88.

[41] G. Zhou, T. He, S. Krishnamurthy, and J. A. Stankovic, “Impact of radio irregularity on wireless sensor networks,” in Proc. Int. Conf. Mobile Syst., 2004, pp. 125–138.

[42] M. Zuniga and B. Krishnamachari, “Analyzing the transitional region in low power wireless links,” in Proc. IEEE Sens. Ad Hoc Commun. Netw.

Conf., Oct. 2004, pp. 517–526.

[43] S. Lindsey and C. Raghavendra, “PEGASIS: Power-efficient gathering in sensor information systems,” in Proc. IEEE Aerosp. Conf., Mar. 2002, pp. 9–16.

[44] G. Bianchi, “Performance analysis of the IEEE 802.11 distributed coordi-nation function,” IEEE J. Sel. Areas Commun., vol. 18, no. 3, pp. 535–547, Mar. 2000.

[45] Y. C. Tay and K. C. Chua, “A capacity analysis for the IEEE 802.11 MAC protocol,” Wirel. Netw., vol. 7, no. 2, pp. 159–171, Mar./Apr. 2001. [46] Real Analysis: Measure Theory, Integration, and Hilbert Spaces,

E. M. Stein and R. Shakarchi, Eds. Princeton, NJ: Princeton Univ. Press, 2005.

[47] K. Duffy, D. Malone, and D. J. Leith, “Modeling the 802.11 distributed coordination function in non-saturated conditions,” IEEE Commun. Lett., vol. 9, no. 8, pp. 715–717, Aug. 2005.

[48] P. E. Engelstad and O. N. Osterbo, “Non-saturation and saturation analysis of IEEE 802.11e EDCA with starvation prediction,” in Proc. Int.

Work-shop Model. Anal. Simul. Wirel. Mobile Syst., 2005, vol. 9, pp. 224–233.

[49] O. Abu-Sharkh and A. H. Tewfik, “Performance analysis of multi-rate 802.11 WLANs under finite load and saturation conditions,” in Proc.

IEEE VTC, 2005, pp. 2652–2657.

[50] M. Ergen and P. V. Baris Dundar, “Throughput analysis of an extended service set in IEEE 802.11,” in Proc. GLOBECOM, Nov. 2004, vol. 2, pp. 1040–1045.

[51] P. C. Ng and S. C. Liew, “Offered load control in IEEE 802.11 multi-hop ad-hoc networks,” in Proc. MASS, Oct. 2004, pp. 80–89.

[52] C. Buratti, A. Giorgetti, and R. Verdone, “Simulation of an energy effi-cient carrier sensing multiple access protocol for clustered wireless sensor networks,” in Proc. IWWAN, May/Jun. 2004, pp. 325–329.

Li-Chun Wang (S’92–M’96–SM’06) received the

B.S. degree from National Chiao-Tung University, Hsinchu, Taiwan, in 1986, the M.S. degree from National Taiwan University, Taipei, Taiwan, in 1988, and the M.Sc. and Ph.D. degrees from Georgia Institute of Technology, Atlanta, in 1995 and 1996, respectively, all in electrical engineering.

In 1995, he was with Northern Telecom, Richardson, TX. From 1996 to 2000, he was with the Wireless Communications Research Department, AT&T Laboratories, where he was a Senior Techni-cal Staff Member. In August 2000, he joined the Department of Communication Engineering, National Chiao-Tung University, as an Associate Professor. In August 2005, he was promoted to Full Professor. He is the holder of three U.S. patents, with three more pending. His current research interests include cellular architectures, radio network resource management, and cross-layer optimization for cooperative and cognitive wireless networks.

Dr. Wang was a corecipient of the Jack Neubauer Best Paper Award from the IEEE Vehicular Technology Society in 1997.

Chung-Wei Wang (S’07) received the B.S. degree

in electrical engineering from Tamkang University, Taipei, Taiwan, in 2003. He is currently working toward the Ph.D. degree with the Department of Communication Engineering, National Chiao Tung University, Hsinchu, Taiwan.

His current research interests include cross-layer optimization and MAC protocol design in wireless sensor networks, ad hoc networks, and cognitive radio networks.

Chuan-Ming Liu (M’03) received the B.S. and

M.S. degrees in applied mathematics from National Chung-Hsing University, Taichung, Taiwan, in 1992 and 1994, respectively, and the Ph.D. degree in computer sciences from Purdue University, West Lafayette, IN, in 2002.

He is currently an Associate Professor with the Department of Computer Science and Information Engineering, National Taipei University of Tech-nology, Taipei, Taiwan. His research interests include parallel and distributed computation, data manage-ment and data dissemination in wireless environmanage-ments, ad hoc and sensor networks, and analysis and design of algorithms.

Prof. Liu has been a Member of the Upsilon Pi Epsilon Honor Society in Computer Science since 1998.