Firefly-inspired Heartbeat Synchronization in Overlay Networks

Page created by Troy Dixon

Science

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

Firefly-inspired Heartbeat Synchronization in Overlay Networks∗

                      Ozalp Babaoglu                                         Toni Binci                           Márk Jelasity
                     Univ. Bologna, Italy                               Univ. Bologna, Italy              HAS & Univ. Szeged, Hungary
                    babaoglu@cs.unibo.it                                 bincit@cs.unibo.it                 jelasity@inf.u-szeged.hu

                                                                       Alberto Montresor
                                                                       Univ. Trento, Italy
                                                                      montresor@dit.unitn.it

                                         Abstract                                      need to be synchronized so that we can talk about “cy-
                                                                                       cles” of the system as a whole. For example, if the pro-
              Heartbeat synchronization strives to have nodes in a                     tocol requires periodic restarts (that is, all nodes need to
              distributed system generate periodic, local “heartbeat”                  be re-initialized), it is important that this event be syn-
              events approximately at the same time. Many useful dis-                  chronized [7, 9].
              tributed protocols rely on the existence of such heart-                       Heartbeat synchronization strives to have nodes in
              beats for driving their cycle-based execution. Yet, solv-                a distributed system generate periodic, local “heartbeat”
              ing the problem in environments where nodes are unre-                    events approximately at the same time. It differs from
              liable and messages are subject to delays and failures                   classical clock synchronization in that nodes are not in-
              is non-trivial. We present a heartbeat synchronization                   terested in counting cycles and agreeing on the ID of
              protocol for overlay networks inspired by mathemati-                     the current cycle. Furthermore, there is no requirement
              cal models of flash synchronization in certain species of                regarding the length of a cycle with respect to real time
              fireflies. In our protocol, nodes send flash messages to                 as long as the length is bounded and all nodes agree on
              their neighbors when a local heartbeat triggers. They                    it eventually. What we are interested in guaranteeing is
              adjust the phase of their next heartbeat based on in-                    that all nodes start and end their cycles at the same time,
              coming flash messages using an algorithm inspired by                     with an error that is at least one, but preferably more, or-
              mathematical models of firefly synchronization. We re-                   ders of magnitude smaller than the chosen cycle length.
              port simulation results of the protocol in various real-                      This problem is rather difficult to solve in peer-to-
              istic failure scenarios typical in overlay networks and                  peer overlay networks due to dynamism, failures and
              show that synchronization emerges even when messages                     scale. In overlay networks, message delay can vary
              can have significant delay subject to large jitter.                      over a wide range [10] and churn can be significant with
                                                                                       nodes leaving and joining the network continuously. In
                                                                                       addition, overlay networks can be extremely large, con-
              1. Introduction                                                          taining millions of nodes. This implies that any pro-
                                                                                       posed solution must be highly scalable. And finally, the
                   In cycle- or round-based distributed protocols (such                solution needs to be decentralized for it to be usable in
              as gossip protocols), it is often necessary that all nodes               overlay networks where nodes have only partial infor-
              agree on when a new cycle starts. In other words, the lo-                mation regarding the system as a whole.
              cal perceptions at nodes as to when cycles begin and end                       Our approach to achieving robust, scalable and de-
                 ∗ Authors                                                             centralized heartbeat synchronization is based on bio-
                          are listed in alphabetical order. Partial support for this
              work was provided by the European Union within the 6th Framework         logical inspiration drawn from the flashing of fireflies.
              Programme under contracts 001907 (DELIS) and 27748 (BIONETS).            It is well know that in certain firefly species, male mem-

First International Conference on Self-Adaptive and Self-Organizing Systems (SASO 2007)
0-7695-2906-2/07 $25.00 © 2007

bers gather in large numbers at dusk and are able to 1: loop
synchronize their flashes such that eventually the en- 2: wait until φ = 1
tire swarm flashes in unison. What is surprising is 3: P ← selectPeerList()
that global synchronization emerges despite the fact that 4: send flash to all peers in P
each member can observe only some small neighbor- 5: end loop
(a) active thread
hood of the swarm and can modifies its own flashing
1: loop
behavior based on this limited local information. Sev-
2: receive flash
eral mathematical models have been proposed to ex-
3: processFlash()
plain this phenomenon (see for example [13] and refer-
4: end loop
ences therein). Decentralized synchronization protocols (b) passive thread
based on such models have been suggested before in the
context of wireless sensor networks [16]. To our knowl-
edge, mathematical models of firefly synchronization Figure 1. The skeleton of the heartbeat syn-
have not been applied to solve problems in large scale chronization protocol.
overlay networks.
The main contribution of the paper is twofold. to local clocks that can measure the passage of real time
First, in Section 3 we introduce a novel protocol for with reasonable accuracy, that is, with small short-term
heartbeat synchronization in overlay networks that is drift.
based on an adaptive mathematical model of emergent
synchronization of firefly flashing [3]. Second, in Sec-
3. The Synchronization Protocol
tion 4 we present extensive large-scale event-based sim-
ulation studies of the protocol in realistic scenarios in-
In this section we present an abstract protocol
volving message loss and delay, and demonstrate that skeleton for firefly-inspired heartbeat synchronization
the protocol can indeed achieve synchronization to a and overview some of its possible instantiations based
sufficient degree. on different mathematical models of firefly flashing be-
havior. We briefly discuss the behavior of each model
2. System Model and argue that the most promising one is the adaptive
model described in [3]. This model will be analyzed
We assume that nodes are connected through an ex- experimentally in Section 4.
isting routed network, such as the Internet, where ev-
ery node can potentially communicate with every other 3.1. The Protocol Skeleton
node. To actually communicate, a node has to know
the address of another node. This is achieved by main- The protocol skeleton is shown in Figure 1. We
taining a partial view (view for short) at each node that assume that each node is an oscillator that can be char-
contains a set of node descriptors. Views can be inter- acterized by its phase, φ , and the cycle length, ∆. The
preted as sets of edges between nodes, naturally defin- phase is a variable in the interval [0, 1] and its dynam-
ing a directed graph over the nodes that determines the ics are defined by a sawtooth function of time t, where
topology of an overlay network. we have ∂ φ /∂ t = 1/∆, such that the phase increases
The network is highly dynamic; new nodes may linearly from 0 to 1 in ∆ time units. When the phase
join at any time, and existing nodes may leave, ei- reaches 1, the node emits a “flash”, which results in a
ther voluntarily or by crashing. Our approach does ping message being sent to a set of peer nodes. Sub-
not require any mechanism specific to leaves: spon- sequently, the phase is reset to 0. The cycle length ∆
taneous crashes and voluntary leaves are treated uni- can be initially different (or identical) at all nodes, de-
formly. Thus, in the following, we limit our discussion pending on the implementation of PROCESS F LASH under
to node crashes. Byzantine failures, with nodes behav- consideration.
ing arbitrarily, are excluded from the present discussion. When the node receives a flash, method PROCESS -
Communication incurs unpredictable delays and is F LASH is executed. This method is the heart of the syn-
subject to failures. Single messages may be lost, links chronization algorithm. It is responsible for updating φ
between pairs of nodes may break. Nodes have access and possibly also ∆. That is, it can delay or advance the