|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.jgroups.stack.Protocol org.jgroups.protocols.FD
public class FD
Failure detection based on simple heartbeat protocol. Regularly polls members for liveness. Multicasts SUSPECT messages when a member is not reachable. The simple algorithms works as follows: the membership is known and ordered. Each HB protocol periodically sends an 'are-you-alive' message to its *neighbor*. A neighbor is the next in rank in the membership list, which is recomputed upon a view change. When a response hasn't been received for n milliseconds and m tries, the corresponding member is suspected (and eventually excluded if faulty).
FD starts when it detects (in a view change notification) that there are at least 2 members in the group. It stops running when the membership drops below 2.
When a message is received from the monitored neighbor member, it causes the pinger thread to 'skip' sending the next are-you-alive message. Thus, traffic is reduced.
Nested Class Summary | |
---|---|
protected class |
FD.Broadcaster
Task that periodically broadcasts a list of suspected members to the group. |
protected class |
FD.BroadcastTask
|
static class |
FD.FdHeader
|
protected class |
FD.Monitor
Task which periodically checks of the last_ack from ping_dest exceeded timeout and - if yes - broadcasts a SUSPECT message |
Field Summary | |
---|---|
protected FD.Broadcaster |
bcast_task
Transmits SUSPECT message until view change or UNSUSPECT is received |
protected long |
last_ack
|
protected Address |
local_addr
|
protected java.util.concurrent.locks.Lock |
lock
|
protected int |
max_tries
|
protected java.util.List<Address> |
members
|
protected java.util.concurrent.Future<?> |
monitor_future
|
protected int |
num_heartbeats
|
protected int |
num_suspect_events
|
protected java.util.concurrent.atomic.AtomicInteger |
num_tries
|
protected Address |
ping_dest
|
protected java.util.List<Address> |
pingable_mbrs
Members from which we select ping_dest. |
protected BoundedList<Address> |
suspect_history
|
protected long |
timeout
|
protected TimeScheduler |
timer
|
Fields inherited from class org.jgroups.stack.Protocol |
---|
down_prot, ergonomics, id, log, name, stack, stats, up_prot |
Constructor Summary | |
---|---|
FD()
|
Method Summary | |
---|---|
protected void |
computePingDest(Address remove)
Computes pingable_mbrs (based on the current membership and the suspected members) and ping_dest |
java.lang.Object |
down(Event evt)
An event is to be sent down the stack. |
int |
getCurrentNumTries()
|
java.lang.String |
getLocalAddress()
|
int |
getMaxTries()
|
java.lang.String |
getMembers()
|
int |
getNumberOfHeartbeatsSent()
|
int |
getNumSuspectEventsGenerated()
|
java.lang.String |
getPingableMembers()
|
java.lang.String |
getPingDest()
|
protected Address |
getPingDest(java.util.List<Address> mbrs)
|
long |
getTimeout()
|
void |
init()
Called after instance has been created (null constructor) and before protocol is started. |
boolean |
isMonitorRunning()
|
java.lang.String |
printSuspectHistory()
|
void |
resetStats()
|
protected void |
sendHeartbeatResponse(Address dest)
|
void |
setMaxTries(int max_tries)
|
void |
setTimeout(long timeout)
|
void |
startFailureDetection()
|
protected void |
startMonitor()
Requires lock to held by caller |
void |
stop()
This method is called on a Channel.disconnect() . |
void |
stopFailureDetection()
|
protected void |
stopMonitor()
Requires lock to be held by caller |
protected void |
unsuspect(Address mbr)
|
java.lang.Object |
up(Event evt)
An event was received from the layer below. |
protected void |
updateTimestamp(Address sender)
|
Methods inherited from class org.jgroups.stack.Protocol |
---|
destroy, dumpStats, enableStats, getConfigurableObjects, getDownProtocol, getDownServices, getId, getIdsAbove, getLevel, getName, getProtocolStack, getSocketFactory, getThreadFactory, getTransport, getUpProtocol, getUpServices, getValue, isErgonomics, printStats, providedDownServices, providedUpServices, requiredDownServices, requiredUpServices, resetStatistics, setDownProtocol, setErgonomics, setId, setLevel, setProtocolStack, setSocketFactory, setUpProtocol, setValue, setValues, start, statsEnabled |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
protected long timeout
protected int max_tries
protected int num_heartbeats
protected int num_suspect_events
protected final BoundedList<Address> suspect_history
protected Address local_addr
protected volatile long last_ack
protected final java.util.concurrent.atomic.AtomicInteger num_tries
protected final java.util.concurrent.locks.Lock lock
protected volatile Address ping_dest
protected final java.util.List<Address> members
protected final java.util.List<Address> pingable_mbrs
members
minus the suspected members
protected TimeScheduler timer
protected java.util.concurrent.Future<?> monitor_future
protected final FD.Broadcaster bcast_task
Constructor Detail |
---|
public FD()
Method Detail |
---|
public java.lang.String getLocalAddress()
public java.lang.String getMembers()
public java.lang.String getPingableMembers()
public java.lang.String getPingDest()
public int getNumberOfHeartbeatsSent()
public int getNumSuspectEventsGenerated()
public long getTimeout()
public void setTimeout(long timeout)
public int getMaxTries()
public void setMaxTries(int max_tries)
public int getCurrentNumTries()
public java.lang.String printSuspectHistory()
public void resetStats()
resetStats
in class Protocol
public void init() throws java.lang.Exception
Protocol
init
in class Protocol
java.lang.Exception
- Thrown if protocol cannot be initialized successfully. This will cause the
ProtocolStack to fail, so the channel constructor will throw an exceptionpublic void stop()
Protocol
Channel.disconnect()
. Stops work (e.g. by closing multicast socket).
Will be called from top to bottom. This means that at the time of the method invocation the
neighbor protocol below is still working. This method will replace the
STOP, STOP_OK, CLEANUP and CLEANUP_OK events. The ProtocolStack guarantees that
when this method is called all messages in the down queue will have been flushed
stop
in class Protocol
protected Address getPingDest(java.util.List<Address> mbrs)
public void stopFailureDetection()
public void startFailureDetection()
protected void startMonitor()
protected void stopMonitor()
public boolean isMonitorRunning()
public java.lang.Object up(Event evt)
Protocol
down_prot.down()
or c) the event (or another event) is sent up
the stack using up_prot.up()
.
up
in class Protocol
public java.lang.Object down(Event evt)
Protocol
down_prot.down()
. In case of a GET_ADDRESS event (which tries to
retrieve the stack's address from one of the bottom layers), the layer may need to send
a new response event back up the stack using up_prot.up()
.
down
in class Protocol
protected void sendHeartbeatResponse(Address dest)
protected void unsuspect(Address mbr)
protected void updateTimestamp(Address sender)
protected void computePingDest(Address remove)
remove
- The member to be removed from pingable_mbrs
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |