Hazelcast broken node detection

Question

Hazelcast broken node detection

Here is my simplified use case:

I have two built-in nodes, each of which runs in its own JVM, on the same physical machine. I run them and they form a simple cluster.
both nodes try to get the same lock
the first, to get a lock, holds it for 30 seconds.
If I kill a node that holds the lock, the cluster needs something between 5 and 10 seconds to conclude that the node is dead and release the lock

My question is: can this interval between killing a node that locks the lock and the cluster actually free the lock? I need it to be less than 1 second.

I tried some of the available properties that seemed to be related to this problem:

hazelcast.socket.connect.timeout.seconds
hazelcast.client.heartbeat.timeout
hazelcast.client.invocation.timeout.seconds

None of this helped; I did not notice a change in the behavior of the lock.

Update:

These two seem to be correct:

<property name="hazelcast.socket.connect.timeout.seconds">1</property>
<property name="hazelcast.connection.monitor.max.faults">1</property>

I have yet to find out if this will cause stability problems in a real use scenario. In this simple test, it works quite well.

+4

java concurrency hazelcast

stuhpa Feb 17 '16 at 15:15

source share

No one has answered this question yet.

See related questions:

7

Hazelcast Special Hosts

1

Hazelcast ITopic Catalogs and Listeners

1

Using (preferably Hazelcast) distributed synchronization primitives to efficiently synchronize access to a MySQL table

1

How does a pessimistic lock last indefinitely to keep track of who owns the cluster?

1

Change configuration between two Hazelcast Nodes behavior?

1

Hazelcast Distributed Lock with iMap

1

Hazelcast Large Cluster Resiliency

0

Cluster synchronization problem

0

Lock is disabled, causes lock in Neo4j

0

Hazelcast hosts refusing to join each other

Hazelcast broken node detection

More articles: