Playing with PostgreSQL and Pgpool: Avoiding session disconnection while fail over

Monday, July 25, 2016

Avoiding session disconnection while fail over

Your client session to Pgpool-II will be disconnected once fail over or switch over happens. Pretty annoying. This is because Pgpool-II kills all child process that are responsible for each client session. Pgpool-II 3.6 will mitigate this under certain conditions:

Pgpool-II operates in streaming replication mode
The failed DB node is not the primary (master) node
Your "load balance node" is not the failed node

1 & 2 are easy to understand. What about #3?

The load balance node is chosen when you connect to Pgpool-II. Pgpool-II assigns one of the DB nodes to send read only queries to. It is decided according some of pgpool.conf settings:

load_balance_mode (of course this should be "on")
"weight" parameter
database_redirect_preference_list
app_name_redirect_preference_list

The decision which DB node to choose is done at the early stage of session connection and the assignment will not be changed until you exit the session. From Pgpool-II 3.6, you can check your load balance node by using "show pool_nodes" command.

$ psql -p 11000 test
test=# show pool_nodes;
node_id | hostname | port | status | lb_weight | role   | select_cnt | load_balance_node
---------+----------+-------+--------+-----------+---------+------------+-------------------
0       | /tmp     | 11002 | 2      | 0.333333 | primary | 0          | false
1       | /tmp     | 11003 | 2      | 0.333333 | standby | 0          | false
2       | /tmp     | 11004 | 2      | 0.333333 | standby | 0          | true
(3 rows)

Here "load_balance_node" is the DB node chosen for the "load balance node".

If other than node 2 is going down and the node is not primary, this session will not be disconnected. In this case the session will not be disconnected if node 1 goes down. Let's try that using another terminal:

$ pg_ctl -D data1 -m f stop
waiting for server to shut down.... done
server stopped

Ok, let's input something in the previous psql session:

test=# show pool_nodes;
node_id | hostname | port | status | lb_weight | role   | select_cnt | load_balance_node
---------+----------+-------+--------+-----------+---------+------------+-------------------
0       | /tmp     | 11002 | 2      | 0.333333 | primary | 0          | false
1       | /tmp     | 11003 | 3      | 0.333333 | standby | 0          | false
2       | /tmp     | 11004 | 2      | 0.333333 | standby | 0          | true
(3 rows)

As you can see, the session was not disconnected and you see the "status" column of node 1 is now changed to "3", which means the node is in down status.

Now let's suppose you want to maintain one of DB nodes. In this case, you could apply following procedure:

Edit pgpool.conf to change "backend_weight1" parameter to 0. This will prevent new sessions to choose the node 1 as the load balance node.
Wait until all users who are using node 1 as the load balance node exit session

8 comments:

UnknownJuly 27, 2016 at 7:32 AM
This is why I always suggest putting PgBouncer in front of pgpool2. If pgpool acted as an actual pooling service, it would proxy and hide the back-end connections. Connections to pgpool are separate from connections to Postgres, so if a failover occurs, it should simply re-bind all connections to the new master, or make new connections as necessary without breaking connections to itself. The fact it doesn't do this pretty much requires the use of a secondary proxy like PgBouncer.

With PgBouncer in place, it acts as a connection aggregator that hides the fact pgpool is throwing away connections during a failover. This makes the process much more transparent. There may be a query delay during a failover, but no disconnection.

Of course, you can get the same effect with HAProxy, etcd, and Governor or Patroni. Pgpool is perfectly positioned to handle all of these roles, so the fact that it doesn't is somewhat frustrating. :(
ReplyDelete
Replies
UnknownAugust 1, 2016 at 1:49 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
UnknownFebruary 28, 2018 at 4:21 PM
Hi All,
I have tried PgBouncer on top of PgPool-II but still connections are getting diconnected when the failover occurred on master node. So i don't think PgBouncer will help here or i am missing any configuration which can help.

Any other ideas on it?

regards,
Nabeel
ReplyDelete
Replies
SteveMay 21, 2018 at 8:09 PM
I think you need to put PgBouncer behind PgPool-II, not in front of it, i.e.:

client --> pgpool-ii --> pgbouncer --> cluster
ReplyDelete
Replies

Add comment

Playing with PostgreSQL and Pgpool

Monday, July 25, 2016

Avoiding session disconnection while fail over

8 comments:

Dynamic spare process management in Pgpool-II

Followers

Report Abuse