Keepalived-sync docs and config examples

pgaufillet · January 16, 2024, 9:40pm

Hi there!

After a few years of usage, I am now refreshing the keepalived configuration of my routers, especially regarding the synchronisation of config and dhcp leases files. It looks Jaymin Patel has commited a nice synchronization feature about 18 month ago (https://github.com/openwrt/packages/commit/33398a38aacc02c89c277f5106c079b9c61b97a2), but there is so little documentation that I don't see how to configure it.

Has someone any clue or config example about it?

Thank you for your help!

jempatel · January 17, 2024, 5:22am

Here are some details guides provided as a part of the PR review that might be helpful.

keepalived-sync

github.com/openwrt/packages

Comment by jempatel to keepalived: high-availability files and data sync

openwrt:master ← jempatel:improve_keepalived-uci-sync

> @jempatel Sorry I'm busy at the moment, but I haven't forgotten the pullreques…t. I hope I will be able to do it next week. I first have to set up two system with keepalived and your changes! > > Edit: It wouldn't be bad, if we want to speed things up, if you could give me the configuration you tested it with. Then I don't have to start from scratch. Yes, I can share that. [master.tar.gz](https://github.com/openwrt/packages/files/9687358/master.tar.gz) contains `/etc/config/network /etc/config/dhcp /etc/config/firewall /etc/config/keepalived /etc/keepalived/keys/id_rsa` [backup.tar.gz](https://github.com/openwrt/packages/files/9687178/backup.tar.gz) contains `/etc/config/network /etc/config/keepalived` I have 2 OpenWRT VMs with 3 network interfaces and one ubuntu client, in my test setup. Interface | Zone | Master | Backup | VIP -------|---------| --------------- | ----------------|----------------- eth0 | wan | 192.168.111.102/24 | 192.168.111.103/24 | 192.168.111.101/24 eth2 | lan | 192.168.5.2/24 | 192.168.5.3/24 | 192.168.5.1/24 eth1 | NA (HA Mgmt) | 100.100.100.1/24 | 100.100.100.2/24 | NA (Back to Back Directly Connected) Once master and backup are configured, dhcp and firewall config would be synchronized to backup. In current PR, config synchronized is only supported for - rpcd - system - ucitrack - firewall - dhcp (dnsmasq) - dropbear - uhttpd - luci - base-files To add any other uci config in synchornization, hotplug script needs to be added in `/etc/hotplug.d/keepalived`. To make it simple, default hotplug lib for keepalived is installed in `/lib/functions/keepalived/hotplug.sh`. All above mentioned uci config files are using it. An example of keepalived hootplug script dropbear ```bash #!/bin/sh . /lib/functions/keepalived/hotplug.sh # Sets service name, which should be started/stopped/restarted when set set_service_name dropbear # Reloads service dropbear if node becomes backup set_reload_if_backup # Reloads service dropbear after all service specific files are updated (files can be added with add_sync_file) set_reload_if_sync # when file is added to sync files list, it would be updated to actual location in backup node once received from master add_sync_file /etc/config/dropbear add_sync_file /etc/dropbear/dropbear_ed25519_host_key add_sync_file /etc/dropbear/dropbear_rsa_host_key # If there are no user defined explicit callbacks, default callbacks from keepalived_hotplug would be called # To override default call backs, define functions and set call back with # set_master_cb <func> # set_backup_cb <func> # set_fault_cb <func> # set_sync_cb func> # hotplug entrypoint keepalived_hotplug ```

luci-app-keepalived

github.com/openwrt/luci

luci-app-keepalived: Add LuCI for keepalived

openwrt:master ← jempatel:luci-app-keepalived

opened 04:36AM - 08 Sep 22 UTC

jempatel

+1376 -0

LuCI Support for Keepalived **We also need https://github.com/openwrt/package…s/pull/19329** **We also need https://github.com/openwrt/packages/pull/19374** Screenshots from the new Luci App Keepalived: Overview (Master): ![01-OpenWrt-Overview-LuCI-Master](https://user-images.githubusercontent.com/479974/189039554-5ad25c7a-c88e-4a7c-8695-4cdac26e28dd.png) Overview (Backup): ![02-OpenWrt-Overview-LuCI-Backup](https://user-images.githubusercontent.com/479974/189039571-d49c2eec-ee8d-4401-b583-8f97083e818d.png) Globals: ![03-OpenWrt-Globals-LuCI](https://user-images.githubusercontent.com/479974/189039650-7f8e8fff-605f-49a6-a826-3fb1597429e7.png) IP Address: ![04-OpenWrt-IP-Address-LuCI](https://user-images.githubusercontent.com/479974/189039671-d77ab039-3cb7-45d9-8254-76c4228443c2.png) IP Address (Add/Edit): ![05-OpenWrt-IP-Address-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189039689-f5f11ea8-a1fc-4762-a3bd-1a53076184ce.png) Static IP Address (Add/Edit): ![06-OpenWrt-Static-IP-Address-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189039701-25b8ed5a-f109-4626-8a16-1f9164b4d451.png) Route: ![07-OpenWrt-Route-LuCI](https://user-images.githubusercontent.com/479974/189039713-65296b13-68b4-441e-97cd-c315982fe022.png) Route (Add/Edit): ![08-OpenWrt-Route-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189039733-184f67f4-99eb-4589-a9c2-d59ea86fd4a2.png) Static Route (Add/Edit): ![09-OpenWrt-Static-Route-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189039748-8e831e8f-e2a8-41ec-8840-0b2bda070879.png) URLs: ![10-OpenWrt-URLs-LuCI](https://user-images.githubusercontent.com/479974/189039759-41029a93-89b1-4a08-b4b2-dff091e618cb.png) URLs (Add/Edit): ![11-OpenWrt-URLs-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189039785-7430846e-a5c8-41ab-b377-848481d6fa83.png) Scripts: ![12-OpenWrt-Scripts-LuCI](https://user-images.githubusercontent.com/479974/189039800-f992c0b7-a268-4bd4-a476-98fe9ec8fcf6.png) Scripts (Add/Edit): ![13-OpenWrt-Scripts-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189039828-d290cdcd-681f-4892-9887-c4d010b03b64.png) Track Scripts (Add/Edit): ![14-OpenWrt-Track-Scripts-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189039848-49d624bf-7a8a-444c-a875-5ede6c2cdbf0.png) Interfaces: ![15-OpenWrt-Interfaces-LuCI](https://user-images.githubusercontent.com/479974/189039867-d61ae9ff-006a-49b6-b246-a2af2fbc67b7.png) Interfaces (Add/Edit): ![16-OpenWrt-Interfaces-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189039883-bbaab629-0e00-4f96-b8b0-fd00eab6ba98.png) Instance: ![17-OpenWrt-Instance-LuCI](https://user-images.githubusercontent.com/479974/189039968-4e8ba02c-6117-469a-8751-d679cc933104.png) Instance (General): ![18-OpenWrt-Instance-General-LuCI](https://user-images.githubusercontent.com/479974/189039984-0102c2b6-26db-4a3d-8ee8-4dae47e14fa9.png) Instance (Peer): ![19-OpenWrt-Instance-Peer-LuCI](https://user-images.githubusercontent.com/479974/189040003-a40b6a8a-e34c-431b-a432-a2379492dd9c.png) Instance (Tracking): ![20-OpenWrt-Instance-Tracking-LuCI](https://user-images.githubusercontent.com/479974/189040013-51f4856c-196e-44fa-9905-15e3505afbb9.png) Instance (GARP): ![21-OpenWrt-Instance-GARP-LuCI](https://user-images.githubusercontent.com/479974/189040024-5e44595a-3415-4fc0-9743-d990463e5b4c.png) Instance (Advanced): ![22-OpenWrt-Instance-Advanced-LuCI](https://user-images.githubusercontent.com/479974/189040036-84a24087-d32d-499e-a1ae-0e6c19642cb5.png) Servers: ![23-OpenWrt-Servers-LuCI](https://user-images.githubusercontent.com/479974/189040041-288533a7-9be7-471f-836d-e056b29cc6a3.png) Real Servers (Add/Edit): ![24-OpenWrt-Real-Servers-Add-EditLuCI](https://user-images.githubusercontent.com/479974/189040050-c6609af3-f873-4053-9cad-5d28890d5681.png) Virtual Servers (General): ![25-OpenWrt-Virtual-Servers-General-LuCI](https://user-images.githubusercontent.com/479974/189040062-e51aed8b-bb81-4674-9690-42c458c787b1.png) Virtual Servers (Advanced): ![26-OpenWrt-Virtual-Servers-Advanced-LuCI](https://user-images.githubusercontent.com/479974/189040084-80d791fe-5f6c-4759-aefe-026fe50b3f69.png) Sync Group: ![27-OpenWrt-Sync-Group-LuCI](https://user-images.githubusercontent.com/479974/189040096-a8b0a3a6-056f-4add-bd50-77326cbf59d5.png) Sync Group (Add/Edit): ![28-OpenWrt-Sync-Group-Add-Edit-LuCI](https://user-images.githubusercontent.com/479974/189040111-81563454-1031-490f-93e1-c7e35a524e51.png)

pgaufillet · January 17, 2024, 7:32am

Thank you Jaymin

I am going to study that.

pgaufillet · February 11, 2024, 8:53am

Hi!

After a few weeks, it is time for a summary of what I have seen with Keepalived. First, luci-app-keepalived, keepalived-sync and keepalived works, and it is pretty cool. Thank you @jempatel for your great work

Nevertheless it has not been possible to support totally my needs, and a few things could also be improved:

Using sysupgrade -l in the rsync.sh script is too agressive. I suggest to synchronize only files from the user defined sync_list. In my case, the routers have different roles in MASTER and BACKUP modes, and copying all configuration files prevents such cases. Using sysupgrade -l could be kept as an option. The workaround is to modify the rsync.sh script locally line 46:

	for sync_file in $sync_list $(sysupgrade -l); do

Route specification needs to be extended with more options. In my case, metric is useful for masking the BACKUP default route, while preserving the interaction with other subsystems like mwan3.
The configuration of keepalived has evolved in its latest version and is not completely supported: global_track for example has been replaced by other mechanisms, max_auto_priority is not supported, etc.
The luci-app-keepalived+keepalived-sync Wiki page is not describing the current usage. Without the links provided by @jempatel, I would probably have resigned.
Action NOTIFY of type GROUP are not managed by the hotplug scripts. It raises errors in the logs.
The sync through ssh and rsync is very noisy, filling up the journals of low added value information. It would be nice to provide a way to reduce the verbosity:

Sun Feb 11 09:26:45 2024 authpriv.info dropbear[16112]: Child connection from 192.168.1.1:46634
Sun Feb 11 09:26:45 2024 authpriv.notice dropbear[16112]: Pubkey auth succeeded for 'keepalived' with ssh-ed25519 key SHA256:5N2aUWNlcosUvHwRdobELPRuf4V0HizXCGuC27X/sLg from 192.168.1.1:46634
Sun Feb 11 09:26:45 2024 authpriv.info dropbear[16112]: Exit (keepalived) from <192.168.1.1:46634>: Exited normally

dnsmasq is stopped from time to time while it is not expected. In asymetrical cases like mine, the BACKUP routers still needs dnsmasq for remaining operational (DNS resolving and concurrent fail-over DHCP service). The problem has probably be pointed out by @Blackfeather in Dnsmasq dies on boot after receiving SIGTERM.

I would be pleased to contribute to any of these topics if it can help. Comments and questions welcome

In addition, here is the alternate keepalived config file I finally use, for reference:

global_defs {
	script_user root
	enable_script_security
	process_names
	router_id router_1
}

vrrp_script rsync {
	script /etc/keepalived/scripts/rsync.sh
	interval 60
	weight 100
}

vrrp_instance VI_1 {
	authentication {
		auth_type PASS
		auth_pass XXXXXXXX
	}
	state BACKUP
	interface lan
	unicast_src_ip 192.168.1.1
	virtual_router_id 1
	priority 100
	advert_int 1
	debug 2
	garp_master_delay 1
	garp_master_refresh 1
	garp_master_repeat 1
	garp_master_refresh_repeat 1
	nopreempt
	notify_backup "/bin/busybox env -i ACTION=NOTIFY_BACKUP TYPE=INSTANCE NAME=VI_1 /sbin/hotplug-call keepalived"
	notify_master "/bin/busybox env -i ACTION=NOTIFY_MASTER TYPE=INSTANCE NAME=VI_1 /sbin/hotplug-call keepalived"
	notify_fault "/bin/busybox env -i ACTION=NOTIFY_FAULT TYPE=INSTANCE NAME=VI_1 /sbin/hotplug-call keepalived"
	notify_stop "/bin/busybox env -i ACTION=NOTIFY_STOP TYPE=INSTANCE NAME=VI_1 /sbin/hotplug-call keepalived"
	virtual_ipaddress {
		192.168.1.2/24 dev lan label lan:vip scope global
		192.168.100.2/24 dev dmz label dmz:vip scope global
		192.168.110.2/24 dev iot label iot:vip scope global
		192.168.120.2/24 dev wan label wan:vip scope global
	}
	virtual_routes {
		src 192.168.120.2 0.0.0.0/0 via 192.168.120.1 dev wan metric 5
	}
	track_script {
		rsync weight 100 
	}
}

anichang · February 11, 2024, 11:29am

Thanks Pedro, I'm on that track too but I had to do other stuff and I couldn't take the job to the end. As you already have your hands on the topic, allow me to advice for you to translate this text into a code patch. If you have some more time to commit on this topic.

Because writing on the forum might end up here; and it's a pity to loose your work. If you produce a patch and attach to some issue tracker instead (ex: the one on github repo, as we are not permanent developers so we can't access the main repo), there are more chances for some of the actual developers to discuss the patch with you inside the issue you opened, and finally include your findings in the upstream.
I don't know current practices of openwrt's developers, but having a patch ready to be merged might be just a few clicks away from upstream for some of them. If you don't serve your findings in a patch, they should spend more time to implement and test if your findings are good or not...

Basically you need to git clone openwrt repo, mod and test, make the patch and attach to an issue. You can also use github's features to fork, mod and issue a pull request.

Regards

pgaufillet · February 13, 2024, 9:10am

You are right @anichang. Unfortunately I have also very little free time for it. I'll do my best for proposing something in the coming weeks nevertheless.