What you’re looking for a PR Cluster (Performance Replicator) which is the “active/active” – but that’s not a HA solution it’s a remote copy of your data which reduces latency AND can provide it’s own token management. There are obvious shortcoming issues with this (no DR) but it’s a good solution for off loading load as well as reducing latency between different locations.
Vault performance replication isn’t truly active/active though, as the performance replica still becomes unable to service write operations if its primary cluster goes offline.