New Findings and Additional Testing
To ensure a completely clean test environment, I have now created an entirely new organization. In this new setup, I only went as far as creating a new host set and then used the CLI to register 250 random test servers with commands like:
Copy sh
boundary hosts create static -host-catalog-id hcst_ohIEspNaFp -name "Server002" -address 10.10.10.2 -addr https://boundary.xxxxx.xxx
boundary hosts create static -host-catalog-id hcst_ohIEspNaFp -name "Server003" -address 10.10.10.3 -addr https://boundary.xxxxx.xxx
boundary hosts create static -host-catalog-id hcst_ohIEspNaFp -name "Server004" -address 10.10.10.4 -addr https://boundary.xxxxx.xxx
boundary hosts create static -host-catalog-id hcst_ohIEspNaFp -name "Server005" -address 10.10.10.5 -addr https://boundary.xxxxx.xxx
boundary hosts create static -host-catalog-id hcst_ohIEspNaFp -name "Server006" -address 10.10.10.6 -addr https://boundary.xxxxx.xxx
boundary hosts create static -host-catalog-id hcst_ohIEspNaFp -name "Server007" -address 10.10.10.7 -addr https://boundary.xxxxx.xxx
# ... etc. up to Server250
After this, I proceeded to add 200 out of the 250 available test servers to the new host set. I am seeing exactly the same pattern as described in my earlier posts:
Once the host set exceeds approximately 60–80 servers, I immediately encounter a 500 error when either adding more servers with the “Add Hosts” function, or when viewing the host set under the “Hosts” tab in the UI.
Further clarification:
It is important to highlight that as long as I am on the “Details” tab, I am able to add as many servers as I want—200+, in this case—without any problem. The Error 500 only appears right after clicking “Add Host” under the “Add Existing Host” function, and also when I switch to the “Hosts” tab for the host set. All selected servers are still added as expected, but the subsequent display of a large host set fails with a 500 error in the UI.
To rule out any additional factors, I have not created any credentials or configured anything else in this test organization—only the host set and test servers were created.
For reference:
-
The CLI can handle large host sets with no issues at all.
-
The problem is reproducible in a completely clean setup.
-
I have tested with up to 250 hosts in a single host set.
-
I am using the standard Boundary UI.
Additionally, I can add that I am using the latest bitnamisecure/postgresql-ha
image (tag: latest
) for PostgreSQL HA, running with 3 pods. There are no errors or issues to be seen from the database side, either in this environment or in other environments using the same PostgreSQL cluster.
This should help rule out environmental or database-cluster-specific causes.(I hope)
Please let me know if further details or logs could be helpful.
postgres=# SELECT version();
version
PostgreSQL 17.6 on x86_64-pc-linux-gnu, compiled by gcc (Debian 12.2.0-14+deb12u1) 12.2.0, 64-bit
(1 row)
pgpool --version pgpool-II version 4.6.3 (chirikoboshi)