Issue resolved
Root cause - the broker service's initContainer to test zookeeper connectivity together with the broker itself were too resource heavy for the node types in the node group (t3.small, 2gb RAM).
Created a new nodegroup, with t3.medium nodes, 4gb RAM, and migrated the cluster to use that instead.