Maybe try AMD's Quark? It can convert fp32 and fp16 to bf16.
https://quark.docs.amd.com/latest/supported_accelerators/ryzenai/tutorial_convert_fp32_or_fp16_to_bf16.html