The issue was the worker threads were actually disabled (via preprocessor defines not shown in code). You must continue to use worker threads even with the new blk_mq.