I had the same problem. Following the advice from @staroselskii I played around with the tests and found that in my case the problem was caused by outputting too many lines to stderr. When I reduced the amount of logging to stderr, the pipeline completed correctly with a full set of tests.