I know it's late but have you concluded the problem? It seems like the tutorial says mfcc and other methods are applicable to other models, not wav2vec2.