79796943

Date: 2025-10-22 15:15:49
Score: 1
Natty:
Report link

maybe I'm late to the party, but I suggest to modify the following lines:

model = pipeline(model="facebook/wav2vec2-base-960h")
data = np.frombuffer(audio.get_raw_data())

to

model = pipeline("automatic-speech-recognition",model="facebook/wav2vec2-base-960h")
data = np.frombuffer(audio.get_raw_data(),dtype=np.int16)

That's the difference between my code and yours.

Reasons:
  • Has code block (-0.5):
  • Unregistered user (0.5):
  • Low reputation (1):
Posted by: Neosilver