Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LOG [sizzling] [sizzling] [sizzling] [sizzling] LOG [engine revving] ???? #195

Open
ghost opened this issue Feb 6, 2024 · 2 comments
Open

Comments

@ghost
Copy link

ghost commented Feb 6, 2024

LOG [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] [sizzling] (engine revving) [splashing]
LOG [engine revving]

why is it transcribing everything like this? the audio file contains speech but this is all i get with whisper tiny

@sooryranga
Copy link
Contributor

Real time works for me but I can't get it to work from a file. I get the same engine revving result when opening from a file.

@wojciechkrol
Copy link

I encountered the same issue. I use the expo-av package for microphone recording. I resolved it by modifying the output file configuration to match the one mentioned here.

const AUDIO_WAV = {
  isMeteringEnabled: true,
  android: {
    ...Audio.RecordingOptionsPresets.HIGH_QUALITY.android,
    extension: '.wav',
    outputFormat: AndroidOutputFormat.DEFAULT,
    audioEncoder: AndroidAudioEncoder.DEFAULT,
    sampleRate: 16000, // Set sample rate to 16kHz
    numberOfChannels: 1, // Set number of channels to mono
  },
  ios: {
    ...Audio.RecordingOptionsPresets.HIGH_QUALITY.ios,
    extension: '.wav',
    outputFormat: IOSOutputFormat.LINEARPCM,
    audioQuality: IOSAudioQuality.MAX,
    sampleRate: 16000, // Set sample rate to 16kHz
    numberOfChannels: 1, // Set number of channels to mono
  },
  web: {
    mimeType: 'audio/wav',
    bitsPerSecond: 128000,
  },
};

const { recording } = await Audio.Recording.createAsync(AUDIO_WAV);

...

const uri = recording.getURI();
const { promise } = whisper.transcribe(uri, {
  language: 'en',
});

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants