I just basically stumbled my way through the menus until I found a basic work process for this that seems to have proven successful for me. -- at least so far. But for anyone else who may be interested in more formal, detailed guides, I did find this online:
+ Reply to Thread
Results 31 to 40 of 40
I have been using Whisper AI for about a year and a GPU speeds things up a lot.
If you want to try Whisper AI with GPU anyway, and you have a Google Drive working, try using Google Colaboratory (Jupyter). You will need to install Whisper AI again but you have done it already on your PC so things will be easy for you. You will be able to use free GPU up to a certain size. Try it to transcribe a short video starting with the smallest models and move upwards till medium or large.
I haven't used it because my Google Drive is corrupted. Let us know if it works for you.
You might find this video helpful
Last edited by Subtitles; 20th Sep 2023 at 05:31.
Right now, on CPU only, I have a Whisper job that's been running for 10 hours -- so far -- yet the Whisper log has only reached about 400 bytes in size. Should I take that as an indication that Whisper has stalled and given up ? Does the log only get written to at the very end ? Any previous job here of about the same size was completed overnight.
Are you using Subtitle Edit for this job or the Whisper AI command line?
Which model have you selected to do the transcription?
You can try the simplest model and see how it goes and go up higher once it finishes.
model medium and large are not going to work on your PC. You will see the difference when you install the GPU.
It just seems to be spinning its wheels on this job. Nothing has shown up yet in the upper left processing window, which I don't think was the case for the previous jobs.
Doing this under Subtitle Edit, which then hands off to Whisper. Using the Large model, as was the case for the previous 6 jobs -- all approx. the same size, and either with original French or German language. (No, wait -- there was one that was Japanese.) All of those succeeded, taking around 8 hours ea. to complete. Results ranged from satisfactory to quite good. I wasn't seeing any clear reason to deviate from this template that had worked several times, the only variable being the spoken language. But all of those had pretty clean soundtracks, with good recording and nothing much to interfere with that. I had not previewed the sound for this one so I don't know, but will go back to see where it stands.
After 12 hours and zilch to show for it, I pulled the plug on that job. May try it again after that video card arrives in a few days. Based on a quick check, I noted no obvious defects in the audio -- no people talking over each other, or poor recording, or obscuring background sound. So that was officially a first failure, once I'd worked out the rudiments of getting Whisper AI going.
For the sake of comparison, has anyone experimented with Deepl ? If so, was it any good for the translating ? I had their Win standalone app installed, but only went to check it out for the first time a few days ago. At first the app could not be found (it had installed itself far under Users in C:, normally a place I would never install anything), and then when I tried to run it from that location it promptly uninstalled itself. Always possible that I made some mistake . . . . Seemingly a dead end, nonetheless.
Have you seen that bar graph which shows all the languages, and relatively how well Whisper performed in translating them ?
If not I'm sure I can find the link and post it here.
I prefer to use the term trascription and not translation simply because I can check the final job while listening even in different languages.
For translation there are several options, including running Whisper again.
Last edited by Subtitles; 22nd Sep 2023 at 04:42.