VideoHelp Forum

+ Reply to Thread
Page 2 of 2
FirstFirst 1 2
Results 31 to 40 of 40
Thread
  1. Member Seeker47's Avatar
    Join Date
    Jul 2005
    Location
    drifting, somewhere on the Sea of Cynicism
    Search Comp PM
    I just basically stumbled my way through the menus until I found a basic work process for this that seems to have proven successful for me. -- at least so far. But for anyone else who may be interested in more formal, detailed guides, I did find this online:

    https://www.notta.ai/en/blog/how-to-use-whisper
    When in Las Vegas, don't miss the Pinball Hall of Fame Museum http://www.pinballmuseum.org/ -- with over 150 tables from 6+ decades of this quintessentially American art form.
    Quote Quote  
  2. Member
    Join Date
    Mar 2021
    Location
    Israel
    Search Comp PM
    Originally Posted by Seeker47 View Post
    I just basically stumbled my way through the menus until I found a basic work process for this that seems to have proven successful for me. -- at least so far. But for anyone else who may be interested in more formal, detailed guides, I did find this online:

    https://www.notta.ai/en/blog/how-to-use-whisper
    It is a good tutorial. Thanks for sharing.
    I have been using Whisper AI for about a year and a GPU speeds things up a lot.
    If you want to try Whisper AI with GPU anyway, and you have a Google Drive working, try using Google Colaboratory (Jupyter). You will need to install Whisper AI again but you have done it already on your PC so things will be easy for you. You will be able to use free GPU up to a certain size. Try it to transcribe a short video starting with the smallest models and move upwards till medium or large.
    I haven't used it because my Google Drive is corrupted. Let us know if it works for you.
    You might find this video helpful
    https://www.youtube.com/watch?v=wrSelk44_Js&ab_channel=MathsChelsea
    Last edited by Subtitles; 20th Sep 2023 at 05:31.
    Quote Quote  
  3. Member Seeker47's Avatar
    Join Date
    Jul 2005
    Location
    drifting, somewhere on the Sea of Cynicism
    Search Comp PM
    Originally Posted by Subtitles View Post
    Originally Posted by Seeker47 View Post
    I just basically stumbled my way through the menus until I found a basic work process for this that seems to have proven successful for me. -- at least so far. But for anyone else who may be interested in more formal, detailed guides, I did find this online:

    https://www.notta.ai/en/blog/how-to-use-whisper
    It is a good tutorial. Thanks for sharing.
    I have been using Whisper AI for about a year and a GPU speeds things up a lot.
    If you want to try Whisper AI with GPU anyway, and you have a Google Drive working, try using Google Colaboratory (Jupyter). You will need to install Whisper AI again but you have done it already on your PC so things will be easy for you. You will be able to use free GPU up to a certain size. Try it to transcribe a short video starting with the smallest models and move upwards till medium or large.
    I haven't used it because my Google Drive is corrupted. Let us know if it works for you.
    You might find this video helpful
    https://www.youtube.com/watch?v=wrSelk44_Js&ab_channel=MathsChelsea
    Thanks for the suggestion. I've ordered a video card that has 4 GB of VRAM (probably not enough . . . ?), which might fit and work in this computer setup. So I expect to be giving that a try. I've only seen mentions of Google Drive, no exposure to that at all.

    Right now, on CPU only, I have a Whisper job that's been running for 10 hours -- so far -- yet the Whisper log has only reached about 400 bytes in size. Should I take that as an indication that Whisper has stalled and given up ? Does the log only get written to at the very end ? Any previous job here of about the same size was completed overnight.
    When in Las Vegas, don't miss the Pinball Hall of Fame Museum http://www.pinballmuseum.org/ -- with over 150 tables from 6+ decades of this quintessentially American art form.
    Quote Quote  
  4. Member
    Join Date
    Mar 2021
    Location
    Israel
    Search Comp PM
    Are you using Subtitle Edit for this job or the Whisper AI command line?
    Which model have you selected to do the transcription?
    You can try the simplest model and see how it goes and go up higher once it finishes.
    model medium and large are not going to work on your PC. You will see the difference when you install the GPU.
    Quote Quote  
  5. Member Seeker47's Avatar
    Join Date
    Jul 2005
    Location
    drifting, somewhere on the Sea of Cynicism
    Search Comp PM
    Originally Posted by Subtitles View Post
    Are you using Subtitle Edit for this job or the Whisper AI command line?
    Which model have you selected to do the transcription?
    You can try the simplest model and see how it goes and go up higher once it finishes.
    model medium and large are not going to work on your PC. You will see the difference when you install the GPU.

    It just seems to be spinning its wheels on this job. Nothing has shown up yet in the upper left processing window, which I don't think was the case for the previous jobs.

    Doing this under Subtitle Edit, which then hands off to Whisper. Using the Large model, as was the case for the previous 6 jobs -- all approx. the same size, and either with original French or German language. (No, wait -- there was one that was Japanese.) All of those succeeded, taking around 8 hours ea. to complete. Results ranged from satisfactory to quite good. I wasn't seeing any clear reason to deviate from this template that had worked several times, the only variable being the spoken language. But all of those had pretty clean soundtracks, with good recording and nothing much to interfere with that. I had not previewed the sound for this one so I don't know, but will go back to see where it stands.
    When in Las Vegas, don't miss the Pinball Hall of Fame Museum http://www.pinballmuseum.org/ -- with over 150 tables from 6+ decades of this quintessentially American art form.
    Quote Quote  
  6. Member Seeker47's Avatar
    Join Date
    Jul 2005
    Location
    drifting, somewhere on the Sea of Cynicism
    Search Comp PM
    After 12 hours and zilch to show for it, I pulled the plug on that job. May try it again after that video card arrives in a few days. Based on a quick check, I noted no obvious defects in the audio -- no people talking over each other, or poor recording, or obscuring background sound. So that was officially a first failure, once I'd worked out the rudiments of getting Whisper AI going.
    When in Las Vegas, don't miss the Pinball Hall of Fame Museum http://www.pinballmuseum.org/ -- with over 150 tables from 6+ decades of this quintessentially American art form.
    Quote Quote  
  7. Member
    Join Date
    Mar 2021
    Location
    Israel
    Search Comp PM
    Originally Posted by Seeker47 View Post
    After 12 hours and zilch to show for it, I pulled the plug on that job. May try it again after that video card arrives in a few days. Based on a quick check, I noted no obvious defects in the audio -- no people talking over each other, or poor recording, or obscuring background sound. So that was officially a first failure, once I'd worked out the rudiments of getting Whisper AI going.
    Start again and use model small. At least you will get somthing even if it is not very accurate.
    Quote Quote  
  8. Member Seeker47's Avatar
    Join Date
    Jul 2005
    Location
    drifting, somewhere on the Sea of Cynicism
    Search Comp PM
    Originally Posted by Subtitles View Post
    Originally Posted by Seeker47 View Post
    After 12 hours and zilch to show for it, I pulled the plug on that job. May try it again after that video card arrives in a few days. Based on a quick check, I noted no obvious defects in the audio -- no people talking over each other, or poor recording, or obscuring background sound. So that was officially a first failure, once I'd worked out the rudiments of getting Whisper AI going.
    Start again and use model small. At least you will get somthing even if it is not very accurate.
    I'll probably defer on this until that video card comes in -- enough other things going on in the interim anyway.

    For the sake of comparison, has anyone experimented with Deepl ? If so, was it any good for the translating ? I had their Win standalone app installed, but only went to check it out for the first time a few days ago. At first the app could not be found (it had installed itself far under Users in C:, normally a place I would never install anything), and then when I tried to run it from that location it promptly uninstalled itself. Always possible that I made some mistake . . . . Seemingly a dead end, nonetheless.
    When in Las Vegas, don't miss the Pinball Hall of Fame Museum http://www.pinballmuseum.org/ -- with over 150 tables from 6+ decades of this quintessentially American art form.
    Quote Quote  
  9. Member Seeker47's Avatar
    Join Date
    Jul 2005
    Location
    drifting, somewhere on the Sea of Cynicism
    Search Comp PM
    @Subtitles,

    Have you seen that bar graph which shows all the languages, and relatively how well Whisper performed in translating them ?
    If not I'm sure I can find the link and post it here.
    When in Las Vegas, don't miss the Pinball Hall of Fame Museum http://www.pinballmuseum.org/ -- with over 150 tables from 6+ decades of this quintessentially American art form.
    Quote Quote  
  10. Member
    Join Date
    Mar 2021
    Location
    Israel
    Search Comp PM
    Originally Posted by Seeker47 View Post
    @Subtitles,

    Have you seen that bar graph which shows all the languages, and relatively how well Whisper performed in translating them ?
    If not I'm sure I can find the link and post it here.
    Link
    https://github.com/openai/whisper

    I prefer to use the term trascription and not translation simply because I can check the final job while listening even in different languages.
    For translation there are several options, including running Whisper again.
    Last edited by Subtitles; 22nd Sep 2023 at 04:42.
    Quote Quote  



Similar Threads