I'm trying to extrat subs from a video that displayes the sub like karaoke: characters are displayed in sequence -one character after one. The speed of the sequence varies. I am missing quite a bit of the subs. Is there a way for me to set the intervals for sub recognition to be shorter? Or any other tips on how to capture such subs?

Thanks!