Oops, some formatting errors. My bad. I fixed it by importing it in subtitle edit and exporting it again as .srt. Try this one. test.srt
Edit: There will be some random lines left empty because the OCR went kaput, but I'm gonna look into it. There are like 8 lines out of 310 empty. Gonna have to try different image processing techniques.
Try StreamFab Downloader and download from Netflix, Amazon, Youtube! Or Try DVDFab and copy Blu-rays!
+ Reply to Thread
Results 31 to 42 of 42
Thread
-
--[----->+<]>.++++++++++++.---.--------.
[*drm mass downloader: widefrog*]~~~[*how to make your own mass downloader: guide*] -
thank you
now it open but like that... (gibberish)
it happend to me too , i didn't found a way to fix it to work with subtitle workshop
and by the way , what program did you used to make it srt ? -
It works for me... in subtitle edit. Isn't workshop kinda dead anyway? Been some time since the last update.
[Attachment 77034 - Click to enlarge]
As for how I got it, I just used python, tesseract ocr and coded my own stuff. Subtitle edit was used to export all 310 png images and their corresponding timestamps and then in python I applied tesseract ocr for each image. Leaving the image unaltered gave a bunch of gibberish so I just used binarization to improve the success rate. It left 8 lines empty and I played around with some other techniques and now there's only 3 or 4 empty. I don't think I can do better than that with what I currently know anyway. I'm sure there are users here that are expert when it comes to this stuff.--[----->+<]>.++++++++++++.---.--------.
[*drm mass downloader: widefrog*]~~~[*how to make your own mass downloader: guide*] -
-
Ok, so I fixed some other problems and added a helpful "warning" so you could see what lines to add manually. Now keep in mind, just because the other lines "seem" to be ok, that doesn't mean they are, so a fast proofread could be good.
[Attachment 77036 - Click to enlarge]
The final srt file (with 8 lines missing) is this. output.srt I dunno if the other programs could already do this automatically but for me it was easier to write the script than to read a bunch of documentation. I tried it in subtitle edit and I couldn't see any option to process the image through specific methods.--[----->+<]>.++++++++++++.---.--------.
[*drm mass downloader: widefrog*]~~~[*how to make your own mass downloader: guide*] -
I'm just curious if there is a library to recognize Hebrew characters in Subtitle Edit, and Subtitle Workshop. At least Subtitle Edit recognizes pretty well graphics dvb_subtitles nightmare but fails with conversion to text.
BTW: ffmpeg and MKVToolnix fail to extract graphics version, so not worth to try. -
2nHxWW6GkN1l916N3ayz8HQoi did a great job... as he said , he done it with a script...
i tried the ocr in ts doctor... it seems decent too...
about subtitle edit , in the past i got some datebase for hebrew characters to subtitle edit , but it was long time ago , and i don't have it anymore , and i don't remember from where i got it ...
thanks for the info noemi7 -
If by extracting you mean, you want the subtitle data in some external files completely separated from the video container data, you can do this by using subtitle edit. Though I have no idea how those files would be useful on their own or how they could be "imported" to another video container like normal text based subtitles.
[Attachment 77041 - Click to enlarge]
Edit: I checked and the sub/idx pair can actually be imported on another mkv file. Good to know.Last edited by 2nHxWW6GkN1l916N3ayz8HQoi; 15th Feb 2024 at 14:09.
--[----->+<]>.++++++++++++.---.--------.
[*drm mass downloader: widefrog*]~~~[*how to make your own mass downloader: guide*] -
Funny thing is that you can import them to mkv directly from a ts as dvb_subtitles but mkvextract doesn't support their extraction.
-
Yet if you have a MKV file with PGS subs within the mkv file, my Zidoo media player UHD3000 shows them and if I use Inviska MKV to extract the PGS as a standalone subtitle it extracts the PGS file as a .SUP which is a graphics type that can then be OCR in Subtitle Edit to produce a srt or .ass sub.
SONY 75" Full array 200Hz LED TV, Yamaha A1070 amp, Zidoo UHD3000, BeyonWiz PVR V2 (Enigma2 clone), Chromecast, Windows 11 Professional, QNAP NAS TS851 -
Inviska (which uses mkvextract) cannot extract subtitles from the video uploaded by benhouli, and re-muxed as mkv. Are dvb subtitles coded differently in Australia? I'm in Europe, and streams that I used to capture have text based or hardcoded subtitles. I found a few channels that have graphics dvb subtitles PID but in fact there is no data in it.
Similar Threads
-
How Can HardSub Subtitles Extract?
By cinamews in forum SubtitleReplies: 7Last Post: 25th Sep 2022, 06:33 -
Does anyone know how to extract subtitles from .ts?
By ALUOp in forum DVB / IPTVReplies: 48Last Post: 27th Jul 2022, 00:39 -
Extract spu subtitles
By rrats in forum Video ConversionReplies: 1Last Post: 26th Dec 2021, 06:12 -
Subtitles to extract
By ecolek in forum Video Streaming DownloadingReplies: 2Last Post: 14th Feb 2021, 05:45 -
Extract subtitles from mubi.com
By linuz in forum SubtitleReplies: 0Last Post: 9th Mar 2020, 14:37