VideoHelp Forum




+ Reply to Thread
Results 1 to 6 of 6
  1. Member
    Join Date
    May 2015
    Location
    Perth, Western Australia.
    Search PM
    I am trying to convert teletext from tv tuner recordings to an .srt file, and it seems with different shows the way the captions are formatted is different. I have run into this with a few shows:
    Click image for larger version

Name:	videohelp_forums_01.png
Views:	538
Size:	12.1 KB
ID:	31522

    I was hoping there was a program that could go through and combine multiple entries (what is each entry actually called in terms of a subtitle file?) which share text like above into a single entry and give that the starting time of the first combined entry and the ending time of the last. Or even if nobody knows of this feature in any program, a program where I could script that sort of functionality? Not only does having it like above cause the jumpy behaviour because it's adding new words in quick succession (rather than sections of text at a time), but it makes it very difficult fixing timing, and would be much easier with just one entry as I described.

    Any other advice/help would be appreciated, like maybe there is a way to record teletext (in Australia) in a better way? I am currently using media portal.

    Thanks for any help!
    Quote Quote  
  2. I'm a MEGA Super Moderator Baldrick's Avatar
    Join Date
    Aug 2000
    Location
    Sweden
    Search Comp PM
    How do you convert the teletext to srt?

    I don't think any subtitle program can merge multiple entries.
    Quote Quote  
  3. Member
    Join Date
    May 2015
    Location
    Perth, Western Australia.
    Search PM
    Originally Posted by Baldrick View Post
    How do you convert the teletext to srt?

    I don't think any subtitle program can merge multiple entries.
    I currently am using CCExtractorGUI. I am opening the .srt file up to edit in Subtitle Workshop (which is where the screenshot in OP is taken from). As for merging multiple entries, it doesn't have to actually be doing that, I just need the end result as what I mentioned. Finding successive entries that share text like in the OP screenshot and creating a new entry with their text (with all duplicated text removed) and the starting time of the first entry that was combined and the ending time of the last. Then it could delete all the old entries.

    I could do it myself manually by following the process I just described, but that would take a LONG time going through the entire file. So I'm wondering if there is any method I could automate this process? Or any way to record/convert teletext to subtitles in a better way to avoid this completely.
    Quote Quote  
  4. Member
    Join Date
    Jul 2011
    Location
    Denmark
    Search Comp PM
    Try Subtitle Edit, Tools -> Merge lines with same text...
    Quote Quote  
  5. I'm a MEGA Super Moderator Baldrick's Avatar
    Join Date
    Aug 2000
    Location
    Sweden
    Search Comp PM
    Originally Posted by Nikse View Post
    Try Subtitle Edit, Tools -> Merge lines with same text...


    It supports everything!
    Quote Quote  
  6. Member
    Join Date
    May 2015
    Location
    Perth, Western Australia.
    Search PM
    Originally Posted by Nikse View Post
    Try Subtitle Edit, Tools -> Merge lines with same text...
    Thanks this does the trick. It's not perfect as lines with some duplicated text still remaining but in different positions are not removed.
    Eg "Hello how" and "how are you" would be left with the duplicate how from my use, but still massive reduction of the workload. If anyone does know how to solve this smaller issue that would be appreciated. Can't see any tools that would do it.

    Thanks a lot!
    Last edited by callmeclean; 6th May 2015 at 06:15.
    Quote Quote  



Similar Threads

Visit our sponsor! Try DVDFab and backup Blu-rays!