VideoHelp Forum
+ Reply to Thread
Results 1 to 9 of 9
Thread
  1. I've been experimenting with downloading pbs videos and closed captions as described here:
    http://forum.videohelp.com/threads/363819-Some-basic-questions-about-using-ffmpeg-to-d...oad-pbs-videos
    Just by looking at the srt text files corresponding to some of the TV programs, they look OK ( didn't verify by playing the video together with the subtitles).
    However, all downloads of Newshour seems to have subtitles doubled up. See example at end of this post.
    My question is: What is the purpose of this, and how could it be fixed? Certainly, I would think, if you watch the live stream of Newshour with captions, there would not be this problem. The live software player it seems must somehow fix the problem when watching the live stream. Any ideas?
    I have attached the srt subtitle file and also the corresponding 2 minute video file at the end of this post.


    1
    00:00:11,999 --> 00:00:18,998
    >> Ifill: BOSTON REMEMBERS THE
    LIVES LOST, AND THOSE STILL
    HEALING, ONE YEAR AFTER TWIN

    2
    00:00:18,999 --> 00:00:19,998
    LIVES LOST, AND THOSE STILL
    HEALING, ONE YEAR AFTER TWIN
    BOMBS STRUCK AT THE MARATHON'S

    3
    00:00:19,999 --> 00:00:21,998
    HEALING, ONE YEAR AFTER TWIN
    BOMBS STRUCK AT THE MARATHON'S
    FINISH LINE.

    4
    00:00:21,999 --> 00:00:22,998
    BOMBS STRUCK AT THE MARATHON'S
    FINISH LINE.
    GOOD EVENING, I'M GWEN IFILL.

    5
    00:00:22,999 --> 00:00:24,998
    FINISH LINE.
    GOOD EVENING, I'M GWEN IFILL.
    >> Woodruff: AND I'M JUDY

    6
    00:00:24,999 --> 00:00:25,998
    GOOD EVENING, I'M GWEN IFILL.
    >> Woodruff: AND I'M JUDY
    WOODRUFF.

    7
    00:00:25,999 --> 00:00:27,998
    >> Woodruff: AND I'M JUDY
    WOODRUFF.
    ALSO AHEAD TONIGHT, THE FIRST

    8
    00:00:27,999 --> 00:00:28,998
    WOODRUFF.
    ALSO AHEAD TONIGHT, THE FIRST
    SIGN OF A MILITARES

    9
    00:00:28,999 --> 00:00:30,998
    ALSO AHEAD TONIGHT, THE FIRST
    SIGN OF A MILITARES
    KIEV, AGAINST MOUNTING UNREST IN

    10
    00:00:30,999 --> 00:00:32,375
    SIGN OF A MILITARES
    KIEV, AGAINST MOUNTING UNREST IN
    UKRAINE'S EASTERN REGIONS.

    11
    00:00:32,459 --> 00:00:33,998
    KIEV, AGAINST MOUNTING UNREST IN
    UKRAINE'S EASTERN REGIONS.
    IT SAYS IT'S RECAPTURED AN

    12
    00:00:33,999 --> 00:00:34,998
    UKRAINE'S EASTERN REGIONS.
    IT SAYS IT'S RECAPTURED AN
    AIRPORT FROM PRO-RUSSIAN

    13
    00:00:34,999 --> 00:00:36,998
    IT SAYS IT'S RECAPTURED AN
    AIRPORT FROM PRO-RUSSIAN
    SEPARATISTS.

    14
    00:00:36,999 --> 00:00:38,998
    AIRPORT FROM PRO-RUSSIAN
    SEPARATISTS.
    >> Ifill: PLUS, THE SECOND OF

    15
    00:00:38,999 --> 00:00:40,998
    SEPARATISTS.
    >> Ifill: PLUS, THE SECOND OF
    JEFFREY BROWN'S REPORTS FROM THE

    16
    00:00:40,999 --> 00:00:41,998
    >> Ifill: PLUS, THE SECOND OF
    JEFFREY BROWN'S REPORTS FROM THE
    ASIAN NATION OF MYANMAR.

    17
    00:00:41,999 --> 00:00:43,998
    JEFFREY BROWN'S REPORTS FROM THE
    ASIAN NATION OF MYANMAR.
    TONIGHT, THE STRUGGLE TO

    18
    00:00:43,999 --> 00:00:45,998
    ASIAN NATION OF MYANMAR.
    TONIGHT, THE STRUGGLE TO
    PRESERVE IT'S GRAND ARCHITECTURE

    19
    00:00:45,999 --> 00:00:46,998
    TONIGHT, THE STRUGGLE TO
    PRESERVE IT'S GRAND ARCHITECTURE
    AND CULTURAL HISTORY, IN THE

    20
    00:00:46,999 --> 00:00:53,998
    PRESERVE IT'S GRAND ARCHITECTURE
    AND CULTURAL HISTORY, IN THE
    FACE OF RAPID CHANGE.

    21
    00:00:53,999 --> 00:00:54,998
    AND CULTURAL HISTORY, IN THE
    FACE OF RAPID CHANGE.
    >> THE CITY ONCE CALLED RANGOON

    22
    00:00:54,999 --> 00:00:55,998
    FACE OF RAPID CHANGE.
    >> THE CITY ONCE CALLED RANGOON
    IS OFTEN SAID TO BE FROZEN IN

    23
    00:00:55,999 --> 00:00:55,997
    >> THE CITY ONCE CALLED RANGOON
    IS OFTEN SAID TO BE FROZEN IN
    TIME.

    24
    00:00:55,998 --> 00:00:57,998
    IS OFTEN SAID TO BE FROZEN IN
    TIME.
    THAT'S CHANGING AND QUICKLY.

    25
    00:00:57,999 --> 00:00:59,998
    TIME.
    THAT'S CHANGING AND QUICKLY.
    THE KEY QUESTION IS HOW TO

    26
    00:00:59,999 --> 00:01:00,998
    THAT'S CHANGING AND QUICKLY.
    THE KEY QUESTION IS HOW TO
    PRESERVE SOMETHING OF THE PAST

    27
    00:01:00,999 --> 00:01:02,998
    THE KEY QUESTION IS HOW TO
    PRESERVE SOMETHING OF THE PAST
    WHILE MOVING INTO A 21s

    28
    00:01:02,999 --> 00:01:07,998
    PRESERVE SOMETHING OF THE PAST
    WHILE MOVING INTO A 21s
    21st CENTURY FUTURE.

    29
    00:01:07,999 --> 00:01:08,998
    WHILE MOVING INTO A 21s
    21st CENTURY FUTURE.
    >> Ifill: THOSE ARE JUST SOME OF

    30
    00:01:08,999 --> 00:01:09,998
    21st CENTURY FUTURE.
    >> Ifill: THOSE ARE JUST SOME OF
    THE STORIES WE'RE COVERING ON

    31
    00:01:09,999 --> 00:01:15,998
    >> Ifill: THOSE ARE JUST SOME OF
    THE STORIES WE'RE COVERING ON
    TONIGHT'S PBS NEWSHOUR.

    32
    00:01:15,999 --> 00:01:20,998
    >> Woodruff: THE CITY OF BOSTON
    PAUSED ON THIS FIRST ANNIVERSARY
    OF THE BOMBINGS THAT ERUPTED AT

    33
    00:01:20,999 --> 00:01:21,998
    PAUSED ON THIS FIRST ANNIVERSARY
    OF THE BOMBINGS THAT ERUPTED AT
    THE FINISH LINE OF THE BOSTON

    34
    00:01:21,999 --> 00:01:24,998
    OF THE BOMBINGS THAT ERUPTED AT
    THE FINISH LINE OF THE BOSTON
    MARATHON.

    35
    00:01:24,999 --> 00:01:25,998
    THE FINISH LINE OF THE BOSTON
    MARATHON.
    SURVIVORS, PUBLIC OFFICIALS AND

    36
    00:01:25,999 --> 00:01:26,998
    MARATHON.
    SURVIVORS, PUBLIC OFFICIALS AND
    THE GENERAL PUBLIC HONORED THE

    37
    00:01:26,999 --> 00:01:27,998
    SURVIVORS, PUBLIC OFFICIALS AND
    THE GENERAL PUBLIC HONORED THE
    DEAD AND OFFERED HOPE FOR THE

    38
    00:01:27,999 --> 00:01:33,998
    THE GENERAL PUBLIC HONORED THE
    DEAD AND OFFERED HOPE FOR THE
    FUTURE.

    39
    00:01:33,999 --> 00:01:36,998
    DEAD AND OFFERED HOPE FOR THE
    FUTURE.
    >> Woodruff: THE DAY BEGAN WITH

    40
    00:01:36,999 --> 00:01:37,998
    FUTURE.
    >> Woodruff: THE DAY BEGAN WITH
    A QUIET WREATH-LAYING AT THE

    41
    00:01:37,999 --> 00:01:38,998
    >> Woodruff: THE DAY BEGAN WITH
    A QUIET WREATH-LAYING AT THE
    SITE OF THE TWO BLASTS ON

    42
    00:01:38,999 --> 00:01:39,998
    A QUIET WREATH-LAYING AT THE
    SITE OF THE TWO BLASTS ON
    BOYLSTON STREET.

    43
    00:01:39,999 --> 00:01:42,998
    SITE OF THE TWO BLASTS ON
    BOYLSTON STREET.
    THE BROTHER AND SISTER OF THE

    44
    00:01:42,999 --> 00:01:43,998
    BOYLSTON STREET.
    THE BROTHER AND SISTER OF THE
    YOUNGEST VICTIM, 8-YEAR-OLD

    45
    00:01:43,999 --> 00:01:44,998
    THE BROTHER AND SISTER OF THE
    YOUNGEST VICTIM, 8-YEAR-OLD
    MARTIN RICHARD, LAID ONE OF THE

    46
    00:01:44,999 --> 00:01:47,998
    YOUNGEST VICTIM, 8-YEAR-OLD
    MARTIN RICHARD, LAID ONE OF THE
    WREATHS.

    47
    00:01:47,999 --> 00:01:49,998
    MARTIN RICHARD, LAID ONE OF THE
    WREATHS.
    A YEAR AGO, THE SCENE WAS VERY

    48
    00:01:49,999 --> 00:01:50,998
    WREATHS.
    A YEAR AGO, THE SCENE WAS VERY
    DIFFERENT, AS RUNNERS MADE THEIR

    49
    00:01:50,999 --> 00:01:52,998
    A YEAR AGO, THE SCENE WAS VERY
    DIFFERENT, AS RUNNERS MADE THEIR
    WAY TO THE FINISH LINE, THE TWO

    50
    00:01:52,999 --> 00:01:53,998
    DIFFERENT, AS RUNNERS MADE THEIR
    WAY TO THE FINISH LINE, THE TWO
    PRESSURE COOKER BOMBS EXPLODED

    51
    00:01:53,999 --> 00:01:57,998
    WAY TO THE FINISH LINE, THE TWO
    PRESSURE COOKER BOMBS EXPLODED
    WITHIN MOMENTS OF EACH OTHER.
    Attached Files
    Last edited by jimdagys; 16th Apr 2014 at 15:11.
    Quote Quote  
  2. Member DB83's Avatar
    Join Date: Jul 2007
    Location: United Kingdom
    Search Comp PM
    It may be more useful to post the originally downloaded sub-title file.

    The converted file is so out-of-sync as to make it useless.

    I can not access original video from this source but my experience of cc-titles is that they are usually displayed one line at a time to give the user a fair chance to read them.
    Quote Quote  
  3. Here is the original dfxp file. Input into Subtitle Edit to get srt. Converting to srt does not change doubling up.
    Attached Files
    Quote Quote  
  4. this kind of subtilte is probably of live event or sports.
    Quote Quote  
  5. Thank you very much for your answer. I was looking for some reason, ie some difference between "Newshour" and "This Old House" and I think you got it. I guess there is some ability for the live stream player to correct this kind of problem. I will look around at other pbs shows captions and see if there are any other captions that are doubled up. If it is only "Newshour" then it is no big deal.
    Still curious how the livestream player can correct doubled up captions. My internet connection is so poor that I have no way to watch the streamed version of any pbs show and look at the captions.
    Last edited by jimdagys; 16th Apr 2014 at 19:17.
    Quote Quote  
  6. Member DB83's Avatar
    Join Date: Jul 2007
    Location: United Kingdom
    Search Comp PM
    Yes, there appears to be doubling up but the timings of the two files are different.

    The doubling is not the critical issue here since that is probably how it looks in real time with the duplicated lines performing a 'blending' whereas normal subtitles for a dvd etc. are always different.

    The conversion is not precise enough. The original is http-based (I think that is the correct term) so I, again, guess, that the web-player can correctly read the timings
    Quote Quote  
  7. Ok, I got the Nature program about Honey Badgers.

    Link to subtitle file:
    HTML Code:
    http://ga.video.cdn.pbs.org/captions/nature/fde0b61f-e674-427a-a9b5-c5b8eb8c520c/captions/148970_Encoded.dfxp
    Link to video:
    Code:
    http://ga.video.cdn.pbs.org/videos/nature/fde0b61f-e674-427a-a9b5-c5b8eb8c520c/117230/hd-mezzanine-16x9/naat3106_mezz16x9-16x9-mp4-baseline.mp4
    The subtitles look fine (no doubling). And they play fine with the video. I checked from beginning to end of the show and no problem. I converted the dfxp file to srt with Subtitle Edit. (Same as with converting "Newshour" subtitles.) Anybody have some more ideas about the messed up subtitles in "Newshour"?
    Link to srt file and video clip at end of this post.

    00:00:07,999 --> 00:00:09,999
    NARRATOR:
    Out on the African plains,


    2
    00:00:09,999 --> 00:00:11,999
    even the king of beasts
    makes way


    3
    00:00:11,999 --> 00:00:14,999
    for one relentless
    little warrior.


    4
    00:00:14,999 --> 00:00:18,751
    Meet the savannah's
    true tough guy,


    5
    00:00:18,751 --> 00:00:19,999
    the honey badger.


    6
    00:00:19,999 --> 00:00:23,999
    Known to be absolutely fearless,


    7
    00:00:23,999 --> 00:00:27,999
    honey badgers never give up,
    never give in.


    8
    00:00:27,999 --> 00:00:29,999
    MAN: The honey badger to me


    9
    00:00:29,999 --> 00:00:31,999
    is so brave and so courageous
    and so determined


    10
    00:00:31,999 --> 00:00:33,999
    that you can't help
    but love him.


    11
    00:00:33,999 --> 00:00:35,999
    NARRATOR: But he's simply
    impossible to live with.


    12
    00:00:35,999 --> 00:00:38,999
    MAN: The first honey badger
    I took in was Stoffel.


    13
    00:00:38,999 --> 00:00:40,999
    He caused chaos in the house.


    14
    00:00:40,999 --> 00:00:42,999
    I mean, he'd come
    and break through the window,


    15
    00:00:42,999 --> 00:00:44,999
    go to the kitchen,
    kick the fridge open.


    16
    00:00:44,999 --> 00:00:47,999
    And you don't dare stop him.


    17
    00:00:50,999 --> 00:00:53,999
    NARRATOR: And trying to follow
    honey badgers out in the bush


    18
    00:00:53,999 --> 00:00:55,999
    isn't any easier.


    19
    00:00:55,999 --> 00:00:57,999
    MAN: It's actually
    quite frustrating.


    20
    00:00:57,999 --> 00:01:00,999
    I see these animals literally
    every single night.


    21
    00:01:00,999 --> 00:01:03,999
    They come in,
    cause chaos inside camp.


    22
    00:01:03,999 --> 00:01:05,999
    As soon as morning hits,
    they just disappear.


    23
    00:01:06,999 --> 00:01:10,999
    NARRATOR: But instead of
    courage or ferocity,


    24
    00:01:10,999 --> 00:01:12,999
    their sharpest weapon
    might just be


    25
    00:01:12,999 --> 00:01:15,999
    one of the best brains
    in all the wild.


    26
    00:01:15,999 --> 00:01:16,999
    MAN: Stoffel!
    Attached Files
    Last edited by jimdagys; 17th Apr 2014 at 01:10.
    Quote Quote  
  8. Member Cornucopia's Avatar
    Join Date: Oct 2001
    Location: Deep in the Heart of Texas
    Search Comp PM
    ???

    They aren't "Doubling up" - They are meant to be SCROLLING! If you watch it while they play, that's what should be going on.

    Example:

    The phrase:

    "Now is the time for
    all good men to come
    to the aid of their country."

    Which is split up into 3 lines to keep it centered/clustered, would look like:

    Code:
    Time 1 to Time 2
    "Now is the time for
    
    Time 2 to Time 3
    "Now is the time for
    all good men to come
    
    Time 3 to Time 4
    all good men to come
    to the aid of their country."
    
    Time 4 to Time 5
    to the aid of their country."
    
    Time 5 and onward
    <BLANK>
    Works just like scrolling credits or closed-captions.

    Nothing wrong with the file except your expectations about it.

    Scott
    Last edited by Cornucopia; 17th Apr 2014 at 02:56.
    "When will the rhetorical questions end?!" - George Carlin
    Quote Quote  
  9. I agree, this is one of the multiple ways that closed captions can be displayed.

    However, I would say that the SRT conversion process did not work correctly.

    IIRC, Subtitle Workshop could remove the duplicates very easily, it was a built-in function.

    There used to be a guy named McPoodle who had a website dedicated to CC tools, don't know if it's still there. Lot's of good stuff there.
    Quote Quote