I've been using Subtitle Edit to convert SRT files to TTML (Timed Text). The issue I'm having is that I will run into files that won't convert in the batch because they contain certain hexadecimal characters that aren't supported by TTML (which is XML). I can go through and manually remove the unsupported hexadecimal characters and convert again. However, doing a find and replace for several hexadecimal characters in a single SRT file is very time consuming.
My question is, is there anything I can do to automate the removal of these unsupported hexadecimal characters in the SRT files before converting to TTML. Any help would be greatly appreciated.
Thanks!
+ Reply to Thread
Results 1 to 4 of 4
-
Last edited by Kodae; 6th Mar 2015 at 08:19.
-
I would do this in Notepad++ with a regex expression or multiple replace commands in a macro.
-
Could be just the codepage/character encoding needs to be converted (e.g. UTF-16<->UTF-8<->ASCII<->ANSI<->Windows Latin1, etc).
Scott -
I thought Kodae was referring the Famous Five, though perhaps (s)he will give us an example.
" "
' '
< <
> >
& &