VideoHelp Forum
+ Reply to Thread
Results 1 to 13 of 13
Thread
  1. I couldn't post it under videos so I did here

    I'm trying to download Unicode subtitles but they are being saved as ANSI which results in all kind of funny characters when displayed or opened in Notepad++.
    How can I solve this problem?

    Thanks anyway
    Quote Quote  
  2. Use Notepad (not ++) and select Unicode as code when saving.

    Click image for larger version

Name:	Unicode.jpg
Views:	1098
Size:	74.3 KB
ID:	12955
    Quote Quote  
  3. I'm not writing the subtitle myself, I download it from the internet, opening it in whatever editor shows the same, funny characters
    Quote Quote  
  4. DECEASED
    Join Date
    Jun 2009
    Location
    Heaven
    Search Comp PM
    Probably what you have ARE NOT "Unicode" subtitle files, but plain-ANSI text written for a different "codepage".

    OR, equally possible, there is a problem in the Web server of the site you download from, or/and in the Web browser you're using

    If you upload a sample file to this forum, we may take a look at it.
    Quote Quote  
  5. There you go

    This is just a sample, I have been trying different subs from the internet and I see all of them being saved as ANSI
    Quote Quote  
  6. I'm a Super Moderator johns0's Avatar
    Join Date
    Jun 2002
    Location
    canada
    Search Comp PM
    Thread moved to the subtitle forum.
    I think,therefore i am a hamster.
    Quote Quote  
  7. DECEASED
    Join Date
    Jun 2009
    Location
    Heaven
    Search Comp PM
    Thanks for the upload

    Hmmm, it looks like that SRT, as I said, is "ANSI plus Simplified Chinese", plus a UTF-8 header (a.k.a. B.O.M.) at the beginning I used EditPlus for re-loading it under the Simplified Chinese codepage, and saved the result as a Unicode file --- which appears fine in Wordpad, but since I don't understand Chinese, I really don't know if what I got is just nonsense written with Chinese characters Please check the modified (hopefully fixed) .SRT ......
    Image Attached Files
    Quote Quote  
  8. Problem solved
    Thanks for trying to help me guys, appreciated
    In MPC, subtitles - styles, under font group box select GB2312 (134)
    I have no idea what that stands for but it seems to work on all subtitles for now..
    Image Attached Thumbnails Click image for larger version

Name:	Capture.PNG
Views:	1022
Size:	101.6 KB
ID:	12969  

    Last edited by Maxand; 2nd Jul 2012 at 04:04.
    Quote Quote  
  9. DECEASED
    Join Date
    Jun 2009
    Location
    Heaven
    Search Comp PM
    Originally Posted by Maxand View Post
    Thanks for trying to help me guys, but its still not working
    @El Heggunte, the file you uploaded looks like Chinese buts its not
    I'm sorry for that.

    This is so confusing, I download these subtitles from opensubtitles.org and other sites
    Arabic works, Chinese doesn't, and they are both Unicode

    I also tried downloading these subtitle files using different browsers, which shouldn't matter because they are zip files.
    I noticed that when I want to display Arabic subtitle I need to go to subtitles - styles - Arabic(178) (Under font group box)
    But when I do the same for Chinese (Chinesebig5(136)) it doesn't work, it just displays Chinese like funny chars
    I presume I know what you mean by "funny characters" (some Greek letters, some mathematic symbols, and a lot of ??? and/or "blank boxes")
    Apparently something very-wrong happens to the Chinese subtitles BEFORE they are zipped by those sites. As I tried to say, a UTF-8 BOM at the beginning of a text file IS NOT what transforms "codepaged ANSI" into actual UTF-8 Unfortunately, there is a LOT of programmers who still haven't learned what UTF-8 or Unicode really means.
    Quote Quote  
  10. Hi El Heggunte,
    I edited my last post, it works now, could you explain what that option in the picture stand for?
    Quote Quote  
  11. DECEASED
    Join Date
    Jun 2009
    Location
    Heaven
    Search Comp PM
    Originally Posted by Maxand View Post
    Hi El Heggunte,
    I edited my last post, it works now, could you explain what that option in the picture stand for?
    It stands for "Simplified Chinese".

    Just for the notes, THERE IS a difference between "character set" and "(type of) encoding". The file I uploaded IS real Unicode (little-endian type, always using 2 bytes per character).
    Quote Quote  
  12. How is it possible that when I choose that option (GB2312 (134)) it displays Chinese AND Arabic subtitle fine?!
    Also when I open those subtitle files after I unzip them in Notepad++ or any editor they are showing their format as ANSI yet MPC displays them fine!
    How can one edit those files if he wanted to if they show funny characters in editors!
    Strange!
    Quote Quote  
  13. DECEASED
    Join Date
    Jun 2009
    Location
    Heaven
    Search Comp PM
    Originally Posted by Maxand View Post
    How is it possible that when I choose that option (GB2312 (134)) it displays Chinese AND Arabic subtitle fine?!
    Also when I open those subtitle files after I unzip them in Notepad++ or any editor they are showing their format as ANSI yet MPC displays them fine!
    How can one edit those files if he wanted to if they show funny characters in editors!
    Strange!
    VERY-strange indeed

    FWIW, I still use VSFilter version 2.33 (it works for me, I don't like fancy .ASS subtitles, so I won't upgrade it anytime soon).

    --- EDIT ---

    Hmmm, with VSFilter 2.33, UTF-8 and Unicode work well for SSA files only

    Maybe it's time for an overdue update, unfortunately (argh)
    Last edited by El Heggunte; 2nd Jul 2012 at 06:30. Reason: argh :-)
    Quote Quote  



Similar Threads

Visit our sponsor! Try DVDFab and backup Blu-rays!