[ID3 Dev] Unicode

Jud White jwhite at cdtag.com
Sun Feb 11 18:32:02 PST 2007


Just tested this.. if you're writing the BOM reversed (should be 0xFF 
0xFE) you'll get oriental characters in iTunes.

Jud White wrote:
> Mark,
>
> This isn't a UCS-2 vs UTF-16 issue.  The differences in these two only 
> occur over 0xffff.  Also it's not an issue with BOM since iTunes can 
> cope without BOM.
>
> I was able to reproduce this behavior by writing a text encoding byte 
> of "Unicode" (0x01) but writing the actual string in UTF-8.  Maybe 
> your implementation is doing something similar?
>
> -Jud
>
>
>
> Mark Smith wrote:
>> I'm getting a bit exasperated with trying to handle Unicode 
>> correctly. In my library, I'm handling all strings as UTF8 
>> internally, but since the 2.3 spec (as I've understood it) only 
>> allows for iso 8559-1 and UCS-2 (for the moment I'm treating UCS-2 as 
>> if it were UTF-16), I'm writing out as UTF-16, where necessary.
>>
>> What I'm finding is that if I write out a TALB frame as "Erét" (thats 
>> E - r - e with acute accent - t, if your mail client displays 
>> something else) as UTF-16, iTunes and the other two tagging apps I've 
>> checked out display it in an oriental font.
>>
>> So the question is, am I wrong, or are other people just not 
>> bothering to deal with anything but english?
>>
>> Any insights gratefully recieved....
>>
>> Thanks,
>>
>> Mark
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: id3v2-unsubscribe at id3.org
>> For additional commands, e-mail: id3v2-help at id3.org
>>
>>
>>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: id3v2-unsubscribe at id3.org
> For additional commands, e-mail: id3v2-help at id3.org
>
>
>



---------------------------------------------------------------------
To unsubscribe, e-mail: id3v2-unsubscribe at id3.org
For additional commands, e-mail: id3v2-help at id3.org



More information about the ID3v2 mailing list