[ID3 Dev] Unicode

Jud White jwhite at cdtag.com
Sun Feb 11 20:34:00 PST 2007


Mark,

What you have is Big Endian.  The year should be 01 FF FE 31 00 39 00 37 
00 35 00 (null terminator excluded)

-Jud

Mark Smith wrote:
> Judd, thanks for taking the trouble to look at this.
>
> This is the content of the TYER frame (excluding the frame header) 
> that I'm testing with:
>
> 0x01 0xFF 0xFE 0x00 0x31 0x00 0x39 0x00 0x37 0x00 0x35 0x00 0x00
>
> which is what I think it should be...(1975 is the year), but this 
> shows up as 16665 in iTunes (I stopped testing other frames that 
> resulted in iTunes renaming files and folders).
>
> I'm working on Mac OS X, which may be a factor.
>
> Thanks, again.
>
> Best,
>
> Mark
>
> On 12 Feb 2007, at 02:32, Jud White wrote:
>
>> Just tested this.. if you're writing the BOM reversed (should be 0xFF 
>> 0xFE) you'll get oriental characters in iTunes.
>>
>> Jud White wrote:
>>> Mark,
>>>
>>> This isn't a UCS-2 vs UTF-16 issue.  The differences in these two 
>>> only occur over 0xffff.  Also it's not an issue with BOM since 
>>> iTunes can cope without BOM.
>>>
>>> I was able to reproduce this behavior by writing a text encoding 
>>> byte of "Unicode" (0x01) but writing the actual string in UTF-8.  
>>> Maybe your implementation is doing something similar?
>>>
>>> -Jud
>>>
>>>
>>>
>>> Mark Smith wrote:
>>>> I'm getting a bit exasperated with trying to handle Unicode 
>>>> correctly. In my library, I'm handling all strings as UTF8 
>>>> internally, but since the 2.3 spec (as I've understood it) only 
>>>> allows for iso 8559-1 and UCS-2 (for the moment I'm treating UCS-2 
>>>> as if it were UTF-16), I'm writing out as UTF-16, where necessary.
>>>>
>>>> What I'm finding is that if I write out a TALB frame as "Erét" 
>>>> (thats E - r - e with acute accent - t, if your mail client 
>>>> displays something else) as UTF-16, iTunes and the other two 
>>>> tagging apps I've checked out display it in an oriental font.
>>>>
>>>> So the question is, am I wrong, or are other people just not 
>>>> bothering to deal with anything but english?
>>>>
>>>> Any insights gratefully recieved....
>>>>
>>>> Thanks,
>>>>
>>>> Mark
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: id3v2-unsubscribe at id3.org
>>>> For additional commands, e-mail: id3v2-help at id3.org
>>>>
>>>>
>>>>
>>>
>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: id3v2-unsubscribe at id3.org
>>> For additional commands, e-mail: id3v2-help at id3.org
>>>
>>>
>>>
>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: id3v2-unsubscribe at id3.org
>> For additional commands, e-mail: id3v2-help at id3.org
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: id3v2-unsubscribe at id3.org
> For additional commands, e-mail: id3v2-help at id3.org
>
>
>



---------------------------------------------------------------------
To unsubscribe, e-mail: id3v2-unsubscribe at id3.org
For additional commands, e-mail: id3v2-help at id3.org



More information about the ID3v2 mailing list