<DIV>Thanks again for help, </DIV>
<DIV> </DIV>
<DIV>John.<BR><BR><B><I>Ben Bennett <fiji@ayup.limey.net></I></B> wrote:</DIV>
<BLOCKQUOTE class=replbq style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">On Wed, Aug 24, 2005 at 01:42:19PM +0100, Ion Todirel wrote:<BR>> Ben, look on this frame (or others):<BR>> <BR>> <COMM><BR>> Text encoding $xx<BR>> Language $xx xx xx<BR>> Short content descrip. <TEXT encoding to according string>$00 (00)<BR>> The actual text <FULL encoding to according string text><BR>> <BR>> $00 is a byte (new byte(); (0)) separator between "Short content descrip." and "Text"?<BR><BR>The reason the terminator is specified as $00 (00) is that the<BR>terminator may be one byte or it may be two depending on the encoding.<BR>Section 4 of the "Main Structure" document<BR>(http://www.id3.org/id3v2.4.0-structure.txt) outlines the rules:<BR>$00 ISO-8859-1 [ISO-8859-1]. Terminated with $00.<BR>$01 UTF-16 [UTF-16] encoded Unicode [UNICODE] with BOM. All<BR>strings in the same frame SHALL have the same byteorder.<BR>Terminated with $00 00.<BR>$02
UTF-16BE [UTF-16] encoded Unicode [UNICODE] without BOM.<BR>Terminated with $00 00.<BR>$03 UTF-8 [UTF-8] encoded Unicode [UNICODE]. Terminated with $00.<BR><BR>So for encodings $00 and $03 the terminator is $00, and for $01 and<BR>$02 (the wide character Unicode encodings) the terminator is $00 $00.<BR><BR>> For some frames is not specified String Encoding, for example for frame "WXXX":<BR>> <BR>> <"WXXX"><BR>> Text encoding $xx<BR>> Description <TEXT encoding to according string>$00 (00)<BR>> URL <TEXT string><BR>> <BR>> URL is Encoded with Latin1Encoding ("ISO-8859-1") ?<BR>> <BR>> or for "UFID" frame, what encoding shuld use to decode "Identifier" from byte to string? "ISO-8859-1" ?<BR><BR>From the same section of the spec:<BR>If nothing else is said, strings, including numeric strings and URLs<BR>[URL], are represented as ISO-8859-1 [ISO-8859-1] characters in the<BR>range $20 - $FF. Such strings are represented in frame descriptions<BR>as
<TEXT string>, or <FULL string text>if newlines are allowed. If<BR>nothing else is said newline character is forbidden. In ISO-8859-1 a<BR>newline is represented, when allowed, with $0A only.<BR><BR>Note that in the WXXX case it should be URL encoded anyway.<BR><BR>Also note the rules about newlines and the distinction in the spec<BR>between <TEXT string>and <FULL string text>.<BR><BR>The spec is unlear about what to do with newlines when the encoding is<BR>not ISO-8859-1.<BR><BR>-ben<BR><BR>---------------------------------------------------------------------<BR>To unsubscribe, e-mail: id3v2-unsubscribe@id3.org<BR>For additional commands, e-mail: id3v2-help@id3.org<BR><BR></BLOCKQUOTE><p>
<hr size=1><font size="2" face="Verdana, Arial, Helvetica, sans-serif"><a href="http://us.rd.yahoo.com/mail/uk/taglines/default/messenger/*http://uk.messenger.yahoo.com
">Yahoo! Messenger</a>
<font color="red">NEW</font> - crystal clear PC to PC
<a href="http://us.rd.yahoo.com/mail/uk/taglines/default/messenger/*http://uk.messenger.yahoo.com
">calling worldwide with voicemail
</a></font>