-
Key: CORBA3-41
-
Legacy Issue Number: 4725
-
Status: closed
-
Source: International Business Machines ( Richard Sitze)
-
Summary:
[3, Chapters 13.10.1.9, and 13.10.1.12] apparently indicate that both TCS-C
and TCS-W can be byte-oriented or non-byte-oriented.Assume the following configurations for two communicating ORBs.
CNCS-C = windows-1252, SNCS-C = ISO-8859-1, and CCCS-C = SCCS-C =
{UTF-16}.
The execution of the OMG code set negotiation algorithm [3, Chapter
13.10.2.6] in this case will result in the value of the TCS-C as UTF-16!An IDL string will then be marshalled as UTF-16 encoded data, which may
have embedded single-octet NULLs. This point should be mentioned explicitly
somewhere in [3, Chapter 15.3.1.6], especially when IDL string data types
are not allowed to contain any embedded NULLs [3, Chapter 3.10.3.2]. [3,
Chapter 15.3.1.6] states that "Both the string length and contents include
a terminating null". If TCS-C is selected to be UTF-16, this 'null' should
be a null of two-octet size. [3, Chapter 15.3.1.6] should be explicit in
stating that the concrete representation of the 'terminating null' is
dependent on the TCS-C.Similarly, for the following configuration
{UTF-8}
CNCS-W = UCS-2, SNCS-W = UCS-4, and CCCS-W = SCCS-W =TCS-W will be selected as UTF-8!
Are these configurations valid? Regardless of the answer to this question,
[3, Chapters 13.10.1.9, and 13.10.1.12] should clarify the issue of the
orthogonality of TCS with respect to the byte-oriented and
non-byte-oriented code sets with appropriate examples. -
Reported: CORBA 2.6 — Tue, 4 Dec 2001 05:00 GMT
-
Disposition: Resolved — CORBA 3.0.2
-
Disposition Summary:
see above
-
Updated: Fri, 6 Mar 2015 20:58 GMT