This post originated from an RSS feed registered with Agile Buzz
by James Robertson.
Original Post: Microsoft and XML Validity
Feed Title: Cincom Smalltalk Blog - Smalltalk with Rants
Feed URL: http://www.cincomsmalltalk.com/rssBlog/rssBlogView.xml
Feed Description: James Robertson comments on Cincom Smalltalk, the Smalltalk development community, and IT trends and issues in general.
The short answer is that we do not implement RFC 3023 currently. The RSS platform uses MSXML (in XML conforming mode) to fetch and parse the data, so the behavior is inherited from MSXML. Since MSXML is used by most products that we ship, it means the platform is consistent. And nearly every other stack in the industry ignores RFC 3023 as well, so it's not a widely accepted interop point at the moment.
If an XML entity is in a file, the Byte-Order Mark and encoding declaration are used (if present) to determine the character encoding
Is something I implemented in BottomFeeder a long time ago. At the time, I found that paying attention to the declared encoding declaration helped a lot. I guess the MSXML parser isn't built that way, and I'd also guess that the decision was based on the sources they ran into during construction of the parser. I was looking at RSS feeds, and the declaration seemed like an easy thing to do.