It was thus said that the Great Hans Franke once
stated:
Rather then restricting the encodeing of the XML
file to a
specific charset, we need to restrict the USAGE within the
standard to certain characters, regardless of the encodeing.
Unless otherwise noted, XML files are assumed to be encoded in UTF-8,
*but* an XML parser is required to abort at the first error in the XML file.
If a parser is reading an XML file without an explicit character set
encoding scheme (which means it's assuming UTF-8) and it reads a character
that is illegal (say the file was encoded in ISO-8859-3) it gives up
(usually with an "illegal character at such-n-such position" error).