Reading XML document with blank lines at top

Ryan S · May 11, 2004

I am trying to read an XML document generated by a web server using the
XMLTextReader class, but the document generated appears to have some blank
lines at the top that are causing problems.

If I connect directly to the URL, when I call the ".Read" method on the
XMLTextReader, I get the message:

"The XML declaration is unexpected. Line 8, Column 3"

If I open the URL in my browser and copy and paste the XML stuff (starting
with <?xml...) to a text file, save it locally and then access that file
using the XMLText Reader, all is well. It appears as if a bunch (7,
actually) of blank lines are prepended to the top of the file from the web
server so that they appear before the <?xml... declaration.

Is this an an example of a non-well-formed document?

Is there a way to have the XMLTextReader skip over these lines, or is there
a way for me to intercept the stream and only start parsing at the
appropriate point?

Thanks,
Ryan

BTW, I am specifically trying to call the APIs from BMC's Patrol Express
product which return an XML document.

Sven Groot · May 12, 2004

Ryan said:
I am trying to read an XML document generated by a web server using
the XMLTextReader class, but the document generated appears to have
some blank lines at the top that are causing problems.

If I connect directly to the URL, when I call the ".Read" method on
the XMLTextReader, I get the message:

"The XML declaration is unexpected. Line 8, Column 3"

If I open the URL in my browser and copy and paste the XML stuff
(starting with <?xml...) to a text file, save it locally and then
access that file using the XMLText Reader, all is well. It appears
as if a bunch (7, actually) of blank lines are prepended to the top
of the file from the web server so that they appear before the
<?xml... declaration.

Is this an an example of a non-well-formed document?

It should work, as far as I know.

Are you sure they're blank lines, and not HTTP headers? Headers would
confuse the hell out of an XmlReader, and would also indeed not show if you
copy/past the URL in a browser.

Ryan S · May 12, 2004

Sven,

It does not appear that they are HTTP headers. When I view the source
(using both IE and Mozilla). I definetly see 7 blank lines above the <?xml
.... > line. I also packet sniffed the transaction between the server and my
broswer, I definetly see a bunch of CrLf characters being sent just prior to
<?xml. I also some HTTP packet headers like Server:, Content-Type: and
others. Will these headers cause trouble? If so, any suggestions on
passing over them to get to the XML stuff?

-Ryan

Oleg Tkachenko [MVP] · May 12, 2004

Ryan said:
If I open the URL in my browser and copy and paste the XML stuff (starting
with <?xml...) to a text file, save it locally and then access that file
using the XMLText Reader, all is well. It appears as if a bunch (7,
actually) of blank lines are prepended to the top of the file from the web
server so that they appear before the <?xml... declaration.

Is this an an example of a non-well-formed document?

Yep. In a well-formed XML document nothing (but unicode byte-order mark)
can precede XML declaration. It must be the very first thing in the
document.

Ryan S · May 12, 2004

Thanks for everyone's help and comments. I solved my problem by reading the
HTTP stream and dumping it, starting with "<?xml", into a temp file which I
then call with DataSet.ReadXML on.

-Ryan

Experiencing Delay in Loading XML Documents	4	Aug 4, 2007
Convert Access data to XML	1	Jun 14, 2006
XML DTDs Versus XML Schema	0	Nov 26, 2014
read XML and add to it with XMLTextReader / XMLTextWriter	4	Dec 10, 2009
Liquid Technologies Release Liquid XML Studio 2014	1	Nov 26, 2014
xml woes	1	Sep 4, 2009
Processing XML With C# and .NET	0	Apr 20, 2014
Processing XML With C# and .NET	1	Apr 20, 2014

Reading XML document with blank lines at top

Ryan S

Sven Groot

Ryan S

Oleg Tkachenko [MVP]

Ryan S

Ask a Question

Similar Threads