datatable from xml doc and massive time differences on desktop / server

Hermit Dave · Jul 11, 2008

Hello All,

I have an xml document which can contain massive amounts of data with no
fixed child node sequence.
We have some 9000 odd fields and any ones can be a part of the xml.

The way i build the data table is that i process one record at a time and i
keep a list of columns in hash table (with ordinal position) and in a string
collection (for retrieval in correct order) and for each field, if the
column does not exists, i add the column and for that ordinal position, i
set the value in the data table.

For xml doc contain 85000 odd records with say 5 child fields, on my desktop
i can process the whole thing and display paged in list in 4 mins and 40
secs, however on the server it takes around 12 mins.
Same input file, same code, same binaries.

The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM and
running XP
The Server runs 4 x 2.6 Dual core Opteron with 16 GB of RAM running Windows
2003 Server Enterprise edition.

What do you guys think of it ? TIA

Hermit

PS: Its code in C# on vs.net 2003 using standard frameworkclasses
(datatable, xmltextreader, xmldocument etc)

Pavel Minaev · Jul 12, 2008

Hello All,

I have an xml document which can contain massive amounts of data with no
fixed child node sequence.
We have some 9000 odd fields and any ones can be a part of the xml.

The way i build the data table is that i process one record at a time andi
keep a list of columns in hash table (with ordinal position) and in a string
collection (for retrieval in correct order) and for each field, if the
column does not exists, i add the column and for that ordinal position, i
set the value in the data table.

For xml doc contain 85000 odd records with say 5 child fields, on my desktop
i can process the whole thing and display paged in list in 4 mins and 40
secs, however on the server it takes around 12 mins.
Same input file, same code, same binaries.

The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM and
running XP
The Server runs 4 x 2.6 Dual core Opteron with 16 GB of RAM running Windows
2003 Server Enterprise edition.

What do you guys think of it ? TIA

Where is the source XML read from on your desktop, and on your server?
It could well be an I/O bottleneck, not a problem with your code as
such. Perhaps the server reads it from an SMB share, or something like
that?

Hermit Dave · Jul 14, 2008

Pavel,

Well the desktop is a standard ide / sata drive.
there server uses SAN.

I dont think its I/O issue as my team leader tried a hard way of assuming a
column exists for a given node (and catching any exceptions and handling it
correctly) and the server timings dropped to about 2 and half mins.
I guess it was something else. Maybe something in the code was being
optimised lot better for intel architecture

Regards,

Hermit

Hello All,

I have an xml document which can contain massive amounts of data with no
fixed child node sequence.
We have some 9000 odd fields and any ones can be a part of the xml.

The way i build the data table is that i process one record at a time and
i
keep a list of columns in hash table (with ordinal position) and in a
string
collection (for retrieval in correct order) and for each field, if the
column does not exists, i add the column and for that ordinal position, i
set the value in the data table.

For xml doc contain 85000 odd records with say 5 child fields, on my
desktop
i can process the whole thing and display paged in list in 4 mins and 40
secs, however on the server it takes around 12 mins.
Same input file, same code, same binaries.

The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM
and
running XP
The Server runs 4 x 2.6 Dual core Opteron with 16 GB of RAM running
Windows
2003 Server Enterprise edition.

What do you guys think of it ? TIA

Where is the source XML read from on your desktop, and on your server?
It could well be an I/O bottleneck, not a problem with your code as
such. Perhaps the server reads it from an SMB share, or something like
that?

XML and strongly typed datasets	1	Aug 17, 2007
OutOfMemoryException on DataTables	13	Mar 5, 2006
Add Table to XML file using Dataset...	1	Dec 17, 2008
Digitally Sign an XML Doc with X509Certificate Solution :)	2	Apr 29, 2009
reading xml from database..	3	Mar 24, 2007
Problem with Count Expression and DataTable	3	Aug 1, 2006
XML vs SQL Server	22	Nov 28, 2009
Binding XML Data to WPF	1	Mar 31, 2010

datatable from xml doc and massive time differences on desktop / server

Hermit Dave

Pavel Minaev

Hermit Dave

Ask a Question

Similar Threads