A very simple question about binary/text format difference

  • Thread starter Thread starter Oriane
  • Start date Start date
O

Oriane

Hi folks,

what makes that the SOAP/Xml binary encoding is smaller than the "text" one
? Is is only because the XML tags are more or less suppressed ?

More generally, I don't understand how gzip can reduce text file size since
I can't see how a character can be store in less than 1 byte. Or is that a
text file in an OS has such a big overhead ?


Regards
 
Oriane,

Binary serialization is smaller for two main reasons. One is the lack of
the XML tags. The other is that some data fields will be smaller in their
binary format. It is between 2 and 10 times smaller on average.

For compression, check out this article. Essentially it is just re-encoding
the data in a more efficient format.
http://www.igeek.com/articles/Software/Compression.txt

Hope this helps,


Steve
 
Hi Steve,

PlatinumBay said:
Oriane,

Binary serialization is smaller for two main reasons. One is the lack of
the XML tags.
Do you mean that the Xml tags are simply suppressed ? Or that theye are
replaced by something less bverbose ?
The other is that some data fields will be smaller in their binary format.
It is between 2 and 10 times smaller on average.
What sort of data field is smaller ? Numeric ones ?
Hope this helps,
Yes indeed.

Regards
 
Back
Top