HttpUtility.HtmlEncode and HtmlDecode problems

Guest · Jul 26, 2005

In working with System.Web.HttpUtility class, I've come across some
inconsistencies in encoding and decoding. If I start with the following code:

string s = @"& " ' < > Â® â„¢ Â© Ã©";
s = System.Web.HttpUtility.HtmlDecode(s);
s = System.Web.HttpUtility.HtmlEncode(s);

The value of the string after the call to HtmlDecode is:
& \" ' < > Â® â„¢ Â© Ã©

This indicates that it doesn't correctly decode the (&apos

to be a single
quote ('), otherwise, it decodes all characters correctly. Further, the
value of the string after the call to HtmlEncode is:
& " ' < > Â® â„¢ Â© Ã©

This indicates that it doesn't re-encode all the characters correctly
(notice the &apos double encoding due to the initial decoding error, as well
as the inability to encode the TM symbol at all).

Does anyone have any insight into why this is happening?

Joerg Jooss · Jul 26, 2005

J80127 said:
In working with System.Web.HttpUtility class, I've come across some
inconsistencies in encoding and decoding. If I start with the
following code:

string s = @"& " ' < > Â® â„¢ Â© Ã©";
s = System.Web.HttpUtility.HtmlDecode(s);
s = System.Web.HttpUtility.HtmlEncode(s);

The value of the string after the call to HtmlDecode is:
& \" ' < > Â® â„¢ Â© Ã©

This indicates that it doesn't correctly decode the (&apos to be a
single quote ('), otherwise, it decodes all characters correctly.
Further, the value of the string after the call to HtmlEncode is:
& " ' < > Â® â„¢ Â© Ã©

This indicates that it doesn't re-encode all the characters correctly
(notice the &apos double encoding due to the initial decoding error,
as well as the inability to encode the TM symbol at all).

Does anyone have any insight into why this is happening?

These methods are somewhat broken and famous for not encoding certain
characters like the apostrophe.

See also
http://lab.msdn.microsoft.com/productfeedback/viewfeedback.aspx?feedback
id=7cf0356f-d2ff-47eb-858c-faf6226dee03

Cheers,

HtmlEncode and HtmlDecode on Client Side?	2	Aug 15, 2007
DVD's will not play correctly.	2	Jul 19, 2005
Extended ASCII Encoding in .NET	5	Dec 1, 2005
Testing xml string is well formed xml	2	Sep 12, 2008
Listbox and combo problem	7	Apr 28, 2010
Problem with character encoding from usenet?	1	Dec 31, 2008
Regular expression and special charactere	1	Jun 23, 2009
Encoding problem when posting text between classic ASP and ASP.NET	2	May 16, 2006

HttpUtility.HtmlEncode and HtmlDecode problems

Guest

Joerg Jooss

Ask a Question

Similar Threads