Second Try - Regex Question

  • Thread starter Thread starter Just Me
  • Start date Start date
J

Just Me

I need a regex to do this.


Ignore < possibleWhiteSpace htmlTag

Replace whitespace anything >

With >

Basically I need to remove anything following the html tag up to and
including the closing tag

Any help is appreciated.
 
I need a regex to do this.


Ignore < possibleWhiteSpace htmlTag

Replace whitespace anything >

With >

Basically I need to remove anything following the html tag up to and
including the closing tag

Any help is appreciated.

Hmmm are you trying to do this?

< TAG > becomes < TAG>?

You can try this to match the entire tag (and the parts within the tag):

\<(?<leading>\s)*(?<tag>\w)+(?<trailing>\s)*\>

The regex above uses named groups so that you can references parts of
the matches in code. Take a look at RegEx.Match.Groups for details.

If you want to do pure search, replace, this should work:

RegEx.Replace(MyHTML, "(\s)*\>", ">")

(\s)+ matches zero or more spaces. \> matches the trailing tag.

I hope that's what you want.
 
parsing ""(\s)*>", ">")" - Too many )'s. // Error from regex.


[\w|\d]*=.*> This almost works, but it leaves out all the text contained by
the element.
 
Done it.

[A-Za-z0-9]*=.*?>| [A-Za-z0-9]*=.*?\sp


Just Me said:
parsing ""(\s)*>", ">")" - Too many )'s. // Error from regex.


[\w|\d]*=.*> This almost works, but it leaves out all the text contained
by the element.



Spam Catcher said:
Hmmm are you trying to do this?

< TAG > becomes < TAG>?

You can try this to match the entire tag (and the parts within the tag):

\<(?<leading>\s)*(?<tag>\w)+(?<trailing>\s)*\>

The regex above uses named groups so that you can references parts of
the matches in code. Take a look at RegEx.Match.Groups for details.

If you want to do pure search, replace, this should work:

RegEx.Replace(MyHTML, "(\s)*\>", ">")

(\s)+ matches zero or more spaces. \> matches the trailing tag.

I hope that's what you want.
 
Back
Top