J
Juan Gabriel Del Cid
I have the sudden need to split a text that may have any of the
Ok, lets supose you left out grouping functionality (i.e. qoutes and double
coutes are not grouping operators). If this were the case, this regular
expression will spilt the for you:
Regex splitter = new Regex("[\\s,;]+");
string []splitItems = splitter.Split(myString);
This is without grouping. When you throw in grouping functionality, you need
a parser. Regular expressions wont cut it. You need to think of:
- unballanced grouping chars (e.g. an unclosed quote)
- escaping grouping chars (e.g. if you want the name O'Neal in a word)
- double quotes inside single quotes and viceversa
For this to work you need to write a parser. It's really not that hard, but
it's not as easy as a regex,
.
Hope this helps,
-JG
following tokens :
- Words with quotes or double quotes.
- Words with no quotes at all.
- Numbers with and without decimal points,
no commas allowed, but may contain parenthesis
which I would like to keep apart to drop later.
They may be separated by comas, spaces or semicolon.
Ok, lets supose you left out grouping functionality (i.e. qoutes and double
coutes are not grouping operators). If this were the case, this regular
expression will spilt the for you:
Regex splitter = new Regex("[\\s,;]+");
string []splitItems = splitter.Split(myString);
This is without grouping. When you throw in grouping functionality, you need
a parser. Regular expressions wont cut it. You need to think of:
- unballanced grouping chars (e.g. an unclosed quote)
- escaping grouping chars (e.g. if you want the name O'Neal in a word)
- double quotes inside single quotes and viceversa
For this to work you need to write a parser. It's really not that hard, but
it's not as easy as a regex,

Hope this helps,
-JG