Awe Forget it

  • Thread starter Thread starter Just Me
  • Start date Start date
J

Just Me

This is a bunch of bull cr*p. I have tried copying tables out on the web
and there are so many variations that its not feasable to write a single
regex for every situation.

So, I give up.
 
Hello...

Please try to keep your related posts together in one thread ;)

And now a sugestion:

Try the HTML-DOM and look at the tags there... They have a property of inner
text, which can be used to extract the text out of any HTML-Node or even the
whole document... Or you can examine all tables or table.row or tabledata
fields and extract the information from there. But scrapping information
from websites is usually quite hard ;)
 
This will get all the tables: Set IgnoreCase and SingleLine options. Use
groups.

<table .*?</table>
 
Just Me said:
This is a bunch of bull cr*p. I have tried copying tables out on the web
and there are so many variations that its not feasable to write a single
regex for every situation.

Well shux, why don't you just read the file one char at a time and use use
"if" statements and comparison operators?
It won't be a minimal task, but it won't be that tough, either.
 
Back
Top