Forum Moderators: phranque

Message Too Old, No Replies

Parsing web pages

         

zxk105

9:41 pm on Nov 2, 2004 (gmt 0)

10+ Year Member



Is there a way to parse a web page (.html) from a different web site and grab only the information you would like from that page? Can this be done with ASP.NET?

ska_demon

3:28 pm on Nov 3, 2004 (gmt 0)

10+ Year Member



I did something similar recently. I downloaded a spider program which allowed me to search for data inside html tags. For example I would tell my bot to go look at all pages in www.site-i-wanna-nick-stuff-from.com and get <head>data</head> This would return me just data from the <head> I was just grabbing titles, descriptions and keywords to build my own datafeeds.
I think it was called winbot or winwebbot or something.
Ska

zxk105

5:04 pm on Nov 3, 2004 (gmt 0)

10+ Year Member



I can't seem to find the site you are talking about. Do you maybe have a specific URL for me?