Welcome to WebmasterWorld Guest from 220.127.116.11
What I need to do now is to grab the content of these pages, and to somehow determine the most significant image is on the page.
This will allow me to grab and resize the image for a thumbnail for a link to the site.
For example, if they were news articles, it would be the story's picture, not the newspaper's logo, or advertisments, or other images.
I'm dealing with a list of different, and changing websites.
Are there any scripts out there that does this? Does anybody have any recommendations?
and to somehow determine the most significant image is on the page.
You will need to define more specifically exactly what makes any image on any given page most significant.
I believe you are going to end up writing some custom code for this particular project.
If you need that just for taking out the picture which would tell you what the website is about, i'd suggest you take thumbnails of whole websites, which is easy to code.
I decided to go a different approach, and make my own script that will just work for the ten most frequent sites on my list. They provide most of the results anyways.