Googlebot crawling 7 versions of same page - alternatives? - General Search Engine Marketing Issues forum at WebmasterWorld

I can't give the details of the actual company I am working with so my example company sells plants. "plants" is a keyword they want to rank for so it has a category page called "plants" that lists every single plant for sale on the site, 10 to a page, 100+ pages. All of the plants on this page are crawled. Example URL of a plant is www.example.com/plantid-1293022

This same plant is green, and my company wants to rank highly for "Green plants" so it has a different category called "Green plants" which lists all of the plants that we sell that are Green. The URL for this same plant changes slightly to include the filter www.example.com/plantid-1293022?ColourIdentifier=Green

This same plant could also be described as an "Evergreen", and my company wants to rank highly for "Evergreen Plants" etc etc, URL is now www.example.com/plantid-1293022?TypeIdentifier=Evergreen.

Each URL is essentially the same as it's exactly the same content. We already have <link rel="canonical" href="www.example.com/plantid-1293022" /> in the <head> tags so that Google will show the pretty URL on the rankings. My problem is this...

I want Google to rank our website highly for "Plants", "Green Plants" and "Evergreen" - but also for the individual plants themsleves - so it is important that these category pages are crawled. However once on these category pages Googlebot is having to visit individual plants, eg www.example.com/plantid-1293022, 6 or 7 times because it appears as different URL's for so many categories. This means that Googlebot is crawling the same page 6 times when only once would do.

We are getting an error in Webmaster tools saying "Googlebot encountered an extremely high number of URLs on your site" and this is because of it crawling so many pages with same content but different URL's.

If this can be stopped then Googlebot will crawl more pages, so we should then have a larger number indexed in Google.

I'm looking for a solution that would solve the problem of recrawling the same pages, so that more unique pages can be crawled - any ideas would be welcome.

Firstly I am thinking of making each URL the same and passing info via a cookie rather than the URL. I'm SEO, not Dev, so I could only manage to get this implemented if this had solid SEO reasons - would this have benefits for SEO? I'm thinking against this because surely, even if each URL was identical, it would still get crawled 6 or 7 times because it's listed on 6 or 7 different category pages? Or does Googlebot recognise that it has already crawled that URL and so doesn't do it again?

Is there a way that I can keep the category page, but not have to use nofollows to stop Googlebot crawling the plant pages which would waste all of the link juice?

Is there a method/way of using robots.txt to follow some links but not others?

Any helps with this would be greatly appreciated.

Thanks all

ChainsawDR