Forum Moderators: Robert Charlton & goodroi
www.example.com/gadgets www.example.com/widgets www.example.com/new-products www.example.com/gadgets/24981-left-handed-widget www.example.com/widgets/24981-left-handed-widget www.example.com/new-products/24981-left-handed-widget www.example.com/24981-left-handed-widget Home > Gadgets > Left-Handed > Product 24981 that this particular visitor took to get to that page (tracked using cookies or database entries to reconstruct the breadcrumb links). www.example.com/24981-left-handed-widget if a user requests www.example.com/24981-left or www.example.com/24981-left-handed-widgets-are-great-buy-one-now the site should issue a 301 redirect to the correct URL for that product. Having got that functionality in place you can then deliberately post links like example.com/24981 to Twitter and other places that need 'short' URLs (Yes, I know the shortening rules have recently changed on Twitter) knowing that your site will redirect the user to the correct place, without having to rely on a third-party site for URL-shortening services. With an extra few lines of database wizardry you can then also track incoming traffic for those 'short' URLs.
The best way to think about it is that the number of pages that we crawl is roughly proportional to your PageRank
the low PageRank pages on your site are competing against a much larger pool of pages with the same or higher PageRank.
Imagine we crawl three pages from a site, and then we discover that the two other pages were duplicates of the third page. We'll drop two out of the three pages and keep only one, and that's why it looks like it has less good content. So we might tend to not crawl quite as much from that site.
Eric Enge: Can you talk a little bit about Session IDs? Matt Cutts: Don't use them.
(on paid affiliate links:) "... we usually would not count those as an endorsement"
"... if you are trying to block something out from robots.txt, often times we'll still see that URL and keep a reference to it in our index. So it doesn't necessarily save your crawl budget"
We absolutely do process PDF files. I am not going to talk about whether links in PDF files pass PageRank. But, a good way to think about PDFs is that they are kind of like Flash in that they aren't a file format that's inherent and native to the web, but they can be very useful.
what do endorsement means here?
and how Google know the affiliate link as an affiliate?
If SEs don't like that - tough luck
What if a product naturally belongs to Widgets, Gadgets, and SuperGadgets on Sale categories?
Don't give me any crap about not having keywords in the URI either.
...we might put out a call for people to report more about link spam in the coming months.
g1smd, but then you are missing a whole lot of situations with your single URL. What if a product naturally belongs to Widgets, Gadgets, and SuperGadgets on Sale categories?
With your single URL, you have no clue which particular section user is on.
And how would you display breadcrumbs for this product if categories are not inclusive? Would you display as 3 separate breadcrumbs? sounds silly.