Forum Moderators: phranque
Now my second "big" website just turned 22 (months) and is now fully indexed by google. All of its 900+ articles are contained in a single directory; this is different to my older project, in which I had one subdirectory for every main point of the navigation. With this first website it also took almost precisely 22 months until all contents were properly indexed and started to rank well. As far as I can tell, google treats both websites in a very similar way.
More likely to affect the crawl are things like
- navigation structure - how many clicks to get from the home page to all other pages and how easy is it to grab those links>
- duplicate content - does the crawler need to make six requests and get 301s and such to figure out which is the one canonical URL? If it has to do that for every page, then it will only index 1/5 as many pages as it might in a crawl session.
- server response time - is the server timing out before the crawler gets the page it needs?
I would expect factors like that to have a much bigger impact than where files are located on the hard drive.
I should have said that my feeling is that having more precise keywords in the URL should affect the relevance and possibly the ranking on some longer tail searches, but it shouldn't really affect indexing and crawlability.
With respect to the ranking on more common terms, I don't know. I do think the URL can have some excellent value, but of course in the context of other on page factors. Where I've gotten the most value out of a keyword-rich URL is on content-light pages (i.e. a page that is mostly a single image or something like that).