tedster

msg:3484520 | 12:36 am on Oct 23, 2007 (gmt 0) |
I knew I forgot some common issues: Domain Root vs. index.html - another kind of duplicate [webmasterworld.com] ...and of course, the classic "canonical" troubles: Why "www" & "no-www" Are Different [webmasterworld.com] < These and other duplicate issues are discussed in threads in our Hot Topics [webmasterworld.com] thread, which is always pinned to the top of this forum's index page. >
|
Robert Charlton

msg:3484621 | 3:12 am on Oct 23, 2007 (gmt 0) |
There also can be http vs https issues. And, in an extensionless environment, where there are no .html, .asp, etc file extensions on page filenames, there can be confusion between pages and directories if you set up your pages with trailing slashes. Use trailing slashes for directory urls... no slashes when an extentionless page file is referenced.
|
nervo

msg:3485091 | 3:31 pm on Oct 23, 2007 (gmt 0) |
So, what would be better for internal linking (and why) http://www.example.com/dir/ or http://www.example.com/dir/index.html Thanks! [edited by: tedster at 3:56 pm (utc) on Oct. 23, 2007] [edit reason] switch to example.com - it can never be sold [/edit]
|
tedster

msg:3485128 | 4:00 pm on Oct 23, 2007 (gmt 0) |
For a new site, I would use the first url ending with a slash, and make sure that the index.html version is 301 redirected. For an established site, I would study what Google already has indexed. If the index.html version is already indexed, then use those versions of the url and 301 redirect in the other direction. Consistency is the most important goal, here. Do it the same way across the entire site.
|
nervo

msg:3485197 | 5:12 pm on Oct 23, 2007 (gmt 0) |
Tedster, thanks for your explanation! However, I'm afraid I made a mess with an 6-month old site where Google indexed the homepage as www.example.com/ but, while majority of the site directories are indexed with domain root, others are indexed with www.example.com/dir/index.html. Does that require different approach?
|
tedster

msg:3485220 | 5:22 pm on Oct 23, 2007 (gmt 0) |
With mixed results already indexed, I would usually choose the directory style url without the file name. There may be a short period of adjustment as Google sorts out the changes, but then going forward everything would be optimal.
|
pageoneresults

msg:3485224 | 5:25 pm on Oct 23, 2007 (gmt 0) |
| There also can be http vs https issues. |
| I'd like to emphasize this particular issue as those of us on Windows seem to be more susceptible to this. I've seen Google index both http and https versions of a site that was improperly configured. If you do a site: search for your domain and that first listing is https: you have a potential problem.
|
nervo

msg:3485257 | 5:44 pm on Oct 23, 2007 (gmt 0) |
Thanks again! One other thing: Is there a difference for both internal and external linking in using these: www.example.com www.example.com/
|
Tonearm

msg:3486500 | 7:14 pm on Oct 24, 2007 (gmt 0) |
| There also can be http vs https issues. |
| I had no idea this was a duplicate content issue. So any page should only be able to be accessed via http or https?
|
jd01

msg:3486505 | 7:22 pm on Oct 24, 2007 (gmt 0) |
www.example.com www.example.com/ |
| Without the trailing / you force your server to add it on... IMO: Better to add it yourself in the link to conserve cpu cycles. Justin
|
Shurik

msg:3486641 | 9:29 pm on Oct 24, 2007 (gmt 0) |
On IIS there is absolutely no way to prevent assess to directories as files www.sample.com/directory and www.sample.com/directory/ are treated the same. IIS silently converts 1 into 2 – very annoying! Robots.txt is of no use too.
|
|