homepage Welcome to WebmasterWorld Guest from 54.237.98.229
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

    
Difference between / /index.htm and /index.html
maggi

10+ Year Member



 
Msg#: 15710 posted 11:39 am on Jul 31, 2003 (gmt 0)

I give a lot images to other sites which are linking back to my site.
I might link to
1. [mysite...]
2. [mysite...]
3. [mysite...]
I asume, that 1 and 2 are identical seen by Google,
but how does Google see aproach 3?
Will it add the values for 1,2,3 to a common PR?

When liking to index.htm I might fill that page with specific keyywords in url and page.
Will Goggle see /index.htm as a page equal like any other page?
Or will I get some benefit from a naming similar as the index.html

Sometimes Google is showing in the listing 2 pages from a site:
Central Homepage
[mysite...]
Keyword Page
[mysite...]
I am interest to understand why Google is showing 2 entries from the same website.
Thanks a lot, Maggi

 

Macro

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 1:46 pm on Jul 31, 2003 (gmt 0)

I'd be interested in knowing this as well.

Presumably you could have index.htm and index.html with completely different keywords and link to index.html from other pages in your site but have other sites link to your index.htm page.

Perhaps Google gives them an equal PR in which case quickly put up a default.htm :-)

doc_z

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 2:34 pm on Jul 31, 2003 (gmt 0)

I asume, that 1 and 2 are identical seen by Google

Assuming that 1 and 2 have the same content, Google normally doesn't distinguish between them. You can verify this by comparing link:www.mysite.com with link:www.mysite.com/index.html. If the same results are shown, Google treats these pages as the same page and PR is added. Otherwise, PR as well as anchor text is splitted between the two pages.
In any case, linking only to www.mysite.com is the safe way.

Will Goggle see /index.htm as a page equal like any other page?
Or will I get some benefit from a naming similar as the index.html

There is no benefit for index.htm. It is handled as any other page.

Gus_R

10+ Year Member



 
Msg#: 15710 posted 2:43 pm on Jul 31, 2003 (gmt 0)

What about / and /index1.htm in example?
Google recognizes any default entry page or manages a common name list? (index.htm, default.htm ...)

Gus

AthlonInside

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 2:55 pm on Jul 31, 2003 (gmt 0)

Google threated 3 of them as TOTALLY UNIQUE pages.

You should always stick to one and the one that is best is www.yoursite.com/. Try to avoid linking to index.html or index.htm, even in your site itself. In your site, use <a href="/"> instead of <a href="index.html">.

You can do a 301 redirect to tell Google they are all the same. Use .htaccess to rewrite your URL from anything ELSE (index.htm, index.html ...) to www.yoursite.com/.

doc_z

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 3:39 pm on Jul 31, 2003 (gmt 0)

You should always stick to one and the one that is best is www.yoursite.com/. Try to avoid linking to index.html or index.htm, even in your site itself. In your site, use <a href="/"> instead of <a href="index.html">.

Of course, this is the safe way. However, for none of my sites I have a problem linking to the index page. In all of these cases, Google doesn't distinguish between www.mysite.com, mysite.com, www.mysite.com/index.html. Moreover, one of my sites is reachable via www.mysite.com and www.mysite.de. Even in this case, Google doesn't distinguish between them, i.e. link:www.mysite.com shows the same results as link:www.mysite.de (and they have the same PR). Unfortunately, this doesn't work always.

antoshka



 
Msg#: 15710 posted 8:23 pm on Jul 31, 2003 (gmt 0)

whether your site displays

/
/index.htm
/index.html

as different pages TOTALLY depends on your web server settings. it is easy to set it up so that on / request it will respond with /index.txt page.

i don't know how Google treats these 3 pages, but probably as 3 different pages. and also, probably, they have each their own PR.

[edited by: engine at 7:49 am (utc) on Aug. 1, 2003]
[edit reason] No sigs, thanks. See TOS [webmasterworld.com] [/edit]

jon80

10+ Year Member



 
Msg#: 15710 posted 8:29 pm on Jul 31, 2003 (gmt 0)

I don't think you will get away with advertising your url like that.
It's against the terms and conditions.

maggi

10+ Year Member



 
Msg#: 15710 posted 9:39 am on Aug 1, 2003 (gmt 0)

Ok, you convinced me.
As a first step I tried the mod_rewrite example for the trailing /
from [engelschall.com...]

RewriteCond %{HTTP_HOST}!^fully\.qualified\.domain\.name [NC]
RewriteCond %{HTTP_HOST}!^$
RewriteCond %{SERVER_PORT}!^80$
RewriteRule ^/(.*) [fully.qualified.domain.name:%{SERVER_PORT}...] [L,R]
RewriteCond %{HTTP_HOST}!^fully\.qualified\.domain\.name [NC]
RewriteCond %{HTTP_HOST}!^$
RewriteRule ^/(.*) [fully.qualified.domain.name...] [L,R]

It is working,
but now the images are not shown, due to
[domain.com...]

Do I have to change all images to
[domain.com...]
Thanks, Maggi

g1smd

WebmasterWorld Senior Member g1smd us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 15710 posted 12:33 am on Aug 2, 2003 (gmt 0)

If a browser or agent requests / then your server settings decide whether index.htm or index.html is the one served up first if both happen to exist. These two are treated as separate pages. Some freehosts have a "This is your new homepage." default file at index.htm. The new user should delete or replace that page, but if they do not, but instead use index.html then that gets served in preference.

claus

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 2:34 am on Aug 2, 2003 (gmt 0)

- and if you're on a MS server, the same file will probably be named "default.htm" or "default.html" in stead. Variations on "default" and "index" with suffixes "asp", "jsp", "php", "shtml", "sht", "stm", "cgi", "pl" etc. are also possible...

It's no more than a server setting - tell the server which type of file it must show when you request www.example.com or any subdirectory or subdomain, and it will do so.

For linking purposes, avoid including "index.(s)htm(l)" or "default.(s)htm(l)", just cut the link at the "/". That way, your links will not become obsolete when you decide to go from ASP to PHP or HTML, or PERL or whatever.

/claus

Herenvardo

10+ Year Member



 
Msg#: 15710 posted 2:19 pm on Aug 4, 2003 (gmt 0)

>> Sometimes Google is showing in the listing 2 pages from a site

maggi: a search engine has no way to determine what pages make a site and what ones are another site. It is usually confused the term site with the term domain, but they are not the same: a site can espand many domains, like:
www.mysite.com
www.mysite.uk (for england)
www.mysite.es (for spain)
mail.mysite.com (it could be a mail service)
mail.mysite.uk
and we could continue. And then, there are opposite cases: sites placed in subdomains. It normally happens in free hosting services: the user gets some megs freely to "hang" her/his personal webpage and an address that is a subdomain or a subdir of the hosters' domain.
Since google has no way to know if two or more pages of the same domain are really part of the same site, it has to give all the results. Even so, when two or more matches are found from the same domain, there only appear two: the most relevant one and the root of the domain. Then is displayed a link to get more results from that site.
Of course, it would be better if g was able to determine a site limit, but even so, i think that google has find a good solution :)
regards,
herenvardo

Macro

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 7:33 pm on Aug 5, 2003 (gmt 0)

I'm afraid I don't understand some of this thread, especially the mod_rewrite bit. I assume it's about the server telling the browser which page to go to if the page is not specified.

However, consider the below, it's true:

I have an index.htm. Type in mysite.com/ and it takes you to index.htm and it's got a Google PR of 6.

If I put up a new page called index.html and create one link to it from within mysite.com Google picks it up and Index.html soon gets a PR of 6, even though it has different content and only one link from a PR5 page.

If I then put up a page called default.htm with yet another set of keywords and different content that page too gets an almost immediate PR of 6.

In the interim people typing mysite.com/ still get to index.htm

So you can "leverage" the PR of your homepage by adding other "default" type home pages like index.html, index.htm and default.htm?

Or am I completely wrong on this?

doc_z

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 7:52 pm on Aug 5, 2003 (gmt 0)

If I put up a new page called index.html and create one link to it from within mysite.com Google picks it up and Index.html soon gets a PR of 6, even though it has different content and only one link from a PR5 page.

It's an estimated - and not a real - PR.

Macro

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 12:38 pm on Aug 7, 2003 (gmt 0)

doc_z, I thought that the estimated PR is usually one number "less" than the PR of the page that is linking to it. Maybe someone else here knows different? Thanks.

Gus_R

10+ Year Member



 
Msg#: 15710 posted 1:54 pm on Aug 7, 2003 (gmt 0)

From dominic it's common to see white/grey bar for pages not indexed, I don't know if estimated pr is currently used.

doc_z

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 15710 posted 8:26 pm on Aug 7, 2003 (gmt 0)

"less" than the PR of the page that is linking to it

As far as I know (I never exanmined this in detail), guessed PR was calculated just from the PR of the domain and not from the page that is linking to it.

I did a little experiment: On one of my PR6 sites with an index.html (default) page I created an (orphan) index.htm page. It's got immediately PR6. Thus it's an estimated PR.

From dominic it's common to see white/grey bar for pages not indexed, I don't know if estimated pr is currently used.

Yes, in general that's true. However, this seems to be an exception.

g1smd

WebmasterWorld Senior Member g1smd us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 15710 posted 6:51 pm on Aug 10, 2003 (gmt 0)

>> I thought that the estimated PR is usually one number "less" than the PR of the page that is linking to it. <<

I believe that is true only if the page is one folder level lower than the main page.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved