Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

strange links scanned from google - extra slashes

..funny thing.... no such links exist

         

omoutop

7:52 am on Feb 9, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Hi all and thanks for any help in advance.

Recently it came to my attention something strange about some links the google had scanned from my site.

While the original link was:
http://www.example.com/some-page.htm
the google sitemap suggested it was:
http://www.example.com/folder////some-page.htm

Now half the site is made with php (dynamic creation of links) and apache's mod_rewrite (to construct the htm pages and virtual folders out of the php pages).

According to the keywords ("some", "page", "folder"), I know in what page the "strange" link should appear.
After close examination of the dynamic page, and after thorough tests (using all sorts of possible outcomes), I didn't observed any strange behavior, or error in creating these links.

Now, the error link scanned by the Google can only be created by one (1) specific page only, unique for the specific keywords. Yet such link isn't appearing at all on the page (and not in the source code also - i have checked for possible blank links).

Can anyone have any suggestions on what to look for? Or if this is some mis-behavior of Google? Or anything else that might prove usufull? Again thanks for any info/insight that you might share with me.

[edited by: tedster at 9:28 am (utc) on Feb. 9, 2007]
[edit reason] use example.com [/edit]

bwnbwn

3:09 pm on Feb 9, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



we get them as well in dynamic sites and static sites got to do something with the bot error I quit worring about it. They will go away

g1smd

4:17 pm on Feb 9, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Check out any redirects that you have in place.

A redirect "generates" a target URL.

If that target URL responds "200 OK" then the content at that URL gets indexed.

MThiessen

4:24 pm on Feb 9, 2007 (gmt 0)

10+ Year Member



http://www.example.com/folder////some-page.htm

I had this problem once, drove me crazy till I found out why it happened.

It can from internal relational link errors.

for instance doing a .././ when all I needed was a ../ Since I switched later to a base tag, I couldn't find the problem because all the links with bad relation links resolved, making them NOT look broken. They looked broken to google though.

I isolated this using the Dreamweaver sitemap plugin. It identified every single one of them and when I fixed it, google stopped showing the // in the links.

Check all your relational links.

omoutop

11:27 am on Feb 12, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



g1smd: we don't have ny redirects in the static pages (although we got plenty in the dynamic ones).

MThiessen: this sounds logical.... i will follow your advise and check the internal links of the static pages (the dynamic pages use absolute paths for their links since we rely heavily on mod_rewrite)

Thanks again for the hints.

g1smd

1:17 pm on Feb 12, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



How about redirects in your .htaccess file?

omoutop

1:24 pm on Feb 12, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



well we have some htaccess redirects, but since you mentioned it, i have checked them and none refers to my problematic pages... so I guess I am clean from that side.

MrSpeed

2:56 pm on Feb 12, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Run Xenu's Link Sleuth on your site to see if there are any "leaks" internally causing it. No matter how clean I think my rewrites are I find something.

omoutop

7:12 am on Feb 13, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I run Xenu before postinmg here :)
thats why i am so confused..... currently i am in the procss of changing all the relative links to absolute paths