Welcome to WebmasterWorld Guest from 54.147.63.124

Forum Moderators: ergophobe

Message Too Old, No Replies

Joomla URL problem with Google crawler

     
9:41 pm on Apr 8, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


Google crawler 'finds' wierd URLs such as :

URLs were here (minus domain) to show the wierd structure but I cant get them to appear properly.

Over 100,000 of them now in Google webmaster tools list !

Joomla is installed in subdir /beijing and that is the live_site setting. SEF is all off. The site works fine for human visitors.

Help !

[edited by: bramley at 9:52 pm (utc) on Apr 8, 2011]

9:47 pm on Apr 8, 2011 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Dec 15, 2003
posts:2606
votes: 0


ew, have you tried crawling one of the pages that is generating these links in Google as googlebot?

Without seeing your site my guess would be it is crawling js links in a weird way OR one of your components is doing something weird.
9:51 pm on Apr 8, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


Thanks - the urls are shown here [forum.joomla.org...]

It adds spurious directories and finally the correct index.php?...

I have a few links that load things with ajax but these are either in addition to an href or just an onclick; but I'll think about it ...
9:54 pm on Apr 8, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


Re 'as googlebot' - how to do that ?
10:15 pm on Apr 8, 2011 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Dec 15, 2003
posts:2606
votes: 0


In G's webmaster tools in the left hand nav

-> Diagnostics -> Fetch A Page AS Googlebot
10:18 pm on Apr 8, 2011 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Dec 15, 2003
posts:2606
votes: 0


I checked out that Joomla forum posting

Are you still getting the jfolder errors too?

Oddly enough I ran into that today and fixed it so if you still get that message let me know, I'll send you the fix.
10:52 pm on Apr 8, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


Hi Demaestro,

Back from dinner.

jfolders errors? what do you mean ?

ps: I'll try the WMT also
10:56 pm on Apr 8, 2011 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Dec 15, 2003
posts:2606
votes: 0


Oh maybe I misread it. in that other thread Dongle mentions Jfolder error I thought it was you.

It must have stood out to me since I had the same issue this morning with a site.
11:13 pm on Apr 8, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


Tried the main page in Fetch as googlebot, and the links there look fine
11:13 pm on Apr 8, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


Let me look at that thread ...
11:15 pm on Apr 8, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


That's something else I think
11:48 pm on Apr 8, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


i found some js ajax calls with relative urls - made absolute now.

would the googlebot really try to retrive them?
2:00 am on Apr 9, 2011 (gmt 0)

Junior Member

joined:Mar 30, 2011
posts: 153
votes: 0


Found someone with a similar problem :

[dev.anything-digital.com...]

But the fix doesnt match the file in my Joomla (caused by a Joomla bug)