homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Code, Content, and Presentation / Content Management
Forum Library, Charter, Moderators: ergophobe

Content Management Forum

Joomla URL problem with Google crawler

 9:41 pm on Apr 8, 2011 (gmt 0)

Google crawler 'finds' wierd URLs such as :

URLs were here (minus domain) to show the wierd structure but I cant get them to appear properly.

Over 100,000 of them now in Google webmaster tools list !

Joomla is installed in subdir /beijing and that is the live_site setting. SEF is all off. The site works fine for human visitors.

Help !

[edited by: bramley at 9:52 pm (utc) on Apr 8, 2011]



 9:47 pm on Apr 8, 2011 (gmt 0)

ew, have you tried crawling one of the pages that is generating these links in Google as googlebot?

Without seeing your site my guess would be it is crawling js links in a weird way OR one of your components is doing something weird.


 9:51 pm on Apr 8, 2011 (gmt 0)

Thanks - the urls are shown here [forum.joomla.org...]

It adds spurious directories and finally the correct index.php?...

I have a few links that load things with ajax but these are either in addition to an href or just an onclick; but I'll think about it ...


 9:54 pm on Apr 8, 2011 (gmt 0)

Re 'as googlebot' - how to do that ?


 10:15 pm on Apr 8, 2011 (gmt 0)

In G's webmaster tools in the left hand nav

-> Diagnostics -> Fetch A Page AS Googlebot


 10:18 pm on Apr 8, 2011 (gmt 0)

I checked out that Joomla forum posting

Are you still getting the jfolder errors too?

Oddly enough I ran into that today and fixed it so if you still get that message let me know, I'll send you the fix.


 10:52 pm on Apr 8, 2011 (gmt 0)

Hi Demaestro,

Back from dinner.

jfolders errors? what do you mean ?

ps: I'll try the WMT also


 10:56 pm on Apr 8, 2011 (gmt 0)

Oh maybe I misread it. in that other thread Dongle mentions Jfolder error I thought it was you.

It must have stood out to me since I had the same issue this morning with a site.


 11:13 pm on Apr 8, 2011 (gmt 0)

Tried the main page in Fetch as googlebot, and the links there look fine


 11:13 pm on Apr 8, 2011 (gmt 0)

Let me look at that thread ...


 11:15 pm on Apr 8, 2011 (gmt 0)

That's something else I think


 11:48 pm on Apr 8, 2011 (gmt 0)

i found some js ajax calls with relative urls - made absolute now.

would the googlebot really try to retrive them?


 2:00 am on Apr 9, 2011 (gmt 0)

Found someone with a similar problem :


But the fix doesnt match the file in my Joomla (caused by a Joomla bug)

Global Options:
 top home search open messages active posts  

Home / Forums Index / Code, Content, and Presentation / Content Management
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved