homepage Welcome to WebmasterWorld Guest from 54.197.94.241
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Code, Content, and Presentation / Content Management
Forum Library, Charter, Moderators: ergophobe

Content Management Forum

    
Joomla URL problem with Google crawler
bramley




msg:4294395
 9:41 pm on Apr 8, 2011 (gmt 0)

Google crawler 'finds' wierd URLs such as :

URLs were here (minus domain) to show the wierd structure but I cant get them to appear properly.

Over 100,000 of them now in Google webmaster tools list !

Joomla is installed in subdir /beijing and that is the live_site setting. SEF is all off. The site works fine for human visitors.

Help !

[edited by: bramley at 9:52 pm (utc) on Apr 8, 2011]

 

Demaestro




msg:4294397
 9:47 pm on Apr 8, 2011 (gmt 0)

ew, have you tried crawling one of the pages that is generating these links in Google as googlebot?

Without seeing your site my guess would be it is crawling js links in a weird way OR one of your components is doing something weird.

bramley




msg:4294398
 9:51 pm on Apr 8, 2011 (gmt 0)

Thanks - the urls are shown here [forum.joomla.org...]

It adds spurious directories and finally the correct index.php?...

I have a few links that load things with ajax but these are either in addition to an href or just an onclick; but I'll think about it ...

bramley




msg:4294400
 9:54 pm on Apr 8, 2011 (gmt 0)

Re 'as googlebot' - how to do that ?

Demaestro




msg:4294409
 10:15 pm on Apr 8, 2011 (gmt 0)

In G's webmaster tools in the left hand nav

-> Diagnostics -> Fetch A Page AS Googlebot

Demaestro




msg:4294410
 10:18 pm on Apr 8, 2011 (gmt 0)

I checked out that Joomla forum posting

Are you still getting the jfolder errors too?

Oddly enough I ran into that today and fixed it so if you still get that message let me know, I'll send you the fix.

bramley




msg:4294423
 10:52 pm on Apr 8, 2011 (gmt 0)

Hi Demaestro,

Back from dinner.

jfolders errors? what do you mean ?

ps: I'll try the WMT also

Demaestro




msg:4294429
 10:56 pm on Apr 8, 2011 (gmt 0)

Oh maybe I misread it. in that other thread Dongle mentions Jfolder error I thought it was you.

It must have stood out to me since I had the same issue this morning with a site.

bramley




msg:4294437
 11:13 pm on Apr 8, 2011 (gmt 0)

Tried the main page in Fetch as googlebot, and the links there look fine

bramley




msg:4294438
 11:13 pm on Apr 8, 2011 (gmt 0)

Let me look at that thread ...

bramley




msg:4294439
 11:15 pm on Apr 8, 2011 (gmt 0)

That's something else I think

bramley




msg:4294444
 11:48 pm on Apr 8, 2011 (gmt 0)

i found some js ajax calls with relative urls - made absolute now.

would the googlebot really try to retrive them?

bramley




msg:4294477
 2:00 am on Apr 9, 2011 (gmt 0)

Found someone with a similar problem :

[dev.anything-digital.com...]

But the fix doesnt match the file in my Joomla (caused by a Joomla bug)

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / Content Management
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved