homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
Forum Library, Charter, Moderators: goodroi

Sitemaps, Meta Data, and robots.txt Forum

Persistent Crawl Errors

Msg#: 4094329 posted 10:12 pm on Mar 9, 2010 (gmt 0)

I have a Joomla site that has Search Engine Friendly URLs turned on but Google keeps on picking up urls with the index.php in it and sometimes with index.php/index.php in it. I already submitted them for deletion and they where processed but they have returned.

For example

404 (Not found)1 pagesMar 4, 2010
404 (Not found)1 pagesMar 5, 2010
404 (Not found)1 pagesMar 5, 2010
404 (Not found)1 pagesMar 5, 2010

The sitemap is at http://example.com/sitemap.xml and doesn't show any urls with index.php in it so I cannot figure out where it is coming from.

The main site is http://example.com

Any help or suggestion would be much appreciated.

[edited by: goodroi at 2:26 pm (utc) on Mar 15, 2010]
[edit reason] examplified [/edit]



WebmasterWorld Administrator goodroi us a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

Msg#: 4094329 posted 2:28 pm on Mar 15, 2010 (gmt 0)

i'm not an expert with joomla since i prefer to use other cms. it seems to me that you should double check your settings & maybe also check out the htaccess file.

another tip is to check out your log files. it can help you if you know more about where these urls are coming from. is another website linking to you with a typo in the url? maybe one of your internal pages is broken and linking to these urls. these bad urls could be formed by many different ways. if i were you i would peek into the log files to see if there are any clues there.

Global Options:
 top home search open messages active posts  

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved