homepage Welcome to WebmasterWorld Guest from 54.145.243.51
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Code, Content, and Presentation / HTML
Forum Library, Charter, Moderators: incrediBILL

HTML Forum

    
Facing crawl errors with showing missing category. How to Fix?
shobee



 
Msg#: 4656489 posted 6:42 pm on Mar 23, 2014 (gmt 0)

Hello All,

I am working on a WordPress site and facing crawl errors at GWT which is showing URLs of my site without category and went on not found 404 pages.

Example is

GWT crawl error URL

www.xyz.com/world-market-center-social-media-information-sheet/

Actual URL is

www.xyz.com/socialmedia/world-market-center-social-media-information-sheet/

So most of my URLs at GWT showing with missing category that cause of crawl error.

How can i fix this please help.

 

not2easy

WebmasterWorld Administrator 5+ Year Member Top Contributors Of The Month



 
Msg#: 4656489 posted 6:53 pm on Mar 23, 2014 (gmt 0)

Look at your sitemaps and see that you aren't listing the same content different ways. You don't want G to crawl the same content as a tag, archive, category and page or post. Pick one format and noindex the others. It can help if you block crawling (Disallow in robots.txt) for the URLs you don't want indexed. If you need a plugin for it, I suggest YoustSEO but there are others.

lucy24

WebmasterWorld Senior Member lucy24 us a WebmasterWorld Top Contributor of All Time Top Contributors Of The Month



 
Msg#: 4656489 posted 8:27 pm on Mar 23, 2014 (gmt 0)

Does google give any hints about how it learned about the nonexistent URL? Sometimes it just makes things up. But other times you'll find it listed as either "in sitemap" (meaning some sitemap within the present geological era, not necessarily your current one) or some referring page.

If there are lots of requests and you can't pinpoint the source, it may be necessary to edit your htaccess file to put in some manual redirects. (Or, if nobody but google is making the requests, serve a 410 to make them go away.) This is Not Fun when you're working with a cms that has its own ideas about what belongs in the htaccess file.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / HTML
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved