Welcome to WebmasterWorld Guest from 54.146.194.42

Message Too Old, No Replies

Google Webmaster 404 errors due to ".."

     
3:11 pm on Jan 5, 2012 (gmt 0)

New User

5+ Year Member

joined:Dec 16, 2009
posts: 36
votes: 0


First it was just one site. Then another. Now, this has become a real nuisance!

Search sites abbreviate URLs with ellipsis. (It seems they are often out of AU or NZ, but that may just be coincidence.) Goog finds the links, then crawls them and reports 404 errors in Webmaster tools.

Is there any workaround to this than adding 301 redirects? I've got about 600 to add right now. :-(

I wrote one guy in NZ using his contact page, and he said mine was the 3rd email he'd received about the topic, and that he'd "try to fix it." Guess how that turned out? More 404 errors for me to add to my .htaccess file.

Here is an example of a link going to my site, to illustrate:

http://www.example.com/product/a/Tripzonic-F..

where the URL is actually about 20 characters longer for the ..

Any thoughts on how to handle this?

[edited by: tedster at 5:34 pm (utc) on Jan 5, 2012]

5:37 pm on Jan 5, 2012 (gmt 0)

Senior Member

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:May 26, 2000
posts:37301
votes: 0


Yes - you can ignore it unless the backlink is valuable.

You don't need to "fix" 404 crawl errors coming from external sites. This has been confirmed by several Google spokespeople on their own Webmaster Forums. So there's no need to generate massive .htaccess files and play whack-a-mole with this nonsense.
7:43 pm on Jan 5, 2012 (gmt 0)

Senior Member

WebmasterWorld Senior Member g1smd is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:July 3, 2002
posts:18903
votes: 0


To be clear, is it the URL in the href that is being truncated, or just the anchor text?

If it is the anchor text, then simply ignore it.
8:03 pm on Jan 5, 2012 (gmt 0)

New User

5+ Year Member

joined:Dec 16, 2009
posts: 36
votes: 0


@g1smd great question - it's actually just text on the page that google is picking up on and pulling out URLs. See:

[justsearchit.com.au...]

as an example.
8:09 pm on Jan 5, 2012 (gmt 0)

Senior Member

WebmasterWorld Senior Member g1smd is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:July 3, 2002
posts:18903
votes: 0


Google started pulling anything that looks like a URL from text on pages and testing the server response for their "guess" only a few weeks ago.

The correct response for junk URLs is to return 404. If Google get tens of millions of 404 responses for these hopefully they will stop this sillyness.
10:06 pm on Jan 5, 2012 (gmt 0)

Moderator This Forum from US 

WebmasterWorld Administrator robert_charlton is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Nov 11, 2000
posts:11317
votes: 167


Google started pulling anything that looks like a URL from text on pages and testing the server response for their "guess" only a few weeks ago.

Seems like it's just occurring to Google that this is creating some unnecessary concern, and they are "looking into ways of making that a bit clearer."

This was just discussed at length in another thread here....

Google Following URLs Without Hyperlinks
http://www.webmasterworld.com/google/4389424.htm [webmasterworld.com]

There's also a link in the thread to some additional comments by Google's John Mueller on the subject.
11:42 pm on Jan 5, 2012 (gmt 0)

New User

5+ Year Member

joined:Dec 16, 2009
posts: 36
votes: 0


@g1smd Trust me, it wasn't a few weeks ago - I started cataloging these bizarre searches (oh, the racy sex ones, you wouldn't believe!) last summer when I first took notice of it. @Robert Thanks for the link, will read there now. I sure hope google figures this out. My Webmaster reports since last summer are just splattered with ".." 404 URLs. By the way, one thing I didn't mention and hasn't been brought up, it's not easy to just dismiss these searches as a nuisance, since search is one of the most resource-intensive things for most servers. In my case, I have 2 "0 found" search results failovers: parsing the search phrase to find near matches, and if that fails, looking for "did you mean" matches. So 2,000 of these bogus googlebot hits in a day adds up.
12:05 am on Jan 6, 2012 (gmt 0)

Senior Member

WebmasterWorld Senior Member g1smd is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:July 3, 2002
posts:18903
votes: 0


I meant "months", not "weeks".

I didn't actually see any of this until more recently than some other people.