Does Google dislike page links containing question mark (?)

Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Does Google dislike page links containing question mark (?)

born2run

3:37 am on Oct 9, 2014 (gmt 0)

Hi, we have a dynamic site and some of the urls contain the character question mark like example:

http://example.com/q=1
http://example.com/node?testing

Links like the above.

Does google not recommend having such links? Can anyone explain? Thanks!

JD_Toims

11:50 am on Oct 9, 2014 (gmt 0)

The most direct answer I know of:

Which can Googlebot read better, static or dynamic URLs?
We've come across many webmasters who, like our friend, believed that static or static-looking URLs were an advantage for indexing and ranking their sites. This is based on the presumption that search engines have issues with crawling and analyzing URLs that include session IDs or source trackers. However, as a matter of fact, we at Google have made some progress in both areas. While static URLs might have a slight advantage in terms of clickthrough rates because users can easily read the urls, the decision to use database-driven websites does not imply a significant disadvantage in terms of indexing and ranking. Providing search engines with dynamic URLs should be favored over hiding parameters to make them look static.

[googlewebmastercentral.blogspot.com...]
This is actually from a faceted navigation post, but still addresses the question you have via best/worst practices:

Worst practice #2: Using directories or file paths rather than parameters to list values that don’t change page content.

Worst practice:

example.com/c123/s789/product?swedish-fish
(where /c123/ is a category, /s789/ is a sessionID that doesn’t change page content)

Good practice:

example.com/gummy-candy/product?item=swedish-fish&sid=789 (the directory, /gummy-candy/,changes the page content in a meaningful way)

Best practice:

example.com/product?item=swedish-fish&category=gummy-candy&sid=789 (URL parameters allow more flexibility for search engines to determine how to crawl efficiently)

It’s difficult for automated programs, like search engine crawlers, to differentiate useful values (e.g., “gummy-candy”) from the useless ones (e.g., “sessionID”) when values are placed directly in the path. On the other hand, URL parameters provide flexibility for search engines to quickly test and determine when a given value doesn’t require the crawler access all variations.

[googlewebmastercentral.blogspot.com...]
And, since "what about duplicate content" usually makes it's way into these type of threads:

Emphasis Added
When Google detects duplicate content, such as the pages in the example above, a Google algorithm groups the duplicate URLs into one cluster and selects what the algorithm thinks is the best URL to represent the cluster in search results (for example, Google might select the URL with the most content). Google then tries to consolidate what we know about the URLs in the cluster, such as link popularity, to the one representative URL to ultimately improve the accuracy of its page ranking and results in Google Search.

[support.google.com...]
Note the page/quote linked above not only indirectly indicates there is no such thing as a duplicate content penalty, it also indicates ranking signals from all duplicates in a grouping are applied to the selected URL, which means the SEO promoted idea of "link weight splitting" due to content being available via more than one URL is debunked too.

* Wishes SEO Snake Oil sales that do nothing but cause people looking for a solution to an issue to chase their tails could somehow be snap-banned...

anim8tr

1:54 pm on Oct 9, 2014 (gmt 0)

born2run, Google does NOT have a problem with links containing a "?", as long as they are used in a traditional sense (i.e. a querystring). The two examples you cite are not typical querystring links and may be confusing Google when it comes to indexing your content.

For example, the following link is incorrect:
http://example.com/q=1

It should be:
http://example.com/?q=1 OR http://example.com?q=1

The second one is also incorrect:
http://example.com/node?testing

It should be:
http://example.com/node?testing=[some value]

I would do some research on "querystrings" and see if this is what you are trying to do.

JD_Toims

2:28 pm on Oct 9, 2014 (gmt 0)

The first one looks like a simple typo to me, but according to RFC 2616 3.2.2 the second correction you gave [ http://www.example.com?q=1 ] is incorrect as a URI the user-agent does not need to adjust or edit when making the request.

If the abs_path is not present in the URL, it MUST be given as "/" when used as a Request-URI for a resource (section 5.1.2).

[ietf.org...]
[w3.org...]
Also, http://www.example.com/node?testing is *not* incorrect, even though it's not a best practice for SE understandability.

Emphasis Added
However, as query components are often used to carry identifying information in the form of "key=value" pairs...

http://tools.ietf.org/html/rfc3986#section-3.4

Note the use of often, rather than always or must.

born2run

4:56 am on Oct 10, 2014 (gmt 0)

Thanks guys for the help!

phranque

5:57 am on Oct 13, 2014 (gmt 0)

http://tools.ietf.org/html/rfc3986#section-3.0

The query component is indicated by the first question mark ("?") character and terminated by a number sign ("#") character or by the end of the URI.

it's been >10 years since query strings per se were an issue for google.