homepage Welcome to WebmasterWorld Guest from 107.22.70.215
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
Pubcon Website
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

This 38 message thread spans 2 pages: < < 38 ( 1 [2]     
URL Parameter function within Google Webmaster Tools
member22




msg:4604638
 10:05 am on Aug 24, 2013 (gmt 0)

Hello,

I have been searching through the forum and can't find what I am looking for regarding the URL parameter function in the webmaster tool.

I use joomla for my website and google found a list of parameters ( option, id , itemid ). However, instead of using the following parameters that google is giving me I would like to use numbers ( i would like to add number 1 for example as a parameter ).

For example in this address index.php?option=com_content&view=article&id=129&Itemid=32

There is itemid listed but if I decide to create a new url parameter with the number 1 google going to recognize it ?

Does the url parameter recognize numbers ?

How long will it take google to show the URL monitored once I set a new parameter like 1 or 2 ?

Thank you

 

aakk9999




msg:4605819
 4:36 pm on Aug 29, 2013 (gmt 0)

It took a few days to index those 600 pages ( due a bug when I upgraded joomla ) but you are telling me it can take years to remove that is terrible...

Perhaps not years, but certainly much longer than it takes to index them.

In other words, what is best to use, the disallow in the robots.txt, URL removal tool or URL Parameter or should I use the 3 of them at once to give myself the most chance and get the penalty I have removed as quickly as possible.

If you want to remove these pages from Google index, you should either noindex these pages (see above from JD_Toims) or block them in robots.txt AND use URL Removal tool.

There have been several threads on the problem you had with your Joomla upgrade. Have you considered becoming supporter and putting your site for review in the Review my site [webmasterworld.com] forum? In that forum you can post URL to your site and you would probably get more focused responses.

Robert Charlton




msg:4605861
 7:48 pm on Aug 29, 2013 (gmt 0)

To comment on some questions not related to the URL Parameter function, but to other questions I see you're asking....

I currently have pages indexed in google with the following description : " A description for this result is not available because of this site's robots.txt learn more. " is it because of the disallow I have

These generally result when a url is disallowed by robots.txt, but (as JD_Toims mentioned) there are existing links to the url from pages that are accessible to Googlebot.

The urls could be urls generated by earlier versions of your CMS which had gotten indexed. I have no idea if this is the case. It would require someone familiar with your CMS to identify the patterns. If this is the case, though, I'm not sure how you would use meta robots noindex to remove them, as these are variants of "pages" that no longer exist, and there would be nowhere to place the meta robots tags. It seems to me that you should have the requests for such urls 301 redirected by the server to your current preferred "canonical" versions.

Note also, that you can't combine meta robots noindex and robots.txt. A robots.txt disallow would prevent Googlebot from spidering the url, either to discover the meta robots noindex on the page... or else to discover that the page is gone and the request returns a 404.

Again, this suggests to me that using 301 redirects on the server to a single canonical form might be the most efficient way of handling this... assuming that you can identify all of the patterns and likely url variants. If this is a problem that occurred widely during a Joomla upgrade, it's very likely that the patterns have been catalogued somewhere in the Joomla community.

PS: I'd be very wary about using the url removal tool.

lucy24




msg:4605880
 8:26 pm on Aug 29, 2013 (gmt 0)

There have been several threads on the problem you had with your Joomla upgrade. Have you considered becoming supporter and putting your site for review

At a minimum there's the Content Management [webmasterworld.com] forum on the free side. It's littered with joomla-related questions.

I get the impression-- based partly on information from outside this thread-- that the underlying problem has to do with the CMS returning valid pages when given invalid values for legitimate parameters. This can't be fixed in gwt; it's a combination of htaccess (for existing problems) and fixing the upgrade.

member22




msg:4606033
 7:05 am on Aug 30, 2013 (gmt 0)

JD_Toims

Thank you for your answer about the line of code to add but this issue is that I don't know which directory yhe issue is coming from because googlebot has surfed our FTP in a certain way and created pages that I think random ( I am sure it is not but it is impossible to figure out which way it surfed and why ), is it still possible to use your method ?

Can you give a example of what you would replace " the-path " and " to-the-directly " by on for example www.cnn.com so that I understand.

Thank you,

<LocationMatch "^/the-path/to-the-directory/to-noindex">
Header set X-Robots-Tag: "noindex"
</LocationMatch>

member22




msg:4606038
 7:13 am on Aug 30, 2013 (gmt 0)

This morning my URL Parameter is showing more URL monitored than a week ago when I decided to NO URL all my itemid ? does it mean google is finding new duplicate content pages ?

JD_Toims




msg:4606040
 7:46 am on Aug 30, 2013 (gmt 0)

Thank you for your answer about the line of code to add but this issue is that I don't know which directory the issue is coming from because googlebot has surfed our FTP in a certain way and created pages that I think random

Personally, if it was for files/directories I was not using I would likely go with a negative match along the lines of !^/something/i-use/ where ! is *not* a match to the pattern I normally use. Hope that makes sense.

lucy24




msg:4606048
 8:09 am on Aug 30, 2013 (gmt 0)

googlebot has surfed our FTP in a certain way

Can someone translate, please? :(

<LocationMatch "^/the-path/to-the-directory/to-noindex">
Header set X-Robots-Tag: "noindex"
</LocationMatch>

What the bleep? I thought all this was happening in htaccess on shared hosting.

dougwilson




msg:4608322
 3:18 pm on Sep 8, 2013 (gmt 0)

The answer to this question:

"if selecting the NO URL in the URL Parameter will remove those"

is no

Using the parameter tool doesn't effect what's already indexed.

Using robots.txt to Dissallow: /folder

We can then use removal tool to remove the directory from index...

As in "domain/folder/"

Same process for "domain/folder/pages", "domain/folder/pages/2" and so on

This 38 message thread spans 2 pages: < < 38 ( 1 [2]
Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About
© Webmaster World 1996-2014 all rights reserved