Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

How to find URLs with an underscore symbol for a site

         

sviba

4:15 am on Jan 30, 2009 (gmt 0)

10+ Year Member



Please let me know how can we find the total number of page index with the page URL contains "_" in the site.

I have tried the following query in the Google SE
but google doesn't show any results

inurl:_ site:www.mysite.com

It shows the results, If we use inurl:_ without site:

Thanks,
Vijay.

tedster

6:54 pm on Jan 30, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Same issue here, and I'm stumped on this one. I know that the underscore character gets very special treatment at Google - partly because it has been so commonly used in technical names, e.g FrontPage extensions.

I noticed some information about the similar allinurl: operator and it may be related to this limitation with inurl:

Note that [allinurl:] works on words, not url components. In particular, it ignores punctuation. Thus, [allinurl: foo/bar] will restrict the results to page with the words "foo" and "bar" in the url, but won't require that they be separated by a slash within that url, that they be adjacent, or that they be in that particular word order. There is currently no way to enforce these constraints.

[google.com...]

If you need to track down all your urls that include an underscore "_", you might use Xenu to spider your site and then work from that report.

sviba

11:44 am on Feb 2, 2009 (gmt 0)

10+ Year Member



My link structure like
http://www.example.com/AAA_YYY_ZZZ/id.html
http://www.example.com/AAA1_YYY1_ZZZ1/id.html

So i want to know how many pages with the symbol "_" are indexed by Google.

The xenu is used to list the URLs from the site.

[edited by: tedster at 6:21 pm (utc) on Feb. 2, 2009]
[edit reason] switch to example.com - it can never be owned [/edit]