Forum Moderators: Robert Charlton & goodroi
When searching with inurl:domain.com/dir/ results returned are 10 times larger than number of pages in that dir ever hosted.
Same is the case if I search with inurl:domaine.com/dir/ site:www.domain.com
For a directory that never had more than lets say 10.000 pages it would return 100.000 results!
I did try to manually review the result set but I can not get passed 100 pages and everything before that looks fine.
If anyone have any experience at all with the issue above it would be really helpful if shared what can be causing inurl to return much higher than expected and possible number of results.
Thanks!
We always used this syntax
<input . . . value="inurl:www.example.com/Departments/Psychology/" />
which would return only pages containing the search term in and beyond the url. For example, 13 hits on the search term "moore" . . .
Some time ago, a search began returning all pages in and beyond www.example.com/ For example, 34,000+ hits on the search term "moore" . . .
Changing "inurl:" to "+inurl:" fixed the problem. And so did changing "inurl:" to "site:" But we have no real idea why the search broke in the first place. Nor why this should fix it.
Something feels very fuzzy here.
<Sorry, no specifics - use example.com.
See Forum Charter [webmasterworld.com]>
[edited by: tedster at 4:56 pm (utc) on July 21, 2006]
In our case, we are searching beyond the domain on a Public Service Search at an educational institution.
Sorry this is so long-winded, but what we have is syntax that used to work and is now broken. So I want to be very clear about what the problem is.
At the domain level we search using:
<http://www.google.com/u/school>
embedded in a form:
<form action="http://www.google.com/u/school" method="get" name="f2">
<input type="hidden" value="school.edu;schoolalumnimagazine.com;schoolbears.collegesports.com" name="domains">
<input type="hidden" value="school.edu" name="sitesearch">
<input type="text" class="txt" name="query" value="Search the Web"
<input type="submit" class="but" name="sa" value="Search" />
</form>
and this works very well.
To search beyond this, for example, to search in a university department, Google suggested using inurl: as an interim solution.
I've only found this documented on the blog:
<http://tenderlover.blogspot.com/2005/11/google-site-search.html>
where the suggested syntax is
<type="hidden" name="hq" value="inurl:www.example.com/yourdirectory">
embedded in a form:
<form method="get" action="http://www.google.com/u/school">
<input type="hidden" name="hq" value="inurl:www.school.edu/Departments/departmentname/" />
<label for="query">Search the Department Web </label>
<input size="30" name="query" id="query" value="Enter search term..." onfocus="clearDefault(this)">
<input type="submit" name="submit" value="Go" />
</form>
which used to work but stopped working some time ago. We don't really know when. Searching a
department with a search term that should return 20 hits returns 30,000 hits instead! Various
departments have various versions of this form and none of them work. So it seems unlikely it is
a subtle syntax problem.
With a little experimentation on variations of inurl: or substituting site: for inurl: in the example above
I obtained these reurlts:
value="inurl:www.school.edu/Departments/departmentname/" 100,000,000 hits
value="+inurl:www.school.edu/Departments/departmentname/" 31 hits
value="site:www.school.edu/Departments/departmentname/" 23 hits
value="+site:www.school.edu/Departments/departmentname/" 23 hits
The difference between 31 and 23 in these results is that Google is finding pages outside of our
domain with both the department url and the search term in them. Obviously we get results
that make the most sense with:
value="site:www.school.edu/Departments/departmentname/"
but can anyone see any problem with this syntax? We have no idea why the original syntax
stopped working. And need to change this on dozens (and dozens) of department pages. (Don't you just hate going back to 80 people in your academic departments and saying "sorry, we got the syntax wrong again.")
thanks all.
site:www.domain.com -inurl:www returns a load of Supplemental results, but they are all www pages (shouldn't the -inurl:www exclude www pages?)
site:www.domain.com inurl:www returns www pages, but without any Supplemental results.
I see this for a large number of sites. I cannot yet confirm that it happens for all sites.