Forum Moderators: coopster

Message Too Old, No Replies

Automated searching via Google

...limits and conditions

         

dwhite

7:39 pm on Aug 7, 2008 (gmt 0)

10+ Year Member



Hope this is the right forum to post this.

I'm creating a web app, and hope to automatically extract information from Google's search results. For each search on my site, anything from 10-100 Google pages may be looked at. I would obviously credit Google.

Last time I heard, automated Google searching was frowned upon, but it was still possible as long as you didn't perform too many automated searches.

What I want to know is:

a: Is this still possible, and if so where is the link? It seems one can use AJAX with this page [code.google.com], but I just want to use PHP to gather the info, not AJAX.

b: What's the maximum number of megabytes or queries of search I can automatically extract a day without annoying Google? Last time I heard it was 1000 queries a day.

c: Can you pay for further automated querying?

d: For speed's sake, how many connections can I open at once to the Google servers? Obviously I'd love to open say 10 connections at once as this would speed up the search page downloads.

eelixduppy

1:53 am on Aug 8, 2008 (gmt 0)



The only way I know that PHP can do this is through the use of SOAP, however, Google is no longer giving away keys to the SOAP Search API [code.google.com] so you pretty much only have the AJAX API to use for your applications.

For your other questions, you are probably going to have to refer to the Terms of Use and the FAQ [code.google.com] for additional information.

dwhite

12:13 pm on Aug 19, 2008 (gmt 0)

10+ Year Member



I would just like to conclude this thread and put it out of its misery.

Okay never mind the language for now - after studying the FAQ [code.google.com], I couldn't even find out if it was possible to scrape results, let alone the limit on queries per day.

However, after studying the group for this API, it turns out that it's not posssible [groups.google.com]. That is, automated querying is not allowed to scrape search results. This is unlike their old deprecated SOAP API which was able to allow up to 1000 queries per day.

I am frustrated with the state of affairs because my ambitious project (which does not rival Google in any way) has been ground to a halt before it even started. :(

I might make a small fuss on their forum, but I doubt I'll have any hope of changing things.

eelixduppy

1:29 pm on Aug 19, 2008 (gmt 0)



Is Google the only search engine you'll use for your project? Perhaps you can use another? There are other big ones out there that have a SOAP API.

dwhite

3:21 pm on Aug 19, 2008 (gmt 0)

10+ Year Member



Google would be better really, but if you know of another search engine that allows preferably 50-1000 results at once, and 1000-10,000 automated queries per day, I would love to hear.