Forum Moderators: coopster
So last night I wrote a script that is supposed to submit the phrase, look through the results page by page, and finally show me the page that has me on it.
What happens instead is no matter what I supply for the starting point, the first page comes back and they've fooled with the links - the navigation links at the bottom contain the domain name which is doing the submission instead of 'google.com'.
It occurred to me after triple-checking my script for errors that I could see how google might take steps to keep people from doing such a thing - while my intentions are honorable, I could see how someone could use the methodology to display search results sans advertisements.
Do you think that this is the case - can they detect that the query is via curl, they don't like it, and it's hopeless? I think I can accomplish the same thing with javascript, but JS isn't my 'native tongue' so I'd prefer a php solution. This is my first foray into curl so I don't mind tweaking options etc. if there's hope, but I don't want to waste my time if they're going to keep shutting me down.
What they're talking about in the article is when a page is crafted for their bot to read that winds up influencing other searches.
Yes, I'm passing the agent along from $_SERVER.
Thanks for the replies, capulet_x and joelgreen.
But you have to hit them pretty hard to get this to show up so most people will never see it.
JAG