Massive Cuil Search Engine Launched

Forum Moderators: bakedjake

Message Too Old, No Replies

Massive Cuil Search Engine Launched

"World's Largest Search Engine" is the claim

kamikaze Optimizer

4:43 am on Jul 28, 2008 (gmt 0)

SAN FRANCISCO — In her two years at Google, Anna Patterson helped design and build some of the pillars of the company’s search engine, including its large index of Web pages and some of the formulas it uses for ranking search results.
Skip to next paragraph
The makers of the Cuil search engine say it should provide better results and show them in a more attractive manner.
Now, along with her husband, Tom Costello, and a few other Google alumni, she is trying to upstage her former employer.
On Monday, their company, Cuil, is unveiling a search engine that they promise will be more comprehensive than Google’s and that they hope will give its users more relevant results.

[nytimes.com...]

zett

2:48 pm on Jul 31, 2008 (gmt 0)

It looks as if Cuil has chosen about x images for a particular topic and is using those x images to display next to sites.

Exactly. They apparently determine the topic of a query, pull matching images for that query from their database, and splash these image files randomly across the result page next to whatever result there may be (and yes, it could also be your own site).

I found that the location of the images is random, and that they typically use the same set of images for a given query. Sometimes a new image or two comes up on page 2, but beyond page 3 there are rarely new images. They just keep displaying the same images.

It's interesting - you can find your stuff easily once you found one photo. You suddenly get an idea which parts of your sites have been scraped, and which keywords they use. A good starting point is to use the first three words on the title of a page.

To me all of this is a clear violation of copyright. I am currently running an inquiry with an IP lawyer specialized on photography. You see, Cuil's pockets still carry some VC money. Instead of spending it for muffins, they might as well hand over some of that cash to us?!

*evil grin*

piatkow

3:12 pm on Jul 31, 2008 (gmt 0)

Is it just my connection or have they pulled the images? All images currently appear to be broken when I search.

When images were appearing some of my pages had their own images (sometimes ads from the pages) while others had images from elsewhere. The external images seemed generally relevant(ish) but I was a bit annoyed to have an image of a book that I don't sell showing up.

zett

3:19 pm on Jul 31, 2008 (gmt 0)

Is it just my connection or have they pulled the images?

Must be your connection. I am still seeing the images, even after having cleared the cache. But I am seeing an increasing number of [X] images. Must be the NOIs flying in right now.

Frank_Rizzo

4:26 pm on Jul 31, 2008 (gmt 0)

The image selection is clearly based on the alt tag used for the image.

My logo has an alt tag of "Mysite - Widget Foods"

If I search for Widget Foods any site listed which mentions the text widget foods could show my logo next to it.

I counted 8 on the first 10 pages, and the logo is even displayed all the way down to the last page on 23.

---

About a year or two ago my site got hammered by cuil. Reading a few posts here it was suggested that the site had a bad bot. I banned that bot - blocked the IP ranges.

I remember trying to ask the site admins (cuil) to stop indexing but the reply was curt. I seem to remember that their site splash screen at the time showed a crusty old farmer ploughing the furrows - like he was saying "Get orrff moi land"

---

One other point: Is it Cuil or Cuill?

I always thought Cuill but the site redirects to Cuil. If you look at the wayback machine Anna, Russell and Tom describe it as Cuill.

If they can't make up their minds what the site is called how the heck can they decide what to show for images?

[edited by: Frank_Rizzo at 4:27 pm (utc) on July 31, 2008]

Quadrille

4:48 pm on Jul 31, 2008 (gmt 0)

First they made up their minds it was cuill
Then they decided it was Cuil.

Buy One, Get One Free!

mahlon

5:11 pm on Jul 31, 2008 (gmt 0)

Wow! They should pull the images! I see competitors logos posted for an e commerce site I used to work on. My product photos posted for competitor sites etc....

[edited by: mahlon at 5:19 pm (utc) on July 31, 2008]

Tourz

5:19 pm on Jul 31, 2008 (gmt 0)

The image thing has got to be on purpose. They are just messing around.

Reno

5:40 pm on Jul 31, 2008 (gmt 0)

Of all the various aspects of their so-called "advanced" algo, you would think that the monumental image meltdown would be about the easiest to fix. As ronin said in an earlier post:

If Image does not come from Site A, do not display it next to Site A listing.

The fact that they didn't figure that out LONG before their launch date is, to say the least, worrisome, and does not bode well for any kind of long term presence in the highly competitive world of search engine development...

...............................

pageoneresults

5:45 pm on Jul 31, 2008 (gmt 0)

Web Results 1 - 10 of about 1,530,000 for cuil.
Blog Results 1 - 10 of about 86,567 for cuil.

We keep feeding the frenzy too. I've just come to accept those things which I cannot change.

I "could" also prevent them (the things I cannot change) from happening. ;)

Brett_Tabke

6:22 pm on Jul 31, 2008 (gmt 0)

This is the first viable and brightest search engine launch since Teoma (6+years ago). Damn - it is nice eh? I forgot there could actually be some competition in this space.

Samizdata

6:24 pm on Jul 31, 2008 (gmt 0)

I "could" also prevent them (the things I cannot change) from happening

I have had Twiceler blocked on all my sites since the day it first turned up, ignored robots.txt and blundered it's aggressive way into a spider trap - I don't care that my sites are not in their index.

So presumably Cuil(l) will not be using MY images to promote other people's websites.

As I understand it they host the images they use - or stole - on their own servers, so if you block their bot now it wil be too late. It stinks, but you can't put the genie back in the bottle.

I suspect that the other funny smell is $33 million going up in smoke.

...

Murdoch

6:29 pm on Jul 31, 2008 (gmt 0)

If Image does not come from Site A, do not display it next to Site A listing.

That's the problem though, the images are not actually from the sites. Cuil ripped those images, placed them on their own server, and are using some kind of categorization to bring them up. That's why you see a lot of the same image over and over again for the wrong sites.

I'm sure they did all this just to save bandwidth, page load times, ineffective images due to hotlink protection, etc. But the resulting mess is not only bad because they aren't showing YOUR pictures next to another site, they are actually showing pictures they STOLE from your site on sites that come up for the search terms.

That seems to me to be even bigger of a crime. At least Google Images references the original URL in the thumbnail (yes I know along with their own).

pageoneresults

7:37 pm on Jul 31, 2008 (gmt 0)

This is the first viable and brightest search engine launch since Teoma (6+years ago). Damn - it is nice eh? I forgot there could actually be some competition in this space.

Brett? Since there was no ;) or :) after that, we'll assume that you find the results satisfactory. They must have cleaned up the area you were searching in. I've seen that happening over the past few days, lots of stuff seems to be moving about, particularly images.

I do know that a couple of days ago, you could search for specific brands and find their listing with a competitors product image totally not related to the search query.

I am willing to forget all the other bugs in the process with the exception of the images malfunction. That is one of those things that makes you overlook all the other cuil stuff that "may" be going on.

I'm still not backing off. Not until I see that "criminal mugshot" go away that has been eerily moving about the results for a particular name search. I may find myself thinking that someone is playing games with your results based on what I'm seeing.

gibbergibber

7:44 pm on Jul 31, 2008 (gmt 0)

--Cuil ripped those images, placed them on their own server, and are using some kind of categorization to bring them up. --

As a commercial site using those images in a commercial product, that's just asking for a lawsuit.

How do they think they can get away with it?

Look at all the fuss Google had to go through with keywords containing trademarks, and that was just the words. Imagine if it had been images.

makemetop

7:44 pm on Jul 31, 2008 (gmt 0)

Having been deeply involved in search markets where Google is considered second rate (at best) - I find the willingness to bash a newbie interesting - but not unexpected.

Fact is, they made a HUGE screw-up on their launch and it has to be down to VC and marketing department pressure.

I agree that their results are not great (but for Joe Public they are fine as far as they have commented to me).

I completely agree that the image association is the probably the greatest foul-up I've ever seen (and a greater copyright infringement than Google's cache).

I also agree that this has got to be in the "Edsel" arena of marketing screwups.

But if- and it is a BIG if - the average user got to this site - I think they would like it.

Crap results for us - are not always crap results for the surfing public.

Just my opinion (and I'm often wrong) - but then, I was a great supporter of Inktomi!

[edited by: makemetop at 7:46 pm (utc) on July 31, 2008]

pageoneresults

7:45 pm on Jul 31, 2008 (gmt 0)

How do they think they can get away with it?

They can't. That is the next phase of the marketing campaign. Remember, this is a "negative press campaign" and there are many levels that still need to be reached. Phase II: Litigation Proceedings < Just guessing...

zett

8:22 pm on Jul 31, 2008 (gmt 0)

Brett:

This is the first viable and brightest search engine launch since Teoma (6+years ago). Damn - it is nice eh?

I probably miss the part where this launch was "bright". The name? The availability? The layout? The relevance of results? The images?

I understand that anyone (including me) was excited about the appearance of a Google competitor, but the launch was, er, far from perfect. They could have done sooo much better! Sad.

quiet_man

8:31 pm on Jul 31, 2008 (gmt 0)

using those images in a commercial product

Commercial product? One of the remaining questions is exactly how they intend to make money. Right now there's no PPC, no banners, no paid inclusion and little chance of a buyer. I don't think you could call it a commercial product at the moment.

zett

8:32 pm on Jul 31, 2008 (gmt 0)

That is the next phase of the marketing campaign. Remember, this is a "negative press campaign"

Nah. Sure, it may drive traffic. But it is expensive traffic. The damages for infringing copyright protected works is up to $150,000 per infringement. I found 32 images so far (and I did not look closely), hmmmmm, that's up to $4,800,000 in damages. Now, what was that "negative press campaign" again...? ;-)

zett

8:36 pm on Jul 31, 2008 (gmt 0)

I don't think you could call it a commercial product at the moment.

The fact that a site does not monetize (but attract eyeballs) does not really matter in infringement cases. Cuil is clearly not covered by "fair use".

cmarshall

9:05 pm on Jul 31, 2008 (gmt 0)

Brett, by any chance, do you think anyone at Cuil might know the IP range that you use?

It's pretty obvious that your experience is vastly different from everyone else's.

BTW: Regular schleps hate it too. I know a number of non-techhies that gave it one try and canned it for good.

Brett_Tabke

9:19 pm on Jul 31, 2008 (gmt 0)

cmarshall. No - I banned the bot early on and never got around to undoing it. I will at some point.

I see what everyone is seeing, but it is a baby search engine learning to crawl. Obviously, they have some work to do. Bitching and moaning about it, is not going to change that. They have little traffic at this point and we shouldn't throw out the baby with the bath water. I don't care what they are showing right now. I haven't done more than a hand full of searches and probably won't give them another look for a few months - I'm going to give them some time to work it out.

robho

10:06 pm on Jul 31, 2008 (gmt 0)

I'd love to see a rival to Google, but Cuil isn't it.

Although some of my larger sites do quite well for some terms (and I've even received a little traffic from Cuil), the vast majority of searches for more obscure terms are simply full of spam. And the result count, even for one page, is always wrong.

One thing I haven't seen mentioned is that on some browsers Cuil is unusable. Take for example Opera on a Nokia 770 - the search button doesn't work for me (press enter instead) - and the results can't be scrolled (their javascript removes scroll bars) so only the first few results are shown with no way to see more.

Unless they plan to release a mobile version without all the javascript/ajax stuff they're missing out on the main growth area - mobile.

Even on a normal browser like Firefox wheel scrolling doesn't work if the cursor is in their fixed bars at top and bottom, making the user wonder why the wheel broke.

These interface problems though would be unimportant if the search actually worked, but the results are simply unusable for anything other than major keywords (which are maybe hand tuned). Plus, the snippets shown are months old.

Pity, a Google rival is really needed.

cmarshall

10:34 pm on Jul 31, 2008 (gmt 0)

Okay, I'll follow Brett's lead, and get off the bigpile.

I really do want to see an effective competitor to Google.

Reno

10:55 pm on Jul 31, 2008 (gmt 0)

... it is a baby search engine learning to crawl. Obiously, they have some work to do. Bitching and moaning about it, is not going to change that.

I certainly agree with the first part of that statement -- they need time to work out the considerable glitches. I think most people would be perfectly willing to wait months or even a year (or more) for them to perfect their algo so their SERPs have more quality consistency. [We're STILL waiting for MSN/LIVE!]

And as regards to their layout/format -- they have every right to present that any way they want. All responses here or anywhere else -- positive or negative -- would be subjective, not objective.

But as to the complaints, much of what I'm reading are simply observations/feedback to an over-the-top PR campaign. Someone should have told them:

"Be careful what you say -- you may be held to your own pronouncements."

.......................

IanKelley

11:02 pm on Jul 31, 2008 (gmt 0)

much of what I'm reading are simply observations/feedback to an over-the-top PR campaign

That's the part that got me. You expect some hype but the level of outright BS and unfounded arrogance was just too high.

Follow that with, possibily, the biggest collection of mistakes at launch in the history of search and it's no surprise that people don't want to cut them any slack.

Yeah we all want to see more competition but it seems unlikely that Cuil is going to fill that role any time soon.

docbird

2:17 am on Aug 1, 2008 (gmt 0)

it is a baby search engine learning to crawl.

more like a purportedly fully-fledged search engine, that hopped out of the nest, roared "Hello world, I'm huge and I'm gonna soar!", then plummetted to the ground, joining the remains of startups that never really flew.

Obviously, they have some work to do.

- same could be said of all alternative search engines, indeed any company that has less than prime or even lousy product/service. Doesn't mean the work'll get done.
Even google has some work to do.

A bird emerges from the nest, no one expects it to swoop n soar n do barrel rolls, but if you can't flap as far as the next tree, start being able to feed yourself, you won't be around long.

Not sure if folk overly patronising re Joe Public; if so incapable of telling when searches decent, so-so or pap, I don't think google would have become so dominant. Plus, cuil not in zone of being so-so, so need to look hard and be smarty pants web expert to know things could be better; it's terrible.

minnapple

2:23 am on Aug 1, 2008 (gmt 0)

Cuilyoo!
Cuil offers us fantastic fun.
A new algo to mess with is always welcomed!
I hope it becomes even somewhat popular.
It smells like money to me. : )

To Start, Cuil give us a quarter for each "search Cuil" button we put up!

cornwall

8:00 am on Aug 1, 2008 (gmt 0)

The is an amusing article on The Register

[theregister.co.uk...]

They take the premise that any new search engine will eventually have to reverse engineer Google and conclude

Web spammers are looking for an audience. As the most popular search engine, Google controls what its audience sees, so the junk jockeys generate their pages in ways that game the system. Everyone else legitimately seeking those same eyeballs for their content or customers for their business want the better search rankings too, so the SEO crowd works to make legit sites dance to Google's tune. Techniques such as adding needless internal links, creating PageRank-friendly URLs and distorting normal grammar are all widely deployed with varying degrees of dastardliness.
And so it goes. Gradually the structure and content of the web becomes at one with the Google data centre. Disrupting such a tight, interconnected mutualism seems impossible for would-be "Google Killers". The best others can hope for is to imitate Google results.

StoutFiles

10:53 am on Aug 1, 2008 (gmt 0)

Cuilyoo!
Cuil offers us fantastic fun.
A new algo to mess with is always welcomed!
I hope it becomes even somewhat popular.
It smells like money to me. : )
To Start, Cuil give us a quarter for each "search Cuil" button we put up!

I really don't think Cuil has much money left to spend.

This 491 message thread spans 17 pages: 491