Why Don’t Search Startups Share Data? Part 2

August 24th, 2007 by Charles S. Knight
Posted in Guest Authors | No Comments »

 

Thursdays on AltSearchEngines we welcome a Guest Author to our blog. Yesterday and today we are fortunate to have Bob Warfield of the SmoothSpan Blog.

Bob recently published a two-part series entitled, Why Don’t Search Startups Share Data? (aka Open Source Style Web Crawling) One of the basic tenets of AltSearchEngines is to encourage greater communication and cooperation amongst the Alternative Search Engines for their mutual benefit.

Here’s Part II:

I mentioned in an earlier post that search startups ought to look into a divide and conquer approach when crawling the web.  After all, one of the biggest complaints about a lot of interesting search services is they don’t find as much as Google does.  TechCrunch, for example, complains that Microsoft’s new Tafiti produces search results that are “not as relevant as Google or Yahoo“.  And yet, they also admit Tafiti is beautiful (as an aside, it is very cool and worth a look to see what Microsoft’s Flex killer, Silverlight, can do for a web site).  If the Alt search sites band together to do the basic crawling and crunching using Google’s MapReduce-style algorithms (possible based on the Open Sourced Hadoop Yahoo is pushing), they could share one of the bigger costs of being in business and ameliorate the huge advantage in reach that the biggest players have over them.

ZDNet bloggers Dan Farber and Larry Dignan ask whether Open Sourced Hadoop can give Yahoo the leverage it needs to close the gap with Google.  Their first words are that “Open source is always friend to the No. 2 player in a market and always the enemy of the top dog.”  I don’t think Hadoop by itself is enough, but if Yahoo were to create a collaborative search service, maybe it would be.  In fact, what if search was much more like Facebook only more open (Hey, if Scoble can do it with a hotel, I can do it with a search engine!)?  In a manner similar to my “Web Hosting Plan for World Domination“, Yahoo could undertake a plan for “Search Engine World Domination”.  Here’s how it would work:

-  Yahoo builds up the Hadoop Open Source infrastructure for Web Crawling.  Alt Search engines can tie back into that to get the raw data and avoid doing their own crawling.  Even GigaOm says “The biggest hindrance to any search start-up taking on Google (or Microsoft, Ask or Yahoo for that matter) is the high cost of infrastructure.“  Let’s share those costs and further defray them by having a big player like Yahoo help out.

-  Yahoo can also offer up the Hadoop scaffolding to do any massively parallel processing these Alt Search Engines need to compute their indices.  Think of it as being like Amazon’s EC2 and S3, but purpose-built to simplify search engines.  People are already asking Amazon for Search Engine AMI’s, so there is clearly interest.

-  Now here is there Facebook piece of the puzzle:  Yahoo needs to turn this whole infrastructure play into a Social Networking play.  That means they offer Search Widgits to any Social Network that wants them, and they let you personalize your own search experience by collecting the widgits you like.  Most importantly, Yahoo creates basic widgits that reflect their current search offering, but they allow the Alt Search Engines to make widgits that package their search functionality.  Take a look at Tafiti and see how it let’s you select different “views”.  Those views are widgits!

-  Yahoo gets a big new channel for its ads, and it gracioulsy shares the revenues with the Widgit builders because that’s what makes the world go round.  Perhaps they even have virtual dollars that can be used to pay for the infrastructure using ad revenue, although I personally think they should give away as much infrastructure as possible to attract the Alt Search crowd to their platform. 

Don Dodge, meanwhile, is wondering what the exit strategy is for the almost 1,000 startups out there trying to peddle alternative search engines.  It sure seems to me that creating this search widgit social network world solves a big problem for Yahoo and at the same time creates a lot of new opportunity for the exit strategy of these engines.  Suddenly, they have access to large volumes of data they couldn’t afford and a distribution channel in which to build an audience. 

Open Source Swarm Competition in the Search Engine Space is Born!

A Special Search Engine of the Day: Fooooo

August 24th, 2007 by Charles S. Knight
Posted in Alts, Newcomers | 2 Comments »

This week has been about Online Video over at Read/WriteWeb. A perfect example of the relationship between Read/WriteWeb and this blog, AltSearchEngines, is our contibution.

First, we featured our Top 10 Video Alternative Search Engines as our Monday Vertical.

Second, we hosted one of our “Great Debates” in the middle of the week with this topic, Video Search. Our participants were Mary Hodder of Dabble, and Gary Baker of ClipBlast!, both “Top 10″ alternative video search engines.

Third, since we feature a Search Engine of the Day, today we are taking a closer look at Fooooo, a newcomer to the Top 10 list. I hope that spotlighting Fooooo helps to show our commitment to all alternative search engines.

Regarding the Search Engine of the Day feature, one reader contacted me to say that they often lacked detail. The explanation of “What’s in a Name” might be in another post, and issues such as funding might be missing altogether. Put them all together, this reader advised, and make it a more robust feature.

An excellent idea!I hope that if you have a suggestion for AltSearchEngines that you will also send me an email: Charles@ReadWriteWeb.com.

So, what shall we say about Fooooo?

Fooooo is a Japanese Video Search Engine. It is run by the Bank of Innovation in Tokyo, Japan. In this building :-)

And while it boasts several international versions, it actually pulls from one common index (42 million videos from video sites such as YouTube and MySpace; it is not a true web-crawling search engine) with multi-lingual homepages.

David Vogelpohl of Marketing Pilgrim, in his review of Fooooo, gave it a score of 7/10. His primary recommendation was that they need to divide their index according to language, and not just the homepages, an excellent point.

The search results link you to the source of each video (it does not host them), or you can use the ‘Quick Start’ link and play the video right away. When I “tested” Fooooo by looking for one easy video (The Machine is Us/ing Us) and one difficult video (SNL’s “Tiny Elvis” skit), Fooooo returned Tiny Elvis as result #1.

Fooooo also has a Facebook application:

When I searched the “About Us” information, it was all in Japanese and looked like this on my screen!:



So I went to Google Translate (yep, Google is very good at some things), and it came out like this:







Regarding funding, it turns out that Fooooo received funding just two weeks ago, according to alarm:clock.

“Daisuke Tanaka, Founder of the Japanese Video Search Engine Fooooo, gets in touch to let us know that he has raised $100K in seed financing from CyberAgent, a Japanese IT venture company listed on the Tokyo Stock Exchange.”

But the hardest partof my review was trying to figure out what the heck “Fooooo” means!

One possibilty is that it belongs to a Japanse wrestler / TV star known as “Hard Gay” Razor Ramon. One blog post says, “In Japan, he’s known simply as “Hard Gay” or “HG.” His trademark move is hip-thrusting and ending every other sentence with a scream of “Fooooo!!!”


Another usage seems to be the trash American equivalent of “Fool,” the way that our TV character Mr. T’s signature expression was, “I pity the foooool.”

It may also be a variable used in quantum physics, similar to Google’s mathematical roots.

UPDATE FROM Fooooo:

Hello Charles,

Thanks for introducing “Fooooo”.
I hope I can answer your question.

First, we pronounce “Fooooo” as [ fu: ].

Second, Fooooo’s “F” means “find” and “five”,
and “ooooo” means “five continents”, like olympic symbols.

Thanks, Daisuke