Why Don’t Search Startups Share Data? Part I

August 23rd, 2007 by Charles S. Knight
Posted in Guest Authors | 3 Comments »




Thursdays on AltSearchEngines we welcome a Guest Author to our blog. Today we are fortunate to have Bob Warfield of the SmoothSpan Blog.

Bob recently published a two-part series entitled, Why Don’t Search Startups Share Data? (aka Open Source Style Web Crawling) One of the basic tenets of AltSearchEngines is to encourage greater communication and cooperation amongst the Alternative Search Engines for the mutual benefit of all

Here’s Part I (tomorrow is Part II):

New Jersey Search Engine startup Accoona is filing for IPO after just a few short years of operation. Whether you think they’re a good investment or not, there’s definitely some feeling there’s gold left in them thar hills and the Googleplex hasn’t taken it all yet.

At the same time, there is an interesting discussion on Skrentablog that asks if there are 100 alternate search engines how come only about 11 seem to be crawling the net? Accoona is one of the 11 actually detected, BTW. Richard MacManus is similarly perplexed, and his Read/WriteWeb blog post considers some possible explanations, but winds up baffled.

I appreciate the mystery. Are these niche search engines that just haven’t hit all the sites? Are they purchasing their index from somewhere? Do they know some other way besides crawling to create an index? Are they really smart about not crawling until a page changes somehow? And does this deafening silence from crawlers indicate that the Googles and Yahoo!s of the world really have sewn up the search world, that you have to be big to do it, and that others need not apply? Read the rest of this entry »

Search Engine of the Day: Trovit

August 23rd, 2007 by Charles S. Knight
Posted in Alts | 8 Comments »

There is nothing new about searching through classified ads online, for example oodle or the ubiquitous Craigslist. They are not web search engines, they don’t crawl the web. They just search through websites full of ads. People continuously post their ads to their local newspapers, etc., and that’s what keeps the content fresh on sites like these.

Trovit does the same thing, and in fewer categories, just homes, autos, and jobs. You cannot send your ad to trovit, you post it on a local website, and then the trovit spiders come along and pick it up.

The moral of the story is that there are search engines and then there are search engines. What matters is simply knowing what the universe of data is that is being searched. Yesterday the universe was 4,700 numismatic websites, with trovit it is a certain number of classified ad websites, which trovit separates by category and by country.

But the index, as we know, if only half of the story. The interface is always the other half. Trovit has a very clean interface that flows particularly well. You mouseover the country that you want, then you click on the category that you want, and then a homepage for exactly that country and category comes up; that’s when you conduct your search.

Here’s a quick tour:

.
First, select your country:

and then select the category: homes, cars, or jobs.

In this case, I have selected jobs in Italy. And here is a result:

Do you live in one of these countries?

Please give trovit a test drive and let us know what you think of it.

So, what does “trovit” mean?

“trovi” is “find” in Esperanto. Add “it,” so trovi+it, trovit means find it!