Great Debate: Peer-to-Peer (P2P) Search Part I

October 2nd, 2007 by Charles S. Knight
Posted in Debates | 10 Comments »

Every Tuesday night on AltSearchEngines, we invite two Alternative Search Engines to discuss a common topic for our benefit – and yours.  The topic tonight, Peer-to-Peer or P2P Search, is more complex than most other categories, and we are fortunate indeed to have two experts to explain this important area to us: Wolf Garbe of FAROO, and Jeremie Miller of Wikia.

ATLASFAROO

1) Architecture: How is your search engine different from today’s general search engines? Briefly, what does the architecture of your search engine looks like?

FAROO: FAROO is a web search engine based on peer-to-peer technology.

The users are connecting their computers, building a worldwide, distributed P2P web search engine. No centralized index and crawler are required anymore. Every web page visited is automatically included in the distributed index of the search engine.  Installing our software, you become immediately part of the distributed search engine.  FAROO’s distributed core architecture is fundamentally different from the centralized approach of today’s search engines.

ATLAS: Atlas breaks search into three distinct groupings, crawling (the Factory), index/ranking (the Collector), and query handling (the Broker). The primary difference is that there can be multiples of any of these groups all working together or competing.

2) Distribution/P2P: In what aspect is the architecture distributed?  What are the benefits of this?

FAROO:  FAROO is using a fully distributed architecture: distributed index, distributed crawler, distributed ranking, and distributed search.

Search, as the most frequently used Internet application, will be distributed, and thus follows a principle, which the whole Internet is based upon successfully. The distributed architecture provides cost advantages, better scaling, less intrusive crawling, democratic ranking and improved privacy protection.

* Each of the major search engines requires hundreds of thousands servers. We don’t need any hardware at all. This means huge saving of infrastructure costs, allowing us to share revenues with our users.
  * The Internet is increasing steadily, and so also is the amount of required hardware in order to index all these new web pages and to serve the new users. In FAROO’s distributed architecture the users become part of the solution of this problem. Therefore FAROO scales with the growth of the Internet.
  * FAROO indexes web pages without a dedicated crawler, therefore additional traffic for users and web servers is avoided.

ATLAS:  Each entity within Atlas, whether it be the Factory, Collector, or Broker, can be entirely distinct and independent, and will likely be different companies or groups altogether. There is nothing more distributed or beneficial on the Internet than many independent services connecting through an open protocol.

3) Crawler: How does your distributed crawler work?

FAROO:   We changed the way a crawler works. There is no traditional crawler at all.  Every web page visited by one of our users is automatically included into our distributed index, and instantly searchable for all other users.

ATLAS: Atlas supports any number of crawling projects, either web-wide or just specific segments. The Factory that exposes any crawled information must also add value by understanding the content it is crawling and providing contextual hints and meta-data to the Collector for indexing. A Collector should therefore have many relationships with Factories, and choose the ones doing the best job crawling and adding the most value.

4) Ranking: How does your ranking algorithm work?

FAROO:  FAROO is using an attention based ranking. If users spend a long time on a page, visit it often, put it to bookmarks or print it out, this page goes up in ranking.  For the first time the ranking of the web pages is automatically done by the target audience itself. This leads to a more democratic, user centric ranking, while resistant against rank manipulation.  Additional ranking parameters ensure a proper ranking also during the start with relatively few users and for freshly indexed pages.

ATLAS: Since Atlas is just a protocol that any Collector can support, everyone must compete to provide the best rankings for the data they are indexing. There isn’t any single best algorithm, but instead there will likely be many thousands, all working on either different types of content, media, or locality. An Atlas Broker must select the best Collectors (and it can get rankings from multiple ones and merge them) for the users and queries they represent.

5) Do you use the “wisdom of the crowds”? If so, how?

FAROO:  When it comes to understanding, valuating, and rating of content, the human mind is still unsurpassed. Therefore FAROO uses “wisdom of the crowds” in two ways, for ranking and for crawling.  An algorithm may distinguish between original content, trivial content and spam. But when it comes to more subtle distinctions, we are better off trusting our own species!  And, it’s no surprise that even the well known PageRank uses indirect human judgment, as it is based on the popularity of a page among webmasters.

FAROO’s user generated ranking goes a step further, as it is based on the popularity of a page amongst all users. And this is done automatically.  In this way many more people get involved then with current ranking methods, where either only webmasters are entitled to vote or a manual voting is required.

FAROO also uses user powered crawling. Pages which are changing often like, for eaxample, news, are visited frequently by users. And with FAROO they are therefore also re-indexed more often. So the FAROO users implicitly control the distributed crawler in a way that frequently changing pages are kept fresh in the distributed index, while preventing unnecessary traffic on rather static pages.

ATLAS:  Within Atlas there will likely be some general feedback mechanisms, particularly in handling spam. Every entity with Atlas is encouraged to become more intelligent by involving end users in some way, but there are no specific mandated features to do so.

If you have any questions about Peer-to-Peer (P2P) Search so far, please leave a comment now.

Next Tuesday night we will bring you Part II – the conclusion of this interesting debate!

You can help to build a musical search engine!

October 2nd, 2007 by Charles S. Knight
Posted in Reviews | 1 Comment »


What is Listen Game?

Listen Game is a game that lets you tell others what you think about music. It also lets you hear what other people think about music.  Music is a subjective form of art. We all have our own unique experience when listening to a song. Even though you might think that “Hangin’ Tough” by New Kids on the Block is “explosive and revolutionary,” your friend may think it’s “cheesy and annoying.”  
Play Listen Game and see if the world agrees with you!

Why should you play Listen Game?

It’s fun!  Also, while you play Listen Game, they listen to what you say about music. They use your opinions to build a musical search engine. Just a like an Internet search engine that finds web pages, their search engine uses your musical taste to help you find music that you want to listen to.

How do you play Listen Game?

1) You and players from around the world simultaneously listen to short music clips.
Each Listen Game consists of seven clips.

2) While you listen, they list 6 relevant words.
 
3) Pick one word that best describes the music and another word that also describes the music.

4) Score points by choosing words that other Listeners agree upon. It’s your opinion – there are no right or wrong answers.  The site also records players with the highest scores. Shenry is killing them!

Legally Blonde – The Alternative Search Engine

October 2nd, 2007 by Charles S. Knight
Posted in Reviews | 1 Comment »

Once in a while a new alternative search engine comes along that is so surprising, so dramatically different, so awesome, that it jolts you out of your chair and renews your confidence in the struggle of the “Alts” vs. the Majors.

This is not one of them.

I’m Amy, welcome to Pinkle – my little oasis of pinkness in an otherwise dull world.

I love pink. I have pink shoes, pink clothes and pink wallpaper but Google was still boring white. So I created Pinkle, same great Google search results but more cute pink! Help me make the world more pink by bookmarking Pinkle or making it your Homepage. You can also tell a friend about Pinkle and really start painting the Net pink.

Love, Amy
Always Pretty in Pink.