Search 552 bilingual combiniations with 2lingual

July 2nd, 2008 by Charles S. Knight
Posted in Global, Reviews | No Comments »

2lingual makes it possible for users to bilingually search the World Wide Web, Images, Videos, Wikipedia and the Blogosphere. It’s possible to search the Web in 552 unique bilingual combinations.

Media Wombat Flash Search Engine

July 2nd, 2008 by Charles S. Knight
Posted in Innovations, Newcomers, Reviews | 2 Comments »

Media Wombat is a search engine designed to find and display content embedded inside of Adobe flash (.swf) files.   They crawl the internet looking for SWF files, then grab them, rip them apart, index them and then allow for the contents of them to be searched.

Google and other search engines don’t index the insides of any files mainly because they have to spend resources (CPU/Disk) pulling the SWF file apart and actually host the contents of those files after unpacking the content. Google indexes the text of a web page and points back to the original URL (quick-n-dirty), but if google finds a ZIP file, they won’t index the files inside of that zip file for the same reasons.

Flash content is everywhere on the internet nowadays, from youtube.com to nbcnews.com to disney.com. It allows for webpages to engage the user with a much more dynamic experience, but the problem is that to a crawler (aka, Google, Yahoo, Ask, …) it’s just another file that they throw away. Our technology actually takes as many parts of the Flash contents as posible and indexes those parts.
Ads

“Adsense” doesn’t work with Flash because Google doesn’t see inside of the flash files, so it doesn’t have anything relevant to use for the context-senstive part of AdSense. Were working on an “Adsense/Adwords” ad distribution system that does know about the inside of flash content to bring context-sensitive ads to flash-heavy websites.

Media Wombat believes that there are several types of users who would be interested in searching for Flash content.

* Flash Developers – “I need a button image or a beep sound to use in my own flash animation.”
* Online Video Game Players – There are numerous Flash games online that can be found via our search.
* People in advertising/marketing industry could use the site to view/find Flash based ads.
* General Users – We (Media Wombat) could promote the idea that there is “hidden and exciting” content on the web that the other search engines don’t know about.
* Image Search
* MP3 (sound) search
* FLV (video) search

 

Scientifically Measure Search Precision and Recall

July 2nd, 2008 by Guest Author
Posted in Guest Authors, News | No Comments »


By Guest Author Kathleen Dahlgren, PhD

This post is to add to the dialog about precision and recall which are standard measures of Search engine performance. Precision is a measure of retrieval accuracy calculated by dividing the total number of relevant retrievals by the number of all retrievals generated by the Search.  Recall is a measure of the extent to which relevant material in the total document base is found.  It is calculated by dividing the number of relevant retrievals by the total number of potentially relevant retrievals in the document base.

Pattern-matching technologies perform with both low precision and low recall (typically under 20% for both).   The TREC (Text Retrieval Conference), sponsored by the  National Institute of Standards and Technology (NIST), is a recognized source of precision/recall testing for various technologies, including pattern-matching and statistical approaches.  In TREC’s legal track competition in 2007, there were 13 technologies participating.  Their precision performance ranged from under 1% to 23% and their recall performance ranged from under 1% to 22%.

While Cognition did not participate in the TREC competition in 2007 (but is participating in 2008), it did conduct its own internal precision/recall tests on a wide variety of document bases (similar to the TREC data) and Websites.  These included the National Library of Medicine’s MEDLINE™, the public domain Enron fraud case, the public domain Microsoft anti-trust case, the BBC World News Website (http://news.bbc.co.uk/),  and the Global Issues Website (http://www.globalissues.com), among others.  For each test, 50 queries that were considered likely to be asked by users of the data/Website were formulated and posed to a CognitionSearch Search function on the sites’ documents.  Relevancy was judged for a sample of 20 or fewer retrievals and extrapolated.  Cognition’s precision exceeded 90%.  Recall was measured relatively.  In other words, full recall was taken to be the total of all relevant retrievals returned by any of the Search engines used in the particular test.  Cognition’s relative recall in these tests exceeded 90% relative recall.

We and other semantic technologies believe that by employing Semantic NLP technology, Search results will achieve significantly better precision and recall than pattern-matching or statistical approaches.

Search for Country Flags with Flags of the World

July 2nd, 2008 by Charles S. Knight
Posted in Reviews | No Comments »

Searching for a specific country flag? Use their search feature to find it.

Cool World Flag Facts:

The American flag has 50 stars. Each star represents one state. It also has 13 red and white stripes which represent the original 13 colonies.

The tri-colors of the Mexican Flag were adopted after they won independence from Spain.

The large red circle on the Japanese Flag represents the rising sun.

The Nepalese flag is the only country flag in the world that is not rectangular. It is actually composed of two pennants (one sideways triangle atop of another).

The largest star on the Australian flag has 7 points and is known as the Commonwealth star. The smallest star on the flag only has 5 points.

The Lion on the original Ethiopian flag represents the Lion of Judah.