In Stealth: Yebol knowledge based semantic search engine

yebolYebol’s mission is to build a human-like world’s knowledge base and provide knowledge based search (semantics) and services. Yebol utilizes a combination of patented algorithms paired with human knowledge to build a Web directory for each query and each user.

Instead of the common “listing” of Web search queries, Yebol automatically clusters and categorizes search terms, Web sites, pages and contents. Yebol allows for a multi-dimensional search result instead of the normal one-dimensional search seen by most web search engines today.  This provides a more accurate summary of top sites and categories; a wider array of related search terms; a longer and richer expansion for query results; and a deeper base of links and keywords in search result pages.

Unlike current search platforms, Yebol provides hundreds of easily identified and accessibly categorized results in one easily navigable page.  Our results not only provide users with a rich search experience, but also help them quickly scan the results to find targeted information.  In essence, we generate a unique “homepage look” for each search query.

Yebol provides a productive experience for both main types of search – general investigation of a topic and specific target search.  For users who want to research and explore a topic, Yebol provides rich information from several dimensions.  For users who want a specific type of targeted results, they can quickly focus on their category and navigate to the target.

Yebol was founded by Hongfeng Yin, Ph.D., who has more than 20 years research and development experience on artificial intelligence, pattern recognition, data mining and search.  Most recently, Yin worked for more than four years at Yahoo! Data Mining Research team, where he built the core behavioral targeting technologies, products and more than a thousand models, which generate multi-hundred millions revenue.

In the 1980s, scientist Tsien Hsue-shen (Qian Xuesen), a co-founder of the Jet Propulsion Laboratory and father of the Chinese space program, advocated scientific investigation on human thinking – noetic sciences. Hongfeng Yin, then a graduate student at Chinese Academia of Science under professor Dai Ruwei, proposed a human think model with simulation methodologies based on Tsien’s proposal. Tsien regarded Hongfeng and Dai’s paper “On thinking and simulated intelligence” as a great epochal groundbreaking paper. At that time, scientists from Japan and other countries proposed intelligent machine plans. To build a truly intelligent machine, Tsien and Hongfeng believed that the research should focus on human intuitional thinking instead of logical computing. Hongfeng focused his investigation on associative memory models with artificial neural networks to simulate the intuitional thinking. In 1990s, Tsien and Dai proposed giant complex intelligent system theories and pointed out these systems should be constructed with human and machines together.

Since the beginning of this century, Hongfeng has explored the possibility of applying this research and the resulting theories to build a human-like world’s knowledge system for web search. While human-generated knowledge systems, such as Wikipedia, web page directories and so forth, are fairly easy to create for web search applications, these systems are limited in size to only millions of items even with massive human participation. On the other hand, machine-based knowledge systems can process trillions of items. Until now, the main difference has been that the results for machine-based knowledge systems are not as accurate as human generated results. To bridge the gap, our approach is to combine human intelligence with machine algorithms to build a world’s knowledge base. Actually, we believe that search services based on the knowledge base is the intelligent machines which were proposed in 1980s.

The invention of computer has brought about an information technology and information revolution. Information can now be generated at an explosive speed. To effectively use this information, it is necessary to have a means of converting it into knowledge. To fully harness the incredible advances in information computing, there must be a way to make this information easily accessible for people to utilize. In the last few years, new technological developments have finally made this conversion of information to knowledge a realistic possibility. Cloud computing, map/reduce/Hadoop infrastructure and data mining algorithms have created a knowledge revolution and brought us to a new era in knowledge technology.

What is the main difference between information and knowledge?

Information is usually unstructured, static and redundant and it is data centric. Knowledge is structured, connected, categorized, ranked with meaning and it is human centric.

At the beginning of the Internet, there were few web sites and pages. Pioneers like Jerry Yang and David Filo just listed the web sites. When more information was on line, Yahoo! developed a web directory to search the information. When the Internet grew to hundreds of millions web pages in the late 1990s, early search engines took the major role for finding information by listing all relevant web pages for a query. This has not essentially changed in the past decade. Currently, one query may generate a list of millions of results. From a data structure point of view, the linear list is not a good structure for finding information from a large data set. The optimal structure for human use is a tree or graph structure for efficiently finding targeted information. Until now, this optimal structure has not been possible to utilize because, on the one hand, it is not scalable to build web search directories for all queries with only human power while, on the other hand, pure algorithms cannot generate human desired results. Until now…

Our approach and what makes our efforts so groundbreaking is our integration of algorithms with human knowledge to build a web directory for each query and each user. We have developed a group of algorithms of association, clustering and categorization for automatically generating knowledge for search concepts, web sites, web pages and users. We also integrate human labeled information with our algorithms.

We expand the search in several dimensions: higher – summary of top sites and categories for queries; wider – related search terms; longer – results of expansion terms for the queries; deeper – inside links and keywords of search result pages;

Using the Amazon.com clouding computing service, we have built a knowledge base for about 10 million concepts over 1 billion web pages. Our technologies are scalable to build a knowledge base of 100 million concepts over 10 billion web pages in the next phase.

The knowledge based search can fundamentally change the search result presentation, finally expanding beyond the confines of the linear structure. Unlike 10 links in a traditional search page, we can present hundreds of links with categories in a similar space of a page. This can largely reduce users’ time to find desired information from the search results. It also provides rich channels for users to research and explore a topic.

Currently, to complement our knowledge based search, we provide traditional search results for the queries that are not in our knowledge base. In a few years, we will be able to provide non-linear knowledge results for the majority of potential searches.

We are just at the beginning of the knowledge revolution. In a few years, we believe that the majority of searches will return knowledge results. However, the impact of this technological development will expand beyond simply human web search applications. This revolution in knowledge technology will make possible the generation of truly intelligent machines which are going to greatly impact our industries and our life.

Yebol is looking for a:

Sr. Marketing Manager
Responsibilities:

* Define and execute the marketing strategy of Yebol’s new search engine.
* Develop and execute marketing initiatives to drive awareness, activation and usage of Yebol Search.
* Implement strategy of launching new products and services.
* Work with external marketing/advertising agencies to develop customer-centric communications.

Requirements:
* 5 – 7 years of business experience.
* 3 – 5 years of Internet product marketing and marketing programs experience.
* Direct experience launching consumer web-based products or services and managing broad-reach consumer marketing campaigns.
* Strong communications and presentation skills.
* A bachelor’s degree is required.

Software Engineers
Responsibilities:

* Build large-scale, clustered, distributed and high-performance data and computing platform for the next generation of web search.
* Crawl, maintain and analyze multi-billion documents for link analysis, knowledge discovery and data mining.

Requirements:
* BS/MS Computer Science (or equivalent), 4+ years experience (PhD a plus).
* Experienced in: C++/C, Java, Perl multi-threaded programming
* Experience with large-scale, complex distributed Linux/Unix applications, cluster management systems.
* Experience with search engine systems, crawling, indexing and ranking.
* Text analysis, information extraction, information retrieval, data mining, machine learning or related experience

If you are passionate about new search technology, please send your resume to: jobs at yebol dot com

Source: Yebol

Leave a Reply