The Obama administration is going to “listen to citizens.”

July 1st, 2009 by Guest Author
Posted in CEO Views, Guest Authors, Semantic | No Comments »

logo

By J. Brooke Aker, CEO, Expert System USA

The Office of Public Liaison in the new Obama administration is promising to listen to citizens as it considers policy direction, legislation and brings the people to Washington rather than bringing Washington to the people.  The most concrete of these proposals is to allow a five-day comment period by citizens via the Internet before the President signs any legislation.  Even now, anyone can offer an opinion directly to the President here.   You can contribute up to 5,000 characters.  That is roughly 400 words. 

The windows are open in the White House and a new breeze of inclusiveness is blowing right in.  This is certainly a change over the previous eight years when the White House was shut tight with the air inside growing staler by the day.  But I wonder if the administration is prepared for the hurricane force winds that could result?

If you ask for comments on pending legislation, how many comments will the White House get?  There are some hints from around the blogosphere.  In February, a search conducted on Technorati for a count of the word “bailout” over the previous 6 months resulted in the following chart: chart

The peak of more than 14,000 blog posts was around the passage of the first muti-billion bank bailout in the early Fall.  An estimate of the average around this spike looks to be roughly 6,000 posts per day.  As the debate and finalization of the ARRA (American Recovery and Reinvestment Act) and second half of the bank bailout money is finalized, you can be sure the number will spike again.  But let’s be conservative and assume 1/3 of the average would like to comment directly to the White House on the ARRA over the five-day period promised.  That would be 10,000 comments President Obama says he will consider before signing the legislation.  The current estimate of U.S. bloggers is 22.6 million so 10,000 comments may only be a drop in the bucket.    

Short of a small army of readers, how will Valerie Jarrett and her staff understand this “wisdom of the crowd” input?  We do know that President Obama has hired some tech vets to lead this kind of effort. 

Chief among these is a former Google product manager Katie Jacobs Stanton who is the new President’s “director of citizen participation.”  It is not just a coincidence that Ms. Stanton was in charge of Google Moderator. 

A quick look at this tool reveals the ability for anyone to post a question (or I suppose a comment) and then have others vote for its importance relative to all the other questions posted.  Looking through the questions posted around the Presidential debates is another estimate I can find that might look like what the White House will experience.  The breakdown of topics, questions asked, votes recorded and citizens participating look like the table below.

 

Votes

Questions

People

Education

6,926

96

1,183

Health Care

3,483

81

412

Iraq War

3,513

64

488

Economy

7,534

209

580

Environment

3,078

73

317

Foreign Policy

3,699

101

339

TOTAL

28,233

624

3,319

 

Ok here is the rub.  No matter how you measure citizens’ participation in the new administration, technology beyond posting and voting is going to be needed.  It’s not clear on Google Monitor if the categories were decided before the questions came in or after everyone posted.  In any case, I took the top vote getting comments from each of these categories and analyzed them again using our semantic technology to see what categories come out.  I could find 90 categories in total across all those who commented.  The top categories (more than 1% of the total) were the following;

2009-06-30_1828  

That’s easily more than twice what Google Moderator can bucket things into.  The point is that true participation means more than a simple tally.  It should mean listening, really listening to the context, the nuances, and the breadth of what citizen’s experience in their daily lives and what they expect from their government.  Volume is only the first problem for citizen participation.  The bigger issue is, as the intelligence community who is familiar with these problems puts it, finding dots, connecting dots and understanding dots. 

I believe semantics to be a core technology that can not only process the volume of what the White House is about to experience but can also understand the full picture of true citizen participation.  At Expert System we build technology that does exactly this.  We have deployed our solutions successfully within both governments and private industries alike who wish to be responsive to customers and citizens.  With several won awards from Prime Ministers and industry associations, Expert System is confident that we could—and would love the opportunity—to do the same for the Obama administration.  In the spirit of bi-partisanship, others do a good job at this as well, including GetSatisfaction and InQuira.  

We took a stab at semantically indexing and evaluation Obama’s web sites Recovery.gov and Data.gov, which were developed to drive accountability and openness between citizens and the Obama administration.  There is an abundance of information, indexed and tagged differently, with no real way to search for pertinent information without sifting through unwanted results or missing related information from a similar search.

These sites are a great first step, but without the right technology to enable users to search for and find the information they are looking for can really conclude that government transparency has improved? For instance, if a citizen is looking for information on money, a keyword search would turn up only those results with the word money in them.  However, by adding a semantic layer to the mix, the results returned would include words such as also amount, cash which are related to the word money, especially in the context of government use.

Our indexing semantic analysis and categorization allows citizen users to semantically search and categorize issues, topics or actions and thus discover a more complete understanding without having to sort through unwanted, inaccurate text.  This is a more complete step in the direction of true transparency of today’s government.

It will not do President Obama any good to promise to listen to his most important constituency and later be accused of lending a dull ear to the process.  There is great promise in having the breadth and range of American opinion directly influence the highest office in the land.  Equally as important is truly enabling U.S. citizens to find accurate and robust information on their own.  Everyone can see technology is the key to extending our democratic reach to every living room and kitchen table.  The peril is in not applying enough or the right technology resulting in enough citizens feeling as though they were not sufficiently heard.  That would do democracy harm indeed. 

J. Brooke Aker is the CEO of Expert System USA, a semantic technologies company.  Mr. Aker is a serial entrepreneur and economist by education.  Prior to being named CEO of Expert System USA, Mr. Aker was the CEO of Acuity Software and before that, Cipher Systems.  He is the technology designer of intelligence software for corporate use, and a long-time speaker and author in the areas of strategic planning, knowledge management, and business intelligence. His designs have been recognized by IBM, SCIP and The Journal of Strategic Planning.

 

 

5 Questions for Collecta CEO Gerry Campbell

June 30th, 2009 by Guest Author
Posted in CEO Views, Guest Authors | 1 Comment »

svlogo
By Guest Author Noah Mallin
5-questionsCollecta, a new real-time search engine, went live in beta on June 18th.They intro themselves on their homepage by saying “The web is alive with real-time information. So why search a stale archive? Collecta monitors the update streams of news sites, popular blogs and social media, and Flickr, so we can show you results as they happen.” This is ambitious stuff at a time when real-time is the buzzword in both search and social media circles.

To find out more about Collecta I asked their CEO, Gerry Campbell, to answer our patented 5 Questions. Campbell has a great blog, LuckyRobot.com where he admits to getting “…fired up about making new things.” He’s had a rich and varied background across information media companies like Reuters and AOL and has also been an active investor and advisor to a range of startups. So what does Gerry Campbell have to say about Collecta and real-time search? Read on…

1. Your experience encompasses traditional media at Reuters, search at AOL and Alta Vista and early-stage investing in several social media start-ups. Does real-time search embodied by Collecta bring these three together?

Every experience we have builds on the others. Absolutely. I have worked with some of the most brilliant people in media and technology. Old school, early stage and everything in between. My view of real-time search comes as the synthesis of all of that. I am also a student of mass media, which has had a huge impact on helping me to identify the trends that are driving us forward.

Just as important as my view of real-time search is the vision and strength that the founders of the company have. Jack Moffitt is one of the greatest problem solvers and technology visionaries I have ever worked with. He, Brian Zisk and an early crew of crazy-smart XMPP wizards arrived at the real-time search vision in parallel to me, even though we had never even met.

We came together late last year with a nearly identical vision – one from the technical side and one from the business/market side. We really work well together.

2. Real Time search is beginning to get crowded,  what makes Collecta different from some  of the other competitors in this space like Social Mention?

Because all of the air was sucked out of the market by the Google juggernaut there has not been much innovation in the search space as a whole for the past few years. Now, that there is such a gap between the traditional search leaders and the possibilities of what can be created for users, there’s been a rebirth of search innovation. We’re experiencing a renaissance of new ideas and approaches in search and it’s exciting.

How are we different?

a) Collecta is the fastest. Nobody has a similar approach, our unique architecture resembles a high performance financial data service more than a traditional web search.

b) Collecta is the broadest. We draw from the web at large – not just social networks. Or even more limited, one single social network (like some other real-time search tools.)

c) Collecta is revolutionary, not evolutionary. Because we are starting with the fundamental building blocks of search we are making leaps in time-based search innovation. Not incremental improvements in ranking, or simple mashups on a stream of data from a single social network.

3. Why do you think it’s taken so long for the big guys like Google to really get into the real time search game? How do you think Google’s doing?

Having run a top-tier site for a long time I completely understand the issue. When you’re operating a large property with a huge revenue base it’s not in your best interest to declare that the game has changed. In fact, your current frame of reference for how the business operates may be rooted in a certain view. In that case it’s hard to see past the established momentum and you might not know the game has changed. So you wait until you’re pressured into adopting/incorporating the new approach.

In general, I believe the people who create and run search businesses/technologies are some of the smartest people on the planet – so i assume the big guys have seen it coming and are maximizing the value to their shareholders in their approach – balancing the emerging trend with existing momentum.

It’s the innovator’s dilemma in its purest form.

4. The question that always comes up with a new web property is the business plan – do you envision some kind of search ad program being rolled out in the future or are you not concerned yet about how to monetize?

Collecta is a business and we have revenue plans. Users come first!

5. How many sources are you pulling from to generate results and do you plan to add to your index?

We are pulling from enough to prove the value proposition, and are adding new sources every day. It’s a big Internet out there.

The original post appeared here.

Aardvark CEO “We are now in public beta, check us out!”

June 29th, 2009 by Guest Author
Posted in CEO Views, News, Updates | No Comments »

aardvark_landing_logo

Hi Charles,

Since you’ve blogged about Aardvark, I thought you might be interested in some exciting developments.

First, we’ve completed our private beta, and just launched open sign-ups to the public!

Anyone can join now at http://vark.com using their Facebook account.

(We’re integrating with other social networks very soon.)

Second, Aardvark was in the Sunday New York Times!

28digi-600

You can check out the online version of it here.

When the author first contacted us, he said that he was inspired to write about Aardvark by our early supporters (that’s you!).  Thank you.

We’re really thrilled about the piece — we think it captures what Aardvark is all about.  

And third, we have some great product news coming out soon.  I’ll let you know when it’s ready so you can pass it on to your readers.

Let me know if you or your readers have any feedback about Aardvark, I’d love to hear about it.

Best,

Max

Max Ventilla, CEO

Aardvark discovers the perfect person to answer any question in minutes.

Comming Attractions – Twingly to launch Project Shinobi

June 11th, 2009 by Guest Author
Posted in CEO Views, Guest Authors, News, Updates | No Comments »

project-shinobi
We are very excited to announce today that on October 1st, 2009 Twingly is going to launch what will become the next great platform for social media.

The Realtime Web has claimed the throne as successor of Web 2.0 and the world is blogging and Tweeting and Facebooking to their hearts’ content. Interesting content deserving your attention might be only seconds or minutes old, and yesterday’s news is since long gone.

In social media, newly published content travels to you propelled by it’s own interestingness and quality. People around you find different ways to say “this is interesting!” and the choice of people you listen to decides what news will reach you and when. Often, you ignore what is coming through. But whenever you spend a few minutes on a new piece of information, you have made a decision based on which people you know already have spent time on it and their subsequent reactions. In the realtime web, this process is ever faster, increasing the freshness and quality of news ending up before your eyes.

On October 1st, 2009, Twingly will join the ranks of web services working together to improve your experience of social and traditional media. With Project Shinobi, we are aiming to provide a more social, more relevant and more realtime experience, integrating with the services you already use. Not only for people that are early adopters of social media, but accessible and immediately valuable for anyone. 

Project Shinobi is underway. Stay tuned (to AltSearchEngines – ed.)

This post was written by Martin / Twingly Blog

An interview with Kevin Chang, co-founder of Cazoodle

June 11th, 2009 by Hope Leman
Posted in CEO Views | 2 Comments »

cazAn interview by Hope Leman.

 

Hi, Kevin. Before we start, just a bit of background to put this interview into context for our readers. I recently attended the Web 2.0 Expo in San Francisco. I wandered around the Expo Hall and came across Cazoodle’s booth there. I was given a demonstration of both the apartment search and product search. I was quite impressed by the swiftness and incredible detail provided by both.

 

 

When I got home I watched the very useful screencast about Cazoodle here:

 

http://www.cazoodle.com/about.php

 

I recommend that anyone new to the term “vertical search” watch that screencast.

 

Now, on to our interview.

 

First of all, one reaction I had, is impressed as I was, I am not in the market for an apartment. But there is way more to Cazoodle than that, right? The apartment search engine is just the first of what will become a series of search products, correct?

 

Yes, apartment search is where we started; we plan to develop a series of vertical search products, with electronics shopping being the next.

 

Since Cazoodle came from our research work on “data-aware” search at the University of Illinois, we hope to bring these enabling technologies to build vertical search for the domains where deep search on structured data—such as finding apartments by addresses—is desired but is difficult to achieve.

 

How did you hit upon apartment search as your first product?

 

Cazoodle was started by a bunch of graduate students and me, so you could imagine what we got when I asked the students to brainstorm when we were looking for “driving applications.” They came up with two areas—apartments and used cars. And, you know why! We eventually picked apartments according to their ranking of pain. Even in our small town, it was a major pain to have to look everywhere online for finding apartments to check out. There were many useful sources, from Craiglist, Rent.com, to local apartment owners’ sites—just too many! In the old days, our mom and dad had to drive everywhere in town to look for rental signs; now we have the Internet but we had to surf everywhere online for property listings. We thought our technologies could do something useful for finding apartments.

 

As another major reason, I also had experience, in my faculty consulting work, with a major real estate service. They wanted to expand their good coverage from properties for sale to those for rent—which turned out to be so different and so difficult. In for-sale real estate, you get the MLS database from the National Association of Realtors. However, the for-rent market is rather fragmented and MLS does not cover very much. The company I did consulting work with was struggling to build an organic search for apartment properties, since there was no MLS-equivalent for rentals. So I knew since long ago that the industry did have this on their wish list.

 

How does apartment search showcase the potential for your products?

 

In hindsight, that was a very good choice, because apartment is a tough domain to deal with. For building structured search, the challenge really boils down to crawling, indexing, and integrating data from thousands of or even more sources. Apartment search very well represents these key barriers—the domain is very fragmented, with not only some large posting services but also numerous small landlord websites. To be both comprehensive and precise, we needed to tackle crawling and integration of data from numerous data sources at the order of hundreds of thousands. Large scale integration is what my research group has focused on, and we certainly have an edge here.

 

Furthermore, it was also good to pick a domain where the industry had encountered a bottleneck to breakthrough. While paid ads/posting services for apartments have be in existence since the dawn of the Web age, only until recently we started to see some organic crawl-based searches—but of rather small data coverage. As I said earlier, I witnessed stalled attempts of leading real estate companies. In such a domain, our potential would be clearly testified as we become the first and only comprehensive apartment search to cover data sources of all sizes.

 

Tell us about your plans for branding. Will you have, say, Cazoodle Drug Search, Cazoodle Car Search, and Cazoodle Book Search? You already have a Cazoodle for Electronics, for example.

 

Yes, that’s right. We hope to deploy the same technology in many new domains where vertical search is hard to build. Our strength in data harvesting can quite generally extends to various application areas.

 

However, to leverage our data harvesting capability, we also hope to serve as a data provider to collaborate with service providers in each domain. With our limited manpower, we could not develop all these many verticals so quickly. So we are always talking with partners to power up their vertical applications using our data harvesting and integration technology.

 

One of the things I liked about the screencast was the obvious enthusiasm of the speaker and the pride he takes in Cazoodle. Is that because he is a student and excited to be creating a real-world tool—something so few students get the opportunity to do?

 

The best part of being a faculty at UIUC is that I have the privilege of working with very strong students. They are so full of youth and, of course, enthusiasm as you noticed. Some of my previous students are now leading Cazoodle, and I am witnessing their “fire” from the great results they are producing, including this screencast. Govind Kabra, our CTO, led the production, and Jason Hertenstein contributed his professional voice.

 

ewbuildingCan you tell us about EnterpriseWorks, the incubator facility University of Illinois at Urbana-Champaign?

 

After the little fight with Netscape, the university seemed to have realized the potential of its strong engineering school, and has since then built up quite complete supports for faculty startups: the three arms of incubator, technology transfer, and venture capital investment.

 

The EnterpriseWorks is the incubator arm and is truly the best gift I could imagine from the University for doing startup! It provides not only hard-walled office space but also much soft-minded mentoring of entrepreneurship. Since we started in the building in January 2007, as the only “scale intensive” internet company there, we have added much new challenge to the administration, such as accommodating our data center. They have always been open-minded to learn new stuff with us!

 

Are you unique in enabling students to develop commercial search products? Do you know of any other programs like yours?

 

I am certainly not the only one doing this. Many know that the story of search, if search equals Google, has a lot to do with academic research— and I was particularly inspired because when I was a graduate student at Stanford, I had the pleasure to watch Google growing around us. We were in the same Digital Libraries project lead by Professor Hector Garcia-Molina, who was my advisor. When I was doing circuit interviewing for academic positions, one of the most asked interview question was—could you imagine—“why don’t you join Google?”

 

And I think I had a good answer. I was so inspired by how technologies—through startup—can change the way people live in a good way. I thought being a professor in academic research, if done right, could be the best way towards this goal. Why? Everyone doing startup would tell you recruiting good guts to join your dream is every bit as hard as coming up the dream itself. But no worry—As a professor, your research will develop not only technologies but also students! That’s exactly how I started Cazoodle, with the students and technologies developed at the university.

 

What are the majors of the students involved? Computer science? Electrical engineering? Do you recruit kids from non-techie department like marketing?

 

I am in the computer science department so, in research and teaching, I mostly work with CS students, with occasional undergraduates from electrical engineering. However, at Cazoodle, we have quite a few kids of various backgrounds in our data engineering process—our best performer was a music major specializing in trombone. We just wanted to make our data harvesting something kids can do for fun (or for an allowance). In addition, we also work with IBC, a group of part-time business student consultants at Illinois, for helping our marketing.

 

How do you recruit your students and how do you all work together to design marketable products?

 

Students actually like to work on product-able ideas that may have a market—you could imagine that, given we are in the department where Netscape, PayPal, YouTube, and Yelp came from. Even when writing academic papers, you always have to explain why the stuff is useful.

 

To this end, recruiting is not too hard. We simply demo what we are working on and what we have worked out. My research group holds “open house” to give demos. My research talks would always include demos. I recruited many graduate and undergraduate students with a “making-real-things” mindset this way.

 

What I found harder is to get students to work on search in particular. They would tell you Google has done it. I had to convince a new graduate student that her idea can go much beyond what Google is. But as soon as students are motivated, they can do wonder!

 

Who came up with the idea of Cazoodle? Do students get a royalty for their contributions?

 

The idea of Cazoodle came from my research focus: integrating and search structured data all over the Web. We started the MetaQuerier project exploring the “Deep Web,” where we developed techniques for integrating numerous online databases. Then, we continued with the WISDM project exploring structured data embedded in the vast amount of “Surface Web” text. In all these efforts, I worked with many students and some of them graduated at the right time and we co-founded Cazoodle together.

 

All inventions at a university belong to the university and the co-inventors. So students do share whatever rights associated with the inventions.

 

As a woman, I am eager for women to get more into technology. Are any of your students women?

 

Sure. Although Computer Science is not a department with many women, I do work with a few female students at any time. In fact, two of our co-founding students, Dr. Zhen Zhang and Huiting Yang, are women. Of course, quite some of our key developers are female.

 

As someone who works in a center for health research and quality, I immediately began to wonder how Cazoodle’s technologies could be used in healthcare. Do you envision it being used to search for consumer health products such as non-prescription drugs and medical supplies? I find, for instance, that there are many fascinating, worthwhile assistive technologies for the disabled but there don’t seem to be easy ways to find them. I often read about such technologies on message boards at social network sites like Patients Like Me. Could you assign a brilliant student to render such data as easily findable as you have managed to make apartment listings?

 

Yes, we are hoping to expand to other domains beyond current products. The healthcare domain could certainly use data aware search to make data more findable. At this point, we are focusing on apartments, shopping (electronics), and local events.

 

I try to list as many grants, awards and scholarships in the health sciences as I can find on the site I help maintain, ScanGrants. I find such data on the Web via hours of searching and enter them into ScanGrants one by one. When I saw Cazoodle demonstrated, I was envious and wished could that with grants and scholarships! What is your personal dream application of Cazoodle’s powerful technology?

 

First, as a professor I have to look for grants all the time—I think you are certainly right in how we really can make grant finding a much less laborious process. In fact, we are helping the IRIS (Illinois Researcher Information Service) service, a grant search service from our university, to automate their data indexing process.

 

Second, dream application? As Cazoodle has the technologies to understand structured data everywhere, in a large scale, I am hoping we can bring search to everywhere users need it, rather than requiring users to drop their task at hands and come to Cazoodle to search. Can we develop search technologies that can, when users are looking at something in the browser, understand what users are looking at and bring back relevant information automatically, without explicit search? I think, when search can go invisible and ubiquitous, it will open up a whole range of new applications.

 

Also, you mention Deep Web searching. Are there plans to apply your technologies to such Deep Web materials as chemical compounds and genes?

 

Yes, we are very interested in making science more findable, but at this point we need to stay focused on what we can do with a small group of us. We do look for partnership to co-develop applications for novel domains like science!

 

Tell us about the Deep Web. How is Cazoodle different from the search products of Deep Web Technologies, for instance?

 

As mentioned earlier, we started with the MetaQuerier project, one of the earliest efforts in exploring and integrating the numerous databases online, or the so-called “Deep Web” because their contents could not be easily reached by a typical crawler that only follows static links.

 

What we focused on, and may have excelled, are the set of technologies to automatically discover such databases for a particular purpose, to comprehensively crawl, extract, and index their data, and to precisely integrate and rank them for search.

 

What we differ from most other companies is the capability to do this in a large scale, with an automatic approach. For instance, as far as I know, most state of the art deep-Web technologies focus on federated “meta-search” with built-in interoperability protocols among participants, which has been effective in close-knit communities. In contrast, we hope to deploy to open, consumer-oriented domains like apartments and shopping.

 

I am in library school and am struck by how little discussion there is about search technology. By contrast, social networking is all the rage. How do you get kids interested in search? And how do you foresee search engines dealing with the fact that so much information is now being generated in member-only sites like Sermo, in microcontent fora like Twitter and via Open Science?

 

Since I am in computer science it is a little easier since kids are more into technologies to begin with. Many of my colleagues in CS agree that search is just a perfect playground for various CS fields—databases, artificial intelligence, text retrieval, high-performance computing, to name a few. As I said earlier, it is not hard to get students interested in search, it is harder to establish their confidence that they can do something Google has not covered well.

 

The new trends such as Twitter simply highlights that search will never be a solved problem. People not only consume information but of course also produce information. Search will have to tackle with new ways information is produced and the forms it exists. Twitter for example shows that information could be generated by everyone at real time. Search must always progress along with the course that our civilization evolves. 

 

Can you describe for us the design process for a search engine from inception to launch? How do you decide what data to search? Do you have to get permission from each database? Any headaches about copyright?

 

I will focus on vertical search as we have been doing. There appear to be a generic 3-step formula for the process: discover, crawl-and-index, integrate-and-rank, to finally deliver search to users.  Think apartments. First, we must discover, among the billions out there, which websites provide apartment listings. Second, we must crawl and index every listing from each site. Finally, we must integrate repeated listings, rank, and show them on a map.

 

Our crawler does follow the common robot crawling protocol and we always work hard to be a good Internet citizen with polite behavior. We always respond to crawling issues quickly.

 

We did run into some permission issues—mostly from site owners who did not want to see their data listed in the way we showed them, or who syndicated and did not actually own data on their sites to further disseminate. We would simply explain and honor their requests. On the other hand, we more often get requests from websites asking how their data could be included.

 

Where might readers see Cazoodle demonstrated this year?

 

We are now promoting our Apartment Search product in the rental industry tradeshows, like the NAA (National Apartment Association) conference in June. In addition, we will also be at various technology expos such as Web 2.0 conferences later this year.

 

Thank you for your time.

 

Thanks for the opportunity to speak up!