Spezify ricercare e navigare tra i risultati in modo più intuitivo

July 3rd, 2009 by Guest Author
Posted in Global, Guest Authors, Verticals, Visual | No Comments »

spezify

Spezify è un motore di ricerca visuale che propone un modo più intuitivo di ricercare e visualizzare le informazioni.

E’ capace di ricercare contenuti tipo immagini, testo e video all’interno di siti come: Yahoo, Twitter, Amazon, Ebay e Bing.

I risultati sono presentati similmente a degli articoli di giornale e nella parte alta della pagina sono suggerite delle parole correlate alla query inserita.

Attraverso l’utilizzo della tastiera è poi possibile navigare e visionare tutte le informazioni in una singola pagina.

Spezify è un motore di ricerca visuale, ideato per ottimizzare le ricerche online filtrando i risultati e fornendo solo i contenuti migliori, pronti per essere consultati in maniera veloce ed intuitiva.

caronteweb-g-y

From CaronteWeb here.

Doc Fetcher : outil de recherche de documents offline

July 2nd, 2009 by Charles S. Knight
Posted in Global, Guest Authors, Verticals | No Comments »

doc-fetcherDoc Fetcher est ce qu’on appelle un moteur de recherche de bureau. Il s’agit en fait d’un outil similaire à la recherche de fichiers Windows, sauf qu’ici, c’est la technologie Google qui est utilisé (et le vieux chien 3D disparaît aussi au passage).

Doc Fetcher est un programme gratuit et opensource pour Windows ou Linux (où est la version Mac OS ?). Il vous permet donc d’accéder rapidement aux documents stockés sur votre ordinateur en tapant quelques mots-clés.

Comment ça marche ?

Tout simplement comme un vrai moteur. Doc Fetcher va crawler l’ensemble de vos dossiers et récupérer les différents fichiers qui s’y trouvent afin de constituer un index. La première fois, cela risque de prendre du temps (apparemment, le site indique 1 min pour 200 fichiers environ). J’ai cependant noté quelques fonctions intéressantes comme la recherche de termes directement à l’intérieur du code source (pour les fichiers HTML par exemple). Après cette indexation, vous pourrez personnaliser les critères de prises en comptes lors de la recherche.

Ci-dessous la liste des formats de fichier supportés :

  • HTML and plain text (both customizable)
  • Portable Document Format (pdf)
  • Microsoft Office (doc, xls, ppt)
  • Microsoft Office 2007 (docx, xlsx, pptx)
  • OpenOffice.org Writer, Calc, Draw and Impress (odt, ods, odg, odp)
  • Rich Text Format (rtf)
  • AbiWord (abw, abw.gz, zabw)
  • Microsoft Compiled HTML Help (chm)
  • Microsoft Visio (vsd)
  • Scalable Vector Graphics (svg)

J’ai pu rapidement tester sous Parallel mais j’ai quelques soucis de ralentissement général avec le programme. Si vous avez l’occasion de tester Doc Fetcher, que ce soit avec l’interface complète ou simplifiée, n’hésitez pas à déposer votre avis dans les commentaires.

rbFrom the RamenosBlog here.

The Whuffie Powered Search Engine

July 1st, 2009 by Guest Author
Posted in Guest Authors | No Comments »

By guest author Carmel Hagen

Original post here.

Do a search for “The Whuffie Factor reviews” on a traditional search engine, and something interesting happens. In the area that SEO experts lust for – those glorified first five results – a blog post pops up. It’s a modest post, just a few flattering paragraphs covering Tara Hunt’s guide to social media for businesspeople, but there it is – smushed right between Amazon’s own review and the book’s official website. A quote from within the post unveils the significance of that blog’s page rank – and quite serendipitously proves the power of the “Whuffie” movement as a whole:

picture-1916“When Tara tweeted offering to send folks an advance copy of her book The Whuffie Factor to review I responded right away, though I didn’t expect to be chosen. Virtually nobody reads this site.”

No one reads this site. Right. As so many of us are exasperatingly aware of, things don’t just hit the front page of a traditional search engine by sheer luck. So how did one “readerless” blog become the go-to place for a review on a popular marketing book? Well, it was Whuffie.

Whuffie,” a term coined by the author Cory Doctorow, is a friendly word for a powerful concept: social capital. Social capital, also known as the return on investment that comes from gathering trustworthiness and approval online, is one of today’s most compelling reasons to be deeply entrenched in social media. But does the investment pay off? If that self-described small beans blogger turned Whuffie pro-evangelist* is any indicator, yes. However, that post didn’t SEO the heck out of itself just because Tara believed in investing in her community – it got there because her community had money of its own.

For a blog post to reach the top of a traditional search engine’s results, a rather huge, combined community investment must be made– one that is very rare to come across. Not only does a blogger need to write about the search topic, but they also must inspire a ton of other bloggers to write about it, linking their posts back to the original one so that it builds page rank. In terms of social capital, blogs are expensive, taking up tons of the time and effort of very nice, willing people. Thankfully, this mass investment is no longer needed in the area of realtime search, where significantly simpler shares are highly influential in the ranking of search results. Shares are effortless compared to blog posts, taking only seconds of user’s time (yet offering plenty of that same good Whuffie) – and everybody loves to share.

picture-2011Michael Arrington recently penned something like this: “Re-tweets are the currency of the web.” At OneRiot, where a realtime result’s PulseRank is influenced by those retweets, we agree – but we venture to take it one further. The currency of the web lies not in a simple re-tweet, but in sharing as a whole. Whether it’s via email, thru Twitter, across Facebook, over IM, or hyper-dispersed thru a tool like Yoono, a share indicates a web user’s assignment of value. When a person shares something of yours, they’re assigning that value to you. How awesome is that.

In business speak, shares are to the web what referrals are to the real world – they are the word of mouth marketing of the internet. Online friends listen to online friends because they have already built up the same complex rapport that forms the online backbone of Whuffie. They trust each other and they respect each other – so they click on each others’ links. A lot. And when people click on links, cool things happen. Articles get read. New products and tools are discovered. Links get re-shared. Sometimes, people even buy stuff. Sounds a bit like the kind of stuff you’d want a marketing campaign to achieve, doesn’t it? To us, it sounds like the building blocks for a new kind of SEO.

In business speak, shares are to the web what referrals are to the real world – they are the word of mouth marketing of the internet.

As the readers savvy to our ranking algorithms know, there is a heavy relationship between sharing and the ranking of our search results. You can read up on the details of that here, but the gist of it is that links tend to wind up as search results on OneRiot when someone (and frequently, lots of someones) shares them online. The way we see it, each new share represents a fresh link to realtime information – the stuff people care about, right now. Obviously, this amplifies our general sentiment that sharing is pretty freakin’ important, because it is the most organic way to determine something’s importance that we know of today.

So how do you encourage sharing, embrace your share-ers, and reap the benefits of Whuffie? You enable it. D’oh, right? You’d think so. But as Tobias noticed the other day, only a small number of Internet companies are actually making sharing as easy for their users as they should be. For instance, only a handful of publishers are offering their readers easy, impulse friendly ways to share their content (like sticking a “Tweet This” button on every post). Only a slightly bigger set are putting a fair effort into sharing their own content through social media channels. The smallest set – those that are engage in a healthy mix of both (Mashable is a great example) – are winning, amassing their own social capital and capitalizing on that which already exists in the hands of their users. That said, all the “Tweet this” buttons in the world won’t amount to anything if the content you’re giving your reader isn’t valuable, sharable stuff – but we’ll save that discussion for another post.

picture-2116The investment seems obvious – the more people share your links, the more valuable those links will become to you. Sharing creates visibility, visibility creates traffic, and traffic creates business. On the flipside, if people aren’t sharing your content, you’re missing out, bigtime, because today’s most effective web marketing is happening via the passing of social capital from computer screen to computer screen.

When Tara shipped her unreleased book off to a nice guy who happened to be very interested, she set off a series of events that led to a big increase in social capital for her – one that’s still paying off. Tara gave her online community the tools they needed to share information, and they essentially took over a chunk of the marketing campaign for her book. The difference between that story and the ones we now get to write is the ease with which we can create ours. Enable people to share your content, and they will. Share your own content, and people will embrace it. Make Whuffie, make it to the top of our search results. Every day.

Now if you don’t mind…Tweet this?

*The author of that great Whuffie Factor review was Chris Patterson, by the way.

You can find him here and here. Killer post, Chris :)

The Obama administration is going to “listen to citizens.”

July 1st, 2009 by Guest Author
Posted in CEO Views, Guest Authors, Semantic | No Comments »

logo

By J. Brooke Aker, CEO, Expert System USA

The Office of Public Liaison in the new Obama administration is promising to listen to citizens as it considers policy direction, legislation and brings the people to Washington rather than bringing Washington to the people.  The most concrete of these proposals is to allow a five-day comment period by citizens via the Internet before the President signs any legislation.  Even now, anyone can offer an opinion directly to the President here.   You can contribute up to 5,000 characters.  That is roughly 400 words. 

The windows are open in the White House and a new breeze of inclusiveness is blowing right in.  This is certainly a change over the previous eight years when the White House was shut tight with the air inside growing staler by the day.  But I wonder if the administration is prepared for the hurricane force winds that could result?

If you ask for comments on pending legislation, how many comments will the White House get?  There are some hints from around the blogosphere.  In February, a search conducted on Technorati for a count of the word “bailout” over the previous 6 months resulted in the following chart: chart

The peak of more than 14,000 blog posts was around the passage of the first muti-billion bank bailout in the early Fall.  An estimate of the average around this spike looks to be roughly 6,000 posts per day.  As the debate and finalization of the ARRA (American Recovery and Reinvestment Act) and second half of the bank bailout money is finalized, you can be sure the number will spike again.  But let’s be conservative and assume 1/3 of the average would like to comment directly to the White House on the ARRA over the five-day period promised.  That would be 10,000 comments President Obama says he will consider before signing the legislation.  The current estimate of U.S. bloggers is 22.6 million so 10,000 comments may only be a drop in the bucket.    

Short of a small army of readers, how will Valerie Jarrett and her staff understand this “wisdom of the crowd” input?  We do know that President Obama has hired some tech vets to lead this kind of effort. 

Chief among these is a former Google product manager Katie Jacobs Stanton who is the new President’s “director of citizen participation.”  It is not just a coincidence that Ms. Stanton was in charge of Google Moderator. 

A quick look at this tool reveals the ability for anyone to post a question (or I suppose a comment) and then have others vote for its importance relative to all the other questions posted.  Looking through the questions posted around the Presidential debates is another estimate I can find that might look like what the White House will experience.  The breakdown of topics, questions asked, votes recorded and citizens participating look like the table below.

 

Votes

Questions

People

Education

6,926

96

1,183

Health Care

3,483

81

412

Iraq War

3,513

64

488

Economy

7,534

209

580

Environment

3,078

73

317

Foreign Policy

3,699

101

339

TOTAL

28,233

624

3,319

 

Ok here is the rub.  No matter how you measure citizens’ participation in the new administration, technology beyond posting and voting is going to be needed.  It’s not clear on Google Monitor if the categories were decided before the questions came in or after everyone posted.  In any case, I took the top vote getting comments from each of these categories and analyzed them again using our semantic technology to see what categories come out.  I could find 90 categories in total across all those who commented.  The top categories (more than 1% of the total) were the following;

2009-06-30_1828  

That’s easily more than twice what Google Moderator can bucket things into.  The point is that true participation means more than a simple tally.  It should mean listening, really listening to the context, the nuances, and the breadth of what citizen’s experience in their daily lives and what they expect from their government.  Volume is only the first problem for citizen participation.  The bigger issue is, as the intelligence community who is familiar with these problems puts it, finding dots, connecting dots and understanding dots. 

I believe semantics to be a core technology that can not only process the volume of what the White House is about to experience but can also understand the full picture of true citizen participation.  At Expert System we build technology that does exactly this.  We have deployed our solutions successfully within both governments and private industries alike who wish to be responsive to customers and citizens.  With several won awards from Prime Ministers and industry associations, Expert System is confident that we could—and would love the opportunity—to do the same for the Obama administration.  In the spirit of bi-partisanship, others do a good job at this as well, including GetSatisfaction and InQuira.  

We took a stab at semantically indexing and evaluation Obama’s web sites Recovery.gov and Data.gov, which were developed to drive accountability and openness between citizens and the Obama administration.  There is an abundance of information, indexed and tagged differently, with no real way to search for pertinent information without sifting through unwanted results or missing related information from a similar search.

These sites are a great first step, but without the right technology to enable users to search for and find the information they are looking for can really conclude that government transparency has improved? For instance, if a citizen is looking for information on money, a keyword search would turn up only those results with the word money in them.  However, by adding a semantic layer to the mix, the results returned would include words such as also amount, cash which are related to the word money, especially in the context of government use.

Our indexing semantic analysis and categorization allows citizen users to semantically search and categorize issues, topics or actions and thus discover a more complete understanding without having to sort through unwanted, inaccurate text.  This is a more complete step in the direction of true transparency of today’s government.

It will not do President Obama any good to promise to listen to his most important constituency and later be accused of lending a dull ear to the process.  There is great promise in having the breadth and range of American opinion directly influence the highest office in the land.  Equally as important is truly enabling U.S. citizens to find accurate and robust information on their own.  Everyone can see technology is the key to extending our democratic reach to every living room and kitchen table.  The peril is in not applying enough or the right technology resulting in enough citizens feeling as though they were not sufficiently heard.  That would do democracy harm indeed. 

J. Brooke Aker is the CEO of Expert System USA, a semantic technologies company.  Mr. Aker is a serial entrepreneur and economist by education.  Prior to being named CEO of Expert System USA, Mr. Aker was the CEO of Acuity Software and before that, Cipher Systems.  He is the technology designer of intelligence software for corporate use, and a long-time speaker and author in the areas of strategic planning, knowledge management, and business intelligence. His designs have been recognized by IBM, SCIP and The Journal of Strategic Planning.

 

 

5 Questions for Collecta CEO Gerry Campbell

June 30th, 2009 by Guest Author
Posted in CEO Views, Guest Authors | 1 Comment »

svlogo
By Guest Author Noah Mallin
5-questionsCollecta, a new real-time search engine, went live in beta on June 18th.They intro themselves on their homepage by saying “The web is alive with real-time information. So why search a stale archive? Collecta monitors the update streams of news sites, popular blogs and social media, and Flickr, so we can show you results as they happen.” This is ambitious stuff at a time when real-time is the buzzword in both search and social media circles.

To find out more about Collecta I asked their CEO, Gerry Campbell, to answer our patented 5 Questions. Campbell has a great blog, LuckyRobot.com where he admits to getting “…fired up about making new things.” He’s had a rich and varied background across information media companies like Reuters and AOL and has also been an active investor and advisor to a range of startups. So what does Gerry Campbell have to say about Collecta and real-time search? Read on…

1. Your experience encompasses traditional media at Reuters, search at AOL and Alta Vista and early-stage investing in several social media start-ups. Does real-time search embodied by Collecta bring these three together?

Every experience we have builds on the others. Absolutely. I have worked with some of the most brilliant people in media and technology. Old school, early stage and everything in between. My view of real-time search comes as the synthesis of all of that. I am also a student of mass media, which has had a huge impact on helping me to identify the trends that are driving us forward.

Just as important as my view of real-time search is the vision and strength that the founders of the company have. Jack Moffitt is one of the greatest problem solvers and technology visionaries I have ever worked with. He, Brian Zisk and an early crew of crazy-smart XMPP wizards arrived at the real-time search vision in parallel to me, even though we had never even met.

We came together late last year with a nearly identical vision – one from the technical side and one from the business/market side. We really work well together.

2. Real Time search is beginning to get crowded,  what makes Collecta different from some  of the other competitors in this space like Social Mention?

Because all of the air was sucked out of the market by the Google juggernaut there has not been much innovation in the search space as a whole for the past few years. Now, that there is such a gap between the traditional search leaders and the possibilities of what can be created for users, there’s been a rebirth of search innovation. We’re experiencing a renaissance of new ideas and approaches in search and it’s exciting.

How are we different?

a) Collecta is the fastest. Nobody has a similar approach, our unique architecture resembles a high performance financial data service more than a traditional web search.

b) Collecta is the broadest. We draw from the web at large – not just social networks. Or even more limited, one single social network (like some other real-time search tools.)

c) Collecta is revolutionary, not evolutionary. Because we are starting with the fundamental building blocks of search we are making leaps in time-based search innovation. Not incremental improvements in ranking, or simple mashups on a stream of data from a single social network.

3. Why do you think it’s taken so long for the big guys like Google to really get into the real time search game? How do you think Google’s doing?

Having run a top-tier site for a long time I completely understand the issue. When you’re operating a large property with a huge revenue base it’s not in your best interest to declare that the game has changed. In fact, your current frame of reference for how the business operates may be rooted in a certain view. In that case it’s hard to see past the established momentum and you might not know the game has changed. So you wait until you’re pressured into adopting/incorporating the new approach.

In general, I believe the people who create and run search businesses/technologies are some of the smartest people on the planet – so i assume the big guys have seen it coming and are maximizing the value to their shareholders in their approach – balancing the emerging trend with existing momentum.

It’s the innovator’s dilemma in its purest form.

4. The question that always comes up with a new web property is the business plan – do you envision some kind of search ad program being rolled out in the future or are you not concerned yet about how to monetize?

Collecta is a business and we have revenue plans. Users come first!

5. How many sources are you pulling from to generate results and do you plan to add to your index?

We are pulling from enough to prove the value proposition, and are adding new sources every day. It’s a big Internet out there.

The original post appeared here.