<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Top-Down and Bottom-Up Semantics</title>
	<atom:link href="http://www.altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/</link>
	<description>The most wonderful search engines you've never seen!</description>
	<lastBuildDate>Sat, 13 Mar 2010 14:10:09 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Vipin Jain</title>
		<link>http://www.altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/comment-page-1/#comment-78225</link>
		<dc:creator>Vipin Jain</dc:creator>
		<pubDate>Sat, 12 Jul 2008 21:04:52 +0000</pubDate>
		<guid isPermaLink="false">http://altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/#comment-78225</guid>
		<description>D,

No doubt. I still have to see a commercial AI, NLP, semantic system that works across seemingly infinite number of contexts and semantics from both users&#039; and information perspective. Leave alone that is not subjective :-).

Anyways, it was a fun discussion. And it was good to chat!</description>
		<content:encoded><![CDATA[<p>D,</p>
<p>No doubt. I still have to see a commercial AI, NLP, semantic system that works across seemingly infinite number of contexts and semantics from both users&#8217; and information perspective. Leave alone that is not subjective <img src='http://www.altsearchengines.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> .</p>
<p>Anyways, it was a fun discussion. And it was good to chat!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: D Ashcart</title>
		<link>http://www.altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/comment-page-1/#comment-77803</link>
		<dc:creator>D Ashcart</dc:creator>
		<pubDate>Fri, 11 Jul 2008 18:08:51 +0000</pubDate>
		<guid isPermaLink="false">http://altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/#comment-77803</guid>
		<description>@Vipin - what you are saying makes sense. If you dig deeply enough into the origins of any learning system, you will hit the subjectivity barrier. Which is OK, since all forms of expression are ultimately subjective. This is the reason that the Holy Grail you describe is unreachable - you will need an infinite bitspace. You know you are headed into semantic swampland when you start seeing terms like &#039;learning&#039;, &#039;domain expertise&#039;, &#039;heuristic&#039;, &#039;practical solution&#039;, &#039;intelligent&#039;, etc. creep into the conversation. But there&#039;s no denying that these solutions often offer utility.

My initial reaction to the article was to question the implied leap of faith from the untrustworthiness of a tagger to the trustworthiness of a corporation.

Nothing specific against Cognition - the semantic/NLP space is rife with marketing-speak-absolutism that tramples over nuance and accuracy, which occasionally tweaks my purist heart. This is my personal curse.</description>
		<content:encoded><![CDATA[<p>@Vipin &#8211; what you are saying makes sense. If you dig deeply enough into the origins of any learning system, you will hit the subjectivity barrier. Which is OK, since all forms of expression are ultimately subjective. This is the reason that the Holy Grail you describe is unreachable &#8211; you will need an infinite bitspace. You know you are headed into semantic swampland when you start seeing terms like &#8216;learning&#8217;, &#8216;domain expertise&#8217;, &#8216;heuristic&#8217;, &#8216;practical solution&#8217;, &#8216;intelligent&#8217;, etc. creep into the conversation. But there&#8217;s no denying that these solutions often offer utility.</p>
<p>My initial reaction to the article was to question the implied leap of faith from the untrustworthiness of a tagger to the trustworthiness of a corporation.</p>
<p>Nothing specific against Cognition &#8211; the semantic/NLP space is rife with marketing-speak-absolutism that tramples over nuance and accuracy, which occasionally tweaks my purist heart. This is my personal curse.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Vipin Jain</title>
		<link>http://www.altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/comment-page-1/#comment-77780</link>
		<dc:creator>Vipin Jain</dc:creator>
		<pubDate>Fri, 11 Jul 2008 16:29:50 +0000</pubDate>
		<guid isPermaLink="false">http://altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/#comment-77780</guid>
		<description>We have seen researchers positioned along the entire spectrum of AI and semantic web. Holy grail being a completely automated bottoms-up approach that requires no structured feed to the learning system – the system is supposed to learn the underlying structure on its own. This becomes extremely challenging when the learning system doesn’t have an identity, intelligence, or motivation of its own. Actually, it is extremely challenging when the learning system does have an identify, intelligence or motivation – just look at how good we are at training and retraining our children or other peoples :-). 

Anyway back to computers, all learning systems have to be guided and differ only in the amount and means of guidance provided by the “algorithms specialist” or the “programmer”. Practical and successful approaches constrain the learning system and structure the problem such that simple mathematical rules can learn the classification boundaries from available data. Even then you can’t satisfy every user request in a given vertical. You optimize for the most common tasks that make a solution commercially viable from cost and ROI perspective. Our researchers at Retrevo have employed practical approaches of constraining problems by using intelligent crawling, statistics-based feature extraction and selection, and Bayesian learning of classification boundaries. We don’t intend to satisfy every corner case in our vertical equally well but we satisfy common tasks extremely well. Just 2 cents from a company that has applied research to solve practical problems with commercial viability. And we know we still have ways to go!</description>
		<content:encoded><![CDATA[<p>We have seen researchers positioned along the entire spectrum of AI and semantic web. Holy grail being a completely automated bottoms-up approach that requires no structured feed to the learning system – the system is supposed to learn the underlying structure on its own. This becomes extremely challenging when the learning system doesn’t have an identity, intelligence, or motivation of its own. Actually, it is extremely challenging when the learning system does have an identify, intelligence or motivation – just look at how good we are at training and retraining our children or other peoples <img src='http://www.altsearchengines.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> . </p>
<p>Anyway back to computers, all learning systems have to be guided and differ only in the amount and means of guidance provided by the “algorithms specialist” or the “programmer”. Practical and successful approaches constrain the learning system and structure the problem such that simple mathematical rules can learn the classification boundaries from available data. Even then you can’t satisfy every user request in a given vertical. You optimize for the most common tasks that make a solution commercially viable from cost and ROI perspective. Our researchers at Retrevo have employed practical approaches of constraining problems by using intelligent crawling, statistics-based feature extraction and selection, and Bayesian learning of classification boundaries. We don’t intend to satisfy every corner case in our vertical equally well but we satisfy common tasks extremely well. Just 2 cents from a company that has applied research to solve practical problems with commercial viability. And we know we still have ways to go!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: D Ashcart</title>
		<link>http://www.altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/comment-page-1/#comment-77534</link>
		<dc:creator>D Ashcart</dc:creator>
		<pubDate>Thu, 10 Jul 2008 23:57:19 +0000</pubDate>
		<guid isPermaLink="false">http://altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/#comment-77534</guid>
		<description>&quot;An alternative method to “bottom-up” and “top-down” semantics is Semantic Natural Language Processing, such as that employed by Cognition Technologies, in which the computer has been taught the meanings and relationships of all of the words and phrases, and also recovers the meanings of the words and phrases in searched document set.&quot;

I&#039;m curious how this is an alternative to top-down. If top-down tagging is susceptible to the subjectivity and knowledge of the tagger, isn&#039;t SNLP susceptible to the subjectivity of your programmers or librarians or ontologists?

In other words, who &quot;taught the computer&quot;?</description>
		<content:encoded><![CDATA[<p>&#8220;An alternative method to “bottom-up” and “top-down” semantics is Semantic Natural Language Processing, such as that employed by Cognition Technologies, in which the computer has been taught the meanings and relationships of all of the words and phrases, and also recovers the meanings of the words and phrases in searched document set.&#8221;</p>
<p>I&#8217;m curious how this is an alternative to top-down. If top-down tagging is susceptible to the subjectivity and knowledge of the tagger, isn&#8217;t SNLP susceptible to the subjectivity of your programmers or librarians or ontologists?</p>
<p>In other words, who &#8220;taught the computer&#8221;?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hope Leman</title>
		<link>http://www.altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/comment-page-1/#comment-76984</link>
		<dc:creator>Hope Leman</dc:creator>
		<pubDate>Wed, 09 Jul 2008 13:29:54 +0000</pubDate>
		<guid isPermaLink="false">http://altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/#comment-76984</guid>
		<description>This is a really fascinating article. I was interested that the emphasis was on coders as taggers. But isn’t a huge amount of tagging done by average people with no coding now-how at all? For instance, I help run a site that requires me to determine what categories each grant goes into it. Those are sort of quasi-tags that don’t require any coding know-how on my part and some of the grant listings do seem to get into Google (not as much as I would like, of course!).

And Blogger has the feature, “Labels for this post,” which is a sort of tagging system (I guess!). And then there is the whole world of del.icio.us and social bookmarking. None of that requires knowledge of tagging languages. And do websites necessarily need to tag their content if enthusiastic laypeople are doing it for them a la Digg?

Anyway, this was one of the best, clearest overviews of the very abstruse topic of the semantic web I have seen. Thank you. I am in library school and need to grasp all this stuff.</description>
		<content:encoded><![CDATA[<p>This is a really fascinating article. I was interested that the emphasis was on coders as taggers. But isn’t a huge amount of tagging done by average people with no coding now-how at all? For instance, I help run a site that requires me to determine what categories each grant goes into it. Those are sort of quasi-tags that don’t require any coding know-how on my part and some of the grant listings do seem to get into Google (not as much as I would like, of course!).</p>
<p>And Blogger has the feature, “Labels for this post,” which is a sort of tagging system (I guess!). And then there is the whole world of del.icio.us and social bookmarking. None of that requires knowledge of tagging languages. And do websites necessarily need to tag their content if enthusiastic laypeople are doing it for them a la Digg?</p>
<p>Anyway, this was one of the best, clearest overviews of the very abstruse topic of the semantic web I have seen. Thank you. I am in library school and need to grasp all this stuff.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andreas Harth</title>
		<link>http://www.altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/comment-page-1/#comment-76959</link>
		<dc:creator>Andreas Harth</dc:creator>
		<pubDate>Wed, 09 Jul 2008 11:32:27 +0000</pubDate>
		<guid isPermaLink="false">http://altsearchengines.com/2008/07/08/top-down-and-bottom-up-semantics/#comment-76959</guid>
		<description>I don&#039;t completely agree with the notion of top-down vs. bottom-up semantics.  Even tagging individual pages with topics requires a formal specification of the topics, even if the formal spec is very basic, e.g. in the case of taxonomies such as DMOZ.  The article also fails to explain that a shared use of URIs forms the basis for deriving meaning on the Semantic Web.

I&#039;m skeptic of the promise that NLP methods will automatically derive meaning from text. While information extraction with NLP may work in narrowly specified domains (which, however involves a great deal of manual labour), automatically extracting entities, attributes and relationships from the open-domain, multi-lingual Web using NLP techniques has failed, despite millions of dollars of investment (Powerset).  Rather, starting with structured datasets and extending and linking them iteratively seems to work (DBpedia, Linked Data).</description>
		<content:encoded><![CDATA[<p>I don&#8217;t completely agree with the notion of top-down vs. bottom-up semantics.  Even tagging individual pages with topics requires a formal specification of the topics, even if the formal spec is very basic, e.g. in the case of taxonomies such as DMOZ.  The article also fails to explain that a shared use of URIs forms the basis for deriving meaning on the Semantic Web.</p>
<p>I&#8217;m skeptic of the promise that NLP methods will automatically derive meaning from text. While information extraction with NLP may work in narrowly specified domains (which, however involves a great deal of manual labour), automatically extracting entities, attributes and relationships from the open-domain, multi-lingual Web using NLP techniques has failed, despite millions of dollars of investment (Powerset).  Rather, starting with structured datasets and extending and linking them iteratively seems to work (DBpedia, Linked Data).</p>
]]></content:encoded>
	</item>
</channel>
</rss>
