<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Kommentare zu: Nutch &#8211; meta description in search results</title>
	<atom:link href="http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/</link>
	<description>TYPO3, Web-Entwicklung und mehr</description>
	<lastBuildDate>Thu, 10 May 2012 13:12:59 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>Von: Volli</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-4082</link>
		<dc:creator>Volli</dc:creator>
		<pubDate>Thu, 26 Aug 2010 00:38:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-4082</guid>
		<description>No chance with Nutch 1.1 (for me!).

#x: bin/nutch index crawl/NEWindexes crawl/crawldb crawl/linkdb crawl/segments/*

throws errors:
Indexer: starting
Indexer: java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1252)
at org.apache.nutch.indexer.Indexer.index(Indexer.java:72)
at org.apache.nutch.indexer.Indexer.run(Indexer.java:92)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.indexer.Indexer.main(Indexer.java:101)

Afterwards I tried a built (from final 1.1 src) without description-plugin and it worked without any errors.

But I learned a lot &#039;bout nutch structure and ant compiling. So, thanks for your efforts!</description>
		<content:encoded><![CDATA[<p>No chance with Nutch 1.1 (for me!).</p>
<p>#x: bin/nutch index crawl/NEWindexes crawl/crawldb crawl/linkdb crawl/segments/*</p>
<p>throws errors:<br />
Indexer: starting<br />
Indexer: java.io.IOException: Job failed!<br />
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1252)<br />
at org.apache.nutch.indexer.Indexer.index(Indexer.java:72)<br />
at org.apache.nutch.indexer.Indexer.run(Indexer.java:92)<br />
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)<br />
at org.apache.nutch.indexer.Indexer.main(Indexer.java:101)</p>
<p>Afterwards I tried a built (from final 1.1 src) without description-plugin and it worked without any errors.</p>
<p>But I learned a lot &#8217;bout nutch structure and ant compiling. So, thanks for your efforts!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: Volli</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-4076</link>
		<dc:creator>Volli</dc:creator>
		<pubDate>Wed, 25 Aug 2010 22:44:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-4076</guid>
		<description>-Nutch 1.1; Compiling procedure-
Just a intermediate result (I stopped at point successful building with ant; no further testing yet!!! Maybe completely senseless!? They changed a lot with Nutch 1.1.):

DescriptionIndexer.java:
1) Hellkeeper&#039;s advice.
2) FIND:
Field.Store.YES, Field.Index.UN_TOKENIZED
3) CHANGE:
Field.Store.YES, Field.Index.NOT_ANALYZED</description>
		<content:encoded><![CDATA[<p>-Nutch 1.1; Compiling procedure-<br />
Just a intermediate result (I stopped at point successful building with ant; no further testing yet!!! Maybe completely senseless!? They changed a lot with Nutch 1.1.):</p>
<p>DescriptionIndexer.java:<br />
1) Hellkeeper&#8217;s advice.<br />
2) FIND:<br />
Field.Store.YES, Field.Index.UN_TOKENIZED<br />
3) CHANGE:<br />
Field.Store.YES, Field.Index.NOT_ANALYZED</p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: Chris</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-2883</link>
		<dc:creator>Chris</dc:creator>
		<pubDate>Wed, 27 Jan 2010 15:38:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-2883</guid>
		<description>Thanks for this tip.  It was a big help.</description>
		<content:encoded><![CDATA[<p>Thanks for this tip.  It was a big help.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: Hellkeeper</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-2181</link>
		<dc:creator>Hellkeeper</dc:creator>
		<pubDate>Tue, 29 Sep 2009 16:36:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-2181</guid>
		<description>To get it running in final nutch 1.0 version you have to change line 26 in file DescriptionIndexer.java to this: public abstract class DescriptionIndexer implements IndexingFilter

Thx to Alex for this help! ;-)</description>
		<content:encoded><![CDATA[<p>To get it running in final nutch 1.0 version you have to change line 26 in file DescriptionIndexer.java to this: public abstract class DescriptionIndexer implements IndexingFilter</p>
<p>Thx to Alex for this help! <img src='http://www.adick.at/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: Alex</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-984</link>
		<dc:creator>Alex</dc:creator>
		<pubDate>Thu, 09 Apr 2009 20:20:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-984</guid>
		<description>Jacob,
did it work in the meantime? Maybe the content of your description tag is not taken to the index.</description>
		<content:encoded><![CDATA[<p>Jacob,<br />
did it work in the meantime? Maybe the content of your description tag is not taken to the index.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: Jacob</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-804</link>
		<dc:creator>Jacob</dc:creator>
		<pubDate>Thu, 02 Apr 2009 16:21:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-804</guid>
		<description>Hi Alex,

Firs of all thanks for your post,

I got a problem because nutch doesn&#039;t find anything when I want to search by description, everything works perfect but when I search some content stored on the description doesn&#039;t find anything, any idea?</description>
		<content:encoded><![CDATA[<p>Hi Alex,</p>
<p>Firs of all thanks for your post,</p>
<p>I got a problem because nutch doesn&#8217;t find anything when I want to search by description, everything works perfect but when I search some content stored on the description doesn&#8217;t find anything, any idea?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: Alex</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-526</link>
		<dc:creator>Alex</dc:creator>
		<pubDate>Sat, 28 Feb 2009 18:58:24 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-526</guid>
		<description>Hi Mille,

Yes you can pass it with parse.getData().getContentMeta().set(&quot;your_description&quot;, your_data);

Then simply get it using

String desc = parse.getData().getMeta(&quot;your_description&quot;);

and you&#039;re done.

Cheers</description>
		<content:encoded><![CDATA[<p>Hi Mille,</p>
<p>Yes you can pass it with parse.getData().getContentMeta().set(&#8220;your_description&#8221;, your_data);</p>
<p>Then simply get it using</p>
<p>String desc = parse.getData().getMeta(&#8220;your_description&#8221;);</p>
<p>and you&#8217;re done.</p>
<p>Cheers</p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: MilleB</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-519</link>
		<dc:creator>MilleB</dc:creator>
		<pubDate>Sat, 28 Feb 2009 16:53:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-519</guid>
		<description>Interesting example, my problem is slightly different. 
Instead of looking to metatag, i need to run a regexp on the text to find the tags I&#039;m looking for.

I suppose I can modify the parser for that... so I get a list of tags
do you think I can use :
parse.getData().getContentMeta().set(META_DESCRIPTION_NAME, desc);       to pass them on to the indexer ?</description>
		<content:encoded><![CDATA[<p>Interesting example, my problem is slightly different.<br />
Instead of looking to metatag, i need to run a regexp on the text to find the tags I&#8217;m looking for.</p>
<p>I suppose I can modify the parser for that&#8230; so I get a list of tags<br />
do you think I can use :<br />
parse.getData().getContentMeta().set(META_DESCRIPTION_NAME, desc);       to pass them on to the indexer ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: Alex</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-435</link>
		<dc:creator>Alex</dc:creator>
		<pubDate>Fri, 27 Feb 2009 07:58:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-435</guid>
		<description>Didn&#039;t check if it had really an impact in the result-listing, but thought it would look good :D</description>
		<content:encoded><![CDATA[<p>Didn&#8217;t check if it had really an impact in the result-listing, but thought it would look good <img src='http://www.adick.at/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>Von: Alex</title>
		<link>http://www.adick.at/2008/09/17/nutch-display-meta-description-in-search-results/comment-page-1/#comment-434</link>
		<dc:creator>Alex</dc:creator>
		<pubDate>Fri, 27 Feb 2009 07:57:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.dick.at/alex/wordpress/?p=36#comment-434</guid>
		<description>Because I wanted search results _with_ a description being listed before those without a description.</description>
		<content:encoded><![CDATA[<p>Because I wanted search results _with_ a description being listed before those without a description.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

