[jira] Created: (NUTCH-947) text.jsp does not compile on Apache Tomcat, and charset is not specified

2010-12-21 Thread Enrique Berlanga (JIRA)
text.jsp does not compile on Apache Tomcat, and charset is not specified


 Key: NUTCH-947
 URL: https://issues.apache.org/jira/browse/NUTCH-947
 Project: Nutch
  Issue Type: Bug
  Components: web gui
Affects Versions: 1.2
 Environment: Server version: Apache Tomcat/6.0.29
Server built:   July 19 2010 1458
Server number:  6.0.0.29
OS Name:Linux
OS Version: 2.6.18-128.7.1.el5
Architecture:   i386
JVM Version:1.6.0_22-b04
JVM Vendor: Sun Microsystems Inc.
Reporter: Enrique Berlanga
 Attachments: text-947.patch

If you add index-more plugin, view as plain text option is shown in search 
result, but if you access the jsp a compilation error is shown:
org.apache.jasper.JasperException: /text.jsp(60,29) Attribute value
details.getValue(url) is quoted with  which must be escaped when used
within the value

To solve this and the missing character encofing, i've generated the atached 
patch file.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Resolved: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments

2010-12-21 Thread Andrzej Bialecki (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrzej Bialecki  resolved NUTCH-939.
-

Resolution: Fixed
  Assignee: Andrzej Bialecki 

I modified the patch slightly to allow more flexibility (you can mix individual 
segment names and the -dir options) as well as allowing segments placed on 
different filesystems. Committed in rev. 1051505. Thank you!

 Added -dir command line option to Indexer and SolrIndexer,  allowing to 
 specify directory containing segments
 -

 Key: NUTCH-939
 URL: https://issues.apache.org/jira/browse/NUTCH-939
 Project: Nutch
  Issue Type: Improvement
  Components: indexer
Affects Versions: 1.3
Reporter: Claudio Martella
Assignee: Andrzej Bialecki 
Priority: Minor
 Fix For: 1.3

 Attachments: Indexer.patch, SolrIndexer.patch


 The patches add -dir option, so the user can specify the directory in which 
 the segments are to be found. The actual mode is to specify the list of 
 segments, which is not very easy with hdfs. Also, the -dir option is already 
 implemented in LinkDB and SegmentMerger, for example.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (NUTCH-948) Remove Lucene dependencies

2010-12-21 Thread Andrzej Bialecki (JIRA)
Remove Lucene dependencies
--

 Key: NUTCH-948
 URL: https://issues.apache.org/jira/browse/NUTCH-948
 Project: Nutch
  Issue Type: Improvement
Affects Versions: 1.3
Reporter: Andrzej Bialecki 
Assignee: Andrzej Bialecki 
 Fix For: 1.3


Branch-1.3 still has Lucene libs, but uses Lucene only in one place, namely it 
uses DateTools in index-basic. DateTools should be replaced with Solr's 
DateUtil, as we did in trunk, and then we can remove Lucene libs as a 
dependency.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Resolved: (NUTCH-948) Remove Lucene dependencies

2010-12-21 Thread Andrzej Bialecki (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrzej Bialecki  resolved NUTCH-948.
-

Resolution: Fixed

Committed in rev. 1051509.

 Remove Lucene dependencies
 --

 Key: NUTCH-948
 URL: https://issues.apache.org/jira/browse/NUTCH-948
 Project: Nutch
  Issue Type: Improvement
Affects Versions: 1.3
Reporter: Andrzej Bialecki 
Assignee: Andrzej Bialecki 
 Fix For: 1.3


 Branch-1.3 still has Lucene libs, but uses Lucene only in one place, namely 
 it uses DateTools in index-basic. DateTools should be replaced with Solr's 
 DateUtil, as we did in trunk, and then we can remove Lucene libs as a 
 dependency.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments

2010-12-21 Thread Claudio Martella (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12973706#action_12973706
 ] 

Claudio Martella commented on NUTCH-939:


Great. what about the Indexer patch?

 Added -dir command line option to Indexer and SolrIndexer,  allowing to 
 specify directory containing segments
 -

 Key: NUTCH-939
 URL: https://issues.apache.org/jira/browse/NUTCH-939
 Project: Nutch
  Issue Type: Improvement
  Components: indexer
Affects Versions: 1.3
Reporter: Claudio Martella
Assignee: Andrzej Bialecki 
Priority: Minor
 Fix For: 1.3

 Attachments: Indexer.patch, SolrIndexer.patch


 The patches add -dir option, so the user can specify the directory in which 
 the segments are to be found. The actual mode is to specify the list of 
 segments, which is not very easy with hdfs. Also, the -dir option is already 
 implemented in LinkDB and SegmentMerger, for example.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (NUTCH-940) static field plugin

2010-12-21 Thread Claudio Martella (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12973709#action_12973709
 ] 

Claudio Martella commented on NUTCH-940:


you mean nutchbase or branch-1.3?

 static field plugin
 ---

 Key: NUTCH-940
 URL: https://issues.apache.org/jira/browse/NUTCH-940
 Project: Nutch
  Issue Type: New Feature
  Components: indexer
Affects Versions: 1.3
Reporter: Claudio Martella
Priority: Minor
 Fix For: 1.3

 Attachments: static-field.tar.gz


 A simple plugin called at indexing that adds fields with static data. You can 
 specify a list of fieldname:fieldcontent per nutch job.
 It can be useful when collections can't be created by urlpatterns, like in 
 subcollection, but on a job-basis.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (NUTCH-940) static field plugin

2010-12-21 Thread Julien Nioche (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12973754#action_12973754
 ] 

Julien Nioche commented on NUTCH-940:
-

nutchbase is not an active branch.  please diff against branch-1.3 and trunk 
thanks

 static field plugin
 ---

 Key: NUTCH-940
 URL: https://issues.apache.org/jira/browse/NUTCH-940
 Project: Nutch
  Issue Type: New Feature
  Components: indexer
Affects Versions: 1.3
Reporter: Claudio Martella
Priority: Minor
 Fix For: 1.3

 Attachments: static-field.tar.gz


 A simple plugin called at indexing that adds fields with static data. You can 
 specify a list of fieldname:fieldcontent per nutch job.
 It can be useful when collections can't be created by urlpatterns, like in 
 subcollection, but on a job-basis.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments

2010-12-21 Thread Andrzej Bialecki (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12973915#action_12973915
 ] 

Andrzej Bialecki  commented on NUTCH-939:
-

1.2 release is out, and branch-1.2 is unlikely to result in a subsequent 
release - most users seem to be interested either in 1.3 or trunk.

 Added -dir command line option to Indexer and SolrIndexer,  allowing to 
 specify directory containing segments
 -

 Key: NUTCH-939
 URL: https://issues.apache.org/jira/browse/NUTCH-939
 Project: Nutch
  Issue Type: Improvement
  Components: indexer
Affects Versions: 1.3
Reporter: Claudio Martella
Assignee: Andrzej Bialecki 
Priority: Minor
 Fix For: 1.3

 Attachments: Indexer.patch, SolrIndexer.patch


 The patches add -dir option, so the user can specify the directory in which 
 the segments are to be found. The actual mode is to specify the list of 
 segments, which is not very easy with hdfs. Also, the -dir option is already 
 implemented in LinkDB and SegmentMerger, for example.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Build failed in Hudson: Nutch-trunk #1344

2010-12-21 Thread Apache Hudson Server
See https://hudson.apache.org/hudson/job/Nutch-trunk/1344/

--
[...truncated 1007 lines...]
A src/plugin/subcollection/src/java/org/apache/nutch/collection
A 
src/plugin/subcollection/src/java/org/apache/nutch/collection/Subcollection.java
A 
src/plugin/subcollection/src/java/org/apache/nutch/collection/CollectionManager.java
A 
src/plugin/subcollection/src/java/org/apache/nutch/collection/package.html
A src/plugin/subcollection/src/java/org/apache/nutch/indexer
A 
src/plugin/subcollection/src/java/org/apache/nutch/indexer/subcollection
A 
src/plugin/subcollection/src/java/org/apache/nutch/indexer/subcollection/SubcollectionIndexingFilter.java
A src/plugin/subcollection/README.txt
A src/plugin/subcollection/plugin.xml
A src/plugin/subcollection/build.xml
A src/plugin/index-more
A src/plugin/index-more/ivy.xml
A src/plugin/index-more/src
A src/plugin/index-more/src/test
A src/plugin/index-more/src/test/org
A src/plugin/index-more/src/test/org/apache
A src/plugin/index-more/src/test/org/apache/nutch
A src/plugin/index-more/src/test/org/apache/nutch/indexer
A src/plugin/index-more/src/test/org/apache/nutch/indexer/more
A 
src/plugin/index-more/src/test/org/apache/nutch/indexer/more/TestMoreIndexingFilter.java
A src/plugin/index-more/src/java
A src/plugin/index-more/src/java/org
A src/plugin/index-more/src/java/org/apache
A src/plugin/index-more/src/java/org/apache/nutch
A src/plugin/index-more/src/java/org/apache/nutch/indexer
A src/plugin/index-more/src/java/org/apache/nutch/indexer/more
A 
src/plugin/index-more/src/java/org/apache/nutch/indexer/more/MoreIndexingFilter.java
A 
src/plugin/index-more/src/java/org/apache/nutch/indexer/more/package.html
A src/plugin/index-more/plugin.xml
A src/plugin/index-more/build.xml
AUsrc/plugin/plugin.dtd
A src/plugin/parse-ext
A src/plugin/parse-ext/ivy.xml
A src/plugin/parse-ext/src
A src/plugin/parse-ext/src/test
A src/plugin/parse-ext/src/test/org
A src/plugin/parse-ext/src/test/org/apache
A src/plugin/parse-ext/src/test/org/apache/nutch
A src/plugin/parse-ext/src/test/org/apache/nutch/parse
A src/plugin/parse-ext/src/test/org/apache/nutch/parse/ext
A 
src/plugin/parse-ext/src/test/org/apache/nutch/parse/ext/TestExtParser.java
A src/plugin/parse-ext/src/java
A src/plugin/parse-ext/src/java/org
A src/plugin/parse-ext/src/java/org/apache
A src/plugin/parse-ext/src/java/org/apache/nutch
A src/plugin/parse-ext/src/java/org/apache/nutch/parse
A src/plugin/parse-ext/src/java/org/apache/nutch/parse/ext
A 
src/plugin/parse-ext/src/java/org/apache/nutch/parse/ext/ExtParser.java
A src/plugin/parse-ext/plugin.xml
A src/plugin/parse-ext/build.xml
A src/plugin/parse-ext/command
A src/plugin/urlnormalizer-pass
A src/plugin/urlnormalizer-pass/ivy.xml
A src/plugin/urlnormalizer-pass/src
A src/plugin/urlnormalizer-pass/src/test
A src/plugin/urlnormalizer-pass/src/test/org
A src/plugin/urlnormalizer-pass/src/test/org/apache
A src/plugin/urlnormalizer-pass/src/test/org/apache/nutch
A src/plugin/urlnormalizer-pass/src/test/org/apache/nutch/net
A 
src/plugin/urlnormalizer-pass/src/test/org/apache/nutch/net/urlnormalizer
A 
src/plugin/urlnormalizer-pass/src/test/org/apache/nutch/net/urlnormalizer/pass
AU
src/plugin/urlnormalizer-pass/src/test/org/apache/nutch/net/urlnormalizer/pass/TestPassURLNormalizer.java
A src/plugin/urlnormalizer-pass/src/java
A src/plugin/urlnormalizer-pass/src/java/org
A src/plugin/urlnormalizer-pass/src/java/org/apache
A src/plugin/urlnormalizer-pass/src/java/org/apache/nutch
A src/plugin/urlnormalizer-pass/src/java/org/apache/nutch/net
A 
src/plugin/urlnormalizer-pass/src/java/org/apache/nutch/net/urlnormalizer
A 
src/plugin/urlnormalizer-pass/src/java/org/apache/nutch/net/urlnormalizer/pass
AU
src/plugin/urlnormalizer-pass/src/java/org/apache/nutch/net/urlnormalizer/pass/PassURLNormalizer.java
AUsrc/plugin/urlnormalizer-pass/plugin.xml
AUsrc/plugin/urlnormalizer-pass/build.xml
A src/plugin/parse-html
A src/plugin/parse-html/ivy.xml
A src/plugin/parse-html/lib
A src/plugin/parse-html/lib/tagsoup.LICENSE.txt
A src/plugin/parse-html/src
A src/plugin/parse-html/src/test
A src/plugin/parse-html/src/test/org
A src/plugin/parse-html/src/test/org/apache
A src/plugin/parse-html/src/test/org/apache/nutch
A src/plugin/parse-html/src/test/org/apache/nutch/parse
A