Hi,
Check out my game
http://gotiti.com/greedyjump/
regards,
VK
On 12/14/2011 07:41 AM, Avni, Itamar wrote:
> Regarding (1) I'd suggest plugin-in your own additional implementation for HtmlParseFilter, where
> you can manipulate the content as you like, and set it back on the returned ParseResult.ParseText.
On 12/14/2011 02:11 AM, Hartl, Florian wrote:
> 2.
No. You would have to create and implementation for that or mount it over NFS
so you can access it as if it is an normal filesystem.
On Wednesday 14 December 2011 11:04:57 shashwat shriparv wrote:
> i have configure nutch and able to crawl the http addresses and now i need
> to know whether it is
i have configure nutch and able to crawl the http addresses and now i need
to know whether it is possible to crawl contents of hdfs file syste using
nutch.
Thanks and regards
--
Shashwat Shriparv
It's something else
NUTCH-1016 Strip UTF-8 non-character codepoints and add logging for SolrWriter
On Wednesday 14 December 2011 15:12:23 Lewis John Mcgibbney wrote:
> Hi Remi,
>
> This is a compatibility issue with conflicting versions of Solrj [1]
>
> [1]
> http://lucene.472066.n3.nabble.com/
Hi Remi,
This is a compatibility issue with conflicting versions of Solrj [1]
[1]
http://lucene.472066.n3.nabble.com/Invalid-version-or-the-data-in-not-in-javabin-format-td1460495.html
On Wed, Dec 14, 2011 at 1:57 PM, remi tassing wrote:
> Hello guys,
>
> After crawling with Nutch I tried push
Yes, 1.4 has some fixes for bad content. It strips away bad UTF-8 sequences.
On Wednesday 14 December 2011 14:57:40 remi tassing wrote:
> I'm using Nutch-1.2. Solr-3.4 & 3.5 don't work but 1.4 works well!
--
Markus Jelsma - CTO - Openindex
Hello guys,
After crawling with Nutch I tried pushing the index to Solr but it doesn't
work.
I'm using Nutch-1.2. Solr-3.4 & 3.5 don't work but 1.4 works well!
$ bin/nutch solrindex http://127.0.0.1:8983/solr/ crawl/crawldb
crawl/linkdb crawl/segments/*
SolrIndexer: starting at 2011-12-14 15:36:
what nutch version are you using?
On Wednesday 14 December 2011 12:29:48 Rafael Pappert wrote:
> Hey Markus,
>
> Nutch' log contains loads of errors like this:
>
> org.apache.solr.common.SolrException: Bad Request
>
> Bad Request
>
> request: http://hadoop0:8080/apache-solr-3.5.0/update?wt=jav
Hey Markus,
Nutch' log contains loads of errors like this:
org.apache.solr.common.SolrException: Bad Request
Bad Request
request: http://hadoop0:8080/apache-solr-3.5.0/update?wt=javabin&version=2
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.j
We also need Nutch' log
On Wednesday 14 December 2011 12:14:15 Rafael Pappert wrote:
> Hello List,
>
> at indexing got errors like this:
>
> Dec 14, 2011 5:00:11 AM org.apache.solr.common.SolrException log
> SEVERE: java.lang.RuntimeException: [was class java.io.IOException] Invalid
> CRLF at
>
Hello List,
at indexing got errors like this:
Dec 14, 2011 5:00:11 AM org.apache.solr.common.SolrException log
SEVERE: java.lang.RuntimeException: [was class java.io.IOException] Invalid CRLF
at
com.ctc.wstx.util.ExceptionUtil.throwRuntimeException(ExceptionUtil.java:18)
at com.c
12 matches
Mail list logo