On Apr 6, 2009, at 10:16 AM, Fergus McMenemie wrote:
Hmmm,
Not sure how this all hangs together. But editing my solrconfig.xml
as follows
sorted the problem:-
<requestParsers enableRemoteStreaming="false"
multipartUploadLimitInKB="2048" />
to
<requestParsers enableRemoteStreaming="false"
multipartUploadLimitInKB="20048" />
We should document this on the wiki or in the config, if it isn't
already.
Also, my initial report of the issue was misled by the log messages.
The mention
of "oceania.pdf" refers to a previous successful tika extract. There
no mention
of the filename that was rejected in the logs or any information
that would help
me identify it!
We should fix this so it at least spits out a meaningful message. Can
you open a JIRA?
Regards Fergus.
Sorry if this is a FAQ; I suspect it could be. But how do I work
around the following:-
INFO: [] webapp=/apache-solr-1.4-dev path=/update/extract
params={ext.def.fl=text&ext.literal.id=factbook/reference_maps/pdf/
oceania.pdf} status=0 QTime=318
Apr 2, 2009 11:17:46 AM org.apache.solr.common.SolrException log
SEVERE: org.apache.commons.fileupload.FileUploadBase
$SizeLimitExceededException: the request was rejected because its
size (4585774) exceeds the configured maximum (2097152)
at org.apache.commons.fileupload.FileUploadBase
$FileItemIteratorImpl.<init>(FileUploadBase.java:914)
at
org
.apache
.commons
.fileupload.FileUploadBase.getItemIterator(FileUploadBase.java:331)
at
org
.apache
.commons.fileupload.FileUploadBase.parseRequest(FileUploadBase.java:
349)
at
org
.apache
.commons
.fileupload
.servlet.ServletFileUpload.parseRequest(ServletFileUpload.java:126)
at
org
.apache
.solr
.servlet
.MultipartRequestParser
.parseParamsAndFillStreams(SolrRequestParsers.java:343)
at
org
.apache
.solr
.servlet
.StandardRequestParser
.parseParamsAndFillStreams(SolrRequestParsers.java:396)
at
org
.apache
.solr.servlet.SolrRequestParsers.parse(SolrRequestParsers.java:114)
at
org
.apache
.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:
217)
at
org
.apache
.catalina
.core
.ApplicationFilterChain
.internalDoFilter(ApplicationFilterChain.java:202)
at
org
.apache
.catalina
.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:
173)
at
org
.apache
.catalina
.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
at
org
.apache
.catalina
.core.StandardContextValve.invoke(StandardContextValve.java:178)
at
org
.apache
.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126)
at
org
.apache
.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105)
Although the PDF is big, it contains very little text; it is a map.
"java -jar solr/lib/tika-0.3.jar -g" appears to have no bother
with it.
Fergus...
--
===============================================================
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===============================================================
--
===============================================================
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===============================================================
--------------------------
Grant Ingersoll
http://www.lucidimagination.com/
Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)
using Solr/Lucene:
http://www.lucidimagination.com/search