This thread got me thinking a bit...
Does SOLR support the concept of partial updates to documents? By this I
mean updating a subset of fields in a document that already exists in the
index, and without having to resubmit the entire document.
An example would be storing/indexing user tags
Hi Chris, thanks for looking at this.
I'm using Solr 1.4.0 including the Tika that's in the tgz file which
means Tika 0.4.
I've now discovered that only two letters are required. A single line
with XE will crash it.
This fails:
r...@gamma:/home/ross# hexdump -C test.txt
58 45 0a
a bit unfortunate because a similar word (a person's name ??BLE )
with the same problem appears frequently in upper case near the top of
my files.
Cheers
Ross
On Sun, Mar 21, 2010 at 12:58 PM, Ross tetr...@gmail.com wrote:
Hi all
I'm trying to import some text files. I'm mostly following Avi
than plain text. It just adds an html header and wraps everything in a
pre tag. That seems to keep Solr happy.
Ross
On Mon, Mar 22, 2010 at 9:08 AM, György Frivolt
gyorgy.friv...@gmail.com wrote:
Hi,
I had problem with indexing documents some months ago as well. I found
that there were XML
have anything to do
with the scanning. I'm guessing but it seems that something in the
Tomcat / Solr / Tika implementation tries to load it all into memory
at once.
pdftotext (part of http://www.foolabs.com/xpdf/download.html ) seems
to do it nicely and processes small chunks at a time.
Ross
On Mon
then it works. In my real
files, the XX is something else but the result is the same. It's a
common word in the files. I guess for this quick and dirty job I'm
doing I could do a bulk replace in the files to make it lower case.
Is there any workaround for this?
Thanks
Ross
htmlheadtitleApache Tomcat
editing and moving my unique word
around, it seems to stop working if that word is beyond the 100 KB
point in the file. I think another file earlier gave a different
result.
Hopefully I'm missing something obvious.
Thanks for any help.
Ross
Thanks Erick.
That was it. All looking good now.
Cheers
Ross
On Sat, Mar 20, 2010 at 9:29 PM, Erick Erickson erickerick...@gmail.com wrote:
Does our solarconfig file have a line like...
maxFieldLength1/maxFieldLength
?
Try upping the 1...
HTH
Erick
On Sat, Mar 20, 2010 at 8
display correctly on my terminal set to UTF-8 but
they get screwed up when indexed.
I gave up in the end and went back to putting it urlencoded in the url.
Ross
-Original Message-
From: Shalin Shekhar Mangar [mailto:shalinman...@gmail.com]
Sent: Monday, January 04, 2010 4:28 AM
To: solr
for production use but
this is initially to get some data indexed for test and demo.
Thanks
Ross
is commented out but they all still
work. wt=php etc returns the php format. How is it working if they're
not defined? Are they defined elsewhere?
Thanks
Ross
. :-)
Ross
From: Chris Hostetter [mailto:hossman_luc...@fucit.org]
Sent: Tue 2/3/2009 2:18 PM
To: solr-user@lucene.apache.org
Subject: Re: Unsubscribing
: Subject: Unsubscribing
:
: I've tried multiple times to unsubscribe from this list using the proper
I've tried multiple times to unsubscribe from this list using the proper method
(mailto:solr-user-unsubscr...@lucene.apache.org), but it's not working! Can
anyone help with that?
Fantastic, thanks for your efforts Chris, that was in fact the
problem, I removed all trace of those files, and it works just fine!!!
Now I just need to figure out why my system was still displaying
these symptons despite updates being done...
once again thanks for your help,
Ross.
On 15
14 matches
Mail list logo