This is really great!
I agree with Hoss, keep the commit=false by default and let he client
control how commits work.
On a tab-delimited file I just got from a client, I got this error:
SEVERE: java.io.IOException: (line 119986) invalid char between
encapsualted token end delimiter
at org.apache.commons.csv.CSVParser.encapsulatedTokenLexer
(CSVParser.java:499)
This may just be a problem with the file, or perhaps I need to
specify an encoding (not quite sure what it is on that file, but it
doesn't appear to be UTF8 as TextEdit complained about it). The file
is brand new to me, and fairly large (~150MB). The command I'm using
to import is:
curl "http://localhost:8983/solr/update/csv?stream.file=/Users/erik/
Desktop/data.txt&separator=%
09&fieldnames=id,name_text,title_text,qty_display,price_display,config_d
isplay,category_facet"
I have another tab-delimited file to bring in, but only some of the
columns should be imported. Is it possible with this loader to skip
over columns in the data file not desired in Solr? Certainly I can
transform the file before loading, so its not a problem, just curious.
Thanks again for another great piece of capability in Solr. You all
are amazing.
Erik
On Mar 30, 2007, at 5:41 PM, Yonik Seeley wrote:
Any comments on the CSV parameters, while the paint is still fresh?
Specifically, what about the default of commit=true? Seems to make
sense for large CSV uploads, but not for small ones. Should it be
"false" for consistency with the XML update handler???
The docs also reference a currently non-existent page about different
ways to upload data (POST binary, stream.url, stream,file, etc...)
-Yonik
On 3/30/07, Apache Wiki <[EMAIL PROTECTED]> wrote:
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Solr Wiki"
for change notification.
The following page has been changed by YonikSeeley:
http://wiki.apache.org/solr/UpdateCSV