I filed a Jira on this as well back in May:
https://issues.apache.org/jira/browse/SOLR-4864
-- Jack Krupansky
-Original Message-
From: Ken Krugler
Sent: Tuesday, October 15, 2013 10:34 PM
To: solr-user@lucene.apache.org
Cc: Chris Hostetter
Subject: RegexReplaceProcessorFactory
if used in moderation. Millions would not be
moderation.
-- Jack Krupansky
-Original Message-
From: Lee Carroll
Sent: Sunday, October 13, 2013 8:35 AM
To: solr-user@lucene.apache.org
Subject: Re: Storing 2 dimension array in Solr
I think he means a doc for each element. so you have
enough, depending on the rest of your requirements.
-- Jack Krupansky
-Original Message-
From: Mysurf Mail
Sent: Sunday, October 13, 2013 5:32 AM
To: solr-user@lucene.apache.org
Subject: Adding field to solr dynamically
My database model is designed using dynamic attributes (Entity
- no document for a disease if
it is not present for that group.
-- Jack Krupansky
-Original Message-
From: David Philip
Sent: Saturday, October 12, 2013 9:56 PM
To: solr-user@lucene.apache.org
Subject: Re: Storing 2 dimension array in Solr
Hi Erick, Yes it is. But the columns here
There is this note for escape: If an escape is specified, the encapsulator
is not used unless also explicitly specified since most formats use either
encapsulation or escaping, not both.
-- Jack Krupansky
-Original Message-
From: Utkarsh Sengar
Sent: Friday, October 11, 2013 4:35 PM
analytics features, but is still somewhat light
in that area.
It does sound as if you are seeking to match two documents as opposed to
matching a query against documents.
-- Jack Krupansky
-Original Message-
From: shahzad73
Sent: Thursday, October 10, 2013 3:03 PM
To: solr-user
,features
doc-1,doc1,feat1:feat2'
You may need to add stream.contentType=text/csv to you command.
-- Jack Krupansky
-Original Message-
From: Utkarsh Sengar
Sent: Thursday, October 10, 2013 4:51 PM
To: solr-user@lucene.apache.org
Subject: Using split in updateCSV for SolrCloud 4.4
Hello,
I
,junk_s
doc-1,doc1,feat1:feat2,junk
Can you confirm whether this exact case works for you?
-- Jack Krupansky
-Original Message-
From: Utkarsh Sengar
Sent: Thursday, October 10, 2013 8:48 PM
To: solr-user@lucene.apache.org
Subject: Re: Using split in updateCSV for SolrCloud 4.4
@Jack I just
Why use regular expressions at all?
Try:
published OR deprecated
-- Jack Krupansky
-Original Message-
From: Dinusha Dilrukshi
Sent: Tuesday, October 08, 2013 3:32 AM
To: solr-user@lucene.apache.org
Subject: Regex to match one of two words
I have an input that can have only 2 values
I'd suggest that each of your source document sections would be a distinct
solr document. All of the sections could have a source document ID field
to tie them together.
Dynamic fields work best when used in moderation. Your use case seems like
an excessive use of dynamic fields.
-- Jack
The default query operator applies only within a single query parameter. If
you want to OR two filter queries, you must combine them into one filter
query parameter.
-- Jack Krupansky
-Original Message-
From: PeterKerk
Sent: Monday, October 07, 2013 1:08 PM
To: solr-user
Combine the two filter queries with an explicit OR operator.
-- Jack Krupansky
-Original Message-
From: PeterKerk
Sent: Monday, October 07, 2013 1:50 PM
To: solr-user@lucene.apache.org
Subject: Re: Adding OR operator in querystring and grouping fields?
Ok thanks.
you must combine them
field.
-- Jack Krupansky
-Original Message-
From: Ertio Lew
Sent: Sunday, October 06, 2013 4:46 AM
To: solr-user@lucene.apache.org
Subject: Can I use app specific document id as the document id that Solr
uses for internal purposes?
Could I just use my application specific document id
require it to be an integer? I mean, isn't a numeric
string good enough for your app?
-- Jack Krupansky
-Original Message-
From: Ertio Lew
Sent: Sunday, October 06, 2013 1:36 PM
To: solr-user@lucene.apache.org
Subject: Re: Can I use app specific document id as the document id that Solr
uses
in Java as a plugin that decoded object
parameters.
-- Jack Krupansky
-Original Message-
From: Alok Bhandari
Sent: Friday, October 04, 2013 6:39 AM
To: solr-user@lucene.apache.org
Subject: Can I pass some Object as request parameter to solr server
Hello ,
I am using solr 4.0 , I want
in
Solr which only support string IDs.
-- Jack Krupansky
-Original Message-
From: Zahoor Mohamed
Sent: Friday, October 04, 2013 4:51 AM
To: solr-user@lucene.apache.org
Subject: Size of ID field
Hi
Does the size of ID field matter .. in terms of memory usage...and query
performance...
i.e
, but there is no such feature in Lucene
at this time. FunctionQuery modifies the document score, but doesn't affect
which documents are selected.
Function queries can be used to modify document scores and to return values,
but not in the query itself to select documents.
-- Jack Krupansky
-Original Message
to keep only some token types.
Besides my book, the best reference is going to be... the source code.
-- Jack Krupansky
-Original Message-
From: Ken Krugler
Sent: Thursday, October 03, 2013 9:03 PM
To: solr-user@lucene.apache.org
Subject: WikipediaTokenizer documentation
Hi all
No plan that I know of, but there is a new Lucene expression module, so
maybe it is not so farfetched. Its performance might not be so great, but if
you need the flexibility it might be worth it.
-- Jack Krupansky
-Original Message-
From: Sandro Zbinden
Sent: Friday, October 04
What are your specific requirements?
-- Jack Krupansky
-Original Message-
From: Ankit Kumar
Sent: Thursday, October 03, 2013 3:36 AM
To: solr-user@lucene.apache.org
Subject: Adding Custom Score
In Lucene i need to add custom score using function query classes . How to
do it?
Use square brackets (array) around a list of documents:
[{id:change.me,title:change.me},{id:change2.me,title:change.me2}]
-- Jack Krupansky
-Original Message-
From: Dennis Brundage
Sent: Wednesday, October 02, 2013 11:46 AM
To: solr-user@lucene.apache.org
Subject: Submitting Multiple
not support any
correspondence between multivalued fields.
You must flatten your data your data to achieve any correspondence.
Multivalued field are a powerful feature of Solr, but you must be extremely
careful to use them only in moderation.
-- Jack Krupansky
-Original Message-
From
name=defaults
bool name=tvtrue/bool
/lst
arr name=last-components
strtvComponent/str
/arr
/requestHandler
You can add that last-components list to your default handler, if you
wish.
I have more detailed examples in my e-book.
-- Jack Krupansky
indicated which fl fields did not
have term vectors.
As a general proposition it didn't work is an extremely unhelpful
response - it gives us no clues as to what you are actually seeing.
-- Jack Krupansky
-Original Message-
From: alibozorgkhan
Sent: Friday, September 27, 2013 3:41
You are using components instead of last-components, so you have to all
search components, including the QueryComponent. Better to use
last-components.
-- Jack Krupansky
-Original Message-
From: Shawn Heisey
Sent: Friday, September 27, 2013 4:02 PM
To: solr-user@lucene.apache.org
Escape any special characters in field names with backslash.
-- Jack Krupansky
-Original Message-
From: Tanu Garg
Sent: Wednesday, September 25, 2013 8:45 AM
To: solr-user@lucene.apache.org
Subject: Re: Sum function causing error in solr
Thanks erick
I know we should name variable
that second approach by simply using the edismax query
parser with the pf parameter to boost the phrase of all terms.
-- Jack Krupansky
-Original Message-
From: PAVAN
Sent: Wednesday, September 25, 2013 9:12 AM
To: solr-user@lucene.apache.org
Subject: AND/OR logic
Hi,
I have situation
Use the char filter instead:
http://lucene.apache.org/core/4_4_0/analyzers-common/org/apache/lucene/analysis/pattern/PatternReplaceCharFilterFactory.html
-- Jack Krupansky
-Original Message-
From: Greg Preston
Sent: Wednesday, September 25, 2013 5:43 PM
To: solr-user@lucene.apache.org
as to your precise requirement.
-- Jack Krupansky
-Original Message-
From: asuka
Sent: Tuesday, September 24, 2013 11:08 AM
To: solr-user@lucene.apache.org
Subject: Re: Get only those documents that are fully satisfied.
Hi Andre,
I don't want to get documents that fit my whole query, I
It all depends on your query parameters and schema field type analyzers, of
which you have told us nothing.
-- Jack Krupansky
-Original Message-
From: asuka
Sent: Monday, September 23, 2013 7:57 AM
To: solr-user@lucene.apache.org
Subject: Get only those documents that are fully
of
Solr to use and precisely how to use them.
The word delimiter filter and edge n-gram filter are possible tools to use
in such cases.
-- Jack Krupansky
-Original Message-
From: Mysurf Mail
Sent: Monday, September 23, 2013 3:34 AM
To: solr-user@lucene.apache.org
Subject: solr
(by examining the log) whether Solr was able to
successfully load your schema changes without errors?
-- Jack Krupansky
-Original Message-
From: Kishan Parmar
Sent: Sunday, September 22, 2013 9:56 AM
To: solr-user@lucene.apache.org
Subject: Xml file is not inserting from code java -jar post.jar
Set:
termVectors=true
termPositions=true
termOffsets=true
And use the fast vector highlighter.
-- Jack Krupansky
-Original Message-
From: Nalini Kartha
Sent: Friday, September 20, 2013 7:34 PM
To: solr-user@lucene.apache.org
Subject: Getting term offsets from Solr
Hi,
We're
, but having better integration with the Solr Admin UI
is a good idea.
-- Jack Krupansky
-Original Message-
From: Gareth Poulton
Sent: Thursday, September 19, 2013 7:50 AM
To: solr-user@lucene.apache.org
Subject: Migrating from Endeca
Hi,
A customer wants us to move their entire enterprise
There is no such fieldType attribute as subSuffix. Solr is just
complaining about extraneous, junk attributes. Delete the crap.
-- Jack Krupansky
-Original Message-
From: jimmy nguyen
Sent: Thursday, September 19, 2013 12:43 PM
To: solr-user@lucene.apache.org
Subject: Indexing
Is /solr/update working?
Check solrconfig to see that /update/extract is configured as in the
standard Solr example.
Does /solr/update/extract work for you using the standard Solr example?
-- Jack Krupansky
-Original Message-
From: Nutan
Sent: Sunday, September 15, 2013 2:37 AM
database, but
instead is a batch-oriented search engine with eventual consistency. Focus
on exploiting Solr's strengths, not trying to treat Solr as something that
it is not.
-- Jack Krupansky
-Original Message-
From: Phani Chaitanya
Sent: Saturday, September 14, 2013 7:36 PM
To: solr
filter should work fine.
-- Jack Krupansky
-Original Message-
From: Prasi S
Sent: Friday, September 13, 2013 3:56 AM
To: solr-user@lucene.apache.org
Subject: Escaping *, ? in Solr
Hi,
I want to do regex search in solr.
E.g: Googl* . In my query api, i have used
Wildcard applies only to a single term. The escaped space suggests that you
are trying to match a wildcard on multiple terms.
Try the contrib complex phrase query parser.
-- Jack Krupansky
-Original Message-
From: Prasi S
Sent: Friday, September 13, 2013 6:37 AM
To: solr-user
I don't have any additional questions, and won't, until you are able to
supply the information requested in my previous response.
-- Jack Krupansky
-Original Message-
From: Kuchekar
Sent: Friday, September 13, 2013 1:46 PM
To: solr-user@lucene.apache.org
Subject: Re: Different
Okay, it's hot off the e-presses: my updated book Solr 4.x Deep Dive, Early
Access Release #7 is now available for purchase and download as an e-book
for $9.99 on Lulu.com at:
http://www.lulu.com/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-1/ebook/product-21120181.html
Question mark and asterisk are wildcard characters, so if you want them to
be treated as punctuation, either enclose the terms in quotes or escape the
characters.
Wildcard characters suppress the execution of some token filters if they are
not able to cope with wildcards.
-- Jack Krupansky
Slow down, back up, and now tell us what problem (if any!) you are really
trying to solve. Don't leap to a proposed solution before you clearly state
the problem to be solved.
First, why do you think there is any problem at all?
Or, what are you really trying to achieve?
-- Jack Krupansky
? You still haven't
articulated it.
It sounds as if you are trying to solve a non-problem. But, we can't be sure
since you haven't articulated what the actual problem (if any) really is.
-- Jack Krupansky
-Original Message-
From: phanichaitanya
Sent: Thursday, September 12, 2013 1:42
Yes, the document will be searchable after it is committed.
Although you can also do auto commits and commitWithin which do not
guarantee immediate visibility of index changes, you can do a hard commit
any time you want to make a document searchable.
-- Jack Krupansky
-Original Message
e-book has a lot better examples, especially for the field
redirection aspect.
-- Jack Krupansky
-Original Message-
From: maephisto
Sent: Wednesday, September 11, 2013 8:33 AM
To: solr-user@lucene.apache.org
Subject: Re: Dynamic analizer settings change
Thanks, Erik!
I might have
Do a copyField to another field, with a limit of 8 characters, and then use
that other field.
-- Jack Krupansky
-Original Message-
From: Ken Krugler
Sent: Wednesday, September 11, 2013 8:24 PM
To: solr-user@lucene.apache.org
Subject: Grouping by field substring?
Hi all,
Assuming I
.
-- Jack Krupansky
-Original Message-
From: Andreas Owen
Sent: Tuesday, September 10, 2013 7:07 AM
To: solr-user@lucene.apache.org
Subject: Re: charfilter doesn't do anything
ok i am getting there now but if there are newlines involved the regex stops
as soon as it reaches a \r\n
mean
exclude documents that contain the following term.
Or, just escape any leading hyphen with a backslash.
-- Jack Krupansky
-Original Message-
From: Prathik Puthran
Sent: Tuesday, September 10, 2013 11:47 AM
To: d...@lucene.apache.org ; solr-user@lucene.apache.org
Subject: Re: Help
Is your df parameter also set to PackageName? The fl parameter just sets the
fields to return, not which fields are queried.
-- Jack Krupansky
-Original Message-
From: Mysurf Mail
Sent: Tuesday, September 10, 2013 9:25 AM
To: solr-user@lucene.apache.org
Subject: Solr doesnt return
Removing stray hyphens (embedded hyphens, like CD-ROM, are okay) or
escaping them with backslash looks like your best bests. There's no query
parser option to disable the hyphen as an exlusion operator, although an
upgrade to a modern Solr should fix the problem.
-- Jack Krupansky
Did you in fact try my suggested example? If not, please do so.
-- Jack Krupansky
-Original Message-
From: Andreas Owen
Sent: Monday, September 09, 2013 4:42 PM
To: solr-user@lucene.apache.org
Subject: Re: charfilter doesn't do anything
i index html pages with a lot of lines
Did you at least try the pattern I gave you?
The point of the curl was the data, not how you send the data. You can just
use the standard Solr simple post tool.
-- Jack Krupansky
-Original Message-
From: Andreas Owen
Sent: Monday, September 09, 2013 6:40 PM
To: solr-user
Use XML then. Although you will need to escape the XML special characters as
I did in the pattern.
The point is simply: Quickly and simply try to find the simple test scenario
that illustrates the problem.
-- Jack Krupansky
-Original Message-
From: Andreas Owen
Sent: Monday
;
Shows nothing, HTML tag stripped
In your original query, you didn't show us what your default field, df
parameter, was.
-- Jack Krupansky
-Original Message-
From: Andreas Owen
Sent: Sunday, September 08, 2013 5:21 AM
To: solr-user@lucene.apache.org
Subject: Re: charfilter doesn't do
2. Flatten your data.
3. Use dynamic and multivalued fields only in moderation.
1. First, tell us how your application intends to use and query your data.
That will be a guide to how your data should be stored.
-- Jack Krupansky
-Original Message-
From: anurag.jain
Sent: Sunday
For the second question, there is no multiline mode - the ends of lines are
just white space characters. IOW, it is implicitly multi-line.
-- Jack Krupansky
-Original Message-
From: Andreas Owen
Sent: Thursday, September 05, 2013 12:03 PM
To: solr-user@lucene.apache.org
Subject
a field contain or match a scalar value.
-- Jack Krupansky
-Original Message-
From: A Geek
Sent: Friday, September 06, 2013 7:10 AM
To: solr user
Subject: Store 2 dimensional array( of int values) in solr 4.0
hi All, I'm trying to store a 2 dimensional array in SOLR [version 4.0
Explain what you mean by restring duplicate file indexing. Solr doesn't work
at the file level - only documents (rows or records) and fields and
values.
-- Jack Krupansky
-Original Message-
From: shabbir
Sent: Friday, September 06, 2013 12:24 AM
To: solr-user@lucene.apache.org
and a sample of your actual Solr
input (Solr XML or JSON?) so that we can see what fields are being
populated.
-- Jack Krupansky
-Original Message-
From: Andreas Owen
Sent: Friday, September 06, 2013 4:01 AM
To: solr-user@lucene.apache.org
Subject: Re: charfilter doesn't do anything
Facet counts are per field - your counts are scattered across different
fields.
There are additional capabilities in the facet component, but first you
should describe exactly what your requirements are.
-- Jack Krupansky
-Original Message-
From: Raheel Hasan
Sent: Friday
You still haven't supplied any queries.
If all you really need is the JSON as a blob, simply store it as a string
and parse the JSON in your application layer.
-- Jack Krupansky
-Original Message-
From: A Geek
Sent: Friday, September 06, 2013 10:30 AM
To: solr user
Subject: RE
You're not being clear here - are the commas delimiting fields or do you
have one value per row?
Yes, you can tokenize a comma-delimited value in Solr.
-- Jack Krupansky
-Original Message-
From: Raheel Hasan
Sent: Friday, September 06, 2013 11:54 AM
To: solr-user@lucene.apache.org
://cwiki.apache.org/confluence/display/solr/Result+Grouping
-- Jack Krupansky
-Original Message-
From: Sai Gadde
Sent: Thursday, September 05, 2013 2:27 AM
To: solr-user@lucene.apache.org
Subject: Tweaking boosts for more search results variety
Our index is aggregated content from various
I just tried commitWithin with the standard Solr example in Solr 4.4 and it
works fine.
Can you reproduce your problem using the standard Solr example in Solr 4.4?
-- Jack Krupansky
From: Ryan, Brent
Sent: Thursday, September 05, 2013 10:39 AM
To: solr-user@lucene.apache.org
Subject: JSON
And show us an input string and a query that fail.
-- Jack Krupansky
-Original Message-
From: Shawn Heisey
Sent: Thursday, September 05, 2013 2:41 PM
To: solr-user@lucene.apache.org
Subject: Re: charfilter doesn't do anything
On 9/5/2013 10:03 AM, Andreas Owen wrote:
i would like
Do you have stop word filtering enabled? What does your field type look
like?
If stop words are ignored, you will get exactly the behavior you described.
-- Jack Krupansky
-Original Message-
From: Alistair Young
Sent: Wednesday, September 04, 2013 6:57 AM
To: solr-user
Did you restart Solr after editing config and schema?
-- Jack Krupansky
-Original Message-
From: Nutan
Sent: Wednesday, September 04, 2013 3:07 AM
To: solr-user@lucene.apache.org
Subject: unknown _stream_source_info while indexing rich doc in solr
i am using solr4.2 on windows7
my
Sounds like it would be better for you to preprocess the query in your
application layer. Your requirements seem too open-ended to wire into
Solr.
But, to be sure, please elaborate exactly what sort of variations you need
in query analysis.
-- Jack Krupansky
-Original Message
and how
dev' gets analyzed at query time.
-- Jack Krupansky
-Original Message-
From: devendra W
Sent: Tuesday, September 03, 2013 5:59 AM
To: solr-user@lucene.apache.org
Subject: Re: Apostrophes in fields
in my case - the fields with apostrophe not returned in results
When I search
then get comparable structure to the
generated queries.
-- Jack Krupansky
-Original Message-
From: Naomi Dushay
Sent: Tuesday, September 03, 2013 8:54 PM
To: solr-user@lucene.apache.org
Subject: mm, tie, qs, ps and CJKBigramFilter and edismax and dismax
When I have a field using
from one
round-robin iteration will necessarily be the same on a repeat of the same
distributed query.
The bottom line is: What exactly are you after, simply an explanation for
what you are seeing, or a guarantee that you will always see that behavior?
-- Jack Krupansky
-Original Message
Okay, it's hot off the e-presses: my updated book Solr 4.x Deep Dive, Early
Access Release #6 is now available for purchase and download as an e-book
for $9.99 on Lulu.com at:
http://www.lulu.com/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-1/ebook/product-21120181.html
See:
https://wiki.apache.org/solr/Unsubscribing%20from%20mailing%20lists
-- Jack Krupansky
-Original Message-
From: veena rani
Sent: Thursday, August 29, 2013 12:18 AM
To: solr-user@lucene.apache.org
Subject: Re: why does a node switch state ?
Kindly stop me from solr mail chain
init failure usually means you had a bad configuration parameter. You need
to look for the last caused by in the stack trace and that should tell you
what the parameter problem was.
-- Jack Krupansky
-Original Message-
From: Brian Robinson
Sent: Thursday, August 29, 2013 10:43 AM
You probably just need to escape the backslashes with a backslash -
otherwise the query parser will treat your backslashes as escapes and remove
them. This is not unlike placing a regex in a Java string literal.
-- Jack Krupansky
-Original Message-
From: Hugh Cayless
Sent: Thursday
require a THIRD
level of escaping (four backslashes for each backslash to be used in a
regex), such as if you were using SolrJ.
-- Jack Krupansky
-Original Message-
From: Hugh Cayless
Sent: Thursday, August 29, 2013 12:49 PM
To: solr-user@lucene.apache.org
Subject: Re: regex constructs
I have additional examples in the two most recent early access releases of
my book - variations on using the existing update processors.
-- Jack Krupansky
-Original Message-
From: Federico Chiacchiaretta
Sent: Tuesday, August 27, 2013 8:39 AM
To: solr-user@lucene.apache.org
Subject
clean data into Solr.
I suspect this field is a dynamic field (a dynamicField element with the
pattern *_datetime). Nothing wrong with that - just make sure you only
populate the field with valid date data.
-- Jack Krupansky
-Original Message-
From: Nikesh12
Sent: Tuesday, August 27
throughput.
+ str name=syncLevelflush|fsync|none/str
+ --
/updateLog
-- Jack Krupansky
-Original Message-
From: Erick Erickson
Sent: Tuesday, August 27, 2013 11:54 AM
To: solr-user@lucene.apache.org
Subject: Re: Transaction log on-disk guarantees
Here's a blog I
And here I was just about to give Mark credit for updating the wiki!
-- Jack Krupansky
-Original Message-
From: Erick Erickson
Sent: Tuesday, August 27, 2013 4:24 PM
To: solr-user@lucene.apache.org
Subject: Re: Transaction log on-disk guarantees
Well, when you originally googled
No, it is part of the core admin API.
-- Jack Krupansky
-Original Message-
From: smanad
Sent: Monday, August 26, 2013 10:02 AM
To: solr-user@lucene.apache.org
Subject: Re: custom names for replicas in solrcloud
Is coreNodeName exposed via collections api?
--
View this message
Unfortunately, there is no -Dcore property, so you have to due -Durl -
java -Durl=http://localhost:8983/solr/collection2/update ... -jar post.jar
...
You have the proper /select syntax.
-- Jack Krupansky
-Original Message-
From: Bruno Mannina
Sent: Monday, August 26, 2013 9:36 AM
imprecisely.
-- Jack Krupansky
-Original Message-
From: smanad
Sent: Monday, August 26, 2013 8:50 PM
To: solr-user@lucene.apache.org
Subject: Re: Default query operator OR wont work in some cases
here is keywords field for 3 docs,
Simply Asia products,Simply Asia,Sesame Chicken Egg Drop Soup
Yeah, sorry, I read the parsed query too quickly - the phrase is the
optional relevancy boost due to the pf2 parameter.
-- Jack Krupansky
-Original Message-
From: smanad
Sent: Monday, August 26, 2013 10:08 PM
To: solr-user@lucene.apache.org
Subject: Re: Default query operator OR wont
You can get the term frequency (per document) for a term using the
termfreq() function query in the fl parameter:
fl=*,termfreq(field,'term')
-- Jack Krupansky
-Original Message-
From: JZ
Sent: Friday, August 23, 2013 7:43 AM
To: solr-user@lucene.apache.org
Subject: Query term count
Your data file appears to use spaces rather than tabs.
-- Jack Krupansky
From: Rob Koeling Ai
Sent: Friday, August 23, 2013 6:38 AM
To: solr-user@lucene.apache.org
Subject: Problem with importing tab-delimited csv file
I'm having trouble importing a tab-delimited file with the csv update
Exactly - Solr does not define the punctuation, UAX#29 defines it, and I
have deciphered the UAX#29 rules and included them in my book. Some
punctuation is always punctuation and always removed, and some is
conditional on context - I tried to lay out all the implied rules.
-- Jack Krupansky
You need the CSV content type header and --data-binary.
I tried this with Solr 4.4:
curl 'http://localhost:8983/solr/update?commit=trueseparator=%09' -H
'Content-type:application/csv' --data-binary @sample.tmp
Otherwise, Solr just ignores the request.
-- Jack Krupansky
-Original
optimize is an explicit request to perform a merge. Merges occur in the
background, automatically, as needed or indicated by the parameters of the
merge policy. An optimize is requested from outside of Solr.
-- Jack Krupansky
-Original Message-
From: YouPeng Yang
Sent: Thursday
You must have upgraded from a very old release of Solr. There is no
DisMaxRequestHandler.
Just use the standard request handler for /select in the Solr example
config and then add a boolean for the defType parameter to set it to
dismax to enable the dismax query parser.
-- Jack Krupansky
rather difficult.
All of that said, please describe your use case. First, let's make sure that
it is an appropriate use case for Solr.
-- Jack Krupansky
-Original Message-
From: Prasi S
Sent: Wednesday, August 21, 2013 1:12 AM
To: solr-user@lucene.apache.org
Subject: Solr Filter Query
.
-- Jack Krupansky
-Original Message-
From: danielitos85
Sent: Wednesday, August 21, 2013 4:41 AM
To: solr-user@lucene.apache.org
Subject: Re: get term frequency, just only keywords search
Thanks a lot guys,
@Jack in my search I use dismax (how defType) and I search either term or
phrase
Yes, post.jar supports csv files.
-- Jack Krupansky
-Original Message-
From: bharat
Sent: Wednesday, August 21, 2013 1:57 AM
To: solr-user@lucene.apache.org
Subject: Re: convert text file to solr document where delimiter fields are
fields of document
Thanks all of you for quick
attribute causes best and buy to be concatenated to form bestbuy.
-- Jack Krupansky
-Original Message-
From: Erick Erickson
Sent: Wednesday, August 21, 2013 11:12 AM
To: solr-user@lucene.apache.org
Subject: Re: What filter to use to search with spaces omitted/included
between words
I'd like to see a screen shot of a search results web page that has 2,000
facets.
-- Jack Krupansky
-Original Message-
From: Erick Erickson
Sent: Wednesday, August 21, 2013 11:24 AM
To: solr-user@lucene.apache.org
Subject: Re: Facing Solr performance during query search
~2,000
I thought that the StandardTokenizer always split on punctuation,
Proving that you haven't read my book! The section on the standard tokenizer
details the rules that the tokenizer uses (in addition to extensive
examples.) That's what I mean by deep dive.
-- Jack Krupansky
-Original
Solr does not have a function query value source which is the raw document
relevance score for the current query.
That would be a nice Jira request.
Or, a parameter to outright replace the score for each document rather than
merely multiply or add to it.
-- Jack Krupansky
-Original
You can directly update Solr using a CSV file and specify an alternate
separator character:
http://wiki.apache.org/solr/UpdateCSV#separator
-- Jack Krupansky
-Original Message-
From: bharat
Sent: Tuesday, August 20, 2013 5:01 AM
To: solr-user@lucene.apache.org
Subject: convert text
examples in my book, but they are for German compound words
since that was the original primary intent for this filter. But it should
work for any words since it is a simple dictionary.
-- Jack Krupansky
-Original Message-
From: Erick Erickson
Sent: Tuesday, August 20, 2013 7:21 AM
901 - 1000 of 2475 matches
Mail list logo