from:"dhaivat"

Search speed issue on new core creation

2015-04-08 Thread dhaivat dave

Hello All,

I am using Master - Slave architecture setup with hundreds of cores getting
replicated between master and slave servers. I am facing very weird issue
while creating a new core.

Whenever there is a new call for a new core creation (using
CoreAdminRequest.createCore(coreName,instanceDir,serverObj)) all the
searches issued to other cores are getting blocked.

Any help or thoughts would highly appreciated.

Regards,
Dhaivat

wildcard queries with custom analyzer

2013-12-28 Thread dhaivat dave

Hello everyone,

I have written custom analyzer for indexing and querying data from solr
indexes.

Now i would like to enable wildcard search with this custom analyzer only.

Please guide me on how to enable this feature?

Many Thanks,
Dhaivat

Re: developing custom tokenizer

2013-08-13 Thread dhaivat dave

Hi Alex,

Thanks for your reply and i looked into core analyser and also created
custom tokeniser using that.I have shared code below. when i tried to look
into analysis of solr, the analyser is working fine but when i tried to
submit 100 docs together i found in logs (with custom message printing)
 that for some of the document it's not calling create method from
SampleTokeniserFactory (please see code below).

can you please help me out what's wrong in following code. am i missing
something?

here is the class which extends TokeniserFactory class

=== SampleTokeniserFactory.java

public class SampleTokeniserFactory extends TokenizerFactory {

public SampleTokeniserFactory(MapString, String args) {
super(args);
}

public SampleTokeniser create(AttributeFactory factory, Reader reader) {
return new SampleTokeniser(factory, reader);
}

}

here is the class which extends Tokenizer class


package ns.solr.analyser;

import java.io.IOException;
import java.io.Reader;
import java.util.ArrayList;
import java.util.List;

import org.apache.lucene.analysis.Tokenizer;
import org.apache.lucene.analysis.Token;
import org.apache.lucene.analysis.tokenattributes.CharTermAttribute;
import org.apache.lucene.analysis.tokenattributes.OffsetAttribute;
import
org.apache.lucene.analysis.tokenattributes.PositionIncrementAttribute;

public class SampleTokeniser extends Tokenizer {

private ListToken tokenList = new ArrayListToken();

int tokenCounter = -1;

private final CharTermAttribute termAtt =
addAttribute(CharTermAttribute.class);

/**
 * Object that defines the offset attribute
 */
private final OffsetAttribute offsetAttribute = (OffsetAttribute)
addAttribute(OffsetAttribute.class);

/**
 * Object that defines the position attribute
 */
private final PositionIncrementAttribute position =
(PositionIncrementAttribute) addAttribute(PositionIncrementAttribute.class);

public SampleTokeniser(AttributeFactory factory, Reader reader) {
super(factory, reader);
String textToProcess = null;
try {
textToProcess = readFully(reader);
processText(textToProcess);
} catch (IOException e) {
e.printStackTrace();
}

}

public String readFully(Reader reader) throws IOException {
char[] arr = new char[8 * 1024]; // 8K at a time
StringBuffer buf = new StringBuffer();
int numChars;
while ((numChars = reader.read(arr, 0, arr.length))  0) {
buf.append(arr, 0, numChars);
}
return buf.toString();
}

public void processText(String textToProcess) {

String wordsList[] = textToProcess.split( );

int startOffset = 0, endOffset = 0;

for (String word : wordsList) {

endOffset = word.length();

Token aToken = new Token(Token. + word, startOffset, endOffset);

aToken.setPositionIncrement(1);

tokenList.add(aToken);

startOffset = endOffset + 1;
}
}

@Override
public boolean incrementToken() throws IOException {

clearAttributes();
tokenCounter++;

if (tokenCounter  tokenList.size()) {
Token aToken = tokenList.get(tokenCounter);

termAtt.append(aToken);
termAtt.setLength(aToken.length());
offsetAttribute.setOffset(correctOffset(aToken.startOffset()),
correctOffset(aToken.endOffset()));
position.setPositionIncrement(aToken.getPositionIncrement());
return true;
}

return false;
}

/**
 * close object
 *
 * @throws IOException
 */
public void close() throws IOException {
super.close();
System.out.println(Close method called);

}

/**
 * called when end method gets called
 *
 * @throws IOException
 */
public void end() throws IOException {
super.end();
// setting final offset
System.out.println(end called with final offset);
}

/**
 * method reset the record
 *
 * @throws IOException
 */
public void reset() throws IOException {
super.reset();
System.out.println(Reset Called);
tokenCounter = -1;

}
}


Many Thanks,
Dhaivat


On Mon, Aug 12, 2013 at 7:03 PM, Alexandre Rafalovitch
arafa...@gmail.comwrote:

 Have you tried looking at source code itself? Between simple organizer like
 keyword and complex language ones, you should be able to get an idea. Then
 ask specific follow up questions.

 Regards,
  Alex
 On 12 Aug 2013 09:29, dhaivat dave dhaiva...@gmail.com wrote:

  Hello All,
 
  I want to create custom tokeniser in solr 4.4.  it will be very helpful
 if
  some one share any tutorials or information on this.
 
 
  Many Thanks,
  Dhaivat Dave
 




-- 







Regards
Dhaivat

issue with custom tokenizer

2013-08-13 Thread dhaivat dave

Hello All,

I am trying to develop custom tokeniser (please find code below) and found
some issue while adding multiple document one after another.

it works fine when i add first document and when i add another document
it's not calling create method from SampleTokeniserFactory.java but it
calls directly reset method and then call incrementToken(). any one have an
idea on this what's wrong in the code below?  please share your thoughts on
this.

here is the class which extends TokeniserFactory class

=== SampleTokeniserFactory.java

public class SampleTokeniserFactory extends TokenizerFactory {

public SampleTokeniserFactory(MapString, String args) {
super(args);
}

public SampleTokeniser create(AttributeFactory factory, Reader reader) {
return new SampleTokeniser(factory, reader);
}

}

here is the class which extends Tokenizer class


package ns.solr.analyser;

import java.io.IOException;
import java.io.Reader;
import java.util.ArrayList;
import java.util.List;

import org.apache.lucene.analysis.Tokenizer;
import org.apache.lucene.analysis.Token;
import org.apache.lucene.analysis.tokenattributes.CharTermAttribute;
import org.apache.lucene.analysis.tokenattributes.OffsetAttribute;
import
org.apache.lucene.analysis.tokenattributes.PositionIncrementAttribute;

public class SampleTokeniser extends Tokenizer {

private ListToken tokenList = new ArrayListToken();

int tokenCounter = -1;

private final CharTermAttribute termAtt =
addAttribute(CharTermAttribute.class);

/**
 * Object that defines the offset attribute
 */
private final OffsetAttribute offsetAttribute = (OffsetAttribute)
addAttribute(OffsetAttribute.class);

/**
 * Object that defines the position attribute
 */
private final PositionIncrementAttribute position =
(PositionIncrementAttribute) addAttribute(PositionIncrementAttribute.class);

public SampleTokeniser(AttributeFactory factory, Reader reader) {
super(factory, reader);
String textToProcess = null;
try {
textToProcess = readFully(reader);
processText(textToProcess);
} catch (IOException e) {
e.printStackTrace();
}

}

public String readFully(Reader reader) throws IOException {
char[] arr = new char[8 * 1024]; // 8K at a time
StringBuffer buf = new StringBuffer();
int numChars;
while ((numChars = reader.read(arr, 0, arr.length))  0) {
buf.append(arr, 0, numChars);
}
return buf.toString();
}

public void processText(String textToProcess) {

String wordsList[] = textToProcess.split( );

int startOffset = 0, endOffset = 0;

for (String word : wordsList) {

endOffset = word.length();

Token aToken = new Token(Token. + word, startOffset, endOffset);

aToken.setPositionIncrement(1);

tokenList.add(aToken);

startOffset = endOffset + 1;
}
}

@Override
public boolean incrementToken() throws IOException {

clearAttributes();
tokenCounter++;

if (tokenCounter  tokenList.size()) {
Token aToken = tokenList.get(tokenCounter);

termAtt.append(aToken);
termAtt.setLength(aToken.length());
offsetAttribute.setOffset(correctOffset(aToken.startOffset()),
correctOffset(aToken.endOffset()));
position.setPositionIncrement(aToken.getPositionIncrement());
return true;
}

return false;
}

/**
 * close object
 *
 * @throws IOException
 */
public void close() throws IOException {
super.close();
System.out.println(Close method called);

}

/**
 * called when end method gets called
 *
 * @throws IOException
 */
public void end() throws IOException {
super.end();
// setting final offset
System.out.println(end called with final offset);
}

/**
 * method reset the record
 *
 * @throws IOException
 */
public void reset() throws IOException {
super.reset();
System.out.println(Reset Called);
tokenCounter = -1;

}
}

developing custom tokenizer

2013-08-12 Thread dhaivat dave

Hello All,

I want to create custom tokeniser in solr 4.4.  it will be very helpful if
some one share any tutorials or information on this.


Many Thanks,
Dhaivat Dave

Re: Indexing and Query time boosting together

2013-08-04 Thread dhaivat dave

Hey Jack,

Thank you so much for your reply. This is very useful.

Thanks again,
Dhaivat Dave

On Fri, Aug 2, 2013 at 8:04 PM, Jack Krupansky j...@basetechnology.comwrote:

product promotions = query elevation

See:
http://wiki.apache.org/solr/**QueryElevationComponenthttp://wiki.apache.org/solr/QueryElevationComponent
https://cwiki.apache.org/**confluence/display/solr/The+**
Query+Elevation+Componenthttps://cwiki.apache.org/confluence/display/solr/The+Query+Elevation+Component

Or, boost the query using a function query referencing an external file
field that gets updated for promotions.

-- Jack Krupansky

-Original Message- From: dhaivat dave
Sent: Friday, August 02, 2013 9:17 AM
To: solr-user@lucene.apache.org
Subject: Re: Indexing and Query time boosting together

Hi Erick

Many Thanks for your reply. I got your point. one question on this: is it
possible to give more priority to those docs which has higher indexing time
boosting against query time boosting. I am trying to achieve product
promotions using this implementation. can you please guide how should i
implement this feature ?

Many Thanks,
Dhaivat Dave

On Fri, Aug 2, 2013 at 5:34 PM, Erick Erickson erickerick...@gmail.com**
wrote:

Add debug=all to your query, that'll show you exactly how the scores
are calculated. But the most obvious thing is that you're boosting
on the titleName field in your query, which for doc 123 does NOT
contain phone so I suspect the fact that phone is in the titleName
field for 122 is overriding the index-time boost, especially since phone
appears in both title and description for 122.

Best
Erick

On Fri, Aug 2, 2013 at 7:53 AM, dhaivat dave dhaiva...@gmail.com wrote:

Hello All,

I want to boost certain products on particular keywords. for this i am
using solr's indexing time boosting feature. i have given index time
boosting with 1.0 value to all documents in my solr indices. now what
i
am doing is when user want to boost certain product i just increase
index
time boosting value to 10.0 of that particular product only. now the
problem is: i have also used query time boosting (for boosting documents
when searched term found directly in title field) and so even i have
increase the indexing time boosting value of the particular product it
appears after query time boosted product.

consider following example:

- I have indexed couple document related to mobile phone (nokia,samsung
and
so on)
- All the documents contains the title field which contains following
value
*Doc1:*
*==*
doc boost=1.0
id122/id
titleNameNokia Phone 2610 /titleName
descriptionSuprb phone/description
..
/otherField
/doc

*Doc2: *
* ==*
doc boost=1.0
id123
titleNameSamsung smwer233/titleName
descriptionSamsung phone/description
..
/otherfield
/doc

- now if some one searches for Phone it will display first Nokia
Phone
second Samsung Phone (by searching in titleName and description
field)
- to display Samsung before Nokia i have boost the index time value
,
some thing like below

doc boost=5.0
id123 /id
titleNameSamsung smwer233/titleName
descriptionSamsung phone/description
..
/otherfield
/doc

- i am also using boosting at query time to boost the document which has
found terms in titleName field
*titleName:phone^4*

now even though i have higher boosting in samsung mobile it displays
nokia
mobile first and then samsung mobile.

can any one please guide how can i boost particular document using index
time boosting(it should appear first even though i am applying query
time
boosting).

Many Thanks,
Dhaivat Dave

Regards
Dhaivat

Indexing and Query time boosting together

2013-08-02 Thread dhaivat dave

Hello All,

I want to boost certain products on particular keywords. for this i am
using solr's indexing time boosting feature. i have given index time
boosting with 1.0 value to all documents in my solr indices. now what i
am doing is when user want to boost certain product i just increase index
time boosting value to 10.0 of that particular product only. now the
problem is: i have also used query time boosting (for boosting documents
when searched term found directly in title field) and so even i have
increase the indexing time boosting value of the particular product it
appears after query time boosted product.

consider following example:

- I have indexed couple document related to mobile phone (nokia,samsung and
so on)
- All the documents contains the title field which contains following value
   *Doc1:*
   *==*
   doc boost=1.0
   id122/id
   titleNameNokia Phone 2610 /titleName
   descriptionSuprb phone/description
 ..
   /otherField
   /doc

   *Doc2: *
*   ==*
   doc boost=1.0
id123
titleNameSamsung smwer233/titleName
descriptionSamsung phone/description
 ..
/otherfield
 /doc

- now if some one searches for Phone it will display first Nokia Phone
second Samsung Phone (by searching in titleName and description field)
- to display Samsung before Nokia  i have boost the index time value ,
some thing like below

doc boost=5.0
id123  /id
titleNameSamsung smwer233/titleName
descriptionSamsung phone/description
 ..
/otherfield
 /doc

- i am also using boosting at query time to boost the document which has
found terms in titleName field
*titleName:phone^4*

now even though i have higher boosting in samsung mobile it displays nokia
mobile first and then samsung mobile.

can any one please guide how can i boost particular document using index
time boosting(it should appear first even though i am applying query time
boosting).

Many Thanks,
Dhaivat Dave

Re: Indexing and Query time boosting together

2013-08-02 Thread dhaivat dave

Hi Erick

Many Thanks for your reply. I got your point. one question on this: is it
possible to give more priority to those docs which has higher indexing time
boosting against query time boosting. I am trying to achieve product
promotions using this implementation. can you please guide how should i
implement this feature ?

Many Thanks,
Dhaivat Dave

On Fri, Aug 2, 2013 at 5:34 PM, Erick Erickson erickerick...@gmail.comwrote:

 Add debug=all to your query, that'll show you exactly how the scores
 are calculated. But the most obvious thing is that you're boosting
 on the titleName field in your query, which for doc 123 does NOT
 contain phone so I suspect the fact that phone is in the titleName
 field for 122 is overriding the index-time boost, especially since phone
 appears in both title and description for 122.

 Best
 Erick


 On Fri, Aug 2, 2013 at 7:53 AM, dhaivat dave dhaiva...@gmail.com wrote:

  Hello All,
 
  I want to boost certain products on particular keywords. for this i am
  using solr's indexing time boosting feature. i have given index time
  boosting with 1.0 value to all documents in my solr indices. now what i
  am doing is when user want to boost certain product i just increase index
  time boosting value to 10.0 of that particular product only. now the
  problem is: i have also used query time boosting (for boosting documents
  when searched term found directly in title field) and so even i have
  increase the indexing time boosting value of the particular product it
  appears after query time boosted product.
 
  consider following example:
 
  - I have indexed couple document related to mobile phone (nokia,samsung
 and
  so on)
  - All the documents contains the title field which contains following
 value
 *Doc1:*
 *==*
 doc boost=1.0
 id122/id
 titleNameNokia Phone 2610 /titleName
 descriptionSuprb phone/description
   ..
 /otherField
 /doc
 
 *Doc2: *
  *   ==*
 doc boost=1.0
  id123
  titleNameSamsung smwer233/titleName
  descriptionSamsung phone/description
   ..
  /otherfield
   /doc
 
  - now if some one searches for Phone it will display first Nokia
 Phone
  second Samsung Phone (by searching in titleName and description
  field)
  - to display Samsung before Nokia  i have boost the index time value
 ,
  some thing like below
 
  doc boost=5.0
  id123  /id
  titleNameSamsung smwer233/titleName
  descriptionSamsung phone/description
   ..
  /otherfield
   /doc
 
  - i am also using boosting at query time to boost the document which has
  found terms in titleName field
  *titleName:phone^4*
 
  now even though i have higher boosting in samsung mobile it displays
 nokia
  mobile first and then samsung mobile.
 
  can any one please guide how can i boost particular document using index
  time boosting(it should appear first even though i am applying query time
  boosting).
 
  Many Thanks,
  Dhaivat Dave
 




-- 







Regards
Dhaivat

Boosting Original Indexed Terms

2013-02-27 Thread dhaivat dave

Hello All,

I need help in boosting original indexed terms.

I am storing multiple terms at same position and i want to boost the
original term.

consider following scenario i am indexing document which contain the
following text:

*baby t-shirts* i am storing  terms as following


position12term textbabyt-shirtsbabet-shirtinfantchildkidstartOffset0505000
endOffset413413444

so now i want to boost results on original terms  i.e if user searches baby
it should returns that results which has original term baby in it. and then
others.

please let me know how to achieve this.

Thanks
Dhaivat

Boosting Original Indexed Terms

2013-02-27 Thread dhaivat

Hello All,

I need help in boosting original indexed terms.

I am storing multiple terms at same position and i want to boost the
original term.

consider following scenario i am indexing document which contain the
following text:  

baby t-shirts i am storing  terms as following

here is the indexing analysis.


position 1   2

term textbabyt-shirts
 babet-shirt
 infant
 child
 kid

startOffset  0   5
 0   5
 0
 0
 0
endOffset4   13
 4   13
 4 
 4
 4


so now i want to boost results on original terms  i.e if user searches baby
it should returns that results which has original term baby in it. and then
others.

please let me know how to achieve this.

Thanks



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Boosting-Original-Indexed-Terms-tp4043380.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Load Testing in Solr

2012-08-30 Thread dhaivat dave

Thanks Pravedsh for your reply. i ll use the JMeter tool .

On Thu, Aug 30, 2012 at 11:10 PM, pravesh suyalprav...@yahoo.com wrote:

 Hi Dhaivat,
 JMeter is a nice tool. But it all depends what sort of load are you
 expecting, how complex queries are you expecting(sorting/filtering/textual
 searches).  You need to consider all these to benchmark.

 Thanx
 Pravedsh



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Load-Testing-in-Solr-tp4004117p4004428.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 







Regards
Dhaivat

Error while indexing data using Solr (Unexpected character 'F' (code 70) in prolog; expected '')

2012-08-27 Thread dhaivat dave

Hello Everyone ,


I am getting an error while indexing data to solr. i am using solrj apis to
index the document and using the xml request handler to index document. i
am getting an error *org.apache.solr.common.SolrException: Unexpected
character 'F' (code 70) in prolog; expected '' at [row,col
{unknown-source}]: [1,1] *. i have also escaped the content before sending
it to solr. can any please tell me the reason behind this error.





Regards
Dhaivat

Re: Fast Vector Highlighter Working for some records only

2012-02-23 Thread dhaivat

Hi Koji,

Thanks for your guidance. i have looked into anlysis page of solr and it's
working fine.but still it's not working fine for few documents. 

here is configuration for highlighter i am using,i have specefied this in
solrconfig.xml, please can you tell me what should i change to highlighter
to work for all documents. for your information i am not using any kind of
filter for custom field, i am just using my custom tokeniser.. 


  searchComponent class=solr.HighlightComponent name=highlight
highlighting
  
  
  fragmenter name=gap 
  default=true
  class=solr.highlight.GapFragmenter
lst name=defaults
  
int name=hl.snippets1000/int
int name=hl.fragsize7/int
  
int name=hl.maxAnalyzedChars7/int

/lst
  /fragmenter

  

fragmenter name=regex 
class=org.apache.solr.highlight.RegexFragmenter
lst name=defaults
  
  int name=hl.fragsize70/int
  
  float name=hl.regex.slop0.5/float
  
  str name=hl.regex.pattern[-\w ,/\n\']{20,200}/str
/lst
   /fragmenter

 

  
  formatter name=html 
 default=true
 class=solr.highlight.HtmlFormatter
lst name=defaults
  str name=hl.simple.pre/str
  str name=hl.simple.post/str
/lst
  /formatter

  
  encoder name=html 
   class=solr.highlight.HtmlEncoder /

  
  fragListBuilder name=simple 
   default=true
   class=solr.highlight.SimpleFragListBuilder/

  
  fragListBuilder name=single 
   class=solr.highlight.SingleFragListBuilder/

  
  fragmentsBuilder name=default 
default=true
class=solr.highlight.ScoreOrderFragmentsBuilder

  /fragmentsBuilder

  
  fragmentsBuilder name=colored 
class=solr.highlight.ScoreOrderFragmentsBuilder
lst name=defaults
  str name=hl.tag.pre/str
  str name=hl.tag.post/str
/lst
  /fragmentsBuilder
  
  boundaryScanner name=default 
   default=true
   class=solr.highlight.SimpleBoundaryScanner
lst name=defaults
  str name=hl.bs.maxScan10/str
  str name=hl.bs.chars.,!? #9;#10;#13;/str
/lst
  /boundaryScanner
  
  boundaryScanner name=breakIterator 
   class=solr.highlight.BreakIteratorBoundaryScanner
lst name=defaults
  
  str name=hl.bs.typeWORD/str
  
  
  str name=hl.bs.languageen/str
  str name=hl.bs.countryUS/str
/lst
  /boundaryScanner
/highlighting
  /searchComponent

 



Koji Sekiguchi wrote
 
 Hi dhaivat,
 
 I think you may want to use analysis.jsp:
 
 http://localhost:8983/solr/admin/analysis.jsp
 
 Go to the URL and look into how your custom tokenizer produces tokens,
 and compare with the output of Solr's inbuilt tokenizer.
 
 koji
 -- 
 Query Log Visualizer for Apache Solr
 http://soleami.com/
 
 
 (12/02/22 21:35), dhaivat wrote:

 Koji Sekiguchi wrote

 (12/02/22 11:58), dhaivat wrote:
 Thanks for reply,

 But can you please tell me why it's working for some documents and not
 for
 other.

 As Solr 1.4.1 cannot recognize hl.useFastVectorHighlighter flag, Solr
 just
 ignore it, but due to hl=true is there, Solr tries to create highlight
 snippets
 by using (existing; traditional; I mean not FVH) Highlighter.
 Highlighter (including FVH) cannot produce snippets sometime for some
 reasons,
 you can use hl.alternateField parameter.

 http://wiki.apache.org/solr/HighlightingParameters#hl.alternateField

 koji
 --
 Query Log Visualizer for Apache Solr
 http://soleami.com/


 Thank you so much explanation,

 I have updated my solr version and using 3.5, Could you please tell me
 when
 i am using custom Tokenizer on the field,so do i need to make any changes
 related Solr highlighter.

 here is my custom analyser

   fieldType name=custom_text class=solr.TextField
 positionIncrementGap=100
analyzer type=index
  tokenizer
 class=ns.solr.analyser.CustomIndexTokeniserFactory/
/analyzer
  analyzer type=query
  tokenizer class=ns.solr.analyser.CustomSearcherTokeniserFactory/
  
  /analyzer
  /fieldType

 here is the field info:

 field name=contents type=custom_text indexed=true stored=true
 multiValued=true termPositions=true  termVectors=true
 termOffsets=true/

 i am creating tokens using my custom analyser and when i am trying to use
 highlighter it's not working properly for contents field.. but when i
 tried
 to use Solr inbuilt tokeniser i am finding the word highlighted for
 particular query.. Please can you help me out with this ?


 Thanks

Re: Fast Vector Highlighter Working for some records only

2012-02-23 Thread dhaivat

Hi Koji

 i am using solr 3.5 and i want to highlight the multivalued field, when i
supply single value for the multi field value at that highlighter is working
fine. but when i am indexing multiple values for field and try to highlight
that field at that time i am getting following error with Fast Vector
Highlighter 

java.lang.StringIndexOutOfBoundsException: String index out of range: -1099

i have set following parameter using solrj


query.add(hl.q,term);
query.add(hl.fl,contents);
query.add(hl,true);


query.add(hl.useFastVectorHighlighter,true);
query.add(hl.snippets,100);
query.add(hl.fragsize,7);
query.add(hl.maxAnalyzedChars,7);

can you please tell me the cause of this error ?

Thanks in advance
Dhaivat





Koji Sekiguchi wrote
 
 Hi dhaivat,
 
 I think you may want to use analysis.jsp:
 
 http://localhost:8983/solr/admin/analysis.jsp
 
 Go to the URL and look into how your custom tokenizer produces tokens,
 and compare with the output of Solr's inbuilt tokenizer.
 
 koji
 -- 
 Query Log Visualizer for Apache Solr
 http://soleami.com/
 
 
 (12/02/22 21:35), dhaivat wrote:

 Koji Sekiguchi wrote

 (12/02/22 11:58), dhaivat wrote:
 Thanks for reply,

 But can you please tell me why it's working for some documents and not
 for
 other.

 As Solr 1.4.1 cannot recognize hl.useFastVectorHighlighter flag, Solr
 just
 ignore it, but due to hl=true is there, Solr tries to create highlight
 snippets
 by using (existing; traditional; I mean not FVH) Highlighter.
 Highlighter (including FVH) cannot produce snippets sometime for some
 reasons,
 you can use hl.alternateField parameter.

 http://wiki.apache.org/solr/HighlightingParameters#hl.alternateField

 koji
 --
 Query Log Visualizer for Apache Solr
 http://soleami.com/


 Thank you so much explanation,

 I have updated my solr version and using 3.5, Could you please tell me
 when
 i am using custom Tokenizer on the field,so do i need to make any changes
 related Solr highlighter.

 here is my custom analyser

   fieldType name=custom_text class=solr.TextField
 positionIncrementGap=100
analyzer type=index
  tokenizer
 class=ns.solr.analyser.CustomIndexTokeniserFactory/
/analyzer
  analyzer type=query
  tokenizer class=ns.solr.analyser.CustomSearcherTokeniserFactory/
  
  /analyzer
  /fieldType

 here is the field info:

 field name=contents type=custom_text indexed=true stored=true
 multiValued=true termPositions=true  termVectors=true
 termOffsets=true/

 i am creating tokens using my custom analyser and when i am trying to use
 highlighter it's not working properly for contents field.. but when i
 tried
 to use Solr inbuilt tokeniser i am finding the word highlighted for
 particular query.. Please can you help me out with this ?


 Thanks in advance
 Dhaivat





 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Fast-Vector-Highlighter-Working-for-some-records-only-tp3763286p3766335.html
 Sent from the Solr - User mailing list archive at Nabble.com.

 


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Fast-Vector-Highlighter-Working-for-some-records-only-tp3763286p3771933.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Fast Vector Highlighter Working for some records only

2012-02-22 Thread dhaivat


Koji Sekiguchi wrote
 
 (12/02/22 11:58), dhaivat wrote:
 Thanks for reply,

 But can you please tell me why it's working for some documents and not
 for
 other.
 
 As Solr 1.4.1 cannot recognize hl.useFastVectorHighlighter flag, Solr just
 ignore it, but due to hl=true is there, Solr tries to create highlight
 snippets
 by using (existing; traditional; I mean not FVH) Highlighter.
 Highlighter (including FVH) cannot produce snippets sometime for some
 reasons,
 you can use hl.alternateField parameter.
 
 http://wiki.apache.org/solr/HighlightingParameters#hl.alternateField
 
 koji
 -- 
 Query Log Visualizer for Apache Solr
 http://soleami.com/
 

Thank you so much explanation,
 
I have updated my solr version and using 3.5, Could you please tell me when
i am using custom Tokenizer on the field,so do i need to make any changes
related Solr highlighter. 

here is my custom analyser

 fieldType name=custom_text class=solr.TextField
positionIncrementGap=100
  analyzer type=index
tokenizer class=ns.solr.analyser.CustomIndexTokeniserFactory/
  /analyzer
  analyzer type=query
tokenizer class=ns.solr.analyser.CustomSearcherTokeniserFactory/
 
  /analyzer
/fieldType

here is the field info:

field name=contents type=custom_text indexed=true stored=true
multiValued=true termPositions=true  termVectors=true
termOffsets=true/

i am creating tokens using my custom analyser and when i am trying to use
highlighter it's not working properly for contents field.. but when i tried
to use Solr inbuilt tokeniser i am finding the word highlighted for
particular query.. Please can you help me out with this ?


Thanks in advance
Dhaivat





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Fast-Vector-Highlighter-Working-for-some-records-only-tp3763286p3766335.html
Sent from the Solr - User mailing list archive at Nabble.com.

Fast Vector Highlighter Working for some records only

2012-02-21 Thread dhaivat


Hi

I am newbie to Solr and i am using Sorj Client to create index and query the
solr data.. When i am querying the data i want to use Highlight feature of
solr so i am using Fast Vector Highlighter to enable highlight on words.. I
found that it's working fine for some documents and for some document it's
returning any highlighted words even though the field of document contents
that word.. i am using the following parameters  using solrj client :

   query.add(hl,true);
query.add(hl.q,term);
query.add(hl.fl,contents);
query.add(hl.snippets,100);
query.add(hl.fragsize,10);
query.add(hl.maxAnalyzedChars,10);
query.add(hl.useFastVectorHighlighter,true);
query.add(hl.highlightMultiTerm,true);
query.add(hl.regex.slop,0.5);
query.add(hl.regex.pattern,[-\\w ,/\n\\\']*);

query.setHighlightSimplePre(*);
query.setHighlightSimplePost(*);

My solrconfig is pretty strait forward haven't specified anything related to
highlighter there.

this is how my solrConfig looks like :

config
  updateHandler class=solr.DirectUpdateHandler2 /

  requestDispatcher handleSelect=true 
requestParsers enableRemoteStreaming=false
multipartUploadLimitInKB=2048 /
  /requestDispatcher
  
  requestHandler name=standard class=solr.StandardRequestHandler
default=true /
  requestHandler name=/update class=solr.XmlUpdateRequestHandler /
  requestHandler name=/admin/
class=org.apache.solr.handler.admin.AdminHandlers /
  

   
 
   
  admin
defaultQuerysolr/defaultQuery
  /admin

/config

i have also enabled the TermVectors,TermOffsets,TermPostions on Field on
which i am indexing


can anyone tell me where i am going wrong ? 

thanks in advance

Dhaivat



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Fast-Vector-Highlighter-Working-for-some-records-only-tp3763286p3763286.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Fast Vector Highlighter Working for some records only

2012-02-21 Thread dhaivat

Hi  Koji,

Thanks for quick reply, i am using solr 1.4.1 

i am querying *camera*



here is the example of documents :

which matches the 

doc
id70/id
  categoryElectronics/Cell Phones/category
  thumbnail/b/l/blackberry-8100-pearl-2.jpg/thumbnail
  price349.99/price
  meta_descriptionBlackBerry 8100 Pearl sports a large 240 x 260 screen
that supports over 65,000 colors-- plenty of real estate to view your
e-mails, Web browser content, messaging sessions, and
attachments./meta_description
  contentsSilver/contents
  url_pathblackberry-8100-pearl.html/url_path
  contentsLike the BlackBerry 7105t, the BlackBerry 8100 Pearl is  The
BlackBerry 8100 Pearl sports a large 240 x 260 screen that supports over
65,000 colors-- plenty of real estate to view your e-mails, Web browser
content, messaging sessions, and attachments. The venerable BlackBerry
trackwheel has been replaced on this model with an innovative four-way
trackball placed below the screen. On the rear of the handheld, youapos;ll
find a 1.3-megapixel camera and a self portrait mirror. The handheldapos;s
microSD memory card slot is located inside the device, behind the battery.
Thereapos;s also a standard 2.5mm headset jack that can be used with the
included headset, as well as a mini-USB port for data
connectivity./contents
  contentsBlackBerry 8100 Pearl/contents
  in_depthlt;ulgt; lt;ul class=quot;discquot;gt; lt;ligt; 1.3 mega
pixel camera to capture those special momentslt;brgt;lt;/ligt;
lt;ligt; MP3 player lets you listen to your favorite music on the
golt;brgt;lt;/ligt; lt;ligt;Menu and escape keys on the front of the
device for easier accesslt;brgt;lt;/ligt; lt;ligt; Bluetooth
technology lets you experience hands free and wire free
featureslt;brgt;lt;/ligt; lt;ligt;Package Contents: phone,AC
adapter,software CD,headset,USB cable,sim- card,get started poster,reference
guidelt;brgt;lt;/ligt; lt;/ulgt;/in_depth

/doc

doc
 id89/id
  categoryElectronics/Cameras/Accessories/category
  thumbnail/u/n/universal-camera-case-2.jpg/thumbnail
  price34.0/price
  meta_descriptionUniversal Camera Case/meta_description
  contentsGreen/contents
  url_pathuniversal-camera-case.html/url_path
  contentsA stylish digital camera demands stylish protection. This
leather carrying case will defend your camera from the dings and scratches
of travel and everyday use while looking smart all the time./contents
  contentsUniversal Camera Case/contents
  in_depth/in_depth

/doc

on above documents i am getting highlighting response on documentid =  89
and not for documentId = 70 even though there is word called camera in
document(id=70)..

I have field called 

field name=contents type=custom_type indexed=true stored=true
multiValued=true termPositions=true  termVectors=true
termOffsets=true/

for your information i am using custom analyser for indexing and querying.

Thanks
Dhaivat




Koji Sekiguchi wrote
 
 Dhaivat,
 
 Can you give us the concrete document that you are trying to search and
 make
 a highlight snippet? And what is your Solr version?
 
 koji
 -- 
 Query Log Visualizer for Apache Solr
 http://soleami.com/
 
 (12/02/21 20:29), dhaivat wrote:

 Hi

 I am newbie to Solr and i am using Sorj Client to create index and query
 the
 solr data.. When i am querying the data i want to use Highlight feature
 of
 solr so i am using Fast Vector Highlighter to enable highlight on words..
 I
 found that it's working fine for some documents and for some document
 it's
 returning any highlighted words even though the field of document
 contents
 that word.. i am using the following parameters  using solrj client :

 query.add(hl,true);
  query.add(hl.q,term);
  query.add(hl.fl,contents);
  query.add(hl.snippets,100);
  query.add(hl.fragsize,10);
  query.add(hl.maxAnalyzedChars,10);
  query.add(hl.useFastVectorHighlighter,true);
  query.add(hl.highlightMultiTerm,true);
  query.add(hl.regex.slop,0.5);
  query.add(hl.regex.pattern,[-\\w ,/\n\\\']*);
  
  query.setHighlightSimplePre(*);
  query.setHighlightSimplePost(*);

 My solrconfig is pretty strait forward haven't specified anything related
 to
 highlighter there.

 this is how my solrConfig looks like :

 config
updateHandler class=solr.DirectUpdateHandler2 /

requestDispatcher handleSelect=true
  requestParsers enableRemoteStreaming=false
 multipartUploadLimitInKB=2048 /
/requestDispatcher

requestHandler name=standard class=solr.StandardRequestHandler
 default=true /
requestHandler name=/update class=solr.XmlUpdateRequestHandler /
requestHandler name=/admin/
 class=org.apache.solr.handler.admin.AdminHandlers /





admin
  defaultQuerysolr/defaultQuery
/admin

 /config

 i have also enabled the TermVectors

Re: Fast Vector Highlighter Working for some records only

2012-02-21 Thread dhaivat


Koji Sekiguchi wrote
 
 (12/02/21 21:22), dhaivat wrote:
 Hi  Koji,

 Thanks for quick reply, i am using solr 1.4.1

 
 Uh, you cannot use FVH on Solr 1.4.1. FVH is available Solr 3.1 or later.
 So your hl.useFastVectorHighlighter=true flag is ignored.
 
 koji
 -- 
 Query Log Visualizer for Apache Solr
 http://soleami.com/
 

Thanks for reply,

But can you please tell me why it's working for some documents and not for
other.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Fast-Vector-Highlighter-Working-for-some-records-only-tp3763286p3765458.html
Sent from the Solr - User mailing list archive at Nabble.com.

Multi Core indexed using SolrJ

2011-11-10 Thread dhaivat

Hello all

I have gone through the tutorials of Solrj. now i want to create multi core
indexes through solrj but i am not getting clue , so can anybody post some
example code ? 


Regards

Dhaivat

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-Core-indexed-using-SolrJ-tp3496830p3496830.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Multi Core indexed using SolrJ

2011-11-10 Thread dhaivat


Thanks Ivan, 

Is there any specific method using which i can create core and add documents
in it ?


Regards
Dhaivat 

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-Core-indexed-using-SolrJ-tp3496830p3496869.html
Sent from the Solr - User mailing list archive at Nabble.com.

Search speed issue on new core creation

wildcard queries with custom analyzer

Re: developing custom tokenizer

issue with custom tokenizer

developing custom tokenizer

Re: Indexing and Query time boosting together

Indexing and Query time boosting together

Re: Indexing and Query time boosting together

Boosting Original Indexed Terms

Boosting Original Indexed Terms

Re: Load Testing in Solr

Error while indexing data using Solr (Unexpected character 'F' (code 70) in prolog; expected '')

Re: Fast Vector Highlighter Working for some records only

Re: Fast Vector Highlighter Working for some records only

Re: Fast Vector Highlighter Working for some records only

Fast Vector Highlighter Working for some records only

Re: Fast Vector Highlighter Working for some records only

Re: Fast Vector Highlighter Working for some records only

Multi Core indexed using SolrJ

Re: Multi Core indexed using SolrJ

20 matches

Site Navigation

Mail list logo

Footer information