Hi
I am trying to index and search unicode (utf - 8) . the code i am using to index the
documents is as follows :
/**/
IndexWriter iw = new
I don't think you can retrieve the contents of Fields that have
been loaded by a Reader. From the javadoc for Field:
Text(String name, Reader value)
Constructs a Reader-valued Field that is tokenized and indexed, but is
not stored in the index verbatim.
--
Ian.
[EMAIL PROTECTED]
Hi,
That was the problem , Thanks :-) . still i am strugling to get lucene to
search non english unicode content . it works partially will simple analyser
but doesn't return any results with standard analyser . is there a way by
which i can output the exact contents that are going into the index
Hello,
That was the problem , Thanks :-) . still i am strugling to get
lucene to
search non english unicode content . it works partially will simple
analyser
but doesn't return any results with standard analyser . is there a
way by
which i can output the exact contents that are going into
Hello,
I'm sending this to lucene-user list, as that seems more appropriate.
I haven't used Lucene's slop feature, but it looks like both
QueryParser and PhraseQuery have support for slop. I am not sure what
the syntax for it is, but if nothign else you should be able to call
setSlop(int)
thanks, i saw the queryparser documentation and works fine.
now how can i make the query operator like 'AND', 'OR', etc, case insensitive? also
how can i change the '~' to 'w/' ?
I really don't know how use JavaCC, but may be for someone is easy,
someone can help me?
thank you.
--
On Mon, 10
This is a bit more complicated.
We have had this discussion a while ago about having a NEAR operator. The
queryParser.jj of how to do this in the developer mailling list.
The problem is that the solution is not generic. That is what if the term is
a wildcard or a more complicated sub query (a
Hi,
Are there any resources available which explain how the simple analyser processes the
data given to it .
what i want to know is that suppose i have a set of words , what exact rules are
applied to tokenize and index these words and how can i customize them.
My requirement is that the