Re: How to reserve ids?

2011-09-27 Thread Gabriele Kahlout
on the list. Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ From: Gabriele Kahlout gabri...@mysimpatico.com To: solr-user@lucene.apache.org; Otis Gospodnetic otis_gospodne...@yahoo.com Sent

How to reserve ids?

2011-09-26 Thread Gabriele Kahlout
Hello, While indexing there are certain urls/ids I'd never want to appear in the search results (so be indexed). Is there already a 'supported by design' mechanism to do that to point me too, or should I just create this blacklist as an processor in the update chain? -- Regards, K. Gabriele

Re: How to reserve ids?

2011-09-26 Thread Gabriele Kahlout
I'm interested in the stopwords solution as it sounds like less work but i'm not sure i understand how it works. By having msn.com as a stopword it doesnt mean i wont get msn.com as a result for say 'hotmail'. My understanding is that msn.com will never make it to the similarity function and

Re: How to make the url id case insensitive?

2011-09-05 Thread Gabriele Kahlout
On Mon, Sep 5, 2011 at 1:22 PM, Markus Jelsma markus.jel...@openindex.iowrote: Hi, URI paths are case-sensitive. If you really want to treat all URL's as case- insensitive i would suggest to modifiy the basic URL normalizer to lowercase all URL's so that it also ends up lowercased in the

How to make the url id case insensitive?

2011-09-04 Thread Gabriele Kahlout
Hi, I've just noticed that two search results of indexed data have the same url: http://www.atory.com/dupe_checker_pro/ http://www.atory.com/dupe_checker_PRO/ I thought the url/id was case-insentively unique. Is there how I can set it up to be so? For Solr it makes sense not to make it the

Re: How to get all the terms in a document as Luke does?

2011-08-30 Thread Gabriele Kahlout
@gmail.com wrote: you might want to check - http://wiki.apache.org/solr/TermVectorComponent Should provide you with the term vectors with a lot of additional info. Regards, Jayendra On Tue, Aug 30, 2011 at 3:34 AM, Gabriele Kahlout gabri...@mysimpatico.com wrote: Hello, This time I'm trying

How to get all the terms in a document as Luke does?

2011-08-29 Thread Gabriele Kahlout
I'll remove the JSP dependency and this may eventually making it into trunk. Thanks, -Trey Grainger Search Technology Development Team Lead, Careerbuilder.com Site Architect, Celiaccess.com On Tue, Jul 5, 2011 at 3:59 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: Hello

Re: Why are not query keywords treated as a set?

2011-08-20 Thread Gabriele Kahlout
tokens, why would it be a `set` ? this might help in your analysis chain http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.RemoveDupl icatesTokenFilterFactory On 20 June 2011 04:21, Gabriele Kahlout gabri...@mysimpatico.com wrote: str name=rawquerystringpast past/str

Re: How to add TrieIntField to a SolrInputDocument?

2011-07-14 Thread Gabriele Kahlout
this works: doc.remove(wc); SolrInputField wcField = new SolrInputField(wc); wcField.setValue(150, 1.0f); doc.put(wc,wcField); On Wed, Jul 13, 2011 at 4:19 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: SolrInputDocument doc = new SolrInputDocument

Why cannot I open a read-only IndexReader from TestHarness.getIndexDir() ?

2011-07-14 Thread Gabriele Kahlout
IndexReader getReader() throws CorruptIndexException, IOException { return IndexReader.open(FSDirectory.open(new File(h.getCore().getIndexDir())), true); } *org.apache.lucene.index.IndexNotFoundException: no segments* file found in

Re: Why cannot I open a read-only IndexReader from TestHarness.getIndexDir() ?

2011-07-14 Thread Gabriele Kahlout
(FSDirectory.open(new File(h.getCore().getIndexDir())), true); //for me it fails here. But since the document was added I suspect this is a bug On Thu, Jul 14, 2011 at 10:48 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: On Thu, Jul 14, 2011 at 1:56 PM, Gabriele Kahlout gabri

Re: Can I still search documents once updated?

2011-07-13 Thread Gabriele Kahlout
from the index. Verify that the doc you retrieve from the index has values for content, I bet it doesn't Best Erick On Tue, Jul 12, 2011 at 9:38 AM, Gabriele Kahlout gabri...@mysimpatico.com wrote: @Test public void testUpdateLoseTermsSimplified() throws Exception

Re: Can I still search documents once updated?

2011-07-13 Thread Gabriele Kahlout
unstored but indexed fields). If the field is stored, then there's another problem, you might want to dump the document after reading it from the IR. Best Erick On Wed, Jul 13, 2011 at 2:25 AM, Gabriele Kahlout gabri...@mysimpatico.com wrote: It indeed is not stored, but this is still

Re: Can I still search documents once updated?

2011-07-13 Thread Gabriele Kahlout
unrelated to content (number of inbound links for an example) they would have to re-crawl the page again. This is at least !intuitive. On Wed, Jul 13, 2011 at 2:40 PM, Michael Kuhlmann s...@kuli.org wrote: Am 13.07.2011 14:05, schrieb Gabriele Kahlout: this is what i was expecting. Otherwise

Re: Can I still search documents once updated?

2011-07-13 Thread Gabriele Kahlout
On Wed, Jul 13, 2011 at 3:54 PM, Michael Kuhlmann s...@kuli.org wrote: Am 13.07.2011 15:37, schrieb Gabriele Kahlout: Well, I'm !sure how usual this scenario would be: 1. In general those using solr with nutch don't store the content field to avoid storing the whole web/intranet

How to add TrieIntField to a SolrInputDocument?

2011-07-13 Thread Gabriele Kahlout
)); assertU(commit()); assertNumFound(1); The above test fails until I change the following in schema.xml: - fieldType name=int class=solr.*TrieIntField* omitNorms=true/ + fieldType name=int class=solr.*IntField* omitNorms=true/ On Sun, Jul 10, 2011 at 10:36 PM, Gabriele Kahlout gabri

Re: How to create a solr core if no solr cores were created before?

2011-07-12 Thread Gabriele Kahlout
adminPath=/admin/cores defaultCoreName=live shareSchema=true core name=live instanceDir=. dataDir=live / core name=test instanceDir=. dataDir=test / /cores /solr 2011/7/11 Gabriele Kahlout gabri...@mysimpatico.com: have a look here [1]. [1] https://issues.apache.org/jira/browse

How to get doc # to use in reader.norms(content)[doc]?

2011-07-12 Thread Gabriele Kahlout
Hello, I'm trying to get the norm of an indexed document for a given field but beside reader.norms(fieldName) I'm not finding any API to retrieve it. Now reader.norms(..) returns an array with the norms for that field of all indexed documents. How do I know the index of my document in there?

How do I specify a different analyzer at search-time?

2011-07-11 Thread Gabriele Kahlout
With a lucene QueryParser instance it's possible to set the analyzer in use. I suspect Solr doesn't use the same analyzer it used at indexing, defined in schema.xml but I cannot verify that without the queryparser instance. From Jan's diagram it seems this is set in the SearchHandler's init. Is

Re: How to create a solr core if no solr cores were created before?

2011-07-11 Thread Gabriele Kahlout
have a look here [1]. [1] https://issues.apache.org/jira/browse/SOLR-2645?focusedCommentId=13062748page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13062748 On Mon, Jul 11, 2011 at 4:46 PM, Mark Schoy hei...@gmx.de wrote: Hi, I tried to create a solr core but I

Can I write to the index from within RequestHandler.handleRequestBody(..)?

2011-07-10 Thread Gabriele Kahlout
Hello, IndexWriter writer = new IndexWriter(FSDirectory.open(new File(req.getCore().getDataDir(), index)), req.getSchema().getAnalyzer(), IndexWriter.MaxFieldLength.LIMITED); updateSolrIndex(writer); But this is what I get (I know that RequestHandler are not intended to

Re: Can I write to the index from within RequestHandler.handleRequestBody(..)?

2011-07-10 Thread Gabriele Kahlout
On Sun, Jul 10, 2011 at 6:21 PM, Koji Sekiguchi k...@r.email.ne.jp wrote: There are such RequestHandlers. Look at CSVRequestHandler, for example. IndexWriter writer = new IndexWriter(FSDirectory.open(**new File(req.getCore().getDataDir(**), index)),

Re: What's the fq= syntax for NumericRangeFilter?

2011-07-10 Thread Gabriele Kahlout
http://www.lucidimagination.com On Sat, Jul 9, 2011 at 10:50 AM, Gabriele Kahlout gabri...@mysimpatico.com wrote: http://localhost:8080/solr/select?indent=onversion=2.2q=*%3A** fq=wc%3A%5B255+TO+257%5D* start=0rows=10fl=*%2Cscoreqt=wt=xmlexplainOther=hl.fl= The toString of the request

Can I delete the stored value?

2011-07-09 Thread Gabriele Kahlout
I've stored the contents of some pages I no longer need. How can I now delete the stored content without re-crawling the pages (i.e. using updateDocument ). I cannot just remove the field, since I still want the field to be indexed, I just don't want to store something with it. My understanding is

What's the fq= syntax for NumericRangeFilter?

2011-07-09 Thread Gabriele Kahlout
I'm trying to filter a query by the value of a numeric field. I can do it in Java as follows, but I don't know how to do it with the query syntax, and I found no documentation of it. @Test public void testFqWc() throws Exception { IndexSearcher searcher = wc(); *Filter wc3 =

Re: What's the fq= syntax for NumericRangeFilter?

2011-07-09 Thread Gabriele Kahlout
I don't get it to work! If I specify no fq I get the first result with int name=wc256/int With wc:[255 TO 257] (fq=wc%3A%5B255+TO+257%5D) nothing comes out. On Sat, Jul 9, 2011 at 12:29 PM, Markus Jelsma markus.jel...@openindex.iowrote: Hu? It's describe in the link Ahmet's given you. I'm

Re: What's the fq= syntax for NumericRangeFilter?

2011-07-09 Thread Gabriele Kahlout
http://localhost:8080/solr/select?indent=onversion=2.2q=*%3A** fq=wc%3A%5B255+TO+257%5D* start=0rows=10fl=*%2Cscoreqt=wt=xmlexplainOther=hl.fl= The toString of the request: {explainOther=fl=*,scoreindent=onstart=0q=*:*hl.fl=qt=wt=xmlfq=wc:[255+TO+257]rows=1version=2.2} Even when the

Re: How do I add a custom field?

2011-07-07 Thread Gabriele Kahlout
no single-field update capability in Lucene (yet?). -Mike On 7/3/2011 1:09 PM, Gabriele Kahlout wrote: Is there how I can compute and add the field to all indexed documents without re-indexing? MyField counts the number of terms per document (unique word count). On Sun, Jul 3, 2011 at 12

Re: Can I invert the inverted index?

2011-07-06 Thread Gabriele Kahlout
eventually making it into trunk. Thanks, -Trey Grainger Search Technology Development Team Lead, Careerbuilder.com Site Architect, Celiaccess.com On Tue, Jul 5, 2011 at 3:59 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: Hello, With an inverted index the term is the key

Re: Does Nutch make any use of solr.WhitespaceTokenizerFactory defined in schema.xml?

2011-07-05 Thread Gabriele Kahlout
input docs from NutchDocument objects. Solr will do analysis. The integration is analogous to XML post of Solr documents. On Tuesday 05 July 2011 12:28:21 Gabriele Kahlout wrote: Hello, I'm trying to understand better Nutch and Solr integration. My understanding is that Documents are added

Re: Does Nutch make any use of solr.WhitespaceTokenizerFactory defined in schema.xml?

2011-07-05 Thread Gabriele Kahlout
new RemoveDuplicatesTokenFilter(englishPorterFilterFactory.create(new LowerCaseFilter(wordDelimiterFilterFactory.create(stopFilterFactory.create(new WhitespaceTokenizer(reader)); } } On Tue, Jul 5, 2011 at 1:00 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: nice...where

Re: Does Nutch make any use of solr.WhitespaceTokenizerFactory defined in schema.xml?

2011-07-05 Thread Gabriele Kahlout
CharFilterFactory[charFilters.size()]), tokenizers.get(0), filters.toArray(new TokenFilterFactory[filters.size()])); }; On Tue, Jul 5, 2011 at 2:26 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: I suspect the following should do (1). I'm just not sure about file references

Re: Does Nutch make any use of solr.WhitespaceTokenizerFactory defined in schema.xml?

2011-07-05 Thread Gabriele Kahlout
the answer to 2) is new IndexSchema(solrConf, schema).getAnalyzer(); On Tue, Jul 5, 2011 at 2:48 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: Not yet an answer to 2) but this is where and how Solr initializes the Analyzer defined in the schema.xml

Cannot I search documents added by IndexWriter after commit?

2011-07-05 Thread Gabriele Kahlout
@Test public void testUpdate() throws IOException, ParserConfigurationException, SAXException, ParseException { Analyzer analyzer = getAnalyzer(); QueryParser parser = new QueryParser(Version.LUCENE_32, content, analyzer); Query allQ = parser.parse(*:*);

Re: Cannot I search documents added by IndexWriter after commit?

2011-07-05 Thread Gabriele Kahlout
PM, Gabriele Kahlout gabri...@mysimpatico.com wrote: @Test public void testUpdate() throws IOException, ParserConfigurationException, SAXException, ParseException { Analyzer analyzer = getAnalyzer(); QueryParser parser = new QueryParser(Version.LUCENE_32, content

Re: Cannot I search documents added by IndexWriter after commit?

2011-07-05 Thread Gabriele Kahlout
://blog.mikemccandless.com On Tue, Jul 5, 2011 at 2:12 PM, Gabriele Kahlout gabri...@mysimpatico.com wrote: and how do you do that? There is no reopen method On Tue, Jul 5, 2011 at 8:09 PM, Michael McCandless luc...@mikemccandless.com wrote: After your writer.commit you need to reopen your searcher

Re: Cannot I search documents added by IndexWriter after commit?

2011-07-05 Thread Gabriele Kahlout
); } On Tue, Jul 5, 2011 at 8:23 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: Still won't work (same as before). @Test public void testUpdate() throws IOException, ParserConfigurationException, SAXException, ParseException { Analyzer analyzer = getAnalyzer(); QueryParser

Can I invert the inverted index?

2011-07-05 Thread Gabriele Kahlout
Hello, With an inverted index the term is the key, and the documents are the values. Is it still however possible that given a document id I get the terms indexed for that document? -- Regards, K. Gabriele --- unchanged since 20/9/10 --- P.S. If the subject contains [LON] or the addressee

Re: Can I invert the inverted index?

2011-07-05 Thread Gabriele Kahlout
I had looked an term vectors but don't understand them to solve my problem. Consider the following index entries: t0, doc0, doc1 t1, doc0 From the 2nd entry we know that t1 is only present in doc0. Now, my problem, given doc0 how can I know which terms occur in in (t0 and t1) (without storing

Re: How do I compute and store a field?

2011-07-04 Thread Gabriele Kahlout
Gee, I was about to post. I figured my issue is that of computing the unique terms per document. One approach to compute that value is running the analyzer on the document before before calling addDocument, and count the number of tokens. Then I can invoke addDocument with the value of the field

How do I add a custom field?

2011-07-03 Thread Gabriele Kahlout
Hello, I want to have an additional field that appears for every document in search results. I understand that I should do this by adding the field to the schema.xml, so I add: field name=myField default=0 type=integer stored=true indexed=false/ Then I restart Solr (so that I loads the new

Re: How do I add a custom field?

2011-07-03 Thread Gabriele Kahlout
field ? The results will just bring back docs and what fields they have. They won't bring back null fields just because they are in your schema. Lucene is schema-less. Solr adds the schema to make it nice to administer and very powerful to use. On 3 July 2011 11:01, Gabriele Kahlout gabri

How do I compute and store a field?

2011-07-03 Thread Gabriele Kahlout
Hello, I'm trying to add a field that counts the number of terms in a document to my schema. So far I've been computing this value at query-time. Is there how I could compute this once only and store the field? final SolrIndexSearcher searcher = request.getSearcher(); final

site: feature in Solr?

2011-06-19 Thread Gabriele Kahlout
Hello, Beside creating an index with just the site in question, is it possible like with Google to search for results only in a given domain? -- Regards, K. Gabriele --- unchanged since 20/9/10 --- P.S. If the subject contains [LON] or the addressee acknowledges the receipt within 48 hours

Re: Why are not query keywords treated as a set?

2011-06-19 Thread Gabriele Kahlout
...@googlemail.comwrote: do you mean a phrase query? past past can you give some more detail? On 18 June 2011 13:02, Gabriele Kahlout gabri...@mysimpatico.com wrote: q=past past 1.0 = (MATCH) sum of: * 0.5 = (MATCH) fieldWeight(content:past in 0), product of:* 1.0 = tf(termFreq(content:past)=1

Why does paste get parsed into past?

2011-06-18 Thread Gabriele Kahlout
Hello, Debugging query results I find that: str name=querystringpaste/str str name=parsedquerycontent:past/str Now paste and past are two different words. Why does Solr not consider that? How do I make it? -- Regards, K. Gabriele --- unchanged since 20/9/10 --- P.S. If the subject contains

Why are not query keywords treated as a set?

2011-06-18 Thread Gabriele Kahlout
q=past past 1.0 = (MATCH) sum of: * 0.5 = (MATCH) fieldWeight(content:past in 0), product of:* 1.0 = tf(termFreq(content:past)=1) 1.0 = idf(docFreq=1, maxDocs=2) 0.5 = fieldNorm(field=content, doc=0) * 0.5 = (MATCH) fieldWeight(content:past in 0), product of:* 1.0 =

Re: Why does paste get parsed into past?

2011-06-18 Thread Gabriele Kahlout
Schiettecatte fschietteca...@gmail.com What do you have set up for stemming? François On Jun 18, 2011, at 8:00 AM, Gabriele Kahlout wrote: Hello, Debugging query results I find that: str name=querystringpaste/str str name=parsedquerycontent:past/str Now paste and past are two different

Is it true that I cannot delete stored content from the index?

2011-06-18 Thread Gabriele Kahlout
Hello, I've indexing with the content field stored. Now I'd like to delete all stored content, is there how to do that without re-indexing? It seems not from lucene FAQhttp://wiki.apache.org/lucene-java/LuceneFAQ#How_do_I_update_a_document_or_a_set_of_documents_that_are_already_indexed.3F : How

It's not possible to decide at run-time which similarity class to use, right?

2011-06-16 Thread Gabriele Kahlout
Hello, I'm testing out different Similarity implementations, and to do that I restart Solr each time I want to try a different similarity class I change the class attributed of the similiary element in schema.xml. Beside running multiple-cores, each with its own schema, is there a way to tell the

Re: It's not possible to decide at run-time which similarity class to use, right?

2011-06-16 Thread Gabriele Kahlout
the run-time similarity? I think many would welcome such responsibility distinction. Erik On Jun 16, 2011, at 14:55 , Gabriele Kahlout wrote: Hello, I'm testing out different Similarity implementations, and to do that I restart Solr each time I want to try a different

Re: How do I make sure the resulting documents contain the query terms?

2011-06-07 Thread Gabriele Kahlout
examples? This *looks* like filter queries, but I think you already know about those... Best Erick On Mon, Jun 6, 2011 at 4:00 PM, Gabriele Kahlout gabri...@mysimpatico.com wrote: Hello, I've seen that through boosting it's possible to influence the scoring function, but what I would

Re: How do I make sure the resulting documents contain the query terms?

2011-06-07 Thread Gabriele Kahlout
On Tue, Jun 7, 2011 at 8:43 AM, pravesh suyalprav...@yahoo.com wrote: k0 -- A | C k1 -- A | B k2 -- A | B | C k3 -- B | C Now let q=k1, how do I make sure C doesn't appear as a result since it doesn't contain any occurence of k1? Do we bother to do that. Now that's what lucene does :)

Re: How do I make sure the resulting documents contain the query terms?

2011-06-07 Thread Gabriele Kahlout
no reason documentC should be in the result set at all in your case of q=k1, where docC is not indexed under k1. On 6/7/2011 2:35 AM, Gabriele Kahlout wrote: Sorry being unclear and thank you for answering. Consider the following documents A(k0,k1,k2), B(k1,k2,k3), and C(k0,k2,k3), where A,B,C

How do I make sure the resulting documents contain the query terms?

2011-06-06 Thread Gabriele Kahlout
Hello, I've seen that through boosting it's possible to influence the scoring function, but what I would like is sort of a boolean property. In some way it's to search only the indexed documents by that keyword (or the intersection/union) rather than the whole set. Is this supported in any way?

Re: How to know how many documents are indexed? Anything more elegant than parsing numFound?

2011-06-04 Thread Gabriele Kahlout
sorry, this was my bad.. should have used and ! (append) On Fri, Jun 3, 2011 at 9:45 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: $ curl --fail http://192.168.34.51:8080/solr/admin/stats.jsp; resp.xml $ xmlstarlet sel -t -v //@numDocs resp.xml *Extra content at the end

How to know how many documents are indexed? Anything more elegant than parsing numFound?

2011-06-03 Thread Gabriele Kahlout
$ curl http://192.168.34.51:8080/solr/select?q=*%3A*rows=0; resp.xml $ xmlstarlet sel -t -v //@numFound resp.xml -- Regards, K. Gabriele --- unchanged since 20/9/10 --- P.S. If the subject contains [LON] or the addressee acknowledges the receipt within 48 hours then I don't resend the email.

Re: How to know how many documents are indexed? Anything more elegant than parsing numFound?

2011-06-03 Thread Gabriele Kahlout
$ curl --fail http://192.168.34.51:8080/solr/admin/stats.jsp; resp.xml $ xmlstarlet sel -t -v //@numDocs resp.xml *Extra content at the end of the document* On Fri, Jun 3, 2011 at 8:56 PM, Ahmet Arslan iori...@yahoo.com wrote: : How to know how many documents are indexed? Anything more elegant

What's the need for a complicated SolrTestCaseJ4.getClassName() ?

2011-05-23 Thread Gabriele Kahlout
Hello, As long as I subclass SolrTestCaseJ4 I cannot do this.getClass().getSimpleName(), I don't understand why. I wonder if the following complicated methods in SolrTestCaseJ4 have anything to do with it? protected static String getClassName() { StackTraceElement[] stack = new

(How) can I use SolrTestCaseJ4.assertQ(..) to test an existing index?

2011-05-21 Thread Gabriele Kahlout
Hello, Examining Solr Core example it seems that a new index is created in a temp dataDir deleted after each test (Good practice - agreed). But before I start debugging adoc(..) I'm wondering if I can query the same index which I see to work through Solr Web Server interface. Also for large

Re: (How) can I use SolrTestCaseJ4.assertQ(..) to test an existing index?

2011-05-21 Thread Gabriele Kahlout
On Sat, May 21, 2011 at 3:29 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: Hello, Examining Solr Core example it seems that a new index is created in a temp dataDir deleted after each test (Good practice - agreed). errr..from a test to the other only dataDir is rm

Re: How to test Solr Integartion - how to get EmbeddedSolrServer?

2011-05-18 Thread Gabriele Kahlout
module for Tests which depend on Solr Core and on the Test Framework. The org burden of that extra module, versus the ease of building configuration, I believe, outweights. On Tue, May 17, 2011 at 7:11 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: http://stackoverflow.com/questions

Re: How to list/see all the indexed terms of a particular field in a document?

2011-05-18 Thread Gabriele Kahlout
ant luke? On Wed, May 18, 2011 at 11:47 AM, Gnanakumar gna...@zoniac.com wrote: Hi, I'm using Apache Solr v3.1. How do I list/get to see all the indexed terms of a particular field in a document (by passing Unique Key ID of the document)? For example, I've the following field definition

Does every Solr request-response require a running server?

2011-05-18 Thread Gabriele Kahlout
Hello, I'm wondering if Solr Test framework at the end of the day always runs an embedded/jetty server (which is the only way to interact with solr, i.e. no web server -- no solr) or in the tests they interact without one, calling directly the under line methods? The latter seems to be the case

Re: Does every Solr request-response require a running server?

2011-05-18 Thread Gabriele Kahlout
On Wed, May 18, 2011 at 5:09 PM, Yonik Seeley yo...@lucidimagination.comwrote: On Wed, May 18, 2011 at 10:50 AM, Gabriele Kahlout gabri...@mysimpatico.com wrote: Hello, I'm wondering if Solr Test framework at the end of the day always runs an embedded/jetty server (which is the only way

How to test Solr Integartion - how to get EmbeddedSolrServer?

2011-05-17 Thread Gabriele Kahlout
Hello, I'm starting to write tests of my Solr integration, and have unfortunately spent a lot of time chasing updated documentation. Follows a test I found herehttp://blog.synyx.de/2011/01/integration-tests-for-your-solr-config/which uses anEmbeddedSolrServerto communicate with the server and

Re: How to test Solr Integartion - how to get EmbeddedSolrServer?

2011-05-17 Thread Gabriele Kahlout
thank you. I'd like to stick to the same version (i.e. 3.2-SNAPSHOT). It seems things have changed there. To reproduce (should we file this and add my test as a test to avoid this bumping up again?) $ svn co -r 1104120 http://svn.apache.org/repos/asf/lucene/dev/branches/branch_3x/ solr cd solr;

Re: How to test Solr Integartion - how to get EmbeddedSolrServer?

2011-05-17 Thread Gabriele Kahlout
On Tue, May 17, 2011 at 3:44 PM, Steven A Rowe sar...@syr.edu wrote: Hi Gabriele, On 5/17/2011 at 9:34 AM, Gabriele Kahlout wrote: Solr Core should declare a test dependency on Solr Test Framework. I agree: - Solr Core should have a test-scope dependency on Solr Test Framework. - Solr

Re: How do i I modify XMLWriter to write foobar?

2011-05-15 Thread Gabriele Kahlout
Got this sorted checking out the branch revision. On Thu, May 5, 2011 at 9:44 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: I've now tried to write my own QueryResponseWriter plugin[1], as a maven project depending on Solr Core 3.1, which is the same version of Solr I've installed

How to get the filtered terms from a Query in the ResponseWriter?

2011-05-15 Thread Gabriele Kahlout
Hello, For a given q string I'm trying to extract the terms (identifiers of tokens) that the Query Parser identified at terms (and shows when explaining results). I manage to do it as follows, but *I hope there a better way (more direct) you will tell me about:* NamedList analysis = new

How to plugin the value of a Field? DocInverterPerField?

2011-05-14 Thread Gabriele Kahlout
Hello, I'm trying to add an extra field to the schema.xml that is only stored, but with nutch not knowing about it, I don't know how to tell Solr of its value for each document. I'd like to plugin the computation, something like is done with Similarity, but I'm not sure how to do that. From

Re: How to plugin the value of a Field? DocInverterPerField?

2011-05-14 Thread Gabriele Kahlout
It looks like I've to contact updateHandler class=solr.DirectUpdateHandler2 with an AddUpdateCommand . On Sat, May 14, 2011 at 12:36 PM, Gabriele Kahlout gabri...@mysimpatico.com wrote: Hello, I'm trying to add an extra field to the schema.xml that is only stored, but with nutch

Re: How to plugin the value of a Field? DocInverterPerField?

2011-05-14 Thread Gabriele Kahlout
I calculate it from search-time + index-time field values. For example, say I want to print the reciprocal of the content field norm (available at index-time) along every document in the results. What's the 'clean' way of doing that? On Sat, May 14, 2011 at 3:42 PM, Markus Jelsma

Re: Want to Delete Existing Index create fresh index

2011-05-14 Thread Gabriele Kahlout
/index and restart, the index directory will be automatically recreated. François On May 14, 2011, at 1:53 AM, Gabriele Kahlout wrote: curl --fail $solrIndex/update?commit=true -d 'deletequery*:*/query/delete' #empty index [1 http://wiki.apache.org/nutch

Re: How to plugin the value of a Field? DocInverterPerField?

2011-05-14 Thread Gabriele Kahlout
the value is computed and added. As acknowledged by others before me, there's room for refactoring ResponseWriters to at least make them more re-usable. Hope SOLR-1566 https://issues.apache.org/jira/browse/SOLR-1566 come up with a cleaner solution. On Sat, May 14, 2011 at 3:55 PM, Gabriele Kahlout

Editor loads wrong version of IndexSearcher while debugging - how to fix?

2011-05-13 Thread Gabriele Kahlout
Hello, I'm debugging Solr built as a maven project in NB, and when I enter the code of a Lucene dependency, namely org.apache.lucene.search.IndexSearcher.explain(..) the call stack expects this method to be at line 599 while in the editor the class ends at 304. from solr-core's pom.xml:

Re: Want to Delete Existing Index create fresh index

2011-05-13 Thread Gabriele Kahlout
curl --fail $solrIndex/update?commit=true -d 'deletequery*:*/query/delete' #empty index [1 http://wiki.apache.org/nutch/Whole-Web%20Crawling%20incremental%20script] did u try? On Sat, May 14, 2011 at 7:26 AM, Pawan Darira pawan.dar...@gmail.comwrote: Hi I had an existing index created

Re: Is it possible to build Solr as a maven project?

2011-05-12 Thread Gabriele Kahlout
On Tue, May 10, 2011 at 3:56 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: On Tue, May 10, 2011 at 3:50 PM, Steven A Rowe sar...@syr.edu wrote: Hi Gabriele, There are some Maven instructions here (not in Lucene/Solr 3.1 because I just wrote the file a couple of days ago): http

Coord in queryExplain

2011-05-12 Thread Gabriele Kahlout
Hello, I'm wondering why the results of coord() are not displayed when debugging query results, as described in the wiki[1http://wiki.apache.org/solr/SolrRelevancyFAQ#Why_does_id:archangel_come_before_id:hawkgirl_when_querying_for_.22wings.22]. I'd like to see it. Could someone point to how to

Re: Coord in queryExplain

2011-05-12 Thread Gabriele Kahlout
You are right! On Thu, May 12, 2011 at 2:54 PM, Ahmet Arslan iori...@yahoo.com wrote: I'm wondering why the results of coord() are not displayed when debugging query results, as described in the wiki[1

No more standard query type?

2011-05-11 Thread Gabriele Kahlout
Is the tagged release of solr 3.1 different from the one distributed in the downloads page? It looks like a reproducible bug. svn co -r 1101526 http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_3_1 solr This is the default query I get from http://localhost:8080/solr/admin/form.jsp:

Re: Is it possible to build Solr as a maven project?

2011-05-10 Thread Gabriele Kahlout
locations. I'm using netbeans and I'm using the plugin Automatic Projects to do everything inside the IDE. Which version of Solr are you using ? Ludovic. 2011/5/4 Gabriele Kahlout [via Lucene] ml-node+2898211-2124746009-383...@n3.nabble.com generate-maven-artifacts: [mkdir] Created

Re: Is it possible to build Solr as a maven project?

2011-05-10 Thread Gabriele Kahlout
. Please write back if you run into any problems. Steve From: Gabriele Kahlout [mailto:gabri...@mysimpatico.com] Sent: Tuesday, May 10, 2011 8:37 AM To: boutr...@gmail.com Cc: solr-user@lucene.apache.org; Steven A Rowe; ryan...@gmail.com Subject: Re: Is it possible to build Solr as a maven project

SolrHome ends with /./ - is this normal?

2011-05-10 Thread Gabriele Kahlout
Hello, I'm having trouble getting Solr 3.1 to work with nutch-1.3. I'm not sure where the problem is, but I'm wondering why does the solrHome path end with /./. cwd=/Applications/NetBeans/apache-tomcat-7.0.6/bin SolrHome=/Users/simpatico/apache-solr-3.1.0/solr/./ In the web.xml of solr:

Re: SolrHome ends with /./ - is this normal?

2011-05-10 Thread Gabriele Kahlout
:04 +0200 (Tue, 10 May 2011) Does this work for you? All I've done is svn co nutch 1.3 and execute my script which up to now worked. On Tue, May 10, 2011 at 4:11 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: Hello, I'm having trouble getting Solr 3.1 to work with nutch-1.3. I'm

Re: SolrHome ends with /./ - is this normal?

2011-05-10 Thread Gabriele Kahlout
, Gabriele Kahlout gabri...@mysimpatico.comwrote: It apparently is normal, and my issue is indeed with nutch. I've modified post.sh from the example docs to use the solr in http://localhost:8080/apache-solr-3.1-SNAPSHOT and now finally data made it to the index. $ post.sh solr.xml monitor.xml

Re: SolrHome ends with /./ - is this normal?

2011-05-10 Thread Gabriele Kahlout
. On Tuesday 10 May 2011 16:40:02 Gabriele Kahlout wrote: From solr logs: May 10, 2011 4:33:20 PM org.apache.solr.common.SolrException log *SEVERE: org.apache.solr.common.SolrException: ERROR:unknown field 'content' * at org.apache.solr.update.DocumentBuilder.toDocument

Re: SolrHome ends with /./ - is this normal?

2011-05-10 Thread Gabriele Kahlout
shipped with Nutch in Solr. It provides most fields that you need. On Tuesday 10 May 2011 17:31:33 Gabriele Kahlout wrote: I don't get you, are you talking about conf/schema.xml? That's what I'm referring to. Am i supposed to do something with the nutch's conf/schema.xml? On Tue, May

Re: SolrHome ends with /./ - is this normal?

2011-05-10 Thread Gabriele Kahlout
actually something changed, I managed to crawl and index some pages (the other must have to do with regex-urls). Thank you! Was this always necessary? Any pointer discussing why it's needed? On Tue, May 10, 2011 at 5:40 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: You mean that I should

Re: Solr 4.0

2011-05-09 Thread Gabriele Kahlout
REPOST as a more general question about ivy dependencies: http://stackoverflow.com/questions/5941789/do-ivy-dependency-revisions-have-anything-to-do-with-svns On Mon, May 9, 2011 at 11:31 AM, Gabriele Kahlout gabri...@mysimpatico.comwrote: I think you are talking about this dependency

Why is org.apache.solr.response.XMLWriter final?

2011-05-05 Thread Gabriele Kahlout
Hello, It's final in the trunk, and has always been since conception in 2006 at revision 372455. Why? -- Regards, K. Gabriele --- unchanged since 20/9/10 --- P.S. If the subject contains [LON] or the addressee acknowledges the receipt within 48 hours then I don't resend the email.

How do I debug Unable to evaluate expression using this context printed at start?

2011-05-05 Thread Gabriele Kahlout
I've tried to re-install solr on tomcat, and now when I launch tomcat in debug mode I see the following exception relating to solr. It's not enough to understand the problem (and fix it), but I don't know where to look for more (or what to do). Please help me. Following the tutorial and

Re: Is it possible to build Solr as a maven project?

2011-05-05 Thread Gabriele Kahlout
...@syr.edu wrote: Hi Gabriele, The sequence should be 1. svn update 2. ant get-maven-poms 3. mvn -N -Pbootstrap install I think you left out #2 - there was a very recent change to the POMs that affects the noggit jar name. Steve -Original Message- From: Gabriele Kahlout

Re: How do I debug Unable to evaluate expression using this context printed at start?

2011-05-05 Thread Gabriele Kahlout
://markmail.org/thread/3y4zqieyjqfi5vl3. Thank you Chris! On Thu, May 5, 2011 at 2:58 PM, Gabriele Kahlout gabri...@mysimpatico.comwrote: I've tried to re-install solr on tomcat, and now when I launch tomcat in debug mode I see the following exception relating to solr. It's not enough to understand

Re: Is it possible to build Solr as a maven project?

2011-05-05 Thread Gabriele Kahlout
May 05 20:39:09 CEST 2011 [INFO] Final Memory: 38M/90M [INFO] On Thu, May 5, 2011 at 6:53 PM, Steven A Rowe sar...@syr.edu wrote: Hi Gabriele, On 5/5/2011 at 9:57 AM, Gabriele Kahlout wrote: Okay, that sequence worked

Re: How do i I modify XMLWriter to write foobar?

2011-05-05 Thread Gabriele Kahlout
I've now tried to write my own QueryResponseWriter plugin[1], as a maven project depending on Solr Core 3.1, which is the same version of Solr I've installed. It seems I'm not able to get rid of some cache. $ xmlstarlet sel -t -c /config/queryResponseWriter conf/solrconfig.xml

Re: Is it possible to build Solr as a maven project?

2011-05-05 Thread Gabriele Kahlout
Just for the reference. $ svn update At revision 1099940. On Thu, May 5, 2011 at 9:14 PM, Steven A Rowe sar...@syr.edu wrote: You're welcome, I'm glad you got it to work. - Steve -Original Message- From: Gabriele Kahlout [mailto:gabri...@mysimpatico.com] Sent: Thursday, May 05

Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
Hello, I'm trying to modify Solr and I think debugging will be very useful to understand what's going on. Hence I'd like to use an IDE (NetBeans) which automatically supports Maven projects. I see under src/maven that there are templates but I'm not sure how to use them to mavenize the

Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
the project as a standard maven project. Ludovic. 2011/5/4 Gabriele Kahlout [via Lucene] ml-node+2898068-621882422-383...@n3.nabble.com Hello, I'm trying to modify Solr and I think debugging will be very useful to understand what's going on. Hence I'd like to use an IDE (NetBeans) which

Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
target in my script. I'm using netbeans and I'm using the plugin Automatic Projects to do everything inside the IDE. Which version of Solr are you using ? the official latest: 3.1 Maybe I can copy-paste from the build script you are using? Ludovic. 2011/5/4 Gabriele Kahlout [via Lucene

  1   2   >