on the list.
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
From: Gabriele Kahlout gabri...@mysimpatico.com
To: solr-user@lucene.apache.org; Otis Gospodnetic
otis_gospodne...@yahoo.com
Sent
Hello,
While indexing there are certain urls/ids I'd never want to appear in the
search results (so be indexed). Is there already a 'supported by design'
mechanism to do that to point me too, or should I just create this blacklist
as an processor in the update chain?
--
Regards,
K. Gabriele
I'm interested in the stopwords solution as it sounds like less work but i'm
not sure i understand how it works. By having msn.com as a stopword it doesnt
mean i wont get msn.com as a result for say 'hotmail'. My understanding is that
msn.com will never make it to the similarity function and
On Mon, Sep 5, 2011 at 1:22 PM, Markus Jelsma markus.jel...@openindex.iowrote:
Hi,
URI paths are case-sensitive. If you really want to treat all URL's as
case-
insensitive i would suggest to modifiy the basic URL normalizer to
lowercase
all URL's so that it also ends up lowercased in the
Hi,
I've just noticed that two search results of indexed data have the same url:
http://www.atory.com/dupe_checker_pro/
http://www.atory.com/dupe_checker_PRO/
I thought the url/id was case-insentively unique. Is there how I can set it
up to be so?
For Solr it makes sense not to make it the
@gmail.com wrote:
you might want to check - http://wiki.apache.org/solr/TermVectorComponent
Should provide you with the term vectors with a lot of additional info.
Regards,
Jayendra
On Tue, Aug 30, 2011 at 3:34 AM, Gabriele Kahlout
gabri...@mysimpatico.com wrote:
Hello,
This time I'm trying
I'll
remove the JSP dependency and this may eventually making it into trunk.
Thanks,
-Trey Grainger
Search Technology Development Team Lead, Careerbuilder.com
Site Architect, Celiaccess.com
On Tue, Jul 5, 2011 at 3:59 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
Hello
tokens, why would it be a `set` ?
this might help in your analysis chain
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.RemoveDupl
icatesTokenFilterFactory
On 20 June 2011 04:21, Gabriele Kahlout gabri...@mysimpatico.com
wrote:
str name=rawquerystringpast past/str
this works:
doc.remove(wc);
SolrInputField wcField = new SolrInputField(wc);
wcField.setValue(150, 1.0f);
doc.put(wc,wcField);
On Wed, Jul 13, 2011 at 4:19 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
SolrInputDocument doc = new SolrInputDocument
IndexReader getReader() throws CorruptIndexException, IOException {
return IndexReader.open(FSDirectory.open(new
File(h.getCore().getIndexDir())), true);
}
*org.apache.lucene.index.IndexNotFoundException: no segments* file found in
(FSDirectory.open(new File(h.getCore().getIndexDir())),
true); //for me it fails here. But since the document was added I suspect
this is a bug
On Thu, Jul 14, 2011 at 10:48 PM, Shalin Shekhar Mangar
shalinman...@gmail.com wrote:
On Thu, Jul 14, 2011 at 1:56 PM, Gabriele Kahlout
gabri
from the index. Verify that the doc you retrieve from the index
has values for content, I bet it doesn't
Best
Erick
On Tue, Jul 12, 2011 at 9:38 AM, Gabriele Kahlout
gabri...@mysimpatico.com wrote:
@Test
public void testUpdateLoseTermsSimplified() throws Exception
unstored but indexed fields).
If the field is stored, then there's another problem, you might
want to dump the document after reading it from the IR.
Best
Erick
On Wed, Jul 13, 2011 at 2:25 AM, Gabriele Kahlout
gabri...@mysimpatico.com wrote:
It indeed is not stored, but this is still
unrelated to content (number of
inbound links for an example) they would have to re-crawl the page again.
This is at least !intuitive.
On Wed, Jul 13, 2011 at 2:40 PM, Michael Kuhlmann s...@kuli.org wrote:
Am 13.07.2011 14:05, schrieb Gabriele Kahlout:
this is what i was expecting. Otherwise
On Wed, Jul 13, 2011 at 3:54 PM, Michael Kuhlmann s...@kuli.org wrote:
Am 13.07.2011 15:37, schrieb Gabriele Kahlout:
Well, I'm !sure how usual this scenario would be:
1. In general those using solr with nutch don't store the content field
to
avoid storing the whole web/intranet
));
assertU(commit());
assertNumFound(1);
The above test fails until I change the following in schema.xml:
- fieldType name=int class=solr.*TrieIntField* omitNorms=true/
+ fieldType name=int class=solr.*IntField* omitNorms=true/
On Sun, Jul 10, 2011 at 10:36 PM, Gabriele Kahlout gabri
adminPath=/admin/cores defaultCoreName=live shareSchema=true
core name=live instanceDir=. dataDir=live /
core name=test instanceDir=. dataDir=test /
/cores
/solr
2011/7/11 Gabriele Kahlout gabri...@mysimpatico.com:
have a look here [1].
[1]
https://issues.apache.org/jira/browse
Hello,
I'm trying to get the norm of an indexed document for a given field but
beside reader.norms(fieldName) I'm not finding any API to retrieve it. Now
reader.norms(..) returns an array with the norms for that field of all
indexed documents. How do I know the index of my document in there?
With a lucene QueryParser instance it's possible to set the analyzer in use.
I suspect Solr doesn't use the same analyzer it used at indexing, defined in
schema.xml but I cannot verify that without the queryparser instance.
From Jan's diagram it seems this is set in the SearchHandler's init. Is
have a look here [1].
[1]
https://issues.apache.org/jira/browse/SOLR-2645?focusedCommentId=13062748page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13062748
On Mon, Jul 11, 2011 at 4:46 PM, Mark Schoy hei...@gmx.de wrote:
Hi,
I tried to create a solr core but I
Hello,
IndexWriter writer = new IndexWriter(FSDirectory.open(new
File(req.getCore().getDataDir(), index)), req.getSchema().getAnalyzer(),
IndexWriter.MaxFieldLength.LIMITED);
updateSolrIndex(writer);
But this is what I get (I know that RequestHandler are not intended to
On Sun, Jul 10, 2011 at 6:21 PM, Koji Sekiguchi k...@r.email.ne.jp wrote:
There are such RequestHandlers. Look at CSVRequestHandler, for example.
IndexWriter writer = new IndexWriter(FSDirectory.open(**new
File(req.getCore().getDataDir(**), index)),
http://www.lucidimagination.com
On Sat, Jul 9, 2011 at 10:50 AM, Gabriele Kahlout
gabri...@mysimpatico.com wrote:
http://localhost:8080/solr/select?indent=onversion=2.2q=*%3A**
fq=wc%3A%5B255+TO+257%5D*
start=0rows=10fl=*%2Cscoreqt=wt=xmlexplainOther=hl.fl=
The toString of the request
I've stored the contents of some pages I no longer need. How can I now
delete the stored content without re-crawling the pages (i.e. using
updateDocument ). I cannot just remove the field, since I still want the
field to be indexed, I just don't want to store something with it.
My understanding is
I'm trying to filter a query by the value of a numeric field. I can do it in
Java as follows, but I don't know how to do it with the query syntax, and I
found no documentation of it.
@Test
public void testFqWc() throws Exception {
IndexSearcher searcher = wc();
*Filter wc3 =
I don't get it to work!
If I specify no fq I get the first result with int name=wc256/int
With wc:[255 TO 257] (fq=wc%3A%5B255+TO+257%5D) nothing comes out.
On Sat, Jul 9, 2011 at 12:29 PM, Markus Jelsma
markus.jel...@openindex.iowrote:
Hu? It's describe in the link Ahmet's given you.
I'm
http://localhost:8080/solr/select?indent=onversion=2.2q=*%3A**
fq=wc%3A%5B255+TO+257%5D*
start=0rows=10fl=*%2Cscoreqt=wt=xmlexplainOther=hl.fl=
The toString of the request:
{explainOther=fl=*,scoreindent=onstart=0q=*:*hl.fl=qt=wt=xmlfq=wc:[255+TO+257]rows=1version=2.2}
Even when the
no single-field
update capability in Lucene (yet?).
-Mike
On 7/3/2011 1:09 PM, Gabriele Kahlout wrote:
Is there how I can compute and add the field to all indexed documents
without re-indexing? MyField counts the number of terms per document
(unique
word count).
On Sun, Jul 3, 2011 at 12
eventually making it into trunk.
Thanks,
-Trey Grainger
Search Technology Development Team Lead, Careerbuilder.com
Site Architect, Celiaccess.com
On Tue, Jul 5, 2011 at 3:59 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
Hello,
With an inverted index the term is the key
input docs from NutchDocument objects. Solr will do
analysis. The integration is analogous to XML post of Solr documents.
On Tuesday 05 July 2011 12:28:21 Gabriele Kahlout wrote:
Hello,
I'm trying to understand better Nutch and Solr integration. My
understanding is that Documents are added
new
RemoveDuplicatesTokenFilter(englishPorterFilterFactory.create(new
LowerCaseFilter(wordDelimiterFilterFactory.create(stopFilterFactory.create(new
WhitespaceTokenizer(reader));
}
}
On Tue, Jul 5, 2011 at 1:00 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
nice...where
CharFilterFactory[charFilters.size()]),
tokenizers.get(0), filters.toArray(new
TokenFilterFactory[filters.size()]));
};
On Tue, Jul 5, 2011 at 2:26 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
I suspect the following should do (1). I'm just not sure about file
references
the answer to 2) is new IndexSchema(solrConf, schema).getAnalyzer();
On Tue, Jul 5, 2011 at 2:48 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
Not yet an answer to 2) but this is where and how Solr initializes the
Analyzer defined in the schema.xml
@Test
public void testUpdate() throws IOException,
ParserConfigurationException, SAXException, ParseException {
Analyzer analyzer = getAnalyzer();
QueryParser parser = new QueryParser(Version.LUCENE_32, content,
analyzer);
Query allQ = parser.parse(*:*);
PM, Gabriele Kahlout
gabri...@mysimpatico.com wrote:
@Test
public void testUpdate() throws IOException,
ParserConfigurationException, SAXException, ParseException {
Analyzer analyzer = getAnalyzer();
QueryParser parser = new QueryParser(Version.LUCENE_32, content
://blog.mikemccandless.com
On Tue, Jul 5, 2011 at 2:12 PM, Gabriele Kahlout
gabri...@mysimpatico.com wrote:
and how do you do that? There is no reopen method
On Tue, Jul 5, 2011 at 8:09 PM, Michael McCandless
luc...@mikemccandless.com wrote:
After your writer.commit you need to reopen your searcher
);
}
On Tue, Jul 5, 2011 at 8:23 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
Still won't work (same as before).
@Test
public void testUpdate() throws IOException,
ParserConfigurationException, SAXException, ParseException {
Analyzer analyzer = getAnalyzer();
QueryParser
Hello,
With an inverted index the term is the key, and the documents are the
values. Is it still however possible that given a document id I get the
terms indexed for that document?
--
Regards,
K. Gabriele
--- unchanged since 20/9/10 ---
P.S. If the subject contains [LON] or the addressee
I had looked an term vectors but don't understand them to solve my problem.
Consider the following index entries:
t0, doc0, doc1
t1, doc0
From the 2nd entry we know that t1 is only present in doc0.
Now, my problem, given doc0 how can I know which terms occur in in (t0 and
t1) (without storing
Gee, I was about to post. I figured my issue is that of computing the unique
terms per document. One approach to compute that value is running the
analyzer on the document before before calling addDocument, and count the
number of tokens.
Then I can invoke addDocument with the value of the field
Hello,
I want to have an additional field that appears for every document in
search results. I understand that I should do this by adding the field to
the schema.xml, so I add:
field name=myField default=0 type=integer stored=true
indexed=false/
Then I restart Solr (so that I loads the new
field ?
The results will just bring back docs and what fields they have. They won't
bring back null fields just because they are in your schema. Lucene
is schema-less.
Solr adds the schema to make it nice to administer and very powerful to
use.
On 3 July 2011 11:01, Gabriele Kahlout gabri
Hello,
I'm trying to add a field that counts the number of terms in a document to
my schema. So far I've been computing this value at query-time. Is there how
I could compute this once only and store the field?
final SolrIndexSearcher searcher = request.getSearcher();
final
Hello,
Beside creating an index with just the site in question, is it possible like
with Google to search for results only in a given domain?
--
Regards,
K. Gabriele
--- unchanged since 20/9/10 ---
P.S. If the subject contains [LON] or the addressee acknowledges the
receipt within 48 hours
...@googlemail.comwrote:
do you mean a phrase query? past past
can you give some more detail?
On 18 June 2011 13:02, Gabriele Kahlout gabri...@mysimpatico.com wrote:
q=past past
1.0 = (MATCH) sum of:
* 0.5 = (MATCH) fieldWeight(content:past in 0), product of:*
1.0 = tf(termFreq(content:past)=1
Hello,
Debugging query results I find that:
str name=querystringpaste/str
str name=parsedquerycontent:past/str
Now paste and past are two different words. Why does Solr not consider
that? How do I make it?
--
Regards,
K. Gabriele
--- unchanged since 20/9/10 ---
P.S. If the subject contains
q=past past
1.0 = (MATCH) sum of:
* 0.5 = (MATCH) fieldWeight(content:past in 0), product of:*
1.0 = tf(termFreq(content:past)=1)
1.0 = idf(docFreq=1, maxDocs=2)
0.5 = fieldNorm(field=content, doc=0)
* 0.5 = (MATCH) fieldWeight(content:past in 0), product of:*
1.0 =
Schiettecatte fschietteca...@gmail.com
What do you have set up for stemming?
François
On Jun 18, 2011, at 8:00 AM, Gabriele Kahlout wrote:
Hello,
Debugging query results I find that:
str name=querystringpaste/str
str name=parsedquerycontent:past/str
Now paste and past are two different
Hello,
I've indexing with the content field stored. Now I'd like to delete all
stored content, is there how to do that without re-indexing?
It seems not from lucene
FAQhttp://wiki.apache.org/lucene-java/LuceneFAQ#How_do_I_update_a_document_or_a_set_of_documents_that_are_already_indexed.3F
:
How
Hello,
I'm testing out different Similarity implementations, and to do that I
restart Solr each time I want to try a different similarity class I change
the class attributed of the similiary element in schema.xml. Beside running
multiple-cores, each with its own schema, is there a way to tell the
the run-time similarity?
I think many would welcome such responsibility distinction.
Erik
On Jun 16, 2011, at 14:55 , Gabriele Kahlout wrote:
Hello,
I'm testing out different Similarity implementations, and to do that I
restart Solr each time I want to try a different
examples? This *looks* like filter queries,
but I think you already know about those...
Best
Erick
On Mon, Jun 6, 2011 at 4:00 PM, Gabriele Kahlout
gabri...@mysimpatico.com wrote:
Hello,
I've seen that through boosting it's possible to influence the scoring
function, but what I would
On Tue, Jun 7, 2011 at 8:43 AM, pravesh suyalprav...@yahoo.com wrote:
k0 -- A | C
k1 -- A | B
k2 -- A | B | C
k3 -- B | C
Now let q=k1, how do I make sure C doesn't appear as a result since it
doesn't contain any occurence of k1?
Do we bother to do that. Now that's what lucene does :)
no reason documentC should be in the result set at
all in your case of q=k1, where docC is not indexed under k1.
On 6/7/2011 2:35 AM, Gabriele Kahlout wrote:
Sorry being unclear and thank you for answering.
Consider the following documents A(k0,k1,k2), B(k1,k2,k3), and
C(k0,k2,k3),
where A,B,C
Hello,
I've seen that through boosting it's possible to influence the scoring
function, but what I would like is sort of a boolean property. In some way
it's to search only the indexed documents by that keyword (or the
intersection/union) rather than the whole set.
Is this supported in any way?
sorry, this was my bad.. should have used and ! (append)
On Fri, Jun 3, 2011 at 9:45 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
$ curl --fail http://192.168.34.51:8080/solr/admin/stats.jsp; resp.xml
$ xmlstarlet sel -t -v //@numDocs resp.xml
*Extra content at the end
$ curl http://192.168.34.51:8080/solr/select?q=*%3A*rows=0; resp.xml
$ xmlstarlet sel -t -v //@numFound resp.xml
--
Regards,
K. Gabriele
--- unchanged since 20/9/10 ---
P.S. If the subject contains [LON] or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
$ curl --fail http://192.168.34.51:8080/solr/admin/stats.jsp; resp.xml
$ xmlstarlet sel -t -v //@numDocs resp.xml
*Extra content at the end of the document*
On Fri, Jun 3, 2011 at 8:56 PM, Ahmet Arslan iori...@yahoo.com wrote:
: How to know how many documents are indexed? Anything more elegant
Hello,
As long as I subclass SolrTestCaseJ4 I cannot do
this.getClass().getSimpleName(), I don't understand why. I wonder if the
following complicated methods in SolrTestCaseJ4 have anything to do with
it?
protected static String getClassName() {
StackTraceElement[] stack = new
Hello,
Examining Solr Core example it seems that a new index is created in a temp
dataDir deleted after each test (Good practice - agreed). But before I start
debugging adoc(..) I'm wondering if I can query the same index which I see
to work through Solr Web Server interface. Also for large
On Sat, May 21, 2011 at 3:29 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
Hello,
Examining Solr Core example it seems that a new index is created in a temp
dataDir deleted after each test (Good practice - agreed).
errr..from a test to the other only dataDir is rm
module for Tests which depend on Solr Core and on the Test Framework.
The org burden of that extra module, versus the ease of building
configuration, I believe, outweights.
On Tue, May 17, 2011 at 7:11 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
http://stackoverflow.com/questions
ant luke?
On Wed, May 18, 2011 at 11:47 AM, Gnanakumar gna...@zoniac.com wrote:
Hi,
I'm using Apache Solr v3.1.
How do I list/get to see all the indexed terms of a particular field in a
document (by passing Unique Key ID of the document)?
For example, I've the following field definition
Hello,
I'm wondering if Solr Test framework at the end of the day always runs an
embedded/jetty server (which is the only way to interact with solr, i.e. no
web server -- no solr) or in the tests they interact without one, calling
directly the under line methods?
The latter seems to be the case
On Wed, May 18, 2011 at 5:09 PM, Yonik Seeley yo...@lucidimagination.comwrote:
On Wed, May 18, 2011 at 10:50 AM, Gabriele Kahlout
gabri...@mysimpatico.com wrote:
Hello,
I'm wondering if Solr Test framework at the end of the day always runs an
embedded/jetty server (which is the only way
Hello,
I'm starting to write tests of my Solr integration, and have unfortunately
spent a lot of time chasing updated documentation.
Follows a test I found
herehttp://blog.synyx.de/2011/01/integration-tests-for-your-solr-config/which
uses anEmbeddedSolrServerto communicate with the server and
thank you. I'd like to stick to the same version (i.e. 3.2-SNAPSHOT). It
seems things have changed there.
To reproduce (should we file this and add my test as a test to avoid this
bumping up again?)
$
svn co -r 1104120
http://svn.apache.org/repos/asf/lucene/dev/branches/branch_3x/ solr
cd solr;
On Tue, May 17, 2011 at 3:44 PM, Steven A Rowe sar...@syr.edu wrote:
Hi Gabriele,
On 5/17/2011 at 9:34 AM, Gabriele Kahlout wrote:
Solr Core should declare a test dependency on Solr Test Framework.
I agree:
- Solr Core should have a test-scope dependency on Solr Test Framework.
- Solr
Got this sorted checking out the branch revision.
On Thu, May 5, 2011 at 9:44 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
I've now tried to write my own QueryResponseWriter plugin[1], as a maven
project depending on Solr Core 3.1, which is the same version of Solr I've
installed
Hello,
For a given q string I'm trying to extract the terms (identifiers of tokens)
that the Query Parser identified at terms (and shows when explaining
results). I manage to do it as follows, but *I hope there a better way (more
direct) you will tell me about:*
NamedList analysis = new
Hello,
I'm trying to add an extra field to the schema.xml that is only stored, but
with nutch not knowing about it, I don't know how to tell Solr of its value
for each document. I'd like to plugin the computation, something like is
done with Similarity, but I'm not sure how to do that.
From
It looks like I've to contact
updateHandler class=solr.DirectUpdateHandler2 with an AddUpdateCommand
.
On Sat, May 14, 2011 at 12:36 PM, Gabriele Kahlout gabri...@mysimpatico.com
wrote:
Hello,
I'm trying to add an extra field to the schema.xml that is only stored, but
with nutch
I calculate it from search-time + index-time field values.
For example, say I want to print the reciprocal of the content field norm
(available at index-time) along every document in the results. What's the
'clean' way of doing that?
On Sat, May 14, 2011 at 3:42 PM, Markus Jelsma
/index
and restart, the index directory will be automatically recreated.
François
On May 14, 2011, at 1:53 AM, Gabriele Kahlout wrote:
curl --fail $solrIndex/update?commit=true -d
'deletequery*:*/query/delete' #empty index [1
http://wiki.apache.org/nutch
the value is computed
and added.
As acknowledged by others before me, there's room for refactoring
ResponseWriters to at least make them more re-usable.
Hope SOLR-1566 https://issues.apache.org/jira/browse/SOLR-1566 come up
with a cleaner solution.
On Sat, May 14, 2011 at 3:55 PM, Gabriele Kahlout
Hello,
I'm debugging Solr built as a maven project in NB, and when I enter the code
of a Lucene dependency, namely
org.apache.lucene.search.IndexSearcher.explain(..) the call stack expects
this method to be at line 599 while in the editor the class ends at 304.
from solr-core's pom.xml:
curl --fail $solrIndex/update?commit=true -d
'deletequery*:*/query/delete' #empty index [1
http://wiki.apache.org/nutch/Whole-Web%20Crawling%20incremental%20script]
did u try?
On Sat, May 14, 2011 at 7:26 AM, Pawan Darira pawan.dar...@gmail.comwrote:
Hi
I had an existing index created
On Tue, May 10, 2011 at 3:56 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
On Tue, May 10, 2011 at 3:50 PM, Steven A Rowe sar...@syr.edu wrote:
Hi Gabriele,
There are some Maven instructions here (not in Lucene/Solr 3.1 because I
just wrote the file a couple of days ago):
http
Hello,
I'm wondering why the results of coord() are not displayed when debugging
query results, as described in the
wiki[1http://wiki.apache.org/solr/SolrRelevancyFAQ#Why_does_id:archangel_come_before_id:hawkgirl_when_querying_for_.22wings.22].
I'd like to see it.
Could someone point to how to
You are right!
On Thu, May 12, 2011 at 2:54 PM, Ahmet Arslan iori...@yahoo.com wrote:
I'm wondering why the results of coord() are not displayed
when debugging
query results, as described in the
wiki[1
Is the tagged release of solr 3.1 different from the one distributed in the
downloads page? It looks like a reproducible bug.
svn co -r 1101526
http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_3_1 solr
This is the default query I get from
http://localhost:8080/solr/admin/form.jsp:
locations.
I'm using netbeans and I'm using the plugin Automatic Projects to do
everything inside the IDE.
Which version of Solr are you using ?
Ludovic.
2011/5/4 Gabriele Kahlout [via Lucene]
ml-node+2898211-2124746009-383...@n3.nabble.com
generate-maven-artifacts:
[mkdir] Created
.
Please write back if you run into any problems.
Steve
From: Gabriele Kahlout [mailto:gabri...@mysimpatico.com]
Sent: Tuesday, May 10, 2011 8:37 AM
To: boutr...@gmail.com
Cc: solr-user@lucene.apache.org; Steven A Rowe; ryan...@gmail.com
Subject: Re: Is it possible to build Solr as a maven project
Hello,
I'm having trouble getting Solr 3.1 to work with nutch-1.3. I'm not sure
where the problem is, but I'm wondering why does the solrHome path end with
/./.
cwd=/Applications/NetBeans/apache-tomcat-7.0.6/bin
SolrHome=/Users/simpatico/apache-solr-3.1.0/solr/./
In the web.xml of solr:
:04 +0200 (Tue, 10 May 2011)
Does this work for you? All I've done is svn co nutch 1.3 and execute my
script which up to now worked.
On Tue, May 10, 2011 at 4:11 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
Hello,
I'm having trouble getting Solr 3.1 to work with nutch-1.3. I'm
, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
It apparently is normal, and my issue is indeed with nutch.
I've modified post.sh from the example docs to use the solr in
http://localhost:8080/apache-solr-3.1-SNAPSHOT and now finally data made
it to the index.
$ post.sh solr.xml monitor.xml
.
On Tuesday 10 May 2011 16:40:02 Gabriele Kahlout wrote:
From solr logs:
May 10, 2011 4:33:20 PM org.apache.solr.common.SolrException log
*SEVERE: org.apache.solr.common.SolrException: ERROR:unknown field
'content' *
at
org.apache.solr.update.DocumentBuilder.toDocument
shipped with Nutch in Solr. It provides most
fields that you need.
On Tuesday 10 May 2011 17:31:33 Gabriele Kahlout wrote:
I don't get you, are you talking about conf/schema.xml? That's what I'm
referring to. Am i supposed to do something with the nutch's
conf/schema.xml?
On Tue, May
actually something changed, I managed to crawl and index some pages (the
other must have to do with regex-urls). Thank you!
Was this always necessary? Any pointer discussing why it's needed?
On Tue, May 10, 2011 at 5:40 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
You mean that I should
REPOST as a more general question about ivy dependencies:
http://stackoverflow.com/questions/5941789/do-ivy-dependency-revisions-have-anything-to-do-with-svns
On Mon, May 9, 2011 at 11:31 AM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
I think you are talking about this dependency
Hello,
It's final in the trunk, and has always been since conception in 2006 at
revision 372455. Why?
--
Regards,
K. Gabriele
--- unchanged since 20/9/10 ---
P.S. If the subject contains [LON] or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
I've tried to re-install solr on tomcat, and now when I launch tomcat in
debug mode I see the following exception relating to solr. It's not enough
to understand the problem (and fix it), but I don't know where to look for
more (or what to do). Please help me.
Following the tutorial and
...@syr.edu wrote:
Hi Gabriele,
The sequence should be
1. svn update
2. ant get-maven-poms
3. mvn -N -Pbootstrap install
I think you left out #2 - there was a very recent change to the POMs that
affects the noggit jar name.
Steve
-Original Message-
From: Gabriele Kahlout
://markmail.org/thread/3y4zqieyjqfi5vl3. Thank you Chris!
On Thu, May 5, 2011 at 2:58 PM, Gabriele Kahlout
gabri...@mysimpatico.comwrote:
I've tried to re-install solr on tomcat, and now when I launch tomcat in
debug mode I see the following exception relating to solr. It's not enough
to understand
May 05 20:39:09 CEST 2011
[INFO] Final Memory: 38M/90M
[INFO]
On Thu, May 5, 2011 at 6:53 PM, Steven A Rowe sar...@syr.edu wrote:
Hi Gabriele,
On 5/5/2011 at 9:57 AM, Gabriele Kahlout wrote:
Okay, that sequence worked
I've now tried to write my own QueryResponseWriter plugin[1], as a maven
project depending on Solr Core 3.1, which is the same version of Solr I've
installed. It seems I'm not able to get rid of some cache.
$ xmlstarlet sel -t -c /config/queryResponseWriter conf/solrconfig.xml
Just for the reference.
$ svn update
At revision 1099940.
On Thu, May 5, 2011 at 9:14 PM, Steven A Rowe sar...@syr.edu wrote:
You're welcome, I'm glad you got it to work. - Steve
-Original Message-
From: Gabriele Kahlout [mailto:gabri...@mysimpatico.com]
Sent: Thursday, May 05
Hello,
I'm trying to modify Solr and I think debugging will be very useful to
understand what's going on. Hence I'd like to use an IDE (NetBeans)
which automatically supports Maven projects. I see under src/maven
that there are templates but I'm not sure how to use them to mavenize
the
the project as a standard maven
project.
Ludovic.
2011/5/4 Gabriele Kahlout [via Lucene]
ml-node+2898068-621882422-383...@n3.nabble.com
Hello,
I'm trying to modify Solr and I think debugging will be very useful to
understand what's going on. Hence I'd like to use an IDE (NetBeans)
which
target in my script.
I'm using netbeans and I'm using the plugin Automatic Projects to do
everything inside the IDE.
Which version of Solr are you using ?
the official latest: 3.1
Maybe I can copy-paste from the build script you are using?
Ludovic.
2011/5/4 Gabriele Kahlout [via Lucene
1 - 100 of 112 matches
Mail list logo