Re: Solr reports: "Can not read response from server" when running import

2012-10-16 Thread Romita Saha
Hi Dave, I followed your guidance and loaded my database in MySQL. Presently the url reads like this: url = "jdbc:mysql://localhost:8983/var/lib/mysql/camerasys" The bin address in my.cnf file is : bind-address = 127.0.0.1 However the issue still persists. Kindly help me find out the issue. T

Re: Flushing RAM to disk

2012-10-16 Thread Lance Norskog
There is no "backed by disk" RamDirectory feature. The MMapDirectory uses the operating system to do almost exactly the same thing, in a much better way. That is why it is the default. - Original Message - | From: "deniz" | To: solr-user@lucene.apache.org | Sent: Tuesday, October 16, 20

Flushing RAM to disk

2012-10-16 Thread deniz
Hi all, I have a question about solr directories... Basically I will be using RAM directory for the project, but I am curious if it is possible to flush (or copy from ) RAM to disk? via cronjob or a timer in java code? if yes, could anyone give me some details about it? thank you - Zeki a

Re: How many documents in each Lucene segment?

2012-10-16 Thread Lance Norskog
CheckIndex prints these stats. java -cp lucene-core-WHATEVER.jar org.apache.lucene.index.CheckIndex - Original Message - | From: "Shawn Heisey" | To: solr-user@lucene.apache.org | Sent: Monday, October 15, 2012 9:46:33 PM | Subject: Re: How many documents in each Lucene segment? | | On

results with no default sort order...

2012-10-16 Thread trilok2000
Hi there, I'm searching with the following query: /select?q=*:*&fq=fld: OR fld: OR fld: where, the field fld is a String type and uniqueKey. I'm getting results as: Looks like the results are sorted by fld. But I want the results to be "un-sorted"... me

Re: Can we retrieve deleted records before optimized

2012-10-16 Thread Zeng Lames
Thanks kan for your prompt help. It is really a great solution to recovery those deleted records. Another question is about Solr history data housekeep problem. the scenario is as below: we have a solr core to store biz records, which is large volume that the index files is more than 50GB in one

ScorerDocQueue.java's downHeap showing up as frequent hotspot in profiling - ideas why?

2012-10-16 Thread Aaron Daubman
Greetings, In a recent batch of solr 3.6.1 slow response time queries the profiler highlighted downHeap (line 212) in SoorerDocQueue.java as averaging more than 60ms across the 16 calls I was looking at and showing it spiking up over 100ms - which, after looking at the code (two int comparisons?!?

RE: multi-core sharing synonym map

2012-10-16 Thread Chris Hostetter
: Yes I was thinking the same thing, although I was hoping there was a : more elegant mechanism exposed by the solr infrastructure code to handle : the shared map, aside from just using a global that is. a) if you have a large data structure you wnat to share between SolrCores, you might be ab

Re: Datefaceting on multiple value in solr

2012-10-16 Thread Chris Hostetter
: In solr, I have two fields one is datetime [Admission_Date] and other : string[Name]. I want to do datefaceting on multiple values in one solr : query. : : Suppose I have 3 records. name="test" ,"test1", "test3". : : I want to do data-faceting on all three values in one solr query. I'm sorry,

Re: How does Solr know which relative paths to use?

2012-10-16 Thread Dotan Cohen
On Wed, Oct 17, 2012 at 12:16 AM, P Williams wrote: > Hi Dotan, > > It seems that the examples now use Multiple > Coresby default. If your test > server is based on the stock example, you should > see a solr.xml file in your CWD path which is how Solr knows

Re: How does Solr know which relative paths to use?

2012-10-16 Thread P Williams
Hi Dotan, It seems that the examples now use Multiple Coresby default. If your test server is based on the stock example, you should see a solr.xml file in your CWD path which is how Solr knows about the relative paths. There should also be a README.txt fil

How does Solr know which relative paths to use?

2012-10-16 Thread Dotan Cohen
I have just installed Solr 4.0 on a test server. I start it like so: $ pwd /some/dir $ java -jar start.jar The Solr Instance now looks like this: CWD /some/dir Instance /some/dir/solr/collection1 Data /some/dir/solr/collection1/data Index /some/dir/solr/collection1/data/index >Fro

Re: Using

2012-10-16 Thread P Williams
Hi, Just wanted to update with a workaround. Works for me to test my configs and project code with SolrTestCaseJ4 using IVY as a dependency manager. Does anyone else think it's odd that the directory structure solr.home/collection1 is hard coded into the test-framework? Regards, Tricia

Re: Issue using SpatialRecursivePrefixTreeFieldType

2012-10-16 Thread David Smiley (@MITRE.org)
Eric, Can you please elaborate on your workaround? I'm not sure I get your drift. ~ David On Oct 16, 2012, at 12:54 PM, Eric Khoury [via Lucene] wrote: > > Thanks for the help David, makes sense. I found a workaround, creating much > smaller rectangles and updating them more often.Glad to ha

RE: Issue using SpatialRecursivePrefixTreeFieldType

2012-10-16 Thread Eric Khoury
Thanks for the help David, makes sense. I found a workaround, creating much smaller rectangles and updating them more often.Glad to have this functionality, thanks again!Eric. > Date: Fri, 12 Oct 2012 21:06:52 -0700 > From: dsmi...@mitre.org > To: solr-user@lucene.apache.org > Subject: Re: Iss

Charfilter keep "dates" but skeep "number"

2012-10-16 Thread darul
Hello all,A long time I have not posted, but do not worry I am still using Solr everyday and enjoy it.Here the details of my requirement:According to a specific content with "dates", "number", (maybe number%), we would like to *skip number* and *keep dates (+number%)* in indexation process. Do you

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Tomás Fernández Löbbe
I think sorting should work too, as I suggested before. In this case (because by coincidence you need alphabetic sort on the type) "sort=type asc, score desc" should work. If you need to add other types, maybe add an int field that represents how you would like those to be sorted. between types, t

Re: Can we retrieve deleted records before optimized

2012-10-16 Thread Dmitry Kan
Hello, One approach (not a solrish one, but still) would be to use Lucene API and set up an IndexReader onto the solr index in question. You can then do: [code] Directory indexDir = FSDirectory.open(new File(pathToDir)); IndexReader input = IndexReader.open(indexDir, true); FieldSelector fieldSe

Re: Testing Solr4 - first impressions and problems

2012-10-16 Thread Shawn Heisey
On 10/16/2012 8:48 AM, Tomás Fernández Löbbe wrote: Shawn, you should create a Jira for that. Maybe it could be programatically activated/deactivated. Alan, make sure you don't confuse "near real time" with "Realtime get". As Hoss said, you don't need the transaction log unless you need Realtime

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Walter Underwood
Here is an approach that avoids the IDF problem. Add another field, perhaps named "priority". In that field, put a boost value, like 100 for allele docs, 10 for mi_attempt docs, and so on. In the boost part of the query, use the value of that field boost=priority. If you cannot change the index

Re: Multicore setup is ignored when deploying solr.war on Tomcat 5/6/7

2012-10-16 Thread Chris Hostetter
: To answer your question, I tried both -Dsolr.solr.home and solr/home JNDI : variable, in both cases I got the same result. : : I checked the logs several times, solr always only loads up the collection1, That doesn't really answer any of the questions i was asking you. *Before* solr logs anyth

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Asfand Qazi
On 16/10/12 16:15, Walter Underwood wrote: Why do you want that ordering? That isn't what Solr is designed to do. It is designed for relevance. I expect that idf (the rarity of the terms) is being used in the ordering. "mi_attempt" is probably much more rare than "allele". If you want that str

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Walter Underwood
Why do you want that ordering? That isn't what Solr is designed to do. It is designed for relevance. I expect that idf (the rarity of the terms) is being used in the ordering. "mi_attempt" is probably much more rare than "allele". If you want that strict ordering, I recommend doing three queries

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Tomás Fernández Löbbe
I meant: sort=type asc,score desc On Tue, Oct 16, 2012 at 12:08 PM, Tomás Fernández Löbbe < tomasflo...@gmail.com> wrote: > Yes, I saw that, but you'll need to add even more boosting than that, > other factors of the scoring are giving the doc with type mi_attempt higher > score than the one with

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Jack Krupansky
Sometimes things like term frequency can overwhelm even a seemingly large boost. Add &debugQuery=true to your query request and check the "explain" section for how the "allelle" document is scored and contrast that with the scoring for the "mi_attempt" document. Or documents - sometimes it is d

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Tomás Fernández Löbbe
Yes, I saw that, but you'll need to add even more boosting than that, other factors of the scoring are giving the doc with type mi_attempt higher score than the one with type 'allele'. See what I say adding debugQuery=true. If you'll always want this behavior, you may want to use sorting instead o

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Asfand Qazi
Hi, thanks for the reply. I tried that: http://ikmc.vm.bytemark.co.uk:8983/solr/allele/search?q=mgi_accession_id:"MGI:1315204"&bq=type:allele^100 OR type:mi_attempt^10 OR type:phenotype_attempt^1 (forgive the wrapping) and I got mi_attempt at the top, then the allele, then the phenotype_att

Datefaceting on multiple value in solr

2012-10-16 Thread Sagar Joshi1304
Hello All, In solr, I have two fields one is datetime [Admission_Date] and other string[Name]. I want to do datefaceting on multiple values in one solr query. Suppose I have 3 records. name="test" ,"test1", "test3". I want to do data-faceting on all three values in one solr query. Currently I a

Re: Testing Solr4 - first impressions and problems

2012-10-16 Thread Tomás Fernández Löbbe
Shawn, you should create a Jira for that. Maybe it could be programatically activated/deactivated. Alan, make sure you don't confuse "near real time" with "Realtime get". As Hoss said, you don't need the transaction log unless you need Realtime Get or recovery of uncommitted docs (or Solr Cloud, w

Re: Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Tomás Fernández Löbbe
you are missing the "OR" between the clauses of the bq. Try with: bq=type:allele^100 OR type:mi_attempt^10 OR type:phenotype_attempt^1 or set OR as your default operator in the schema.xml Tomás On Tue, Oct 16, 2012 at 10:37 AM, Asfand Qazi wrote: > Hello, > > The Solr server I am driving is f

Re: FieldType works fine at ANALYSIS(debuging) but not at index time

2012-10-16 Thread meier_flo
I'm executing the post.jar from the example docs where I've have an .xml document in the required structure but I guess I figured it out myself ... had an error of reasoning here ... the terms after the analysis process are stored in the index correctly... at query time when I'm searching fo

Re: Config parameters to tweak for update performance

2012-10-16 Thread Shawn Heisey
On 10/16/2012 5:38 AM, Robert Krüger wrote: I use solr embedded in a desktop app and due to the consistency requirements of the application I have to commit rather often. Are there some best practices on how to optimize commit performance via the configuration? I could easily live with slower que

Re: Testing Solr4 - first impressions and problems

2012-10-16 Thread Shawn Heisey
On 10/15/2012 3:37 PM, Shawn Heisey wrote: On 10/15/2012 2:51 PM, Chris Hostetter wrote: For your usecase and upgrade: don't add the updateLog to your configs, and don't add autocommit to your configs, and things should work fine. if you decide you wnat to start using something that requires t

Re: FieldType works fine at ANALYSIS(debuging) but not at index time

2012-10-16 Thread Jack Krupansky
How are you indexing? Maybe the angle brackets are being escaped (or not being escaped) properly. -- Jack Krupansky -Original Message- From: meier_flo Sent: Tuesday, October 16, 2012 7:50 AM To: solr-user@lucene.apache.org Subject: FieldType works fine at ANALYSIS(debuging) but not at

Re: exception when starting single instance solr-4.0.0

2012-10-16 Thread Bernd Fehling
Hi Erik, I just found the problem, took me some time to locate it. I tried java with full debug, jetty with full debug, no success. Finally I moved step by step from solr/example to my setup. It turned out that you _must_ avoid to have lucene-test-framework-4.0-SNAPSHOT.jar loaded. After removing

Having trouble getting boosting queries to work with multiple terms

2012-10-16 Thread Asfand Qazi
Hello, The Solr server I am driving is found publicly at http://ikmc.vm.bytemark.co.uk:8983/solr/allele/search , it contains freely available information from science research establishments. It contains many documents, and I usually do is look up all documents where the 'mgi_accession_id' f

Re: Search in specific website

2012-10-16 Thread Otis Gospodnetic
Hi, Maybe there are matches for both :) Otis -- Performance Monitoring - http://sematext.com/spm On Oct 16, 2012 9:05 AM, "Tolga" wrote: > Hi again, > > In Nutch list, I was told to use "url:example\.net AND content:some > keyword" and so I did. However, I get results from both my URLs. Why thi

Re: Search in specific website

2012-10-16 Thread Erick Erickson
Please review: http://wiki.apache.org/solr/UsingMailingLists You're providing almost no information to allow us to help you; all we can do is waste time by guessing. Best Erick On Tue, Oct 16, 2012 at 9:05 AM, Tolga wrote: > Hi again, > > In Nutch list, I was told to use "url:example\.net AND

solr 4 tika config

2012-10-16 Thread cmd.ares
I want to index all pdf files in "d:\myfiles\*.*" file fullname as the field id file content as the field txt the index should be like this: -id---txt-- d:\myfiles\0.pdfa d:\myfiles\subfolder1\1.pdf bb

Re: Search in specific website

2012-10-16 Thread Tolga
Hi again, In Nutch list, I was told to use "url:example\.net AND content:some keyword" and so I did. However, I get results from both my URLs. Why this behaviour? Regards, PS: I've re(crawl|index)ed my data. On 10/12/2012 05:07 PM, Otis Gospodnetic wrote: Hi Tolga, You'll get more help on

Re: exception when starting single instance solr-4.0.0

2012-10-16 Thread Erick Erickson
Well, it's crude but it _might_ work. All the jars are logged when loaded on startup. You might search through the log for "3." or extract all the paths with a regex or something and see if something jumps out and backtrack. I really, really, _hate_ having to deal with this kind of thing Best

FieldType works fine at ANALYSIS(debuging) but not at index time

2012-10-16 Thread meier_flo
Dear solr-user community, I'd like to apologize upfront if this question has been asked in a similar way before, but while searching I didn't find anything that could help me with my question/problem So I got the following inconsistency with my schema.xml at Debugging (ANALYSIS) and index ti

Re: Multicore setup is ignored when deploying solr.war on Tomcat 5/6/7

2012-10-16 Thread Rogério Pereira Araújo
Hi Chris, To answer your question, I tried both -Dsolr.solr.home and solr/home JNDI variable, in both cases I got the same result. I checked the logs several times, solr always only loads up the collection1, if I rename the cores on solr.xml to anything else or add more cores, nothing happen

Config parameters to tweak for update performance

2012-10-16 Thread Robert Krüger
Hi, I use solr embedded in a desktop app and due to the consistency requirements of the application I have to commit rather often. Are there some best practices on how to optimize commit performance via the configuration? I could easily live with slower queries or more memory use as my index is ra

Re: Newbie - Setting up date and distance facets

2012-10-16 Thread Spadez
Thank you for the reply. The facet range gap loks good but it is too far down the line to be of use, I wish it was implemented though. What I want is really a more simple question http://wiki.apache.org/solr/SimpleFacetParameters#facet.range Is it correct that to add facets on date and distance

Re: Search multiple tags within multiple categories

2012-10-16 Thread Eneko chan
Hello, May be I explained myself wrong or I still don't know what OpenNLP is capable of. I don't need to automatically create the relations between tags and categories. That would be done (at first) manually. I need somehow to send a query with the tags [plane, airport, pilot, parachuting, base j

Re: Questions about query times

2012-10-16 Thread Dmitry Kan
couple of random questions: 1) in some of your example queries (like query 2) there trimTime:[2012-09-04T15:23:48Z TO *] mentioned in two diff. filter queries of the same request. Any reason to do this? 2) if you don't need the documents themselves, only the facets, you could set rows=0. 3) try tw

Re: Newbie - Setting up date and distance facets

2012-10-16 Thread Alexandre Rafalovitch
I haven't done this, but I think your use case is a range facets with variable bucket sizes. As described here: http://wiki.apache.org/solr/SimpleFacetParameters#facet.range.gap Unfortunately, the variable buckets are not implemented yet, and are just old patches in SOLR-2366 (as linked). I think

Newbie - Setting up date and distance facets

2012-10-16 Thread Spadez
I am using solr and I want to setup two different types of facets on my data. The first is date, so I would like it like this: Posted: Today 3 days 7 days All time Distance: 5 miles 10 miles 30 miles 100 miles How should I be setting up faceting? It looks like something I need to edit my solr.xm

Re: Open Source Social (London) - 23rd Oct

2012-10-16 Thread Richard Marr
Don't forget, The London Search Social is on Tuesday next week. Come and grab a beer with us and talk about Search, NLP, ML, Hadoop. All experience levels welcome. On 11 October 2012 21:59, Richard Marr wrote: > Hi all, > > The next Open Source Search Social is on the 23rd Oct at The Plough,

Re: Search multiple tags within multiple categories

2012-10-16 Thread Otis Gospodnetic
Hello, Sounds like you want to look into classification tools like OpenNLP. There is a wiki page on Solr wiki, too... Otis -- Performance Monitoring - http://sematext.com/spm On Oct 16, 2012 5:45 AM, "Eneko chan" wrote: > Hi, > > I'm new to Solr and I'm felling quite lost. I'm not sure if Sorl

Re: Solr4 without slf4j bindings -- apparent catch-22

2012-10-16 Thread Jan Høydahl
Hi, Classloading is made more complicated by Jetty in that it magically treats org.slf4j.* classes as "Server" classes and won't let the webapp see them. See http://docs.codehaus.org/display/JETTY/Classloading The original idea of letting slf4j-api stay in the war was for the webapp not to cra

Re: how solr4.0 and zookeeper run on weblogic

2012-10-16 Thread Vadim Kisselmann
Hi, these are JAVA_OPTS params, you can find and set this stuff in the startManagedWeblogic script. Best regards Vadim 2012/10/16 rayvicky : > who can help me ? > where to settings -DzkRun-Dbootstrap_conf=true > -DzkHost=localhost:9080 -DnumShards=2 > in weblogic > > > > -- > View this m