; if not exactly 75 and 25, something in that range should be
fine too.
Any ideas on implementing this feature?
Thanks much!
Satish
--
Lance Norskog
goks...@gmail.com
of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
--
Lance Norskog
goks...@gmail.com
connection between the two. Is there
a better way to represent this?
Thanks.
--
Lance Norskog
goks...@gmail.com
- User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
processors so that missing directories and
files are ignored and the indexing process does not stop immediately?
Best,
Sascha
--
Lance Norskog
goks...@gmail.com
the
fq parameter in this way, or is there some better approach since
filters are almost always used?
Thank you!
--
Lance Norskog
goks...@gmail.com
.
How might I do this?
Barring that, how can I inspect the index post-fact? I have tried to
use luke to see what's in the index, but I get an error: Unknown
format version -10. Is it possible to get luke to work?
My solr build is straight out of SVN.
thanks,
harry
--
Lance Norskog
] INFO [SolrDeletionPolicy] newest
commit = 1281030128385
--
View this message in context:
http://lucene.472066.n3.nabble.com/Deleting-with-the-DIH-sometimes-doesn-t-delete-tp1113098p1113098.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks
.
Karthik
--
Lance Norskog
goks...@gmail.com
at 9:40 AM, Shawn Heisey s...@elyograg.org wrote:
On 4/9/2010 7:35 PM, Lance Norskog wrote:
Function queries are notoriously slow. Another way to boost by year is
with range queries:
[NOW-6MONTHS TO NOW]^5.0 ,
[NOW-1YEARS TO NOW-6MONTHS]^3.0
[NOW-2YEARS TO NOW-1YEARS]^2.0
[* TO NOW-2YEARS
?
Thanks,
Michael
--
Lance Norskog
goks...@gmail.com
the Solr trunk.)
--
Lance Norskog
goks...@gmail.com
double name=time0.0/double
/lst
−
lst name=org.apache.solr.handler.component.DebugComponent
double name=time501.0/double
/lst
/lst
/lst
/lst
--
Lance Norskog
goks...@gmail.com
mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
a cURL or LWP loop but
I'd prefer to use SolrJ if I can.
Thanks for all your help.
- Tod
--
Lance Norskog
goks...@gmail.com
.nabble.com/Solr-for-multiple-websites-tp1173220p1219
772.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
.
It seems to work well on my development box, but I have not seen this
approach discussed elsewhere so I was wondering if I was missing
something here.
Feedback gratefully received!
Simon
--
Lance Norskog
goks...@gmail.com
of unit tests:
https://issues.apache.org/jira/browse/SOLR-2060
Please add your source code if you are confident it should work, but does not.
Lance
On Thu, Aug 19, 2010 at 7:45 AM, Tod listac...@gmail.com wrote:
On 8/19/2010 1:45 AM, Lance Norskog wrote:
'stream.url' is just a simple parameter
at Nabble.com.
--
Lance Norskog
goks...@gmail.com
, facet on
categoryId (1, 2, 3 etc. ) and get properties display name, image, etc?
Thanks,
Satish
--
Lance Norskog
goks...@gmail.com
/.
In schema.xml, I put field name=Author type=text indexed=true
stored=true multiValued=true/.
Please let me know if I did something wrong, or how I can display it in
jsp.
I really appreciate your help!
--
Lance Norskog
goks...@gmail.com
anyone know of one, or should I create
it?
Thanks,
Shawn
--
Lance Norskog
goks...@gmail.com
, Lance Norskog goks...@gmail.com wrote:
The Haversine formula in o.a.s.s.f.d.DistanceUtils.java gives these
results for a 0.1 degree difference in miles:
equator horizontal 0.1 deg: lat/lon 0.0/0.0 - 396.320504
equator vertical 0.1 deg: lat/lon 0.0/0.0 - 396.320504
NYC
addressed in Solr 1.4.0?
3) How can I stop this from happening?
Thanks!
Devin
--
Lance Norskog
goks...@gmail.com
for faceting with embedded metadata, and a
different one with the same values without embedded metadata for actual 'fq'
filtering?
Thanks for any tips,
Jonathan
--
Lance Norskog
goks...@gmail.com
and pass it to ie. xml file.
If you have some questions about uniqe id, this page should be a place
for You http://wiki.apache.org/solr/UniqueKey
--
Regards,
Rafał Kuć
--
Lance Norskog
goks...@gmail.com
:
Is it possible to define more than 1 uniqueKey fields per document in
schema.xml?
--
Lance Norskog
goks...@gmail.com
:
http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1262050.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
.
Obviously the 2 requirements aren't compatible. I'm trying to see if it's
possible to define 2 uniqueKeys so each app could have its own one.
--- On Sat, 8/21/10, Lance Norskog goks...@gmail.com wrote:
From: Lance Norskog goks...@gmail.com
Subject: Re: Possible to have more than 1 uniqueKey
--
Lance Norskog
goks...@gmail.com
--
-
--
Lance Norskog
goks...@gmail.com
.
I'm creating my SolrServer instance like this:
private SolrServer solrServer;
CoreContainer container = new
CoreContainer.Initializer().initialize();
solrServer = new EmbeddedSolrServer(container, );
this solrServer wont let me set a request writer.
--
Lance Norskog
server.add
and with 5MB of text it runs out of memory.
Is there a way around this?
Is there a plan to enhance SolrJ to allow a reader to be passed in instead
of a string?
Can you please open a Jira issue?
--
Regards,
Shalin Shekhar Mangar.
--
Lance Norskog
goks...@gmail.com
processing of
specific elements - you give it a content handler that gets fed a stream of
cleaned-up HTML elements.
-- Ken
Le 25-août-10 à 02:09, Lance Norskog a écrit :
I would do this with regular expressions. There is a Pattern Analyzer
and a Tokenizer which do regular expression-based text
is sometimes being truncated (or corrupted) when
it's sent to solr.
What client are you using?
-Yonik
http://lucenerevolution.org Lucene/Solr Conference, Boston Oct 7-8
--
Lance Norskog
goks...@gmail.com
+costumes/str
-
arr name=size
strL/str
strM/str
strS/str
/arr
str name=stockIn Stock/str
str name=subcgynam50's Costumes|14010/str
/doc
Thanks!
--
Lance Norskog
goks...@gmail.com
faceting (as with sorting) is slow... did you try
another query after hello (and without a commit happening
inbetween) to see if it was faster?
-Yonik
http://lucenerevolution.org Lucene/Solr Conference, Boston Oct 7-8
--
Lance Norskog
goks...@gmail.com
at http://www.jpmorgan.com/pages/disclosures/email.
--
Lance Norskog
goks...@gmail.com
it to generate the stream of SAX
events that it then consumes and turns into a normalized XHTML 1.0-compliant
data stream.
-- Ken
On Aug 25, 2010, at 7:22pm, Lance Norskog wrote:
This assumes that the HTML is good quality. I don't know exactly what
your use case is. If you're crawling the web
afterward.
Any ideas?
Thanks.
%b6G$J0T.'$$'d(l/f,r!C
Checked by AVG - www.avg.com
Version: 9.0.851 / Virus Database: 271.1.1/3093 - Release Date: 08/25/10
14:34:00
--
Lance Norskog
goks...@gmail.com
in context:
http://lucene.472066.n3.nabble.com/Private-data-within-SOLR-Schema-tp1376174p1376355.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
Document to a SolrInputDocument.
Thanks.
--
Lance Norskog
goks...@gmail.com
it? Would branch_3x be enough, or would trunk be better?
Thanks,
Shawn
--
Lance Norskog
goks...@gmail.com
... Stump The Chump!
--
Lance Norskog
goks...@gmail.com
.472066.n3.nabble.com/Searching-for-words-with-accented-characters-tp486325p1375019.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
In a multiValued field, I want to write a query that returns all the
documents that actually have more than 1 value in that field. (don't care
what the value is).
Maybe this should be obvious, but I just can't think of how to formulate
such a query.
Any help?
Thanks,
Brad
--
Lance Norskog
goks
version of Solr are you using?
Exactly what error message did you get?
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Training in Europe - www.solrtraining.com
--
Lance Norskog
goks...@gmail.com
.nabble.com/Problem-related-to-Sorting-in-Solr1-4-tp1370622p1371739.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
A problem with this as recently surfaced: spelling suggestions. A
spelling checker built from the index pulls all terms. You cannot give
it a filter query. But, you don't want to show people words from
documents they should not see.
On Fri, Aug 27, 2010 at 12:01 PM, Lance Norskog goks
to the file by multiple threads?
Would this method of using an external file scale?
Thanks.
--
Lance Norskog
goks...@gmail.com
popularity in a boosted query such as:
{!boost b=log(popularity)}foo
The doc says ExternalFileField can only be used in FunctionQuery. Does that
include a boost query like {!boost b=log(popularity)}?
--- On Sat, 8/28/10, Lance Norskog goks...@gmail.com wrote:
From: Lance Norskog goks
to
update the slave and then have the slave commit the changes to the master
(adding new fields in the process?)
Thanks.
--
Lance Norskog
goks...@gmail.com
, there are (or will be) function queries that allow you
to get this behavior without supplying a class.
Lance Norskog
On Sun, Aug 29, 2010 at 1:10 PM, Hasnain hasn...@hotmail.com wrote:
also my request handler looks like this
requestHandler name=mb_artists class=solr.SearchHandler
lst name=defaults
str name
.
Thanks!
Amit
--
Lance Norskog
goks...@gmail.com
follow?
Thanks
Amit
On Mon, Aug 30, 2010 at 5:00 PM, Lance Norskog goks...@gmail.com wrote:
The price-performance knee for small servers is 32G ram, 2-6 SATA
disks on a raid, 8/16 cores. You can buy these servers and half-fill
them, leaving room for expansion.
I have not done benchmarks
Improving search results using affinity Graph
http://research.microsoft.com/apps/pubs/default.aspx?id=67818
Any thoughts?
Cheers
Ukyo
--
Lance Norskog
goks...@gmail.com
@udngroup.comwrote:
I am also curious as Amit does. Can you make an example about the garbage
collection problem you mentioned?
- Original Message - From: Lance Norskog goks...@gmail.com
To: solr-user@lucene.apache.org
Sent: Tuesday, August 31, 2010 9:14 AM
Subject: Re: Hardware Specs
configuration on our project?
Thanks in advance.
Scott
- Original Message - From: Lance Norskog goks...@gmail.com
To: solr-user@lucene.apache.org
Sent: Tuesday, August 31, 2010 1:01 PM
Subject: Re: Hardware Specs Question
There are synchronization points, which become chokepoints at some
the
components of the TF-IDF score, for example. Can someone point me to an
example of doing something like this?
~Brad
--
Lance Norskog
goks...@gmail.com
/SpatialSearch
Are there any distance sorting functions that use this field, or do I
need to have three indexed fields, store_lat_lon, latitude, and
longitude, if I want both filtering and sorting by distance.
Thanks, Scott
--
Lance Norskog
goks...@gmail.com
.
--
Lance Norskog
goks...@gmail.com
?
2. Is there a way to do synonyms' highlight in search result?
3. Does anyone use WordNet to solr?
Thanks so much in advance,
--
Lance Norskog
goks...@gmail.com
,
Matteo
--
Lance Norskog
goks...@gmail.com
.
Best regards
Antonio
--
Lance Norskog
goks...@gmail.com
Kirsty
--
View this message in context:
http://lucene.472066.n3.nabble.com/Indexing-boolean-value-tp1023708p1405001.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
to retrieve and download the pdf file is to use the
id=doc1 to access some repository (even a database table )
that can provide me the content starting from the id.
Does this look like a common practice?
Thank you
Il 02/09/10 08:47, Lance Norskog ha scritto:
Solr can return the list
the same in my big main
index, is it safe?
Also, is there any difference in behavior between autocommits and
explicit commits in this regard?
--
Lance Norskog
goks...@gmail.com
in 1.4.1? I may try that to
see if helps. Configuration of the switches may also be possible.
Also, would you mind explaining your second point... using dual NIC cards.
How can this be accomplished/configured. Thanks for you help
--
Lance Norskog
goks...@gmail.com
:
mp3 n
and that narrows it down to:
- mp3 nano
I had a quick look at the Terms Component.
I suppose it just returns term totals for the entire index and cannot
be
used with a filter or query?
Thanks
Johan
--
Lance Norskog
goks...@gmail.com
be:
field column=id name=id/
field column=name name=name/
field column=city name=city_t/
field column=skill name=skill_t/
--
View this message in context:
http://lucene.472066.n3.nabble.com/solr-user-tp1404814p1406343.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance
the issue?
Thanks,
Jeff
--
Lance Norskog
goks...@gmail.com
More directly: if the 'Artikel' field is a string, only the whole
string will match:
Artikel:Kerstman baardstel.
Or you can use a wildcard: Kerstmann* or just Kerst*
If it is a text field, it is chopped into words and
q=Artikel:Kerstmann would work.
Gora Mohanty wrote:
On Sat, 4 Sep
The RSS example does not do this. It declares only the source, and gives
all of the parameters in the entity.
You can have different entities with different uses of the datasource.
In general, the DIH is easier to use when starting with one of the
examples and slowing changing one thing at a
1) The XML file must include the UTF-8 encoding metadata in the first line.
2) If you are using Tomcat: Tomcat comes without UTF-8 as the default.
The Solr wiki gives the directions on how to fix this.
3) If you are using Windows: Windows does not use UTF-8 by default.
Tracking down UTF-8
These days the best practice for a 'drill-down' facet in a UI is to
encode both the unique value of the facet and the displayable string
into one facet value. In the UI, you unpack and show the display string,
and search with the full facet string.
If you want to also do date ranges, make a
Does JBoss still uses Tomcat? Tomcat has an external file to configure
war files in Catalina/localhost. If JBoss is not Tomcat any more, it
must have a directory and file format somewhere for an external
configuration of a servlet war.
Lance
Chris Hostetter wrote:
: 1-extract the solr.war
in clifornia= live show
= live show in DC = live show in USA
is synonym will effect my original search?
thanks
with regards
Jonty
--
Lance Norskog
goks...@gmail.com
BV
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350
--
Lance Norskog
goks...@gmail.com
-file-tp1438460p1438720.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
-1794 England
--
Lance Norskog
goks...@gmail.com
https://issues.apache.org/jira/browse/SOLR-1499
This is a patch (not committed) that queries a Solr instance and returns
the values as a DIH document. This allows you to do a sort query to
Solr, ask for the first result, and continue indexing after that. Scary,
but it works.
Lance
David
both filtering and sorting by distance.
Thanks, Scott
--
Lance Norskog
goks...@gmail.com
Hello-
There were a few bugs in this area that are fixed in Solr 1.4. There are
many other bugs which were also fixed. We suggest everyone upgrade to 1.4.
There are different locking managers, and you may be able to use a
different one. Also, if this is over NFS that can cause further
The stream.file and stream.url parameters should do this.
Lance
Rico Lelina wrote:
Hi,
I'm using EmbeddedSolrServer for my unit tests. I just can't figure out how to
add my data (stored in xml files similar to those in the example application
example/exampleDocs) after instantiating the
In the general case, this would require a new method on compound queries
to sort themselves into a canonical order, or refuse to. Somehow, I
don't think this will happen. However, it could be done with boolean
queries only, which would make it somewhat easier to combinatorically
compose OR
Look at Deduplication:
http://wiki.apache.org/solr/Deduplication
It implements a unique hashcode (Lookup3Signature
http://wiki.apache.org/solr/Lookup3Signature ) as a tool that avoids
rewriting the same document over and over. It declares this in
solrconfig.xml instead of schema.xml.
Lance
that use this field, or do I
need to have three indexed fields, store_lat_lon, latitude, and
longitude, if I want both filtering and sorting by distance.
Thanks, Scott
--
Lance Norskog
goks...@gmail.com
Since no one has jumped in to give the right syntax- yeah, it's a bug.
Please file a JIRA.
On Thu, Sep 9, 2010 at 9:44 PM, Scott K s...@skister.com wrote:
On Thu, Sep 9, 2010 at 21:00, Lance Norskog goks...@gmail.com wrote:
I just checked out the trunk, and branch 3.x This query is accepted
Delbru
--
Lance Norskog
goks...@gmail.com
There is a trick: facets with only one occurrence tend to be mispellings
or dirt. You write a program to fetch the terms (Lucene's CheckIndex is
a great starting point) create a stopwords file.
Here's a data mining project: which languages are more vulnerable to
dirty OCR?
Burton-West, Tom
Bravo!
Other tricks: here is a policy for deciding when to merge segments that
attempts to balance merging with performance. It was contributed by
LinkedIn- they also run indexsearch in the same instance (not Solr, a
different Lucene app).
Also, the 'v' is capitalized: multiValued. (This is one reason why
posting your schema helps.)
Erick Erickson wrote:
Can we see your schema file? Because it sounds like you didn't
really declare your field multivalued=true on the face of things.
But if it is multivalued AND you changed
://lucene.472066.n3.nabble.com/what-differents-between-SolrCloud-and-Solr-Hadoop-tp1463809p1464106.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
without the query term-- this means doing two
searches. Any suggestions
on
implementing this requirement using only one
search?
Thanks,
Satish
--
Lance Norskog
goks...@gmail.com
Yes, notice the flowers are all a medium-dark crimson red. There are a
bunch of these image-indexing search technologies, but there is no (to
my knowledge) finished technology- it's very much an area of research.
If you want to search the word 'flower' and index data that can find
blobs of
Which version of Solr? 1.4?, 1.4.1? 3.x branch? trunk? if the 3.x or the
trunk, when did you pull it?
andrewdps wrote:
What could be possible error for
14-Sep-10 4:28:47 PM org.apache.solr.common.SolrException log
SEVERE: java.util.concurrent.ExecutionException:
java.lang.NullPointerException
Database optimization is not like program optimization- it is wildly
unpredictable.
What bugs me about the delta approach is using the last time DIH ran,
rather than a timestamp from the DB. Oh well. Also, with SOLR-1499 you
can query Solr directly to see what it has.
Lukas Kahwe Smith
Good eye, Thomas! Yes, GCJ is a non-starter. You're best off downloading
Java 1.6 yourself, but I understand that it is easier to use the public
package repositories.
Thomas Joiner wrote:
My guess would be that Jetty has some configuration somewhere that is
telling it to use GCJ. Is it
Andrew, you should download Solr from the apache site. This packaging is
wrong-headed.
As to Java, a Linux person would know the system for picking which is
the standard Java.
andrewdps wrote:
Also,the solr Java properties looks like this using gcj,despite setting
java_home in /etc/profile
Rolling logfiles is configured in the servlet container, not Solr.
Indexing logfiles is a pain because of multiline log outputs like
Exceptions.
Vladimir Sutskever wrote:
Can SOLR be configured out of the box to handle rolling log files?
Kind regards,
Vladimir Sutskever
Investment Bank -
301 - 400 of 1360 matches
Mail list logo