from:"Markus Jelsma"

Re: Tag Cloud Generation Problem

2010-04-08 Thread Markus Jelsma

buzzWord, sum(frequency) from Verbatim where count(frequency)thresholdValue group by buzzWord Is there a similar way I can query a SOLR. Even a workaround solution to this will do. Thanks. Regards, Ninad R Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com

Re: Tag Cloud Generation Problem

2010-04-08 Thread Markus Jelsma

, Ninad R On Thu, Apr 8, 2010 at 2:09 PM, Markus Jelsma mar...@buyways.nl wrote: The facetting engine can do this job. On Thursday 08 April 2010 10:16:09 Ninad Raut wrote: Hi, I have a business use case where in I have to generate a tagcloud for words with freequency greater

RE: Re: Using Solr with CouchDB

2010-04-28 Thread Markus Jelsma

Hi, Setting up CouchDB-Lucene is quite easy, but you don't want that i guess. You could construct a show function to convert input to Solr accepted XML, should be very straightforward. You just need some program to fetch from CouchDB and push it in Solr. Cheers, -Original

RE: Re: Using Solr with CouchDB

2010-04-28 Thread Markus Jelsma

Whether you need Solr depends on if you require some features such as highlighting, faceting, more-like-this etc. They will not work with CouchDB-Lucene, nor can you, at this moment, use CoucDB-Lucene behind CouchDB-Lounge although a seperate shard can have a sharded Lucene index, you cannot

RE: schema.xml question

2010-05-07 Thread Markus Jelsma

You could write your own requestHandler in solrconfig.xml, it'll allow you to predefine parameters for your configured search components. -Original message- From: Antonello Mangone antonello.mang...@gmail.com Sent: Fri 07-05-2010 15:17 To: solr-user@lucene.apache.org; Subject:

RE: Re: schema.xml question

2010-05-07 Thread Markus Jelsma

don't know how to do it, but I'll follow your suggestion :) Thank you very much ... ps. I'm just a novel 2010/5/7 Markus Jelsma markus.jel...@buyways.nl You could write your own requestHandler in solrconfig.xml, it'll allow you to predefine parameters for your configured search components

RE: Re: schema.xml question

2010-05-07 Thread Markus Jelsma

For the moment I don't know how to do it, but I'll follow your suggestion :) Thank you very much ... ps. I'm just a novel 2010/5/7 Markus Jelsma markus.jel...@buyways.nl You could write your own requestHandler in solrconfig.xml, it'll allow you to predefine parameters for your configured search

RE: Help indexing PDF files

2010-05-07 Thread Markus Jelsma

Hi, The wiki page [1] on this subject will get you started. [1]: http://wiki.apache.org/solr/ExtractingRequestHandler Cheers -Original message- From: Leonardo Azize Martins laz...@gmail.com Sent: Fri 07-05-2010 15:37 To: solr-user@lucene.apache.org; Subject: Help

RE: Re: Help indexing PDF files

2010-05-07 Thread Markus Jelsma

. Thanks 2010/5/7 Markus Jelsma markus.jel...@buyways.nl Hi, The wiki page [1] on this subject will get you started. [1]: http://wiki.apache.org/solr/ExtractingRequestHandler Cheers -Original message- From: Leonardo Azize Martins laz...@gmail.com Sent: Fri 07-05-2010 15

RE: How to query for similar documents before indexing

2010-05-10 Thread Markus Jelsma

Hi, Deduplication [1] is what you're looking for.It can utilize different analyzers that will add a one or more signatures or hashes to your document depending on exact or partial matches for configurable fields. Based on that, it should be able to prevent new documents from entering the

RE: How to query for similar documents before indexing

2010-05-10 Thread Markus Jelsma

to be a silent action ? Or will the add method return that it failed indexing because it detected a duplicate ? Is it commited to the 1.4 already ? Cheers matt --- On Mon, 5/10/10, Markus Jelsma markus.jel...@buyways.nl wrote: From: Markus Jelsma markus.jel...@buyways.nl Subject: RE: How to query

Dedupe and overwriteDupes setting

2010-05-11 Thread Markus Jelsma

clearly see the correct signatures. Most documents have a distinct signature and some share the same because the content field's value is identical for those documents. Anyway, why does it delete all my documents? Any clues? The wiki is not very helpful on this subject. Cheers. Markus

Re: Dedupe and overwriteDupes setting

2010-05-11 Thread Markus Jelsma

It seems this e-mail did already leave the outbox yesterday. Apologies for the spam. On Tuesday 11 May 2010 10:13:18 Markus Jelsma wrote: List, I've stumbled upon an issue with the deduplication mechanism. It either deletes all documents or does nothing at all and it depends

Re: How to query for similar documents before indexing

2010-05-11 Thread Markus Jelsma

as long as such a feature isn't there. Can you explain more what you have in mind ? Thank you for your help! matt --- On Mon, 5/10/10, Markus Jelsma markus.jel...@buyways.nl wrote: From: Markus Jelsma markus.jel...@buyways.nl Subject: RE: How to query for similar documents before

RE: Re: Dedupe and overwriteDupes setting

2010-05-11 Thread Markus Jelsma

field set to indexed=false and overwriteDupes=true it should likely complain) -- - Mark http://www.lucidimagination.com On 5/11/10 4:13 AM, Markus Jelsma wrote: List, I've stumbled upon an issue with the deduplication mechanism. It either deletes all documents or does nothing at all

RE: Config issue for deduplication

2010-05-13 Thread Markus Jelsma

What's your solrconfig? No deduplication is overwritesDedupes = false and signature field is other than doc ID field (unique) -Original message- From: Markus Fischer i...@flyingfischer.ch Sent: Thu 13-05-2010 17:01 To: solr-user@lucene.apache.org; Subject: Config issue for

RE: Solr read-only core

2010-05-25 Thread Markus Jelsma

Hi, I'd guess there are two ways in doing this but i've never seen any solrconfig.xml file having any directives that explicitly do not allow for updates. You'd either have a proxy in front that simply won't allow any other HTTP method than GET and HEAD, or you could remove the update

Re: Filtering near-duplicates using TextProfileSignature

2010-06-09 Thread Markus Jelsma

/processor processor class=solr.LogUpdateProcessorFactory / processor class=solr.RunUpdateProcessorFactory / /updateRequestProcessorChain -- Thanks in advance, -Ali Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: Filtering near-duplicates using TextProfileSignature

2010-06-09 Thread Markus Jelsma

addition that's not documented yet? Sorry about this -- stupid question -- I should have read back through the thread and refreshed my memory. Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: Issue with response header in SOLR running on Linux instance

2010-06-09 Thread Markus Jelsma

machine - response - responseHeader status0/status QTime26/QTime - lst name=params str name=qcredit/str /lst /responseHeader Any idea why this happens? Thanks, Barani Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06

RE: Re: Solr and Nutch/Droids - to use or not to use?

2010-06-16 Thread Markus Jelsma

Nutch does not, at this moment, support some form of consistent hashing to select an appropriate shard. It would be nice if someone could file an issue in Nutch' Jira to add sharding support to it, perhaps someone with a better understanding and more experience with Solr's distributed search

RE: Re: Re: Solr and Nutch/Droids - to use or not to use?

2010-06-16 Thread Markus Jelsma

which shard they want to send documents to. This may be a good case for a new wish issue in Solr JIRA? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Markus Jelsma markus.jel...@buyways.nl

RE: federated / meta search

2010-06-17 Thread Markus Jelsma

Hi, Check out Solr sharding [1] capabilities. I never tested it with different schema's but if each node is queried with fields that it supports, it should return useful results. [1]: http://wiki.apache.org/solr/DistributedSearch Cheers. -Original message- From: Sascha

RE: remove from list

2010-06-23 Thread Markus Jelsma

If you want to unsubscribe, then you can do so [1] without trying to sell something ;) [1]: http://lucene.apache.org/solr/mailing_lists.html Cheers! -Original message- From: Susan Rust su...@achieveinternet.com Sent: Wed 23-06-2010 18:23 To: solr-user@lucene.apache.org; Erik

Re: Cache hits exposed by API

2010-06-29 Thread Markus Jelsma

to know if the cache hit rates of solr exposed via the API of solr? Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: Disabling Access to Solr Admin Panel

2010-06-29 Thread Markus Jelsma

Hi, Check out the wiki [1] on this subject. [1]: http://wiki.apache.org/solr/SolrSecurity Cheers, -Original message- From: Vladimir Sutskever vladimir.sutske...@jpmorgan.com Sent: Tue 29-06-2010 18:05 To: solr-user@lucene.apache.org; Subject: Disabling Access to Solr Admin

RE: Re: Faceted search outofmemory

2010-06-29 Thread Markus Jelsma

http://wiki.apache.org/solr/SimpleFacetParameters#facet.limit -Original message- From: olivier sallou olivier.sal...@gmail.com Sent: Tue 29-06-2010 20:11 To: solr-user@lucene.apache.org; Subject: Re: Faceted search outofmemory How do make paging over facets? 2010/6/29 Ankit Bhatnagar

RE: Re: Disable Solr Response Formatting

2010-06-30 Thread Markus Jelsma

Hi, My client makes a mess out of your example but if you mean formatting as in indenting, then send indent=false, but it's already false by default. Check your requestHandler settings. Cheers, -Original message- From: JohnRodey timothydd...@yahoo.com Sent: Wed 30-06-2010 18:39

RE: Solr results not updating

2010-07-06 Thread Markus Jelsma

Hi, If q=*:* doesn't show your insert, then you forgot the commit: http://wiki.apache.org/solr/UpdateXmlMessages#A.22commit.22_and_.22optimize.22 Cheers, -Original message- From: Moazzam Khan moazz...@gmail.com Sent: Tue 06-07-2010 22:09 To: solr-user@lucene.apache.org;

RE: /select handler statistics

2010-07-12 Thread Markus Jelsma

Hi, I think you're looking for the statistics for the standard request handler. Cheers, -Original message- From: Vladimir Sutskever vladimir.sutske...@jpmorgan.com Sent: Mon 12-07-2010 19:44 To: solr-user@lucene.apache.org; Subject: /select handler statistics Hi All, I am

RE: Problem with Wildcard searches in Solr

2010-07-12 Thread Markus Jelsma

Hi, The DisMaxQParser does not support wildcards in its q parameter [1]. You must use the LuceneQParser instead. AFAIK, in DisMax, wildcards are part of the search query and may get filtered out in your query analyzer. [1]: http://wiki.apache.org/solr/DisMaxRequestHandler#q Cheers,

RE: Problem with Wildcard searches in Solr

2010-07-12 Thread Markus Jelsma

Hi, Check edismax' JIRA page and its unresolved related issues [1]. AFAIK, it hasn't been committed yet. [1]: https://issues.apache.org/jira/browse/SOLR-1553 Cheers, -Original message- From: imranak imranak...@gmail.com Sent: Mon 12-07-2010 23:55 To:

Re: Ranking position in solr

2010-07-13 Thread Markus Jelsma

/QueryElevationComponent - which is used to elevate results based on editorial decisions - may help. Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: indexing rich documents

2010-07-13 Thread Markus Jelsma

directory for solr is /opt/solr/example and catalina home is /opt/tomcat6. thanks regards, swaroop Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: Tag generation

2010-07-15 Thread Markus Jelsma

suited to associate with that content. Does anyone know if there is a contrib to Solr or Lucene that does something like this? Or a third party tool that can be given a solr index or solr query and it comes up with some good Tag suggestions? Markus Jelsma - Technisch Architect - Buyways BV

Wiki, login and password recovery

2010-07-19 Thread Markus Jelsma

Hi, This probably should be in INFRA (to which i'm not subscribed) or something like that. Anyway, for some reason, my user/pass won't let me login anymore and i'm quite sure my browser still `remembers` the correct combination. I'm unsure whether this is a bug: to get that answer, i need

RE: Re: Wiki, login and password recovery

2010-07-19 Thread Markus Jelsma

This happened just a few hours ago and the problem persists at this very moment. I filed an issue: https://issues.apache.org/jira/browse/INFRA-2884 Cheers! -Original message- From: Chris Hostetter hossman_luc...@fucit.org Sent: Mon 19-07-2010 20:23 To: solr-user@lucene.apache.org;

RE: boosting particular field values

2010-07-21 Thread Markus Jelsma

function queries match all documents http://wiki.apache.org/solr/FunctionQuery#Using_FunctionQuery -Original message- From: Justin Lolofie jta...@gmail.com Sent: Wed 21-07-2010 20:24 To: solr-user@lucene.apache.org; Subject: boosting particular field values I'm using dismax request

Re: SolrJ Response + JSON

2010-07-28 Thread Markus Jelsma

you. - Mitch Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: Stress Test Solr

2010-08-02 Thread Markus Jelsma

Very interersting. Could you add some information and link to the relevant wiki page [1] ? [1]: http://wiki.apache.org/solr/BenchmarkingSolr -Original message- From: Tomas tomasflo...@yahoo.com.ar Sent: Mon 02-08-2010 17:34 To: solr-user@lucene.apache.org; Subject: Stress Test Solr

RE: Phrase search

2010-08-02 Thread Markus Jelsma

Well, the WordDelimiterFilterFactory in your query analyzer clearly makes Apple 2 out of Apple2, that's what it's for. If you're looking for an exact match, use a string field. Check the output with the debugQuery=true parameter. Cheers, -Original message- From: johnmu...@aol.com

RE: Re: Phrase search

2010-08-02 Thread Markus Jelsma

Apple 2 to Apple2? How do I fix it? Also, I'm really confused about this. I was under the impression a phrase search is not impacted by the analyzer, no? -M -Original Message- From: Markus Jelsma markus.jel...@buyways.nl To: solr-user@lucene.apache.org Sent: Mon, Aug 2, 2010 2:27 pm

RE: Multi word synomyms

2010-08-03 Thread Markus Jelsma

Hi, This happens because your tokenizer will generate seperate tokens for `exercise dvds`, so the SynonymFilter will try to find declared synonyms for `exercise` and `dvds` separately. It's behavior is documented [1] on the wiki. [1]:

RE: Indexing fieldvalues with dashes and spaces

2010-08-04 Thread Markus Jelsma

You shouldn't fetch faceting results from analyzed fields, it will mess with your results. Search on analyzed fields but don't retrieve values from them. -Original message- From: PeterKerk vettepa...@hotmail.com Sent: Wed 04-08-2010 22:15 To: solr-user@lucene.apache.org; Subject: RE:

RE: Indexing fieldvalues with dashes and spaces

2010-08-04 Thread Markus Jelsma

Hmm, you should first read a bit more on schema design on the wiki and learn about indexing and querying Solr. The copyField directive is what is commonly used in a faceted navigation system, search on analyzed fields, show faceting results using the primitive string field type. With

RE: Re: Load cores without restarting/reloading Solr

2010-08-05 Thread Markus Jelsma

http://wiki.apache.org/solr/CoreAdmin -Original message- From: Karthik K karthikkato...@gmail.com Sent: Thu 05-08-2010 12:00 To: solr-user@lucene.apache.org; Subject: Re: Load cores without restarting/reloading Solr Can some one please answer this. Is there a way of creating/adding a

RE: dismax debugging hyphens dashes

2010-08-07 Thread Markus Jelsma

Well, that smells like a WordDelimiterFilterFactory [1]. It splits, as your debug output shows, value into three separate tokens. This means that (at least) the strings 'abc', '12' and 'def' are in your index and can be found. The abc12 value is not present. If you want to query for

RE: Re: Facet Fields - ID vs. Display Value

2010-08-09 Thread Markus Jelsma

Well, you can do both, of cource but there's no need for additional code if you get it for free. I'd prefer - as most i assume - to use the label as a facet field. -Original message- From: Frank A fsa...@gmail.com Sent: Tue 10-08-2010 01:11 To: solr-user@lucene.apache.org; Subject:

RE: Re: uniqueKey and custom fieldType

2010-08-15 Thread Markus Jelsma

copyField it to an analyzed field will do the trick. -Original message- From: j jta...@gmail.com Sent: Sun 15-08-2010 20:30 To: solr-user@lucene.apache.org; Subject: Re: uniqueKey and custom fieldType Hi Erick, thanks- your explanation makes sense. But how then, do I make my unique

RE: Newbie question about search behavior

2010-08-16 Thread Markus Jelsma

You can append it in your middleware, or try the EdgeNGramTokenizer [1]. If you're going for the latter, don't forget to reindex and expect a larger index. [1]: http://lucene.apache.org/java/2_9_0/api/all/org/apache/lucene/analysis/ngram/EdgeNGramTokenizer.html -Original message-

RE: help on facet range

2010-08-16 Thread Markus Jelsma

No http://wiki.apache.org/solr/SimpleFacetParameters#Facet_by_Range https://issues.apache.org/jira/browse/SOLR-1240 -Original message- From: Peng, Wei wei.p...@xerox.com Sent: Mon 16-08-2010 20:25 To: solr-user@lucene.apache.org; Subject: RE: help on facet range The solr version

RE: Re: Solr searching performance issues, using large documents

2010-08-16 Thread Markus Jelsma

I've no idea if it's possible but i'd at least try to return an ArrayList of rows instead of just a single row. And if it doesn't work, which is probably the case, how about filing an issue in Jira? Reading the docs in the matter, i think it should (made) to be possible to return multiple

RE: Faceting by fields that contain special characters

2010-08-19 Thread Markus Jelsma

A very common issue, you need to facet on a non-analyzed field. http://lucene.472066.n3.nabble.com/Indexing-fieldvalues-with-dashes-and-spaces-td1023699.html#a1222961 -Original message- From: Christos Constantinou ch...@simpleweb.co.uk Sent: Thu 19-08-2010 15:08 To:

RE: Showing results based on facet selection

2010-08-19 Thread Markus Jelsma

Hi, A facet query serves a different purpose [1]. You need to filter your result set [2]. And don't forget to follow the links on caching and such. [1]: http://wiki.apache.org/solr/SimpleFacetParameters#facet.query_:_Arbitrary_Query_Faceting [2]:

RE: Solr for multiple websites

2010-08-19 Thread Markus Jelsma

http://osdir.com/ml/solr-user.lucene.apache.org/2009-09/msg00630.html http://osdir.com/ml/solr-user.lucene.apache.org/2009-03/msg00309.html Load balancing is bit out of scope here but all you need is a simple HTTP load balancer and a replication mechanism, depending on your set up.

RE: Autosuggest on PART of cityname

2010-08-19 Thread Markus Jelsma

You need a new analyzed field with the EdgeNGramTokenizer or you can try facet.prefix for this to work. To retrieve the number of locations for that city, just use the results from the faceting engine as usual. I'm unsure which approach is actually faster but i'd guess using the

RE: Autosuggest on PART of cityname

2010-08-19 Thread Markus Jelsma

Hmm, you have only four documents in your index i guess? That would make sense because you query for *:*. This technique doesn't rely on the found documents but the faceting engine so you should include rows=0 in your query and the fl parameter is not required anymore. Also, add facet=true to

RE: Autosuggest on PART of cityname

2010-08-20 Thread Markus Jelsma

You can't, it's analyzed. And if you facet on a non-analyzed field, you cannot distinguish between upper- and lowercase tokens. If you want that, you must create a new field with an EdgeNGramTokenizer, search on it and then you can facet on a non-analyzed field. Your query will be a bit

Re: Document Section in Solr

2010-08-27 Thread Markus Jelsma

: If the document which is indexed is the big file. Is there are provision of dividing the documents into sections. For eg., 20MB file divided into 10 sections which will show the right section when searched. Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17

Re: Status of Solr in the cloud?

2010-08-27 Thread Markus Jelsma

goal, correct? I'm interested in evaluating one of these solutions for my company; which is the most stable or most likely to eventually be part of the Solr distribution? Thanks, Charlie Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620

Re: Auto ID for Documents indexed

2010-08-27 Thread Markus Jelsma

August 2010 11:41:55 maheshkumar wrote: Is there feature to provide an auto-increment id to the document which is getting indexed. This is the schema file field name=reference type=string indexed=true stored=true required=true/ field name=id type=string indexed=true stored=true/ Markus Jelsma

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-27 Thread Markus Jelsma

, Shawn Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: A few query issues with solr

2010-08-27 Thread Markus Jelsma

terFilterFactory Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: Problem related to Sorting in Solr1.4

2010-08-27 Thread Markus Jelsma

=text indexed=true stored=true / field name=UPDBY type=text indexed=true stored=true / I have these two columns i want to use the SORT for these two columns. any one can please suggest what should i need to do for that. I am currently using Solr1.4. Markus Jelsma - Technisch Architect

RE: Spellcheck in multilanguage search

2010-08-31 Thread Markus Jelsma

Configure language specific fields and spellcheckers just as you would for a single language index, so multiple content_LANG fields and spell_LANG field. This will, of course, only work if you know in what language the search operates. -Original message- From: Grijesh.singh

RE: Memcache for Solr

2010-08-31 Thread Markus Jelsma

Hi, In a restaurant index website, we have used Memcache only for storing the generated HTML facet list when q=*. This cached object was only used when no additional search parameters were specified. It was quite useful because the facet list was always present and only changed if real

Re: Proximity search + Highlighting

2010-09-01 Thread Markus Jelsma

: hl=onhl.fl=qFR,iFR,mFR,vlFRhl.usePhraseHighlighter=falsehl.highlightMult iTerm=truehl.simple.pre=bhl.simple.post=%2Fbhl.mergeContiguous=false Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: shingles work in analyzer but not real data

2010-09-01 Thread Markus Jelsma

the keywords, but somehow the shingling doesn't appear to work on this live data. Does anyone have experience with shingling that might have some tips for us, or otherwise advice for debugging the issue? Thanks, Jeff Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com

Re: morelikethis - stored=true is necessary?

2010-09-02 Thread Markus Jelsma

shows similarity scores of 0 for all documents. However it is not the case for fields that define stored=true. Does it mean I must set stored=true for MLT to work? Also, does multivalued has an effect on the result? Thanks! Markus Jelsma - Technisch Architect - Buyways BV http

Re: How to retrieve the full corpus

2010-09-06 Thread Markus Jelsma

prohibited. If you have received this message in error, please notify the sender immediately by telephone, or e-mail and delete all copies of this message and any attachments from your system. Thank you. Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050

RE: getting started - books/in dept material

2010-09-06 Thread Markus Jelsma

Did you miss the wiki? http://wiki.apache.org/solr/SolrResources -Original message- From: Dennis Gearon gear...@sbcglobal.net Sent: Mon 06-09-2010 22:05 To: solr-user@lucene.apache.org; Subject: getting started - books/in dept material I really don't want to understand the code that

RE: Re: SolrCloud distributed indexing (Re: anyone use hadoop+solr?)

2010-09-06 Thread Markus Jelsma

The remainder of an arithmetic division http://en.wikipedia.org/wiki/Modulo_operation -Original message- From: Dennis Gearon gear...@sbcglobal.net Sent: Mon 06-09-2010 22:04 To: solr-user@lucene.apache.org; Subject: Re: SolrCloud distributed indexing (Re: anyone use hadoop+solr?) What

Re: Nutch/Solr

2010-09-07 Thread Markus Jelsma

fields such as; content, tstamp, title. How can I map content field after crawling ? Do I have change the lucene code (such as add extra field)? Or overcome in solr stage? Any suggestion? Thx. -- Yavuz Selim YILMAZ Markus Jelsma - Technisch Architect - Buyways BV http

Re: Nutch/Solr

2010-09-07 Thread Markus Jelsma

, then comes me that, content indexed twice for every new field. By the way, any details about how to get new fields from the content will be helpful. -- Yavuz Selim YILMAZ 2010/9/7 Markus Jelsma markus.jel...@buyways.nl Depends on your version of Nutch. At least trunk and 1.1 obey

RE: Is there a way to fetch the complete list of data from a particular column in SOLR document?

2010-09-07 Thread Markus Jelsma

q=*:*fl=id_FIELDrows=NUM_DOCS ? -Original message- From: bbarani bbar...@gmail.com Sent: Tue 07-09-2010 23:09 To: solr-user@lucene.apache.org; Subject: Is there a way to fetch the complete list of data from a particular column in SOLR document? Hi, I am trying to get complete list

RE: Re: MoreLikethis and fq not giving exact results ?

2010-09-07 Thread Markus Jelsma

I can think of two useful cases for a feature that limits MLT results depending with an optional mlt.fq parameter that limits the MLT results for each document, based on that fq: 1. prevent irrelevant docs when in a deep faceted navigation 2. general search results with MLT where you need

RE: Re: MoreLikethis and fq not giving exact results ?

2010-09-07 Thread Markus Jelsma

I know =) I was just polling votes for a feature request - there is no such issue filed for this component. Perhaps there should be? -Original message- From: Chris Hostetter hossman_luc...@fucit.org Sent: Wed 08-09-2010 00:13 To: solr-user@lucene.apache.org; Subject: RE: Re:

Invariants on a specific fq value

2010-09-08 Thread Markus Jelsma

through the fq parameter. It will allow the outside worls to specify a value on another field through the fq parameter such as : fq:anotherField:someValue. Any ideas? Cheers, Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: How to import data with a different date format

2010-09-08 Thread Markus Jelsma

No. The Datefield [1] will not accept it any other way. You could, however, fool your boss and dump your dates in an ordinary string field. But then you cannot use some of the nice date features. [1]: http://lucene.apache.org/solr/api/org/apache/solr/schema/DateField.html -Original

RE: Re: How to import data with a different date format

2010-09-08 Thread Markus Jelsma

to do date queries. I guess when I export the data I can just add 00:00:00Z. Thanks. - Original Message From: Markus Jelsma markus.jel...@buyways.nl To: solr-user@lucene.apache.org Sent: Wed, September 8, 2010 11:34:32 AM Subject: RE: How to import data with a different date format

RE: Re: How to import data with a different date format

2010-09-08 Thread Markus Jelsma

of ID So it might be worth it to do some searching earlier rather than later G Best Erick On Wed, Sep 8, 2010 at 12:34 PM, Markus Jelsma markus.jel...@buyways.nlwrote: No. The Datefield [1] will not accept it any other way. You could, however, fool your boss and dump your dates

RE: Re: Invariants on a specific fq value

2010-09-08 Thread Markus Jelsma

exist in 1.4. Jonathan Markus Jelsma wrote: Hi, I have an index with several collections. Every document has a collection field that specifies the collection it belongs to. To make querying easier (and restrict exposed parameters) i have a request handler for each collection. The request

RE: Re: Re: Invariants on a specific fq value

2010-09-08 Thread Markus Jelsma

; Subject: Re: Re: Invariants on a specific fq value 2010 at 1:32 PM, Markus Jelsma markus.jel...@buyways.nl wrote: Interesting! I haven't met the appends method before and i'll be sure to give it a try tomorrow. Try, the wiki [1] is not very clear on what it really does. Here's a comment from

RE: Re: Re: Invariants on a specific fq value

2010-09-08 Thread Markus Jelsma

Excellent! You already made my day for tomorrow! I'll check it's behavior with fq parameters specifying the a filter for the same field! -Original message- From: Chris Hostetter hossman_luc...@fucit.org Sent: Wed 08-09-2010 21:04 To: solr-user@lucene.apache.org; Subject: RE: Re: Re:

RE: svn branch issues

2010-09-09 Thread Markus Jelsma

http://svn.apache.org/repos/asf/lucene/dev/branches/ -Original message- From: Mark Allan mark.al...@ed.ac.uk Sent: Thu 09-09-2010 10:44 To: solr-user@lucene.apache.org; Subject: svn branch issues Hi all, As I've mentioned in the past, I've created some custom field types which make

Re: svn branch issues

2010-09-09 Thread Markus Jelsma

On 9 Sep 2010, at 10:47 am, Markus Jelsma wrote: http://svn.apache.org/repos/asf/lucene/dev/branches/ -Original message- From: Mark Allan mark.al...@ed.ac.uk Sent: Thu 09-09-2010 10:44 To: solr-user@lucene.apache.org; Subject: svn branch issues Hi all, As I've

Re: Indexing checksum of field value

2010-09-09 Thread Markus Jelsma

=text_sha1 I haven't found anything like that in the docs or on google. Did I miss something? If not, would a custom tokenizer be a good way to implement it? /Staffan Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: Inconsistent search results with multiple keywords

2010-09-09 Thread Markus Jelsma

=im_cck_field_authorfacet.field=typefacet.field=im_vid_1=indent= onstart=0version=2.2rows=10} hits=10704 status=0 QTime=1 any hint on whether this is expected or not appreciated. Steph. Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: Garbled facets even in a zero hit search

2010-09-09 Thread Markus Jelsma

: Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: Re: Inconsistent search results with multiple keywords

2010-09-09 Thread Markus Jelsma

Indeed, it's the dismax, i missed it! My bad.. -Original message- From: Ahmet Arslan iori...@yahoo.com Sent: Thu 09-09-2010 20:37 To: solr-user@lucene.apache.org; Subject: Re: Inconsistent search results with multiple keywords yes, my schema.xml file have solrQueryParser

RE: roadmap

2010-09-09 Thread Markus Jelsma

You should check Jira's roadmap [1] instead. It shows a clear picture of what has been done since the 1.4.1 release and pending issues for the 3.x branch and others. [1]: https://issues.apache.org/jira/browse/SOLR?report=com.atlassian.jira.plugin.system.project:roadmap-panel

RE: Re: Re: Invariants on a specific fq value

2010-09-09 Thread Markus Jelsma

- From: Yonik Seeley yo...@lucidimagination.com Sent: Wed 08-09-2010 19:38 To: solr-user@lucene.apache.org; Subject: Re: Re: Invariants on a specific fq value 2010 at 1:32 PM, Markus Jelsma markus.jel...@buyways.nl wrote: Interesting! I haven't met the appends method before and i'll be sure

RE: Help on spelling.

2010-09-09 Thread Markus Jelsma

I don't see you passing spellcheck parameters in the query string. Are they configured as default in your search handler? -Original message- From: Gregg Hoshovsky hosho...@ohsu.edu Sent: Thu 09-09-2010 22:40 To: solr-user@lucene.apache.org; Subject: Help on spelling. I am trying to

RE: How to Update Value of One Field of a Document in Index?

2010-09-10 Thread Markus Jelsma

The MoreLikeThis component actually can accept external input: http://wiki.apache.org/solr/MoreLikeThisHandler#Using_ContentStreams -Original message- From: Jonathan Rochkind rochk...@jhu.edu Sent: Fri 10-09-2010 18:59 To: solr-user@lucene.apache.org; Subject: RE: How to Update Value

RE: multivalued fields in result

2010-09-11 Thread Markus Jelsma

Yes, you'll get what is stored and asked for. -Original message- From: Jason Chaffee jchaf...@ebates.com Sent: Sat 11-09-2010 05:27 To: solr-user@lucene.apache.org; Subject: multivalued fields in result Is it possible to return multivalued files in the result? I would like to have

RE: Re: solr.DateField: org.apache.solr.common.SolrException: Error while creating field

2010-09-14 Thread Markus Jelsma

It would be a nice feature if Solr supports queries with time zone support on an index where all times are UTC. There is some chatter about this in SOLR-750 but i haven't found an issue that would add support for time zone queries. Did i do a lousy search or is the issue missing as of yet?

Re: Handling Aggregate Records/Roll-up in Solr

2010-09-16 Thread Markus Jelsma

flatten the representation of the shirt in the data model. Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: Re: Get all results from a solr query

2010-09-16 Thread Markus Jelsma

Not according to the wiki; http://wiki.apache.org/solr/CommonQueryParameters#rows But you could always create an issue for this one. -Original message- From: Christopher Gross cogr...@gmail.com Sent: Thu 16-09-2010 22:50 To: solr-user@lucene.apache.org; Subject: Re: Get all

Re: Search the mailinglist?

2010-09-17 Thread Markus Jelsma

any convinient way to search through the archive. Thanks for your help Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: Grouping in solr ?

2010-09-23 Thread Markus Jelsma

http://wiki.apache.org/solr/FieldCollapsing https://issues.apache.org/jira/browse/SOLR-236 -Original message- From: Papp Richard ccode...@gmail.com Sent: Thu 23-09-2010 21:29 To: solr-user@lucene.apache.org; Subject: Grouping in solr ? Hi all, is it possible somehow to group

1 2 3 4 5 6 7 8 9 10 >

1 - 100 of 1541 matches

Mail list logo