Re: Retrieving a field from all result docuemnts couple of more queries

2009-09-16 Thread abhay kumar
Hi, 1)Solr has various type of caches . We can specify how many documents cache can have at a time. e.g. if windowsize=50 50 results will be cached in queryResult Cache. if user makes a new request to server for results after 50 documents a new request will be sent

Re: How to create a new index file automatically

2009-09-16 Thread busbus
It can import documents in certain other formats using the http://wiki.apache.org/solr/ExtractingRequestHandler 1) According to my inference.Solr uses Apache Tikka to convert other rich document format files to Text Files, so that the Class ExtractRequestHandler use the output text file

Re: Solr exception with missing required field (meta_guid_s)

2009-09-16 Thread Shalin Shekhar Mangar
On Wed, Sep 16, 2009 at 1:13 AM, kedardes kedar.w...@gmail.com wrote: Hi, I have a data-config file where I map the fields of a very simple table using dynamic field definitions : document name=names entity name=names query=select * from test

Re: Questions on copyField

2009-09-16 Thread Rahul R
Would appreciate any help on this. Thanks Rahul On Mon, Sep 14, 2009 at 5:12 PM, Rahul R rahul.s...@gmail.com wrote: Hello, I have a few questions regarding the copyField directive in schema.xml 1. Does the destination field store a reference or the actual data ? If I have soemthing like

Re: Solr results filtered on MoreLikeThis

2009-09-16 Thread Marcelk
Hi All, Should I create plugin for this or is there some functionality in solr that can help me. I basically already have part of what I want. The search response gives me a result list with (in my situation) 20 results and the attached morelikethis NamedList. Filtering based on the

Re: Questions on copyField

2009-09-16 Thread Shalin Shekhar Mangar
On Mon, Sep 14, 2009 at 5:12 PM, Rahul R rahul.s...@gmail.com wrote: Hello, I have a few questions regarding the copyField directive in schema.xml 1. Does the destination field store a reference or the actual data ? It makes a copy. Storing or indexing of the field depends on the field

Need help to finalize my autocomplete

2009-09-16 Thread Vincent Pérès
Hello, I'm using the following code for my autocomplete feature : The field type : fieldType name=autoComplete class=solr.TextField omitNorms=true analyzer tokenizer class=solr.WhitespaceTokenizerFactory / filter class=solr.LowerCaseFilterFactory / filter

Re: Need help to finalize my autocomplete

2009-09-16 Thread Avlesh Singh
Instead of tokenizer class=solr.WhitespaceTokenizerFactory / use tokenizer class=solr.KeywordTokenizerFactory/ Cheers Avlesh 2009/9/16 Vincent Pérès vincent.pe...@gmail.com Hello, I'm using the following code for my autocomplete feature : The field type : fieldType name=autoComplete

Re: Need help to finalize my autocomplete

2009-09-16 Thread Vincent Pérès
Hello, I tried to replace the class as you suggested, but I still get the same result (and not results where the query start only with the giving query). fieldType name=autoComplete class=solr.TextField omitNorms=true analyzer tokenizer class=solr.KeywordTokenizerFactory /

Mapping SolrDoc to SolrInputDoc

2009-09-16 Thread Licinio Fernández Maurelo
Hi there, currently i'm working on a small app which creates an Embedded Solr Server, reads all documents from one core and puts these docs into another one. The purpose of this app is to apply (small) changes on schema.xml to indexed data (offline) resulting a new index with documents updated

Re: Need help to finalize my autocomplete

2009-09-16 Thread Shalin Shekhar Mangar
2009/9/16 Vincent Pérès vincent.pe...@gmail.com Hello, I tried to replace the class as you suggested, but I still get the same result (and not results where the query start only with the giving query). Make sure you re-index your documents after change the schema. -- Regards, Shalin

Re: Need help to finalize my autocomplete

2009-09-16 Thread Vincent Pérès
After re-indexing it works very well ! Thanks a lot ! Vincent -- View this message in context: http://www.nabble.com/Need-help-to-finalize-my-autocomplete-tp25468885p25469931.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Mapping SolrDoc to SolrInputDoc

2009-09-16 Thread Martijn v Groningen
Hi Licinio, You can use ClientUtils.toSolrInputDocument(...), that converts a SolrDocument to a SolrInputDocument. Martijn 2009/9/16 Licinio Fernández Maurelo licinio.fernan...@gmail.com: Hi there, currently i'm working on a small app which creates an Embedded Solr Server, reads all

Re: Mapping SolrDoc to SolrInputDoc

2009-09-16 Thread Licinio Fernández Maurelo
I'll try, thanks Martijn 2009/9/16 Martijn v Groningen martijn.is.h...@gmail.com Hi Licinio, You can use ClientUtils.toSolrInputDocument(...), that converts a SolrDocument to a SolrInputDocument. Martijn 2009/9/16 Licinio Fernández Maurelo licinio.fernan...@gmail.com: Hi there,

Re: Solr results filtered on MoreLikeThis

2009-09-16 Thread Chantal Ackermann
Have you had a look at the facet query? Not sure but it might just do what you are looking for. http://wiki.apache.org/solr/SolrFacetingOverview http://wiki.apache.org/solr/SimpleFacetParameters Hi All, Should I create plugin for this or is there some functionality in solr that can help

Re: Solr results filtered on MoreLikeThis

2009-09-16 Thread Marcelk
Hi Chantal, Chantal Ackermann wrote: Have you had a look at the facet query? Not sure but it might just do what you are looking for. http://wiki.apache.org/solr/SolrFacetingOverview http://wiki.apache.org/solr/SimpleFacetParameters I still don't really understand facetting? But It

Re: Retrieving a field from all result docuemnts couple of more queries

2009-09-16 Thread Shashikant Kore
Thanks, Abhay. Can someone please throw light on how to disable scoring? --shashi On Wed, Sep 16, 2009 at 11:55 AM, abhay kumar abhay...@gmail.com wrote: Hi, 1)Solr has various type of caches . We can specify how many documents cache can have at a time.       e.g. if windowsize=50        

Re: Extract info from parent node during data import (redirect:)

2009-09-16 Thread Paul, Noble
Fergus, Implementing wildcard (//tagname) is definitely possible. I would love to see it working. But if you wish to take a dig at it I shall do whatever I can to help. What is the use case that makes flow though so useful? We do not know to which forEach xpath a given field is associated

Re: multicore shards and relevancy score

2009-09-16 Thread Shalin Shekhar Mangar
On Tue, Sep 15, 2009 at 8:11 PM, Paul Rosen p...@performantsoftware.comwrote: The second issue was detailed in an email last week shards and facet count. The facet information is lost when doing a search over two shards, so if I use multicore, I can no longer have facets. If both cores'

DeltaImport problem

2009-09-16 Thread KirstyS
I hope this is the correct place to post this issue and if so, that someone can help. I am using the DIH with Solr 1.3 My data-config.xml file looks like this: dataSource driver=net.sourceforge.jtds.jdbc.Driver url=jdbc:jtds:sqlserver:{taken out for posting}

Re: DeltaImport problem

2009-09-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
I vaguely remember there was an issue with delta-import in 1.3. could you try it out with Solr1.4 On Wed, Sep 16, 2009 at 6:14 PM, KirstyS kirst...@gmail.com wrote: I hope this is the correct place to post this issue and if so, that someone can help. I am using the DIH with Solr 1.3 My

Re: DeltaImport problem

2009-09-16 Thread KirstyS
I thought 1.4 was not released yet? Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: I vaguely remember there was an issue with delta-import in 1.3. could you try it out with Solr1.4 On Wed, Sep 16, 2009 at 6:14 PM, KirstyS kirst...@gmail.com wrote: I hope this is the correct place to post this

Re: Retrieving a field from all result docuemnts couple of more queries

2009-09-16 Thread rajan chandi
You might be talking about modifying the similarity object to modify scoring formula in Lucene! $searcher-setSimilarity($similarity); $writer-setSimilarity($similarity); This can very well be done in Solr as SolrIndexWriter inherits from Lucene IndexWriter class. You might want to download

Re: DeltaImport problem

2009-09-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
yeah, not yet released but going to be released pretty soon On Wed, Sep 16, 2009 at 6:32 PM, KirstyS kirst...@gmail.com wrote: I thought 1.4 was not released yet? Noble Paul നോബിള്‍  नोब्ळ्-2 wrote: I vaguely remember there was an issue with delta-import in 1.3. could you try it out with

Re: DeltaImport problem

2009-09-16 Thread KirstyS
mmm..can't seem to find the link..could you help? Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: yeah, not yet released but going to be released pretty soon On Wed, Sep 16, 2009 at 6:32 PM, KirstyS kirst...@gmail.com wrote: I thought 1.4 was not released yet? Noble Paul നോബിള്‍  नोब्ळ्-2 wrote:

When to use Solr over Lucene

2009-09-16 Thread balaji.a
Hi All, I am aware that Solr internally uses Lucene for search and indexing. But it would be helpful if anybody explains about Solr features that is not provided by Lucene. Thanks, Balaji. -- View this message in context:

Re: Retrieving a field from all result docuemnts couple of more queries

2009-09-16 Thread Shashikant Kore
No, I don't wish to put a custom Similarity. Rather, I want an equivalent of HitCollector where I can bypass the scoring altogether. And I prefer to do it by changing the configuration. --shashi On Wed, Sep 16, 2009 at 6:36 PM, rajan chandi chandi.ra...@gmail.com wrote: You might be talking

Re: DeltaImport problem

2009-09-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
http://people.apache.org/builds/lucene/solr/nightly/ On Wed, Sep 16, 2009 at 6:42 PM, KirstyS kirst...@gmail.com wrote: mmm..can't seem to find the link..could you help? Noble Paul നോബിള്‍  नोब्ळ्-2 wrote: yeah, not yet released but going to be released pretty soon On Wed, Sep 16, 2009

Re: When to use Solr over Lucene

2009-09-16 Thread Grant Ingersoll
On Sep 16, 2009, at 9:26 AM, balaji.a wrote: Hi All, I am aware that Solr internally uses Lucene for search and indexing. But it would be helpful if anybody explains about Solr features that is not provided by Lucene. Solr is a server, Lucene is an API Faceting Distributed search

Re: When to use Solr over Lucene

2009-09-16 Thread Israel Ekpo
Comparing Solr to Lucene is not exactly an apples-to-apples comparison. Solr is a superset of Lucene. It uses the Lucene engine to index and process requests for data retrieval. Start here first : * http://lucene.apache.org/solr/features.html#Solr+Uses+the+Lucene+Search+Library+and+Extends+it !*

Re: When to use Solr over Lucene

2009-09-16 Thread Mark Miller
balaji.a wrote: Hi All, I am aware that Solr internally uses Lucene for search and indexing. But it would be helpful if anybody explains about Solr features that is not provided by Lucene. Thanks, Balaji. Any advanced Lucene application generally goes down the same path: Build a

Re: Retrieving a field from all result docuemnts couple of more queries

2009-09-16 Thread rajan chandi
You will need to get SolrIndexSearcher.java and modify following:- public static final int GET_SCORES = 0x01; --Rajan On Wed, Sep 16, 2009 at 6:58 PM, Shashikant Kore shashik...@gmail.comwrote: No, I don't wish to put a custom Similarity. Rather, I want an equivalent of

Re: When to use Solr over Lucene

2009-09-16 Thread Israel Ekpo
Also Solr simplifies the process of implementing the client side interface. You can use the same indices with clients written in any programming language. The client side could be in virtually any programming language of your choosing. If you were to work directly with Lucene, that would not be

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-16 Thread Alexey Serba
Hi Aaron, You can overwrite default Lucene Similarity and disable tf and lengthNorm factors in scoring formula ( see http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/search/Similarity.html and http://lucene.apache.org/java/2_4_1/api/index.html ) You need to 1) compile the following

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-16 Thread Erik Hatcher
Just FYI - you can put Solr plugins in solr-home/lib as JAR files rather than messing with solr.war Erik On Sep 16, 2009, at 10:15 AM, Alexey Serba wrote: Hi Aaron, You can overwrite default Lucene Similarity and disable tf and lengthNorm factors in scoring formula ( see

Any way to encrypt/decrypt stored fields?

2009-09-16 Thread Jay Hill
For security reasons (say I'm indexing very sensitive data, medical records for example) is there a way to encrypt data that is stored in Solr? Some businesses I've encountered have such needs and this is a barrier to them adopting Solr to replace other legacy systems. Would it require a

Re: CSV Update - Need help mapping csv field to schema's ID

2009-09-16 Thread Insight 49, LLC
Thanks guys... Yonik and Grant commented on this thread in the dev group. Dan Chris Hostetter wrote: : I would like to add an additional name:value pair for every line, mapping the : sku field to my schema's id field: : : .map={sku.field}:{id} the map param is for replacing a *value* with

Re: Any way to encrypt/decrypt stored fields?

2009-09-16 Thread Bill Au
That's certainly something that is doable with a filter. I am not aware of any available. Bill On Wed, Sep 16, 2009 at 10:39 AM, Jay Hill jayallenh...@gmail.com wrote: For security reasons (say I'm indexing very sensitive data, medical records for example) is there a way to encrypt data that

Re: Any way to encrypt/decrypt stored fields?

2009-09-16 Thread Erik Hatcher
This could be achieved purely client-side if all you're talking about is a stored field (not indexed/searchable). The client-side could encrypt and encode the encrypted bits as text that Solr/Lucene can store. Then decrypt client-side. Erik On Sep 16, 2009, at 10:39 AM, Jay Hill

Re: do NOT want to stem plurals for a particular field, or words

2009-09-16 Thread Alexey Serba
 You can enable/disable stemming per field type in the schema.xml, by removing the stemming filters from the type definition. Basically, copy your prefered type, rename it to something like 'text_nostem', remove the stemming filter from the type and use your 'text_nostem' type for your field

Re: faceted query not working as i expected

2009-09-16 Thread Jonathan Vanasco
Thank you Ahmet. I forgot to encapuslate the searched string in quotations. On Sep 15, 2009, at 5:19 PM, AHMET ARSLAN wrote: --- On Tue, 9/15/09, Jonathan Vanasco jvana...@2xlp.com wrote: From: Jonathan Vanasco jvana...@2xlp.com Subject: faceted query not working as i expected To:

Highlighting in stemmed or n-grammed fields possible?

2009-09-16 Thread David Espinosa
Hi, Anybody knows how to get the highlighted field, when q term matches in a stemmed or n-grammed filtered field? Matching in a normal field (not stemmed or n-grammed) highlighting works perfectly as expected. But in stemmed matching cases, no highlighting fields are recovered, and in

Re: FileListEntityProcessor and LineEntityProcessor

2009-09-16 Thread Fergus McMenemie
Hi, I'm trying to import data from a list of files using the FileListEntityProcessor. Here is my import configuration: dataSource type=FileDataSource name=fileDataSource/ document name=dict-entries entity name=f processor=FileListEntityProcessor baseDir=d:\my\directory\ fileName=.*WRK

Re: FileListEntityProcessor and LineEntityProcessor

2009-09-16 Thread wojtekpia
Fergus McMenemie-2 wrote: Can you provide more detail on what you are trying to do? ... You seem to listing all files d:\my\directory\.*WRK. Do these WRK files contain lists of files to be indexed? That is my complete data config file. I have a directory containing a bunch of files

Effect of SynonymFilter on Solr document fields

2009-09-16 Thread Prasanna Ranganathan
Hi, I am a newbie to Solr and request you all to kindly excuse any rookie mistakes. I have the following questions: We use the Synonym Filter on one of the indexed fields. It is specified in the schema.xml as one of the filters (for the analyzer type index) for that field. I believe that this

Re: FileListEntityProcessor and LineEntityProcessor

2009-09-16 Thread wojtekpia
Note that if I change my import file to explicitly list all my files (instead of using the FileListEntityProcessor) as below then everything works as I expect. dataSource type=FileDataSource name=fileDataSource basePath=d:\my\directory\/ document name=dict-entries entity name=jc

Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-16 Thread Dadasheva, Olga
Hi, I am testing EmbeddedSolrServer vs StreamingUpdateSolrServer for my crawlers using more or less recent Solr code and everything was fine till today when I took the latest trunk code. When I start my crawler I see a number of INFO outputs 2009-09-16 21:08:29,399 INFO Adding

Re: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-16 Thread Yonik Seeley
On a quick look, it looks like this was caused (or at least triggered by) https://issues.apache.org/jira/browse/SOLR-1427 Registering the bean in the SolrCore constructor causes it to immediately turn around and ask for the stats which asks for a searcher, which blocks. -Yonik

Re: FileListEntityProcessor and LineEntityProcessor

2009-09-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
I have opened an issue SOLR-1440 On Thu, Sep 17, 2009 at 2:46 AM, wojtekpia wojte...@hotmail.com wrote: Note that if I change my import file to explicitly list all my files (instead of using the FileListEntityProcessor) as below then everything works as I expect.  dataSource

Re: [DIH] URLDataSource and fetching a link

2009-09-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
2009/9/17 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com: it is possible to have a sub entity which has XPathEntityProcessor which can use the link ar the url This may not be a good solution. But you can use the $hasMore and $nextUrl options of XPathEntityProcessor to recursively loop if

Re: Questions on copyField

2009-09-16 Thread Rahul R
Shalin, Can you please elaborate a little more on the third response *You can send the location's value directly as the value of the text field.* I dont follow. I am adding 'name' and 'age' to the 'text' field through the schema. If I add the 'location' from the program, will either one copy