Boost, weight, proximity, ranking which one?

2010-09-03 Thread javaxmlsoapdev
I am using solr 1.4 version. I have a requirement where need to show up all documents first which matched most words from the free text search string. e.g. If user was searching for two words with no quotes connectivity breakup my search results should display all documents where both words

can't find solr.xml

2009-10-30 Thread javaxmlsoapdev
I have downloaded apache-solr-1.3.0.tgz for Linux and don't see solr.xml. can someone assist. -- View this message in context: http://old.nabble.com/can%27t-find-solr.xml-tp26136630p26136630.html Sent from the Solr - User mailing list archive at Nabble.com.

Index documents with Solr

2009-11-04 Thread javaxmlsoapdev
Wanted to find out how people are using Solr’s ExtractingRequestHandler to index different types of documents from a configuration file in an import fashion. I want to use this handler in a similar way how DataImportHandler works where you can issue “import” command from the URL to create an

how to search against multiple attributes in the index

2009-11-13 Thread javaxmlsoapdev
I want to build AND search query against field1 AND field2 etc. Both these fields are stored in an index. I am migrating lucene code to Solr. Following is my existing lucene code BooleanQuery currentSearchingQuery = new BooleanQuery(); currentSearchingQuery.add(titleDescQuery,Occur.MUST);

Re: how to search against multiple attributes in the index

2009-11-13 Thread javaxmlsoapdev
/f4d91628ced293bf/lucene_query_to_solr_query Cheers Avlesh On Fri, Nov 13, 2009 at 10:02 PM, javaxmlsoapdev vika...@yahoo.com wrote: I already did dive in before. I am using solrj API and SolrQuery object to build query. but its not clear/written how to build booleanQuery ANDing bunch

Re: how to search against multiple attributes in the index

2009-11-13 Thread javaxmlsoapdev
- #and between two fields solrQuery.setQuery(+field1:foo +field2:bar); #or between two fields solrQuery.setQuery(field1:foo field2:bar); Cheers Avlesh On Fri, Nov 13, 2009 at 10:35 PM, javaxmlsoapdev vika...@yahoo.com wrote: I think I found the answer. needed to read more API

Re: How to use DataImportHandler with ExtractingRequestHandler?

2009-11-20 Thread javaxmlsoapdev
did you extend DIH to do this work? can you share code samples. I have similar requirement where I need tp index database records and each record has a column with document path so need to create another index for documents (we allow users to search both index separately) in parallel with reading

RE: Index documents with Solr

2009-11-20 Thread javaxmlsoapdev
but not clear yet how to get that id to ExtractingRequestHandler. Would be good to see different examples on the Wiki. Have not yet had a first attempt - hoping to in a day or so. -Original Message- From: javaxmlsoapdev [mailto:vika...@yahoo.com] Sent: Wed 04-Nov-2009 5:42 PM

Very busy search screen

2009-11-23 Thread javaxmlsoapdev
I have a client who wants to search on almost every attribute of an object (nearly 15 attributes) on the search screen. Search sreen looks very crazy/busy. I was wondering if there are better ways to address these requirements and build intelligent categorized/configurable searchs? including

Re: How to use DataImportHandler with ExtractingRequestHandler?

2009-11-23 Thread javaxmlsoapdev
Anyone any idea? javaxmlsoapdev wrote: did you extend DIH to do this work? can you share code samples. I have similar requirement where I need tp index database records and each record has a column with document path so need to create another index for documents (we allow users to search

Re: ExternalRequestHandler and ContentStreamUpdateRequest usage

2009-11-23 Thread javaxmlsoapdev
? which maps document content to wrong attribute? thanks, Grant Ingersoll-6 wrote: On Nov 23, 2009, at 5:04 PM, javaxmlsoapdev wrote: Following code is from my test case where it tries to index a file (of type .txt) ContentStreamUpdateRequest up = new ContentStreamUpdateRequest(/update

Re: ExternalRequestHandler and ContentStreamUpdateRequest usage

2009-11-23 Thread javaxmlsoapdev
? Thanks, javaxmlsoapdev wrote: *:* returns me 1 count but when I search for specific word (which was part of .txt file I indexed before) it doesn't return me anything. I don't have luke setup on my end. let me see if I can set that up quickly but otherwise do you see anything I am missing

Re: ExternalRequestHandler and ContentStreamUpdateRequest usage

2009-11-24 Thread javaxmlsoapdev
this. thanks, Grant Ingersoll-6 wrote: On Nov 23, 2009, at 5:33 PM, javaxmlsoapdev wrote: *:* returns me 1 count but when I search for specific word (which was part of .txt file I indexed before) it doesn't return me anything. I don't have luke setup on my end. http://localhost:8983

Re: ExternalRequestHandler and ContentStreamUpdateRequest usage

2009-11-24 Thread javaxmlsoapdev
name=response numFound=1 start=0 - doc - arr name=content str702/str strtext/plain/str strdoc123.txt/str str / /arr long name=key8978/long /doc /result Any idea? Thanks, javaxmlsoapdev wrote: http://machinename:port/solr/admin/luke gives me 404 error so seems like its

Re: ExternalRequestHandler and ContentStreamUpdateRequest usage

2009-11-24 Thread javaxmlsoapdev
I was able to configure /docs index separately from my db data index. still I am seeing same behavior where it only puts .docName its size in the content field (I have renamed field to content in this new schema) below are the only two fields I have in schema.xml field name=key type=slong

Re: ExternalRequestHandler and ContentStreamUpdateRequest usage

2009-11-24 Thread javaxmlsoapdev
when a document is marked for deletion. df values include deleted documents./str /lst /response javaxmlsoapdev wrote: I was able to configure /docs index separately from my db data index. still I am seeing same behavior where it only puts .docName its size in the content field (I have

Re: ExternalRequestHandler and ContentStreamUpdateRequest usage

2009-11-25 Thread javaxmlsoapdev
know if you need anything else. Appreciate your help. Thanks, javaxmlsoapdev wrote: Following is luke response. lst name=fields / is empty. can someone assist to find out why file content isn't being index? ?xml version=1.0 encoding=UTF-8 ? response lst name=responseHeader int name

Where to put ExternalRequestHandler and Tika jars

2009-11-25 Thread javaxmlsoapdev
My SOLR_HOME =/home/solr_1_4_0/apache-solr-1.4.0/example/solr/conf in tomcat.sh POI, PDFBox, Tika and related jars are under /home/solr_1_4_0/apache-solr-1.4.0/lib When I try to index files using SolrJ API as follow, I don't see content of the file being indexed. It only indexes file size

Re: Where to put ExternalRequestHandler and Tika jars

2009-11-25 Thread javaxmlsoapdev
g. I had to include tika and related parsing jars into tomcat/webapps/solr/WEB-INF/lib.. this was an embarrassing mistake. apologies for all the noise. Thanks, -- View this message in context:

Batch file upload using solrJ API

2009-11-25 Thread javaxmlsoapdev
Is there an API to upload files over one connection versus looping through all the files and creating new ContentStreamUpdateRequest for each file. This, as expected, doesn't work if there are large number of files and quickly run into memory problems. Please advise. Thanks, -- View this

Re: Where to put ExternalRequestHandler and Tika jars

2009-11-30 Thread javaxmlsoapdev
with solrj and I have the same problem and could not retrieve file data. On Wed, Nov 25, 2009 at 3:41 PM, javaxmlsoapdev vika...@yahoo.com wrote: g. I had to include tika and related parsing jars into tomcat/webapps/solr/WEB-INF/lib.. this was an embarrassing mistake. apologies

Re: Batch file upload using solrJ API

2009-11-30 Thread javaxmlsoapdev
Any suggestion/pointers on this? javaxmlsoapdev wrote: Is there an API to upload files over one connection versus looping through all the files and creating new ContentStreamUpdateRequest for each file. This, as expected, doesn't work if there are large number of files and quickly run

Solr plugin or something else for custom work?

2009-11-30 Thread javaxmlsoapdev
I have a requirement where I am indexing attachements. Attachements hang off of a database entity(table). I also need to include some meta-data info from the database table as part of the index. Trying to find best way to implement using custom handler or something? where custom handler gets all

Re: dismax query syntax to replace standard query

2009-12-04 Thread javaxmlsoapdev
(not to be confused with qf) to add a filter query in addition to the q parameter. So your text search value goes in q parameter (which searches on the fields you configure) and the rest of the query goes in the fq. Would that work? On Thu, Dec 3, 2009 at 7:28 PM, javaxmlsoapdev vika...@yahoo.com wrote

how to set multiple fq while building a query in solrj

2009-12-04 Thread javaxmlsoapdev
how do I create a query string witih multiple fq params using solrj SolrQuery API. e.g. I want to build a query as follow http://servername:port/solr/issues/select/?q=testingfq=statusName:(Female OR Male)fq=name=Joe I am using solrj client APIs to build query and using SolrQuery as follow

store content only of documents

2009-12-15 Thread javaxmlsoapdev
I store document in a field content field defiend as follow in schema.xml field name=content type=text indexed=true stored=true multiValued=true/ and following in solrconfig.xml requestHandler name=/update/extract class=org.apache.solr.handler.extraction.ExtractingRequestHandler lst

Re: store content only of documents

2009-12-17 Thread javaxmlsoapdev
Anyone? javaxmlsoapdev wrote: I store document in a field content field defiend as follow in schema.xml field name=content type=text indexed=true stored=true multiValued=true/ and following in solrconfig.xml requestHandler name=/update/extract class

Re: Searching .msg files

2009-12-17 Thread javaxmlsoapdev
1)use tika to index .msg files (Tika does support Microsoft outlook format and I am already using Tika: http://lucene.apache.org/tika/formats.html). 2)while indexing you'll have to write handler to extract To, CC, Bcc values and store it in a separate field in index. 3)when user searches on .msg

Build index by consuming web service

2009-12-30 Thread javaxmlsoapdev
I am in a need of a handler which consumes web serivce and builds index from return results of the service. Until now I was building index by reading data directly from database query using DataImportHandler. There are new functional requirements to index calculated fields in the index and

weird text stripping issue

2010-01-28 Thread javaxmlsoapdev
I am observing very weird text stripping issue. when I search for word Search I get following doc str name=descriptionIssue 18 Search String/str int name=key4688/int str name=titleIssue 18 Search String2/str /doc And highliting node lst name=4688 arr name=title strIssue 18

RE: weird text stripping issue

2010-01-28 Thread javaxmlsoapdev
Analyzers are default. anything in particular to look for? ANKITBHATNAGAR wrote: Check you analyzers Ankit -Original Message- From: javaxmlsoapdev [mailto:vika...@yahoo.com] Sent: Thursday, January 28, 2010 4:46 PM To: solr-user@lucene.apache.org Subject: weird text

Any idea what could be wrong with this fq value?

2010-02-03 Thread javaxmlsoapdev
Following is my solr URL. http://hostname:port/solr/entities/select/?version=2.2start=0indent=onqt=dismaxrows=60fq=statusName:(Open OR Cancelled)debugQuery=trueq=devfq=groupName:Infrastructure“ “groupName” is one of the attributes I create fq (filterQuery) on. This field(groupName) is being

Re: Any idea what could be wrong with this fq value?

2010-02-03 Thread javaxmlsoapdev
, at 10:05 AM, javaxmlsoapdev wrote: Following is my solr URL. http://hostname:port/solr/entities/select/? version=2.2start=0indent=onqt=dismaxrows=60fq=statusName:(Open OR Cancelled)debugQuery=trueq=devfq=groupName:Infrastructure“ “groupName” is one of the attributes I create fq (filterQuery

DataImportHandler can't understand query

2010-02-08 Thread javaxmlsoapdev
I have a complex query (runs fine in database), which I am trying to include in DataImportHandler query. Query has case statements with in it e.g. case when (ASSIGNED_TO '' and TRANSLATE(ASSIGNED_TO, '', '0123456789')='') DataImportHandler failes to understand query with following

Re: DataImportHandler can't understand query

2010-02-08 Thread javaxmlsoapdev
Note I already tried to escape character with \ but still it throws same error. Any idea? Thanks, javaxmlsoapdev wrote: I have a complex query (runs fine in database), which I am trying to include in DataImportHandler query. Query has case statements with in it e.g. case

Good literature on search basics

2010-02-12 Thread javaxmlsoapdev
Does anyone know good literature(web resources, books etc) on basics of search? I do have Solr 1.4 and Lucene books but wanted to go in more details on basics. Thanks, -- View this message in context: http://old.nabble.com/Good-literature-on-search-basics-tp27562021p27562021.html Sent from

HttpDataSource consume REST API with Authentication required

2010-03-04 Thread javaxmlsoapdev
I have to use http://wiki.apache.org/solr/DataImportHandler#Usage_with_XML.2BAC8-HTTP_Datasource HttpDataSource to ask Solr consume my REST service and index data returned from that service. My application/service has authentication/authorization. When Solr invokes this service it MUST have

Numeric search in text field

2010-10-05 Thread javaxmlsoapdev
Hello, I have a string Marsh 1 (no quotes while searching). If I put Marsh 1 in the search box with no quotes I get expected results back but when I search for just 1 (again no quotes) I don't get any results back. I use WorldDelimiterFactory as follow. Any idea? analyzer type=query

Partial search extremly slow

2011-02-25 Thread javaxmlsoapdev
Since my users wanted to have a partial search functionality I had to introduce following. I have declared two EdgeNGram filters with both side back and front since they wanted to have partial search working from any side. fieldType name=edgytext class=solr.TextField analyzer tokenizer

Smart Pagination queries

2011-03-08 Thread javaxmlsoapdev
e.g. There are 4,000 solr documents that were found for a particular word search. My app has entitlement rules applied to those 4,000 documents and it's quite possible that user is only eligible to view 3,000 results out of 4K. This is achieved through post filtering application logic. My

Custom request handler/plugin

2011-03-13 Thread javaxmlsoapdev
I have a requirement to filter solr results, based on my application entitlement rules,before they are returned from Solr to the application. In order to do this, I have to implement a custom plugin, which gets entire solr results and apply my application rules (which will be exposed via web

Re: Custom request handler/plugin

2011-03-14 Thread javaxmlsoapdev
Below are the reasons why I thought it wouldn't be feasible to have pre-filtered results with filter queries. please comment. Since can't pen down direct business reqs due to confidentially contact with the client, I'll mock out scenario using an example. - There is a parent entity called Quiz,

Re: Custom request handler/plugin

2011-03-18 Thread javaxmlsoapdev
Thanks for the response. Finally I have decided to build access intelligence into the Solr to pre filter the results by storing required attributes in the index to determine the access. -- View this message in context:

Solr performance

2011-05-11 Thread javaxmlsoapdev
I have some 25 odd fields with stored=true in schema.xml. Retrieving back 5,000 records back takes a few secs. I also tried passing fl and only include one field in the response but still response time is same. What are the things to look to tune the performance. Thanks, -- View this message in

Re: Solr performance

2011-05-13 Thread javaxmlsoapdev
Alright. It turned out that defaultSearchField=title where title field is of a custom fieldType=edgyText where fieldType name=edtyText class=solr.TextField filter class=solr.EdgeNGramFilterFactory minGramSize=1 maxGramSize=25 side=back/ filter class=solr.EdgeNGramFilterFactory