Re: Solr substring search

2013-09-06 Thread Alvaro Cabrerizo
Hi: I would start looking: http://docs.lucidworks.com/display/solr/The+Standard+Query+Parser And the org.apache.lucene.queryparser.flexible.standard.StandardQueryParser.java Hope it helps. On Thu, Sep 5, 2013 at 11:30 PM, Scott Schneider scott_schnei...@symantec.com wrote: Hello, I'm

Re: Dynamic Field

2013-09-09 Thread Alvaro Cabrerizo
Hi: As you posted, a possibility could be, to define the fields jobs and batch as multivalued and use the partial updatehttp://solr.pl/en/2012/07/09/solr-4-0-partial-documents-update/to add new values to those fields. Hope it helps. On Sun, Sep 8, 2013 at 9:49 PM, anurag.jain

synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
Hi: I'm involved in a process o upgrade solr from 1.4 to 4.4 and I'm having a problem using SynonymFilterFactory within the process chain SynonymFilterFactory, StopFilterFactory . I have configured synonyms.txt to expand the word AIO as: all-in-one. Well, when using solr 1.4 I get the following

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
that words? 2013/10/9 Alvaro Cabrerizo topor...@gmail.com Hi: I'm involved in a process o upgrade solr from 1.4 to 4.4 and I'm having a problem using SynonymFilterFactory within the process chain SynonymFilterFactory, StopFilterFactory . I have configured synonyms.txt to expand the word AIO

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
No, it has no synonyms. On Wed, Oct 9, 2013 at 10:48 AM, Furkan KAMACI furkankam...@gmail.comwrote: Does two has a synonym of in and one? 2013/10/9 Furkan KAMACI furkankam...@gmail.com Does two has a synonym of in and one? 2013/10/9 Alvaro Cabrerizo topor...@gmail.com Sure, Find

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
The synonyms.txt has defined the next associations defined. AIO=All in one aio=all-in-one Regards. On Wed, Oct 9, 2013 at 11:05 AM, Alvaro Cabrerizo topor...@gmail.comwrote: No, it has no synonyms. On Wed, Oct 9, 2013 at 10:48 AM, Furkan KAMACI furkankam...@gmail.comwrote: Does two has

measure result set quality

2013-10-17 Thread Alvaro Cabrerizo
Hi, Imagine the next situation. You have a corpus of documents and a list of queries extracted from production environment. The corpus haven't been manually annotated with relvant/non relevant tags for every query. Then you configure various solr instances changing the schema (adding synonyms,

Re: measure result set quality

2013-10-21 Thread Alvaro Cabrerizo
On Thu, Oct 17, 2013 at 11:28 AM, Alvaro Cabrerizo topor...@gmail.com wrote: Hi, Imagine the next situation. You have a corpus of documents and a list of queries extracted from production environment. The corpus haven't been manually annotated with relvant/non relevant tags

Re: Example of join using Solr/Lucene

2013-11-05 Thread Alvaro Cabrerizo
In my case, everytime I've used joins, the FROM field was a multivalued string and the TO was an univalued string. Regards. El 05/11/2013 18:37, Tech Id tech.login@gmail.com escribió: I think Solr has the ability to do joins in the latest version as verified on this issue:

Re: Example of join using Solr/Lucene

2013-11-06 Thread Alvaro Cabrerizo
, Alvaro Cabrerizo topor...@gmail.com wrote: In my case, everytime I've used joins, the FROM field was a multivalued string and the TO was an univalued string. Regards. El 05/11/2013 18:37, Tech Id tech.login@gmail.com escribió: I think Solr has the ability to do joins

Re: Example of join using Solr/Lucene

2013-11-06 Thread Alvaro Cabrerizo
q gives me all the documents and later I filter them saying just to return those which companyId has a company tha t satisfies id=23 or in the second example its name start by toyo and its id starts by 2. Regards. On Wed, Nov 6, 2013 at 7:55 PM, Alvaro Cabrerizo topor...@gmail.com wrote: You

Re: Example of join using Solr/Lucene

2013-11-06 Thread Alvaro Cabrerizo
with a name that starts with toyo (AND _query_:{!join to=companyId from=id}name:toyo*) - *{!join to=id from=ownerId}* Use the former list of cars to make a join with the documents pointed by ownedId (i.e. people). Regards. On Wed, Nov 6, 2013 at 9:43 PM, Alvaro Cabrerizo topor

Re: SOLR keyword search with fq queries

2013-11-08 Thread Alvaro Cabrerizo
Please, check if defaults, appends and invariants from http://wiki.apache.org/solr/SearchHandler can solve your problem. Regards. On Fri, Nov 8, 2013 at 6:05 AM, atuldj.jadhav atuldj.jad...@gmail.comwrote: Hi All,I need your help to find a solution to one of the issue I am facing with the

Re: Indexing a token to a different field in a custom filter

2013-11-12 Thread Alvaro Cabrerizo
Hi, Maybe the synonym filterhttp://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactoryis the mirror you can look in. You can start creating a new field type in your schema that is stanbol enhanced. Let's follow with the parallelism, in the case of synonym we could have

Re: Why do people want to deploy to Tomcat?

2013-11-12 Thread Alvaro Cabrerizo
In my case, the selection of the servlet container has never been a hard requirement. I mean, some customers provide us a virtual machine configured with java/tomcat , others have a tomcat installed and want to share it with solr, others prefer jetty because their sysadmins are used to configure

Re: Modify the querySearch to q=*:*

2013-11-13 Thread Alvaro Cabrerizo
Hi: First of all I have to say that I had never heard about *\* as the query to get all the documents in a index but *:* (maybe I'm wrong) . Re-reading Apache Solr 4 cookbook, Solr 1.4 Enterprise Search Server and Apache Solr 3 Enterprise Search Server there is no trace for the query *\* as the

Re: Thought exercise: features for Solr client

2013-11-14 Thread Alvaro Cabrerizo
Here goes my wishlist: - Transaction management - Access control at document level Regards. On Thu, Nov 14, 2013 at 10:35 AM, Alexandre Rafalovitch arafa...@gmail.comwrote: Hello, I am trying to imagine what would a new, fresh, Solr client library look like. There has been a number

Expansion of fields in fl parameter

2013-11-28 Thread Alvaro Cabrerizo
The question: ¿Is there a way to avoid field expansion when adding flhttp://wiki.apache.org/solr/CommonQueryParameters#flparameter to the request? The environment: Solr 4.4 using the next schema.xml field name=id type=string indexed=true stored=true required=true / field name=01_text

Re: Expansion of fields in fl parameter

2013-11-28 Thread Alvaro Cabrerizo
definitions? What happens when you request all fields (fl=*)? On Thursday, November 28, 2013 3:59 PM, Alvaro Cabrerizo topor...@gmail.com wrote: The question: ¿Is there a way to avoid field expansion when adding flhttp://wiki.apache.org/solr/CommonQueryParameters#flparameter to the request

Re: Facetting by field then query

2014-03-27 Thread Alvaro Cabrerizo
I don't think you can do it, as pivot facetinghttp://wiki.apache.org/solr/SimpleFacetParameters#Pivot_.28ie_Decision_Tree.29_Faceting doesn't let you use facet queries. The closer query I can imagine is: - q=sentence:bar OR sentence:foo - facet=true - facet.pivot=media_id,sentence At

Re: Does sorting skip everything having to do with relevancy?

2014-04-04 Thread Alvaro Cabrerizo
Hi, By default solr is using the sort parameter over the score field. So if you overwrite it using other sort field, yes solr will use the parameter you've provided. Remember, you can use multiple fields for sortinghttp://wiki.apache.org/solr/CommonQueryParameters#sort so you can make something

Re: Boosing Basic

2014-04-04 Thread Alvaro Cabrerizo
Hi, If I were you, I would start reading the edismax documentationhttps://cwiki.apache.org/confluence/display/solr/The+Extended+DisMax+Query+Parser. Apart from the wiki, you can find in every distribution a full example with the configuration of the edismax query parser (check the xml node

Re: Solr join and lucene scoring

2014-04-04 Thread Alvaro Cabrerizo
Hi, The defect you are referencing is closed with a resolution of *Invalid*, so it seems the scoring is working fine with the join. I've made the next two tests on my own data and seems it is working: *TestA* - fl=id,score - q=notebook - fq={!join from=product_list to=id

Re: Does sorting skip everything having to do with relevancy?

2014-04-04 Thread Alvaro Cabrerizo
). The parameter boost will be overwritten if it is not defined as an invariant. Regards. On Fri, Apr 4, 2014 at 4:12 PM, Shawn Heisey s...@elyograg.org wrote: On 4/4/2014 12:48 AM, Alvaro Cabrerizo wrote: By default solr is using the sort parameter over the score field. So if you overwrite

Re: Solr join and lucene scoring

2014-04-09 Thread Alvaro Cabrerizo
Right, The examples posted before had a trick (both weren't pure join queries thats why the results had score). Having a pure join query (q={!join from=id to=id}printer) gives all the scores as 1 as Moritz said). Regards. On Wed, Apr 9, 2014 at 3:13 PM, Mikhail Khludnev

Re: Join solr multi core select field mixed other core

2014-04-22 Thread Alvaro Cabrerizo
Hi, Please, check using ...q={!join+to=cat_id+from=id+fromIndex=category}name:*... Regards. On Tue, Apr 22, 2014 at 3:37 AM, hungctk33 nokiatwo9...@gmail.com wrote: Hi . all. I has table Product (id, name, cat_id) Category (id, name); Select pro.id, pro.name, cat.name from Product as pro

Re: Join in solr to get data from two cores

2014-05-14 Thread Alvaro Cabrerizo
There are two previous threads in the list that i think can help you, http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201405.mbox/%3c1398929537117-4134045.p...@n3.nabble.com%3E

commit persistence guarantee

2014-05-14 Thread Alvaro Cabrerizo
Hi, Is there any guarantee that every document is persisted on disk during a commit avalanche that produces the: ERROR org.apache.solr.core.SolrCore – org.apache.solr.common.SolrException: Error opening new searcher. *exceeded limit of maxWarmingSearchers*=1, try again later. I've made some

Re: search multiple cores

2014-05-15 Thread Alvaro Cabrerizo
As far as I know (and how i have been using it), the join can't do what you want. The structure of the query you could try (among others) is : 1. http://SOLR_ADDRESS/coreA/select?q=Afq={!join ... fromCore=coreB}B 2. http://SOLR_ADDRESS/coreA/select?q=A AND

Re: Indexing DateField timezone problem

2014-05-16 Thread Alvaro Cabrerizo
I guess, you will need to modify your extraction select in order to fix it, using some date functions provided by the database manufacturer. For example, in some projects when using oracle as a data source i've been using the next recipe to modify the oracle TIMESTAMP(6) datatype to fit the solr

Re: Boost documents having a field value

2014-06-02 Thread Alvaro Cabrerizo
Hi, One option (not tested by myself), could be the use of payloads ( http://wiki.apache.org/solr/Payloads). Regards. On Mon, Jun 2, 2014 at 7:58 PM, Hakim Benoudjit h.benoud...@gmail.com wrote: Hi guys, Is it possible in solr to boost documents having a field value (Ex. field:value)? I

query time join (stored or indexed value field?)

2015-01-26 Thread Alvaro Cabrerizo
Hi, Is the time join query http://wiki.apache.org/solr/Join using stored data or indexed data from the fields set in from and to? (For example, the facet feature makes the count based on the indexed data) I've made an small example (using tokenizers, stopwords...) and it seems that the join uses

Re: Detect ongoing Solr Import and its Completion

2015-02-05 Thread Alvaro Cabrerizo
Maybe you are asking for the status command. Currently this is the url I invoke for checking whether the import process is running (or has failed) From the cwiki: The URL is http://host:port/solr/collection_name/dataimport?command=status. It returns statistics on the number of documents

Re: Problem with faceting

2015-02-06 Thread Alvaro Cabrerizo
Hi David, Yes it sounds weird. Just for testing purpose, It would be nice to have the ID_bent fieldtype definition. Regards. On Fri, Feb 6, 2015 at 9:05 AM, david.dav...@correo.aeat.es wrote: Hello, we have been using faceting for a long time, but now I have discovered a problem that I

Re: Problem with faceting

2015-02-06 Thread Alvaro Cabrerizo
de Informática Tributaria Subdirección de Tecnologías de Análisis de la Información e Investigación del Fraude Teléfono: 915828763 Extensión: 36763 De: Alvaro Cabrerizo topor...@gmail.com Para: solr-user@lucene.apache.org solr-user@lucene.apache.org, Fecha: 06/02/2015 12:34 Asunto

Re: Newly observed Facets

2015-01-19 Thread Alvaro Cabrerizo
At first impression, everything seems ok. Anyway, is the startTimeISO single-value or multivalued field? In case it is single-value the clause startTimeISO:[2015-01-19T00: 00:00.000Z TO 2015-01-20T00:00:00.000Z] is sufficient to exclude other period of time. I also guess that the startTimeISO

Re: Newly observed Facets

2015-01-20 Thread Alvaro Cabrerizo
that were never seen in the last 30 days, but they were observed today. That is the main challenge I am having. How to identify which usernames from today were not seen in the last 30 days. On Tue, Jan 20, 2015, 1:02 AM Alvaro Cabrerizo topor...@gmail.com wrote: Ok, Thus as commented

Re: Newly observed Facets

2015-01-20 Thread Alvaro Cabrerizo
Ok, Thus as commented before, in case your starttimeISO is single-value you only need to add the range clause: startTimeISO:[2015-01-19T00: 00:00.000Z TO 2015-01-20T00:00:00.000Z]. There is no need to add both NOT A AND B as the documents that satisfy B will automatically satisfy A. If you

Re: Newly observed Facets

2015-01-20 Thread Alvaro Cabrerizo
and it was only observed today. So it is new Hope I could explain it to you well. Ask me for any more questions On Mon, Jan 19, 2015 at 11:45 PM, Alvaro Cabrerizo topor...@gmail.com wrote: At first impression, everything seems ok. Anyway, is the startTimeISO single-value or multivalued field

Re: Newly observed Facets

2015-01-20 Thread Alvaro Cabrerizo
Hi, In case your data looks like: id: 1, userName: one, startTimeISO: 2015-01-20T17:24:32.888Z id: 2, userName: one, startTimeISO: 2015-01-16T17:24:50.208Z id: 3, userName: two, startTimeISO: 2015-01-20T17:25:06.109Z You could use the next query combination q=*:* fq=startTimeISO:[NOW-1DAY

Re: Filter Solr multivalued fields to be able to add pagination

2015-01-20 Thread Alvaro Cabrerizo
Hi, Currently, there is no way to sort by a multi-value field within solr (first the system should sort the content of the field, then sort documents...). Anyway, if you have a clear idea on how the sort should be done try to accomodate your data to your needs (in case it is posible). One option

Re: Ignore whitesapce, underscore using KeywordTokenizer... EdgeNGramFilter

2015-01-21 Thread Alvaro Cabrerizo
Hi, Not sure, but I think that the PatternReplaceFilterFactory or the PatternReplaceCharFilterFactory could help you deleting those characters. Regards. On Jan 21, 2015 7:59 PM, Vishal Swaroop vishal@gmail.com wrote: I am trying to implement type-ahead suggestion for single field which

Re: Ignore whitesapce, underscore using KeywordTokenizer... EdgeNGramFilter

2015-01-21 Thread Alvaro Cabrerizo
:31 PM, Alvaro Cabrerizo topor...@gmail.com wrote: Hi, Not sure, but I think that the PatternReplaceFilterFactory or the PatternReplaceCharFilterFactory could help you deleting those characters. Regards. On Jan 21, 2015 7:59 PM, Vishal Swaroop vishal@gmail.com wrote: I am

Re: How to index data from multiple data source

2015-01-20 Thread Alvaro Cabrerizo
Hi, You can find several examples of configuring tika+dih to index pdf in internet (e.g. https://tuxdna.wordpress.com/2013/02/04/indexing-the-documents-stored-in-a-database-using-apache-solr-and-apache-tika/ ) Regards. On Jan 21, 2015 6:54 AM, Yusniel Hidalgo Delgado yhdelg...@uci.cu wrote:

Re: Reading data from another solr core

2015-01-28 Thread Alvaro Cabrerizo
Hi, I usually use the SolrEntityProcessor for moving/transform data between cores, it's a piece of cake! Regards. On Wed, Jan 28, 2015 at 8:13 AM, solrk koushikga...@gmail.com wrote: Hi Guys, I have multiple cores setup in my solr server. I would like read/import data from one

Re: Trending functionality in Solr

2015-02-09 Thread Alvaro Cabrerizo
Hi, In a project I'm working on, we adopted the same approach as the one commented by Jorge Luis (use a separate core and let the front-end send the query to solr to store it). Depending on the analysis you want to achieve it could be interesting to delete duplicates or not (have 1 solr document

Re: Facet on TopDocs

2015-02-25 Thread Alvaro Cabrerizo
Hi, The facet component works with the whole result set, so you can't get the facets for your topN documents. A naive way you can fulfill your requirement is two implement it in two steps: - Request your data and recover the doc ids. - Create a new query with the selected ids (id:id1 OR

Re: Problem with queries that includes NOT

2015-02-25 Thread Alvaro Cabrerizo
Hi, The edismax parser should be able to manage the query you want to ask. I've made a test and the next both queries give me the right result (see the parenthesis): - {!edismax}(NOT id:7 AND NOT id:8 AND id:9) (gives 1 hit the id:9) - {!edismax}((NOT id:7 AND NOT

Re: Display entire string containing query string

2016-02-18 Thread Alvaro Cabrerizo
Hi, To understand Binoy's answer, please check the: The fl (Field List) Parameter . If you want to include: "*fragments of documents that match the user's query to be included

Re: Hitting complex multilevel pivot queries in solr

2016-02-18 Thread Alvaro Cabrerizo
Hi, The idea of copying fields into a new one (or various) during indexing and then facet the new field (or fields) looks promising. More information about data will be helpful (for example if the fields:country, state.. are single or multivalued). For example if all of the fields are single

Re: Hitting complex multilevel pivot queries in solr

2016-02-19 Thread Alvaro Cabrerizo
concatenate at query time. I just need top 100 counts > from the leaf level of the pivot. > I'm also looking at facet.threads which could give responses to an extent. > But It does not solve my issue. > > Hovewer, the Endeca equivalent of this application seems to be working &

Re: Exact match works only for some of the strings

2017-03-16 Thread Alvaro Cabrerizo
Hello, I've tested on an old solr 4.3 instance and the schema and the field definition are fine. I've also checked that only the query nameExact:"Guardian EU-referendum" gives the result, the other one you have commented (nameExact:"Guardian US") gives 0 hits. Maybe, you forgot to re-index after