fetched but none was processed when set batchSize to -1

2015-11-08 Thread Yangrui Guo
*Hello* *Indexing since 23m 45s* Requests: 5 (0/s), Fetched: 352,993 (248/s), Skipped: 0, Processed: 0 (0/s) Started: less than a minute ago I tried to index a table with nested structure. I set the parent entity as director and put cacheImpl="SortedMapBackedCache" processor=

SqlEntityProcessor is too unstable

2015-11-09 Thread Yangrui Guo
Hello I've been trying to index IMDB data from MySQL with no success yet. The problem was with the data import handler. When I specify using of "SqlEntityProcessor", DIH either totally skipped the row, or didn't start importing at all, or the results are not searchable. I also tried setting

Re: highlighting on child document

2015-11-08 Thread Yangrui Guo
But how does highlighting work with block join query? Do I need to supply additional parameter? Yangrui On Sun, Nov 8, 2015 at 12:45 PM, Mikhail Khludnev < mkhlud...@griddynamics.com> wrote: > On Thu, Nov 5, 2015 at 12:12 AM, Mikhail Khludnev < > mkhlud...@griddynamics.com> wrote: > > > > >

child document faceting returning empty buckets

2015-11-09 Thread Yangrui Guo
{blockChildren:"content_type:children"} } }' But I got an empty list of buckets from the response. The count number was equivalent to number of parent docs. Is there anything wrong with my query? "facets":{ "count":2412762, "movies":{ "buckets":[]}}} Yangrui Guo

Re: child document faceting returning empty buckets

2015-11-09 Thread Yangrui Guo
Just solved the problem by changing blockChildren:"content_type:children" to blockParent:"content_type:children". Does Solrj support json faceting as well? Yangrui On Mon, Nov 9, 2015 at 2:39 PM, Yangrui Guo <guoyang...@gmail.com> wrote: > Hello > > I follo

Re: Kate Winslet vs Winslet Kate

2015-10-31 Thread Yangrui Guo
AND movie) will return nothing. Yangrui Guo On Saturday, October 31, 2015, Erick Erickson <erickerick...@gmail.com> wrote: > There are a couple of anomalies here. > > 1> kate AND winslet > What does the query look like if you add =true to the statement > and look at the "p

Kate Winslet vs Winslet Kate

2015-10-31 Thread Yangrui Guo
Hi today I found an interesting aspect of solr. I imported IMDB data into solr. The IMDB puts last name before first name for its person's name field eg. "Winslet, Kate". When I search "Winslet Kate" with quotation marks I could get the exact result. However if I search "Kate Winslet" or Kate AND

Re: Kate Winslet vs Winslet Kate

2015-11-01 Thread Yangrui Guo
> Or use edismax to do this kind of thing for you, that's > its purpose. > > Best, > Erick > > On Sun, Nov 1, 2015 at 7:06 AM, Yangrui Guo <guoyang...@gmail.com > <javascript:;>> wrote: > > I debugged the query and found the query has been translated into

Re: Kate Winslet vs Winslet Kate

2015-11-01 Thread Yangrui Guo
I debugged the query and found the query has been translated into _text_:Kate AND _text_:Winslet, which _text_ is the default search field. Because my documents use parent/child relation it appeared that if there's no exact match of Kate Winslet, solr will return all documents contains "Kate" and

Re: Kate Winslet vs Winslet Kate

2015-11-01 Thread Yangrui Guo
I've just read the post and it has addressed much of my issue. It is hard to detect phrases and disambiguate phrases but some existing approaches seem really promising. On Sunday, November 1, 2015, Paul Libbrecht wrote: > Alexandre, > > I guess you are talking about that

How to retrieve single child document with block join

2015-10-31 Thread Yangrui Guo
Hi I want to know if I can get the child document only if it contains the query term. Currently I could retrieve all child document at once with query expansion. Does solr support individual child retrieval? Thanks, Yangrui

Re: Kate Winslet vs Winslet Kate

2015-11-03 Thread Yangrui Guo
are treated as escaped characters except AND & OR. You can say > > Dismax gives some room for phrase querying. eDismax improve something > > of Dismax but it depends on if you need thorse improvement or not. You > can > > see it here: > > > https://cwiki.apache.org/con

Re: data import extremely slow

2015-11-06 Thread Yangrui Guo
Thanks for the reply. I just removed CacheKeyLookUp and CachedKey and used WHERE clause instead. Everything works fine now. Yangrui On Friday, November 6, 2015, Shawn Heisey <apa...@elyograg.org> wrote: > On 11/6/2015 10:32 AM, Yangrui Guo wrote: > > > There's a good chance t

Re: highlighting on child document

2015-11-05 Thread Yangrui Guo
ar you can call > highlighter with children query also passing fq={!child > ..}parent-id:. > > On Wed, Nov 4, 2015 at 7:57 PM, Yangrui Guo <guoyang...@gmail.com > <javascript:;>> wrote: > > > Hi > > > > I want to highlight matched terms on child documents becau

Re: data import extremely slow

2015-11-07 Thread Yangrui Guo
I just realized that not everything was ok. Three child entities were not imported. Had set batchSize to -1 but again solr was stuck :( On Fri, Nov 6, 2015 at 3:11 PM, Yangrui Guo <guoyang...@gmail.com> wrote: > Thanks for the reply. I just removed CacheKeyLookUp and CachedKey and use

Re: data import extremely slow

2015-11-07 Thread Yangrui Guo
, Yangrui Guo <guoyang...@gmail.com> wrote: > I just realized that not everything was ok. Three child entities were not > imported. Had set batchSize to -1 but again solr was stuck :( > > On Fri, Nov 6, 2015 at 3:11 PM, Yangrui Guo <guoyang...@gmail.com > <javascrip

Re: Data import handler not indexing all data

2015-11-07 Thread Yangrui Guo
f you do "select > distinct id from imdb.director" - how many items do you get? > > Regards, >Alex. > > Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: > http://www.solr-start.com/ > > > On 7 November 2015 at 18:21, Yangrui Guo

Data import handler not indexing all data

2015-11-07 Thread Yangrui Guo
xed 287041 rows. I didn't see any error in the log. Why was this happening? Here's my data-config.xml Yangrui Guo

Re: Data import handler not indexing all data

2015-11-07 Thread Yangrui Guo
at did make it into Solr. Try to push one of those > missing documents explicitly into Solr by either modifying SQL query > in DIH or as CSV or whatever. > > Good luck, >Alex. > > Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: > http://www.solr

Re: data import extremely slow

2015-11-07 Thread Yangrui Guo
> On Sat, Nov 7, 2015 at 2:22 PM, Alexandre Rafalovitch > > <arafa...@gmail.com <javascript:;>> wrote: > >> Have you thought of just using Solr. Might be faster than > troubleshooting > >> DIH for complex scenarios. > >> On 7 Nov 2015 3:39 pm, "Yang

highlighting on child document

2015-11-04 Thread Yangrui Guo
Hi I want to highlight matched terms on child documents because I need to determine which field matched the search terms. However when I use block join solr returned empty highlight fields. How can I use highlight with nested document? Or is there anyway to tell which field matched the query

data import extremely slow

2015-11-06 Thread Yangrui Guo
Hi I'm using Solr's data import handler and MySQL 5.5 to index imdb database. However the data-import takes a few minutes to process one document while there are over 3 million movies. This is going to take forever yet I can select the rows in MySQL in no time. Where am I doing wrong? My

Solr getting irrelevant results when use block join

2015-10-31 Thread Yangrui Guo
Hi I'm using solr to search imdb database. I set the parent entity to include the name for each actor/actress and child entity for his movies. Because user might either enter a movie or a person I did not specify which entity solr should return. When I just search q=Kate AND Winslet without block

Partial sentence match with block join

2015-12-14 Thread Yangrui Guo
Hello I've been using 5.3.1. I would like to enable this feature: when user enters a query, the results should include documents that also partially match the query. For example, the document is Apple Company and user query is "apple computer company". Though the document is missing the term

Re: Partial sentence match with block join

2015-12-15 Thread Yangrui Guo
, it is "Inc.", not "Company", > so a proper search would be for "Apple Inc." or the old "Apple Computer, > Inc." > > > -- Jack Krupansky > > On Tue, Dec 15, 2015 at 2:35 AM, Yangrui Guo <guoyang...@gmail.com > <javascript:;>> wr

Re: Partial sentence match with block join

2015-12-16 Thread Yangrui Guo
description below. > > Thx! > > On Wed, Dec 16, 2015, at 12:42 AM, Yangrui Guo wrote: > > This will be a very common situation. Amazon and Google now display > > keywords missing in the document. However it seems that Solr parent-child > > structure requires to use &quo

Multiple data-config.xml in one collection?

2016-04-05 Thread Yangrui Guo
to zookeeper for each node in a single collection? Best, Yangrui Guo

Re: Multiple data-config.xml in one collection?

2016-04-05 Thread Yangrui Guo
Hi thanks for the answer. Yes I will be using DIH to import data from different database connections. Do I have to create a collection for each connection? On Tuesday, April 5, 2016, Shawn Heisey <apa...@elyograg.org> wrote: > On 4/5/2016 8:12 AM, Yangrui Guo wrote: > > I'm u

Re: Multiple data-config.xml in one collection?

2016-04-05 Thread Yangrui Guo
;TemplateTransformer"> > >... > > Hope this helps, > > -Dan > > -Original Message- > From: Shawn Heisey [mailto:apa...@elyograg.org <javascript:;>] > Sent: Tuesday, April 05, 2016 10:50 AM > To: solr-user@lucene.apache.org <javascri

Re: Multiple data-config.xml in one collection?

2016-04-06 Thread Yangrui Guo
s: > http://www.solr-start.com/ > > > On 6 April 2016 at 00:12, Yangrui Guo <guoyang...@gmail.com <javascript:;>> > wrote: > > Hello > > > > I'm using Solr Cloud to index a number of databases. The problem is there > > is unknown number of databases

Re: Multiple data-config.xml in one collection?

2016-04-05 Thread Yangrui Guo
Even if > documents having different "source" documents are queried independently > (through filter queries, such as fq:source=health-topics, the frequency of > a word in the entire collection is a factor. > > However, you can query them together, even if you have to

how to retrieve json facet using solrj

2016-04-24 Thread Yangrui Guo
Hello I use json facet api to get facets. The response returned with facets and counts However, when I called the getFacetFields method in SolrJ client, I got null results. How can I get the facet results from solrj? I set my query as query.setParam("json.facet", "{entities : {type: terms,field:

Re: pivoting with json facet api

2016-04-21 Thread Yangrui Guo
me know if it were useful in comments. You could also specify > your problem a bit more if you don't find the answer. > > Cheers, > Alisa > > > > >Четверг, 21 апреля 2016, 1:01 -04:00 от Yangrui Guo <guoyang...@gmail.com > >: > > > >Hi > > > >I

Re: how to restrict phrase to appear in same child document

2016-04-20 Thread Yangrui Guo
t;type"/"category"/"path" fields into your data, so it would look like this: > > { > type:top > id: > { > type:car_color > car: > color: > } > { > type:driver_color > driver: > color: > } > } > > > >Wed,

Re: Child doc facet not getting terms, only counts

2016-04-26 Thread Yangrui Guo
numbers of attributes associated with the products, I used this structure to store the document. My intention is to show facets of the value when an attribute facet is chosen. For example, if you choose "brand" then it'll show "Chanel", "Dior", etc. Is this cu

Child doc facet not getting terms, only counts

2016-04-26 Thread Yangrui Guo
When I use subfaceting with Json API, the facet results only gave me counts, no terms. My query is like this: { apparels : { type: terms, field: brand, facet:{ values:{ type: query, q:\"brand:Chanel\", facet: {

Re: Child doc facet not getting terms, only counts

2016-04-26 Thread Yangrui Guo
I've finally solved this problem. It appears that I do not need to add the line domain: blockChildren: content_type:c in the subfacet. Now I've got my desired results On Tue, Apr 26, 2016 at 3:14 PM, Yangrui Guo <guoyang...@gmail.com> wrote: > The documents are organized in a key-v

how to restrict phrase to appear in same child document

2016-04-19 Thread Yangrui Guo
hello I have a nested document type in my index. Here's the structure of my document: { id: { car: color: } { driver: color: } } However, when I use the query q={!parent which="content_type:parent"}+(black AND driver)={!parent which="content_type:parent"}+(white AND mercedes),

pivoting with json facet api

2016-04-20 Thread Yangrui Guo
Hi I am trying to facet results on my nest documents. The solr document did not say much on how to pivot with json api with nest documents. Could someone show me some examples? Thanks very much. Yangrui

Re: Integrating Stanford NLP or any other NLP for Natural Language Query

2016-07-09 Thread Yangrui Guo
Hi Puneet, I only use Watson's text to speech as user interface, because a lot of people think NLP is the same as voice recognition. If you don't need voice recognition you could remove Watson from it. Stanford has better dependency parsing and can be used offline. However it seems you are using

Re: Integrating Stanford NLP or any other NLP for Natural Language Query

2016-07-10 Thread Yangrui Guo
; this is being done by another team and so I would not be able to comment > further on it. > I would like to know what kind of Solr field you are using for storing the > output from your classes. And also what function output you are putting > into that field. > Thanks > Punee

Re: Integrating Stanford NLP or any other NLP for Natural Language Query

2016-07-07 Thread Yangrui Guo
What is your NLP search like? I have a NLP solution for Solr and just open sourced it. Not sure if it fits your need Yangrui On Thursday, July 7, 2016, Puneet Pawaia wrote: > Hi > > I am currently using Solr 5.5.x to test but can upgrade to Solr 6.x if > required. > I

Re: Integrating Stanford NLP or any other NLP for Natural Language Query

2016-07-07 Thread Yangrui Guo
:;> > www.curvolabs.com > > 58 Adams Ave > Evansville, IN 47713 > > On Thu, Jul 7, 2016 at 4:32 PM, Yangrui Guo <guoyang...@gmail.com > <javascript:;>> wrote: > > > What is your NLP search like? I have a NLP solution for Solr and just > open > > sourc

Re: Integrating Stanford NLP or any other NLP for Natural Language Query

2016-07-07 Thread Yangrui Guo
d search. > A link to your solution would be worth looking at. > Regards > Puneet > > On 8 Jul 2016 03:02, "Yangrui Guo" <guoyang...@gmail.com <javascript:;>> > wrote: > > What is your NLP search like? I have a NLP solution for Solr and just open >

Re: Can Solr find related terms in a document

2016-10-17 Thread Yangrui Guo
ing, looking in a > collection for terms which occur together more often than by chance ? > > > On Sun, Oct 16, 2016 at 3:45 AM, Yangrui Guo <guoyang...@gmail.com > <javascript:;>> wrote: > > > Hello > > > > I'm curious to know if Solr can correlate the occurrences

Re: highlighting on child document

2016-11-17 Thread Yangrui Guo
Thanks. Does Solr plan to add highlighting on children in future? On Thursday, November 17, 2016, vstrugatsky wrote: > It appears that highlighting works for fields in the parent documents only. > https://issues.apache.org/jira/browse/LUCENE-5929 only fixed a bug when >

Can Solr find related terms in a document

2016-10-16 Thread Yangrui Guo
Hello I'm curious to know if Solr can correlate the occurrences of two terms. E.g. if "Bush administration" and "stupid mistake" often appear in the same article, then Solr will think that the two terms are related. Is there a way to achieve this? Yangrui

Re: solr-user-subscribe

2017-07-16 Thread Yangrui Guo
unsubscribe On Friday, July 14, 2017, Naohiko Uramoto wrote: > solr-user-subscribe > > > -- > Naohiko Uramoto >