[Job] Software Engineer at Polyvore

2010-01-09 Thread Guangwei Yuan
Polyvore (http://www.polyvore.com) is a well-funded startup company backed by Benchmark Capital and Matrix Partners. It is a fast-growing site with over 5 million unique visitors and 120 million pageviews per month. At Polyvore, we strive to build products that delight people. By design, we keep

Recommendation system

2009-11-19 Thread Guangwei Yuan
Hello, We have an interesting and we guess common machine learning problem. Polyvore (the company I work for) has a few million products in its index. For each product we have the following information: - keywords extracted from title, description, tags - other meta information like colors,

Re: Disable coord

2009-11-18 Thread Guangwei Yuan
Thanks for your reply. Nested boolean queries is a valid concern. I also realized that isCoordDisabled needs to be considered in BooleanQuery.hashCode so that a query with coord=false will have a different cache key in Solr. On Thu, Nov 12, 2009 at 12:12 PM, Chris Hostetter

Re: BoostedQuery Performance

2009-05-11 Thread Guangwei Yuan
, Guangwei On Sat, May 9, 2009 at 12:31 PM, Guangwei Yuan guy...@gmail.com wrote: Hi, I'm trying the BoostQParserPlugin and FunctionQuery to enable query time boosting. It works better than bf (boost function) because it multiplies the relevancy score by the boosts. However I noticed significant

Re: BoostedQuery Performance

2009-05-11 Thread Guangwei Yuan
Please ignore my posts. Log is quite expensive an operation... On Mon, May 11, 2009 at 11:45 AM, Guangwei Yuan guy...@gmail.com wrote: After spending more time on this, it seems more likely a problem from FunctionQuery. If using boost = log(100) takes 100ms, log(log(100)) adds another 100ms

BoostedQuery Performance

2009-05-09 Thread Guangwei Yuan
Hi, I'm trying the BoostQParserPlugin and FunctionQuery to enable query time boosting. It works better than bf (boost function) because it multiplies the relevancy score by the boosts. However I noticed significant performance issues with it. The more functions I use as boosts, the slower it

Slow performance using MatchAllDocsQuery with filter query

2008-07-01 Thread Guangwei Yuan
Hi, I've noticed some bad performance in faceted browsing, when the query is empty (so the MatchAllDocsQuery is used) and there are only filter queries. An example of the search url is: http://hostname:8080/solr/select/?q=qt=dismaxfq=color:%2300 One idea is to switch to the StandardRequest

Timeout in remote streaming

2007-11-07 Thread Guangwei Yuan
Hi, I'm sending a local csv file to Solr via remote streaming, and constantly get the 500 read timeout message. The csv file is about 200MB in size, and Solr is running on Tomcat 5.5. What types of timeout related Tomcat params I can adjust to fix this? Thanks in advance. - Guangwei

Handling empty query

2007-10-04 Thread Guangwei Yuan
Hi, Does Solr support empty queries? It'll be nice if Solr can return all results if q is null. Otherwise, I guess I'll have to write a customized request handler. Any thoughts? Thanks in advance. - Guangwei

Re: Color search

2007-09-29 Thread Guangwei Yuan
can you you explain exactly how you are indexing the data and what your query looks like? I used the same field name (color), not 10 different names (c0 - c9). So the index fields look like (50% #00, 20% #99): color: #00 color: #00 color: #00 color: #00 color: #00

Color search

2007-09-28 Thread Guangwei Yuan
Hi, We're running an e-commerce site that provides product search. We've been able to extract colors from product images, and we think it'd be cool and useful to search products by color. A product image can have up to 5 colors (from a color space of about 100 colors), so we can implement it

Re: Color search

2007-09-28 Thread Guangwei Yuan
Thanks for all the replies. I think creating 10 fields and feeding each field with a color's value for 10% from that color is a reasonable approach, and easy to implement too. One problem though, is that not all products have a total of 100% colors (due to various reasons including our color