RE: DIH serialize
Dear Stefan, thank you for your help! Well, I wrote a small script, even if not json, but works: script![CDATA[ function my_serialize(row) { var st = ; st = row.get('stt_id') + || + row.get('stt_name') + || + row.get('stt_date_from') + || + row.get('stt_date_to') + || + row.get('stt_monday') + || + row.get('stt_tuesday') + || + row.get('stt_wednesday') + || + row.get('stt_thursday') + || + row.get('stt_friday') + || + row.get('stt_saturday') + || + row.get('stt_sunday') ; var ret = new java.util.HashMap(); ret.put('main_timetable', st); return ret; } ]]/script regards, Rich -Original Message- From: Stefan Matheis [mailto:matheis.ste...@googlemail.com] Sent: Tuesday, January 25, 2011 11:13 To: solr-user@lucene.apache.org Subject: Re: DIH serialize Rich, i played around for a few minutes with Script-Transformers, but i have not enough knowledge to get anything done right know :/ My Idea was: looping over the given row, which should be a Java HashMap or something like that? and do sth like this (pseudo-code): var row_data = [] for( var key in row ) { row_data.push( '' + key + ' : ' + row[key] + '' ); } row.put( 'whatever_field', '{' + row_data.join( ',' ) + '}' ); Which should result in a json-object like {'key1':'value1', 'key2':'value2'} - and that should be okay to work with? Regards Stefan On Mon, Jan 24, 2011 at 7:53 PM, Papp Richard ccode...@gmail.com wrote: Hi Stefan, yes, this is exactly what I intend - I don't want to search in this field - just quicly return me the result in a serialized form (the search criteria is on other fields). Well, if I could serialize the data exactly as like the PHP serialize() does I would be maximally satisfied, but any other form in which I could compact the data easily into one field I would be pleased. Can anyone help me? I guess the script is quite a good way, but I don't know which function should I use there to compact the data to be easily usable in PHP. Or any other method? thanks, Rich -Original Message- From: Stefan Matheis [mailto:matheis.ste...@googlemail.com] Sent: Monday, January 24, 2011 18:23 To: solr-user@lucene.apache.org Subject: Re: DIH serialize Hi Rich, i'm a bit confused after reading your post .. what exactly you wanna try to achieve? Serializing (like http://php.net/serialize) your complete row into one field? Don't wanna search in them, just store and deliver them in your results? Does that make sense? Sounds a bit strange :) Regards Stefan On Mon, Jan 24, 2011 at 10:03 AM, Papp Richard ccode...@gmail.com wrote: Hi Dennis, thank you for your answer, but didn't understand why you say it doesn't need serialization. I'm with the option C. but the main question is, how to put into one field a result of many fields: SELECT * FROM. thanks, Rich -Original Message- From: Dennis Gearon [mailto:gear...@sbcglobal.net] Sent: Monday, January 24, 2011 02:07 To: solr-user@lucene.apache.org Subject: Re: DIH serialize Depends on your process chain to the eventual viewer/consumer of the data. The questions to ask are: A/ Is the data IN Solr going to be viewed or processed in its original form: --set stored = 'true' ---no serialization needed. B/ If it's going to be anayzed and searched for separate from any other field, the analyzing will put it into an unreadable form. If you need to see it, then ---set indexed=true and stored=true ---no serializaton needed. C/ If it's NOT going to be viewed AS IS, and it's not going to be searched for AS IS, (i.e. other columns will be how the data is found), and you have another, serialzable format: --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. C/ If it's NOT going to be viewed AS IS, and it's not going to be searched for AS IS, (i.e. other columns will be how the data is found), and you have another, serialzable format: --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. D/ If it's NOT going to be viewed AS IS, BUT it's going to be searched for AS IS, (this column will be how the data is found), and you have another, serialzable format: --you need to put it into TWO columns --A SERIALIZED FIELD --set indexed=false and stored=true --AN UNSERIALIZED FIELD --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. Hope that helps! Dennis Gearon Signature Warning
RE: DIH serialize
Hi Dennis, thank you for your answer, but didn't understand why you say it doesn't need serialization. I'm with the option C. but the main question is, how to put into one field a result of many fields: SELECT * FROM. thanks, Rich -Original Message- From: Dennis Gearon [mailto:gear...@sbcglobal.net] Sent: Monday, January 24, 2011 02:07 To: solr-user@lucene.apache.org Subject: Re: DIH serialize Depends on your process chain to the eventual viewer/consumer of the data. The questions to ask are: A/ Is the data IN Solr going to be viewed or processed in its original form: --set stored = 'true' ---no serialization needed. B/ If it's going to be anayzed and searched for separate from any other field, the analyzing will put it into an unreadable form. If you need to see it, then ---set indexed=true and stored=true ---no serializaton needed. C/ If it's NOT going to be viewed AS IS, and it's not going to be searched for AS IS, (i.e. other columns will be how the data is found), and you have another, serialzable format: --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. C/ If it's NOT going to be viewed AS IS, and it's not going to be searched for AS IS, (i.e. other columns will be how the data is found), and you have another, serialzable format: --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. D/ If it's NOT going to be viewed AS IS, BUT it's going to be searched for AS IS, (this column will be how the data is found), and you have another, serialzable format: --you need to put it into TWO columns --A SERIALIZED FIELD --set indexed=false and stored=true --AN UNSERIALIZED FIELD --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. Hope that helps! Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so you do not have to make them yourself. from 'http://blogs.techrepublic.com.com/security/?p=4501tag=nl.e036' EARTH has a Right To Life, otherwise we all die. - Original Message From: Papp Richard ccode...@gmail.com To: solr-user@lucene.apache.org Sent: Sun, January 23, 2011 2:02:05 PM Subject: DIH serialize Hi all, I wasted the last few hours trying to serialize some column values (from mysql) into a Solr column, but I just can't find such a function. I'll use the value in PHP - I don't know if it is possible to serialize in PHP style at all. This is what I tried and works with a given factor: in schema.xml: field name=main_timetable type=text indexed=false stored=true multiValued=true / in DIH xml: dataConfig script![CDATA[ function my_serialize(row) { row.put('main_timetable', row.toString()); return row; } ]]/script . entity name=main_timetable query= SELECT * FROM shop_time_table stt WHERE stt.shop_id = '${shop.id}'; transformer=script:my_serialize . Can I use java directly in script (script language=Java) ? How could I achieve this? Or any other idea? I need these values together (from a row) and I need then in PHP to handle the result easily. thanks, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5740 (20101228) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5740 (20101228) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: DIH serialize
Hi Stefan, yes, this is exactly what I intend - I don't want to search in this field - just quicly return me the result in a serialized form (the search criteria is on other fields). Well, if I could serialize the data exactly as like the PHP serialize() does I would be maximally satisfied, but any other form in which I could compact the data easily into one field I would be pleased. Can anyone help me? I guess the script is quite a good way, but I don't know which function should I use there to compact the data to be easily usable in PHP. Or any other method? thanks, Rich -Original Message- From: Stefan Matheis [mailto:matheis.ste...@googlemail.com] Sent: Monday, January 24, 2011 18:23 To: solr-user@lucene.apache.org Subject: Re: DIH serialize Hi Rich, i'm a bit confused after reading your post .. what exactly you wanna try to achieve? Serializing (like http://php.net/serialize) your complete row into one field? Don't wanna search in them, just store and deliver them in your results? Does that make sense? Sounds a bit strange :) Regards Stefan On Mon, Jan 24, 2011 at 10:03 AM, Papp Richard ccode...@gmail.com wrote: Hi Dennis, thank you for your answer, but didn't understand why you say it doesn't need serialization. I'm with the option C. but the main question is, how to put into one field a result of many fields: SELECT * FROM. thanks, Rich -Original Message- From: Dennis Gearon [mailto:gear...@sbcglobal.net] Sent: Monday, January 24, 2011 02:07 To: solr-user@lucene.apache.org Subject: Re: DIH serialize Depends on your process chain to the eventual viewer/consumer of the data. The questions to ask are: A/ Is the data IN Solr going to be viewed or processed in its original form: --set stored = 'true' ---no serialization needed. B/ If it's going to be anayzed and searched for separate from any other field, the analyzing will put it into an unreadable form. If you need to see it, then ---set indexed=true and stored=true ---no serializaton needed. C/ If it's NOT going to be viewed AS IS, and it's not going to be searched for AS IS, (i.e. other columns will be how the data is found), and you have another, serialzable format: --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. C/ If it's NOT going to be viewed AS IS, and it's not going to be searched for AS IS, (i.e. other columns will be how the data is found), and you have another, serialzable format: --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. D/ If it's NOT going to be viewed AS IS, BUT it's going to be searched for AS IS, (this column will be how the data is found), and you have another, serialzable format: --you need to put it into TWO columns --A SERIALIZED FIELD --set indexed=false and stored=true --AN UNSERIALIZED FIELD --set indexed=false and stored=true --serialize AS PER THE INTENDED APPLICATION, not sure that Solr can do that at all. Hope that helps! Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others' mistakes, so you do not have to make them yourself. from 'http://blogs.techrepublic.com.com/security/?p=4501tag=nl.e036' EARTH has a Right To Life, otherwise we all die. - Original Message From: Papp Richard ccode...@gmail.com To: solr-user@lucene.apache.org Sent: Sun, January 23, 2011 2:02:05 PM Subject: DIH serialize Hi all, I wasted the last few hours trying to serialize some column values (from mysql) into a Solr column, but I just can't find such a function. I'll use the value in PHP - I don't know if it is possible to serialize in PHP style at all. This is what I tried and works with a given factor: in schema.xml: field name=main_timetable type=text indexed=false stored=true multiValued=true / in DIH xml: dataConfig script![CDATA[ function my_serialize(row) { row.put('main_timetable', row.toString()); return row; } ]]/script . entity name=main_timetable query= SELECT * FROM shop_time_table stt WHERE stt.shop_id = '${shop.id }'; transformer=script:my_serialize . Can I use java directly in script (script language=Java) ? How could I achieve this? Or any other idea? I need these values together (from a row) and I need then in PHP to handle the result easily. thanks, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5740 (20101228) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information
DIH serialize
Hi all, I wasted the last few hours trying to serialize some column values (from mysql) into a Solr column, but I just can't find such a function. I'll use the value in PHP - I don't know if it is possible to serialize in PHP style at all. This is what I tried and works with a given factor: in schema.xml: field name=main_timetable type=text indexed=false stored=true multiValued=true / in DIH xml: dataConfig script![CDATA[ function my_serialize(row) { row.put('main_timetable', row.toString()); return row; } ]]/script . entity name=main_timetable query= SELECT * FROM shop_time_table stt WHERE stt.shop_id = '${shop.id}'; transformer=script:my_serialize . Can I use java directly in script (script language=Java) ? How could I achieve this? Or any other idea? I need these values together (from a row) and I need then in PHP to handle the result easily. thanks, Rich
solr admin
Hello, is there any way to specify in the solr admin other than fields? and I'm nt talking about the full interface which is also very limited. like: score, fl, fq, ... and yes, I know that I can use the url... which indeed is not too handy. thanks, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5659 (20101129) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
special sorting
Hello, I have many pages with the same content in the search result (the result is the same for some of the cities from the same county)... which means that I have duplicate content. the filter query is something like: +locationId:(60 26a 39a) - for city with ID 60 and I get the same result for city with ID 62: +locationId:(62 26a 39a) (cityID, countyID, countryID) how could I use a sorting to have different docs order in results for different cities? (for the same city I need to have the same sort order always - it cannot be a simple random...) could I use somehow the cityID parameter as boost or score ? I tried but could't realise too much. thanks, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5659 (20101129) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: solr admin
in Solr admin (http://localhost:8180/services/admin/) I can specify something like: +category_id:200 +xxx:300 but how can I specify a sort option? sort:category_id+asc regards, Rich -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Monday, November 29, 2010 22:00 To: solr-user@lucene.apache.org Subject: Re: solr admin I honestly don't understand what you're asking here. Specify what in solr admin other than fields? what is it you're trying to accomplish? Best Erick On Mon, Nov 29, 2010 at 2:56 PM, Papp Richard ccode...@gmail.com wrote: Hello, is there any way to specify in the solr admin other than fields? and I'm nt talking about the full interface which is also very limited. like: score, fl, fq, ... and yes, I know that I can use the url... which indeed is not too handy. thanks, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5659 (20101129) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5659 (20101129) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5659 (20101129) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: special sorting
Hmm, any clue how to use it? use the location_id somehow? thanks, Rich -Original Message- From: Tommaso Teofili [mailto:tommaso.teof...@gmail.com] Sent: Monday, November 29, 2010 22:08 To: solr-user@lucene.apache.org Subject: Re: special sorting Perhaps, depending on your domain logic you could use function queries to achieve that. http://wiki.apache.org/solr/FunctionQuery#Sort_By_Function Regards, Tommaso 2010/11/29 Papp Richard ccode...@gmail.com Hello, I have many pages with the same content in the search result (the result is the same for some of the cities from the same county)... which means that I have duplicate content. the filter query is something like: +locationId:(60 26a 39a) - for city with ID 60 and I get the same result for city with ID 62: +locationId:(62 26a 39a) (cityID, countyID, countryID) how could I use a sorting to have different docs order in results for different cities? (for the same city I need to have the same sort order always - it cannot be a simple random...) could I use somehow the cityID parameter as boost or score ? I tried but could't realise too much. thanks, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5659 (20101129) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5659 (20101129) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5659 (20101129) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: solr 4.0 - pagination
Dear Yonik, this is fantastic, but can you tell any time it will be ready ? I would need this feature in two weeks. Is it possible to finish and make an update in this time or should I look for another solution cocerning the pgaination (like implement just more results link instead of pagination) ? best regards, Rich -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Saturday, October 30, 2010 19:29 To: solr-user@lucene.apache.org Subject: Re: solr 4.0 - pagination On Sat, Oct 30, 2010 at 12:22 PM, Papp Richard ccode...@gmail.com wrote: I'm using Solr 4.0 with grouping (field collapsing), but unfortunately I can't solve the pagination. It's not implemented yet, but I'm working on that right now. -Yonik http://www.lucidimagination.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5576 (20101029) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: solr 4.0 - pagination
thank you very much Yonik! you are a magician! regards, Rich -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Sunday, November 07, 2010 18:04 To: Papp Richard Cc: solr-user@lucene.apache.org Subject: Re: solr 4.0 - pagination On Sun, Nov 7, 2010 at 10:55 AM, Papp Richard ccode...@gmail.com wrote: this is fantastic, but can you tell any time it will be ready ? It already is ;-) Grab the latest trunk or the latest nightly build. -Yonik http://www.lucidimagination.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: solr 4.0 - pagination
Hi Yonik, I've just tried the latest stable version from nightly build: apache-solr-4.0-2010-11-05_08-06-28.war I have some concerns however: I have 3 documents; 2 in the first group, 1 in the 2nd group. 1. I got for matches 3 - which is good, but I still don't know how many groups I have. (using start = 0, rows = 10) 2. as far as I see the start / rows is working now, but the matches is returned incorrectly = it said matches = 3 instead of = 1, when I used start = 1, rows = 1 so can you help me, how to compute how many pages I'll have, because the matches can't use for this. regards, Rich -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Sunday, November 07, 2010 18:04 To: Papp Richard Cc: solr-user@lucene.apache.org Subject: Re: solr 4.0 - pagination On Sun, Nov 7, 2010 at 10:55 AM, Papp Richard ccode...@gmail.com wrote: this is fantastic, but can you tell any time it will be ready ? It already is ;-) Grab the latest trunk or the latest nightly build. -Yonik http://www.lucidimagination.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5599 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: solr 4.0 - pagination
Hey Yonik, Sorry, I think the matches is ok - because it probably returns always the total document number - however I don't know how to compute the number of pages. thanks, Rich -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Sunday, November 07, 2010 18:04 To: Papp Richard Cc: solr-user@lucene.apache.org Subject: Re: solr 4.0 - pagination On Sun, Nov 7, 2010 at 10:55 AM, Papp Richard ccode...@gmail.com wrote: this is fantastic, but can you tell any time it will be ready ? It already is ;-) Grab the latest trunk or the latest nightly build. -Yonik http://www.lucidimagination.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5599 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: solr 4.0 - pagination
I see. Let's assume that there are 1000 groups. Can I use safely (with no negative impact on memory usage or slowness) the start = 990, rows = 10 to get the latest page? Or this will not work, due you will need to compute all the groups till 1000, in order to return the last 10, and because of this the whole will be slow / memory usage will increase considerably. regards, Rich -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Sunday, November 07, 2010 21:54 To: Papp Richard Cc: solr-user@lucene.apache.org Subject: Re: solr 4.0 - pagination On Sun, Nov 7, 2010 at 2:45 PM, Papp Richard ccode...@gmail.com wrote: Hi Yonik, I've just tried the latest stable version from nightly build: apache-solr-4.0-2010-11-05_08-06-28.war I have some concerns however: I have 3 documents; 2 in the first group, 1 in the 2nd group. 1. I got for matches 3 - which is good, but I still don't know how many groups I have. (using start = 0, rows = 10) 2. as far as I see the start / rows is working now, but the matches is returned incorrectly = it said matches = 3 instead of = 1, when I used start = 1, rows = 1 matches is the number of documents before grouping, so start/rows or group.offset/group.limit will not affect this number. so can you help me, how to compute how many pages I'll have, because the matches can't use for this. Solr doesn't even know given the current algorithm, hence it can't return that info. The issue is that to calculate the total number of groups, we would need to keep each group in memory (which could cause a big blowup if there are tons of groups). The current algorithm only keeps the top 10 groups (assuming rows=10) in memory at any one time, hence it has no idea what the total number of groups is. -Yonik http://www.lucidimagination.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5599 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5599 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
solr 4.0 - pagination
Hi all, I'm using Solr 4.0 with grouping (field collapsing), but unfortunately I can't solve the pagination. Mainly there are two problems: - the query fields start rows doesn't work anymore - beside of the values, it always returns the data as the start would be 0 (start = 0) - the result contains just the total document number and not the total groups number Can anyone help me, how to solve this? regards, Rich
RE: solr 4.0 - pagination
Can you estimate please when it will be done? thanks, Rich -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Saturday, October 30, 2010 19:29 To: solr-user@lucene.apache.org Subject: Re: solr 4.0 - pagination On Sat, Oct 30, 2010 at 12:22 PM, Papp Richard ccode...@gmail.com wrote: I'm using Solr 4.0 with grouping (field collapsing), but unfortunately I can't solve the pagination. It's not implemented yet, but I'm working on that right now. -Yonik http://www.lucidimagination.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5576 (20101029) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5576 (20101029) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: Grouping in solr ?
I'm really sorry - thank you for the note. -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Tuesday, September 28, 2010 05:12 To: solr-user@lucene.apache.org Subject: Re: Grouping in solr ? : References: : abcc5d9ce0798544a169c584b8f1447d230313c...@exchange01.toolbox.local : In-Reply-To: : abcc5d9ce0798544a169c584b8f1447d230313c...@exchange01.toolbox.local : Subject: Grouping in solr ? http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you change the subject line of your email, other mail headers still track which thread you replied to and your question is hidden in that thread and gets less attention. It makes following discussions in the mailing list archives particularly difficult. See Also: http://en.wikipedia.org/wiki/User:DonDiego/Thread_hijacking -Hoss -- http://lucenerevolution.org/ ... October 7-8, Boston http://bit.ly/stump-hoss ... Stump The Chump! __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
Grouping in solr ?
Hi all, is it possible somehow to group documents? I have services as documents, and I would like to show the filtered services grouped by company. So I filter services by given criteria, but I show the results grouped by companay. If I got 1000 services, maybe I need to show just 100 companies (this will affect pagination as well), and how could I get the company info? Should I store the company info in each service (I don't need the compnany info to be indexed) ? regards, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: Grouping in solr ?
thank you! this is really helpful. just tried it and it's amazing. do you know, how trustable is a nightly built version (solr4) ? Rich -Original Message- From: Markus Jelsma [mailto:markus.jel...@buyways.nl] Sent: Thursday, September 23, 2010 22:38 To: solr-user@lucene.apache.org Subject: RE: Grouping in solr ? http://wiki.apache.org/solr/FieldCollapsing https://issues.apache.org/jira/browse/SOLR-236 -Original message- From: Papp Richard ccode...@gmail.com Sent: Thu 23-09-2010 21:29 To: solr-user@lucene.apache.org; Subject: Grouping in solr ? Hi all, is it possible somehow to group documents? I have services as documents, and I would like to show the filtered services grouped by company. So I filter services by given criteria, but I show the results grouped by companay. If I got 1000 services, maybe I need to show just 100 companies (this will affect pagination as well), and how could I get the company info? Should I store the company info in each service (I don't need the compnany info to be indexed) ? regards, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
tricky range query?
Hi all, shortly my problem: I want to filter services based on timetables, let's consider the next timetable for a day: on the date of 15.10.2010: 10:00 - 11:00 12:00 - 12:30 14:30 - 16:00 17:00 - 20:00 how could i store the timetable in Solr and efficiently search in it (let's say filter those timetables which has an availability at 15:00) ? not to mention, that each service has a duration (so, if the service takes 90 mins, filtering by 15:00 shouldn't return the previous timetable, because there is not enough free time (just 60 mins in the above example)) how to solve this? any hints? regards, Rich
RE: tricky range query?
Hi Erik, first of all, thank you for your answer. Let me detail a bit the amount of data: - actually services going to persons, and the time table is per person (a person can have multiple services). - there will be around 10.000 person (or maybe 100.000 - I would like to say rather 100.000 than have problems later) - but time table can differ from week to week, so each person has many time table (one for each week) = so this means that if they have the timetables for ~3 months (12 weeks)... 100.000 x 12 ~ 1.000.000 timetabels... and each time table has 7 days... and on each day we have many periods (as someone books a service, the timetbale will be modified, and possible will result in time gaps, like I show in the example)... so all in all there are too many data, is it? - I've checkte the trie, but couldn't find too much info. I don't know if it could be a solution to us e it or not - I'm not a solr expert. regards, Rich -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Tuesday, September 21, 2010 14:40 To: solr-user@lucene.apache.org Subject: Re: tricky range query? How efficient it would be I don't know, but depending on how many services you're talking here, efficiency may not be that big of a deal... But storing each interval as its own record along with a duration should work. You could then form a query like duration:[90 to *] AND start_time:[* to 1500] AND end_time:[1500 TO *]. I'm not sure I'd want that kind of query on a gigabyte of records... But without knowing some more details, it's impossible to say whether this would be at all suitable... Best Erick On Tue, Sep 21, 2010 at 4:41 AM, Papp Richard ccode...@gmail.com wrote: Hi all, shortly my problem: I want to filter services based on timetables, let's consider the next timetable for a day: on the date of 15.10.2010: 10:00 - 11:00 12:00 - 12:30 14:30 - 16:00 17:00 - 20:00 how could i store the timetable in Solr and efficiently search in it (let's say filter those timetables which has an availability at 15:00) ? not to mention, that each service has a duration (so, if the service takes 90 mins, filtering by 15:00 shouldn't return the previous timetable, because there is not enough free time (just 60 mins in the above example)) how to solve this? any hints? regards, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: tricky range query?
Hi Erick, don't really understand your question and what exactly the point is, but anyway. yes - there is a DB where data are stored, however the scheduling is just a part of the whole picture. I thought to use Solr for search / filtering results - the schedule (availability) is just one filter from the whole search process. Does it make sense for you? May I ask if you are a Solr specialist? I don't know how serious I have to take in account your answers. thank you, Rich -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Tuesday, September 21, 2010 20:36 To: solr-user@lucene.apache.org Subject: Re: tricky range query? So it sounds like you're working on some kind of scheduling app? Which makes me wonder why you're using SOLR. Much as I like it, this sounds more like a database application than a search application. What am I missing? Best Erick On Tue, Sep 21, 2010 at 1:05 PM, Papp Richard ccode...@gmail.com wrote: Hi Erik, first of all, thank you for your answer. Let me detail a bit the amount of data: - actually services going to persons, and the time table is per person (a person can have multiple services). - there will be around 10.000 person (or maybe 100.000 - I would like to say rather 100.000 than have problems later) - but time table can differ from week to week, so each person has many time table (one for each week) = so this means that if they have the timetables for ~3 months (12 weeks)... 100.000 x 12 ~ 1.000.000 timetabels... and each time table has 7 days... and on each day we have many periods (as someone books a service, the timetbale will be modified, and possible will result in time gaps, like I show in the example)... so all in all there are too many data, is it? - I've checkte the trie, but couldn't find too much info. I don't know if it could be a solution to us e it or not - I'm not a solr expert. regards, Rich -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Tuesday, September 21, 2010 14:40 To: solr-user@lucene.apache.org Subject: Re: tricky range query? How efficient it would be I don't know, but depending on how many services you're talking here, efficiency may not be that big of a deal... But storing each interval as its own record along with a duration should work. You could then form a query like duration:[90 to *] AND start_time:[* to 1500] AND end_time:[1500 TO *]. I'm not sure I'd want that kind of query on a gigabyte of records... But without knowing some more details, it's impossible to say whether this would be at all suitable... Best Erick On Tue, Sep 21, 2010 at 4:41 AM, Papp Richard ccode...@gmail.com wrote: Hi all, shortly my problem: I want to filter services based on timetables, let's consider the next timetable for a day: on the date of 15.10.2010: 10:00 - 11:00 12:00 - 12:30 14:30 - 16:00 17:00 - 20:00 how could i store the timetable in Solr and efficiently search in it (let's say filter those timetables which has an availability at 15:00) ? not to mention, that each service has a duration (so, if the service takes 90 mins, filtering by 15:00 shouldn't return the previous timetable, because there is not enough free time (just 60 mins in the above example)) how to solve this? any hints? regards, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com
RE: trie
thank you guys for the answers... now I have to check / read some docs ;) Rich -Original Message- From: Simon Willnauer [mailto:simon.willna...@googlemail.com] Sent: Tuesday, September 21, 2010 23:00 To: solr-user@lucene.apache.org Subject: Re: trie 2010/9/21 Péter Király kirun...@gmail.com: You can read about it in Lucene in Action second edition. have a look at http://www.lucidimagination.com/developer/whitepaper/Whats-New-in-Apache-Lucene-3-0 page 4 to 8 should give you a good intro to the topic simon Péter 2010/9/21 Papp Richard ccode...@gmail.com: is there any good tutorial how to use and what is trie? what I found on the net is really blurry. rgeards, Rich __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5419 (20100902) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com