RE: DIH serialize

2011-01-25 Thread Papp Richard
Dear Stefan,

  thank you for your help! 
  Well, I wrote a small script, even if not json, but works:

  script![CDATA[
function my_serialize(row)
{
  var st = ;
  
  st = row.get('stt_id') + || +
row.get('stt_name') + || +
row.get('stt_date_from') + || +
row.get('stt_date_to') + || +
row.get('stt_monday') + || +
row.get('stt_tuesday') + || +
row.get('stt_wednesday') + || +
row.get('stt_thursday') + || +
row.get('stt_friday') + || +
row.get('stt_saturday') + || +
row.get('stt_sunday') ;

  var ret = new java.util.HashMap();
  ret.put('main_timetable', st);
  
  return ret;
}
  ]]/script

regards,
  Rich

-Original Message-
From: Stefan Matheis [mailto:matheis.ste...@googlemail.com] 
Sent: Tuesday, January 25, 2011 11:13
To: solr-user@lucene.apache.org
Subject: Re: DIH serialize

Rich,

i played around for a few minutes with Script-Transformers, but i have not
enough knowledge to get anything done right know :/
My Idea was: looping over the given row, which should be a Java HashMap or
something like that? and do sth like this (pseudo-code):

var row_data = []
for( var key in row )
{
  row_data.push( '' + key + ' : ' + row[key] + '' );
}
row.put( 'whatever_field', '{' + row_data.join( ',' ) + '}' );

Which should result in a json-object like {'key1':'value1', 'key2':'value2'}
- and that should be okay to work with?

Regards
Stefan

On Mon, Jan 24, 2011 at 7:53 PM, Papp Richard ccode...@gmail.com wrote:

 Hi Stefan,

  yes, this is exactly what I intend - I don't want to search in this field
 - just quicly return me the result in a serialized form (the search
 criteria
 is on other fields). Well, if I could serialize the data exactly as like
 the
 PHP serialize() does I would be maximally satisfied, but any other form in
 which I could compact the data easily into one field I would be pleased.
  Can anyone help me? I guess the script is quite a good way, but I don't
 know which function should I use there to compact the data to be easily
 usable in PHP. Or any other method?

 thanks,
  Rich

 -Original Message-
 From: Stefan Matheis [mailto:matheis.ste...@googlemail.com]
 Sent: Monday, January 24, 2011 18:23
 To: solr-user@lucene.apache.org
 Subject: Re: DIH serialize

 Hi Rich,

 i'm a bit confused after reading your post .. what exactly you wanna try
to
 achieve? Serializing (like http://php.net/serialize) your complete row
 into
 one field? Don't wanna search in them, just store and deliver them in your
 results? Does that make sense? Sounds a bit strange :)

 Regards
 Stefan

 On Mon, Jan 24, 2011 at 10:03 AM, Papp Richard ccode...@gmail.com wrote:

  Hi Dennis,
 
   thank you for your answer, but didn't understand why you say it doesn't
  need serialization. I'm with the option C.
   but the main question is, how to put into one field a result of many
  fields: SELECT * FROM.
 
  thanks,
   Rich
 
  -Original Message-
  From: Dennis Gearon [mailto:gear...@sbcglobal.net]
  Sent: Monday, January 24, 2011 02:07
  To: solr-user@lucene.apache.org
  Subject: Re: DIH serialize
 
  Depends on your process chain to the eventual viewer/consumer of the
 data.
 
  The questions to ask are:
   A/ Is the data IN Solr going to be viewed or processed in its original
  form:
   --set stored = 'true'
  ---no serialization needed.
   B/ If it's going to be anayzed and searched for separate from any other
  field,
 
   the analyzing will put it into  an unreadable form. If you need to
 see
  it,
  then
  ---set indexed=true and stored=true
  ---no serializaton needed.   C/ If it's NOT going to be viewed AS
 IS,
  and
  it's not going to be searched for AS IS,
(i.e. other columns will be how the data is found), and you have
  another,
 
serialzable format:
--set indexed=false and stored=true
--serialize AS PER THE INTENDED APPLICATION,
not sure that Solr can do that at all.
   C/ If it's NOT going to be viewed AS IS, and it's not going to be
 searched
  for
  AS IS,
(i.e. other columns will be how the data is found), and you have
  another,
 
serialzable format:
--set indexed=false and stored=true
--serialize AS PER THE INTENDED APPLICATION,
not sure that Solr can do that at all.
   D/ If it's NOT going to be viewed AS IS, BUT it's going to be searched
 for
  AS
  IS,
(this column will be how the data is found), and you have another,
serialzable format:
--you need to put it into TWO columns
--A SERIALIZED FIELD
--set indexed=false and stored=true
 
   --AN UNSERIALIZED FIELD
--set indexed=false and stored=true
--serialize AS PER THE INTENDED APPLICATION,
not sure that Solr can do that at all.
 
  Hope that helps!
 
 
  Dennis Gearon
 
 
  Signature Warning

RE: DIH serialize

2011-01-24 Thread Papp Richard
Hi Dennis,

  thank you for your answer, but didn't understand why you say it doesn't need 
serialization. I'm with the option C.
  but the main question is, how to put into one field a result of many fields: 
SELECT * FROM.

thanks,
  Rich

-Original Message-
From: Dennis Gearon [mailto:gear...@sbcglobal.net] 
Sent: Monday, January 24, 2011 02:07
To: solr-user@lucene.apache.org
Subject: Re: DIH serialize

Depends on your process chain to the eventual viewer/consumer of the data.

The questions to ask are:
  A/ Is the data IN Solr going to be viewed or processed in its original form:
  --set stored = 'true'
 ---no serialization needed.
  B/ If it's going to be anayzed and searched for separate from any other 
field, 

  the analyzing will put it into  an unreadable form. If you need to see 
it, 
then
 ---set indexed=true and stored=true
 ---no serializaton needed.   C/ If it's NOT going to be viewed AS IS, and 
it's not going to be searched for AS IS,
   (i.e. other columns will be how the data is found), and you have 
another, 

   serialzable format:
   --set indexed=false and stored=true
   --serialize AS PER THE INTENDED APPLICATION,
   not sure that Solr can do that at all.
  C/ If it's NOT going to be viewed AS IS, and it's not going to be searched 
for 
AS IS,
   (i.e. other columns will be how the data is found), and you have 
another, 

   serialzable format:
   --set indexed=false and stored=true
   --serialize AS PER THE INTENDED APPLICATION,
   not sure that Solr can do that at all.
  D/ If it's NOT going to be viewed AS IS, BUT it's going to be searched for AS 
IS,
   (this column will be how the data is found), and you have another, 
   serialzable format:
   --you need to put it into TWO columns
   --A SERIALIZED FIELD
   --set indexed=false and stored=true

  --AN UNSERIALIZED FIELD
   --set indexed=false and stored=true
   --serialize AS PER THE INTENDED APPLICATION,
   not sure that Solr can do that at all.

Hope that helps!


Dennis Gearon


Signature Warning

It is always a good idea to learn from your own mistakes. It is usually a 
better 
idea to learn from others’ mistakes, so you do not have to make them yourself. 
from 'http://blogs.techrepublic.com.com/security/?p=4501tag=nl.e036'


EARTH has a Right To Life,
otherwise we all die.



- Original Message 
From: Papp Richard ccode...@gmail.com
To: solr-user@lucene.apache.org
Sent: Sun, January 23, 2011 2:02:05 PM
Subject: DIH serialize

Hi all,



  I wasted the last few hours trying to serialize some column values (from
mysql) into a Solr column, but I just can't find such a function. I'll use
the value in PHP - I don't know if it is possible to serialize in PHP style
at all. This is what I tried and works with a given factor:



in schema.xml:

   field name=main_timetable  type=text indexed=false
stored=true multiValued=true /



in DIH xml:



dataConfig

  script![CDATA[

function my_serialize(row)

{

  row.put('main_timetable', row.toString());

  return row;

}

  ]]/script



.



  entity name=main_timetable query=

SELECT * FROM shop_time_table stt WHERE stt.shop_id = '${shop.id}';

transformer=script:my_serialize



.

 



  Can I use java directly in script (script language=Java) ?

  How could I achieve this? Or any other idea? 

  I need these values together (from a row) and I need then in PHP to handle
the result easily.



thanks,

  Rich
 

__ Information from ESET NOD32 Antivirus, version of virus signature 
database 5740 (20101228) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature 
database 5740 (20101228) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: DIH serialize

2011-01-24 Thread Papp Richard
Hi Stefan,

  yes, this is exactly what I intend - I don't want to search in this field
- just quicly return me the result in a serialized form (the search criteria
is on other fields). Well, if I could serialize the data exactly as like the
PHP serialize() does I would be maximally satisfied, but any other form in
which I could compact the data easily into one field I would be pleased.
  Can anyone help me? I guess the script is quite a good way, but I don't
know which function should I use there to compact the data to be easily
usable in PHP. Or any other method?

thanks,
  Rich

-Original Message-
From: Stefan Matheis [mailto:matheis.ste...@googlemail.com] 
Sent: Monday, January 24, 2011 18:23
To: solr-user@lucene.apache.org
Subject: Re: DIH serialize

Hi Rich,

i'm a bit confused after reading your post .. what exactly you wanna try to
achieve? Serializing (like http://php.net/serialize) your complete row into
one field? Don't wanna search in them, just store and deliver them in your
results? Does that make sense? Sounds a bit strange :)

Regards
Stefan

On Mon, Jan 24, 2011 at 10:03 AM, Papp Richard ccode...@gmail.com wrote:

 Hi Dennis,

  thank you for your answer, but didn't understand why you say it doesn't
 need serialization. I'm with the option C.
  but the main question is, how to put into one field a result of many
 fields: SELECT * FROM.

 thanks,
  Rich

 -Original Message-
 From: Dennis Gearon [mailto:gear...@sbcglobal.net]
 Sent: Monday, January 24, 2011 02:07
 To: solr-user@lucene.apache.org
 Subject: Re: DIH serialize

 Depends on your process chain to the eventual viewer/consumer of the data.

 The questions to ask are:
  A/ Is the data IN Solr going to be viewed or processed in its original
 form:
  --set stored = 'true'
 ---no serialization needed.
  B/ If it's going to be anayzed and searched for separate from any other
 field,

  the analyzing will put it into  an unreadable form. If you need to
see
 it,
 then
 ---set indexed=true and stored=true
 ---no serializaton needed.   C/ If it's NOT going to be viewed AS IS,
 and
 it's not going to be searched for AS IS,
   (i.e. other columns will be how the data is found), and you have
 another,

   serialzable format:
   --set indexed=false and stored=true
   --serialize AS PER THE INTENDED APPLICATION,
   not sure that Solr can do that at all.
  C/ If it's NOT going to be viewed AS IS, and it's not going to be
searched
 for
 AS IS,
   (i.e. other columns will be how the data is found), and you have
 another,

   serialzable format:
   --set indexed=false and stored=true
   --serialize AS PER THE INTENDED APPLICATION,
   not sure that Solr can do that at all.
  D/ If it's NOT going to be viewed AS IS, BUT it's going to be searched
for
 AS
 IS,
   (this column will be how the data is found), and you have another,
   serialzable format:
   --you need to put it into TWO columns
   --A SERIALIZED FIELD
   --set indexed=false and stored=true

  --AN UNSERIALIZED FIELD
   --set indexed=false and stored=true
   --serialize AS PER THE INTENDED APPLICATION,
   not sure that Solr can do that at all.

 Hope that helps!


 Dennis Gearon


 Signature Warning
 
 It is always a good idea to learn from your own mistakes. It is usually a
 better
 idea to learn from others' mistakes, so you do not have to make them
 yourself.
 from 'http://blogs.techrepublic.com.com/security/?p=4501tag=nl.e036'


 EARTH has a Right To Life,
 otherwise we all die.



 - Original Message 
 From: Papp Richard ccode...@gmail.com
 To: solr-user@lucene.apache.org
 Sent: Sun, January 23, 2011 2:02:05 PM
 Subject: DIH serialize

 Hi all,



  I wasted the last few hours trying to serialize some column values (from
 mysql) into a Solr column, but I just can't find such a function. I'll use
 the value in PHP - I don't know if it is possible to serialize in PHP
style
 at all. This is what I tried and works with a given factor:



 in schema.xml:

   field name=main_timetable  type=text indexed=false
 stored=true multiValued=true /



 in DIH xml:



 dataConfig

  script![CDATA[

function my_serialize(row)

{

  row.put('main_timetable', row.toString());

  return row;

}

  ]]/script



 .



  entity name=main_timetable query=

SELECT * FROM shop_time_table stt WHERE stt.shop_id = '${shop.id
 }';

transformer=script:my_serialize



 .

 



  Can I use java directly in script (script language=Java) ?

  How could I achieve this? Or any other idea?

  I need these values together (from a row) and I need then in PHP to
handle
 the result easily.



 thanks,

  Rich


 __ Information from ESET NOD32 Antivirus, version of virus
 signature database 5740 (20101228) __

 The message was checked by ESET NOD32 Antivirus.

 http://www.eset.com



 __ Information

DIH serialize

2011-01-23 Thread Papp Richard
Hi all,

 

  I wasted the last few hours trying to serialize some column values (from
mysql) into a Solr column, but I just can't find such a function. I'll use
the value in PHP - I don't know if it is possible to serialize in PHP style
at all. This is what I tried and works with a given factor:

 

in schema.xml:

   field name=main_timetable  type=text indexed=false
stored=true multiValued=true /

 

in DIH xml:

 

dataConfig

  script![CDATA[

function my_serialize(row)

{

  row.put('main_timetable', row.toString());

  return row;

}

  ]]/script

 

.

 

  entity name=main_timetable query=

SELECT * FROM shop_time_table stt WHERE stt.shop_id = '${shop.id}';

transformer=script:my_serialize



.

 

 

  Can I use java directly in script (script language=Java) ?

  How could I achieve this? Or any other idea? 

  I need these values together (from a row) and I need then in PHP to handle
the result easily.

 

thanks,

  Rich



solr admin

2010-11-29 Thread Papp Richard
Hello,

  is there any way to specify in the solr admin other than fields? and I'm
nt talking about the full interface which is also very limited.

  like: score, fl, fq, ...

  and yes, I know that I can use the url... which indeed is not too handy.

thanks,
  Rich
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5659 (20101129) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



special sorting

2010-11-29 Thread Papp Richard
Hello,

  I have many pages with the same content in the search result (the result
is the same for some of the cities from the same county)... which means that
I have duplicate content.

  the filter query is something like: +locationId:(60 26a 39a) - for city
with ID 60
  and I get the same result for city with ID 62: +locationId:(62 26a 39a)
(cityID, countyID, countryID)

  how could I use a sorting to have different docs order in results for
different cities?
  (for the same city I need to have the same sort order always - it cannot
be a simple random...)

  could I use somehow the cityID parameter as boost or score ? I tried but
could't realise too much.

thanks,
  Rich
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5659 (20101129) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: solr admin

2010-11-29 Thread Papp Richard
in Solr admin (http://localhost:8180/services/admin/)
I can specify something like:

+category_id:200 +xxx:300

but how can I specify a sort option?

sort:category_id+asc

regards,
  Rich

-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com] 
Sent: Monday, November 29, 2010 22:00
To: solr-user@lucene.apache.org
Subject: Re: solr admin

I honestly don't understand what you're asking here. Specify what
in solr admin other than fields? what is it you're trying to accomplish?

Best
Erick

On Mon, Nov 29, 2010 at 2:56 PM, Papp Richard ccode...@gmail.com wrote:

 Hello,

  is there any way to specify in the solr admin other than fields? and I'm
 nt talking about the full interface which is also very limited.

  like: score, fl, fq, ...

  and yes, I know that I can use the url... which indeed is not too handy.

 thanks,
  Rich


 __ Information from ESET NOD32 Antivirus, version of virus
 signature
 database 5659 (20101129) __

 The message was checked by ESET NOD32 Antivirus.

 http://www.eset.com



 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5659 (20101129) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5659 (20101129) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: special sorting

2010-11-29 Thread Papp Richard
Hmm, any clue how to use it? use the location_id somehow?

thanks,
  Rich

-Original Message-
From: Tommaso Teofili [mailto:tommaso.teof...@gmail.com] 
Sent: Monday, November 29, 2010 22:08
To: solr-user@lucene.apache.org
Subject: Re: special sorting

Perhaps, depending on your domain logic you could use function queries to
achieve that.
http://wiki.apache.org/solr/FunctionQuery#Sort_By_Function
Regards,
Tommaso

2010/11/29 Papp Richard ccode...@gmail.com

 Hello,

  I have many pages with the same content in the search result (the result
 is the same for some of the cities from the same county)... which means
 that
 I have duplicate content.

  the filter query is something like: +locationId:(60 26a 39a) - for city
 with ID 60
  and I get the same result for city with ID 62: +locationId:(62 26a 39a)
 (cityID, countyID, countryID)

  how could I use a sorting to have different docs order in results for
 different cities?
  (for the same city I need to have the same sort order always - it cannot
 be a simple random...)

  could I use somehow the cityID parameter as boost or score ? I tried but
 could't realise too much.

 thanks,
  Rich


 __ Information from ESET NOD32 Antivirus, version of virus
 signature
 database 5659 (20101129) __

 The message was checked by ESET NOD32 Antivirus.

 http://www.eset.com



 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5659 (20101129) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5659 (20101129) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
Dear Yonik,

  this is fantastic, but can you tell any time it will be ready ?
  I would need this feature in two weeks. Is it possible to finish and make
an update in this time or should I look for another solution cocerning the
pgaination (like implement just more results link instead of pagination) ?

best regards,
  Rich

-Original Message-
From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley
Sent: Saturday, October 30, 2010 19:29
To: solr-user@lucene.apache.org
Subject: Re: solr 4.0 - pagination

On Sat, Oct 30, 2010 at 12:22 PM, Papp Richard ccode...@gmail.com wrote:
  I'm using Solr 4.0 with grouping (field collapsing), but unfortunately I
 can't solve the pagination.

It's not implemented yet, but I'm working on that right now.

-Yonik
http://www.lucidimagination.com
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5576 (20101029) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5598 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
thank you very much Yonik! 
you are a magician!

regards,
  Rich

-Original Message-
From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley
Sent: Sunday, November 07, 2010 18:04
To: Papp Richard
Cc: solr-user@lucene.apache.org
Subject: Re: solr 4.0 - pagination

On Sun, Nov 7, 2010 at 10:55 AM, Papp Richard ccode...@gmail.com wrote:
  this is fantastic, but can you tell any time it will be ready ?

It already is ;-)  Grab the latest trunk or the latest nightly build.

-Yonik
http://www.lucidimagination.com
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5598 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5598 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
Hi Yonik,

  I've just tried the latest stable version from nightly build:
apache-solr-4.0-2010-11-05_08-06-28.war

  I have some concerns however: I have 3 documents; 2 in the first group, 1
in the 2nd group.
  
  1. I got for matches 3 - which is good, but I still don't know how many
groups I have. (using start = 0, rows = 10)
  2. as far as I see the start / rows is working now, but the matches is
returned incorrectly = it said matches = 3 instead of = 1, when I used
start = 1, rows = 1

  so can you help me, how to compute how many pages I'll have, because the
matches can't use for this.

regards,
  Rich

-Original Message-
From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley
Sent: Sunday, November 07, 2010 18:04
To: Papp Richard
Cc: solr-user@lucene.apache.org
Subject: Re: solr 4.0 - pagination

On Sun, Nov 7, 2010 at 10:55 AM, Papp Richard ccode...@gmail.com wrote:
  this is fantastic, but can you tell any time it will be ready ?

It already is ;-)  Grab the latest trunk or the latest nightly build.

-Yonik
http://www.lucidimagination.com
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5598 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5599 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
Hey Yonik,

  Sorry, I think the matches is ok - because it probably returns always the
total document number - however I don't know how to compute the number of
pages.

thanks,
  Rich

-Original Message-
From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley
Sent: Sunday, November 07, 2010 18:04
To: Papp Richard
Cc: solr-user@lucene.apache.org
Subject: Re: solr 4.0 - pagination

On Sun, Nov 7, 2010 at 10:55 AM, Papp Richard ccode...@gmail.com wrote:
  this is fantastic, but can you tell any time it will be ready ?

It already is ;-)  Grab the latest trunk or the latest nightly build.

-Yonik
http://www.lucidimagination.com
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5598 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5599 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
I see. Let's assume that there are 1000 groups.
Can I use safely (with no negative impact on memory usage or slowness) the
start = 990, rows = 10 to get the latest page?
Or this will not work, due you will need to compute all the groups till
1000, in order to return the last 10, and because of this the whole will be
slow / memory usage will increase considerably.

regards,
  Rich

-Original Message-
From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley
Sent: Sunday, November 07, 2010 21:54
To: Papp Richard
Cc: solr-user@lucene.apache.org
Subject: Re: solr 4.0 - pagination

On Sun, Nov 7, 2010 at 2:45 PM, Papp Richard ccode...@gmail.com wrote:
 Hi Yonik,

  I've just tried the latest stable version from nightly build:
 apache-solr-4.0-2010-11-05_08-06-28.war

  I have some concerns however: I have 3 documents; 2 in the first group, 1
 in the 2nd group.

  1. I got for matches 3 - which is good, but I still don't know how many
 groups I have. (using start = 0, rows = 10)
  2. as far as I see the start / rows is working now, but the matches is
 returned incorrectly = it said matches = 3 instead of = 1, when I used
 start = 1, rows = 1

matches is the number of documents before grouping, so start/rows or
group.offset/group.limit will not affect this number.

  so can you help me, how to compute how many pages I'll have, because the
 matches can't use for this.

Solr doesn't even know given the current algorithm, hence it can't
return that info.

The issue is that to calculate the total number of groups, we would
need to keep each group in memory (which could cause a big blowup if
there are tons of groups).  The current algorithm only keeps the top
10 groups (assuming rows=10) in memory at any one time, hence it has
no idea what the total number of groups is.

-Yonik
http://www.lucidimagination.com
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5599 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5599 (20101107) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



solr 4.0 - pagination

2010-10-30 Thread Papp Richard
Hi all,

 

  I'm using Solr 4.0 with grouping (field collapsing), but unfortunately I
can't solve the pagination.

  Mainly there are two problems:

-  the query fields start  rows doesn't work anymore -
beside of the values, it always returns the data as the start would be 0
(start = 0)

-  the result contains just the total document number and not
the total groups number

 

  Can anyone help me, how to solve this?

 

regards,

  Rich



RE: solr 4.0 - pagination

2010-10-30 Thread Papp Richard
Can you estimate please when it will be done?


thanks,
  Rich

-Original Message-
From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley
Sent: Saturday, October 30, 2010 19:29
To: solr-user@lucene.apache.org
Subject: Re: solr 4.0 - pagination

On Sat, Oct 30, 2010 at 12:22 PM, Papp Richard ccode...@gmail.com wrote:
  I'm using Solr 4.0 with grouping (field collapsing), but unfortunately I
 can't solve the pagination.

It's not implemented yet, but I'm working on that right now.

-Yonik
http://www.lucidimagination.com
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5576 (20101029) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5576 (20101029) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: Grouping in solr ?

2010-09-30 Thread Papp Richard
I'm really sorry - thank you for the note.

-Original Message-
From: Chris Hostetter [mailto:hossman_luc...@fucit.org] 
Sent: Tuesday, September 28, 2010 05:12
To: solr-user@lucene.apache.org
Subject: Re: Grouping in solr ?

: References:
: abcc5d9ce0798544a169c584b8f1447d230313c...@exchange01.toolbox.local
: In-Reply-To:
: abcc5d9ce0798544a169c584b8f1447d230313c...@exchange01.toolbox.local
: Subject: Grouping in solr ?

http://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists

When starting a new discussion on a mailing list, please do not reply to 
an existing message, instead start a fresh email.  Even if you change the 
subject line of your email, other mail headers still track which thread 
you replied to and your question is hidden in that thread and gets less 
attention.   It makes following discussions in the mailing list archives 
particularly difficult.
See Also:  http://en.wikipedia.org/wiki/User:DonDiego/Thread_hijacking



-Hoss

--
http://lucenerevolution.org/  ...  October 7-8, Boston
http://bit.ly/stump-hoss  ...  Stump The Chump!
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



Grouping in solr ?

2010-09-23 Thread Papp Richard
Hi all,

  is it possible somehow to group documents?
  I have services as documents, and I would like to show the filtered
services grouped by company. 
  So I filter services by given criteria, but I show the results grouped by
companay.
  If I got 1000 services, maybe I need to show just 100 companies (this will
affect pagination as well), and how could I get the company info? Should I
store the company info in each service (I don't need the compnany info to be
indexed) ?

regards,
  Rich
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: Grouping in solr ?

2010-09-23 Thread Papp Richard
thank you!
this is really helpful. just tried it and it's amazing.
do you know, how trustable is a nightly built version (solr4) ?

Rich

-Original Message-
From: Markus Jelsma [mailto:markus.jel...@buyways.nl] 
Sent: Thursday, September 23, 2010 22:38
To: solr-user@lucene.apache.org
Subject: RE: Grouping in solr ?

http://wiki.apache.org/solr/FieldCollapsing

https://issues.apache.org/jira/browse/SOLR-236

 
-Original message-
From: Papp Richard ccode...@gmail.com
Sent: Thu 23-09-2010 21:29
To: solr-user@lucene.apache.org; 
Subject: Grouping in solr ?

Hi all,

 is it possible somehow to group documents?
 I have services as documents, and I would like to show the filtered
services grouped by company. 
 So I filter services by given criteria, but I show the results grouped by
companay.
 If I got 1000 services, maybe I need to show just 100 companies (this will
affect pagination as well), and how could I get the company info? Should I
store the company info in each service (I don't need the compnany info to be
indexed) ?

regards,
 Rich


__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com


 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



tricky range query?

2010-09-21 Thread Papp Richard
Hi all,

 

  shortly my problem: I want to filter services based on timetables, let's
consider the next timetable for a day:

 

on the date of 15.10.2010:

10:00 - 11:00

12:00 - 12:30

14:30 - 16:00

17:00 - 20:00

 

  how could i store the timetable in Solr and efficiently search in it
(let's say filter those timetables which has an availability at 15:00) ? 

  not to mention, that each service has a duration (so, if the service takes
90 mins, filtering by 15:00 shouldn't return the previous timetable, because
there is not enough free time (just 60 mins in the above example))

 

  how to solve this? any hints?

 

regards,

  Rich



RE: tricky range query?

2010-09-21 Thread Papp Richard
Hi Erik,

  first of all, thank you for your answer. Let me detail a bit the amount of
data:

- actually services going to persons, and the time table is per person (a
person can have multiple services).
- there will be around 10.000 person (or maybe 100.000 - I would like to say
rather 100.000 than have problems later)
- but time table can differ from week to week, so each person has many time
table (one for each week) = so this means that if they have the timetables
for ~3 months (12 weeks)... 100.000 x 12 ~ 1.000.000 timetabels... and each
time table has 7 days... and on each day we have many periods (as someone
books a service, the timetbale will be modified, and possible will result in
time gaps, like I show in the example)... so all in all there are too many
data, is it?
- I've checkte the trie, but couldn't find too much info. I don't know if
it could be a solution to us e it or not - I'm not a solr expert.

regards,
  Rich


-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com] 
Sent: Tuesday, September 21, 2010 14:40
To: solr-user@lucene.apache.org
Subject: Re: tricky range query?

How efficient it would be I don't know, but depending on how
many services you're talking here, efficiency may not be
that big of a deal...

But storing each interval as its own record along
with a duration should work. You could then form a query
like duration:[90 to *] AND start_time:[* to 1500] AND
end_time:[1500 TO *]. I'm not sure I'd want that kind of
query on a gigabyte of records...

But without knowing some more details, it's impossible to
say whether this would be at all suitable...

Best
Erick

On Tue, Sep 21, 2010 at 4:41 AM, Papp Richard ccode...@gmail.com wrote:

 Hi all,



  shortly my problem: I want to filter services based on timetables, let's
 consider the next timetable for a day:



 on the date of 15.10.2010:

 10:00 - 11:00

 12:00 - 12:30

 14:30 - 16:00

 17:00 - 20:00



  how could i store the timetable in Solr and efficiently search in it
 (let's say filter those timetables which has an availability at 15:00) ?

  not to mention, that each service has a duration (so, if the service
takes
 90 mins, filtering by 15:00 shouldn't return the previous timetable,
 because
 there is not enough free time (just 60 mins in the above example))



  how to solve this? any hints?



 regards,

  Rich


 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: tricky range query?

2010-09-21 Thread Papp Richard
Hi Erick,

  don't really understand your question and what exactly the point is, but
anyway. yes - there is a DB where data are stored, however the scheduling is
just a part of the whole picture. I thought to use Solr for search /
filtering results - the schedule (availability) is just one filter from the
whole search process. Does it make sense for you? May I ask if you are a
Solr specialist? I don't know how serious I have to take in account your
answers.

thank you,
  Rich

-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com] 
Sent: Tuesday, September 21, 2010 20:36
To: solr-user@lucene.apache.org
Subject: Re: tricky range query?

So it sounds like you're working on some kind of scheduling app? Which
makes me wonder why you're using SOLR. Much as I like it, this sounds
more like a database application than a search application. What am I
missing?

Best
Erick

On Tue, Sep 21, 2010 at 1:05 PM, Papp Richard ccode...@gmail.com wrote:

 Hi Erik,

  first of all, thank you for your answer. Let me detail a bit the amount
of
 data:

 - actually services going to persons, and the time table is per person (a
 person can have multiple services).
 - there will be around 10.000 person (or maybe 100.000 - I would like to
 say
 rather 100.000 than have problems later)
 - but time table can differ from week to week, so each person has many
time
 table (one for each week) = so this means that if they have the
timetables
 for ~3 months (12 weeks)... 100.000 x 12 ~ 1.000.000 timetabels... and
each
 time table has 7 days... and on each day we have many periods (as someone
 books a service, the timetbale will be modified, and possible will result
 in
 time gaps, like I show in the example)... so all in all there are too many
 data, is it?
 - I've checkte the trie, but couldn't find too much info. I don't know
if
 it could be a solution to us e it or not - I'm not a solr expert.

 regards,
  Rich


 -Original Message-
 From: Erick Erickson [mailto:erickerick...@gmail.com]
 Sent: Tuesday, September 21, 2010 14:40
 To: solr-user@lucene.apache.org
 Subject: Re: tricky range query?

 How efficient it would be I don't know, but depending on how
 many services you're talking here, efficiency may not be
 that big of a deal...

 But storing each interval as its own record along
 with a duration should work. You could then form a query
 like duration:[90 to *] AND start_time:[* to 1500] AND
 end_time:[1500 TO *]. I'm not sure I'd want that kind of
 query on a gigabyte of records...

 But without knowing some more details, it's impossible to
 say whether this would be at all suitable...

 Best
 Erick

 On Tue, Sep 21, 2010 at 4:41 AM, Papp Richard ccode...@gmail.com wrote:

  Hi all,
 
 
 
   shortly my problem: I want to filter services based on timetables,
let's
  consider the next timetable for a day:
 
 
 
  on the date of 15.10.2010:
 
  10:00 - 11:00
 
  12:00 - 12:30
 
  14:30 - 16:00
 
  17:00 - 20:00
 
 
 
   how could i store the timetable in Solr and efficiently search in it
  (let's say filter those timetables which has an availability at 15:00) ?
 
   not to mention, that each service has a duration (so, if the service
 takes
  90 mins, filtering by 15:00 shouldn't return the previous timetable,
  because
  there is not enough free time (just 60 mins in the above example))
 
 
 
   how to solve this? any hints?
 
 
 
  regards,
 
   Rich
 
 


 __ Information from ESET NOD32 Antivirus, version of virus
 signature
 database 5419 (20100902) __

 The message was checked by ESET NOD32 Antivirus.

 http://www.eset.com



 __ Information from ESET NOD32 Antivirus, version of virus
 signature
 database 5419 (20100902) __

 The message was checked by ESET NOD32 Antivirus.

 http://www.eset.com



 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 



RE: trie

2010-09-21 Thread Papp Richard
thank you guys for the answers... now I have to check / read some docs ;)

Rich

-Original Message-
From: Simon Willnauer [mailto:simon.willna...@googlemail.com] 
Sent: Tuesday, September 21, 2010 23:00
To: solr-user@lucene.apache.org
Subject: Re: trie

2010/9/21 Péter Király kirun...@gmail.com:
 You can read about it in Lucene in Action second edition.
have a look at 
http://www.lucidimagination.com/developer/whitepaper/Whats-New-in-Apache-Lucene-3-0

page 4 to 8 should give you a good intro to the topic

simon

 Péter

 2010/9/21 Papp Richard ccode...@gmail.com:
  is there any good tutorial how to use and what is trie? what I found on the
 net is really blurry.

 rgeards,
  Rich


 __ Information from ESET NOD32 Antivirus, version of virus signature
 database 5419 (20100902) __

 The message was checked by ESET NOD32 Antivirus.

 http://www.eset.com




 

__ Information from ESET NOD32 Antivirus, version of virus signature 
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com
 
 

__ Information from ESET NOD32 Antivirus, version of virus signature 
database 5419 (20100902) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com