Re: Text searches and free form queries

2012-10-09 Thread Oleg Dulin




It works pretty fast.

Cool.
Just keep an eye out for how big the lucene token row gets.
Cheers




Indeed, it may get out of hand, but for now we are ok -- for the 
foreseable future I would say.


Should it get larger, I can split it up into rows -- i.e. all tokens 
that start with "a", all tokens that start with "b", etc.






Re: Text searches and free form queries

2012-10-08 Thread aaron morton
>  It works pretty fast.
Cool. 

Just keep an eye out for how big the lucene token row gets. 

Cheers

-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 7/10/2012, at 2:57 AM, Oleg Dulin  wrote:

> So, what I ended up doing is this --
> 
> As I write my records into the main CF, I tokenize some fields that I want to 
> search on using Lucene and write an index into a separate CF, such that my 
> columns are a composite of:
> 
> luceneToken:record key
> 
> I can then search my records by doing a slice for each lucene token in the 
> search query and then do an intersection of the sets. It works pretty fast.
> 
> Regards,
> Oleg
> 
> On 2012-09-05 01:28:44 +, aaron morton said:
> 
> AFAIk if you want to keep it inside cassandra then DSE, roll your own from 
> scratch or start with https://github.com/tjake/Solandra . 
> 
> Outside of Cassandra I've heard of people using Elastic Search or Solr which 
> I *think* is now faster at updating the index. 
> 
> Hope that helps. 
> 
>  
> -
> Aaron Morton
> Freelance Developer
> @aaronmorton
> http://www.thelastpickle.com
> 
> On 4/09/2012, at 3:00 AM, Andrey V. Panov  wrote:
> Some one did search on Lucene, but for very fresh data they build search 
> index in memory so data become available for search without delays.
> 
> On 3 September 2012 22:25, Oleg Dulin  wrote:
> Dear Distinguished Colleagues:
> 
> 
> -- 
> Regards,
> Oleg Dulin
> NYC Java Big Data Engineer
> http://www.olegdulin.com/



Re: Text searches and free form queries

2012-10-06 Thread Oleg Dulin

So, what I ended up doing is this --

As I write my records into the main CF, I tokenize some fields that I 
want to search on using Lucene and write an index into a separate CF, 
such that my columns are a composite of:


luceneToken:record key

I can then search my records by doing a slice for each lucene token in 
the search query and then do an intersection of the sets. It works 
pretty fast.


Regards,
Oleg

On 2012-09-05 01:28:44 +, aaron morton said:

AFAIk if you want to keep it inside cassandra then DSE, roll your own 
from scratch or start with https://github.com/tjake/Solandra . 


Outside of Cassandra I've heard of people using Elastic Search or Solr 
which I *think* is now faster at updating the index. 


Hope that helps. 

 
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 4/09/2012, at 3:00 AM, Andrey V. Panov  wrote:
Some one did search on Lucene, but for very fresh data they build 
search index in memory so data become available for search without 
delays.


On 3 September 2012 22:25, Oleg Dulin  wrote:
Dear Distinguished Colleagues:



--
Regards,
Oleg Dulin
NYC Java Big Data Engineer
http://www.olegdulin.com/

Re: Text searches and free form queries

2012-09-04 Thread aaron morton
AFAIk if you want to keep it inside cassandra then DSE, roll your own from 
scratch or start with https://github.com/tjake/Solandra . 

Outside of Cassandra I've heard of people using Elastic Search or Solr which I 
*think* is now faster at updating the index. 

Hope that helps. 

 
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 4/09/2012, at 3:00 AM, Andrey V. Panov  wrote:

> Some one did search on Lucene, but for very fresh data they build search 
> index in memory so data become available for search without delays.
> 
> On 3 September 2012 22:25, Oleg Dulin  wrote:
> Dear Distinguished Colleagues:
> 



Re: Text searches and free form queries

2012-09-03 Thread Andrey V. Panov
Some one did search on Lucene, but for very fresh data they build search
index in memory so data become available for search without delays.

On 3 September 2012 22:25, Oleg Dulin  wrote:

> Dear Distinguished Colleagues:
>
>