[jira] Commented: (SOLR-1814) select count(distinct fieldname) in SOLR

2010-03-12 Thread Marcus Herou (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12844437#action_12844437
 ] 

Marcus Herou commented on SOLR-1814:


Instead of having the file attached... 
http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/

Erik:
The facet counts is something else, it groups the counts based on the field 
supplied does it not? Perhaps facet.query (like you pointed out) can be used, I 
overlooked that. Never got an answer on the mailinglist so I implemented it 
instead :)

What I jave accomplished is this:

select count(distinct blog) from BlogEntries where ...somexpression...

One doc is in in this case a BlogEntry and each belongs to Blog (many-to-one). 
If this already can be accomplished in SOLR, my bad. Please tell me how.

Ted: 
Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I 
only use Trove due to the efficiency, plain hashmaps can be used of course if 
it is a showstopper.



 select count(distinct fieldname) in SOLR
 

 Key: SOLR-1814
 URL: https://issues.apache.org/jira/browse/SOLR-1814
 Project: Solr
  Issue Type: New Feature
  Components: SearchComponents - other
Affects Versions: 1.4, 1.5, 1.6, 2.0
Reporter: Marcus Herou
 Fix For: 1.4, 1.5, 1.6, 2.0

 Attachments: CountComponent.java


 I have seen questions on the mailinglist about having the functionality for 
 counting distinct on a field. We at Tailsweep as well want to that in for 
 example our blogsearch.
 Example:
 You had 1345 hits on 244 blogs
 The 244 part is not possible in SOLR today (correct me if I am wrong). So 
 I've written a component which does this. Attaching it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Issue Comment Edited: (SOLR-1814) select count(distinct fieldname) in SOLR

2010-03-12 Thread Marcus Herou (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12844437#action_12844437
 ] 

Marcus Herou edited comment on SOLR-1814 at 3/12/10 9:55 AM:
-

Instead of having the file attached... 
http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/

Erik:
The facet counts is something else, it groups the counts based on the field 
supplied does it not? Perhaps facet.query (like you pointed out) can be used, I 
overlooked that. Never got an answer on the mailinglist so I implemented it 
instead :)

What I have accomplished is this:

select count(distinct blog) from BlogEntries where ...somexpression...

One doc is in in this case a BlogEntry and each belongs to Blog (many-to-one). 
If this already can be accomplished in SOLR, my bad. Please tell me how.

Ted: 
Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I 
only use Trove due to the efficiency, plain hashmaps can be used of course if 
it is a showstopper.



  was (Author: marcusherou):
Instead of having the file attached... 
http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/

Erik:
The facet counts is something else, it groups the counts based on the field 
supplied does it not? Perhaps facet.query (like you pointed out) can be used, I 
overlooked that. Never got an answer on the mailinglist so I implemented it 
instead :)

What I jave accomplished is this:

select count(distinct blog) from BlogEntries where ...somexpression...

One doc is in in this case a BlogEntry and each belongs to Blog (many-to-one). 
If this already can be accomplished in SOLR, my bad. Please tell me how.

Ted: 
Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I 
only use Trove due to the efficiency, plain hashmaps can be used of course if 
it is a showstopper.


  
 select count(distinct fieldname) in SOLR
 

 Key: SOLR-1814
 URL: https://issues.apache.org/jira/browse/SOLR-1814
 Project: Solr
  Issue Type: New Feature
  Components: SearchComponents - other
Affects Versions: 1.4, 1.5, 1.6, 2.0
Reporter: Marcus Herou
 Fix For: 1.4, 1.5, 1.6, 2.0

 Attachments: CountComponent.java


 I have seen questions on the mailinglist about having the functionality for 
 counting distinct on a field. We at Tailsweep as well want to that in for 
 example our blogsearch.
 Example:
 You had 1345 hits on 244 blogs
 The 244 part is not possible in SOLR today (correct me if I am wrong). So 
 I've written a component which does this. Attaching it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Issue Comment Edited: (SOLR-1814) select count(distinct fieldname) in SOLR

2010-03-12 Thread Marcus Herou (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12844437#action_12844437
 ] 

Marcus Herou edited comment on SOLR-1814 at 3/12/10 10:03 AM:
--

Instead of having the file attached... 
http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/

Erik:
The facet counts is something else, it groups the counts based on the field 
supplied does it not? Perhaps facet.query (like you pointed out) can be used, I 
overlooked that. Never got an answer on the mailinglist so I implemented it 
instead :)

Well the blogs is not a value it is a field of it's own.
We call it feedId and is a pointer to a row in the DB.
...
field name=feedId type=integer indexed=true stored=true 
required=true omitNorms=true /
...

What I have accomplished is this:

select count(distinct feedId) from FeedItem where ...somexpression...

One doc is in in this case a FeedItem and each belongs to Feed (many-to-one). 
If this already can be accomplished in SOLR, my bad. Please tell me how.

Ted: 
Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I 
only use Trove due to the efficiency, plain hashmaps can be used of course if 
it is a showstopper.



  was (Author: marcusherou):
Instead of having the file attached... 
http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/

Erik:
The facet counts is something else, it groups the counts based on the field 
supplied does it not? Perhaps facet.query (like you pointed out) can be used, I 
overlooked that. Never got an answer on the mailinglist so I implemented it 
instead :)

What I have accomplished is this:

select count(distinct blog) from BlogEntries where ...somexpression...

One doc is in in this case a BlogEntry and each belongs to Blog (many-to-one). 
If this already can be accomplished in SOLR, my bad. Please tell me how.

Ted: 
Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I 
only use Trove due to the efficiency, plain hashmaps can be used of course if 
it is a showstopper.


  
 select count(distinct fieldname) in SOLR
 

 Key: SOLR-1814
 URL: https://issues.apache.org/jira/browse/SOLR-1814
 Project: Solr
  Issue Type: New Feature
  Components: SearchComponents - other
Affects Versions: 1.4, 1.5, 1.6, 2.0
Reporter: Marcus Herou
 Fix For: 1.4, 1.5, 1.6, 2.0

 Attachments: CountComponent.java


 I have seen questions on the mailinglist about having the functionality for 
 counting distinct on a field. We at Tailsweep as well want to that in for 
 example our blogsearch.
 Example:
 You had 1345 hits on 244 blogs
 The 244 part is not possible in SOLR today (correct me if I am wrong). So 
 I've written a component which does this. Attaching it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-1814) select count(distinct fieldname) in SOLR

2010-03-12 Thread Marcus Herou (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1286#action_1286
 ] 

Marcus Herou commented on SOLR-1814:


Ted: I am an idiot about ASL. GNU Trove ( I mixed it up with something else ).

I can add code which uses Trove if available in the CP or plain Hashmaps if 
not. Think it exists some good collection utils in commons. Will look it up. 
Trove however is super.

 select count(distinct fieldname) in SOLR
 

 Key: SOLR-1814
 URL: https://issues.apache.org/jira/browse/SOLR-1814
 Project: Solr
  Issue Type: New Feature
  Components: SearchComponents - other
Affects Versions: 1.4, 1.5, 1.6, 2.0
Reporter: Marcus Herou
 Fix For: 1.4, 1.5, 1.6, 2.0

 Attachments: CountComponent.java


 I have seen questions on the mailinglist about having the functionality for 
 counting distinct on a field. We at Tailsweep as well want to that in for 
 example our blogsearch.
 Example:
 You had 1345 hits on 244 blogs
 The 244 part is not possible in SOLR today (correct me if I am wrong). So 
 I've written a component which does this. Attaching it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (SOLR-1814) select count(distinct fieldname) in SOLR

2010-03-10 Thread Marcus Herou (JIRA)
select count(distinct fieldname) in SOLR


 Key: SOLR-1814
 URL: https://issues.apache.org/jira/browse/SOLR-1814
 Project: Solr
  Issue Type: New Feature
  Components: SearchComponents - other
Affects Versions: 1.4
Reporter: Marcus Herou


I have seen questions on the mailinglist about having the functionality for 
counting distinct on a field. We at Tailsweep as well want to that in for 
example our blogsearch.

Example:
You had 1345 hits on 244 blogs

The 244 part is not possible in SOLR today (correct me if I am wrong). So I've 
written a component which does this. Attaching it.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-1814) select count(distinct fieldname) in SOLR

2010-03-10 Thread Marcus Herou (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marcus Herou updated SOLR-1814:
---

Attachment: CountComponent.java

It has dependencies to GNU Trove tested against v 2.0.2
http://sourceforge.net/projects/trove4j/files/trove/archived/trove-2.0.2/trove-2.0.2.tar.gz/download

Trove have more memory efficient data structures so I used those instead. 
Perhaps should be broken out.

solrconfig.xml

arr name=last-components
  strcount/str   
/arr

searchComponent name=count 
class=org.apache.solr.handler.component.CountComponent /



 select count(distinct fieldname) in SOLR
 

 Key: SOLR-1814
 URL: https://issues.apache.org/jira/browse/SOLR-1814
 Project: Solr
  Issue Type: New Feature
  Components: SearchComponents - other
Affects Versions: 1.4
Reporter: Marcus Herou
 Attachments: CountComponent.java


 I have seen questions on the mailinglist about having the functionality for 
 counting distinct on a field. We at Tailsweep as well want to that in for 
 example our blogsearch.
 Example:
 You had 1345 hits on 244 blogs
 The 244 part is not possible in SOLR today (correct me if I am wrong). So 
 I've written a component which does this. Attaching it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-1814) select count(distinct fieldname) in SOLR

2010-03-10 Thread Marcus Herou (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marcus Herou updated SOLR-1814:
---

Fix Version/s: 1.4
   2.0
   1.6
   1.5
Affects Version/s: 2.0
   1.6
   1.5

 select count(distinct fieldname) in SOLR
 

 Key: SOLR-1814
 URL: https://issues.apache.org/jira/browse/SOLR-1814
 Project: Solr
  Issue Type: New Feature
  Components: SearchComponents - other
Affects Versions: 1.4, 1.5, 1.6, 2.0
Reporter: Marcus Herou
 Fix For: 1.4, 1.5, 1.6, 2.0

 Attachments: CountComponent.java


 I have seen questions on the mailinglist about having the functionality for 
 counting distinct on a field. We at Tailsweep as well want to that in for 
 example our blogsearch.
 Example:
 You had 1345 hits on 244 blogs
 The 244 part is not possible in SOLR today (correct me if I am wrong). So 
 I've written a component which does this. Attaching it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-139) Support updateable/modifiable documents

2009-04-25 Thread Marcus Herou (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12702778#action_12702778
 ] 

Marcus Herou commented on SOLR-139:
---

It would make sense of adding ParallelReader functionality so a core can read 
from several index-dirs. 
Guess it complicates things a little since you would need to have support for 
adding data as well to more than one index.

Suggestion:
/update/coreX/index1 - Uses schema1.xml
/update/coreX/index2 - Uses schema2.xml
/select/coreX - Uses all schemas e.g. A ParallelReader.

Seing quite a lot questions on the mailinglist about users that want to be able 
to update a single field while maintaining the rest of the index intact (not 
reindex).



 Support updateable/modifiable documents
 ---

 Key: SOLR-139
 URL: https://issues.apache.org/jira/browse/SOLR-139
 Project: Solr
  Issue Type: New Feature
  Components: update
Reporter: Ryan McKinley
Assignee: Ryan McKinley
 Fix For: 1.5

 Attachments: Eriks-ModifiableDocument.patch, 
 Eriks-ModifiableDocument.patch, Eriks-ModifiableDocument.patch, 
 Eriks-ModifiableDocument.patch, Eriks-ModifiableDocument.patch, 
 Eriks-ModifiableDocument.patch, getStoredFields.patch, getStoredFields.patch, 
 getStoredFields.patch, getStoredFields.patch, getStoredFields.patch, 
 SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, 
 SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, 
 SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, 
 SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, 
 SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, 
 SOLR-139-IndexDocumentCommand.patch, SOLR-139-ModifyInputDocuments.patch, 
 SOLR-139-ModifyInputDocuments.patch, SOLR-139-ModifyInputDocuments.patch, 
 SOLR-139-ModifyInputDocuments.patch, SOLR-139-XmlUpdater.patch, 
 SOLR-269+139-ModifiableDocumentUpdateProcessor.patch


 It would be nice to be able to update some fields on a document without 
 having to insert the entire document.
 Given the way lucene is structured, (for now) one can only modify stored 
 fields.
 While we are at it, we can support incrementing an existing value - I think 
 this only makes sense for numbers.
 for background, see:
 http://www.nabble.com/loading-many-documents-by-ID-tf3145666.html#a8722293

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.