[jira] Commented: (SOLR-1814) select count(distinct fieldname) in SOLR
[ https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12844437#action_12844437 ] Marcus Herou commented on SOLR-1814: Instead of having the file attached... http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/ Erik: The facet counts is something else, it groups the counts based on the field supplied does it not? Perhaps facet.query (like you pointed out) can be used, I overlooked that. Never got an answer on the mailinglist so I implemented it instead :) What I jave accomplished is this: select count(distinct blog) from BlogEntries where ...somexpression... One doc is in in this case a BlogEntry and each belongs to Blog (many-to-one). If this already can be accomplished in SOLR, my bad. Please tell me how. Ted: Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I only use Trove due to the efficiency, plain hashmaps can be used of course if it is a showstopper. select count(distinct fieldname) in SOLR Key: SOLR-1814 URL: https://issues.apache.org/jira/browse/SOLR-1814 Project: Solr Issue Type: New Feature Components: SearchComponents - other Affects Versions: 1.4, 1.5, 1.6, 2.0 Reporter: Marcus Herou Fix For: 1.4, 1.5, 1.6, 2.0 Attachments: CountComponent.java I have seen questions on the mailinglist about having the functionality for counting distinct on a field. We at Tailsweep as well want to that in for example our blogsearch. Example: You had 1345 hits on 244 blogs The 244 part is not possible in SOLR today (correct me if I am wrong). So I've written a component which does this. Attaching it. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Issue Comment Edited: (SOLR-1814) select count(distinct fieldname) in SOLR
[ https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12844437#action_12844437 ] Marcus Herou edited comment on SOLR-1814 at 3/12/10 9:55 AM: - Instead of having the file attached... http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/ Erik: The facet counts is something else, it groups the counts based on the field supplied does it not? Perhaps facet.query (like you pointed out) can be used, I overlooked that. Never got an answer on the mailinglist so I implemented it instead :) What I have accomplished is this: select count(distinct blog) from BlogEntries where ...somexpression... One doc is in in this case a BlogEntry and each belongs to Blog (many-to-one). If this already can be accomplished in SOLR, my bad. Please tell me how. Ted: Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I only use Trove due to the efficiency, plain hashmaps can be used of course if it is a showstopper. was (Author: marcusherou): Instead of having the file attached... http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/ Erik: The facet counts is something else, it groups the counts based on the field supplied does it not? Perhaps facet.query (like you pointed out) can be used, I overlooked that. Never got an answer on the mailinglist so I implemented it instead :) What I jave accomplished is this: select count(distinct blog) from BlogEntries where ...somexpression... One doc is in in this case a BlogEntry and each belongs to Blog (many-to-one). If this already can be accomplished in SOLR, my bad. Please tell me how. Ted: Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I only use Trove due to the efficiency, plain hashmaps can be used of course if it is a showstopper. select count(distinct fieldname) in SOLR Key: SOLR-1814 URL: https://issues.apache.org/jira/browse/SOLR-1814 Project: Solr Issue Type: New Feature Components: SearchComponents - other Affects Versions: 1.4, 1.5, 1.6, 2.0 Reporter: Marcus Herou Fix For: 1.4, 1.5, 1.6, 2.0 Attachments: CountComponent.java I have seen questions on the mailinglist about having the functionality for counting distinct on a field. We at Tailsweep as well want to that in for example our blogsearch. Example: You had 1345 hits on 244 blogs The 244 part is not possible in SOLR today (correct me if I am wrong). So I've written a component which does this. Attaching it. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Issue Comment Edited: (SOLR-1814) select count(distinct fieldname) in SOLR
[ https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12844437#action_12844437 ] Marcus Herou edited comment on SOLR-1814 at 3/12/10 10:03 AM: -- Instead of having the file attached... http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/ Erik: The facet counts is something else, it groups the counts based on the field supplied does it not? Perhaps facet.query (like you pointed out) can be used, I overlooked that. Never got an answer on the mailinglist so I implemented it instead :) Well the blogs is not a value it is a field of it's own. We call it feedId and is a pointer to a row in the DB. ... field name=feedId type=integer indexed=true stored=true required=true omitNorms=true / ... What I have accomplished is this: select count(distinct feedId) from FeedItem where ...somexpression... One doc is in in this case a FeedItem and each belongs to Feed (many-to-one). If this already can be accomplished in SOLR, my bad. Please tell me how. Ted: Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I only use Trove due to the efficiency, plain hashmaps can be used of course if it is a showstopper. was (Author: marcusherou): Instead of having the file attached... http://svn.tailsweep.com/opensource/solr-contrib/trunk/src/main/java/org/apache/solr/handler/component/ Erik: The facet counts is something else, it groups the counts based on the field supplied does it not? Perhaps facet.query (like you pointed out) can be used, I overlooked that. Never got an answer on the mailinglist so I implemented it instead :) What I have accomplished is this: select count(distinct blog) from BlogEntries where ...somexpression... One doc is in in this case a BlogEntry and each belongs to Blog (many-to-one). If this already can be accomplished in SOLR, my bad. Please tell me how. Ted: Trove have two licenses GPL and ASL. I can use the ASL version if it helps. I only use Trove due to the efficiency, plain hashmaps can be used of course if it is a showstopper. select count(distinct fieldname) in SOLR Key: SOLR-1814 URL: https://issues.apache.org/jira/browse/SOLR-1814 Project: Solr Issue Type: New Feature Components: SearchComponents - other Affects Versions: 1.4, 1.5, 1.6, 2.0 Reporter: Marcus Herou Fix For: 1.4, 1.5, 1.6, 2.0 Attachments: CountComponent.java I have seen questions on the mailinglist about having the functionality for counting distinct on a field. We at Tailsweep as well want to that in for example our blogsearch. Example: You had 1345 hits on 244 blogs The 244 part is not possible in SOLR today (correct me if I am wrong). So I've written a component which does this. Attaching it. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (SOLR-1814) select count(distinct fieldname) in SOLR
[ https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1286#action_1286 ] Marcus Herou commented on SOLR-1814: Ted: I am an idiot about ASL. GNU Trove ( I mixed it up with something else ). I can add code which uses Trove if available in the CP or plain Hashmaps if not. Think it exists some good collection utils in commons. Will look it up. Trove however is super. select count(distinct fieldname) in SOLR Key: SOLR-1814 URL: https://issues.apache.org/jira/browse/SOLR-1814 Project: Solr Issue Type: New Feature Components: SearchComponents - other Affects Versions: 1.4, 1.5, 1.6, 2.0 Reporter: Marcus Herou Fix For: 1.4, 1.5, 1.6, 2.0 Attachments: CountComponent.java I have seen questions on the mailinglist about having the functionality for counting distinct on a field. We at Tailsweep as well want to that in for example our blogsearch. Example: You had 1345 hits on 244 blogs The 244 part is not possible in SOLR today (correct me if I am wrong). So I've written a component which does this. Attaching it. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Created: (SOLR-1814) select count(distinct fieldname) in SOLR
select count(distinct fieldname) in SOLR Key: SOLR-1814 URL: https://issues.apache.org/jira/browse/SOLR-1814 Project: Solr Issue Type: New Feature Components: SearchComponents - other Affects Versions: 1.4 Reporter: Marcus Herou I have seen questions on the mailinglist about having the functionality for counting distinct on a field. We at Tailsweep as well want to that in for example our blogsearch. Example: You had 1345 hits on 244 blogs The 244 part is not possible in SOLR today (correct me if I am wrong). So I've written a component which does this. Attaching it. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (SOLR-1814) select count(distinct fieldname) in SOLR
[ https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcus Herou updated SOLR-1814: --- Attachment: CountComponent.java It has dependencies to GNU Trove tested against v 2.0.2 http://sourceforge.net/projects/trove4j/files/trove/archived/trove-2.0.2/trove-2.0.2.tar.gz/download Trove have more memory efficient data structures so I used those instead. Perhaps should be broken out. solrconfig.xml arr name=last-components strcount/str /arr searchComponent name=count class=org.apache.solr.handler.component.CountComponent / select count(distinct fieldname) in SOLR Key: SOLR-1814 URL: https://issues.apache.org/jira/browse/SOLR-1814 Project: Solr Issue Type: New Feature Components: SearchComponents - other Affects Versions: 1.4 Reporter: Marcus Herou Attachments: CountComponent.java I have seen questions on the mailinglist about having the functionality for counting distinct on a field. We at Tailsweep as well want to that in for example our blogsearch. Example: You had 1345 hits on 244 blogs The 244 part is not possible in SOLR today (correct me if I am wrong). So I've written a component which does this. Attaching it. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (SOLR-1814) select count(distinct fieldname) in SOLR
[ https://issues.apache.org/jira/browse/SOLR-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcus Herou updated SOLR-1814: --- Fix Version/s: 1.4 2.0 1.6 1.5 Affects Version/s: 2.0 1.6 1.5 select count(distinct fieldname) in SOLR Key: SOLR-1814 URL: https://issues.apache.org/jira/browse/SOLR-1814 Project: Solr Issue Type: New Feature Components: SearchComponents - other Affects Versions: 1.4, 1.5, 1.6, 2.0 Reporter: Marcus Herou Fix For: 1.4, 1.5, 1.6, 2.0 Attachments: CountComponent.java I have seen questions on the mailinglist about having the functionality for counting distinct on a field. We at Tailsweep as well want to that in for example our blogsearch. Example: You had 1345 hits on 244 blogs The 244 part is not possible in SOLR today (correct me if I am wrong). So I've written a component which does this. Attaching it. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (SOLR-139) Support updateable/modifiable documents
[ https://issues.apache.org/jira/browse/SOLR-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12702778#action_12702778 ] Marcus Herou commented on SOLR-139: --- It would make sense of adding ParallelReader functionality so a core can read from several index-dirs. Guess it complicates things a little since you would need to have support for adding data as well to more than one index. Suggestion: /update/coreX/index1 - Uses schema1.xml /update/coreX/index2 - Uses schema2.xml /select/coreX - Uses all schemas e.g. A ParallelReader. Seing quite a lot questions on the mailinglist about users that want to be able to update a single field while maintaining the rest of the index intact (not reindex). Support updateable/modifiable documents --- Key: SOLR-139 URL: https://issues.apache.org/jira/browse/SOLR-139 Project: Solr Issue Type: New Feature Components: update Reporter: Ryan McKinley Assignee: Ryan McKinley Fix For: 1.5 Attachments: Eriks-ModifiableDocument.patch, Eriks-ModifiableDocument.patch, Eriks-ModifiableDocument.patch, Eriks-ModifiableDocument.patch, Eriks-ModifiableDocument.patch, Eriks-ModifiableDocument.patch, getStoredFields.patch, getStoredFields.patch, getStoredFields.patch, getStoredFields.patch, getStoredFields.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-IndexDocumentCommand.patch, SOLR-139-ModifyInputDocuments.patch, SOLR-139-ModifyInputDocuments.patch, SOLR-139-ModifyInputDocuments.patch, SOLR-139-ModifyInputDocuments.patch, SOLR-139-XmlUpdater.patch, SOLR-269+139-ModifiableDocumentUpdateProcessor.patch It would be nice to be able to update some fields on a document without having to insert the entire document. Given the way lucene is structured, (for now) one can only modify stored fields. While we are at it, we can support incrementing an existing value - I think this only makes sense for numbers. for background, see: http://www.nabble.com/loading-many-documents-by-ID-tf3145666.html#a8722293 -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.