[jira] Closed: (SOLR-495) Use jarjar links to transform javax.xml classpath to avoid collisions with other xml parsers.

2008-11-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Frössman closed SOLR-495. Resolution: Fixed Works without this work around for me with latest jboss 4.2 and solrj 1.3 final.

Re: [jira] Resolved: (SOLR-845) Solr 1.3.1 Release

2008-11-12 Thread Gunnar Wagenknecht
Ryan McKinley schrieb: Yes, you should make an issue related to the StAX dependency. Ok, will do. Without upgrading to java 1.6, what is your thought on how to remove the dependencies? I haven't looked if it's a JRE issue. Based on Shalin's response I assumed it's not. If JRE is an issue

Re: Katta's goodness for Solr

2008-11-12 Thread Grant Ingersoll
On Nov 11, 2008, at 1:15 PM, Otis Gospodnetic wrote: Quick thought. I saw Stefan's Katta presentation last night. Katta seems nice and simple. If I understood correctly, juicy stuff that is interesting to Solr is: - Katta has a notion of a Primary Master and N Secondary Slaves (no SPOF

[jira] Work started: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on SOLR-284 started by Grant Ingersoll. Parsing Rich Document Types --- Key: SOLR-284 URL:

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12646947#action_12646947 ] Grant Ingersoll commented on SOLR-284: -- Some initial thoughts on moving forward: I

DataImportHandler and Blobs

2008-11-12 Thread Grant Ingersoll
Am I understanding the DIH correctly in that it doesn't work with Blobs and or binary things? I'm basing this off of JdbcDataSource.getARow() which seems to be the place that populates the Map that is then passed to the Transformer. One of the things that I think might be interesting is,

[jira] Updated: (SOLR-833) A DataSource to read data from a field as a reader

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar updated SOLR-833: --- Attachment: SOLR-833.patch Bringing patch in sync with trunk (it broke due to to

[jira] Updated: (SOLR-847) Enhance the snappull command in ReplicationHandler to take in masterUrl

2008-11-12 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-847: Attachment: SOLR-847.patch take in an extra parameter while triggerring snappull Enhance the snappull

[jira] Commented: (SOLR-561) Solr replication by Solr (for windows also)

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12646968#action_12646968 ] Otis Gospodnetic commented on SOLR-561: --- I wonder if it might be useful to add copy

[jira] Resolved: (SOLR-842) Better error handling for DIH

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar resolved SOLR-842. Resolution: Fixed Committed revision 713335. Thanks Noble! Better error handling

[jira] Commented: (SOLR-651) A SearchComponent for fetching TF-IDF values

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12646911#action_12646911 ] Grant Ingersoll commented on SOLR-651: -- I committed: freq - tf idf - df tf-idf - tf_idf

[jira] Resolved: (SOLR-833) A DataSource to read data from a field as a reader

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar resolved SOLR-833. Resolution: Fixed Committed revision 713343. Thanks Noble! A DataSource to read

Re: DataImportHandler and Blobs

2008-11-12 Thread Shalin Shekhar Mangar
On Wed, Nov 12, 2008 at 10:44 PM, Grant Ingersoll [EMAIL PROTECTED]wrote: Am I understanding the DIH correctly in that it doesn't work with Blobs and or binary things? I'm basing this off of JdbcDataSource.getARow() which seems to be the place that populates the Map that is then passed to

ContentStream Strategy

2008-11-12 Thread Grant Ingersoll
So, I'm working on SOLR-284 (Rich Documents, aka Tika) and I'm looking at the RichDocumentReqHandler included in that patch and I notice it has a lot of overlap with CSVRequestHandler and the XMLUpdateRequestHandler such that there seems to be a fair amount of commonality for dealing with

[jira] Created: (SOLR-850) Addition of timeouts for distributed searching

2008-11-12 Thread patrick o'leary (JIRA)
Addition of timeouts for distributed searching -- Key: SOLR-850 URL: https://issues.apache.org/jira/browse/SOLR-850 Project: Solr Issue Type: Improvement Components: search Affects

[jira] Updated: (SOLR-850) Addition of timeouts for distributed searching

2008-11-12 Thread patrick o'leary (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] patrick o'leary updated SOLR-850: - Attachment: SOLR-850.patch Patches solr-1.3.0 release Allows for 2 timeout settings to be

Re: ContentStream Strategy

2008-11-12 Thread Erik Hatcher
+1 - keep it DRY! Erik On Nov 12, 2008, at 1:19 PM, Grant Ingersoll wrote: So, I'm working on SOLR-284 (Rich Documents, aka Tika) and I'm looking at the RichDocumentReqHandler included in that patch and I notice it has a lot of overlap with CSVRequestHandler and the

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Eric Pugh (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647003#action_12647003 ] Eric Pugh commented on SOLR-284: Grant, I am really excited that you are looking at this

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647002#action_12647002 ] Yonik Seeley commented on SOLR-465: --- Hmmm, can you expand on what's wrong? Add

[jira] Created: (SOLR-851) Remove dependency on StAX Utils or make it optional

2008-11-12 Thread Gunnar Wagenknecht (JIRA)
Remove dependency on StAX Utils or make it optional --- Key: SOLR-851 URL: https://issues.apache.org/jira/browse/SOLR-851 Project: Solr Issue Type: Wish Affects Versions: 1.3

[jira] Created: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Grant Ingersoll (JIRA)
Refactor common code in various handlers for working with ContentStream Objects --- Key: SOLR-852 URL: https://issues.apache.org/jira/browse/SOLR-852 Project: Solr

[jira] Updated: (SOLR-850) Addition of timeouts for distributed searching

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-850: -- Priority: Minor (was: Major) Fix Version/s: (was: 1.3) 1.4

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647027#action_12647027 ] Shalin Shekhar Mangar commented on SOLR-465: bq. But couldn't one start the

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647022#action_12647022 ] Otis Gospodnetic commented on SOLR-465: --- But couldn't one start the container with the

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647034#action_12647034 ] Mark Miller commented on SOLR-465: -- bq. But couldn't one start the container with the

[jira] Work started: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on SOLR-852 started by Grant Ingersoll. Refactor common code in various handlers for working with ContentStream Objects

[jira] Updated: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated SOLR-852: - Attachment: SOLR-852.patch Here's a first crack at refactoring common code for ContentStream

[jira] Issue Comment Edited: (SOLR-851) Remove dependency on StAX Utils or make it optional

2008-11-12 Thread Gunnar Wagenknecht (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647004#action_12647004 ] gunnar edited comment on SOLR-851 at 11/12/08 11:12 AM:

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647008#action_12647008 ] Mark Miller commented on SOLR-465: -- You will get an FSDirectory no matter which OS. Open is

[jira] Commented: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647067#action_12647067 ] Erik Hatcher commented on SOLR-852: --- Patch looks good, Grant. The abstract class of

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12646987#action_12646987 ] Grant Ingersoll commented on SOLR-284: -- {quote} 3. Tika provides a mechanism for

[jira] Updated: (SOLR-346) need to improve snapinstaller to ignore non-snapshots in data directory

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-346: -- Fix Version/s: 1.3.1 need to improve snapinstaller to ignore non-snapshots in data directory

[jira] Updated: (SOLR-830) snappuller picks bad snapshot name

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-830: -- Fix Version/s: 1.3.1 snappuller picks bad snapshot name --

[jira] Updated: (SOLR-799) Add support for hash based exact/near duplicate document handling

2008-11-12 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated SOLR-799: - Attachment: SOLR-799.patch This ensures the id field stays unique. Are there any other issues that need to

[jira] Updated: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated SOLR-852: - Attachment: SOLR-852.patch Remove the setErrorHeader stuff and just set it in the load. I will

Re: DataImportHandler and Blobs

2008-11-12 Thread Grant Ingersoll
On Nov 12, 2008, at 12:56 PM, Shalin Shekhar Mangar wrote: I think the best way would be a TikaEntityProcessor which knows how to handle documents. I guess a typical use-case would be FileListEntityProcessor-TikaEntityProcessor as parent-child entities. Also see SOLR-833 which adds a

Re: Katta's goodness for Solr

2008-11-12 Thread Noble Paul നോബിള്‍ नोब्ळ्
The way we do distributed search is not straight forward . Introducing extra layers (LoadBalancers) in between the shards looks like a hack to me. Moreover , passing in the shard URL in the request is not a very nice design The clients should be ideally unaware of the fact that they are doing a

[jira] Created: (SOLR-853) Make DIH API friendly

2008-11-12 Thread Noble Paul (JIRA)
Make DIH API friendly - Key: SOLR-853 URL: https://issues.apache.org/jira/browse/SOLR-853 Project: Solr Issue Type: Improvement Components: contrib - DataImportHandler Reporter: Noble Paul DIH

Re: Katta's goodness for Solr

2008-11-12 Thread Otis Gospodnetic
I'm not sure if you are commenting on how Katta does things in that LoadBalancers part, but Katta doesn't do that as far as I know. Passing shard URL in request is the Solr thing, but I think we concluded shard URLs can also live in defaults for the handler, no? Otis -- Sematext --

[jira] Updated: (SOLR-850) Addition of timeouts for distributed searching

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-850: -- Assignee: Otis Gospodnetic I don't see any harm in this. Current values (undefined) mean no

[jira] Assigned: (SOLR-236) Field collapsing

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic reassigned SOLR-236: - Assignee: (was: Otis Gospodnetic) Field collapsing

Re: Katta's goodness for Solr

2008-11-12 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Thu, Nov 13, 2008 at 10:11 AM, Otis Gospodnetic [EMAIL PROTECTED] wrote: I'm not sure if you are commenting on how Katta does things in that LoadBalancers part, but Katta doesn't do that as far as I know. Passing shard URL in request is the Solr thing, but I think we concluded shard URLs

[jira] Updated: (SOLR-844) A SolrServer impl to front-end multiple urls

2008-11-12 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-844: Description: Currently a {{CommonsHttpSolrServer}} can talk to only one server. This demands that the user

[jira] Updated: (SOLR-844) A SolrServer impl to front-end multiple urls

2008-11-12 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-844: Description: Currently a {{CommonsHttpSolrServer}} can talk to only one server. This demands that the user

[jira] Updated: (SOLR-846) Out Of memory doing delta import with fetch size set to -1

2008-11-12 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-846: Attachment: SOLR-846.patch a partial solution. eventually we must stream it or persist the data Out Of