[jira] [Commented] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-24 Thread Aaron LaBella (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625774#comment-16625774
 ] 

Aaron LaBella commented on SOLR-12789:
--

Hi Alexandre, thank for you the additional detail and background.  While I 
understand the goal here, I don't agree with how the end result was achieved.  
I think the real "issue" here was that the examples and documentation are 
stale.  Likewise, UIMA core can (and should) be upgraded to the latest 2.10.2, 
and the additional unnecessary dependencies should absolutely be removed from 
the dist.  I'm attaching a simple patch (*SOLR-12789-4.patch*) that does just 
this.  I would like to propose that we re-instate the contrib/uima project and 
apply my patch instead.  I think this is a fair compromise since 6 Java classes 
doesn't quite compromise as "dead weight", especially if those 6 classes 
provide direct end-user value.  While I would certain agree, UIMA has a steep 
learning curve, there are folks out there that are using it, and removing it 
entirely from the Solr dist is likely to do a disservice to those folks who are 
in-fact doing text analytics using it.

 

All that being said, I think the only thing that really remains is a clean-up 
of the documentation and examples.  I'm happy to do that over the next couple 
weeks if we agree to this strategy.

 

Thanks so much.

 

> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Major
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch, SOLR-12789-4.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been 
> running IN production for the same) ... finally got around to contributing it 
> back to the community.  This change prepares the UIMAUpdateRequestProcessor 
> to allow subclasses to have additional control over how the analysis engine 
> is selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.  I don't necessarily need 
> it to be back-ported, having in 7.4.1 would suffice.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-24 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12789:
-
Attachment: SOLR-12789-4.patch

> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Major
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch, SOLR-12789-4.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been 
> running IN production for the same) ... finally got around to contributing it 
> back to the community.  This change prepares the UIMAUpdateRequestProcessor 
> to allow subclasses to have additional control over how the analysis engine 
> is selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.  I don't necessarily need 
> it to be back-ported, having in 7.4.1 would suffice.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-11694) Remove extremely outdated UIMA contrib module

2018-09-21 Thread Aaron LaBella (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-11694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16623739#comment-16623739
 ] 

Aaron LaBella commented on SOLR-11694:
--

I'm confused... since when does a feature require the "committers" of a project 
to actually use it.  I thought the point of open source and a product / 
features overall was to provide value to the actual "users" of it.  And for 
what it's worth, I've personally provided a few patches to te Solr UIMA project 
over the past couple years.  I think there is a misconception to it "rotting" 
away.  There was nothing to upgrade or maintain because the feature works 
exactly as it was designed and written to work.  Again, I've never heard of a 
feature being completely removed like this when you have real users out there 
that are dependent upon it.

> Remove extremely outdated UIMA contrib module
> -
>
> Key: SOLR-11694
> URL: https://issues.apache.org/jira/browse/SOLR-11694
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Reporter: Cassandra Targett
>Assignee: Alexandre Rafalovitch
>Priority: Major
> Fix For: 7.5, master (8.0)
>
> Attachments: SOLR-11694.patch
>
>
> A user on the [solr-user mailing list back in 
> June|https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201706.mbox/%3CCANsk%2BC_PvZJ38AQ2VfzKRYSQn6c8b33kGvaXxR3qNS3GQ4VUKA%40mail.gmail.com%3E]
>  brought up the fact that IBM has bought Alchemy and keys are no longer 
> available to use Solr's UIMA contrib.
> Someone recently made a [similar 
> comment|https://lucene.apache.org/solr/guide/7_1/uima-integration.html#comment_7174]
>  to the Solr Ref Guide page and asking for a patch.
> I know next to nothing about UIMA, but figured it's worth an issue to 
> determine what to do here. I think folks are saying it's no longer usable? Or 
> maybe only usable by people who already have keys (which will possibly expire 
> at some point)?
> Anyone have an idea what needs to be done here? It seems we should have some 
> kind of answer, but if it's no longer usable perhaps we should retire the 
> contrib.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-21 Thread Aaron LaBella (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16623732#comment-16623732
 ] 

Aaron LaBella commented on SOLR-12789:
--

Thanks Alexandre ... I added comments in SOLR-11694 (as you already saw).  I'll 
need to discuss with my IBM colleagues and see what we want to do going 
forward.  Like I said, this is a feature we've built production applications on 
top of and removing it is just plain "not cool".  There were other ways it 
could've been cleaned up and maintained without removing it.

As a whole, I'm really disappointed in this decision from the Solr community, 
as I never heard of pulling out a function that works absolutely fine.  The 
reason that I would say its not "maintained" is that there's nothing wrong with 
the current feature.  If anything, it could be upgraded to a later UIMAJ but 
even that is easily worked around.  I really see this as a big step backwards 
for Solr with regards to text analytics.

> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Major
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been 
> running IN production for the same) ... finally got around to contributing it 
> back to the community.  This change prepares the UIMAUpdateRequestProcessor 
> to allow subclasses to have additional control over how the analysis engine 
> is selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.  I don't necessarily need 
> it to be back-ported, having in 7.4.1 would suffice.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-11694) Remove extremely outdated UIMA contrib module

2018-09-20 Thread Aaron LaBella (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-11694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16622839#comment-16622839
 ] 

Aaron LaBella commented on SOLR-11694:
--

Apace UIMA 3.0 is in alpha state, and it'll probably be a long time before a.) 
its an actual 3.0 release and b.) anyone out there actually adopts it.  2.10.2 
is the latest and its from February of 2018.  I'm really not sure why we're 
pulling a feature out of Solr that has to be used out there by several people.  
What harm does it have to keep it there?  We're saving 7MB of space??  I don't 
actively follow the Solr mailing list and I'm not sure just because no one 
responded in ~6 months that you can safely assume that the entire community 
doesn't use UIMA.  I feel strongly that this is not the right decision here.  
If you want to clean up, I'd agree, remove the AlchemyAPI, OpenCalais, and 
Tagger.  Upgrade uima-core 2.3.1 to 2.10.2 and keep the rest of the Solr/UIMA 
classes – they work just fine.  There's no reason to pull this contrib project.

> Remove extremely outdated UIMA contrib module
> -
>
> Key: SOLR-11694
> URL: https://issues.apache.org/jira/browse/SOLR-11694
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Reporter: Cassandra Targett
>Assignee: Alexandre Rafalovitch
>Priority: Major
> Fix For: 7.5, master (8.0)
>
> Attachments: SOLR-11694.patch
>
>
> A user on the [solr-user mailing list back in 
> June|https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201706.mbox/%3CCANsk%2BC_PvZJ38AQ2VfzKRYSQn6c8b33kGvaXxR3qNS3GQ4VUKA%40mail.gmail.com%3E]
>  brought up the fact that IBM has bought Alchemy and keys are no longer 
> available to use Solr's UIMA contrib.
> Someone recently made a [similar 
> comment|https://lucene.apache.org/solr/guide/7_1/uima-integration.html#comment_7174]
>  to the Solr Ref Guide page and asking for a patch.
> I know next to nothing about UIMA, but figured it's worth an issue to 
> determine what to do here. I think folks are saying it's no longer usable? Or 
> maybe only usable by people who already have keys (which will possibly expire 
> at some point)?
> Anyone have an idea what needs to be done here? It seems we should have some 
> kind of answer, but if it's no longer usable perhaps we should retire the 
> contrib.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Comment Edited] (SOLR-11694) Remove extremely outdated UIMA contrib module

2018-09-20 Thread Aaron LaBella (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-11694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16622539#comment-16622539
 ] 

Aaron LaBella edited comment on SOLR-11694 at 9/20/18 7:08 PM:
---

Hi Team,

 

_*How do we undo this commit/decision?*_

 

We use UIMA and SOLR +*every day*+ in production, and this decision will 
effectively freeze us on the last working Solr / UIMA version of 7.4.0.  Surely 
there are other folks out there doing text analytics with even older versions 
of UIMA and wish to use the latest and greatest solr?

 

Aso, even though SOLR/LUCENE "ship" with UIMA 2.3.1, I've been using UIMA 2.8.1 
for 2+ years by simply dropping in the more up-to-date JAR in the core-specific 
"lib" directory.  I'd have to look through all the lucene stuff again, but, I'm 
quite certain the lucene support for UIMA is still active as well, no?


was (Author: aaronlab):
Hi Team,

How do we undo this commit/decision?

We use UIMA and SOLR +*every day*+ in production, and this decision will 
effectively freeze us on the last working Solr / UIMA version of 7.4.0.  Surely 
there are other folks out there doing text analytics with even older versions 
of UIMA and wish to use the latest and greatest solr?

 

ALso, even though SOLR/LUCENE "ship" with UIMA 2.3.1, I've been using UIMA 
2.8.1 for 2+ years by simply dropping in the more up-to-date JAR in the 
core-specific "lib" directory.  I'd have to look through all the lucene stuff 
again, but, I'm quite certain the lucene support for UIMA is still active as 
well, no?

> Remove extremely outdated UIMA contrib module
> -
>
> Key: SOLR-11694
> URL: https://issues.apache.org/jira/browse/SOLR-11694
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Reporter: Cassandra Targett
>Assignee: Alexandre Rafalovitch
>Priority: Major
> Fix For: 7.5, master (8.0)
>
> Attachments: SOLR-11694.patch
>
>
> A user on the [solr-user mailing list back in 
> June|https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201706.mbox/%3CCANsk%2BC_PvZJ38AQ2VfzKRYSQn6c8b33kGvaXxR3qNS3GQ4VUKA%40mail.gmail.com%3E]
>  brought up the fact that IBM has bought Alchemy and keys are no longer 
> available to use Solr's UIMA contrib.
> Someone recently made a [similar 
> comment|https://lucene.apache.org/solr/guide/7_1/uima-integration.html#comment_7174]
>  to the Solr Ref Guide page and asking for a patch.
> I know next to nothing about UIMA, but figured it's worth an issue to 
> determine what to do here. I think folks are saying it's no longer usable? Or 
> maybe only usable by people who already have keys (which will possibly expire 
> at some point)?
> Anyone have an idea what needs to be done here? It seems we should have some 
> kind of answer, but if it's no longer usable perhaps we should retire the 
> contrib.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Comment Edited] (SOLR-11694) Remove extremely outdated UIMA contrib module

2018-09-20 Thread Aaron LaBella (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-11694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16622539#comment-16622539
 ] 

Aaron LaBella edited comment on SOLR-11694 at 9/20/18 7:08 PM:
---

Hi Team,

How do we undo this commit/decision?

We use UIMA and SOLR +*every day*+ in production, and this decision will 
effectively freeze us on the last working Solr / UIMA version of 7.4.0.  Surely 
there are other folks out there doing text analytics with even older versions 
of UIMA and wish to use the latest and greatest solr?

 

ALso, even though SOLR/LUCENE "ship" with UIMA 2.3.1, I've been using UIMA 
2.8.1 for 2+ years by simply dropping in the more up-to-date JAR in the 
core-specific "lib" directory.  I'd have to look through all the lucene stuff 
again, but, I'm quite certain the lucene support for UIMA is still active as 
well, no?


was (Author: aaronlab):
Hi Team,

How do we undo this commit/decision?

We use UIMA and SOLR +*every day*+ in production, and this decision will 
effectively freeze us on the last working Solr / UIMA version of 7.4.0.  Surely 
there are other folks out there doing text analytics with even older versions 
of UIMA and wish to use the latest and greatest solr?

> Remove extremely outdated UIMA contrib module
> -
>
> Key: SOLR-11694
> URL: https://issues.apache.org/jira/browse/SOLR-11694
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Reporter: Cassandra Targett
>Assignee: Alexandre Rafalovitch
>Priority: Major
> Fix For: 7.5, master (8.0)
>
> Attachments: SOLR-11694.patch
>
>
> A user on the [solr-user mailing list back in 
> June|https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201706.mbox/%3CCANsk%2BC_PvZJ38AQ2VfzKRYSQn6c8b33kGvaXxR3qNS3GQ4VUKA%40mail.gmail.com%3E]
>  brought up the fact that IBM has bought Alchemy and keys are no longer 
> available to use Solr's UIMA contrib.
> Someone recently made a [similar 
> comment|https://lucene.apache.org/solr/guide/7_1/uima-integration.html#comment_7174]
>  to the Solr Ref Guide page and asking for a patch.
> I know next to nothing about UIMA, but figured it's worth an issue to 
> determine what to do here. I think folks are saying it's no longer usable? Or 
> maybe only usable by people who already have keys (which will possibly expire 
> at some point)?
> Anyone have an idea what needs to be done here? It seems we should have some 
> kind of answer, but if it's no longer usable perhaps we should retire the 
> contrib.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-11694) Remove extremely outdated UIMA contrib module

2018-09-20 Thread Aaron LaBella (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-11694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16622539#comment-16622539
 ] 

Aaron LaBella commented on SOLR-11694:
--

Hi Team,

How do we undo this commit/decision?

We use UIMA and SOLR +*every day*+ in production, and this decision will 
effectively freeze us on the last working Solr / UIMA version of 7.4.0.  Surely 
there are other folks out there doing text analytics with even older versions 
of UIMA and wish to use the latest and greatest solr?

> Remove extremely outdated UIMA contrib module
> -
>
> Key: SOLR-11694
> URL: https://issues.apache.org/jira/browse/SOLR-11694
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Reporter: Cassandra Targett
>Assignee: Alexandre Rafalovitch
>Priority: Major
> Fix For: 7.5, master (8.0)
>
> Attachments: SOLR-11694.patch
>
>
> A user on the [solr-user mailing list back in 
> June|https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201706.mbox/%3CCANsk%2BC_PvZJ38AQ2VfzKRYSQn6c8b33kGvaXxR3qNS3GQ4VUKA%40mail.gmail.com%3E]
>  brought up the fact that IBM has bought Alchemy and keys are no longer 
> available to use Solr's UIMA contrib.
> Someone recently made a [similar 
> comment|https://lucene.apache.org/solr/guide/7_1/uima-integration.html#comment_7174]
>  to the Solr Ref Guide page and asking for a patch.
> I know next to nothing about UIMA, but figured it's worth an issue to 
> determine what to do here. I think folks are saying it's no longer usable? Or 
> maybe only usable by people who already have keys (which will possibly expire 
> at some point)?
> Anyone have an idea what needs to be done here? It seems we should have some 
> kind of answer, but if it's no longer usable perhaps we should retire the 
> contrib.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12789:
-
Priority: Major  (was: Blocker)

> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Major
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been 
> running IN production for the same) ... finally got around to contributing it 
> back to the community.  This change prepares the UIMAUpdateRequestProcessor 
> to allow subclasses to have additional control over how the analysis engine 
> is selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.  I don't necessarily need 
> it to be back-ported, having in 7.4.1 would suffice.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12789:
-
Description: 
I've been sitting on this patch for over 2 years (and likewise it's been 
running IN production for the same) ... finally got around to contributing it 
back to the community.  This change prepares the UIMAUpdateRequestProcessor to 
allow subclasses to have additional control over how the analysis engine is 
selected.  In my case, I wrote a sub-class that allows for *dynamic* detection 
of the UIMA analysis engine based on the document fields.  ie: a field in the 
document can be used to select different UIMA configurations and rules.

 

Can someone please commit this as soon as possible.  I don't necessarily need 
it to be back-ported, having in 7.4.1 would suffice.

Thanks!

  was:
I've been sitting on this patch for over 2 years (and likewise it's been 
running IN production for the same) ... finally got around to contributing it 
back to the community.  This change prepares the UIMAUpdateRequestProcessor to 
allow subclasses to have additional control over how the analysis engine is 
selected.  In my case, I wrote a sub-class that allows for *dynamic* detection 
of the UIMA analysis engine based on the document fields.  ie: a field in the 
document can be used to select different UIMA configurations and rules.

 

Can someone please commit this as soon as possible.

Thanks!


> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been 
> running IN production for the same) ... finally got around to contributing it 
> back to the community.  This change prepares the UIMAUpdateRequestProcessor 
> to allow subclasses to have additional control over how the analysis engine 
> is selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.  I don't necessarily need 
> it to be back-ported, having in 7.4.1 would suffice.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12790) move ServerWrapper logic in LBHttpSolrClient to a method to allow subclasses override. expose other fields as protected

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12790:
-
Description: 
moved ServerWrapper logic in LBHttpSolrClient.request(...) to a method to allow 
subclasses to override.  expose other fields as protected.  This allows 
sub-classes to apply different logic other than round-robin, when using the 
LBHttpSolrClient class.  In my case, we wish to use an algorithm that *only* 
uses the first / primary server unless it is unavailable, and then it uses the 
secondary server(s).

 

Can someone please commit this as soon as possible.  I don't necessarily need 
it to be back-ported, having in 7.4.1 would suffice.

Thanks!

  was:moved ServerWrapper logic in LBHttpSolrClient.request(...) to a method to 
allow subclasses to override.  expose other fields as protected.  This allows 
sub-classes to apply different logic other than round-robin, when using the 
LBHttpSolrClient class.  In my case, we wish to use an algorithm that *only* 
uses the first / primary server unless it is unavailable, and then it uses the 
secondary server(s).


> move ServerWrapper logic in LBHttpSolrClient to a method to allow subclasses 
> override.  expose other fields as protected
> 
>
> Key: SOLR-12790
> URL: https://issues.apache.org/jira/browse/SOLR-12790
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: SolrJ
>Affects Versions: 5.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12790-1.patch
>
>
> moved ServerWrapper logic in LBHttpSolrClient.request(...) to a method to 
> allow subclasses to override.  expose other fields as protected.  This allows 
> sub-classes to apply different logic other than round-robin, when using the 
> LBHttpSolrClient class.  In my case, we wish to use an algorithm that *only* 
> uses the first / primary server unless it is unavailable, and then it uses 
> the secondary server(s).
>  
> Can someone please commit this as soon as possible.  I don't necessarily need 
> it to be back-ported, having in 7.4.1 would suffice.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12790) move ServerWrapper logic in LBHttpSolrClient to a method to allow subclasses override. expose other fields as protected

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12790:
-
Description: moved ServerWrapper logic in LBHttpSolrClient.request(...) to 
a method to allow subclasses to override.  expose other fields as protected.  
This allows sub-classes to apply different logic other than round-robin, when 
using the LBHttpSolrClient class.  In my case, we wish to use an algorithm that 
*only* uses the first / primary server unless it is unavailable, and then it 
uses the secondary server(s).  (was: moved ServerWrapper logic in 
LBHttpSolrClient.request(...) to a method to allow subclasses override.  expose 
other fields as protected.  This allows sub-classes to apply different logic 
other than round-robin, when using the LBHttpSolrClient class.  In my case, we 
wish to use an algorithm that *only* uses the first / primary server unless it 
is unavailable, and then it uses the secondary server(s).)

> move ServerWrapper logic in LBHttpSolrClient to a method to allow subclasses 
> override.  expose other fields as protected
> 
>
> Key: SOLR-12790
> URL: https://issues.apache.org/jira/browse/SOLR-12790
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: SolrJ
>Affects Versions: 5.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12790-1.patch
>
>
> moved ServerWrapper logic in LBHttpSolrClient.request(...) to a method to 
> allow subclasses to override.  expose other fields as protected.  This allows 
> sub-classes to apply different logic other than round-robin, when using the 
> LBHttpSolrClient class.  In my case, we wish to use an algorithm that *only* 
> uses the first / primary server unless it is unavailable, and then it uses 
> the secondary server(s).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12790) move ServerWrapper logic in LBHttpSolrClient to a method to allow subclasses override. expose other fields as protected

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12790:
-
Attachment: SOLR-12790-1.patch

> move ServerWrapper logic in LBHttpSolrClient to a method to allow subclasses 
> override.  expose other fields as protected
> 
>
> Key: SOLR-12790
> URL: https://issues.apache.org/jira/browse/SOLR-12790
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: SolrJ
>Affects Versions: 5.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12790-1.patch
>
>
> moved ServerWrapper logic in LBHttpSolrClient.request(...) to a method to 
> allow subclasses override.  expose other fields as protected.  This allows 
> sub-classes to apply different logic other than round-robin, when using the 
> LBHttpSolrClient class.  In my case, we wish to use an algorithm that *only* 
> uses the first / primary server unless it is unavailable, and then it uses 
> the secondary server(s).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-12790) move ServerWrapper logic in LBHttpSolrClient to a method to allow subclasses override. expose other fields as protected

2018-09-20 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-12790:


 Summary: move ServerWrapper logic in LBHttpSolrClient to a method 
to allow subclasses override.  expose other fields as protected
 Key: SOLR-12790
 URL: https://issues.apache.org/jira/browse/SOLR-12790
 Project: Solr
  Issue Type: Improvement
  Security Level: Public (Default Security Level. Issues are Public)
  Components: SolrJ
Affects Versions: 5.0
Reporter: Aaron LaBella
 Attachments: SOLR-12790-1.patch

moved ServerWrapper logic in LBHttpSolrClient.request(...) to a method to allow 
subclasses override.  expose other fields as protected.  This allows 
sub-classes to apply different logic other than round-robin, when using the 
LBHttpSolrClient class.  In my case, we wish to use an algorithm that *only* 
uses the first / primary server unless it is unavailable, and then it uses the 
secondary server(s).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12789:
-
Description: 
I've been sitting on this patch for over 2 years (and likewise it's been 
running IN production for the same) ... finally got around to contributing it 
back to the community.  This change prepares the UIMAUpdateRequestProcessor to 
allow subclasses to have additional control over how the analysis engine is 
selected.  In my case, I wrote a sub-class that allows for *dynamic* detection 
of the UIMA analysis engine based on the document fields.  ie: a field in the 
document can be used to select different UIMA configurations and rules.

 

Can someone please commit this as soon as possible.

Thanks!

  was:
I've been sitting on this patch for 2 years (and likewise it's been running 
production for the same) ... finally got around to contributing it back to the 
community.  This change prepares the UIMAUpdateRequestProcessor to allow 
subclasses to have additional control over hhow the analysis engine is 
selected.  In my case, I wrote a sub-class that allows for *dynamic* detection 
of the UIMA analysis engine based on the document fields.  ie: a field in the 
document can be used to select different UIMA configurations and rules.

 

Can someone please commit this as soon as possible.

Thanks!


> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been 
> running IN production for the same) ... finally got around to contributing it 
> back to the community.  This change prepares the UIMAUpdateRequestProcessor 
> to allow subclasses to have additional control over how the analysis engine 
> is selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-20 Thread Aaron LaBella (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16622309#comment-16622309
 ] 

Aaron LaBella commented on SOLR-12789:
--

There is also a patch in here to add proper support for UIMA feature values 
which are arrays to map to multivalued fields

> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch
>
>
> I've been sitting on this patch for 2 years (and likewise it's been running 
> production for the same) ... finally got around to contributing it back to 
> the community.  This change prepares the UIMAUpdateRequestProcessor to allow 
> subclasses to have additional control over hhow the analysis engine is 
> selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12789:
-
Attachment: SOLR-12789-3.patch

> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch
>
>
> I've been sitting on this patch for 2 years (and likewise it's been running 
> production for the same) ... finally got around to contributing it back to 
> the community.  This change prepares the UIMAUpdateRequestProcessor to allow 
> subclasses to have additional control over hhow the analysis engine is 
> selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12789:
-
Attachment: SOLR-12789-1.patch

> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch
>
>
> I've been sitting on this patch for 2 years (and likewise it's been running 
> production for the same) ... finally got around to contributing it back to 
> the community.  This change prepares the UIMAUpdateRequestProcessor to allow 
> subclasses to have additional control over hhow the analysis engine is 
> selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-20 Thread Aaron LaBella (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-12789:
-
Attachment: SOLR-12789-2.patch

> UIMA enhancements to allow for dynamic AE detection
> ---
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: contrib - UIMA
>Affects Versions: 6.0
>Reporter: Aaron LaBella
>Priority: Blocker
>  Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, 
> SOLR-12789-3.patch
>
>
> I've been sitting on this patch for 2 years (and likewise it's been running 
> production for the same) ... finally got around to contributing it back to 
> the community.  This change prepares the UIMAUpdateRequestProcessor to allow 
> subclasses to have additional control over hhow the analysis engine is 
> selected.  In my case, I wrote a sub-class that allows for *dynamic* 
> detection of the UIMA analysis engine based on the document fields.  ie: a 
> field in the document can be used to select different UIMA configurations and 
> rules.
>  
> Can someone please commit this as soon as possible.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection

2018-09-20 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-12789:


 Summary: UIMA enhancements to allow for dynamic AE detection
 Key: SOLR-12789
 URL: https://issues.apache.org/jira/browse/SOLR-12789
 Project: Solr
  Issue Type: Improvement
  Security Level: Public (Default Security Level. Issues are Public)
  Components: contrib - UIMA
Affects Versions: 6.0
Reporter: Aaron LaBella


I've been sitting on this patch for 2 years (and likewise it's been running 
production for the same) ... finally got around to contributing it back to the 
community.  This change prepares the UIMAUpdateRequestProcessor to allow 
subclasses to have additional control over hhow the analysis engine is 
selected.  In my case, I wrote a sub-class that allows for *dynamic* detection 
of the UIMA analysis engine based on the document fields.  ie: a field in the 
document can be used to select different UIMA configurations and rules.

 

Can someone please commit this as soon as possible.

Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8548) CorePropertiesLocator does NOT follow symlinks anymore

2016-01-15 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101819#comment-15101819
 ] 

Aaron LaBella commented on SOLR-8548:
-

Alan, no particular reason other than I think it's silly to have the code check 
an unlimited number of directories/depth for the core.properties file.  I think 
2 or 3 at most would be a reasonable limit, not sure why anyone would want to 
bury their solr cores anymore beyond that.  Feel free to change that particular 
setting to whatever you feel is right.

Thanks.

> CorePropertiesLocator does NOT follow symlinks anymore
> --
>
> Key: SOLR-8548
> URL: https://issues.apache.org/jira/browse/SOLR-8548
> Project: Solr
>  Issue Type: Bug
>Affects Versions: 5.4
>Reporter: Aaron LaBella
>Assignee: Alan Woodward
>Priority: Blocker
> Attachments: SOLR-8548.patch
>
>
> Looks like CorePropertiesLocator was switched over to use the new java.nio 
> stuff... and the simplified version of the walkFileTree method does NOT 
> follow symlinks by default.  This is a critical feature that needs to be 
> supported for anyone who has their solr core(s) defined elsewhere, and set up 
> their solr home directory to point to those cores via symlinks.  It's 
> important to realize that this is the behavior that has always existed 
> pre-5.4.0 by the nature of using the java.io package to listFiles.
> I'm attaching a PATCH with the appropriate fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-8548) CorePropertiesLocator does NOT follow symlinks anymore

2016-01-14 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-8548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-8548:

Description: 
Looks like CorePropertiesLocator was switched over to use the new java.nio 
stuff... and the simplified version of the walkFileTree method does NOT follow 
symlinks by default.  This is a critical feature that needs to be supported for 
anyone who has their solr core(s) defined elsewhere, and set up their solr home 
directory to point to those cores via symlinks.  It's important to realize that 
this is the behavior that has always existed pre-5.4.0 by the nature of using 
the java.io package to listFiles.

I'm attaching a PATCH with the appropriate fix.

  was:
Looks like CorePropertiesLocator was switched over to use the new java.nio 
stuff... and the simplified version of the walkFileTree method does NOT follow 
symlinks by default.  This is a critical feature that needs to be supported for 
anyone who has their solr core defined elsewhere, and set up their solr home 
directory to point to those cores via symlinks.  It's important to realize that 
this is the behavior that as always existed pre-5.4.0 by the nature of using 
the java.io package to listFiles.

I'm attaching a PATCH with the appropriate fix.


> CorePropertiesLocator does NOT follow symlinks anymore
> --
>
> Key: SOLR-8548
> URL: https://issues.apache.org/jira/browse/SOLR-8548
> Project: Solr
>  Issue Type: Bug
>Affects Versions: 5.4
>Reporter: Aaron LaBella
>Priority: Blocker
> Attachments: SOLR-8548.patch
>
>
> Looks like CorePropertiesLocator was switched over to use the new java.nio 
> stuff... and the simplified version of the walkFileTree method does NOT 
> follow symlinks by default.  This is a critical feature that needs to be 
> supported for anyone who has their solr core(s) defined elsewhere, and set up 
> their solr home directory to point to those cores via symlinks.  It's 
> important to realize that this is the behavior that has always existed 
> pre-5.4.0 by the nature of using the java.io package to listFiles.
> I'm attaching a PATCH with the appropriate fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-8548) CorePropertiesLocator does NOT follow symlinks anymore

2016-01-14 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-8548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-8548:

Attachment: SOLR-8548.patch

patch to fix issue, please accept asap.

> CorePropertiesLocator does NOT follow symlinks anymore
> --
>
> Key: SOLR-8548
> URL: https://issues.apache.org/jira/browse/SOLR-8548
> Project: Solr
>  Issue Type: Bug
>Affects Versions: 5.4
>Reporter: Aaron LaBella
>Priority: Blocker
> Attachments: SOLR-8548.patch
>
>
> Looks like CorePropertiesLocator was switched over to use the new java.nio 
> stuff... and the simplified version of the walkFileTree method does NOT 
> follow symlinks by default.  This is a critical feature that needs to be 
> supported for anyone who has their solr core defined elsewhere, and set up 
> their solr home directory to point to those cores via symlinks.  It's 
> important to realize that this is the behavior that as always existed 
> pre-5.4.0 by the nature of using the java.io package to listFiles.
> I'm attaching a PATCH with the appropriate fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-8548) CorePropertiesLocator does NOT follow symlinks anymore

2016-01-14 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-8548:
---

 Summary: CorePropertiesLocator does NOT follow symlinks anymore
 Key: SOLR-8548
 URL: https://issues.apache.org/jira/browse/SOLR-8548
 Project: Solr
  Issue Type: Bug
Affects Versions: 5.4
Reporter: Aaron LaBella
Priority: Blocker


Looks like CorePropertiesLocator was switched over to use the new java.nio 
stuff... and the simplified version of the walkFileTree method does NOT follow 
symlinks by default.  This is a critical feature that needs to be supported for 
anyone who has their solr core defined elsewhere, and set up their solr home 
directory to point to those cores via symlinks.  It's important to realize that 
this is the behavior that as always existed pre-5.4.0 by the nature of using 
the java.io package to listFiles.

I'm attaching a PATCH with the appropriate fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7885) Add support for loading HTTP resources

2015-12-16 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15060905#comment-15060905
 ] 

Aaron LaBella commented on SOLR-7885:
-

BTW, trying to call command=reload-config=... on the DIH doesn't 
work -- since, if anything, it just updates the in memory config -- it doesn't 
persist the config to the local file system, which was half the point of the 
patch.

> Add support for loading HTTP resources
> --
>
> Key: SOLR-7885
> URL: https://issues.apache.org/jira/browse/SOLR-7885
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - DataImportHandler, SolrJ
>Affects Versions: 5.3
>Reporter: Aaron LaBella
> Attachments: SOLR-7885-1.patch, SOLR-7885-2.patch
>
>
> I have a need to be able to load data import handler configuration files from 
> an HTTP server instead of the local file system.  So, I modified 
> {code}org.apache.solr.core.SolrResourceLoader{code} and some of the 
> respective dataimport files in {code}org.apache.solr.handler.dataimport{code} 
> to be able to support doing this.  
> {code}solrconfig.xml{code} now has the option to define a parameter: 
> *configRemote*, and if defined (and it's an HTTP(s) URL), it'll attempt to 
> load the resource.  If successfully, it'll also persist the resource to the 
> local file system so that it is available on a solr server restart per chance 
> that the remote resource is currently unavailable.
> Lastly, to be consistent with the pattern that already exists in 
> SolrResourceLoader, this feature is *disabled* by default, and requires the 
> setting of an additional JVM property: 
> {code}-Dsolr.allow.http.resourceloading=true{code}.
> Please review and let me know if there is anything else that needs to be done 
> in order for this patch to make the next release.  As far as I can tell, it's 
> fully tested and ready to go.
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7885) Add support for loading HTTP resources

2015-12-16 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15060901#comment-15060901
 ] 

Aaron LaBella commented on SOLR-7885:
-

I'm closing this issue as I literally just took my patch and moved it into a 
custom RequestHandler instead.
Thanks.

> Add support for loading HTTP resources
> --
>
> Key: SOLR-7885
> URL: https://issues.apache.org/jira/browse/SOLR-7885
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - DataImportHandler, SolrJ
>Affects Versions: 5.3
>Reporter: Aaron LaBella
> Attachments: SOLR-7885-1.patch, SOLR-7885-2.patch
>
>
> I have a need to be able to load data import handler configuration files from 
> an HTTP server instead of the local file system.  So, I modified 
> {code}org.apache.solr.core.SolrResourceLoader{code} and some of the 
> respective dataimport files in {code}org.apache.solr.handler.dataimport{code} 
> to be able to support doing this.  
> {code}solrconfig.xml{code} now has the option to define a parameter: 
> *configRemote*, and if defined (and it's an HTTP(s) URL), it'll attempt to 
> load the resource.  If successfully, it'll also persist the resource to the 
> local file system so that it is available on a solr server restart per chance 
> that the remote resource is currently unavailable.
> Lastly, to be consistent with the pattern that already exists in 
> SolrResourceLoader, this feature is *disabled* by default, and requires the 
> setting of an additional JVM property: 
> {code}-Dsolr.allow.http.resourceloading=true{code}.
> Please review and let me know if there is anything else that needs to be done 
> in order for this patch to make the next release.  As far as I can tell, it's 
> fully tested and ready to go.
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Closed] (SOLR-7885) Add support for loading HTTP resources

2015-12-16 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella closed SOLR-7885.
---
Resolution: Not A Problem

> Add support for loading HTTP resources
> --
>
> Key: SOLR-7885
> URL: https://issues.apache.org/jira/browse/SOLR-7885
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - DataImportHandler, SolrJ
>Affects Versions: 5.3
>Reporter: Aaron LaBella
> Attachments: SOLR-7885-1.patch, SOLR-7885-2.patch
>
>
> I have a need to be able to load data import handler configuration files from 
> an HTTP server instead of the local file system.  So, I modified 
> {code}org.apache.solr.core.SolrResourceLoader{code} and some of the 
> respective dataimport files in {code}org.apache.solr.handler.dataimport{code} 
> to be able to support doing this.  
> {code}solrconfig.xml{code} now has the option to define a parameter: 
> *configRemote*, and if defined (and it's an HTTP(s) URL), it'll attempt to 
> load the resource.  If successfully, it'll also persist the resource to the 
> local file system so that it is available on a solr server restart per chance 
> that the remote resource is currently unavailable.
> Lastly, to be consistent with the pattern that already exists in 
> SolrResourceLoader, this feature is *disabled* by default, and requires the 
> setting of an additional JVM property: 
> {code}-Dsolr.allow.http.resourceloading=true{code}.
> Please review and let me know if there is anything else that needs to be done 
> in order for this patch to make the next release.  As far as I can tell, it's 
> fully tested and ready to go.
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7885) Add support for loading HTTP resources

2015-12-07 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15045068#comment-15045068
 ] 

Aaron LaBella commented on SOLR-7885:
-

Sure.  Our DIH configuration files are stored, protected and maintained on a 
*separate* HTTP server with an admin/editing interface.  This allows us to give 
users an actual front-end editor, make changes to the DIH, and have those 
changes reloaded in SOLR automatically -- *without* having to explicitly reload 
SOLR in any form (either at runtime or with a server restart).

I guess I didn't try reload-config because a.) I'm not really seeing any great 
documentation on this feature and b.) I'm confused if it actually *persists* an 
uploaded and/or reloaded configuration back to the local file system.  If not, 
then this is a problem because it means that if the SOLR server restarts it'll 
re-load the old configurations.

I suppose you could argue this as a "back door" to SOLR, but, it's also 
something that is disabled by default and users would have to consciously 
enable using -Dsolr.allow.http.resourceloading, assuming they are willing to 
accept the risk.

> Add support for loading HTTP resources
> --
>
> Key: SOLR-7885
> URL: https://issues.apache.org/jira/browse/SOLR-7885
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - DataImportHandler, SolrJ
>Affects Versions: 5.3
>Reporter: Aaron LaBella
> Attachments: SOLR-7885-1.patch, SOLR-7885-2.patch
>
>
> I have a need to be able to load data import handler configuration files from 
> an HTTP server instead of the local file system.  So, I modified 
> {code}org.apache.solr.core.SolrResourceLoader{code} and some of the 
> respective dataimport files in {code}org.apache.solr.handler.dataimport{code} 
> to be able to support doing this.  
> {code}solrconfig.xml{code} now has the option to define a parameter: 
> *configRemote*, and if defined (and it's an HTTP(s) URL), it'll attempt to 
> load the resource.  If successfully, it'll also persist the resource to the 
> local file system so that it is available on a solr server restart per chance 
> that the remote resource is currently unavailable.
> Lastly, to be consistent with the pattern that already exists in 
> SolrResourceLoader, this feature is *disabled* by default, and requires the 
> setting of an additional JVM property: 
> {code}-Dsolr.allow.http.resourceloading=true{code}.
> Please review and let me know if there is anything else that needs to be done 
> in order for this patch to make the next release.  As far as I can tell, it's 
> fully tested and ready to go.
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7885) Add support for loading HTTP resources

2015-08-26 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14713468#comment-14713468
 ] 

Aaron LaBella commented on SOLR-7885:
-

Can someone take a look at this patch and commit if approved?
Thanks.

 Add support for loading HTTP resources
 --

 Key: SOLR-7885
 URL: https://issues.apache.org/jira/browse/SOLR-7885
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler, SolrJ
Affects Versions: 5.3
Reporter: Aaron LaBella
 Attachments: SOLR-7885-1.patch, SOLR-7885-2.patch


 I have a need to be able to load data import handler configuration files from 
 an HTTP server instead of the local file system.  So, I modified 
 {code}org.apache.solr.core.SolrResourceLoader{code} and some of the 
 respective dataimport files in {code}org.apache.solr.handler.dataimport{code} 
 to be able to support doing this.  
 {code}solrconfig.xml{code} now has the option to define a parameter: 
 *configRemote*, and if defined (and it's an HTTP(s) URL), it'll attempt to 
 load the resource.  If successfully, it'll also persist the resource to the 
 local file system so that it is available on a solr server restart per chance 
 that the remote resource is currently unavailable.
 Lastly, to be consistent with the pattern that already exists in 
 SolrResourceLoader, this feature is *disabled* by default, and requires the 
 setting of an additional JVM property: 
 {code}-Dsolr.allow.http.resourceloading=true{code}.
 Please review and let me know if there is anything else that needs to be done 
 in order for this patch to make the next release.  As far as I can tell, it's 
 fully tested and ready to go.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7760) Fix method and field visibility for UIMAUpdateRequestProcessor and SolrUIMAConfiguration

2015-08-10 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14680114#comment-14680114
 ] 

Aaron LaBella commented on SOLR-7760:
-

Excellent, thanks so much guys!  I added another patch/enhancement last week 
too:

https://issues.apache.org/jira/browse/SOLR-7885

Are you able to take a look at that one and review/commit that too?

Thanks.

 Fix method and field visibility for UIMAUpdateRequestProcessor and 
 SolrUIMAConfiguration
 

 Key: SOLR-7760
 URL: https://issues.apache.org/jira/browse/SOLR-7760
 Project: Solr
  Issue Type: Improvement
  Components: contrib - UIMA
Affects Versions: 5x
Reporter: Aaron LaBella
Assignee: Tommaso Teofili
Priority: Critical
 Fix For: Trunk, 5.4

 Attachments: SOLR-7760.patch


 The methods in 
 {{solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java}}
  are not public and they need to be for other code to be able to make use of 
 the configuration data, ie: mapped fields.   Likewise, 
 {{solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java}}
  does not have an accessor for the SolrUIMAConfiguration object



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-7885) Add support for loading HTTP resources

2015-08-06 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-7885:
---

 Summary: Add support for loading HTTP resources
 Key: SOLR-7885
 URL: https://issues.apache.org/jira/browse/SOLR-7885
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler, SolrJ
Affects Versions: 5.3
Reporter: Aaron LaBella


I have a need to be able to load data import handler configuration files from 
an HTTP server instead of the local file system.  So, I modified 
{code}org.apache.solr.core.SolrResourceLoader{code} and some of the respective 
dataimport files in {code}org.apache.solr.handler.dataimport{code} to be able 
to support doing this.  

{code}solrconfig.xml{code} now has the option to define a parameter: 
*configRemote*, and if defined (and it's an HTTP(s) URL), it'll attempt to load 
the resource.  If successfully, it'll also persist the resource to the local 
file system so that it is available on a solr server restart per chance that 
the remote resource is currently unavailable.

Lastly, to be consistent with the pattern that already exists in 
SolrResourceLoader, this feature is *disabled* by default, and requires the 
setting of an additional JVM property: 
{code}-Dsolr.allow.http.resourceloading=true{code}.

Please review and let me know if there is anything else that needs to be done 
in order for this patch to make the next release.  As far as I can tell, it's 
fully tested and ready to go.

Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7760) Fix method and field visibility for UIMAUpdateRequestProcessor and SolrUIMAConfiguration

2015-08-06 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14660714#comment-14660714
 ] 

Aaron LaBella commented on SOLR-7760:
-

Is someone able to commit this patch soon?  It'd be great to get in the next 
release of solr.
Thanks!

 Fix method and field visibility for UIMAUpdateRequestProcessor and 
 SolrUIMAConfiguration
 

 Key: SOLR-7760
 URL: https://issues.apache.org/jira/browse/SOLR-7760
 Project: Solr
  Issue Type: Improvement
  Components: contrib - UIMA
Affects Versions: 5x
Reporter: Aaron LaBella
Priority: Critical
 Fix For: 5.3

 Attachments: SOLR-7760.patch


 The methods in 
 {{solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java}}
  are not public and they need to be for other code to be able to make use of 
 the configuration data, ie: mapped fields.   Likewise, 
 {{solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java}}
  does not have an accessor for the SolrUIMAConfiguration object



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-7885) Add support for loading HTTP resources

2015-08-06 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-7885:

Attachment: SOLR-7885-2.patch
SOLR-7885-1.patch

I split the patch into 2 parts, one for solr-core and one for dataimporthandler

 Add support for loading HTTP resources
 --

 Key: SOLR-7885
 URL: https://issues.apache.org/jira/browse/SOLR-7885
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler, SolrJ
Affects Versions: 5.3
Reporter: Aaron LaBella
 Attachments: SOLR-7885-1.patch, SOLR-7885-2.patch


 I have a need to be able to load data import handler configuration files from 
 an HTTP server instead of the local file system.  So, I modified 
 {code}org.apache.solr.core.SolrResourceLoader{code} and some of the 
 respective dataimport files in {code}org.apache.solr.handler.dataimport{code} 
 to be able to support doing this.  
 {code}solrconfig.xml{code} now has the option to define a parameter: 
 *configRemote*, and if defined (and it's an HTTP(s) URL), it'll attempt to 
 load the resource.  If successfully, it'll also persist the resource to the 
 local file system so that it is available on a solr server restart per chance 
 that the remote resource is currently unavailable.
 Lastly, to be consistent with the pattern that already exists in 
 SolrResourceLoader, this feature is *disabled* by default, and requires the 
 setting of an additional JVM property: 
 {code}-Dsolr.allow.http.resourceloading=true{code}.
 Please review and let me know if there is anything else that needs to be done 
 in order for this patch to make the next release.  As far as I can tell, it's 
 fully tested and ready to go.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7760) Fix method and field visibility for UIMAUpdateRequestProcessor and SolrUIMAConfiguration

2015-07-27 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14643287#comment-14643287
 ] 

Aaron LaBella commented on SOLR-7760:
-

Yes, I wrote a UimaSearchHandler that extends SearchHandler to do adhoc uima 
analysis.  It is configured to reference the same updateRequestProcessorChain 
that is used during index time but in order to generically know what fields to 
return I need public access to the SolrUIMAConfiguration and fieldMappings.  I 
don't have time to write a testcase, but, here's some sample code that uses it:

{code:java}
  UpdateRequestProcessorChain chain = 
req.getCore().getUpdateProcessingChain(uimaChain);
  ListUpdateRequestProcessorFactory processors = 
chain.getProcessors();
  UpdateRequestProcessorFactory factory = processors.get(0);
  UpdateRequestProcessor processor = factory.getInstance(req, null, 
null);

  SolrUIMAConfiguration conf = 
((UIMAUpdateRequestProcessor)processor).getConfiguration();
  MapString, MapString, MapField map = 
conf.getTypesFeaturesFieldsMapping();
  for(Map.EntryString,MapString,MapField entry : map.entrySet())
  {
String annotationClass = entry.getKey();
annotationClasses.add(annotationClass);
  }
{code}

 Fix method and field visibility for UIMAUpdateRequestProcessor and 
 SolrUIMAConfiguration
 

 Key: SOLR-7760
 URL: https://issues.apache.org/jira/browse/SOLR-7760
 Project: Solr
  Issue Type: Improvement
  Components: contrib - UIMA
Affects Versions: 5x
Reporter: Aaron LaBella
Priority: Critical
 Fix For: 5.3

 Attachments: SOLR-7760.patch


 The methods in 
 {{solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java}}
  are not public and they need to be for other code to be able to make use of 
 the configuration data, ie: mapped fields.   Likewise, 
 {{solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java}}
  does not have an accessor for the SolrUIMAConfiguration object



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-7760) Fix method and field visibility for UIMAUpdateRequestProcessor and SolrUIMAConfiguration

2015-07-06 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-7760:

Description: The methods in 
{{solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java}}
 are not public and they need to be for other code to be able to make use of 
the configuration data, ie: mapped fields.   Likewise, 
{{solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java}}
 does not have an accessor for the SolrUIMAConfiguration object  (was: The 
methods in 
solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java
 are not public and they need to be for other code to be able to make use of 
the configuration data, ie: mapped fields.   Likewise, 
solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java
 does not have an accessor for the SolrUIMAConfiguration object)

 Fix method and field visibility for UIMAUpdateRequestProcessor and 
 SolrUIMAConfiguration
 

 Key: SOLR-7760
 URL: https://issues.apache.org/jira/browse/SOLR-7760
 Project: Solr
  Issue Type: Improvement
  Components: contrib - UIMA
Affects Versions: 5x
Reporter: Aaron LaBella
Priority: Critical
 Fix For: 5.3

 Attachments: SOLR-7760.patch


 The methods in 
 {{solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java}}
  are not public and they need to be for other code to be able to make use of 
 the configuration data, ie: mapped fields.   Likewise, 
 {{solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java}}
  does not have an accessor for the SolrUIMAConfiguration object



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7760) Fix method and field visibility for UIMAUpdateRequestProcessor and SolrUIMAConfiguration

2015-07-06 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615457#comment-14615457
 ] 

Aaron LaBella commented on SOLR-7760:
-

Can someone with write access to Solr repository please review, apply and 
commit the patch above?  I'd like to see this in the next version of Solr.  
Thanks.

 Fix method and field visibility for UIMAUpdateRequestProcessor and 
 SolrUIMAConfiguration
 

 Key: SOLR-7760
 URL: https://issues.apache.org/jira/browse/SOLR-7760
 Project: Solr
  Issue Type: Improvement
  Components: contrib - UIMA
Affects Versions: 5x
Reporter: Aaron LaBella
Priority: Critical
 Fix For: 5.3

 Attachments: SOLR-7760.patch


 The methods in 
 solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java
  are not public and they need to be for other code to be able to make use of 
 the configuration data, ie: mapped fields.   Likewise, 
 solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java
  does not have an accessor for the SolrUIMAConfiguration object



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-7760) Fix method and field visibility for UIMAUpdateRequestProcessor and SolrUIMAConfiguration

2015-07-06 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-7760:

Attachment: SOLR-7760.patch

patch per issue description

 Fix method and field visibility for UIMAUpdateRequestProcessor and 
 SolrUIMAConfiguration
 

 Key: SOLR-7760
 URL: https://issues.apache.org/jira/browse/SOLR-7760
 Project: Solr
  Issue Type: Improvement
  Components: contrib - UIMA
Affects Versions: 5x
Reporter: Aaron LaBella
Priority: Critical
 Fix For: 5.3

 Attachments: SOLR-7760.patch


 The methods in 
 solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java
  are not public and they need to be for other code to be able to make use of 
 the configuration data, ie: mapped fields.   Likewise, 
 solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java
  does not have an accessor for the SolrUIMAConfiguration object



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-7760) Fix method and field visibility for UIMAUpdateRequestProcessor and SolrUIMAConfiguration

2015-07-06 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-7760:
---

 Summary: Fix method and field visibility for 
UIMAUpdateRequestProcessor and SolrUIMAConfiguration
 Key: SOLR-7760
 URL: https://issues.apache.org/jira/browse/SOLR-7760
 Project: Solr
  Issue Type: Improvement
  Components: contrib - UIMA
Affects Versions: 5x
Reporter: Aaron LaBella
Priority: Critical
 Fix For: 5.3


The methods in 
solr/contrib/uima/src/java/org/apache/solr/uima/processor/SolrUIMAConfiguration.java
 are not public and they need to be for other code to be able to make use of 
the configuration data, ie: mapped fields.   Likewise, 
solr/contrib/uima/src/java/org/apache/solr/uima/processor/UIMAUpdateRequestProcessor.java
 does not have an accessor for the SolrUIMAConfiguration object



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6194) Allow access to DataImporter and DIHConfiguration

2014-06-27 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6194:


Attachment: SOLR-6194.patch

another small patch

 Allow access to DataImporter and DIHConfiguration
 -

 Key: SOLR-6194
 URL: https://issues.apache.org/jira/browse/SOLR-6194
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.10
Reporter: Aaron LaBella
Assignee: Shalin Shekhar Mangar
 Fix For: 4.10

 Attachments: SOLR-6194.patch, SOLR-6194.patch

   Original Estimate: 2h
  Remaining Estimate: 2h

 I'd like to change the visibility and access to a couple of the internal 
 classes of DataImportHandler, specifically DataImporter and DIHConfiguration. 
  My reasoning is that I've added the ability for a new data import handler 
 command called *getquery* that will return the exact queries (fully 
 resolved) that are executed for an entity within the data import 
 configuration.  This makes it much easier to debug the dih, rather than 
 turning on debug/verbose flags and digging through the raw response.  
 Additionally, it gives me a service that I can then go take the queries 
 from and run them.
 Here's a snippet of Java code that I can now execute now that I have access 
 to the DIHConfiguration:
 {code:title=Snippet.java|borderStyle=solid}
   /**
* @return a map of all the queries for each entity in the given config
*/
   protected MapString,String getEntityQueries(DIHConfiguration config, 
 MapString,Object params)
   {
 MapString,String queries = new LinkedHashMap();
 if (config != null  config.getEntities() != null)
 {
   //make a new variable resolve
   VariableResolver vr = new VariableResolver();
   vr.addNamespace(dataimporter.request,params);
   //for each entity
   for (Entity e : config.getEntities())
   {
 //get the query and resolve it
 if (e.getAllAttributes().containsKey(SqlEntityProcessor.QUERY))
 {
   String query = e.getAllAttributes().get(SqlEntityProcessor.QUERY);
   query = query.replaceAll(\\s+,  ).trim();
   String resolved = vr.replaceTokens(query);
   resolved = resolved.replaceAll(\\s+,  ).trim();
   queries.put(e.getName(),resolved);
   queries.put(e.getName()+_raw,query);
 }
   }
 }
 return queries;
   }
 {code}
 I'm attaching a patch that I would appreciate someone have a look for 
 consideration.  It's fully tested -- please let me know if there is something 
 else I need to do/test.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Reopened] (SOLR-6194) Allow access to DataImporter and DIHConfiguration

2014-06-27 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella reopened SOLR-6194:
-


Thanks shalin!   I'm attaching one more small patch that opens up a couple 
other methods.  Now with these fixes I have the ability to actually run the 
entity queries from my data import handler for easier debug/inspection, 
*without* actually having to run an import.  Can you please review and commit 
this small patch as well?

Aaron

 Allow access to DataImporter and DIHConfiguration
 -

 Key: SOLR-6194
 URL: https://issues.apache.org/jira/browse/SOLR-6194
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.10
Reporter: Aaron LaBella
Assignee: Shalin Shekhar Mangar
 Fix For: 4.10

 Attachments: SOLR-6194.patch, SOLR-6194.patch

   Original Estimate: 2h
  Remaining Estimate: 2h

 I'd like to change the visibility and access to a couple of the internal 
 classes of DataImportHandler, specifically DataImporter and DIHConfiguration. 
  My reasoning is that I've added the ability for a new data import handler 
 command called *getquery* that will return the exact queries (fully 
 resolved) that are executed for an entity within the data import 
 configuration.  This makes it much easier to debug the dih, rather than 
 turning on debug/verbose flags and digging through the raw response.  
 Additionally, it gives me a service that I can then go take the queries 
 from and run them.
 Here's a snippet of Java code that I can now execute now that I have access 
 to the DIHConfiguration:
 {code:title=Snippet.java|borderStyle=solid}
   /**
* @return a map of all the queries for each entity in the given config
*/
   protected MapString,String getEntityQueries(DIHConfiguration config, 
 MapString,Object params)
   {
 MapString,String queries = new LinkedHashMap();
 if (config != null  config.getEntities() != null)
 {
   //make a new variable resolve
   VariableResolver vr = new VariableResolver();
   vr.addNamespace(dataimporter.request,params);
   //for each entity
   for (Entity e : config.getEntities())
   {
 //get the query and resolve it
 if (e.getAllAttributes().containsKey(SqlEntityProcessor.QUERY))
 {
   String query = e.getAllAttributes().get(SqlEntityProcessor.QUERY);
   query = query.replaceAll(\\s+,  ).trim();
   String resolved = vr.replaceTokens(query);
   resolved = resolved.replaceAll(\\s+,  ).trim();
   queries.put(e.getName(),resolved);
   queries.put(e.getName()+_raw,query);
 }
   }
 }
 return queries;
   }
 {code}
 I'm attaching a patch that I would appreciate someone have a look for 
 consideration.  It's fully tested -- please let me know if there is something 
 else I need to do/test.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-6194) Allow access to DataImporter and DIHConfiguration

2014-06-27 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14046083#comment-14046083
 ] 

Aaron LaBella commented on SOLR-6194:
-

Shalin,

Good feedback.  Here's the static utility class I wrote:

{code:title=Snippet.java|borderStyle=solid}

//...

final public class SolrQueryUtils
{
  static public void runEntityQueries(
DIHConfiguration config, SolrCore core, DataImporter importer,
Entity entity, String entityName,
SolrQueryRequest req, SolrQueryResponse rsp,
MapString,Object params) throws Exception
  {
CollectionMapString,Object rows = new LinkedList();
EntityProcessorWrapper epw = null;
try
{
  if (entityName == null || entity.getName().equals(entityName))
  {
epw = SolrQueryUtils.getEntityProcessorWrapperFrom(
  core, importer, entity, req, rsp, params
);
if (epw != null)
{
  MapString, Object row = epw.nextRow();
  //just for sanity, TODO: add a better way to do this
  final int MAX = 1;
  int index = 0;
  while (row != null)
  {
if (index++  MAX) { rows.add(row); }
row = epw.nextRow();
  }
}
  }
}
catch (Exception ex)
{
  ex.printStackTrace();
}
finally
{
  if (epw != null)
  {
epw.getDatasource().close();
epw.close();
epw.destroy();
  }
}
if (entityName == null || entity.getName().equals(entityName))
{
  rsp.add(entity.getName()+_results,rows);
}
//potentially recurse if there are children
CollectionEntity children = entity.getChildren();
if (children != null)
{
  //for each child entity
  for (Entity child : children)
  {

SolrQueryUtils.runEntityQueries(config,core,importer,child,entityName,req,rsp,params);
  }
}
  }

  static public EntityProcessorWrapper getEntityProcessorWrapperFrom(
SolrCore core, DataImporter importer, Entity entity,
SolrQueryRequest req, SolrQueryResponse rsp,
MapString,Object params) throws Exception
  {
RequestInfo reqinfo = new RequestInfo(req,params,null);
DocBuilder docBuilder = importer.getDocBuilder(null,reqinfo);
EntityProcessorWrapper epw = docBuilder.getEntityProcessorWrapper(entity);
VariableResolver vr = new VariableResolver();
vr.addNamespace(dataimporter.request,params);
Context ctx = new ContextImpl(
  epw,  //EntityProcessorWrapper epw
  vr,   //VariableResolver resolver
  null, //DataSource ds
  Context.FULL_DUMP,//String currProcess
  reqinfo.getRawParams(),   //MapString, Object global
  null, //ContextImpl parentContext
  docBuilder//DocBuilder docBuilder
);
DataSource? ds = importer.getDataSourceInstance(
  entity, entity.getDataSourceName(), ctx
);
SolrQueryUtils.initDataSource(importer.getConfig(),core,entity,ds,ctx);
epw.setDatasource(ds);
epw.init(ctx);
epw.setInitalized(true);
return epw;
  }

  static public void initDataSource(
DIHConfiguration config, SolrCore core, Entity entity, DataSource? ds, 
Context ctx
  )
  {
//add all the properties from the core descriptor
CoreDescriptor descriptor = core.getCoreDescriptor();
MapString,Object globals = new LinkedHashMap();
globals = 
SolrQueryUtils.addProperties(descriptor.getPersistableStandardProperties(), 
globals);
globals = 
SolrQueryUtils.addProperties(descriptor.getPersistableUserProperties(), 
globals);
//sort the keys for easier debugging
globals = new TreeMap(globals);

//make a new variable for the datasource properties resolver
VariableResolver globalResolver = new VariableResolver(globals);
MapString,String dsProps = 
config.getDataSources().get(entity.getDataSourceName());
MapString,String dsPropsResolved = new LinkedHashMap();
for(Map.EntryString,String entry : dsProps.entrySet())
{
  
dsPropsResolved.put(entry.getKey(),globalResolver.replaceTokens(entry.getValue()));
}

Properties dsProperties = new Properties();
dsProperties.putAll(dsPropsResolved);
ds.init(ctx,dsProperties);
  }

  static public MapString,Object addProperties(Properties p, 
MapString,Object map)
  {
if (p != null  map != null)
{
  Enumeration? enumer = p.propertyNames();
  while(enumer.hasMoreElements())
  {
String key = enumer.nextElement().toString();
String value = p.getProperty(key);
map.put(key,value);
  }
}
return map;
  }
}

{code}

And, I have a custom data import handler that extends DataImportHandler.  I 
added support for calling the data import handler with command='runquery', 
which ends up looking up the entities from the config, and calling the 

[jira] [Commented] (SOLR-6194) Allow access to DataImporter and DIHConfiguration

2014-06-27 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14046087#comment-14046087
 ] 

Aaron LaBella commented on SOLR-6194:
-

NOTE: as you can see above ... actually getting the EntityProcessorWrapper, and 
initializing the datasource, seems like it is much more code than I would've 
hoped/expected for.  I think the solr data import handler framework is an 
incredible piece of software, it's just unfortunate that the API's aren't a 
little more open and flexible to allow for extensibility and use-cases beyond 
just getting data into solr.  I could envision using the code base to define 
adhoc queries that support things like variable resolution, evaluators, 
transformers, etc -- it's very powerful stuff ;-)

Thanks.

 Allow access to DataImporter and DIHConfiguration
 -

 Key: SOLR-6194
 URL: https://issues.apache.org/jira/browse/SOLR-6194
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.10
Reporter: Aaron LaBella
Assignee: Shalin Shekhar Mangar
 Fix For: 4.10

 Attachments: SOLR-6194.patch, SOLR-6194.patch

   Original Estimate: 2h
  Remaining Estimate: 2h

 I'd like to change the visibility and access to a couple of the internal 
 classes of DataImportHandler, specifically DataImporter and DIHConfiguration. 
  My reasoning is that I've added the ability for a new data import handler 
 command called *getquery* that will return the exact queries (fully 
 resolved) that are executed for an entity within the data import 
 configuration.  This makes it much easier to debug the dih, rather than 
 turning on debug/verbose flags and digging through the raw response.  
 Additionally, it gives me a service that I can then go take the queries 
 from and run them.
 Here's a snippet of Java code that I can now execute now that I have access 
 to the DIHConfiguration:
 {code:title=Snippet.java|borderStyle=solid}
   /**
* @return a map of all the queries for each entity in the given config
*/
   protected MapString,String getEntityQueries(DIHConfiguration config, 
 MapString,Object params)
   {
 MapString,String queries = new LinkedHashMap();
 if (config != null  config.getEntities() != null)
 {
   //make a new variable resolve
   VariableResolver vr = new VariableResolver();
   vr.addNamespace(dataimporter.request,params);
   //for each entity
   for (Entity e : config.getEntities())
   {
 //get the query and resolve it
 if (e.getAllAttributes().containsKey(SqlEntityProcessor.QUERY))
 {
   String query = e.getAllAttributes().get(SqlEntityProcessor.QUERY);
   query = query.replaceAll(\\s+,  ).trim();
   String resolved = vr.replaceTokens(query);
   resolved = resolved.replaceAll(\\s+,  ).trim();
   queries.put(e.getName(),resolved);
   queries.put(e.getName()+_raw,query);
 }
   }
 }
 return queries;
   }
 {code}
 I'm attaching a patch that I would appreciate someone have a look for 
 consideration.  It's fully tested -- please let me know if there is something 
 else I need to do/test.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-6194) Allow access to DataImporter and DIHConfiguration

2014-06-23 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-6194:
---

 Summary: Allow access to DataImporter and DIHConfiguration
 Key: SOLR-6194
 URL: https://issues.apache.org/jira/browse/SOLR-6194
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.10
Reporter: Aaron LaBella
 Fix For: 4.10


I'd like to change the visibility and access to a couple of the internal 
classes of DataImportHandler, specifically DataImporter and DIHConfiguration.  
My reasoning is that I've added the ability for a new data import handler 
command called *getquery* that will return the exact queries (fully resolved) 
that are executed for an entity within the data import configuration.  This 
makes it much easier to debug the dih, rather than turning on debug/verbose 
flags and digging through the raw response.  Additionally, it gives me a 
service that I can then go take the queries from and run them.

Here's a snippet of Java code that I can now execute now that I have access to 
the DIHConfiguration:

  /**
   * @return a map of all the queries for each entity in the given config
   */
  protected MapString,String getEntityQueries(DIHConfiguration config, 
MapString,Object params)
  {
MapString,String queries = new LinkedHashMap();
if (config != null  config.getEntities() != null)
{
  //make a new variable resolve
  VariableResolver vr = new VariableResolver();
  vr.addNamespace(dataimporter.request,params);

  //for each entity
  for (Entity e : config.getEntities())
  {
//get the query and resolve it
if (e.getAllAttributes().containsKey(SqlEntityProcessor.QUERY))
{
  String query = e.getAllAttributes().get(SqlEntityProcessor.QUERY);
  query = query.replaceAll(\\s+,  ).trim();
  String resolved = vr.replaceTokens(query);
  resolved = resolved.replaceAll(\\s+,  ).trim();
  queries.put(e.getName(),resolved);
  queries.put(e.getName()+_raw,query);
}
  }
}
return queries;
  }

I'm attaching a patch that I would appreciate someone have a look for 
consideration.  It's fully tested -- please let me know if there is something 
else I need to do/test.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6194) Allow access to DataImporter and DIHConfiguration

2014-06-23 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6194:


Attachment: SOLR-6194.patch

 Allow access to DataImporter and DIHConfiguration
 -

 Key: SOLR-6194
 URL: https://issues.apache.org/jira/browse/SOLR-6194
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.10
Reporter: Aaron LaBella
 Fix For: 4.10

 Attachments: SOLR-6194.patch

   Original Estimate: 2h
  Remaining Estimate: 2h

 I'd like to change the visibility and access to a couple of the internal 
 classes of DataImportHandler, specifically DataImporter and DIHConfiguration. 
  My reasoning is that I've added the ability for a new data import handler 
 command called *getquery* that will return the exact queries (fully 
 resolved) that are executed for an entity within the data import 
 configuration.  This makes it much easier to debug the dih, rather than 
 turning on debug/verbose flags and digging through the raw response.  
 Additionally, it gives me a service that I can then go take the queries 
 from and run them.
 Here's a snippet of Java code that I can now execute now that I have access 
 to the DIHConfiguration:
 {code:title=Snippet.java|borderStyle=solid}
   /**
* @return a map of all the queries for each entity in the given config
*/
   protected MapString,String getEntityQueries(DIHConfiguration config, 
 MapString,Object params)
   {
 MapString,String queries = new LinkedHashMap();
 if (config != null  config.getEntities() != null)
 {
   //make a new variable resolve
   VariableResolver vr = new VariableResolver();
   vr.addNamespace(dataimporter.request,params);
   //for each entity
   for (Entity e : config.getEntities())
   {
 //get the query and resolve it
 if (e.getAllAttributes().containsKey(SqlEntityProcessor.QUERY))
 {
   String query = e.getAllAttributes().get(SqlEntityProcessor.QUERY);
   query = query.replaceAll(\\s+,  ).trim();
   String resolved = vr.replaceTokens(query);
   resolved = resolved.replaceAll(\\s+,  ).trim();
   queries.put(e.getName(),resolved);
   queries.put(e.getName()+_raw,query);
 }
   }
 }
 return queries;
   }
 {code}
 I'm attaching a patch that I would appreciate someone have a look for 
 consideration.  It's fully tested -- please let me know if there is something 
 else I need to do/test.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6194) Allow access to DataImporter and DIHConfiguration

2014-06-23 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6194:


Description: 
I'd like to change the visibility and access to a couple of the internal 
classes of DataImportHandler, specifically DataImporter and DIHConfiguration.  
My reasoning is that I've added the ability for a new data import handler 
command called *getquery* that will return the exact queries (fully resolved) 
that are executed for an entity within the data import configuration.  This 
makes it much easier to debug the dih, rather than turning on debug/verbose 
flags and digging through the raw response.  Additionally, it gives me a 
service that I can then go take the queries from and run them.

Here's a snippet of Java code that I can now execute now that I have access to 
the DIHConfiguration:

{code:title=Snippet.java|borderStyle=solid}
  /**
   * @return a map of all the queries for each entity in the given config
   */
  protected MapString,String getEntityQueries(DIHConfiguration config, 
MapString,Object params)
  {
MapString,String queries = new LinkedHashMap();
if (config != null  config.getEntities() != null)
{
  //make a new variable resolve
  VariableResolver vr = new VariableResolver();
  vr.addNamespace(dataimporter.request,params);

  //for each entity
  for (Entity e : config.getEntities())
  {
//get the query and resolve it
if (e.getAllAttributes().containsKey(SqlEntityProcessor.QUERY))
{
  String query = e.getAllAttributes().get(SqlEntityProcessor.QUERY);
  query = query.replaceAll(\\s+,  ).trim();
  String resolved = vr.replaceTokens(query);
  resolved = resolved.replaceAll(\\s+,  ).trim();
  queries.put(e.getName(),resolved);
  queries.put(e.getName()+_raw,query);
}
  }
}
return queries;
  }
{code}

I'm attaching a patch that I would appreciate someone have a look for 
consideration.  It's fully tested -- please let me know if there is something 
else I need to do/test.

  was:
I'd like to change the visibility and access to a couple of the internal 
classes of DataImportHandler, specifically DataImporter and DIHConfiguration.  
My reasoning is that I've added the ability for a new data import handler 
command called *getquery* that will return the exact queries (fully resolved) 
that are executed for an entity within the data import configuration.  This 
makes it much easier to debug the dih, rather than turning on debug/verbose 
flags and digging through the raw response.  Additionally, it gives me a 
service that I can then go take the queries from and run them.

Here's a snippet of Java code that I can now execute now that I have access to 
the DIHConfiguration:

  /**
   * @return a map of all the queries for each entity in the given config
   */
  protected MapString,String getEntityQueries(DIHConfiguration config, 
MapString,Object params)
  {
MapString,String queries = new LinkedHashMap();
if (config != null  config.getEntities() != null)
{
  //make a new variable resolve
  VariableResolver vr = new VariableResolver();
  vr.addNamespace(dataimporter.request,params);

  //for each entity
  for (Entity e : config.getEntities())
  {
//get the query and resolve it
if (e.getAllAttributes().containsKey(SqlEntityProcessor.QUERY))
{
  String query = e.getAllAttributes().get(SqlEntityProcessor.QUERY);
  query = query.replaceAll(\\s+,  ).trim();
  String resolved = vr.replaceTokens(query);
  resolved = resolved.replaceAll(\\s+,  ).trim();
  queries.put(e.getName(),resolved);
  queries.put(e.getName()+_raw,query);
}
  }
}
return queries;
  }

I'm attaching a patch that I would appreciate someone have a look for 
consideration.  It's fully tested -- please let me know if there is something 
else I need to do/test.


 Allow access to DataImporter and DIHConfiguration
 -

 Key: SOLR-6194
 URL: https://issues.apache.org/jira/browse/SOLR-6194
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.10
Reporter: Aaron LaBella
 Fix For: 4.10

 Attachments: SOLR-6194.patch

   Original Estimate: 2h
  Remaining Estimate: 2h

 I'd like to change the visibility and access to a couple of the internal 
 classes of DataImportHandler, specifically DataImporter and DIHConfiguration. 
  My reasoning is that I've added the ability for a new data import handler 
 command called *getquery* that will return the exact queries (fully 
 resolved) that are executed for an entity within the data import 
 configuration.  This makes it much easier to debug the dih, rather 

[jira] [Commented] (SOLR-6129) DateFormatTransformer doesn't resolve dateTimeFormat

2014-06-18 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-6129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14035676#comment-14035676
 ] 

Aaron LaBella commented on SOLR-6129:
-

excellent.  thanks!!

 DateFormatTransformer doesn't resolve dateTimeFormat
 

 Key: SOLR-6129
 URL: https://issues.apache.org/jira/browse/SOLR-6129
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.8.1
Reporter: Aaron LaBella
Assignee: Shalin Shekhar Mangar
 Fix For: 4.9, 5.0

 Attachments: SOLR-6129.patch, SOLR-6129.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 The DateFormatTransformer doesn't use a VariableResolver to resolve the 
 format, which could potentially be passed in via a dataimport request 
 parameter.  The attached patch is tested and working.  Please include in 4.9.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-6129) DateFormatTransformer doesn't resolve dateTimeFormat

2014-06-17 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-6129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14033942#comment-14033942
 ] 

Aaron LaBella commented on SOLR-6129:
-

Anyone out there?  This patch is ready to go -- just needs someone to 
approve/accept/commit.
Thanks.

 DateFormatTransformer doesn't resolve dateTimeFormat
 

 Key: SOLR-6129
 URL: https://issues.apache.org/jira/browse/SOLR-6129
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.8.1
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: SOLR-6129.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 The DateFormatTransformer doesn't use a VariableResolver to resolve the 
 format, which could potentially be passed in via a dataimport request 
 parameter.  The attached patch is tested and working.  Please include in 4.9.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-6018) Solr DataImportHandler not finding dynamic fields

2014-06-06 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14019924#comment-14019924
 ] 

Aaron LaBella commented on SOLR-6018:
-

Any comments/updates on this issue?

 Solr DataImportHandler not finding dynamic fields
 -

 Key: SOLR-6018
 URL: https://issues.apache.org/jira/browse/SOLR-6018
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: 0001-dih-config-check-for-dynamic-fields.patch


 There is an issue with 
 *org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
 The logic currently says see if you can find the field from the schema, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 SchemaField sf = schema.getFieldOrNull(key);
 {code}
 and, if not found, go ask DIHConfiguration to find it, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 sf = config.getSchemaField(key);
 {code}
 The latter call takes into account case-insensitivity, which is a big deal 
 since some databases, ie: DB2, upper case all the resulting column names.  In 
 order to not modify solr-core (ie: the match logic in IndexSchema), I'm 
 attaching a patch that makes DIHConfiguration apply the same case-insensitive 
 logic to the DynamicFields.
 Without this patch, dynamic fields will not be added to the index unless you 
 declare them like this:
 {code:xml}
   dynamicField name=*_S type=string indexed=true stored=true /
 {code}
 (note the capital S)
 which is in-consistent with what I believe to be solr schema conventions to 
 have all the schema fields as lower-case.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-6129) DateFormatTransformer doesn't resolve dateTimeFormat

2014-06-05 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-6129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018763#comment-14018763
 ] 

Aaron LaBella commented on SOLR-6129:
-

Can someone have a look at this patch and hopefully commit it?  I'm looking to 
get it in for the 4.9 release.
Thanks.

 DateFormatTransformer doesn't resolve dateTimeFormat
 

 Key: SOLR-6129
 URL: https://issues.apache.org/jira/browse/SOLR-6129
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.8.1
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: SOLR-6129.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 The DateFormatTransformer doesn't use a VariableResolver to resolve the 
 format, which could potentially be passed in via a dataimport request 
 parameter.  The attached patch is tested and working.  Please include in 4.9.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-6129) DateFormatTransformer doesn't resolve dateTimeFormat

2014-06-02 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-6129:
---

 Summary: DateFormatTransformer doesn't resolve dateTimeFormat
 Key: SOLR-6129
 URL: https://issues.apache.org/jira/browse/SOLR-6129
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.8.1
Reporter: Aaron LaBella
 Fix For: 4.9


The DateFormatTransformer doesn't use a VariableResolver to resolve the format, 
which could potentially be passed in via a dataimport request parameter.  The 
attached patch is tested and working.  Please include in 4.9.

Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6129) DateFormatTransformer doesn't resolve dateTimeFormat

2014-06-02 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6129:


Attachment: SOLR-6129.patch

 DateFormatTransformer doesn't resolve dateTimeFormat
 

 Key: SOLR-6129
 URL: https://issues.apache.org/jira/browse/SOLR-6129
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.8.1
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: SOLR-6129.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 The DateFormatTransformer doesn't use a VariableResolver to resolve the 
 format, which could potentially be passed in via a dataimport request 
 parameter.  The attached patch is tested and working.  Please include in 4.9.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-6013) Fix method visibility of Evaluator, refactor DateFormatEvaluator for extensibility

2014-05-15 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-6013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13992748#comment-13992748
 ] 

Aaron LaBella commented on SOLR-6013:
-

Shalin, that's fine I suppose (sorry I didn't notice you changed them to 
public/final).  I'm just wondering though, wouldn't it make sense to access the 
bean properties using traditional getter methods instead of accessing them 
directly?  Just curious as to the reasoning of not providing the getters.  In 
either case, I'm fine with whatever you decide and re-closing this issue.

Thanks.

Aaron

 Fix method visibility of Evaluator, refactor DateFormatEvaluator for 
 extensibility
 --

 Key: SOLR-6013
 URL: https://issues.apache.org/jira/browse/SOLR-6013
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
Assignee: Shalin Shekhar Mangar
 Fix For: 4.9, 5.0

 Attachments: 0001-add-getters-for-datemathparser.patch, 
 0001-change-method-access-to-protected.patch, 
 0001-change-method-variable-visibility-and-refactor-for-extensibility.patch, 
 SOLR-6013.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 This is similar to issue 5981, the Evaluator class is declared as abstract, 
 yet the parseParams method is package private?  Surely this is an oversight, 
 as I wouldn't expect everyone writing their own evaluators to have to deal 
 with parsing the parameters.
 Similarly, I needed to refactor DateFormatEvaluator because I need to do some 
 custom date math/parsing and it wasn't written in a way that I can extend it.
 Please review/apply my attached patch to the next version of Solr, ie: 4.8 or 
 4.9 if I must wait.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Reopened] (SOLR-6013) Fix method visibility of Evaluator, refactor DateFormatEvaluator for extensibility

2014-05-05 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella reopened SOLR-6013:
-


Thanks Shalin, but, you forgot two of the public 'getter' methods that I added 
in the last patch to Evaluator.VariableWrapper:

public String getVarName() {
  return this.varName;
}

public VariableResolver getVariableResolver() {
  return this.vr;
}

Can you fix/commit/close?
Thanks.

Aaron

 Fix method visibility of Evaluator, refactor DateFormatEvaluator for 
 extensibility
 --

 Key: SOLR-6013
 URL: https://issues.apache.org/jira/browse/SOLR-6013
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
Assignee: Shalin Shekhar Mangar
 Fix For: 4.9, 5.0

 Attachments: 0001-add-getters-for-datemathparser.patch, 
 0001-change-method-access-to-protected.patch, 
 0001-change-method-variable-visibility-and-refactor-for-extensibility.patch, 
 SOLR-6013.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 This is similar to issue 5981, the Evaluator class is declared as abstract, 
 yet the parseParams method is package private?  Surely this is an oversight, 
 as I wouldn't expect everyone writing their own evaluators to have to deal 
 with parsing the parameters.
 Similarly, I needed to refactor DateFormatEvaluator because I need to do some 
 custom date math/parsing and it wasn't written in a way that I can extend it.
 Please review/apply my attached patch to the next version of Solr, ie: 4.8 or 
 4.9 if I must wait.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3671) DIH doesn't use its own interface + writerImpl has no information about the request

2014-05-05 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13989627#comment-13989627
 ] 

Aaron LaBella commented on SOLR-3671:
-

James,

I downloaded the patch and refactored my code/test case and everything works 
fine.  Can you go ahead and commit that to branch_4x?  Your patch cleans up the 
code to use the DIHWriter interface, which is better anyhow.

Thanks.

 DIH doesn't use its own interface + writerImpl has no information about the 
 request
 ---

 Key: SOLR-3671
 URL: https://issues.apache.org/jira/browse/SOLR-3671
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.0-ALPHA, 4.0-BETA
Reporter: Roman Chyla
Assignee: James Dyer
Priority: Minor
 Attachments: SOLR-3671.patch, SOLR-3671.patch


 The use case: I would like to extend DIH by providing a new writer, I have 
 tried everything but can't accomplish it without either a) duplicating whole 
 DIHandler or b) java reflection tricks. Almost everything inside DIH is 
 private and the mechanism to instantiate a new writer based on the 
 'writerImpl' mechanism seems lacking important functionality
 It doesn't give the new class a chance to get information about the request, 
 update processor. Also, the writer is instantiated twice (when 'writerImpl' 
 is there), which is really unnecessary.
 As a solution, the existing DIHandler.getSolrWriter() should instantiate the 
 appropriate writer and send it to DocBuilder (it is already doing that for 
 SolrWriter). And DocBuilder doesn't need to create a second (duplicate) writer



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5981) Please change method visibility of getSolrWriter in DataImportHandler to public (or at least protected)

2014-05-05 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13989625#comment-13989625
 ] 

Aaron LaBella commented on SOLR-5981:
-

James,

I downloaded the patch from SOLR-3671 and refactored my code/test case and 
everything works fine.  Can you go ahead and commit that to branch_4x?  Your 
patch cleans up the code to use the DIHWriter interface, which is better anyhow.

Thanks.

 Please change method visibility of getSolrWriter in DataImportHandler to 
 public (or at least protected)
 ---

 Key: SOLR-5981
 URL: https://issues.apache.org/jira/browse/SOLR-5981
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.0
 Environment: Linux 3.13.9-200.fc20.x86_64
 Solr 4.6.0
Reporter: Aaron LaBella
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 4.9, 5.0

 Attachments: SOLR-5981.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 I've been using the org.apache.solr.handler.dataimport.DataImportHandler for 
 a bit and it's an excellent model and architecture.  I'd like to extend the 
 usage of it to plugin my own DIHWriter, but, the code doesn't allow for it.  
 Please change ~line 227 in the DataImportHander class to be:
 public SolrWriter getSolrWriter
 instead of:
 private SolrWriter getSolrWriter
 or, at a minimum, protected, so that I can extend DataImportHandler and 
 override this method.
 Thank you *sincerely* in advance for the quick turn-around on this.  If the 
 change can be made in 4.6.0 and upstream, that'd be ideal.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-6013) Fix method visibility of Evaluator, refactor DateFormatEvaluator for extensibility

2014-04-28 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-6013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13982999#comment-13982999
 ] 

Aaron LaBella commented on SOLR-6013:
-

NOTE: the patches can be applied from oldest to newest

 Fix method visibility of Evaluator, refactor DateFormatEvaluator for 
 extensibility
 --

 Key: SOLR-6013
 URL: https://issues.apache.org/jira/browse/SOLR-6013
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: 0001-add-getters-for-datemathparser.patch, 
 0001-change-method-access-to-protected.patch, 
 0001-change-method-variable-visibility-and-refactor-for-extensibility.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 This is similar to issue 5981, the Evaluator class is declared as abstract, 
 yet the parseParams method is package private?  Surely this is an oversight, 
 as I wouldn't expect everyone writing their own evaluators to have to deal 
 with parsing the parameters.
 Similarly, I needed to refactor DateFormatEvaluator because I need to do some 
 custom date math/parsing and it wasn't written in a way that I can extend it.
 Please review/apply my attached patch to the next version of Solr, ie: 4.8 or 
 4.9 if I must wait.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6013) Fix method visibility of Evaluator, refactor DateFormatEvaluator for extensibility

2014-04-28 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6013:


Attachment: 0001-change-method-access-to-protected.patch

Thanks Shalin,

I'm attaching another patch to change the method accessors to protected 
(instead of public) and marked the methods as lucene.experimental.  Let me know 
if there's anything else.  Otherwise, can you, or someone else commit/push 
these patches into the 4.x branch so it makes the next release?

Thanks

 Fix method visibility of Evaluator, refactor DateFormatEvaluator for 
 extensibility
 --

 Key: SOLR-6013
 URL: https://issues.apache.org/jira/browse/SOLR-6013
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: 0001-add-getters-for-datemathparser.patch, 
 0001-change-method-access-to-protected.patch, 
 0001-change-method-variable-visibility-and-refactor-for-extensibility.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 This is similar to issue 5981, the Evaluator class is declared as abstract, 
 yet the parseParams method is package private?  Surely this is an oversight, 
 as I wouldn't expect everyone writing their own evaluators to have to deal 
 with parsing the parameters.
 Similarly, I needed to refactor DateFormatEvaluator because I need to do some 
 custom date math/parsing and it wasn't written in a way that I can extend it.
 Please review/apply my attached patch to the next version of Solr, ie: 4.8 or 
 4.9 if I must wait.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5981) Please change method visibility of getSolrWriter in DataImportHandler to public (or at least protected)

2014-04-25 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13981186#comment-13981186
 ] 

Aaron LaBella commented on SOLR-5981:
-

Sure ... I did a proof of concept to use the DataImportHandler framework to 
import into mongodb.  I think the architecture and functionality that DIH 
supports is fantastic (ie: evaluators, transformers, etc.), and the only 
import that mongodb supports (as far as I know) is a csv.

So, I took advantage of the solr code base here to do everything that the DIH 
does, ie: connect to a DB and get data, just instead of dumping the results 
into the solr index, I actually create mongodb documents.  Actually, my proof 
of concept supports two modes: insert and copy -- the former just inserts into 
mongodb and skips solr, the second will insert documents into both.

Turns out someone else had a similar idea, but, they re-wrote half the solr dih 
framework:
http://code.google.com/p/sql-to-nosql-importer/

My solution only requires a small extension... I'm happy to share it with the 
solr community if anyone else wants it.  I think using mongodb as the document 
store and solr to index just the fields of the document you want to search on 
has the most potential for serious scalability.

Let me know if you have any additional questions/thoughts/comments.

Thanks.

 Please change method visibility of getSolrWriter in DataImportHandler to 
 public (or at least protected)
 ---

 Key: SOLR-5981
 URL: https://issues.apache.org/jira/browse/SOLR-5981
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.0
 Environment: Linux 3.13.9-200.fc20.x86_64
 Solr 4.6.0
Reporter: Aaron LaBella
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 4.9, 5.0

 Attachments: SOLR-5981.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 I've been using the org.apache.solr.handler.dataimport.DataImportHandler for 
 a bit and it's an excellent model and architecture.  I'd like to extend the 
 usage of it to plugin my own DIHWriter, but, the code doesn't allow for it.  
 Please change ~line 227 in the DataImportHander class to be:
 public SolrWriter getSolrWriter
 instead of:
 private SolrWriter getSolrWriter
 or, at a minimum, protected, so that I can extend DataImportHandler and 
 override this method.
 Thank you *sincerely* in advance for the quick turn-around on this.  If the 
 change can be made in 4.6.0 and upstream, that'd be ideal.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-6018) Solr DataImportHandler not finding dynamic fields

2014-04-25 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-6018:
---

 Summary: Solr DataImportHandler not finding dynamic fields
 Key: SOLR-6018
 URL: https://issues.apache.org/jira/browse/SOLR-6018
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9


There is an issue with org.apache.solr.handler.dataimport.DocBuilder.addFields 
around ~line 643.  The logic currently says see if you can find the field from 
the schema, ie:

SchemaField sf = schema.getFieldOrNull(key);

and, if not found, go ask DIHConfiguration to find it:




--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6018) Solr DataImportHandler not finding dynamic fields

2014-04-25 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6018:


Description: 
There is an issue with org.apache.solr.handler.dataimport.DocBuilder.addFields 
around ~line 643.  The logic currently says see if you can find the field from 
the schema, ie:

SchemaField sf = schema.getFieldOrNull(key);

and, if not found, go ask DIHConfiguration to find it, ie:

sf = config.getSchemaField(key);

The latter call takes into account case-insensitivity, which is a big deal 
since some databases, ie: DB2, upper case all the resulting column names.  In 
order to not modify solr-core (ie: the match login in IndexSchema), I'm 
attaching a patch that makes DIHConfiguration apply the same case-insensitive 
logic to the DynamicFields.

Without this patch, dynamic fields will not be added to the index unless you 
declare them like this:

  dynamicField name=*_Stype=string   indexed=true stored=true 
/

(note the capital S)

which is in-consistent with what I believe to be solr schema conventions to 
have all the schema fields as lower-case.

Thanks.


  was:
There is an issue with org.apache.solr.handler.dataimport.DocBuilder.addFields 
around ~line 643.  The logic currently says see if you can find the field from 
the schema, ie:

SchemaField sf = schema.getFieldOrNull(key);

and, if not found, go ask DIHConfiguration to find it:



 Solr DataImportHandler not finding dynamic fields
 -

 Key: SOLR-6018
 URL: https://issues.apache.org/jira/browse/SOLR-6018
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9


 There is an issue with 
 org.apache.solr.handler.dataimport.DocBuilder.addFields around ~line 643.  
 The logic currently says see if you can find the field from the schema, ie:
 SchemaField sf = schema.getFieldOrNull(key);
 and, if not found, go ask DIHConfiguration to find it, ie:
 sf = config.getSchemaField(key);
 The latter call takes into account case-insensitivity, which is a big deal 
 since some databases, ie: DB2, upper case all the resulting column names.  In 
 order to not modify solr-core (ie: the match login in IndexSchema), I'm 
 attaching a patch that makes DIHConfiguration apply the same case-insensitive 
 logic to the DynamicFields.
 Without this patch, dynamic fields will not be added to the index unless you 
 declare them like this:
   dynamicField name=*_Stype=string   indexed=true 
 stored=true /
 (note the capital S)
 which is in-consistent with what I believe to be solr schema conventions to 
 have all the schema fields as lower-case.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6018) Solr DataImportHandler not finding dynamic fields

2014-04-25 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6018:


Description: 
There is an issue with 
*org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
The logic currently says see if you can find the field from the schema, ie:

{code:title=DocBuilder.java|borderStyle=solid}
SchemaField sf = schema.getFieldOrNull(key);
{code}

and, if not found, go ask DIHConfiguration to find it, ie:

{code:title=DocBuilder.java|borderStyle=solid}
sf = config.getSchemaField(key);
{code}

The latter call takes into account case-insensitivity, which is a big deal 
since some databases, ie: DB2, upper case all the resulting column names.  In 
order to not modify solr-core (ie: the match login in IndexSchema), I'm 
attaching a patch that makes DIHConfiguration apply the same case-insensitive 
logic to the DynamicFields.

Without this patch, dynamic fields will not be added to the index unless you 
declare them like this:

{code:xml}
  dynamicField name=*_Stype=string   indexed=true stored=true 
/
{code}

(note the capital S)

which is in-consistent with what I believe to be solr schema conventions to 
have all the schema fields as lower-case.

Thanks.


  was:
There is an issue with org.apache.solr.handler.dataimport.DocBuilder.addFields 
around ~line 643.  The logic currently says see if you can find the field from 
the schema, ie:

SchemaField sf = schema.getFieldOrNull(key);

and, if not found, go ask DIHConfiguration to find it, ie:

sf = config.getSchemaField(key);

The latter call takes into account case-insensitivity, which is a big deal 
since some databases, ie: DB2, upper case all the resulting column names.  In 
order to not modify solr-core (ie: the match login in IndexSchema), I'm 
attaching a patch that makes DIHConfiguration apply the same case-insensitive 
logic to the DynamicFields.

Without this patch, dynamic fields will not be added to the index unless you 
declare them like this:

  dynamicField name=*_Stype=string   indexed=true stored=true 
/

(note the capital S)

which is in-consistent with what I believe to be solr schema conventions to 
have all the schema fields as lower-case.

Thanks.



 Solr DataImportHandler not finding dynamic fields
 -

 Key: SOLR-6018
 URL: https://issues.apache.org/jira/browse/SOLR-6018
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: 0001-dih-config-check-for-dynamic-fields.patch


 There is an issue with 
 *org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
 The logic currently says see if you can find the field from the schema, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 SchemaField sf = schema.getFieldOrNull(key);
 {code}
 and, if not found, go ask DIHConfiguration to find it, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 sf = config.getSchemaField(key);
 {code}
 The latter call takes into account case-insensitivity, which is a big deal 
 since some databases, ie: DB2, upper case all the resulting column names.  In 
 order to not modify solr-core (ie: the match login in IndexSchema), I'm 
 attaching a patch that makes DIHConfiguration apply the same case-insensitive 
 logic to the DynamicFields.
 Without this patch, dynamic fields will not be added to the index unless you 
 declare them like this:
 {code:xml}
   dynamicField name=*_Stype=string   indexed=true 
 stored=true /
 {code}
 (note the capital S)
 which is in-consistent with what I believe to be solr schema conventions to 
 have all the schema fields as lower-case.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6018) Solr DataImportHandler not finding dynamic fields

2014-04-25 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6018:


Description: 
There is an issue with 
*org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
The logic currently says see if you can find the field from the schema, ie:

{code:title=DocBuilder.java|borderStyle=solid}
SchemaField sf = schema.getFieldOrNull(key);
{code}

and, if not found, go ask DIHConfiguration to find it, ie:

{code:title=DocBuilder.java|borderStyle=solid}
sf = config.getSchemaField(key);
{code}

The latter call takes into account case-insensitivity, which is a big deal 
since some databases, ie: DB2, upper case all the resulting column names.  In 
order to not modify solr-core (ie: the match login in IndexSchema), I'm 
attaching a patch that makes DIHConfiguration apply the same case-insensitive 
logic to the DynamicFields.

Without this patch, dynamic fields will not be added to the index unless you 
declare them like this:

{code:xml}
  dynamicField name=*_S type=string indexed=true stored=true /
{code}

(note the capital S)

which is in-consistent with what I believe to be solr schema conventions to 
have all the schema fields as lower-case.

Thanks.


  was:
There is an issue with 
*org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
The logic currently says see if you can find the field from the schema, ie:

{code:title=DocBuilder.java|borderStyle=solid}
SchemaField sf = schema.getFieldOrNull(key);
{code}

and, if not found, go ask DIHConfiguration to find it, ie:

{code:title=DocBuilder.java|borderStyle=solid}
sf = config.getSchemaField(key);
{code}

The latter call takes into account case-insensitivity, which is a big deal 
since some databases, ie: DB2, upper case all the resulting column names.  In 
order to not modify solr-core (ie: the match login in IndexSchema), I'm 
attaching a patch that makes DIHConfiguration apply the same case-insensitive 
logic to the DynamicFields.

Without this patch, dynamic fields will not be added to the index unless you 
declare them like this:

{code:xml}
  dynamicField name=*_Stype=string   indexed=true stored=true 
/
{code}

(note the capital S)

which is in-consistent with what I believe to be solr schema conventions to 
have all the schema fields as lower-case.

Thanks.



 Solr DataImportHandler not finding dynamic fields
 -

 Key: SOLR-6018
 URL: https://issues.apache.org/jira/browse/SOLR-6018
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: 0001-dih-config-check-for-dynamic-fields.patch


 There is an issue with 
 *org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
 The logic currently says see if you can find the field from the schema, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 SchemaField sf = schema.getFieldOrNull(key);
 {code}
 and, if not found, go ask DIHConfiguration to find it, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 sf = config.getSchemaField(key);
 {code}
 The latter call takes into account case-insensitivity, which is a big deal 
 since some databases, ie: DB2, upper case all the resulting column names.  In 
 order to not modify solr-core (ie: the match login in IndexSchema), I'm 
 attaching a patch that makes DIHConfiguration apply the same case-insensitive 
 logic to the DynamicFields.
 Without this patch, dynamic fields will not be added to the index unless you 
 declare them like this:
 {code:xml}
   dynamicField name=*_S type=string indexed=true stored=true /
 {code}
 (note the capital S)
 which is in-consistent with what I believe to be solr schema conventions to 
 have all the schema fields as lower-case.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6018) Solr DataImportHandler not finding dynamic fields

2014-04-25 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6018:


Attachment: 0001-dih-config-check-for-dynamic-fields.patch

 Solr DataImportHandler not finding dynamic fields
 -

 Key: SOLR-6018
 URL: https://issues.apache.org/jira/browse/SOLR-6018
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: 0001-dih-config-check-for-dynamic-fields.patch


 There is an issue with 
 *org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
 The logic currently says see if you can find the field from the schema, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 SchemaField sf = schema.getFieldOrNull(key);
 {code}
 and, if not found, go ask DIHConfiguration to find it, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 sf = config.getSchemaField(key);
 {code}
 The latter call takes into account case-insensitivity, which is a big deal 
 since some databases, ie: DB2, upper case all the resulting column names.  In 
 order to not modify solr-core (ie: the match login in IndexSchema), I'm 
 attaching a patch that makes DIHConfiguration apply the same case-insensitive 
 logic to the DynamicFields.
 Without this patch, dynamic fields will not be added to the index unless you 
 declare them like this:
 {code:xml}
   dynamicField name=*_Stype=string   indexed=true 
 stored=true /
 {code}
 (note the capital S)
 which is in-consistent with what I believe to be solr schema conventions to 
 have all the schema fields as lower-case.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5981) Please change method visibility of getSolrWriter in DataImportHandler to public (or at least protected)

2014-04-25 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13981251#comment-13981251
 ] 

Aaron LaBella commented on SOLR-5981:
-

James, good point.  But, I think the natural place to start extending things is 
by extending the DataImportHandler so that you can take advantage of a request 
handler, ie:

requestHandler name=/dih 
class=org.apache.solr.handler.dataimport.DataImportHandler

Otherwise, you have to start doing all that work yourself.

 Please change method visibility of getSolrWriter in DataImportHandler to 
 public (or at least protected)
 ---

 Key: SOLR-5981
 URL: https://issues.apache.org/jira/browse/SOLR-5981
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.0
 Environment: Linux 3.13.9-200.fc20.x86_64
 Solr 4.6.0
Reporter: Aaron LaBella
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 4.9, 5.0

 Attachments: SOLR-5981.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 I've been using the org.apache.solr.handler.dataimport.DataImportHandler for 
 a bit and it's an excellent model and architecture.  I'd like to extend the 
 usage of it to plugin my own DIHWriter, but, the code doesn't allow for it.  
 Please change ~line 227 in the DataImportHander class to be:
 public SolrWriter getSolrWriter
 instead of:
 private SolrWriter getSolrWriter
 or, at a minimum, protected, so that I can extend DataImportHandler and 
 override this method.
 Thank you *sincerely* in advance for the quick turn-around on this.  If the 
 change can be made in 4.6.0 and upstream, that'd be ideal.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6018) Solr DataImportHandler not finding dynamic fields

2014-04-25 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6018:


Description: 
There is an issue with 
*org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
The logic currently says see if you can find the field from the schema, ie:

{code:title=DocBuilder.java|borderStyle=solid}
SchemaField sf = schema.getFieldOrNull(key);
{code}

and, if not found, go ask DIHConfiguration to find it, ie:

{code:title=DocBuilder.java|borderStyle=solid}
sf = config.getSchemaField(key);
{code}

The latter call takes into account case-insensitivity, which is a big deal 
since some databases, ie: DB2, upper case all the resulting column names.  In 
order to not modify solr-core (ie: the match logic in IndexSchema), I'm 
attaching a patch that makes DIHConfiguration apply the same case-insensitive 
logic to the DynamicFields.

Without this patch, dynamic fields will not be added to the index unless you 
declare them like this:

{code:xml}
  dynamicField name=*_S type=string indexed=true stored=true /
{code}

(note the capital S)

which is in-consistent with what I believe to be solr schema conventions to 
have all the schema fields as lower-case.

Thanks.


  was:
There is an issue with 
*org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
The logic currently says see if you can find the field from the schema, ie:

{code:title=DocBuilder.java|borderStyle=solid}
SchemaField sf = schema.getFieldOrNull(key);
{code}

and, if not found, go ask DIHConfiguration to find it, ie:

{code:title=DocBuilder.java|borderStyle=solid}
sf = config.getSchemaField(key);
{code}

The latter call takes into account case-insensitivity, which is a big deal 
since some databases, ie: DB2, upper case all the resulting column names.  In 
order to not modify solr-core (ie: the match login in IndexSchema), I'm 
attaching a patch that makes DIHConfiguration apply the same case-insensitive 
logic to the DynamicFields.

Without this patch, dynamic fields will not be added to the index unless you 
declare them like this:

{code:xml}
  dynamicField name=*_S type=string indexed=true stored=true /
{code}

(note the capital S)

which is in-consistent with what I believe to be solr schema conventions to 
have all the schema fields as lower-case.

Thanks.



 Solr DataImportHandler not finding dynamic fields
 -

 Key: SOLR-6018
 URL: https://issues.apache.org/jira/browse/SOLR-6018
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.9

 Attachments: 0001-dih-config-check-for-dynamic-fields.patch


 There is an issue with 
 *org.apache.solr.handler.dataimport.DocBuilder:addFields* around ~line 643.  
 The logic currently says see if you can find the field from the schema, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 SchemaField sf = schema.getFieldOrNull(key);
 {code}
 and, if not found, go ask DIHConfiguration to find it, ie:
 {code:title=DocBuilder.java|borderStyle=solid}
 sf = config.getSchemaField(key);
 {code}
 The latter call takes into account case-insensitivity, which is a big deal 
 since some databases, ie: DB2, upper case all the resulting column names.  In 
 order to not modify solr-core (ie: the match logic in IndexSchema), I'm 
 attaching a patch that makes DIHConfiguration apply the same case-insensitive 
 logic to the DynamicFields.
 Without this patch, dynamic fields will not be added to the index unless you 
 declare them like this:
 {code:xml}
   dynamicField name=*_S type=string indexed=true stored=true /
 {code}
 (note the capital S)
 which is in-consistent with what I believe to be solr schema conventions to 
 have all the schema fields as lower-case.
 Thanks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6013) Fix method visibility of Evaluator, refactor DateFormatEvaluator for extensibility

2014-04-24 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6013:


Attachment: 
0001-change-method-variable-visibility-and-refactor-for-extensibility.patch

attaching the patch for review/comments.

 Fix method visibility of Evaluator, refactor DateFormatEvaluator for 
 extensibility
 --

 Key: SOLR-6013
 URL: https://issues.apache.org/jira/browse/SOLR-6013
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.8

 Attachments: 
 0001-change-method-variable-visibility-and-refactor-for-extensibility.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 This is similar to issue 5981, the Evaluator class is declared as abstract, 
 yet the parseParams method is package private?  Surely this is an oversight, 
 as I wouldn't expect everyone writing their own evaluators to have to deal 
 with parsing the parameters.
 Similarly, I needed to refactor DateFormatEvaluator because I need to do some 
 custom date math/parsing and it wasn't written in a way that I can extend it.
 Please review/apply my attached patch to the next version of Solr, ie: 4.8 or 
 4.9 if I must wait.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-6013) Fix method visibility of Evaluator, refactor DateFormatEvaluator for extensibility

2014-04-24 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-6013:
---

 Summary: Fix method visibility of Evaluator, refactor 
DateFormatEvaluator for extensibility
 Key: SOLR-6013
 URL: https://issues.apache.org/jira/browse/SOLR-6013
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.8
 Attachments: 
0001-change-method-variable-visibility-and-refactor-for-extensibility.patch

This is similar to issue 5981, the Evaluator class is declared as abstract, yet 
the parseParams method is package private?  Surely this is an oversight, as I 
wouldn't expect everyone writing their own evaluators to have to deal with 
parsing the parameters.

Similarly, I needed to refactor DateFormatEvaluator because I need to do some 
custom date math/parsing and it wasn't written in a way that I can extend it.

Please review/apply my attached patch to the next version of Solr, ie: 4.8 or 
4.9 if I must wait.

Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5981) Please change method visibility of getSolrWriter in DataImportHandler to public (or at least protected)

2014-04-24 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13980061#comment-13980061
 ] 

Aaron LaBella commented on SOLR-5981:
-

I'm not seeing this fix in the git mirror of lucene-solr?  I'm also wondering 
why it was moved from 4.8 into 4.9, I thought it was ready to go?

Thanks.

 Please change method visibility of getSolrWriter in DataImportHandler to 
 public (or at least protected)
 ---

 Key: SOLR-5981
 URL: https://issues.apache.org/jira/browse/SOLR-5981
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.0
 Environment: Linux 3.13.9-200.fc20.x86_64
 Solr 4.6.0
Reporter: Aaron LaBella
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 4.9, 5.0

 Attachments: SOLR-5981.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 I've been using the org.apache.solr.handler.dataimport.DataImportHandler for 
 a bit and it's an excellent model and architecture.  I'd like to extend the 
 usage of it to plugin my own DIHWriter, but, the code doesn't allow for it.  
 Please change ~line 227 in the DataImportHander class to be:
 public SolrWriter getSolrWriter
 instead of:
 private SolrWriter getSolrWriter
 or, at a minimum, protected, so that I can extend DataImportHandler and 
 override this method.
 Thank you *sincerely* in advance for the quick turn-around on this.  If the 
 change can be made in 4.6.0 and upstream, that'd be ideal.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-6013) Fix method visibility of Evaluator, refactor DateFormatEvaluator for extensibility

2014-04-24 Thread Aaron LaBella (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-6013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron LaBella updated SOLR-6013:


Attachment: 0001-add-getters-for-datemathparser.patch

one more small patch after fully testing the changes for extensibility

 Fix method visibility of Evaluator, refactor DateFormatEvaluator for 
 extensibility
 --

 Key: SOLR-6013
 URL: https://issues.apache.org/jira/browse/SOLR-6013
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.7
Reporter: Aaron LaBella
 Fix For: 4.8

 Attachments: 0001-add-getters-for-datemathparser.patch, 
 0001-change-method-variable-visibility-and-refactor-for-extensibility.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 This is similar to issue 5981, the Evaluator class is declared as abstract, 
 yet the parseParams method is package private?  Surely this is an oversight, 
 as I wouldn't expect everyone writing their own evaluators to have to deal 
 with parsing the parameters.
 Similarly, I needed to refactor DateFormatEvaluator because I need to do some 
 custom date math/parsing and it wasn't written in a way that I can extend it.
 Please review/apply my attached patch to the next version of Solr, ie: 4.8 or 
 4.9 if I must wait.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5981) Please change method visibility of getSolrWriter in DataImportHandler to public (or at least protected)

2014-04-24 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13980508#comment-13980508
 ] 

Aaron LaBella commented on SOLR-5981:
-

Shawn,

No problem, makes sense ... when you get a chance, can you go ahead and commit 
the patch?
Thanks!

 Please change method visibility of getSolrWriter in DataImportHandler to 
 public (or at least protected)
 ---

 Key: SOLR-5981
 URL: https://issues.apache.org/jira/browse/SOLR-5981
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.0
 Environment: Linux 3.13.9-200.fc20.x86_64
 Solr 4.6.0
Reporter: Aaron LaBella
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 4.9, 5.0

 Attachments: SOLR-5981.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 I've been using the org.apache.solr.handler.dataimport.DataImportHandler for 
 a bit and it's an excellent model and architecture.  I'd like to extend the 
 usage of it to plugin my own DIHWriter, but, the code doesn't allow for it.  
 Please change ~line 227 in the DataImportHander class to be:
 public SolrWriter getSolrWriter
 instead of:
 private SolrWriter getSolrWriter
 or, at a minimum, protected, so that I can extend DataImportHandler and 
 override this method.
 Thank you *sincerely* in advance for the quick turn-around on this.  If the 
 change can be made in 4.6.0 and upstream, that'd be ideal.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5981) Please change method visibility of getSolrWriter in DataImportHandler to public (or at least protected)

2014-04-13 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13967816#comment-13967816
 ] 

Aaron LaBella commented on SOLR-5981:
-

Erick,

Thanks -- will do.  I probably would've done that but my SVN skills aren't that 
great.  I accidentally built from trunk first, and then realized I should've 
built against a branch.  Then, I tried to run git svn clone ... but that seemed 
to take forever as well.  Just curious -- are there any plans to migrate 
lucene/solr to a git repository?  +1 for git from me ;-)

Thanks.

Aaron

 Please change method visibility of getSolrWriter in DataImportHandler to 
 public (or at least protected)
 ---

 Key: SOLR-5981
 URL: https://issues.apache.org/jira/browse/SOLR-5981
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.0
 Environment: Linux 3.13.9-200.fc20.x86_64
 Solr 4.6.0
Reporter: Aaron LaBella
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 4.8, 5.0

 Attachments: SOLR-5981.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 I've been using the org.apache.solr.handler.dataimport.DataImportHandler for 
 a bit and it's an excellent model and architecture.  I'd like to extend the 
 usage of it to plugin my own DIHWriter, but, the code doesn't allow for it.  
 Please change ~line 227 in the DataImportHander class to be:
 public SolrWriter getSolrWriter
 instead of:
 private SolrWriter getSolrWriter
 or, at a minimum, protected, so that I can extend DataImportHandler and 
 override this method.
 Thank you *sincerely* in advance for the quick turn-around on this.  If the 
 change can be made in 4.6.0 and upstream, that'd be ideal.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5981) Please change method visibility of getSolrWriter in DataImportHandler to public (or at least protected)

2014-04-12 Thread Aaron LaBella (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13967471#comment-13967471
 ] 

Aaron LaBella commented on SOLR-5981:
-

@Shawn, Thanks!  That was super fast.  The patch looks good to me.  And, for 
what it's worth, I had actually done a test before posting this issue in the 
svn trunk by changing the getSolrWriter method scope to public and it worked 
just fine, so, I'm sure changing everything to protected will be equally as 
helpful.  Thanks again, I think this is a great change to extend the usage of 
DataImportHandler.  I vote yes to the patch.

 Please change method visibility of getSolrWriter in DataImportHandler to 
 public (or at least protected)
 ---

 Key: SOLR-5981
 URL: https://issues.apache.org/jira/browse/SOLR-5981
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 4.0
 Environment: Linux 3.13.9-200.fc20.x86_64
 Solr 4.6.0
Reporter: Aaron LaBella
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 4.8, 5.0

 Attachments: SOLR-5981.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 I've been using the org.apache.solr.handler.dataimport.DataImportHandler for 
 a bit and it's an excellent model and architecture.  I'd like to extend the 
 usage of it to plugin my own DIHWriter, but, the code doesn't allow for it.  
 Please change ~line 227 in the DataImportHander class to be:
 public SolrWriter getSolrWriter
 instead of:
 private SolrWriter getSolrWriter
 or, at a minimum, protected, so that I can extend DataImportHandler and 
 override this method.
 Thank you *sincerely* in advance for the quick turn-around on this.  If the 
 change can be made in 4.6.0 and upstream, that'd be ideal.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-5981) Please change method visibility of getSolrWriter in DataImportHandler to public (or at least protected)

2014-04-11 Thread Aaron LaBella (JIRA)
Aaron LaBella created SOLR-5981:
---

 Summary: Please change method visibility of getSolrWriter in 
DataImportHandler to public (or at least protected)
 Key: SOLR-5981
 URL: https://issues.apache.org/jira/browse/SOLR-5981
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.0
 Environment: Linux 3.13.9-200.fc20.x86_64
Solr 4.6.0
Reporter: Aaron LaBella
Priority: Blocker
 Fix For: 4.6


I've been using the org.apache.solr.handler.dataimport.DataImportHandler for a 
bit and it's an excellent model and architecture.  I'd like to extend the usage 
of it to plugin my own DIHWriter, but, the code doesn't allow for it.  Please 
change ~line 227 in the DataImportHander class to be:

public SolrWriter getSolrWriter

instead of:

private SolrWriter getSolrWriter

or, at a minimum, protected, so that I can extend DataImportHandler and 
override this method.

Thank you *sincerely* in advance for the quick turn-around on this.  If the 
change can be made in 4.6.0 and upstream, that'd be ideal.

Thanks!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org