[jira] [Commented] (LUCENE-765) Index package level javadocs needs content

2018-03-15 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16400046#comment-16400046
 ] 

Lewis John McGibbney commented on LUCENE-765:
-

Hi Folks, I would be interested in taking this issue on as what I believe would 
be my first Lucene contribution.
The _index_ package is significant though. [~gsingers] can you scope out any 
immedite areas where you think the documentation would benefit?
Thanks

> Index package level javadocs needs content
> --
>
> Key: LUCENE-765
> URL: https://issues.apache.org/jira/browse/LUCENE-765
> Project: Lucene - Core
>  Issue Type: Wish
>  Components: general/javadocs
>Reporter: Grant Ingersoll
>Priority: Minor
>  Labels: newdev
>
> The org.apache.lucene.index package level javadocs are sorely lacking.  They 
> should be updated to give a summary of the important classes, how indexing 
> works, etc.  Maybe give an overview of how the different writers coordinate.  
> Links to file formats, information on the posting algorithm, etc. would be 
> helpful.
> See the search package javadocs as a sample of the kind of info that could go 
> here.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-2461) Forrest skin resource docs/skin/fontsize.js causes annoying javascript alert when embedded docs are viewed locally with Chrome

2016-10-05 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15550944#comment-15550944
 ] 

Lewis John McGibbney commented on LUCENE-2461:
--

This issue can be closed as wont fix?

> Forrest skin resource docs/skin/fontsize.js causes annoying javascript alert 
> when embedded docs are viewed locally with Chrome
> --
>
> Key: LUCENE-2461
> URL: https://issues.apache.org/jira/browse/LUCENE-2461
> Project: Lucene - Core
>  Issue Type: Bug
>  Components: core/other
>Affects Versions: 3.0.1
> Environment: linux/Google chrome
>Reporter: Sami Siren
>Priority: Trivial
> Attachments: LUCENE-2461.patch
>
>
> When displaying the documentation from local filesystem (file:// urls) a 
> JavaScript alert popup is displayed on every forrest generated page:
> "Error: SECURITY_ERR: DOM Exception 18"
> This seems to be related to Chrome not allowing reading cookies for such urls.
> One could fix this by patching the javascript file (docs/skin/fontsize.js) to 
> check for the protocol. 
> It does not seem to be currently possible to alter the text size on the skin 
> at all? If so then perhaps the whole .js file could be disabled instead?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-8714) Implement translation contrib package for LanguageTranslationUpdateProcessor's

2016-07-12 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-8714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated SOLR-8714:
---
Fix Version/s: (was: 6.0)
   6.2

> Implement translation contrib package for LanguageTranslationUpdateProcessor's
> --
>
> Key: SOLR-8714
> URL: https://issues.apache.org/jira/browse/SOLR-8714
> Project: Solr
>  Issue Type: New Feature
>Reporter: Lewis John McGibbney
>Assignee: Tommaso Teofili
> Fix For: 6.2
>
>
> A while back over in Tika we implemented the 
> [Translator|https://github.com/apache/tika/blob/master/tika-core/src/main/java/org/apache/tika/language/translate/Translator.java]
>  interface. This now provides a number of 
> [implementations|https://github.com/apache/tika/tree/master/tika-translate/src/main/java/org/apache/tika/language/translate].
>  
> This issue will provide a  translation contrib package offering a 
> LanguageTranslationUpdateProcessor.
> The new processor will probably utilize the existing [Solr Language 
> Identifier|https://github.com/apache/lucene-solr/tree/master/solr/contrib/langid]
>  and would enable a document to be translated based upon a user defined 
> mapping. The LanguageTranslatorUpdateProcessor's should be pluggable and 
> would be placed in an UpdateChain the same as the 
> [LanguageIdentifierUpdateProcessor|https://github.com/apache/lucene-solr/blob/master/solr/contrib/langid/src/java/org/apache/solr/update/processor/LanguageIdentifierUpdateProcessor.java]'s
> It is my intent to also provide a wiki page which can be referenced and 
> maintained in conjunction with the code. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338199#comment-15338199
 ] 

Lewis John McGibbney commented on SOLR-8981:


Where is morph lines? Can someone please explain explicitly what the issue
is here and I'll have a crack at fixing it.
Thanks




-- 
*Lewis*


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336217#comment-15336217
 ] 

Lewis John McGibbney commented on SOLR-8981:


Nice work




-- 
*Lewis*


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334085#comment-15334085
 ] 

Lewis John McGibbney commented on SOLR-8981:


Brilliant. The most recent patch I submitted matches Tika 1.13 dependencies
less scientific data formats and all of the other non 'document' formats.
Thanks for rebuilding Tim it's appreciated.




-- 
*Lewis*


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-05-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302442#comment-15302442
 ] 

Lewis John McGibbney commented on SOLR-8981:


I am working on this again and will try to post a patch ASAP. 
[~talli...@mitre.org]. I have the following test failing in Solr
https://github.com/apache/lucene-solr/blob/master/solr/contrib/extraction/src/test/org/apache/solr/handler/extraction/ExtractingRequestHandlerTest.java#L505
I have been debugging the tests with no luck as of yet. I'll post a new PR 
later today. The new PR is rebased against lucene-solr master and Tika 1.13

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15258091#comment-15258091
 ] 

Lewis John McGibbney commented on SOLR-8716:


Hi [~thetaphi], argh, yes you are right. Lets wait for Tika 1.13. There is [a 
conversation|http://www.mail-archive.com/dev%40tika.apache.org/msg17480.html] 
about this right now.


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15251249#comment-15251249
 ] 

Lewis John McGibbney commented on SOLR-8716:


[~thetaphi] [~janhoy] out of curiosity how do patches in lucene-solr typically 
get reviewed? Do you have a pre-commit build or something set up? Thanks

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15250819#comment-15250819
 ] 

Lewis John McGibbney commented on SOLR-8716:


PR is updated to
1) ensure that the new dependencies involved in the above parsers are 
lexicographically ordered in lucene/ivy-versions.properties, and
2) that they are included within solr/contrib/extraction/ivy.xml

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15250294#comment-15250294
 ] 

Lewis John McGibbney commented on SOLR-8716:


New parsers are
{code}
org.apache.tika.parser.image.WebPParser
org.apache.tika.parser.microsoft.JackcessParser
org.apache.tika.parser.pkg.RarParser
org.apache.tika.parser.dif.DIFParser
org.apache.tika.parser.gdal.GDALParser
org.apache.tika.parser.pot.PooledTimeSeriesParser
org.apache.tika.parser.grib.GribParser
org.apache.tika.parser.jdbc.SQLite3Parser
org.apache.tika.parser.isatab.ISArchiveParser
org.apache.tika.parser.geoinfo.GeographicInformationParser
org.apache.tika.parser.geo.topic.GeoParser
org.apache.tika.parser.external.CompositeExternalParser
org.apache.tika.parser.journal.JournalParser
{code}

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249398#comment-15249398
 ] 

Lewis John McGibbney commented on SOLR-8716:


bq. Are there any new parsers in 1.12 that we could use - which do not have the 
required dependency added?

There are a few... I'll get a comprehensive list and update in due course. 

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-19 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249013#comment-15249013
 ] 

Lewis John McGibbney commented on SOLR-8716:


[~thetaphi] I managed to sort out the dependency stuff... I hope. Would 
appreciate peer review here again. Thanks. 

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Comment Edited] (SOLR-8714) Implement translation contrib package for LanguageTranslationUpdateProcessor's

2016-03-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15212885#comment-15212885
 ] 

Lewis John McGibbney edited comment on SOLR-8714 at 3/26/16 8:36 AM:
-

Hi [~teofili] I started a patch which I thought was sound. The blocker right 
now is SOLR-8716
If we can do the upgrade on Tika then this issue (with Joshua for example 
backing statistical machine translation via the [language 
packs|http://joshua-decoder.org/language-packs/] we've been generating) then 
this issue is IMHO a game changer for the way that Web crawlers harvest and 
make data available, useful and ultimately meaningful to us all. If we can get 
Solr doing statistical machine translation at indexing time then this is a game 
changer (of course others are doing it, but for the open source Apache Solr it 
would be excellent). 


was (Author: lewismc):
Hi [~teofili] I started a patch which I thought was sound. The blocker right 
now is SOLR-8716
If we can do the upgrade on Tika then this issue (with Joshua for example 
backing statistical machine translation via the language packs we've been 
generating) then this issue is IMHO a game changer for the way that Web 
crawlers harvest and make data available, useful and ultimately meaningful to 
us all. If we can get Solr doing statistical machine translation at indexing 
time then this is a game changer (of course others are doing it, but for the 
open source Apache Solr it would be excellent). 

> Implement translation contrib package for LanguageTranslationUpdateProcessor's
> --
>
> Key: SOLR-8714
> URL: https://issues.apache.org/jira/browse/SOLR-8714
> Project: Solr
>  Issue Type: New Feature
>Reporter: Lewis John McGibbney
> Fix For: master
>
>
> A while back over in Tika we implemented the 
> [Translator|https://github.com/apache/tika/blob/master/tika-core/src/main/java/org/apache/tika/language/translate/Translator.java]
>  interface. This now provides a number of 
> [implementations|https://github.com/apache/tika/tree/master/tika-translate/src/main/java/org/apache/tika/language/translate].
>  
> This issue will provide a  translation contrib package offering a 
> LanguageTranslationUpdateProcessor.
> The new processor will probably utilize the existing [Solr Language 
> Identifier|https://github.com/apache/lucene-solr/tree/master/solr/contrib/langid]
>  and would enable a document to be translated based upon a user defined 
> mapping. The LanguageTranslatorUpdateProcessor's should be pluggable and 
> would be placed in an UpdateChain the same as the 
> [LanguageIdentifierUpdateProcessor|https://github.com/apache/lucene-solr/blob/master/solr/contrib/langid/src/java/org/apache/solr/update/processor/LanguageIdentifierUpdateProcessor.java]'s
> It is my intent to also provide a wiki page which can be referenced and 
> maintained in conjunction with the code. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8714) Implement translation contrib package for LanguageTranslationUpdateProcessor's

2016-03-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15212885#comment-15212885
 ] 

Lewis John McGibbney commented on SOLR-8714:


Hi [~teofili] I started a patch which I thought was sound. The blocker right 
now is SOLR-8716
If we can do the upgrade on Tika then this issue (with Joshua for example 
backing statistical machine translation via the language packs we've been 
generating) then this issue is IMHO a game changer for the way that Web 
crawlers harvest and make data available, useful and ultimately meaningful to 
us all. If we can get Solr doing statistical machine translation at indexing 
time then this is a game changer (of course others are doing it, but for the 
open source Apache Solr it would be excellent). 

> Implement translation contrib package for LanguageTranslationUpdateProcessor's
> --
>
> Key: SOLR-8714
> URL: https://issues.apache.org/jira/browse/SOLR-8714
> Project: Solr
>  Issue Type: New Feature
>Reporter: Lewis John McGibbney
> Fix For: master
>
>
> A while back over in Tika we implemented the 
> [Translator|https://github.com/apache/tika/blob/master/tika-core/src/main/java/org/apache/tika/language/translate/Translator.java]
>  interface. This now provides a number of 
> [implementations|https://github.com/apache/tika/tree/master/tika-translate/src/main/java/org/apache/tika/language/translate].
>  
> This issue will provide a  translation contrib package offering a 
> LanguageTranslationUpdateProcessor.
> The new processor will probably utilize the existing [Solr Language 
> Identifier|https://github.com/apache/lucene-solr/tree/master/solr/contrib/langid]
>  and would enable a document to be translated based upon a user defined 
> mapping. The LanguageTranslatorUpdateProcessor's should be pluggable and 
> would be placed in an UpdateChain the same as the 
> [LanguageIdentifierUpdateProcessor|https://github.com/apache/lucene-solr/blob/master/solr/contrib/langid/src/java/org/apache/solr/update/processor/LanguageIdentifierUpdateProcessor.java]'s
> It is my intent to also provide a wiki page which can be referenced and 
> maintained in conjunction with the code. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-7041) Upgrade to Apache Tika 1.12

2016-02-22 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-7041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157776#comment-15157776
 ] 

Lewis John McGibbney commented on LUCENE-7041:
--

ACK, this guidance is very helpful. I've been picking my way through the Lucene 
and Solr build with interest, and yes your right the dependency soup which Tika 
brings is pretty much hellish. Regardless, I'll revisit this right now and make 
best efforts to get an update patch through to you. Can I also please ask, 
which is the best method for providing the update? via patch or pull request on 
Github or both?
Thanks

> Upgrade to Apache Tika 1.12
> ---
>
> Key: LUCENE-7041
> URL: https://issues.apache.org/jira/browse/LUCENE-7041
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: general/build
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-7041) Upgrade to Apache Tika 1.12

2016-02-22 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-7041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated LUCENE-7041:
-
Attachment: LUCENE-7041.patch

Git-generated patch. This is trivial but as mentioned over on [0] I am getting 
failing tests locally. 

[0] https://github.com/apache/lucene-solr/pull/10

> Upgrade to Apache Tika 1.12
> ---
>
> Key: LUCENE-7041
> URL: https://issues.apache.org/jira/browse/LUCENE-7041
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: general/build
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Comment Edited] (LUCENE-7041) Upgrade to Apache Tika 1.12

2016-02-22 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-7041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157661#comment-15157661
 ] 

Lewis John McGibbney edited comment on LUCENE-7041 at 2/22/16 9:17 PM:
---

Running test suite just now, this looks to be a very trivial patch but will be 
really cool when we can do language translation when indexing documents. 
[~thetaphi], thanks for assigning yourself to this one. 


was (Author: lewismc):
Running test suit just now, this looks to be a very trivial patch but will be 
really cool when we can do language translation when indexing documents. 
[~thetaphi], thanks for assigning yourself to this one. 

> Upgrade to Apache Tika 1.12
> ---
>
> Key: LUCENE-7041
> URL: https://issues.apache.org/jira/browse/LUCENE-7041
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: general/build
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-7041) Upgrade to Apache Tika 1.12

2016-02-22 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-7041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157663#comment-15157663
 ] 

Lewis John McGibbney commented on LUCENE-7041:
--

agreed, the parsers module is getting very bulky in tTika these days so some 
dependency management is definitely required here. 

> Upgrade to Apache Tika 1.12
> ---
>
> Key: LUCENE-7041
> URL: https://issues.apache.org/jira/browse/LUCENE-7041
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: general/build
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-7041) Upgrade to Apache Tika 1.12

2016-02-22 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-7041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157661#comment-15157661
 ] 

Lewis John McGibbney commented on LUCENE-7041:
--

Running test suit just now, this looks to be a very trivial patch but will be 
really cool when we can do language translation when indexing documents. 
[~thetaphi], thanks for assigning yourself to this one. 

> Upgrade to Apache Tika 1.12
> ---
>
> Key: LUCENE-7041
> URL: https://issues.apache.org/jira/browse/LUCENE-7041
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: general/build
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-7041) Upgrade to Apache Tika 1.12

2016-02-22 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created LUCENE-7041:


 Summary: Upgrade to Apache Tika 1.12
 Key: LUCENE-7041
 URL: https://issues.apache.org/jira/browse/LUCENE-7041
 Project: Lucene - Core
  Issue Type: Improvement
  Components: general/build
Reporter: Lewis John McGibbney
 Fix For: master


We recently released Apache Tika 1.12. In order to use the fixes provided 
within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
lucene/ivy-versions.properties.
Patch coming up.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-8714) Implement translation contrib package for LanguageTranslationUpdateProcessor's

2016-02-22 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created SOLR-8714:
--

 Summary: Implement translation contrib package for 
LanguageTranslationUpdateProcessor's
 Key: SOLR-8714
 URL: https://issues.apache.org/jira/browse/SOLR-8714
 Project: Solr
  Issue Type: New Feature
Reporter: Lewis John McGibbney
 Fix For: master


A while back over in Tika we implemented the 
[Translator|https://github.com/apache/tika/blob/master/tika-core/src/main/java/org/apache/tika/language/translate/Translator.java]
 interface. This now provides a number of 
[implementations|https://github.com/apache/tika/tree/master/tika-translate/src/main/java/org/apache/tika/language/translate].
 
This issue will provide a  translation contrib package offering a 
LanguageTranslationUpdateProcessor.
The new processor will probably utilize the existing [Solr Language 
Identifier|https://github.com/apache/lucene-solr/tree/master/solr/contrib/langid]
 and would enable a document to be translated based upon a user defined 
mapping. The LanguageTranslatorUpdateProcessor's should be pluggable and would 
be placed in an UpdateChain the same as the 
[LanguageIdentifierUpdateProcessor|https://github.com/apache/lucene-solr/blob/master/solr/contrib/langid/src/java/org/apache/solr/update/processor/LanguageIdentifierUpdateProcessor.java]'s
It is my intent to also provide a wiki page which can be referenced and 
maintained in conjunction with the code. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3198) Apache Solr to adhere to Apache Project Branding Requirements

2014-03-16 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13937111#comment-13937111
 ] 

Lewis John McGibbney commented on SOLR-3198:


[~yo...@apache.org] 
bq. I'd rather not add TM to the logo... it's not actually required
This is not correct. Please see [0]. 
I think it important for a subproject with the exposure and reputation like 
Solr to adhere to branding requirements just as every other project should. 
If there are branding issues, these should also be included in reports sent to 
board@ and some effort put in to addressing the issues.
I'll be happy to take another look into the branding requirements if it would 
be of any help? 

[0] http://www.apache.org/foundation/marks/pmcs.html#graphics

 Apache Solr to adhere to Apache Project Branding Requirements 
 --

 Key: SOLR-3198
 URL: https://issues.apache.org/jira/browse/SOLR-3198
 Project: Solr
  Issue Type: New Feature
  Components: documentation
Reporter: Lewis John McGibbney
 Fix For: 4.8

 Attachments: SOLR-3198.patch, Solr_tm.png


 The ASF project branding requirements [0] provide guidelines for projects to 
 follow and adhre to.
 This is a trivial task, so I'll patch the site and upload it. 
 [0] http://www.apache.org/foundation/marks/pmcs.html



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org