[GitHub] lucene-solr pull request #468: jira/SOLR-12423

2018-10-11 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/468 jira/SOLR-12423 Upgrade to Tika 1.19.1, first draft You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr jira/SOLR-12423

[GitHub] lucene-solr issue #328: SOLR-12034

2018-10-01 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/328 @romseygeek , y, happy to fix/update this. I'll take a look later today. Part of the reason I gave up on this is that I didn't like the changes I had to make at the Lucene

[GitHub] lucene-solr issue #328: SOLR-12034

2018-10-01 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/328 Wow...long time since I've visited this code. Now I think I recall...the ugliness that I don't like imposing on the CustomAnalyzer's API is that it holds its own ResourceLoa

[GitHub] lucene-solr issue #328: SOLR-12034

2018-10-01 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/328 I'll leave this open a bit for discussion in case someone can think of a solution. My intuition is in accord with @uschindler '

[GitHub] lucene-solr issue #328: SOLR-12034

2018-10-02 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/328 Thank you, @romseygeek , for thinking of this PR. I'm closing it because I don't want to wreck the API of CustomAnalyz

[GitHub] lucene-solr pull request #328: SOLR-12034

2018-10-02 Thread tballison
Github user tballison closed the pull request at: https://github.com/apache/lucene-solr/pull/328 --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[GitHub] lucene-solr pull request #418: SOLR-12423 - upgrade to Tika 1.18, first draf...

2018-07-13 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/418 SOLR-12423 - upgrade to Tika 1.18, first draft You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr jira/SOLR-12423

[GitHub] lucene-solr pull request #418: SOLR-12423 - upgrade to Tika 1.18, first draf...

2018-07-13 Thread tballison
Github user tballison closed the pull request at: https://github.com/apache/lucene-solr/pull/418 --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[GitHub] lucene-solr issue #418: SOLR-12423 - upgrade to Tika 1.18, first draft

2018-07-13 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/418 wrong issue number...ugh --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail

[GitHub] lucene-solr pull request #419: SOLR-12551 - upgrade to Tika 1.18, first draf...

2018-07-13 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/419 SOLR-12551 - upgrade to Tika 1.18, first draft You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr jira/SOLR-12551

[GitHub] lucene-solr pull request: Lucene5205

2014-07-18 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/64 Lucene5205 First attempt at pull request for merge from trunk on Lucene5205. Let's see how much gitiocy this displays... You can merge this pull request into a Git repository by ru

[GitHub] lucene-solr pull request: LUCENE-5839: Fix regex in AnalyzingQuery...

2014-07-21 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/67 LUCENE-5839: Fix regex in AnalyzingQueryParser LUCENE-5839: Fix regex in AnalyzingQueryParser You can merge this pull request into a Git repository by running: $ git pull https

[GitHub] lucene-solr pull request: Lucene5205

2014-07-21 Thread tballison
Github user tballison closed the pull request at: https://github.com/apache/lucene-solr/pull/64 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] lucene-solr pull request: Lucene5205

2014-07-24 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/68 Lucene5205 LUCENE-5205 1) merge from trunk 2) roll in March 10, 2014 LUCENE-5205 patch for improved stopword handling 3) roll in SOLR-5410 You can merge this pull request into a

[GitHub] lucene-solr pull request #332: LUCENE-8186

2018-03-05 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/332 LUCENE-8186 check for multitermaware tokenizer in CustomAnalyzer's normalize(). You can merge this pull request into a Git repository by running: $ git pull https://github.com/tbal

[GitHub] lucene-solr issue #332: LUCENE-8186

2018-03-05 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/332 Patch already attached...please ignore... --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional

[GitHub] lucene-solr pull request #332: LUCENE-8186

2018-03-05 Thread tballison
Github user tballison closed the pull request at: https://github.com/apache/lucene-solr/pull/332 --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[GitHub] lucene-solr pull request #75: LUCENE-7434, first draft

2016-09-01 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/75 LUCENE-7434, first draft LUCENE-7434, first draft You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr master

[GitHub] lucene-solr pull request #44: SOLR-8981

2016-06-16 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/44 SOLR-8981 SOLR-8981 upgrade to Tika 1.13 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr SOLR-8981 Alternatively

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-16 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 I think I got it... ant precommit worked in Linux with these modifications. I kept getting hangs with ant jar-checksums in Windows. --- If your project is set up for it, you can reply to

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Git (well, it was my fault, don't get me wrong) added the \r\n somehow. I had turned off autocrlf earlier. > C:\...>git config --get core.autocrlf input I

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Y, I did run the extraction tests. That was the error we were getting initially, but which (without explanation) disappeared on my most recent integration attempt. --- If your project is

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 argh... will take a look. The test passed if you assumed that the html had two bodies, but that's crazy... --- If your project is set up for it, you can reply to this email and

[GitHub] lucene-solr pull request #44: SOLR-8981

2016-06-17 Thread tballison
GitHub user tballison reopened a pull request: https://github.com/apache/lucene-solr/pull/44 SOLR-8981 SOLR-8981 upgrade to Tika 1.13 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr SOLR-8981 Alternatively

[GitHub] lucene-solr pull request #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison closed the pull request at: https://github.com/apache/lucene-solr/pull/44 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 No, it is a self-contained test with a test file. +1 on local and _only_ local. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Just found it. Confirming that fix doesn't break anything else. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 The XHTMLContentHandler adds and . In out-of-the-box Tika with the DefaultHtmlMapper, "body" tags are not in the list of "SAFE_ELEMENTS", which means that the html&#

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Not willing to point fingers... :) I'd like to track down the change in our history between 1.7 and 1.13 so that I actually understand what happened --- If your project is set u

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Our bug introduced in TIKA-995. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 WebP is an image format. Jackcess encrypt is the library that allows users to decrypt MSAccess files. Please give it a go with Java 9. I can't easily test the morphlines stuf

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 There will likely be some conflicts with bouncy castle. Tika 1.13: bcmail-jdk15on 1.54 bcprov-jdk15on 1.54 vs. Solr: org.bouncycastle.version = 1.45

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 If we leave out updating bouncycastle, I'm fairly confident that users will run problems at run time if they try to decrypt MSAccess and probably PDF and doc. We had a b

[GitHub] lucene-solr issue #44: SOLR-8981

2016-06-17 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 > I also only have Windows :) How can you live with the failed builds?!? I wanted to help with [morphlines](https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201606.m

[GitHub] lucene-solr pull request #172: SOLR-9552

2017-03-21 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/172 SOLR-9552 Upgrade to Tika 1.14 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr SOLR-9552 Alternatively you can

[GitHub] lucene-solr issue #273: SOLR-11622: Fix mime4j library dependency for Tika

2017-12-04 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/273 This was the patch I had before running ant precommit, etc., which is still running. You'll need to run ant- clean-jars, jar-checksums and do the git add/rm before this will

[GitHub] lucene-solr issue #273: SOLR-11622: Fix mime4j library dependency for Tika

2017-12-04 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/273 [SOLR-11622-tallison.diff.txt](https://github.com/apache/lucene-solr/files/1528516/SOLR-11622-tallison.diff.txt) This is the full `git diff 83753d0..d2f40af > SOLR-11622-tallison.d

[GitHub] lucene-solr issue #282: SOLR-11622: Fix mime4j library dependency for Tika

2017-12-05 Thread tballison
Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/282 let me know if I should offer a comprehensive PR including your work on my own, or if my "notes" on your other PR are sufficient.

[GitHub] lucene-solr pull request #291: jira/solr-11701

2017-12-14 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/291 jira/solr-11701 SOLR-11701 upgrade to Tika 1.17 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr jira/solr-11701

[GitHub] lucene-solr pull request #259: SOLR-10335

2017-10-05 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/259 SOLR-10335 Upgrade to Tika 1.16 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr SOLR-10335 Alternatively you can

[GitHub] lucene-solr pull request #82: First draft of LUCENE-5317

2016-09-23 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/82 First draft of LUCENE-5317 First draft of LUCENE-5317 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr LUCENE-5317

[GitHub] lucene-solr pull request #328: SOLR-12034

2018-02-26 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/328 SOLR-12034 First draft of SOLR-12034 -- not ready for committing. Some non-flaky tests are now failing. You can merge this pull request into a Git repository by running: $ git pull

[GitHub] lucene-solr pull request #329: SOLR-12035

2018-02-26 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/329 SOLR-12035 don't forget to copy charfilters into nostopanalyzer You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene

[GitHub] lucene-solr pull request #322: SOLR-11976 - fix bug in TokenizerChain

2018-02-12 Thread tballison
GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/322 SOLR-11976 - fix bug in TokenizerChain Currently overwrites tokenfilters rather than chaining. You can merge this pull request into a Git repository by running: $ git pull https