Fwd: A way to connect Solr to SIS

2012-04-29 Thread Mattmann, Chris A (388J)
Probably of interest to Solr devs as well, FYI. Cheers, Chris Begin forwarded message: From: Yiyu Li yiy...@usc.edu Date: April 29, 2012 4:34:46 PM PDT To: sis-...@incubator.apache.org Subject: A way to connect Solr to SIS Reply-To: sis-...@incubator.apache.org Dear All, This is Yiyu

Re: Analyzers and sorting with a custom analysis chain

2011-09-03 Thread Mattmann, Chris A (388J)
Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Mattmann, Chris A (388J) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Saturday, September 03, 2011 6:18 AM To: dev@lucene.apache.org Cc: yo...@lucidimagination.com; solr-u...@lucene.apache.org Subject

Re: Analyzers and sorting with a custom analysis chain

2011-09-03 Thread Mattmann, Chris A (388J)
on the issue. You should not consume token streams on construction. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Mattmann, Chris A (388J) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Saturday

Analyzers and sorting with a custom analysis chain

2011-09-02 Thread Mattmann, Chris A (388J)
Hi Everyone, I've got an Analysis question related to both Lucene and Solr (sorry for the cross posting). i've created a custom analysis chain part of a field type for the title field in my schema representing Businesses. I've created an addition field called title_sort where I copied the

Re: Analyzers and sorting with a custom analysis chain

2011-09-02 Thread Mattmann, Chris A (388J)
Hi Yonik, On Sep 2, 2011, at 7:47 PM, Yonik Seeley wrote: On Fri, Sep 2, 2011 at 10:26 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: I'm left with childrenshospitallosangeles as a single token resultant from the chain. So, when I go to sort the titles in Solr, I use

Re: Analyzers and sorting with a custom analysis chain

2011-09-02 Thread Mattmann, Chris A (388J)
On Sep 2, 2011, at 8:53 PM, Mattmann, Chris A (388J) wrote: I think in spelling this out though, I might have elaborated my problem. Since the method I call in the constructor for my CombiningFilter is super(mergeStreamTokens(in)) where mergeStreamTokens is a static method, I think I

Re: New Website [Help Wanted]

2011-08-10 Thread Mattmann, Chris A (388J)
Wow, Grant, that looks awesome, great job on the new site! Cheers, Chris On Aug 10, 2011, at 6:46 AM, Grant Ingersoll wrote: I've ported the large majority of our old, crappy website to the shiny new ASF CMS. Not only is this easier to maintain b/c we no longer need Forrest, but it also

Re: Lucene Spatial Future

2011-04-03 Thread Mattmann, Chris A (388J)
Hi Ryan, On Apr 3, 2011, at 12:50 PM, Ryan McKinley wrote: 2. 3rd party tools -- In general people working on complex geographic problems use JTS and other LGPL tools. There is some great work happening at Apache SIS now, but it is a long way from being a viable ASL alternative. Thanks

Re: wind down for 3.1?

2011-02-12 Thread Mattmann, Chris A (388J)
Hi David, On Feb 12, 2011, at 4:38 PM, David Smiley (@MITRE.org) wrote: I'm tempted to also bring up my distaste for the next version of Solr being 3.something instead of 1.5 (in fact I just did) but I'll just leave it at that. AFAIK that battle was lost months ago. :) You're not alone in

Re: Let's drop Maven Artifacts !

2011-01-17 Thread Mattmann, Chris A (388J)
On Jan 17, 2011, at 8:06 AM, Steven A Rowe wrote: On 1/17/2011 at 1:53 AM, Michael Busch wrote: I don't think any user needs the ability to run an ant target on Lucene's sources to produce maven artifacts I want to be able to make modifications to the Lucene source, install Maven snapshot

Re: Changes Mess

2010-12-06 Thread Mattmann, Chris A (388J)
CHANGES file: LUCENE-2658: Exceptions while processing term vectors enabled for multiple fields could lead to invalid ArrayIndexOutOfBoundsExceptions. JIRA description: LUCENE-2658: TestIndexWriterExceptions random failure: AIOOBE in ByteBlockPool.allocSlice So you see the story, i hit

Re: Changes Mess

2010-12-06 Thread Mattmann, Chris A (388J)
Would you mind naming these Apache projects? I'd like to take a look. Tika, Nutch, OODT. Cheers, Chris ++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B,

Re: Changes Mess

2010-12-05 Thread Mattmann, Chris A (388J)
Hi Mark, RE: the credit system. JIRA provides a contribution report here, like this one that I generated for Lucene 3.1: http://s.apache.org/BpL Just click on Reports Contribution Report in the upper right of JIRA on the main project summary page. We've been using this in Tika since the

Re: Changes Mess

2010-12-05 Thread Mattmann, Chris A (388J)
: On 12/5/2010 at 12:19 PM, Robert Muir wrote: On Sun, Dec 5, 2010 at 12:08 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hi Mark, RE: the credit system. JIRA provides a contribution report here, like this one that I generated for Lucene 3.1: My concern

Re: Changes Mess

2010-12-05 Thread Mattmann, Chris A (388J)
Yet another way would be to declare the problem non-existent and screw our users by insulting them with a honking great mass of changes without any indication about what they are or how they are inter-related. (You won't be surprised at this point, I think, by my -1 to this.) Right, I'm

Re: Rethinking spatial implementation

2010-11-08 Thread Mattmann, Chris A (388J)
Hi All, FYI, Apache SIS [1], currently Incubating, is working on building an ASLv2 licensed library comparable to JTS or GeoTools. You'll notice that most of the GIS related libs out there are GPL or LGPL (or at least I did), so I decided to do something about it. If anyone else is interested

Re: discussion about release frequency.

2010-09-18 Thread Mattmann, Chris A (388J)
Hi Robert, I can help a little here. Check out this guide: http://maven.apache.org/guides/mini/guide-central-repository-upload.html The long and the short of it is that there are several canonical Maven repos that are sync'ed to Ibiblio and Maven central. Apache has one (through

[Spatial] Geonames and extension to Spatial Solution for Solr

2010-08-24 Thread Mattmann, Chris A (388J)
Hi Folks, You may have noticed over the past few days a bunch of Spatial related contributions, in particular: SOLR-2073 Geonames.org UpdateProcessor for Spatial SOLR-2074 GeoRSS ResponseWriter SOLR-2075 SpatialQParserPlugin and HostIP adaptor SOLR-2076 Spatial example schema updates SOLR-2077

Re: Incubator PMC/Board report for August 2010 (gene...@incubator.apache.org)

2010-08-08 Thread Mattmann, Chris A (388J)
Marvin, report looks good +1 from me (I signed off on the wiki too). Cheers, Chris On 8/8/10 4:02 PM, Marvin Humphrey mar...@rectangular.com wrote: On Fri, Aug 06, 2010 at 11:25:20AM -0700, Chris Hostetter wrote: : Here are some examples of OODT ones that we've filed so we can get an idea :

Re: Incubator PMC/Board report for August 2010 (gene...@incubator.apache.org)

2010-08-06 Thread Mattmann, Chris A (388J)
FYI, the Lucy SVN area has been created in the Incubator in r982976, r982977, r982978 and r982980. Hoss, can you remove the Lucene copy? Cheers, Chris On 8/5/10 9:01 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hi Marvin, There is an action item under Mentors

Re: Incubator PMC/Board report for August 2010 (gene...@incubator.apache.org)

2010-08-05 Thread Mattmann, Chris A (388J)
Hi Marvin, There is an action item under Mentors: Tell Mentors to track progress in the file 'incubator/projects/{project.name}.html' /me clears throat. Mentors, please track progress in the file 'incubator/projects/lucy.html'. /me marks said action item as completed.

FW: [ESIP-all] Announcement AGU Session Earth and Space Science Informatics IN10: Open Source Remote Sensing for Environmental Mapping and Analysis

2010-08-04 Thread Mattmann, Chris A (388J)
(apologies for the cross posting) All, FYI below is some information on two special sessions of AGU this December in San Francisco, CA. The first involves open source software and remote sensing. If you are using any Apache software in the area of remote sensing, you might consider submitting to

Re: [Lucy] Roadmap for first release

2010-07-30 Thread Mattmann, Chris A (388J)
On 7/30/10 9:55 AM, Marvin Humphrey mar...@rectangular.com wrote: On Thu, Jul 29, 2010 at 08:31:27PM -0700, Mattmann, Chris A (388J) wrote: Thanks, Peter for the email. I kind of guessed it was more related to KinoSearch. Since Lucy is about to assimilate the KinoSearch code base, they are now

Re: [Lucy] Roadmap for first release

2010-07-29 Thread Mattmann, Chris A (388J)
Guys: Can you enlighten me as to what this has to do with *Apache* Lucy? And furthermore, what it has to do with the *Incubator podling*? Thanks, Chris On 7/29/10 6:40 PM, Peter Karman pe...@peknet.com wrote: Marvin Humphrey wrote on 7/29/10 8:37 PM: Peter, I'm willing to go with making

Re: Lucy JIRA perms/Admin

2010-07-25 Thread Mattmann, Chris A (388J)
Hi Marvin, Thanks. Comments below: Did you file a JIRA issue in the INFRA space to roll up all the other issues like mailing lists, SVN, etc. Looks like you found the one I did for Tika - why not just create something similar? If you already have, can you pass along the ticket key so I (and

Re: Lucy JIRA perms/Admin

2010-07-25 Thread Mattmann, Chris A (388J)
Hi Marvin, Maybe just #1, which I've already claimed -- let's not duplicate. :) I'm not proposing too. But I'm proposing rather than one Death-star like issue to rule them all, how about some separate traceable activities that you can farm out and not be the single point of failure on?

Re: Lucy JIRA perms/Admin

2010-07-25 Thread Mattmann, Chris A (388J)
Wow, seriously On 7/25/10 6:11 PM, Chris Hostetter hossman_l...@fucit.org wrote: : Yeah not sure myself. I'll send an email to gene...@incubator asking if : there is anything that needs to be done on that end. The reason I ask is : that e.g., if we wanted to update our incubator website,

Re: Lucy trademark issues

2010-07-25 Thread Mattmann, Chris A (388J)
Hi Marvin, On 7/25/10 9:10 PM, Marvin Humphrey mar...@rectangular.com wrote: I suspect that we will need guidance from legal affairs with regards to Zettair and possibly the others if we are to continue under the name Lucy. Mentors, what do you suggest as a next step? I would recommend

Re: [Lucy] Set up for new mailing lists

2010-07-24 Thread Mattmann, Chris A (388J)
Ehrm, Guys, I would recommend just filing INFRA requests here [1]. INFRA can set up all the mailing lists below for you and are the only ones with Apache karma necessary to do so. I'd be happy to file the tickets, if you guys are OK with that. Cheers, Chris [1]

Re: lucy-issues list

2010-07-24 Thread Mattmann, Chris A (388J)
+1, sounds good to me to have the additional lucy-issues@ lists per the specification you mention below, Marvin... Cheers, Chris On 7/24/10 10:16 AM, Marvin Humphrey mar...@rectangular.com wrote: Greets, Lucene sub-projects traditionally have all JIRA notifications sent to the dev list.

Re: Lucy JIRA perms/Admin

2010-07-24 Thread Mattmann, Chris A (388J)
Marvin, Did you file a JIRA issue in the INFRA space to roll up all the other issues like mailing lists, SVN, etc. Looks like you found the one I did for Tika - why not just create something similar? If you already have, can you pass along the ticket key so I (and others) could monitor it?

Re: Lucy JIRA perms/Admin

2010-07-24 Thread Mattmann, Chris A (388J)
I just encountered something similar in OODT-ville, and the answer I got back from INFRA was that they were going to phase out the current way that JIRA is administered and move towards ³roles² rather than ³groups² in JIRA. See here [1] for an explanation. I think we can add people to the

Re: https://svn.apache.org/repos/asf/incubator/lucy/

2010-07-24 Thread Mattmann, Chris A (388J)
Thanks Hoss! On 7/24/10 12:01 PM, Chris Hostetter hossman_l...@fucit.org wrote: Aparently i already had the neccessary karma to create the new Lucy SVN repository, but I don't have the neccessary karma to *grant* karma, so i've sent an email to the IPMC containing the neccessary patch...

Re: Lucy JIRA perms/Admin

2010-07-24 Thread Mattmann, Chris A (388J)
+1... On 7/24/10 12:22 PM, Marvin Humphrey mar...@rectangular.com wrote: On Sat, Jul 24, 2010 at 11:58:03AM -0700, Chris Hostetter wrote: In general I don't think there are any set guidelines on how the Jira Roles are used (they are totally independent of LDAP and SVN perms) so the Lucy

[RESULT] [VOTE] Apache Nutch 1.1 Release Candidate #4

2010-06-15 Thread Mattmann, Chris A (388J)
Hi Folks, Okey dokey, this VOTE has passed with the following PMC binding VOTEs: Chris Mattmann Doğacan Güney Julien Nioche Dennis Kubes I'll go ahead and push out (finally!) the Nutch 1.1 release to the mirrors and get the monkey off my back! Thanks to all for participating in the VOTE!

[SOLR] use of solr.* in e.g., response writers, but other places

2010-05-29 Thread Mattmann, Chris A (388J)
Hey All, So, I'm working on SOLR-1925 [1], and I'm asking myself again: why the use of e.g., solr.* in the class name, as in e.g., solr.XSLTResponseWriter? I mean I've brought this up before and I've heard the history, but in reality they all just default to some package that's loaded via

Re: [SOLR] use of solr.* in e.g., response writers, but other places

2010-05-29 Thread Mattmann, Chris A (388J)
(same reason svn uses the svn: in the properties that SVN expects to own) So, it's almost like a protocol or namespace more so than a class package. I guess it's just the use of the . which is kind of confusing to me then. The . indicates to me that it's a class I guess from an eyeballing it

Re: [SOLR] use of solr.* in e.g., response writers, but other places

2010-05-29 Thread Mattmann, Chris A (388J)
Yep agreed, just me floating off in the clouds per usual... Cheers, Chris On 5/29/10 9:32 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : So, it's almost like a protocol or namespace more so than a class package. I : guess it's just the use of the . which is kind of confusing to me

Re: Adding CLucene as a Lucene subproject

2010-05-19 Thread Mattmann, Chris A (388J)
Hi Itamar, I think what you would do is throw together a proposal mentioning things like: * who would be the initial committers for the project * whether those committers have Apache ICLAs [1] on file, or not * what¹s the rationale behind the project (yours would have strong rationale, since

[VOTE] Apache Nutch 1.1 Release Candidate #3

2010-05-08 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted an updated candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc3/ The major differences between this release and rc #2 are the application of: NUTCH-816, NUTCH-732, NUTCH-815, NUTCH-814, and NUTCH-812

Re: Solr/Lucene unit tests broken?

2010-05-05 Thread Mattmann, Chris A (388J)
this from biting us again whenever version numbers bump) On Tue, May 4, 2010 at 5:35 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hey Guys, With r941046, I get a little further and then it dies on a different test: [junit] Testsuite

Re: Solr/Lucene unit tests broken?

2010-05-05 Thread Mattmann, Chris A (388J)
Thanks guys! On 5/5/10 7:04 AM, Robert Muir rcm...@gmail.com wrote: Erik has committed ant logic to delete old jar files... instead of manually removing the old files, you should be able to just svn up and go! On Wed, May 5, 2010 at 10:02 AM, Mattmann, Chris A (388J) chris.a.mattm

Re: Solr/Lucene unit tests broken?

2010-05-05 Thread Mattmann, Chris A (388J)
to delete old jar files... instead of manually removing the old files, you should be able to just svn up and go! On Wed, May 5, 2010 at 10:02 AM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hey Robert, Thanks so much for the help. I'll give it a try and report back if there's any

Re: [jira] Resolved: (TIKA-419) Allow parser lookup from a custom class loader

2010-05-04 Thread Mattmann, Chris A (388J)
Nice, Jukka! Cheers, Chris On 5/4/10 9:09 AM, Jukka Zitting (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/TIKA-419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-419.

Solr/Lucene unit tests broken?

2010-05-04 Thread Mattmann, Chris A (388J)
Hey Guys, Checked out the latest Solr build, r941013, and unit tests aren't passing for me: [junit] [junit] Testsuite: org.apache.lucene.util.automaton.TestBasicOperations [junit] Tests run: 6, Failures: 0, Errors: 0, Time elapsed: 0.428 sec [junit] BUILD FAILED

Re: Solr/Lucene unit tests broken?

2010-05-04 Thread Mattmann, Chris A (388J)
] Cheers, Chris On 5/4/10 12:58 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hey Guys, Checked out the latest Solr build, r941013, and unit tests aren't passing for me: [junit] [junit] Testsuite: org.apache.lucene.util.automaton.TestBasicOperations [junit] Tests

Re: Solr/Lucene unit tests broken?

2010-05-04 Thread Mattmann, Chris A (388J)
changed. Also see the mail about refactoring to a new 3.x branch. Trunk will be 4.0 and may break suddenly. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de http://www.thetaphi.de/ eMail: u...@thetaphi.de From: Mattmann, Chris A (388J) [mailto:chris.a.mattm

FW: [netcdf-java] [netcdfgroup] NetCDF jars=Maven Central Repos?

2010-04-29 Thread Mattmann, Chris A (388J)
Schlining bschlin...@gmail.com Date: Thu, 29 Apr 2010 08:42:17 -0700 To: John Caron ca...@unidata.ucar.edu Cc: netcdf-j...@unidata.ucar.edu, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov, Orion Poplawski or...@cora.nwra.com Subject: Re: [netcdf-java] [netcdfgroup] NetCDF jars=Maven Central

Re: Running ANT; was -- Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
On 4/26/10 7:24 AM, David M. Cole d...@colegroup.com wrote: At 10:55 PM -0700 4/25/10, Mattmann, Chris A (388J) wrote: Most folks that use Nutch are likely familiar with running ant IMHO. I guess then I fall into the category of not most folks. Have been running Nutch for about 14 months and I

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
a TLP that you delay this release by a few weeks and have the vote done under the auspices of the Nutch PMC? Cheers, Grant On Apr 26, 2010, at 1:55 AM, Mattmann, Chris A (388J) wrote: Hi Folks, I have posted an updated candidate for the Apache Nutch 1.1 release. The source code

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
Hey Andrzej, Okey dokey, np! Let's get the patch in first :) I can cut as many RCs as needed. Cheers, Chris On 4/26/10 11:30 AM, Andrzej Bialecki a...@getopt.org wrote: On 2010-04-26 17:19, Mattmann, Chris A (388J) wrote: Hi Grant, Thanks. I think it actually makes sense to finish off 1.1

Re: NetCDF jars=Maven Central Repos?

2010-04-26 Thread Mattmann, Chris A (388J)
Hi Folks, I never heard back regarding this message. Any thoughts? Thanks! Cheers, Chris On 4/12/10 8:53 AM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hi there NetCDF'ers, We've been working [1] on integrating NetCDF support into Apache Tika [2]. Tika uses Maven2 as its

Re: [netcdfgroup] NetCDF jars=Maven Central Repos?

2010-04-26 Thread Mattmann, Chris A (388J)
currently to optimize that. netcdf-java handles a lot of file formats, but all are assumed to be random access. John Caron On 4/26/2010 2:21 PM, Mattmann, Chris A (388J) wrote: Re: NetCDF jars=Maven Central Repos? Hi Folks, I never heard back regarding this message. Any thoughts? Thanks! Cheers

Re: [netcdfgroup] NetCDF jars=Maven Central Repos?

2010-04-26 Thread Mattmann, Chris A (388J)
FYI, [1] from here refers to: [1] http://issues.apache.org/jira/browse/TIKA-153 On 4/26/10 6:46 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: (copying tika-dev@lucene.apache.org so that we can get some dev help in Tika-land) Hi John, Thanks for the information - we have

[VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-25 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted an updated candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc2/ The major difference between this release and rc #1 is the application of NUTCH-812 - Crawl.java incorrectly uses the Generator API

Apache Tika is a top-level project!

2010-04-23 Thread Mattmann, Chris A (388J)
Hi All, The board has approved the Tika TLP. Yay! I've started the process of moving Tika to its TLP status, and filed INFRA issues [1] [2] and [3] for moving the mailing lists, SVN, and creating UNIX groups respectively. If there's anything I missed, let me know. I've asked that all current

Re: Nutch, Solr and Lucene architecture diagrams

2010-04-22 Thread Mattmann, Chris A (388J)
Hi Grant, I've attached one for Nutch from a while back that I made for a lecture I gave at USC. Cheers, Chris On 4/22/10 10:56 AM, Grant Ignersoll gsing...@apache.org wrote: Hi All, The ASF has been asked by an Industry Analyst group (to remain unnamed at this point) to provide

Re: [VOTE 2] Board resolution for Nutch as TLP

2010-04-12 Thread Mattmann, Chris A (388J)
+1, thanks for pushing this forward Andrzej! Cheers, Chris On 4/12/10 4:32 AM, Doğacan Güney doga...@gmail.com wrote: On Mon, Apr 12, 2010 at 14:08, Andrzej Bialecki a...@getopt.org wrote: Hi, Take two, after s/crawling/search/ ... Following the discussion, below is the text of the

Re: [DISCUSS] Board resolution for Nutch as TLP

2010-04-11 Thread Mattmann, Chris A (388J)
Hi Dogacan, +1 to calling it a web search platform, since I agree, it’s not just a crawler. Cheers, Chris On 4/11/10 11:40 AM, Doğacan Güney doga...@gmail.com wrote: Hi, On Sat, Apr 10, 2010 at 16:32, Jukka Zitting jukka.zitt...@gmail.com wrote: Hi, On Fri, Apr 9, 2010 at 6:52 PM,

Re: Adding jpeg parser to nutch

2010-04-10 Thread Mattmann, Chris A (388J)
Hi David, The latest Nutch release candidate (1.1, http://svn.apache.org/repos/asf/lucene/nutch/tags/1.1) includes the tika-parser plugin, which provides a JpegParser (see here: http://bit.ly/b0zRX8) that hopefully can suit your needs. Let me know what you think. Cheers, Chris On 4/10/10

Re: [DISCUSS] Board resolution for Nutch as TLP

2010-04-09 Thread Mattmann, Chris A (388J)
Hi Andrzej, +1, with the following amendment: RESOLVED, that all responsibilities pertaining to the Apache Lucene Nutch sub-project encumbered upon the Apache Nutch Project are hereafter discharged. This should read: RESOLVED, that all responsibilities pertaining to the Apache Lucene

Re: release of 1.1?

2010-04-06 Thread Mattmann, Chris A (388J)
Thanks Julien! OK, I'll cut the RC at some point today. Thanks! Cheers, Chris On 4/6/10 4:47 AM, Julien Nioche lists.digitalpeb...@gmail.com wrote: Chris, Just to let you know that I have committed https://issues.apache.org/jira/browse/NUTCH-810 which was the last open issue before the

[VOTE] Apache Nutch 1.1 Release Candidate #1

2010-04-06 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted a candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc1/ See the included CHANGES.txt file for details on release contents and latest changes. The release was made using the Nutch release process,

Re: Question: Nutch 0.8.2 and Nutch 0.7.3?

2010-04-04 Thread Mattmann, Chris A (388J)
Hey Andrzej, http://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8/ That's the code that was intended to become 0.8.2 ... However, I'm not sure whether there's any benefit in releasing either of these. Those who really had the need to track this branch (or 0.7) likely used the

Question: Nutch 0.8.2 and Nutch 0.7.3?

2010-04-03 Thread Mattmann, Chris A (388J)
Hey Guys, Question. I see 2 releases that haven't been cut in JIRA: 0.8.2: https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=truepid=106 80fixfor=12312064 0.7.3: https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=truepid=106 80fixfor=12312176 I'm happy to cut 0.8.2

Student Project, Apache Tika

2010-04-03 Thread Mattmann, Chris A (388J)
Hey All, FYI: http://tikatoolkit.googlecode.com/files/Project_Proposal_Feb%202010.pdf I stumbled on this while googling around for Tika articles for my CSCI 572 Search Engines class at USC this summer. Looks like some students are using Tika in their project at Loyola University Chicago. Hope to

[RESULT] [VOTE] Apache Tika 0.7 Release Candidate #1

2010-04-03 Thread Mattmann, Chris A (388J)
Hi All, This VOTE has passed. +1s: Binding: Chris A. Mattmann Jukka Zitting Grant Ignersoll Uwe Schindler Non-Binding: Oleg Tikhonov I'll get started pushing the releases out to the mirrors, and then send an ANNOUNCE to annou...@. Thanks, again, everyone! Cheers, Chris

Re: [VOTE] Apache Tika 0.7 Release Candidate #1

2010-04-02 Thread Mattmann, Chris A (388J)
to include the sha1 of the src archive from jzitting. Will do on both, going forward. * +1 for having a direct link to tika-app on the website. Cheers, Chris On 4/1/10 11:41 PM, Jukka Zitting jukka.zitt...@gmail.com wrote: Hi, On Wed, Mar 31, 2010 at 10:01 PM, Mattmann, Chris A (388J

Re: [VOTE] Apache Tika 0.7 Release Candidate #1

2010-04-02 Thread Mattmann, Chris A (388J)
, Apr 2, 2010 at 4:14 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: +1s, so technically we could still do the 72 hrs and still be OK, but I'm fine with giving folks some more time to take a look I'm fine with closing the vote already at 72 hours since the p.a.o outage only

Re: [VOTE] Apache Tika 0.7 Release Candidate #1

2010-04-02 Thread Mattmann, Chris A (388J)
to include the sha1 of the src archive from jzitting. Will do on both, going forward. * +1 for having a direct link to tika-app on the website. Cheers, Chris On 4/1/10 11:41 PM, Jukka Zitting jukka.zitt...@gmail.com wrote: Hi, On Wed, Mar 31, 2010 at 10:01 PM, Mattmann, Chris A (388J

Re: [VOTE] Apache Tika 0.7 Release Candidate #1

2010-04-02 Thread Mattmann, Chris A (388J)
Grant, FYI: On 4/2/10 7:14 AM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: * Thanks for comments on the CHANGES from gsingers, and the mention to include the sha1 of the src archive from jzitting. Will do on both, going forward. I added a stub for this in Tika 0.8: http

Re: [VOTE] Apache Tika 0.7 Release Candidate #1

2010-04-02 Thread Mattmann, Chris A (388J)
, Apr 2, 2010 at 4:14 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: +1s, so technically we could still do the 72 hrs and still be OK, but I'm fine with giving folks some more time to take a look I'm fine with closing the vote already at 72 hours since the p.a.o outage only

Re: Welcome Uwe Schindler to the Lucene PMC

2010-04-01 Thread Mattmann, Chris A (388J)
Welcome, Uwe! Cheers, Chris On 4/1/10 4:05 AM, Grant Ignersoll gsing...@apache.org wrote: I'm pleased to announce that the Lucene PMC has voted to add Uwe Schindler to the PMC. Uwe has been doing a lot of work in Lucene and Solr, including several of the last releases in Lucene. Please

Re: 1.1 release?

2010-03-31 Thread Mattmann, Chris A (388J)
Hey Guys, OK I'm finally getting around to this: I am going to push all the current 1.1 JIRA issues out and set their fix version to nil. Once I'm done with this, I'll wait 48 hrs to see if there is anything that anyone really wants to get into 1.1. So, please, take a look here [1] and make

[VOTE] Apache Tika 0.7 Release Candidate #1

2010-03-31 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted a candidate for the Apache Tika 0.7 release. The source code is at: http://people.apache.org/~mattmann/apache-tika-0.7/rc1/ See the included CHANGES.txt file for details on release contents and latest changes. The release was made using the Maven2 release plugin,

Re: Detector results for Excel formats

2010-03-23 Thread Mattmann, Chris A (388J)
Hi Simon, Can you prepare a patch, and post it to JIRA? I'll happily take a look. Thanks, Chris On 3/23/10 3:43 AM, Simon Tyler sty...@mimecast.net wrote: I have had a further look at the nature of the failure to detect the type of the particular file and still feel it is a bug. This is an

Re: Branding Solr+Lucene

2010-03-22 Thread Mattmann, Chris A (388J)
What are the implications of this this new branding effort with the brands for the existing Lucene and Solr? Will the names Lucene and Solr cease in the mainstream in favor of a merged name? Cheers, Chris On 3/22/10 11:02 AM, Steven A Rowe sar...@syr.edu wrote: Now that Solr and Lucene live

Re: OutOfMemory exception

2010-03-22 Thread Mattmann, Chris A (388J)
Hi Sangri, How big is the XML file you're trying to parse? If you find it's large (on the order of 100s of MBs-1GBs), it's certainly possible it could take a while (depending on your underlying machine architecture) to parse it. If you need to increase the heap size for Tika, you would do it

[VOTE] Apache Tika TLP Board Resolution

2010-03-22 Thread Mattmann, Chris A (388J)
Hi Folks, Per the existing discussions and feedback: http://www.mail-archive.com/tika-dev@lucene.apache.org/msg01456.html http://www.mail-archive.com/tika-dev@lucene.apache.org/msg01479.html I'd like to call a community VOTE thread on the below board resolution to make Apache Tika a Top Level

Re: [VOTE] Apache Tika TLP Board Resolution

2010-03-22 Thread Mattmann, Chris A (388J)
Oh, btw, forgot to add my +1. Cheers, Chris On 3/22/10 7:43 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hi Folks, Per the existing discussions and feedback: http://www.mail-archive.com/tika-dev@lucene.apache.org/msg01456.html http://www.mail-archive.com/tika-dev

Re: [DISCUSS] Nutch as a top level project (TLP)?

2010-03-20 Thread Mattmann, Chris A (388J)
Hey Andrzej, I'd be +1 for Nutch being a TLP. I don't think it'll change much (other than to provide more visibility/etc., and to allow more focused decision making by the folks in the Nutch community). The infrastructure moves required to move to TLP status are moving mailing lists, moving

Re: rough outline of where Solr's going

2010-03-18 Thread Mattmann, Chris A (388J)
On 3/18/10 11:25 AM, Yonik Seeley ysee...@gmail.com wrote: On Thu, Mar 18, 2010 at 2:16 PM, Chris Hostetter hossman_luc...@fucit.org wrote: 3.1 may make life easy for us as developers, but is likely to be just as cofusing to users as if we called the next version Q We're jumping to version

Re: rough outline of where Solr's going

2010-03-18 Thread Mattmann, Chris A (388J)
: We're jumping to version 3.1 because we're releasing at the same time, : and are based on Lucene 3.1. You say it like it's a done deal, but I don't get the impression that i'm the only one who thinks it's unneccessary. +1, I'm right there with you on this Hoss. My point is really

Re: rough outline of where Solr's going

2010-03-18 Thread Mattmann, Chris A (388J)
: Sorry about the following non serious reply: : : It hasn't seemed to hurt the most popular software in the world to be way : worse than that ;) : : 1, 2, 3, NT, 95, 98, 98SE, ME, CE, 2000, XP, 2003, Vista, 2008, 7 (by who's a) 2000 came out before ME b) NT, CE, and 2003 (a server

[PROPOSAL] Apache Tika TLP board resolution

2010-03-18 Thread Mattmann, Chris A (388J)
Hi Guys, A little late, but better than never. WDYT? I'll leave this up until next Monday, at which point I'll call a [VOTE] thread, and then take it to the Lucene PMC shortly thereafter (hopefully mid/late next week) so that we could potentially make the next board meeting, if the community

Re: rough outline of where Solr's going

2010-03-17 Thread Mattmann, Chris A (388J)
Hi All, : In the interest of moving forward, perhaps we should just focus on the : immediate next major release - 3.1. What happens after can wait. We : never planned for absolutely all the what if's in Solr before the : merge - I'm not sure why we would need to now. I suppose, but

Re: lucene and solr trunk

2010-03-15 Thread Mattmann, Chris A (388J)
Hi Hoss, : (i suspect a whole lot of people who only care about the core library are : going to really adamantly not want to have to check out all of Solr just : to work on the core) : : This wouldn't really be merged development now would it? : When I run 'ant test' I want the Solr

[DISCUSS] Apache Tika as TLP

2010-03-12 Thread Mattmann, Chris A (388J)
Hey Guys, If you're on gene...@lucene then you've probably seen it mentioned it's worth discussing Tika as an Apache TLP. I'd like to get the community view on this since I think it likely makes sense: * Nothing in Tika depends on any of the other Lucene project jars * Tika provides

1.1 release?

2010-03-09 Thread Mattmann, Chris A (388J)
Hey Guys, I have some extra time this weekend and early next week. Want me to be the RM and push out a 1.1 release? Any blockers? I'm happy to do it just let me know. Cheers, Chris ++ Chris Mattmann, Ph.D. Senior Computer

Re: Bug in tika-config xml

2010-02-09 Thread Mattmann, Chris A (388J)
Hi Martin, I filed TIKA-376 to track this issue, and fixed it in r908050. Thanks! Cheers, Chris On 2/9/10 4:17 AM, Martin Gerhardy martin.gerha...@carus-it.com wrote: hi mame should be name i suppose ;) parser mame=parse-rtf class=org.apache.tika.parser.rtf.RTFParser

[ANNOUNCE] Apache Tika 0.6 released

2010-01-31 Thread Mattmann, Chris A (388J)
(...apologies for the cross posting...) The Apache Lucene project is pleased to announce the release of Apache Tika 0.6. The release contents have been pushed out to the main Apache release site and the m2 ibiblio sync, so the releases should be available as soon as the mirrors get the syncs.

Re: CHANGES.txt updates for SOLR-1516 and SOLR-1592

2010-01-27 Thread Mattmann, Chris A (388J)
Thanks, Hoss, no problemo, appreciate it! On 1/26/10 12:22 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : Not to be a best, but there's no CHANGES.txt updates for SOLR-1516 and : SOLR-1592. Could someone update them? A trivial patch is attached... Sorry about that. Every change (with

[RESULT] [VOTE] Apache Tika 0.6 release candidate #1

2010-01-27 Thread Mattmann, Chris A (388J)
, Chris On 1/27/10 12:01 PM, Ted Dunning ted.dunn...@gmail.com wrote: +0 On Wed, Jan 27, 2010 at 7:26 AM, Grant Ingersoll gsing...@apache.orgwrote: On Jan 20, 2010, at 1:56 AM, Mattmann, Chris A (388J) wrote: Hi Folks, I have posted a candidate for the Apache Tika 0.6 release

Re: [VOTE] Apache Tika 0.6 release candidate #1

2010-01-21 Thread Mattmann, Chris A (388J)
Hi Karl, This refenrences to a mail of Tike 0.5 and not to Release 0.6 .. Ok ? The release # referenced in the email isn't as important as the process described. The process has and will be used in all Tika releases 0.5+ on forward. Is there a site available for review as well? Not sure

Re: [VOTE] Apache Tika 0.6 release candidate #1

2010-01-20 Thread Mattmann, Chris A (388J)
Chris, 2010/1/20 Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov Hi Folks, I have posted a candidate for the Apache Tika 0.6 release. The source code is at: http://people.apache.org/~mattmann/apache-tika-0.6/rc1/ See the included CHANGES.txt file for details on release contents

[VOTE] Apache Tika 0.6 release candidate #1

2010-01-19 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted a candidate for the Apache Tika 0.6 release. The source code is at: http://people.apache.org/~mattmann/apache-tika-0.6/rc1/ See the included CHANGES.txt file for details on release contents and latest changes. The release was made using the Maven2 release plugin,

CHANGES.txt updates for SOLR-1516 and SOLR-1592

2010-01-12 Thread Mattmann, Chris A (388J)
Hi Guys, Not to be a best, but there's no CHANGES.txt updates for SOLR-1516 and SOLR-1592. Could someone update them? A trivial patch is attached... Cheers, Chris ++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet

Re: CHANGES.txt updates for SOLR-1516 and SOLR-1592

2010-01-12 Thread Mattmann, Chris A (388J)
Dang, Mailing list stripped the attachment. Here's a link to one: http://sunset.usc.edu/~mattmann/CHANGES-solr.patch Cheers, Chris On 1/12/10 10:39 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hi Guys, Not to be a best, but there's no CHANGES.txt updates for SOLR-1516

Re: TIKA-103 - Excel Number/Date Formatting.

2010-01-08 Thread Mattmann, Chris A (388J)
Thanks Dave, sounds good. Please feel free to update CHANGES.txt and I'll move forward with RC activities. Cheers, Chris On 1/8/10 12:25 PM, Dave Meikle loo...@gmail.com wrote: Hi Chris, 2010/1/8 Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov In in the interest of living up to my

Re: svn commit: r894477 - /lucene/solr/trunk/src/java/org/apache/solr/search/function/distance/DistanceUtils.java

2009-12-30 Thread Mattmann, Chris A (388J)
Hi Yonik, What does this tag mean/do? Cheers, Chris On 12/29/09 12:30 PM, yo...@apache.org yo...@apache.org wrote: Author: yonik Date: Tue Dec 29 20:30:53 2009 New Revision: 894477 URL: http://svn.apache.org/viewvc?rev=894477view=rev Log: SOLR-1586: add solr-internal tag Modified:

  1   2   >