+1
From: lewis john mcgibbney
Reply-To: "dev@nutch.apache.org"
Date: Tuesday, June 9, 2020 at 3:21 PM
To: "dev@nutch.apache.org"
Subject: [EXTERNAL] [PROPOSAL] Replace whitelist blacklist with allowlist
denylist
Hi Folks,
What
I would like to propose that we replace
Thamme worked on this…check where he left off…
From: lewis john mcgibbney
Reply-To: "dev@nutch.apache.org"
Date: Thursday, November 29, 2018 at 1:13 PM
To: "dev@nutch.apache.org"
Subject: Maven vs Gradle for Nutch Build System
Hi Folks,
Seb and I were talking build systems this
From: bineesh k
Date: Wednesday, October 3, 2018 at 12:37 AM
To: "dev-ow...@tika.apache.org"
Subject: Solr/Nutch /tika config for PDF crawing
Hello Tika Team,
Need help on Solr/Nutch setup for crawling the PDF pages
We are using Nutch 1.15 and Solr 7.3.1 for our setup.
++1!
Sounds great.
Cheers,
Chris
From: Sebastian Nagel
Reply-To: "dev@nutch.apache.org"
Date: Monday, June 11, 2018 at 7:35 AM
To: "u...@nutch.apache.org"
Cc: "dev@nutch.apache.org"
Subject: Preparing to release Nutch 1.15 ?
Hi all,
almost 80 fixes and
Yay, go Seb, go!
On 12/22/17, 8:38 AM, "Sebastian Nagel" wrote:
Hi Folks,
thanks to everyone who was able to review the release candidate!
72 hours have passed, please see below for vote results.
[8] +1 Release this package as Apache
+1 this makes sense to me! (
Happy to help test.
Cheers,
Chris
On 12/8/17, 2:53 PM, "Sebastian Nagel" wrote:
Hi all,
50+ issues fixed
https://issues.apache.org/jira/projects/NUTCH/versions/12340218
Of course, as always and still many
Hey Seymon,
FWIW, use this for Github contribution guidelines:
https://github.com/apache/nutch/#contributing
I may have some time this weekend to look at 2441.
Thanks,
Chris
On 11/9/17, 8:34 AM, "Semyon Semyonov" wrote:
Dear all,
Could you review
Great job!
From: Julien Nioche
Reply-To: "dev@nutch.apache.org"
Date: Friday, June 9, 2017 at 2:28 AM
To: "crawler-comm...@googlegroups.com" ,
"bixo-...@yahoogroups.com" ,
he.org/nutch/WhiteListRobots
++++++
Chris Mattmann, Ph.D.
Principal Data Scientist, Engineering Administrative Office (3010)
Manager, Open Source Projects Formulation and Development Office (8212)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 180-503E, Mailstop:
Hi Everyone,
I proposed this earlier, and we said we’d wait until after the
1.11 release. So it’s time to VOTE to move Nutch to Git. So
far, the following people have expressed +1s and if I don’t hear
otherwise, I will implicitly count their VOTE from the DISCUSS
thread:
+1 PMC
Chris Mattmann
tly count their VOTE from the DISCUSS
>thread:
>
>+1 PMC
>
>Chris Mattmann*
>Sebastien Nagel*
>Michael Joyce*
>Asitang Mishra*
>Dennis Kubes*
>BlackIce
>
>Everyone else (or those above that would like to amend their VOTE),
>please VOTE below. I will leave
will be ignored
allowed:http://baron.pagemewhen.com/~chris/
[chipotle:~/src/nutch] mattmann%
Thanks,
Chris Mattmann
will be ignored
allowed:http://baron.pagemewhen.com/~chris/
[chipotle:~/src/nutch] mattmann%
Thanks,
Chris Mattmann
:http://baron.pagemewhen.com/~chris/
[chipotle:~/src/nutch] mattmann%
Thanks,
Chris Mattmann
---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/32451/#review77856
---
Ship it!
Ship It!
- Chris Mattmann
On March 26, 2015, 2:47 a.m
Great, can you attach a patch for this?
Chris Mattmann
chris.mattm...@gmail.com
-Original Message-
From: MengYing Wang mengyingwa...@gmail.com
Date: Thursday, November 20, 2014 at 7:02 PM
To: Lewis John Mcgibbney lewis.mcgibb...@gmail.com
Cc: dev
. To reply, visit:
https://reviews.apache.org/r/9119/#review52796
---
On Sept. 10, 2014, 3:15 a.m., Chris Mattmann wrote:
---
This is an automatically generated e-mail. To reply
/9119/#review52809
---
On Sept. 10, 2014, 3:15 a.m., Chris Mattmann wrote:
---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/9119
-
./trunk/src/java/org/apache/nutch/tools/FileDumper.java PRE-CREATION
Diff: https://reviews.apache.org/r/9119/diff/
Testing
---
Testing it on DARPA XDATA XNET.
Thanks,
Chris Mattmann
---
On Sept. 6, 2014, 4:57 a.m., Chris Mattmann wrote:
---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/9119
.
Diffs (updated)
-
./trunk/src/java/org/apache/nutch/tools/FileDumper.java PRE-CREATION
Diff: https://reviews.apache.org/r/9119/diff/
Testing
---
Testing it on DARPA XDATA XNET.
Thanks,
Chris Mattmann
That is frickin' awesome Juls. You may want to contact Sally
(s...@apache.org),
ASF VP of Press and Marketing and suggest to her that this deserves a
Tweet,
at the least.
Cheers!
Chris
-Original Message-
From: Julien Nioche lists.digitalpeb...@gmail.com
Reply-To: dev@nutch.apache.org
Hey Guys,
I submitted the below talk on Apache Tika, Nutch and Solr to ApacheCon NA
2014:
Real Data Science: Exploring the FBI's Vault dataset with Apache Tika,
Nutch and Solr
Event ApacheCon North America
Submission Type Lightning Talk
Category Developer
Biography Chris Mattmann has a wealth
Hey Jul,
A lot are using the Apache CMS:
http://www.apache.org/dev/cms.html
That's infra recommended. Besides that some are using
Confluence; some use Maven; others use Markdown via CMS, etc.
My +1 would be for the CMS, but I don't have time to set it
up (luckily infra can help and we can
://reviews.apache.org/r/9119/diff/
Testing
---
Testing it on DARPA XDATA XNET.
Thanks,
Chris Mattmann
to verify the downloads
using signatures found on the Apache site:
http://www.apache.org/dist/nutch/KEYS-1.1.txt
For more information on Apache Nutch, visit the project home page:
http://nutch.apache.org
-- Chris Mattmann (on behalf of the Apache Nutch community)
26 matches
Mail list logo